Researchers, in collaboration with MIT, Cohere for AI, and other institutions, have launched the Data Provenance Platform to tackle the issue of AI data transparency. The platform audits and tracks over 2,000 widely used fine-tuning datasets, emphasizing the importance of data transparency. Insufficient data provenance can lead to data leaks, exposure of personal information, bias, and legal risks. The Data Provenance Platform is expected to enhance AI data transparency and improve the quality and ethical compliance of datasets.