Researchers from MIT, Cohere for AI, and other institutions have launched the Data Provenance Platform, aimed at addressing the transparency crisis in the AI field. They have audited and traced over 2,000 widely used fine-tuned datasets, emphasizing that issues with dataset provenance and transparency could lead to data leaks, exposure of personal information, biases, and legal risks. This initiative is expected to enhance data transparency in the AI sector, improve dataset quality and ethical compliance, and promote the sustainable development of AI technology.