Beijing Programmers Stay Up All Night to Debug: Apple's Paper Exposed with 30% Benchmark Data Errors, ICLR Submission Undergoes Urgent Correction
Apple's vision reasoning paper submitted to ICLR 2025 claimed to surpass GPT-5, but was exposed by researchers who replicated the study, revealing serious issues: the official code lacked an image input module, and after fixing it, the accuracy dropped sharply;抽查 found that 30% of the annotated data had errors. The author team hastily closed issue reports on GitHub before finally admitting flaws in the data generation process. This incident exposed flaws in the paper review mechanism and raised concerns in the academic community about the reproducibility of AI research. (140 characters)