The data to be translated: The Webmaster Home reports on a new benchmark called MathVerse, designed to evaluate the performance of multimodal large language models on visual mathematical problems. Research has found that most models rely heavily on visual input, but GPT-4V excels in both textual and visual aspects. The introduction of this benchmark provides new insights for the future development direction of MLLMs.