P-MMEval
A multilingual multi-task benchmark for evaluating large language models (LLMs).
CommonProductOthersMultilingualBenchmarking
P-MMEval is a multilingual benchmark that encompasses datasets focused on foundational and capability specialization. It extends existing benchmarks to ensure consistency in language coverage and provides parallel samples across various languages, supporting up to 10 languages from 8 language families. P-MMEval facilitates comprehensive assessment of multilingual capabilities and comparative analysis of cross-language transferability.
P-MMEval Visit Over Time
Monthly Visits
1141359
Bounce Rate
43.84%
Page per Visit
4.3
Visit Duration
00:03:56