P-MMEval

A multilingual multi-task benchmark for evaluating large language models (LLMs).

CommonProductOthersMultilingualBenchmarking
P-MMEval is a multilingual benchmark that encompasses datasets focused on foundational and capability specialization. It extends existing benchmarks to ensure consistency in language coverage and provides parallel samples across various languages, supporting up to 10 languages from 8 language families. P-MMEval facilitates comprehensive assessment of multilingual capabilities and comparative analysis of cross-language transferability.
Visit

P-MMEval Visit Over Time

Monthly Visits

1141359

Bounce Rate

43.84%

Page per Visit

4.3

Visit Duration

00:03:56

P-MMEval Visit Trend

P-MMEval Visit Geography

P-MMEval Traffic Sources

P-MMEval Alternatives