Beståndsuppgifter: MMLU-ProX

Laddar…

Visa i EDS

Sparad:

Utgivningsår:

2025

Ämnestermer:

Computer Science - Computation and Language

Beskrivning:

Existing large language model (LLM) evaluation benchmarks primarily focus on English, while current multilingual tasks lack parallel questio

Databas:

arXiv