CohereLabs/Global-MMLU-Lite
General NLPAR, BN, CSBenchmarkapache-2.0
Created by CohereLabs at 2024, the CohereLabs/Global-MMLU-Lite is a General NLP benchmark dataset in AR, BN, CS containing 14,000 records in Parquet format. With 7.2K downloads and 41 likes, it is actively used by the community. It is released under the apache-2.0 license and is a 10K<n<100K-scale dataset.
📊 This dataset is used as an LLM benchmark. See model leaderboards →
About CohereLabs/Global-MMLU-Lite
Releases:
Version 3.0 (May 2026): GMMLU Lite 3.0 release with 5 new languages: Czech, Hungarian, Italian (updated), Oriya, Slovak and Tajik
Version 2.0 (Dec 2025): GMMLU Lite 2.0 release with 3 new languages: Albanian, Burmese and Welsh
Versio...
Details
- Task
- General NLP
- Language
- AR, BN, CS
- Format
- Parquet
- Rows / instances
- 14000
- Size
- 10K<n<100K
- Creator
- CohereLabs
- Year
- 2024
- License
- apache-2.0
- Downloads
- 7206
- Likes
- 41