AlgorithmicResearchGroup/arxiv_s2orc_parsed
Text GenerationZero Shot ClassificationENBenchmark
AlgorithmicResearchGroup/arxiv_s2orc_parsed is a text generation benchmark dataset in EN from AlgorithmicResearchGroup with 1,671,614 records in Parquet format. And falls in the 1M<n<10M size category, and has been downloaded 52.8K times.
📊 This dataset is used as an LLM benchmark. See model leaderboards →
About AlgorithmicResearchGroup/arxiv_s2orc_parsed
Dataset Card for "ArtifactAI/arxiv_s2orc_parsed"
Dataset Description
https://huggingface.co/datasets/AlgorithmicResearchGroup/arxiv_s2orc_parsed
Dataset Summary
AlgorithmicResearchGroup/arxiv_s2orc_parsed is a subs...
Details
- Task
- Text Generation, Zero Shot Classification
- Language
- EN
- Format
- Parquet
- Rows / instances
- 1671614
- Size
- 1M<n<10M
- Creator
- AlgorithmicResearchGroup
- Year
- 2026
- Downloads
- 52833
- Likes
- 27