Datasets
updated
shayekh/perplexity__aya_dataset__train
Viewer
• Updated
• 540k • 25
• 1
argilla/magpie-ultra-v0.1
Viewer
• Updated
• 50k • 300
• 221
Magpie-Align/Magpie-Qwen2-Pro-1M-v0.1
Viewer
• Updated
• 1M • 119
• 14
HuggingFaceTB/smollm-corpus
Viewer
• Updated
• 237M • 22.6k
• 437
Viewer
• Updated
• 100k • 6.13k
• 262
BanglaLLM/bangla-alpaca-orca
Viewer
• Updated
• 172k • 35
• 4
AhmadMustafa/Urdu-Instruct-News-Article-Generation
Viewer
• Updated
• 112k • 47
• 4
AhmadMustafa/Urdu-Instruct-News-Headline-Generation
Viewer
• Updated
• 112k • 7
AhmadMustafa/Urdu-Instruct-News-Category-Classification
Viewer
• Updated
• 112k • 20
Viewer
• Updated
• 10k • 263
• 54
akbargherbal/six_millions_instruction_dataset_for_arabic_llm_ft
Viewer
• Updated
• 6.37M • 18
• 1
CohereLabs/aya_collection_language_split
Viewer
• Updated
• 514M • 2.65k
• 114
Viewer
• Updated
• 63k • 123
• 35
Viewer
• Updated
• 21.9M • 1.48k
• 695
convaiinnovations/Nadi_Indic466k_Instruct
Viewer
• Updated
• 466k • 16
• 2
ai4bharat/indic-instruct-data-v0.1
Viewer
• Updated
• 404k • 252
• 25
Viewer
• Updated
• 9.97k • 15
• 2
MarkrAI/KoCommercial-Dataset
Viewer
• Updated
• 175k • 542
• 164