Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
whr94621
's Collections
LLM_LongContext
LLM_Eval
LLM_Alignment
LLM_Pretrain
LLM_Multilingual
llm_datasets_japanese
llm_datasets_multi
llm_datasets_arabic
llm_synthesis_data
llm_datasets_id
llm_datasets_translation
llm_models_pretrain
llm_datasets_korean
llm_datasets_vi
llm_datasets_ru
llm_datasets_th
curated_sft_data
llm_datasets_multi
updated
May 15
同时设计多种语言的数据集
Upvote
-
SEACrowd/x_fact
Updated
Jun 24
•
2
•
1
juletxara/xstory_cloze
Viewer
•
Updated
May 21, 2023
•
20.6k
•
13k
•
8
juletxara/xstory_cloze_mt
Updated
Jul 21, 2023
•
2
miracl/nomiracl
Updated
Feb 26
•
703
•
10
ayymen/Pontoon-Translations
Viewer
•
Updated
Jan 19
•
3.56M
•
509
•
8
biglam/europeana_newspapers
Viewer
•
Updated
Jan 31
•
5.94M
•
2
•
39
PleIAs/French-PD-Newspapers
Viewer
•
Updated
Mar 19
•
2.25M
•
13
•
61
ontocord/CulturaY
Viewer
•
Updated
Mar 30
•
33.2M
•
196
•
25
Shitao/MLDR
Updated
Feb 6
•
6.64k
•
52
joelniklaus/Multi_Legal_Pile_Commercial
Updated
Oct 18, 2023
•
4
•
8
joelniklaus/eurlex_resources
Updated
May 10, 2023
•
8
•
6
CohereForAI/c4ai-command-r-v01
Text Generation
•
Updated
7 days ago
•
11.5k
•
1.05k
carolina-c4ai/corpus-carolina
Updated
Mar 23, 2023
•
168
•
19
eduagarcia/LegalPT_dedup
Viewer
•
Updated
May 7
•
23.9M
•
17
•
13
PleIAs/YouTube-Commons
Updated
Jun 26
•
21
•
301
Upvote
-
Share collection
View history
Collection guide
Browse collections