tiiuae
/

falcon-7b

Text Generation

text-generation-inference

Model card Files Files and versions Community

slippylolo commited on May 26, 2023

Commit

c1a49e6

•

1 Parent(s): 591607b

Fix typo

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -122,7 +122,7 @@ for seq in sequences:
 ### Training Data
-Falcon-RW-7B was trained on 1,500B tokens of [RefinedWeb](https://huggingface.co/datasets/tiiuae/falcon-refinedweb), a high-quality filtered and deduplicated web dataset which we enhanced with curated corpora. Significant components from our curated copora were inspired by The Pile ([Gao et al., 2020](https://arxiv.org/abs/2101.00027)).
 | **Data source**    | **Fraction** | **Tokens** | **Sources**                       |
 |--------------------|--------------|------------|-----------------------------------|

 ### Training Data
+Falcon-7B was trained on 1,500B tokens of [RefinedWeb](https://huggingface.co/datasets/tiiuae/falcon-refinedweb), a high-quality filtered and deduplicated web dataset which we enhanced with curated corpora. Significant components from our curated copora were inspired by The Pile ([Gao et al., 2020](https://arxiv.org/abs/2101.00027)).
 | **Data source**    | **Fraction** | **Tokens** | **Sources**                       |
 |--------------------|--------------|------------|-----------------------------------|