BramVanroy
/

falcon-7b-ft-mc4_nl_cleaned_tiny

Text Generation

text-generation-inference

Model card Files Files and versions Community

BramVanroy commited on Jul 25, 2023

Commit

1e5a668

•

1 Parent(s): b106433

Update README.md

Files changed (1) hide show

README.md +2 -1

README.md CHANGED Viewed

@@ -33,7 +33,8 @@ wanted to see if the performance would be reasonable after finetuning this model
 Trained on the [yhavinga/mc4_nl_cleaned](https://huggingface.co/datasets/yhavinga/mc4_nl_cleaned/viewer/tiny/train) dataset (`tiny` partition) for one epoch. The canonical
 validation split was not used but instead 5% of `train` was used as validation.
-At 2048 tokens context length, the training set was around 2M (2,008,858) samples, and the model was trained for 1 epoch.
 ## Training procedure

 Trained on the [yhavinga/mc4_nl_cleaned](https://huggingface.co/datasets/yhavinga/mc4_nl_cleaned/viewer/tiny/train) dataset (`tiny` partition) for one epoch. The canonical
 validation split was not used but instead 5% of `train` was used as validation.
+At 2048 tokens context length, the training set was around 2M (2,008,858) samples, and the model was trained for 1 epoch. That means that the model was trained for
+around 4B Dutch tokens (`2048 * 2008858 = 4.114.141.184`).
 ## Training procedure