BramVanroy
/

falcon-7b-ft-alpaca-cleaned-dutch

Text Generation

text-generation-inference

Model card Files Files and versions Community

BramVanroy commited on Jul 2, 2023

Commit

788cbef

•

1 Parent(s): 227a3af

Update README.md

Files changed (1) hide show

README.md +15 -8

README.md CHANGED Viewed

@@ -2,34 +2,41 @@
 license: cc-by-nc-4.0
 datasets:
 - BramVanroy/alpaca-cleaned-dutch
 model-index:
 - name: falcon-7b-ft-alpaca-cleaned-dutch
   results: []
 ---
-<!-- This model card has been generated automatically according to the information the Trainer had access to. You
-should probably proofread and complete it, then remove this comment. -->
 # falcon-7b-ft-alpaca-cleaned-dutch
-This model is a fine-tuned version of [ybelkada/falcon-7b-sharded-bf16](https://huggingface.co/ybelkada/falcon-7b-sharded-bf16) on the BramVanroy/alpaca-cleaned-dutch dataset.
-It achieves the following results on the evaluation set:
-- Loss: 1.5448
 ## Model description
-More information needed
 ## Intended uses & limitations
-More information needed
 ## Training and evaluation data
-More information needed
 ## Training procedure
 ### Training hyperparameters
 The following hyperparameters were used during training:

 license: cc-by-nc-4.0
 datasets:
 - BramVanroy/alpaca-cleaned-dutch
+language:
+- nl
+inference: false
 model-index:
 - name: falcon-7b-ft-alpaca-cleaned-dutch
   results: []
 ---
 # falcon-7b-ft-alpaca-cleaned-dutch
 ## Model description
+This model is a fine-tuned version of [ybelkada/falcon-7b-sharded-bf16](https://huggingface.co/ybelkada/falcon-7b-sharded-bf16) on the [BramVanroy/alpaca-cleaned-dutch](https://huggingface.co/datasets/BramVanroy/alpaca-cleaned-dutch) dataset.
+See the original [Falcon 7B model](https://huggingface.co/tiiuae/falcon-7b/) for more information, intended use, and biases.
 ## Intended uses & limitations
+This model is intended as a baseline for Dutch generative LLMs. It by no means aims to provide SOTA performance and is specifically intended for research purposes.
+Interestingly, the original Falcon 7B model was only trained on English and French. Therefore, Dutch generations should be taken with a massive grain of salt.
 ## Training and evaluation data
+Trained on the synthetic [BramVanroy/alpaca-cleaned-dutch](https://huggingface.co/datasets/BramVanroy/alpaca-cleaned-dutch) instruction dataset.
+Therefore, commercial use of this model is forbidden. The model is intended for research purposes only.
 ## Training procedure
+Trained with LoRA and merged before upload.
 ### Training hyperparameters
 The following hyperparameters were used during training: