robinsmits
/

open_llama_7b_alpaca_clean_dutch_qlora

Text Generation

text-generation-inference

Model card Files Files and versions Community

robinsmits commited on Jul 11, 2023

Commit

01f632e

•

1 Parent(s): 7bdfdf1

Update README.md

Files changed (1) hide show

README.md +7 -4

README.md CHANGED Viewed

@@ -16,7 +16,8 @@ tags:
 ## Model description
-This adapter model is a fine-tuned version of [openlm-research/open_llama_7b](https://huggingface.co/openlm-research/open_llama_7b) on the [BramVanroy/alpaca-cleaned-dutch](https://www.huggingface.co/datasets/BramVanroy/alpaca-cleaned-dutch) dataset.
 See [openlm-research/open_llama_7b](https://huggingface.co/openlm-research/open_llama_7b) for all information about the base model.
@@ -51,15 +52,17 @@ For more extensive usage and a lot of generated samples (both good and bad sampl
 ## Intended uses & limitations
 The open_llama_7b model was primarily trained on the English language. Part of the dataset was a Wikipedia dump containing pages in 20 languages.
-Dutch was one of those languages. Given the size of the total dataset and the wikipedia part the Dutch language was very likely less than 0.5% of the total data.
-The primary intention of this model is to explore the use of the Dutch language in combination with an Open LLM.
 ## Training and evaluation data
 This model was trained on the [BramVanroy/alpaca-cleaned-dutch](https://www.huggingface.co/datasets/BramVanroy/alpaca-cleaned-dutch) dataset.
-Commercial use is forbidden. This model is intended for research only.
 ## Training procedure

 ## Model description
+This adapter model is a fine-tuned version of [openlm-research/open_llama_7b](https://huggingface.co/openlm-research/open_llama_7b).
+Finetuning was performed on the Dutch [BramVanroy/alpaca-cleaned-dutch](https://www.huggingface.co/datasets/BramVanroy/alpaca-cleaned-dutch) dataset which contains 52K of records with instruction following-data translated from English to Dutch.
 See [openlm-research/open_llama_7b](https://huggingface.co/openlm-research/open_llama_7b) for all information about the base model.
 ## Intended uses & limitations
 The open_llama_7b model was primarily trained on the English language. Part of the dataset was a Wikipedia dump containing pages in 20 languages.
+Dutch was one of those languages. Given the size of the total dataset and the wikipedia part the Dutch language was very likely less than 0.5% of the total data.
+The generated output and performance of this model for the Dutch language is very likely not always comparable to the various Open-Llama models that have been finetuned on English Alpaca datasets.
+The primary intention of this model is to explore and research the use of the Dutch language in combination with an Open LLM model.
 ## Training and evaluation data
 This model was trained on the [BramVanroy/alpaca-cleaned-dutch](https://www.huggingface.co/datasets/BramVanroy/alpaca-cleaned-dutch) dataset.
+Based on the dataset license only Non-Commercial use is allowed. Commercial use is strictly forbidden.
 ## Training procedure