pdelobelle commited on
Commit
2602fea
1 Parent(s): 59898b9

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -1
README.md CHANGED
@@ -20,10 +20,12 @@ tweety-7b-dutch is a foundation model with a focus on the Dutch language, incorp
20
 
21
  Our tweety-7b-dutch model has an Apache 2.0 license, encouraging applications in research, content creation, and language analysis.
22
 
 
 
 
23
  - **Developed by:** KU Leuven and UGent
24
  - **Funded by:** KU Leuven BOF, VSC (Flemish Supercomputer Center), [Vlaams AI-onderzoeksprogramma](https://www.flandersairesearch.be/nl)
25
  - **Model type:** Foundation model
26
- - **Language(s) (NLP):** Dutch
27
  - **License:** Apache 2.0
28
 
29
  ## Uses
 
20
 
21
  Our tweety-7b-dutch model has an Apache 2.0 license, encouraging applications in research, content creation, and language analysis.
22
 
23
+ - **Tokenizer:** Dutch, 50k tokens ([yhavinga/gpt-neo-1.3B-dutch](https://huggingface.co/yhavinga/gpt-neo-1.3B-dutch))
24
+ - **Pre-training data:** Scraped Dutch ([yhavinga/mc4_nl_cleaned](https://huggingface.co/datasets/yhavinga/mc4_nl_cleaned))
25
+ - **Context window**: 8196 tokens
26
  - **Developed by:** KU Leuven and UGent
27
  - **Funded by:** KU Leuven BOF, VSC (Flemish Supercomputer Center), [Vlaams AI-onderzoeksprogramma](https://www.flandersairesearch.be/nl)
28
  - **Model type:** Foundation model
 
29
  - **License:** Apache 2.0
30
 
31
  ## Uses