gramirez-prompsit commited on
Commit
e55b7f9
1 Parent(s): 23b4874

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +28 -1
README.md CHANGED
@@ -11,4 +11,31 @@ language:
11
  ---
12
  This is a pre-release checkpoint for a Nordic generative language model currently in training.
13
  This preliminary release is provided for HPLT (https://hplt-project.org/) deliverable 4.1 (“First language models trained”)(https://hplt-project.org/deliverables). Consult the HPLT website for further details.
14
- More documentation will be provided soon.
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
11
  ---
12
  This is a pre-release checkpoint for a Nordic generative language model currently in training.
13
  This preliminary release is provided for HPLT (https://hplt-project.org/) deliverable 4.1 (“First language models trained”)(https://hplt-project.org/deliverables). Consult the HPLT website for further details.
14
+ More documentation will be provided soon.
15
+
16
+
17
+ UPDATE: our Nordic model is now called Viking!
18
+ -------
19
+
20
+
21
+ # Viking 7B, 13B and 33B
22
+
23
+ _**NOTE:** These are **research checkpoint** of a model for which **training has not been completed.** It is being provided in its current state for research and testing purposes. **Care should be taken when using the outputs of the model.** Once pretraining has completed we intend to release additional instruction-tuned and chat-tuned varieties._
24
+
25
+ Viking 7B, 13B and 13B are a 7B, 13B and 33B parameter decoder-only transformers pretrained on Finnish,
26
+ English, Swedish, Danish, Norwegian, Icelandic and code. They are being trained
27
+ on 2 trillion tokens (1.3 trillion as of this release).
28
+
29
+ Viking is a fully open source model and is made available under the Apache 2.0 License.
30
+
31
+ Viking was created in a collaboration between the [TurkuNLP group](https://turkunlp.org/) of the University of Turku, [SiloGen](https://www.silo.ai/silogen) from [Silo AI](https://www.silo.ai/), and [High Performance Language Technologies](https://hplt-project.org/) (HPLT). Training was conducted on the [LUMI supercomputer](https://www.lumi-supercomputer.eu/), using compute resources generously provided by [CSC](https://csc.fi/) - IT Center for Science, Finland.
32
+
33
+ This project is part of an ongoing effort to create open source large language models for non-English and especially low resource languages like Finnish. The mode is fluent in Finnish, English, the Scandinavian languages and capable of basic translation between them. It is also able to understand and generate code.
34
+
35
+ More info available at:
36
+
37
+ [Viking 7B](https://huggingface.co/LumiOpen/Viking-7B)
38
+
39
+ [Viking 13B](https://huggingface.co/LumiOpen/Viking-13B)
40
+
41
+ [Viking 33B](https://huggingface.co/LumiOpen/Viking-33B)