Teja-Gollapudi commited on
Commit
8af6be7
1 Parent(s): 7ddf9aa

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +17 -0
README.md CHANGED
@@ -10,6 +10,14 @@ pipeline_tag: text-generation
10
 
11
  # VMware/open-llama-0.3T-7B-instruct-dolly-hhrlhf
12
 
 
 
 
 
 
 
 
 
13
  ```
14
  import os
15
  import torch
@@ -45,3 +53,12 @@ Baking a cake is a simple process. You will need to prepare a cake mixture, then
45
 
46
 
47
 
 
 
 
 
 
 
 
 
 
 
10
 
11
  # VMware/open-llama-0.3T-7B-instruct-dolly-hhrlhf
12
 
13
+ Fully Open Source, Commerically viable.
14
+
15
+ The instruction dataset, [mosaicml/dolly_hhrlhf](https://huggingface.co/datasets/mosaicml/dolly_hhrlhf) is under cc-by-sa-3.0, and the Language Model ([openlm-research/open_llama_7b_preview_300bt](https://huggingface.co/openlm-research/open_llama_7b_preview_300bt/tree/main/open_llama_7b_preview_300bt_transformers_weights)) is under apache-2.0 License.
16
+
17
+ ## Useage
18
+
19
+ Please load the tokenizer with 'add_bos_token = True' parameter as the underlying OpenLLaMa model and this model were trained with a BOS token.
20
+
21
  ```
22
  import os
23
  import torch
 
53
 
54
 
55
 
56
+ ## Drawbacks
57
+ <ul>
58
+ <li>The model was trained on a partially trained Open-LLaMA checkpoint. (300B tokens).
59
+ </li>The model is inconsistent with outputting '\n' tokens as majority of the dataset is obtained from [mosaicml/dolly_hhrlhf](https://huggingface.co/datasets/mosaicml/dolly_hhrlhf) and that dataset removed newline characters from responses.
60
+ </ul>
61
+
62
+ ## Evaluation
63
+
64
+ <B>TODO</B>