VMware
/

open-llama-0.3T-7B-instruct-dolly-hhrlhf

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Teja-Gollapudi commited on May 11, 2023

Commit

8af6be7

•

1 Parent(s): 7ddf9aa

Update README.md

Files changed (1) hide show

README.md +17 -0

README.md CHANGED Viewed

@@ -10,6 +10,14 @@ pipeline_tag: text-generation
 # VMware/open-llama-0.3T-7B-instruct-dolly-hhrlhf
 ```
 import os
 import torch
@@ -45,3 +53,12 @@ Baking a cake is a simple process. You will need to prepare a cake mixture, then

 # VMware/open-llama-0.3T-7B-instruct-dolly-hhrlhf
+Fully Open Source, Commerically viable.
+The instruction dataset, [mosaicml/dolly_hhrlhf](https://huggingface.co/datasets/mosaicml/dolly_hhrlhf) is under cc-by-sa-3.0, and the Language Model ([openlm-research/open_llama_7b_preview_300bt](https://huggingface.co/openlm-research/open_llama_7b_preview_300bt/tree/main/open_llama_7b_preview_300bt_transformers_weights)) is under apache-2.0 License.
+## Useage
+Please load the tokenizer with 'add_bos_token = True' parameter as the underlying OpenLLaMa model and this model were trained with a BOS token.
 ```
 import os
 import torch
+## Drawbacks
+<ul>
+<li>The model was trained on a partially trained Open-LLaMA checkpoint. (300B tokens).
+</li>The model is inconsistent with outputting '\n' tokens as majority of the dataset is obtained from [mosaicml/dolly_hhrlhf](https://huggingface.co/datasets/mosaicml/dolly_hhrlhf) and that dataset removed newline characters from responses.
+</ul>
+## Evaluation
+<B>TODO</B>