VMware
/

open-llama-0.3T-7B-open-instruct-v1.1

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Teja-Gollapudi commited on May 11, 2023

Commit

efe2d41

•

1 Parent(s): bc1fe6c

Update README.md

Files changed (1) hide show

README.md +13 -14

README.md CHANGED Viewed

@@ -11,18 +11,17 @@ pipeline_tag: conversational
 # VMware/open-llama-0.3T-7B-open-instruct-v1.1
 ## License
-<ul>
-<li> <b>Commerically viable.</b></li>
-<li>The instruction dataset, [VMware/open-instruct-v1.1-oasst-dolly-hhrlhf](https://huggingface.co/datasets/VMware/open-instruct-v1.1-oasst-dolly-hhrlhf) is under cc-by-sa-3.0, and the Language Model ([openlm-research/open_llama_7b_preview_300bt](https://huggingface.co/openlm-research/open_llama_7b_preview_300bt/tree/main/open_llama_7b_preview_300bt_transformers_weights)) is under apache-2.0 License.</li>
-</ul>
 ## Nomenclature
-<ul>
- <li> Model : Open-llama
- <li> Model trained on : 300B or 0.3 T tokens
- <li> Model Size: 7B parameters
- <li> Dataset: Open-instruct-v1.1 (oasst,dolly, hhrlhf)
-</ul>
 ## Use in Transformers
@@ -65,10 +64,10 @@ This way, the model can better understand the relationship between different par
 ```
 ## Drawbacks
-<ul>
-<li>The model was trained on a partially trained Open-LLaMA checkpoint. (300B tokens).
-</li>The model is inconsistent with outputting '\n' tokens as majority of the dataset is obtained from [mosaicml/dolly_hhrlhf](https://huggingface.co/datasets/mosaicml/dolly_hhrlhf) and that dataset removed newline characters from responses.
-</ul>
 ## Evaluation

 # VMware/open-llama-0.3T-7B-open-instruct-v1.1
 ## License
+- <b>Commerically viable</b>
+- The instruction dataset, [VMware/open-instruct-v1.1-oasst-dolly-hhrlhf](https://huggingface.co/datasets/VMware/open-instruct-v1.1-oasst-dolly-hhrlhf) is under cc-by-sa-3.0, and the Language Model ([openlm-research/open_llama_7b_preview_300bt](https://huggingface.co/openlm-research/open_llama_7b_preview_300bt/tree/main/open_llama_7b_preview_300bt_transformers_weights)) is under apache-2.0 License
 ## Nomenclature
+- Model : Open-llama
+- Model trained on : 300B or 0.3 T tokens
+- Model Size: 7B parameters
+- Dataset: Open-instruct-v1.1 (oasst,dolly, hhrlhf)
 ## Use in Transformers
 ```
 ## Drawbacks
+- The model was trained on a partially trained Open-LLaMA checkpoint. (300B tokens).
+- The model is inconsistent with outputting '\n' tokens as majority of the dataset is obtained from [mosaicml/dolly_hhrlhf](https://huggingface.co/datasets/mosaicml/dolly_hhrlhf) and that dataset removed newline characters from responses.
 ## Evaluation