VMware
/

open-llama-0.3T-7B-open-instruct-v1.1

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Teja-Gollapudi commited on May 12, 2023

Commit

602a29a

•

1 Parent(s): c0efbe8

Update README.md

Files changed (1) hide show

README.md +6 -2

README.md CHANGED Viewed

@@ -11,7 +11,7 @@ pipeline_tag: conversational
 # VMware/open-llama-0.3T-7B-open-instruct-v1.1
 ## License
-- Commercially viable
 - Instruction dataset, [VMware/open-instruct-v1.1-oasst-dolly-hhrlhf](https://huggingface.co/datasets/VMware/open-instruct-v1.1-oasst-dolly-hhrlhf) is under cc-by-sa-3.0
 - Language Model ([openlm-research/open_llama_7b_preview_300bt](https://huggingface.co/openlm-research/open_llama_7b_preview_300bt/tree/main/open_llama_7b_preview_300bt_transformers_weights)) is under apache-2.0 License
@@ -65,7 +65,11 @@ This way, the model can better understand the relationship between different par
 ## Drawbacks
-- The model was trained on a partially trained Open-LLaMA checkpoint. (300B tokens).
 ## Evaluation

 # VMware/open-llama-0.3T-7B-open-instruct-v1.1
 ## License
+- <b>Commercially Viable </b>
 - Instruction dataset, [VMware/open-instruct-v1.1-oasst-dolly-hhrlhf](https://huggingface.co/datasets/VMware/open-instruct-v1.1-oasst-dolly-hhrlhf) is under cc-by-sa-3.0
 - Language Model ([openlm-research/open_llama_7b_preview_300bt](https://huggingface.co/openlm-research/open_llama_7b_preview_300bt/tree/main/open_llama_7b_preview_300bt_transformers_weights)) is under apache-2.0 License
 ## Drawbacks
+- The model was trained on a partially trained Open-LLaMA checkpoint (300B tokens or 30% training life cycle), there is a huge potential for improvement when trained on fully trained Open-LLaMA checkpoints
+- From what we have observed, the model strugles with few-shot prompting (We plan on addressing it with future iterations)
+- When asked for code, it may or maynot output it with markdown format
+- It doesn't indent python code
 ## Evaluation