Open-Orca
/

OpenOrca-Platypus2-13B

@@ -10,26 +10,29 @@ license: cc-by-nc-4.0
 OpenOrca-Platypus2-13B is a merge of [`garage-bAInd/Platypus2-13B`](https://huggingface.co/garage-bAInd/Platypus2-13B) and [`Open-Orca/OpenOrcaxOpenChat-Preview2-13B`](https://huggingface.co/Open-Orca/OpenOrcaxOpenChat-Preview2-13B).
 ![Platty](./Best_Platty_small.jpeg)
 ### Benchmark Metrics
 | Metric                | Value |
 |-----------------------|-------|
-| MMLU (5-shot)         |   -   |
-| ARC (25-shot)         |   -   |
-| HellaSwag (10-shot)   |   -   |
-| TruthfulQA (0-shot)   |   -   |
-| Avg.                  |   -   |
 We use state-of-the-art [Language Model Evaluation Harness](https://github.com/EleutherAI/lm-evaluation-harness) to run the benchmark tests above, using the same version as the HuggingFace LLM Leaderboard. Please see below for detailed instructions on reproducing benchmark results.
 ### Model Details
-* **Trained by**: **Platypus2-13B** trained by Cole Hunter & Ariel Lee; **OpenOrcaxOpenChat-Preview2-13B** trained by OpenOrca
 * **Model type:**  **OpenOrca-Platypus2-13B** is an auto-regressive language model based on the LLaMA 2 transformer architecture.
 * **Language(s)**: English
 * **License for Platypus2-13B base weights**: Non-Commercial Creative Commons license ([CC BY-NC-4.0](https://creativecommons.org/licenses/by-nc/4.0/))
 ### Prompt Template for base Platypus2-13B
 ```
@@ -39,13 +42,17 @@ We use state-of-the-art [Language Model Evaluation Harness](https://github.com/E
 ### Response:
 ```
-### Training Dataset
 `garage-bAInd/Platypus2-13B` trained using STEM and logic based dataset [`garage-bAInd/Open-Platypus`](https://huggingface.co/datasets/garage-bAInd/Open-Platypus).
 Please see our [paper](https://platypus-llm.github.io/Platypus.pdf) and [project webpage](https://platypus-llm.github.io) for additional information.
 ### Training Procedure
 `garage-bAInd/Platypus2-13B` was instruction fine-tuned using LoRA on 1 A100 80GB. For training details and inference instructions please see the [Platypus](https://github.com/arielnlee/Platypus) GitHub repo.

 OpenOrca-Platypus2-13B is a merge of [`garage-bAInd/Platypus2-13B`](https://huggingface.co/garage-bAInd/Platypus2-13B) and [`Open-Orca/OpenOrcaxOpenChat-Preview2-13B`](https://huggingface.co/Open-Orca/OpenOrcaxOpenChat-Preview2-13B).
+Thank you Open-Orca for putting out a beast of a model and dataset—can't wait for the 70B version (and beyond)!
 ![Platty](./Best_Platty_small.jpeg)
 ### Benchmark Metrics
 | Metric                | Value |
 |-----------------------|-------|
+| MMLU (5-shot)         | 59.5  |
+| ARC (25-shot)         | 62.88 |
+| HellaSwag (10-shot)   | 83.19 |
+| TruthfulQA (0-shot)   | 52.69 |
+| Avg.                  | 64.56 |
 We use state-of-the-art [Language Model Evaluation Harness](https://github.com/EleutherAI/lm-evaluation-harness) to run the benchmark tests above, using the same version as the HuggingFace LLM Leaderboard. Please see below for detailed instructions on reproducing benchmark results.
 ### Model Details
+* **Trained by**: **Platypus2-13B** trained by Cole Hunter & Ariel Lee; **OpenOrcaxOpenChat-Preview2-13B** trained by Open-Orca
 * **Model type:**  **OpenOrca-Platypus2-13B** is an auto-regressive language model based on the LLaMA 2 transformer architecture.
 * **Language(s)**: English
 * **License for Platypus2-13B base weights**: Non-Commercial Creative Commons license ([CC BY-NC-4.0](https://creativecommons.org/licenses/by-nc/4.0/))
+* **License for OpenOrcaxOpenChat-Preview2-13B base weights**: LLaMa-2 commercial
 ### Prompt Template for base Platypus2-13B
 ```
 ### Response:
 ```
+### Prompt Template for base OpenOrcaxOpenChat-Preview2-13B
+OpenChat Llama2 V1: see [Open-Orca's page](https://huggingface.co/Open-Orca/OpenOrcaxOpenChat-Preview2-13B) for additional information.
+### Training Datasets
 `garage-bAInd/Platypus2-13B` trained using STEM and logic based dataset [`garage-bAInd/Open-Platypus`](https://huggingface.co/datasets/garage-bAInd/Open-Platypus).
 Please see our [paper](https://platypus-llm.github.io/Platypus.pdf) and [project webpage](https://platypus-llm.github.io) for additional information.
+[`Open-Orca/OpenOrcaxOpenChat-Preview2-13B`] trained using a refined, 220k subset of the [OpenOrca dataset](https://huggingface.co/datasets/Open-Orca/OpenOrca).
 ### Training Procedure
 `garage-bAInd/Platypus2-13B` was instruction fine-tuned using LoRA on 1 A100 80GB. For training details and inference instructions please see the [Platypus](https://github.com/arielnlee/Platypus) GitHub repo.