Voicelab
/

trurl-2-13b

Text Generation

text-generation-inference

Model card Files Files and versions Community

Wojx commited on Aug 16, 2023

Commit

c67e207

•

1 Parent(s): 05d6ae3

Update README.md

Add MMLU benchmark results

Files changed (1) hide show

README.md +5 -5

README.md CHANGED Viewed

@@ -52,11 +52,11 @@ To get the expected features and performance for the chat versions, a specific L
 # Evaluation Results
 |Model | Size| hellaswag | arc_challenge | MMLU|
 |---|---|---|---|---|
-| Llama-2-chat | 7B |  78.55% |  52.9% | |
-| Llama-2-chat | 13B |  81.94% |  59.04% | |
-| Trurl 2.0 (with MMLU) | 13B | 80.09% | 59.30% |
-| Trurl 2.0 (no MMLU) | 13B | TO-DO | TO-DO | |
-| Trurl 2.0 | 7b | TO-DO | TO-DO |
 <img src="https://voicelab.ai/wp-content/uploads/trurl-hero.webp" alt="trurl graphic" style="width:100px;"/>

 # Evaluation Results
 |Model | Size| hellaswag | arc_challenge | MMLU|
 |---|---|---|---|---|
+| Llama-2-chat | 7B |  78.55% |  52.9% | 48.32% |
+| Llama-2-chat | 13B |  81.94% |  59.04% | 54.64% |
+| Trurl 2.0 (with MMLU) | 13B | 80.09% | 59.30% | 78.35% |
+| Trurl 2.0 (no MMLU) | 13B | TO-DO | TO-DO | TO-DO|
+| Trurl 2.0 | 7b | TO-DO | TO-DO | TO-DO|
 <img src="https://voicelab.ai/wp-content/uploads/trurl-hero.webp" alt="trurl graphic" style="width:100px;"/>