tlphams
/

Wizard-Zephyr-Orpo-8x22B

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

tlphams commited on May 6

Commit

5dceb6c

•

1 Parent(s): 9aa112f

Update README.md

Files changed (1) hide show

README.md +9 -9

README.md CHANGED Viewed

@@ -22,14 +22,14 @@ The following models were included in the merge:
 ## Benchmark results
 ### 1. MT-Bench from lmsys
 We adapted the code from [FastChat](https://github.com/lm-sys/FastChat/tree/main/fastchat/llm_judge) to benchmark our model with GPT-4 as a judge. Here is the result
-```
-|       | Model                    | Turn | Score    |
-|-------|--------------------------|------|----------|
-| First | tlphams/Wizard-Zephyr-Orpo-8x22B      | 1    | 9.1625   |
-|       | mistralai/Mixtral-8x22B-Instruct-v0.1   | 1    | 9.1500   |
-| Second| tlphams/Wizard-Zephyr-Orpo-8x22B      | 2    | 8.873418 |
-|       | mistralai/Mixtral-8x22B-Instruct-v0.1   | 2    | 8.250000 |
-| Average| tlphams/Wizard-Zephyr-Orpo-8x22B     |      | 9.018868 |
-|        | mistralai/Mixtral-8x22B-Instruct-v0.1  |      | 8.700000 |
 ```
 The score is slightly lower than [alpindale/WizardLM-2-8x22B](https://huggingface.co/alpindale/WizardLM-2-8x22B), but still higher than GPT-4-0314. Then the research and experimental work still need to continue ^^

 ## Benchmark results
 ### 1. MT-Bench from lmsys
 We adapted the code from [FastChat](https://github.com/lm-sys/FastChat/tree/main/fastchat/llm_judge) to benchmark our model with GPT-4 as a judge. Here is the result
+```markdown
+|        | Model                                   | Turn | Score    |
+|--------|-----------------------------------------|------|----------|
+| First  | tlphams/Wizard-Zephyr-Orpo-8x22B        | 1    | 9.1625   |
+|        | mistralai/Mixtral-8x22B-Instruct-v0.1   | 1    | 9.1500   |
+| Second | tlphams/Wizard-Zephyr-Orpo-8x22B        | 2    | 8.873418 |
+|        | mistralai/Mixtral-8x22B-Instruct-v0.1   | 2    | 8.250000 |
+| Average| tlphams/Wizard-Zephyr-Orpo-8x22B        |      | 9.018868 |
+|        | mistralai/Mixtral-8x22B-Instruct-v0.1   |      | 8.700000 |
 ```
 The score is slightly lower than [alpindale/WizardLM-2-8x22B](https://huggingface.co/alpindale/WizardLM-2-8x22B), but still higher than GPT-4-0314. Then the research and experimental work still need to continue ^^