tlphams
/

Wizard-Zephyr-Orpo-8x22B

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Wizard-Zephyr-Orpo-8x22B / README.md

tlphams's picture

Update README.md

9aa112f verified 7 months ago

|

1.52 kB

	---
	base_model:
	- alpindale/WizardLM-2-8x22B
	- HuggingFaceH4/zephyr-orpo-141b-A35b-v0.1
	library_name: transformers
	tags:
	- mergekit
	- merge
	license: cc-by-nc-sa-4.0
	---
	# merge

	This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).

	## Merge Details
	### Models Merged

	The following models were included in the merge:
	* [alpindale/WizardLM-2-8x22B](https://huggingface.co/alpindale/WizardLM-2-8x22B)
	* [HuggingFaceH4/zephyr-orpo-141b-A35b-v0.1](https://huggingface.co/HuggingFaceH4/zephyr-orpo-141b-A35b-v0.1)

	## Benchmark results
	### 1. MT-Bench from lmsys
	We adapted the code from [FastChat](https://github.com/lm-sys/FastChat/tree/main/fastchat/llm_judge) to benchmark our model with GPT-4 as a judge. Here is the result
	```
	\| \| Model \| Turn \| Score \|
	\|-------\|--------------------------\|------\|----------\|
	\| First \| tlphams/Wizard-Zephyr-Orpo-8x22B \| 1 \| 9.1625 \|
	\| \| mistralai/Mixtral-8x22B-Instruct-v0.1 \| 1 \| 9.1500 \|
	\| Second\| tlphams/Wizard-Zephyr-Orpo-8x22B \| 2 \| 8.873418 \|
	\| \| mistralai/Mixtral-8x22B-Instruct-v0.1 \| 2 \| 8.250000 \|
	\| Average\| tlphams/Wizard-Zephyr-Orpo-8x22B \| \| 9.018868 \|
	\| \| mistralai/Mixtral-8x22B-Instruct-v0.1 \| \| 8.700000 \|
	```
	The score is slightly lower than [alpindale/WizardLM-2-8x22B](https://huggingface.co/alpindale/WizardLM-2-8x22B), but still higher than GPT-4-0314. Then the research and experimental work still need to continue ^^