hfl
/

chinese-mixtral-gguf

Mixture of Experts

Inference Endpoints

Model card Files Files and versions

hfl-rc commited on Jan 25

Commit

c210776

•

1 Parent(s): 989ba6c

Update README.md

Files changed (1) hide show

README.md +30 -1

README.md CHANGED Viewed

@@ -4,4 +4,33 @@ language:
 - zh
 - en
 ---
-Work-in-progress (WIP)

 - zh
 - en
 ---
+# Chinese-Mixtral-GGUF
+This repository contains the GGUF-v3 models (llama.cpp compatible) for **Chinese-Mixtral** (this is not a chat/instruction model).
+## Performance
+Metric: PPL, lower is better
+| Quant | PPL  |
+| ----- | ---- |
+| Q2_K  |      |
+| Q3_K  |      |
+| Q4_0  |      |
+| Q4_K  |      |
+| Q5_0  |      |
+| Q5_K  |      |
+| Q6_K  |      |
+| Q8_0  |      |
+| F16   |      |
+Due to the file size limitation, for F16 model, please use `cat` command to concatenate all parts into a single file. **You must concatenate these parts in order.**
+## Others
+For Hugging Face version, please see: https://huggingface.co/hfl/chinese-mixtral
+Please refer to [https://github.com/ymcui/Chinese-Mixtral/](https://github.com/ymcui/Chinese-Mixtral/) for more details.