bjoernp commited on
Commit
122a032
1 Parent(s): ff7a696

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +4 -1
README.md CHANGED
@@ -7,7 +7,10 @@ library_name: transformers
7
 
8
  # Llama3_DiscoLeo_8B_DARE_Experimental_4bit_awq_glc
9
 
10
- This model is a 4 bit quantization of [DiscoResearch/Llama3_DiscoLeo_8B_DARE_Experimental](https://huggingface.co/DiscoResearch/Llama3_DiscoLeo_8B_DARE_Experimental).
 
 
 
11
  Copy of the original model card:
12
 
13
 
 
7
 
8
  # Llama3_DiscoLeo_8B_DARE_Experimental_4bit_awq_glc
9
 
10
+ This model is a 4 bit quantization of [DiscoResearch/Llama3_DiscoLeo_8B_DARE_Experimental](https://huggingface.co/DiscoResearch/Llama3_DiscoLeo_8B_DARE_Experimental)
11
+ created using [AutoAWQ](https://github.com/casper-hansen/AutoAWQ) with a custom bilingual calibration dataset and `quant_config = {"zero_point": True, "q_group_size": 128, "w_bit": 4, "version": "GEMM"}`.
12
+
13
+
14
  Copy of the original model card:
15
 
16