uukuguy
/

SynthIA-7B-v1.3-dare-0.85

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

uukuguy commited on Nov 24, 2023

Commit

068c0f4

•

1 Parent(s): 91381d0

Update README.md

Files changed (1) hide show

README.md +5 -0

README.md CHANGED Viewed

@@ -1,6 +1,11 @@
 ---
 license: llama2
 ---
 Experiment for DARE(Drop and REscale), most of the delta parameters can be directly set to zeros without affecting the capabilities of SFT LMs and larger models can tolerate a higher proportion of discarded parameters.
 weight_mask_rate: 0.85 / use_weight_rescale: True / mask_stratery: random / scaling_coefficient: 1.0

 ---
 license: llama2
 ---
+* [AWQ model(s) for GPU inference.](https://huggingface.co/TheBloke/SynthIA-7B-v1.3-dare-0.85-AWQ)
+* [GPTQ models for GPU inference, with multiple quantisation parameter options.](https://huggingface.co/TheBloke/SynthIA-7B-v1.3-dare-0.85-GPTQ)
+* [2, 3, 4, 5, 6 and 8-bit GGUF models for CPU+GPU inference](https://huggingface.co/TheBloke/SynthIA-7B-v1.3-dare-0.85-GGUF)
 Experiment for DARE(Drop and REscale), most of the delta parameters can be directly set to zeros without affecting the capabilities of SFT LMs and larger models can tolerate a higher proportion of discarded parameters.
 weight_mask_rate: 0.85 / use_weight_rescale: True / mask_stratery: random / scaling_coefficient: 1.0