--- base_model: Aryanne/Astrohermes-3B inference: false language: - en library_name: transformers license: cc-by-sa-4.0 model_creator: Aryanne model_name: Astrohermes-3B pipeline_tag: text-generation quantized_by: afrideva tags: - gpt - llm - stablelm - gguf - ggml - quantized - q2_k - q3_k_m - q4_k_m - q5_k_m - q6_k - q8_0 --- # Aryanne/Astrohermes-3B-GGUF Quantized GGUF model files for [Astrohermes-3B](https://huggingface.co/Aryanne/Astrohermes-3B) from [Aryanne](https://huggingface.co/Aryanne) | Name | Quant method | Size | | ---- | ---- | ---- | | [astrohermes-3b.fp16.gguf](https://huggingface.co/afrideva/Astrohermes-3B-GGUF/resolve/main/astrohermes-3b.fp16.gguf) | fp16 | 5.59 GB | | [astrohermes-3b.q2_k.gguf](https://huggingface.co/afrideva/Astrohermes-3B-GGUF/resolve/main/astrohermes-3b.q2_k.gguf) | q2_k | 1.20 GB | | [astrohermes-3b.q3_k_m.gguf](https://huggingface.co/afrideva/Astrohermes-3B-GGUF/resolve/main/astrohermes-3b.q3_k_m.gguf) | q3_k_m | 1.39 GB | | [astrohermes-3b.q4_k_m.gguf](https://huggingface.co/afrideva/Astrohermes-3B-GGUF/resolve/main/astrohermes-3b.q4_k_m.gguf) | q4_k_m | 1.71 GB | | [astrohermes-3b.q5_k_m.gguf](https://huggingface.co/afrideva/Astrohermes-3B-GGUF/resolve/main/astrohermes-3b.q5_k_m.gguf) | q5_k_m | 1.99 GB | | [astrohermes-3b.q6_k.gguf](https://huggingface.co/afrideva/Astrohermes-3B-GGUF/resolve/main/astrohermes-3b.q6_k.gguf) | q6_k | 2.30 GB | | [astrohermes-3b.q8_0.gguf](https://huggingface.co/afrideva/Astrohermes-3B-GGUF/resolve/main/astrohermes-3b.q8_0.gguf) | q8_0 | 2.97 GB | ## Original Model Card: This model is a mix of [PAIXAI/Astrid-3B](https://huggingface.co/PAIXAI/Astrid-3B) + [jondurbin/airoboros-3b-3p0](https://huggingface.co/jondurbin/airoboros-3b-3p0) + [cxllin/StableHermes-3b](https://huggingface.co/cxllin/StableHermes-3b), as shown in the yaml(see Astrohermes.yml or below). [Aryanne/Astridboros-3B](https://huggingface.co/Aryanne/Astridboros-3B) = PAIXAI/Astrid-3B + jondurbin/airoboros-3b-3p0 ```yaml slices: - sources: - model: Aryanne/Astridboros-3B layer_range: [0, 15] - sources: - model: cxllin/StableHermes-3b layer_range: [15, 16] - sources: - model: Aryanne/Astridboros-3B layer_range: [16, 17] - sources: - model: cxllin/StableHermes-3b layer_range: [17, 18] - sources: - model: Aryanne/Astridboros-3B layer_range: [18, 19] - sources: - model: cxllin/StableHermes-3b layer_range: [19, 20] - sources: - model: Aryanne/Astridboros-3B layer_range: [20, 21] - sources: - model: cxllin/StableHermes-3b layer_range: [21, 22] - sources: - model: Aryanne/Astridboros-3B layer_range: [22, 23] - sources: - model: cxllin/StableHermes-3b layer_range: [23, 24] - sources: - model: Aryanne/Astridboros-3B layer_range: [24, 32] merge_method: passthrough dtype: float16 ```