ilsp
/

Meltemi-7B-Instruct-v1-GGUF

Model card Files Files and versions Community

soksof commited on Mar 27

Commit

ea66541

•

1 Parent(s): ac5f671

Update README.md

Files changed (1) hide show

README.md +10 -1

README.md CHANGED Viewed

@@ -14,6 +14,15 @@ quantized_by: ilsp
 ![image/png](https://miro.medium.com/v2/resize:fit:720/format:webp/1*IaE7RJk6JffW8og-MOnYCA.png)
-# Description
 In this repository you can find quantised GGUF variants of [ILSP's Meltemi-7B-Instruct-v1](https://huggingface.co/ilsp/Meltemi-7B-Instruct-v1) model, created using [llama.cpp](https://github.com/ggerganov/llama.cpp).

 ![image/png](https://miro.medium.com/v2/resize:fit:720/format:webp/1*IaE7RJk6JffW8og-MOnYCA.png)
+## Description
 In this repository you can find quantised GGUF variants of [ILSP's Meltemi-7B-Instruct-v1](https://huggingface.co/ilsp/Meltemi-7B-Instruct-v1) model, created using [llama.cpp](https://github.com/ggerganov/llama.cpp).
+## Provided files (Use case column taken from the llama.cpp documentation)
+Based on the information
+| Name | Quant method | Bits | Size | Appr. RAM required | Use case |
+| ---- | ---- | ---- | ---- | ---- | ----- |
+| [meltemi-instruct-v1_q3_K_M.bin](https://huggingface.co/ilsp/Meltemi-7B-Instruct-v1-GGUF/blob/main/meltemi-instruct-v1_q3_K_M.bin) | Q3_K_M | 3 | 3.67 GB| 6.45 GB | very small, very high quality loss |
+| [meltemi-instruct-v1_q5_K_M.bin](https://huggingface.co/ilsp/Meltemi-7B-Instruct-v1-GGUF/blob/main/meltemi-instruct-v1_q5_K_M.bin) | Q5_K_M | 5 | 5.31 GB| 8.1 GB | large, very low quality loss - recommended |