Update README.md
Browse files
README.md
CHANGED
@@ -5,7 +5,7 @@ inference: false
|
|
5 |
## Model Details
|
6 |
**Model Description:**
|
7 |
|
8 |
-
This model is a
|
9 |
|
10 |
---
|
11 |
## Original README
|
|
|
5 |
## Model Details
|
6 |
**Model Description:**
|
7 |
|
8 |
+
This model is a 8-bit quantized version of the Meta Llama 3 - 8B Instruct large language model (LLM). Quantization reduces the model size and improves inference speed, making it suitable for deployment on devices with limited computational resources. The original LLAma3-Instruct 8B model is an autoregressive transformer-based LLM, trained on a massive dataset of text and code. It is fine-tuned for instruction following and excels in dialogue tasks.
|
9 |
|
10 |
---
|
11 |
## Original README
|