GGUF
Inference Endpoints
Edit model card
README.md exists but content is empty. Use the Edit model card button to edit it.
Downloads last month
70
GGUF
Model size
6.94B params
Architecture
llama

2-bit

3-bit

4-bit

5-bit

6-bit

8-bit

Inference API
Unable to determine this model's library. Check the docs .