Edit model card

Llama3.1 8b Instruct GGUF format models which can be runned on the PCs of MacOS, Windows or Linux, Cell phones and smaller devices.

This repo focuses on the available and excellent tiny LLMs which can be easily runned for chatting PDFs on MacOS, balancing the LLM's effect and inference speed.

If you are a Mac user, the following free wonderful AI tools can help you to read and understand PDFs effectively:

  • If you are using Zotero for managing and reading your personal PDFs, zotero-chatpdf is a free plugin which can assist you to chat PDFs effectively by your local llama3.1.

  • you can download the beautiful ChatPDFLocal MacOS app from here, load one or batch PDF files at will, and quickly experience the effect of the model through chat reading. PS. Click here to subscribe and you can use ChatPDFLocal for free.

The default model used by local LLM is ggml-model-Q3_K_M.gguf, you can also load any customerized open source model that suits your Mac configuration size by inputing huggingface repo.

Enjoy, thank you!

Downloads last month
1,727
GGUF
Model size
8.03B params
Architecture
llama

2-bit

3-bit

4-bit

8-bit

16-bit

Inference API
Unable to determine this model’s pipeline type. Check the docs .