Model Depot
Collection
Leading generative models packaged in OpenVino format optimized for use on AI PCs
•
50 items
•
Updated
•
5
llama-2-chat-ov is an OpenVino int4 quantized version of Llama-2-Chat, providing a fast inference implementation, optimized for AI PCs using Intel GPU, CPU and NPU.
llama-2-chat is the official chat finetuned version of Llama2, and is one of the classic and best all-around chat models from 2023.
Base model
meta-llama/Llama-2-7b-chat-hf