Apple Neural Engine LLMs
Collection
CoreML LLMs optimized for Apple Neural Engine.
•
3 items
•
Updated
•
1
CoreML conversion of Llama 2 7B from smpanaro/Llama-2-7b-NuGPTQ.
Use this CLI to download and run inference. macOS 14 (Sonoma) is required.
Base model
meta-llama/Llama-2-7b-hf