neuralmagic/Meta-Llama-3.1-70B-Instruct-quantized.w4a16 Text Generation • Updated Oct 10 • 26.8k • 27
neuralmagic/Llama-3.1-Nemotron-70B-Instruct-HF-FP8-dynamic Text Generation • Updated Oct 17 • 27.5k • 13