Edit Models filters

Inference status

Misc

8-bit precision

Misc with no match

Inference Endpoints

AutoTrain Compatible

text-generation-inference

4-bit precision

text-embeddings-inference

Carbon Emissions

Mixture of Experts

Models

4

Full-text search

Active filters: llmcompressor

neuralmagic/Llama-3.2-1B-Instruct-quantized.w8a8

Text Generation • Updated Oct 16 • 2.14k • 3

neuralmagic/Llama-3.2-3B-Instruct-FP8

Text Generation • Updated Oct 16 • 13.1k • 2

neuralmagic/Llama-3.2-3B-Instruct-quantized.w8a8

Text Generation • Updated Oct 16 • 1.67k • 1

neuralmagic/Llama-3.2-1B-Instruct-FP8

Text Generation • Updated Oct 16 • 261k • 1