Text Generation
Transformers
llama
Inference Endpoints
4-bit precision
exl2