Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
ssaroya
/
gptq_model
like
0
Inference Endpoints
arxiv:
2302.13971
arxiv:
2210.17323
Model card
Files
Files and versions
Community
Deploy
adcfac4
gptq_model
/
quant
1 contributor
History:
2 commits
ssaroya
Upload 7 files
401522d
over 1 year ago
__init__.py
Safe
312 Bytes
Upload 7 files
over 1 year ago
custom_autotune.py
Safe
8.78 kB
Upload 7 files
over 1 year ago
fused_attn.py
Safe
8.6 kB
Upload 7 files
over 1 year ago
fused_mlp.py
Safe
11.9 kB
Upload 7 files
over 1 year ago
placeholder.txt
Safe
4 Bytes
Create quant/placeholder.txt
over 1 year ago
quant_linear.py
Safe
18.3 kB
Upload 7 files
over 1 year ago
quantizer.py
Safe
4.26 kB
Upload 7 files
over 1 year ago
triton_norm.py
Safe
3.12 kB
Upload 7 files
over 1 year ago