stan-hua's picture
Push folder to HuggingFace Hub
61899f9 verified
raw
history blame
178 Bytes
DEFAULT_stage:
DEFAULT_modifiers:
SmoothQuantModifier: {smoothing_strength: 0.8}
QuantizationModifier:
ignore: [lm_head]
targets: Linear
scheme: W8A8