Add quantize_config.json file to make it work with engines like vLLM 36f6209 Pernekhan commited on Jan 31