OpenSourceRonin commited on
Commit
ba9ead3
β€’
1 Parent(s): 43cfd16

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -0
README.md CHANGED
@@ -71,3 +71,5 @@ Read tech report at [**Tech Report**](https://github.com/microsoft/VPTQ/blob/mai
71
  | Reproduced from the tech report | [HF πŸ€—](https://huggingface.co/collections/VPTQ-community/reproduced-vptq-tech-report-baseline-66fbf1dffe741cc9e93ecf04) | Results from the open source community for reference only, please use them responsibly.|
72
  | Hessian and Inverse Hessian Matrix | [HF πŸ€—](https://huggingface.co/collections/VPTQ-community/hessian-and-invhessian-checkpoints-66fd249a104850d17b23fd8b) | Collected from RedPajama-Data-1T-Sample, following [Quip#](https://github.com/Cornell-RelaxML/quip-sharp/blob/main/quantize_llama/hessian_offline_llama.py)|
73
 
 
 
 
71
  | Reproduced from the tech report | [HF πŸ€—](https://huggingface.co/collections/VPTQ-community/reproduced-vptq-tech-report-baseline-66fbf1dffe741cc9e93ecf04) | Results from the open source community for reference only, please use them responsibly.|
72
  | Hessian and Inverse Hessian Matrix | [HF πŸ€—](https://huggingface.co/collections/VPTQ-community/hessian-and-invhessian-checkpoints-66fd249a104850d17b23fd8b) | Collected from RedPajama-Data-1T-Sample, following [Quip#](https://github.com/Cornell-RelaxML/quip-sharp/blob/main/quantize_llama/hessian_offline_llama.py)|
73
 
74
+ ## A Space Demo
75
+ A live-chatbot is created with [VPTQ-community/Meta-Llama-3.1-70B-Instruct-v8-k32768-0-woft](https://huggingface.co/spaces/OpenSourceRonin/LLM-2Bit) over VPTQ.