Update README.md
Browse files
README.md
CHANGED
@@ -10,7 +10,7 @@ widget:
|
|
10 |
|
11 |
## MiniChat-3B
|
12 |
|
13 |
-
📑 [arXiv]() | 👻 [GitHub](https://github.com/GeneZC/MiniMA) | 🤗 [HuggingFace-MiniMA](https://huggingface.co/GeneZC/MiniMA-3B) | 🤗 [HuggingFace-MiniChat](https://huggingface.co/GeneZC/MiniChat-3B) | 🤖 [ModelScope-MiniMA](https://modelscope.cn/models/GeneZC/MiniMA-3B) | 🤖 [ModelScope-MiniChat](https://modelscope.cn/models/GeneZC/MiniChat-3B)
|
14 |
|
15 |
❗ Must comply with LICENSE of LLaMA2 since it is derived from LLaMA2.
|
16 |
|
@@ -62,6 +62,6 @@ output = tokenizer.decode(output_ids, skip_special_tokens=True).strip()
|
|
62 |
title={Towards the Law of Capacity Gap in Distilling Language Models},
|
63 |
author={Zhang, Chen and Song, Dawei and Ye, Zheyu and Gao, Yan},
|
64 |
year={2023},
|
65 |
-
url={}
|
66 |
}
|
67 |
```
|
|
|
10 |
|
11 |
## MiniChat-3B
|
12 |
|
13 |
+
📑 [arXiv](https://arxiv.org/abs/2311.07052) | 👻 [GitHub](https://github.com/GeneZC/MiniMA) | 🤗 [HuggingFace-MiniMA](https://huggingface.co/GeneZC/MiniMA-3B) | 🤗 [HuggingFace-MiniChat](https://huggingface.co/GeneZC/MiniChat-3B) | 🤖 [ModelScope-MiniMA](https://modelscope.cn/models/GeneZC/MiniMA-3B) | 🤖 [ModelScope-MiniChat](https://modelscope.cn/models/GeneZC/MiniChat-3B)
|
14 |
|
15 |
❗ Must comply with LICENSE of LLaMA2 since it is derived from LLaMA2.
|
16 |
|
|
|
62 |
title={Towards the Law of Capacity Gap in Distilling Language Models},
|
63 |
author={Zhang, Chen and Song, Dawei and Ye, Zheyu and Gao, Yan},
|
64 |
year={2023},
|
65 |
+
url={https://arxiv.org/abs/2311.07052}
|
66 |
}
|
67 |
```
|