Add github repo link
Browse files
README.md
CHANGED
@@ -114,6 +114,8 @@ The distillation dataset is composed of about 700k multilingual sentences pairs
|
|
114 |
- [castorini/mr-tydi](https://huggingface.co/datasets/castorini/mr-tydi)
|
115 |
- [quora](https://huggingface.co/datasets/quora)
|
116 |
|
|
|
|
|
117 |
|
118 |
[Multilingual E5 Text Embeddings: A Technical Report](https://arxiv.org/pdf/2402.05672).
|
119 |
Liang Wang, Nan Yang, Xiaolong Huang, Linjun Yang, Rangan Majumder, Furu Wei, arXiv 2024
|
|
|
114 |
- [castorini/mr-tydi](https://huggingface.co/datasets/castorini/mr-tydi)
|
115 |
- [quora](https://huggingface.co/datasets/quora)
|
116 |
|
117 |
+
For code, see [this github repository](https://github.com/Avditvs/matryoshka_factory)
|
118 |
+
|
119 |
|
120 |
[Multilingual E5 Text Embeddings: A Technical Report](https://arxiv.org/pdf/2402.05672).
|
121 |
Liang Wang, Nan Yang, Xiaolong Huang, Linjun Yang, Rangan Majumder, Furu Wei, arXiv 2024
|