juliehunter
commited on
Commit
•
c63a148
1
Parent(s):
30ca4c9
Update README.md
Browse files
README.md
CHANGED
@@ -43,7 +43,7 @@ datasets:
|
|
43 |
|
44 |
# Claire-7B-0.1
|
45 |
|
46 |
-
**Claire-7B-0.1 is a 7B parameter causal decoder-only model built by [LINAGORA](https://labs.linagora.com/)
|
47 |
**adapted from [Falcon-7b](https://huggingface.co/tiiuae/falcon-7b) on French conversational data.**
|
48 |
|
49 |
Quantized versions in GGUF format can be found in [TheBloke/Claire-7B-0.1-GGUF](https://huggingface.co/TheBloke/Claire-7B-0.1-GGUF).
|
@@ -229,6 +229,15 @@ Please note that the model can generate disfluencies and humorous responses as a
|
|
229 |
|
230 |
More evaluation details will be provided in a separate publication.
|
231 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
232 |
## License
|
233 |
|
234 |
Given that some of the corpora used for training are only available under CC-BY-NC-SA licenses,
|
@@ -236,14 +245,31 @@ Claire-7B-0.1 is made available under the [CC-BY-NC-SA 4.0 license](https://crea
|
|
236 |
|
237 |
You can find a variant of this model published under the Apache 2.0 license at [OpenLLM-France/Claire-7B-Apache-0.1](https://huggingface.co/OpenLLM-France/Claire-7B-Apache-0.1).
|
238 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
239 |
## Acknowledgements
|
240 |
|
241 |
This work was performed using HPC resources from GENCI–IDRIS (Grant 2023-AD011014561).
|
242 |
|
243 |
-
Claire-7B-0.1 was created by members of [LINAGORA](https://labs.linagora.com/)
|
244 |
|
245 |
Special thanks to partners from the OpenLLM-France community, especially Christophe Cerisara (LORIA), Pierre-Carl Langlais and Anastasia Stasenko (OpSci), and Pierre Colombo, for valuable advice.
|
246 |
|
|
|
247 |
## Contact
|
248 |
|
249 |
contact@openllm-france.fr
|
|
|
43 |
|
44 |
# Claire-7B-0.1
|
45 |
|
46 |
+
**Claire-7B-0.1 is a 7B parameter causal decoder-only model built by [LINAGORA](https://labs.linagora.com/) with the support of [OpenLLM-France](https://github.com/OpenLLM-France)**
|
47 |
**adapted from [Falcon-7b](https://huggingface.co/tiiuae/falcon-7b) on French conversational data.**
|
48 |
|
49 |
Quantized versions in GGUF format can be found in [TheBloke/Claire-7B-0.1-GGUF](https://huggingface.co/TheBloke/Claire-7B-0.1-GGUF).
|
|
|
229 |
|
230 |
More evaluation details will be provided in a separate publication.
|
231 |
|
232 |
+
## Variants
|
233 |
+
|
234 |
+
Claire-7B-0.1 is finetuned only on French dialogue data, but the following variants are available to evaluate the impact of language mixture on dialogue understanding.
|
235 |
+
* [Claire-7B-FR-EN-25-75](OpenLLM-France/Claire-7B-FR-EN-25-75-0.1), with 25/75 French-English data split.
|
236 |
+
* [Claire-7B-FR-EN-50-50](OpenLLM-France/Claire-7B-FR-EN-50-50-0.1), with 50/50 French-English data split.
|
237 |
+
* [Claire-7B-FR-EN-75-25](OpenLLM-France/Claire-7B-FR-EN-75-25-0.1), with 75/25 French-English data split.
|
238 |
+
* [Claire-7B](OpenLLM-France/Claire-7B-EN-0.1), with only English data.
|
239 |
+
|
240 |
+
|
241 |
## License
|
242 |
|
243 |
Given that some of the corpora used for training are only available under CC-BY-NC-SA licenses,
|
|
|
245 |
|
246 |
You can find a variant of this model published under the Apache 2.0 license at [OpenLLM-France/Claire-7B-Apache-0.1](https://huggingface.co/OpenLLM-France/Claire-7B-Apache-0.1).
|
247 |
|
248 |
+
## Citation
|
249 |
+
|
250 |
+
When using the Claire family of models, please cite the following paper:
|
251 |
+
|
252 |
+
Jérôme Louradour, Julie Hunter, Ismaïl Harrando, Guokan Shang, Virgile Rennard & Jean-Pierre Lorré (2024). [Claire: Large Language Models for Spontaneous French Dialogue](https://aclanthology.org/2024.jeptalnrecital-taln.36.pdf). In _Actes de la 31ème Conférence sur le Traitement Automatique des Langues Naturelles, volume 1: articles longs et prises de position_ (pp. 530-548).
|
253 |
+
|
254 |
+
```bibtex
|
255 |
+
@inproceedings{louradour2024claire,
|
256 |
+
title={Claire: Large Language Models for Spontaneous French Dialogue},
|
257 |
+
author={Louradour, J{\'e}r{\^o}me and Hunter, Julie and Harrando, Isma{\"\i}l and Shang, Guokan and Rennard, Virgile and Lorr{\'e}, Jean-Pierre},
|
258 |
+
booktitle={Actes de la 31{\`e}me Conf{\'e}rence sur le Traitement Automatique des Langues Naturelles, volume 1: articles longs et prises de position},
|
259 |
+
pages={530--548},
|
260 |
+
year={2024}
|
261 |
+
}
|
262 |
+
```
|
263 |
+
|
264 |
## Acknowledgements
|
265 |
|
266 |
This work was performed using HPC resources from GENCI–IDRIS (Grant 2023-AD011014561).
|
267 |
|
268 |
+
Claire-7B-0.1 was created by members of [LINAGORA](https://labs.linagora.com/).
|
269 |
|
270 |
Special thanks to partners from the OpenLLM-France community, especially Christophe Cerisara (LORIA), Pierre-Carl Langlais and Anastasia Stasenko (OpSci), and Pierre Colombo, for valuable advice.
|
271 |
|
272 |
+
|
273 |
## Contact
|
274 |
|
275 |
contact@openllm-france.fr
|