willyninja30
commited on
Commit
•
e21852f
1
Parent(s):
4378cf0
Update README.md
Browse files
README.md
CHANGED
@@ -16,6 +16,12 @@ We decided to build a V3 of Aria 7B based on Mistral instruct instead of LLAMA 2
|
|
16 |
|
17 |
## Base Model : Mistral-7B-Instruct-v0.1
|
18 |
|
|
|
|
|
|
|
|
|
|
|
|
|
19 |
## Training procedure
|
20 |
|
21 |
|
|
|
16 |
|
17 |
## Base Model : Mistral-7B-Instruct-v0.1
|
18 |
|
19 |
+
## Technical issues Fixed & Limits of base model
|
20 |
+
|
21 |
+
We noticed that the base model had a common issue of mixing french and english when the request was done in french in some cases,not all of them. This issue was more visible for
|
22 |
+
prompts over 1000 tokens. By training the base model on our dataset, we fixed this issue and allow the model to reply in the same specific language used for the question to answer.
|
23 |
+
This pain-point is a valuable upgrade for corporate users in non-english areas willing to deploy a model with an increased quality and accuracy in french language.
|
24 |
+
|
25 |
## Training procedure
|
26 |
|
27 |
|