Update README.md
Browse files
README.md
CHANGED
@@ -70,4 +70,21 @@ input_ids = tokenizer(prompt, return_tensors='pt').input_ids
|
|
70 |
output_ids = model.generate( input_ids, max_length=200, num_beams=5 )
|
71 |
input_length = input_ids.shape[1]
|
72 |
generated_text = tokenizer.decode(output_ids[0, input_length: ], skip_special_tokens=True).strip()
|
|
|
73 |
```
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
70 |
output_ids = model.generate( input_ids, max_length=200, num_beams=5 )
|
71 |
input_length = input_ids.shape[1]
|
72 |
generated_text = tokenizer.decode(output_ids[0, input_length: ], skip_special_tokens=True).strip()
|
73 |
+
# Ahir se'n va anar, va agafar les seves coses i es va posar a navegar.
|
74 |
```
|
75 |
+
|
76 |
+
## Training
|
77 |
+
|
78 |
+
Training details are specified in the [paper](). Code for training the model and running other experiments can be found in Plume [repo](https://github.com/projecte-aina/Plume).
|
79 |
+
|
80 |
+
## Evaluation
|
81 |
+
|
82 |
+
| Model | FLORES BLEU | FLORES COMET | NTREX BLEU | NTREX COMET |
|
83 |
+
|----------------------|-------------|--------------|------------|-------------|
|
84 |
+
| NLLB-1.3B | 31.02 | 0.86 | 29.68 | 0.85 |
|
85 |
+
| NLLB-600M | 29.24 | 0.85 | 28.37 | 0.84 |
|
86 |
+
| Bilinguals BSC | 31.93 | 0.86 | 29.77 | 0.84 |
|
87 |
+
|----------------------|-------------|--------------|------------|-------------|
|
88 |
+
| Parlam 32k | 30.44 | 0.86 | 28.46 | 0.84 |
|
89 |
+
| Parlam 128k | 30.81 | 0.86 | 28.78 | 0.84 |
|
90 |
+
| Parlam 256k | 30.72 | 0.86 | 28.87 | 0.84 |
|