projecte-aina
/

Plume32k

text-generation

text-generation-inference

Model card Files Files and versions Community

javi8979 commited on Jun 6

Commit

03066f8

•

1 Parent(s): d29ff40

Update README.md

Files changed (1) hide show

README.md +17 -0

README.md CHANGED Viewed

@@ -70,4 +70,21 @@ input_ids = tokenizer(prompt, return_tensors='pt').input_ids
 output_ids = model.generate( input_ids, max_length=200, num_beams=5 )
 input_length = input_ids.shape[1]
 generated_text = tokenizer.decode(output_ids[0, input_length: ], skip_special_tokens=True).strip()
 ```

 output_ids = model.generate( input_ids, max_length=200, num_beams=5 )
 input_length = input_ids.shape[1]
 generated_text = tokenizer.decode(output_ids[0, input_length: ], skip_special_tokens=True).strip()
+# Ahir se'n va anar, va agafar les seves coses i es va posar a navegar.
 ```
+## Training
+Training details are specified in the [paper](). Code for training the model and running other experiments can be found in Plume [repo](https://github.com/projecte-aina/Plume).
+## Evaluation
+| Model                | FLORES BLEU | FLORES COMET | NTREX BLEU | NTREX COMET |
+|----------------------|-------------|--------------|------------|-------------|
+| NLLB-1.3B            | 31.02       | 0.86         | 29.68      | 0.85        |
+| NLLB-600M            | 29.24       | 0.85         | 28.37      | 0.84        |
+| Bilinguals BSC       | 31.93       | 0.86         | 29.77      | 0.84        |
+|----------------------|-------------|--------------|------------|-------------|
+| Parlam 32k           | 30.44       | 0.86         | 28.46      | 0.84        |
+| Parlam 128k          | 30.81       | 0.86         | 28.78      | 0.84        |
+| Parlam 256k          | 30.72       | 0.86         | 28.87      | 0.84        |