Update README.md
Browse files
README.md
CHANGED
@@ -25,7 +25,7 @@ This model aims to bridge the gap in **open-source English to Punjabi translatio
|
|
25 |
- **Training Data**: 10 million English<>Punjabi parallel sentences from [AI4Bharat's Bharat Parallel Corpus Collection (BPCC)](https://github.com/AI4Bharat/IndicTrans2).
|
26 |
- **Evaluation Data**: The model has been evaluated on **1503 samples** from the **IN22-Conv dataset**, which is also available via [IndicTrans2](https://github.com/AI4Bharat/IndicTrans2).
|
27 |
- **Model Architecture**: Based on **LLaMA 3.1 8B** with BF16 precision.
|
28 |
-
- **Score (chrF++)**: Achieved a **chrF++ score of 28.1** on the IN22-Conv dataset, which is an excellent score for an open-source model.
|
29 |
|
30 |
This is the **first release** of the model, and future updates aim to improve the chrF++ score for enhanced translation quality.
|
31 |
|
@@ -133,7 +133,6 @@ Stay tuned for updates, and feel free to contribute or raise issues on Hugging F
|
|
133 |
|
134 |
- **Training Data**: [Bharat Parallel Corpus Collection (BPCC)](https://github.com/AI4Bharat/IndicTrans2) by AI4Bharat.
|
135 |
- **Evaluation Data**: [IN22-Conv dataset](https://github.com/AI4Bharat/IndicTrans2).
|
136 |
-
- **Benchmarks**: [Translation Benchmarks Paper](https://arxiv.org/pdf/2305.16307).
|
137 |
|
138 |
## Contributors
|
139 |
|
|
|
25 |
- **Training Data**: 10 million English<>Punjabi parallel sentences from [AI4Bharat's Bharat Parallel Corpus Collection (BPCC)](https://github.com/AI4Bharat/IndicTrans2).
|
26 |
- **Evaluation Data**: The model has been evaluated on **1503 samples** from the **IN22-Conv dataset**, which is also available via [IndicTrans2](https://github.com/AI4Bharat/IndicTrans2).
|
27 |
- **Model Architecture**: Based on **LLaMA 3.1 8B** with BF16 precision.
|
28 |
+
- **Score (chrF++)**: Achieved a **chrF++ score of 28.1** on the IN22-Conv dataset, which is an excellent score for an open-source model.
|
29 |
|
30 |
This is the **first release** of the model, and future updates aim to improve the chrF++ score for enhanced translation quality.
|
31 |
|
|
|
133 |
|
134 |
- **Training Data**: [Bharat Parallel Corpus Collection (BPCC)](https://github.com/AI4Bharat/IndicTrans2) by AI4Bharat.
|
135 |
- **Evaluation Data**: [IN22-Conv dataset](https://github.com/AI4Bharat/IndicTrans2).
|
|
|
136 |
|
137 |
## Contributors
|
138 |
|