akshat3492
/

mT5

text2text-generation

Generated from Trainer

Inference Endpoints

Model card Files Files and versions Community

akshat3492 commited on Sep 6, 2023

Commit

0107ebf

•

1 Parent(s): 9557ec9

Training complete

Files changed (2) hide show

README.md +22 -13
pytorch_model.bin +1 -1

README.md CHANGED Viewed

@@ -4,8 +4,8 @@ base_model: google/mt5-small
 tags:
 - summarization
 - generated_from_trainer
-datasets:
-- cnn_dailymail
 model-index:
 - name: mT5
   results: []
@@ -16,18 +16,13 @@ should probably proofread and complete it, then remove this comment. -->
 # mT5
-This model is a fine-tuned version of [google/mt5-small](https://huggingface.co/google/mt5-small) on the cnn_dailymail dataset.
 It achieves the following results on the evaluation set:
-- eval_loss: 2.8471
-- eval_rouge1: 17.5064
-- eval_rouge2: 6.3976
-- eval_rougeL: 15.6341
-- eval_rougeLsum: 16.5177
-- eval_runtime: 80.7804
-- eval_samples_per_second: 0.619
-- eval_steps_per_second: 0.087
-- epoch: 3.0
-- step: 939
 ## Model description
@@ -54,6 +49,20 @@ The following hyperparameters were used during training:
 - lr_scheduler_type: linear
 - num_epochs: 8
 ### Framework versions
 - Transformers 4.32.1

 tags:
 - summarization
 - generated_from_trainer
+metrics:
+- rouge
 model-index:
 - name: mT5
   results: []
 # mT5
+This model is a fine-tuned version of [google/mt5-small](https://huggingface.co/google/mt5-small) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 2.7797
+- Rouge1: 17.5958
+- Rouge2: 5.5502
+- Rougel: 14.89
+- Rougelsum: 15.8861
 ## Model description
 - lr_scheduler_type: linear
 - num_epochs: 8
+### Training results
+| Training Loss | Epoch | Step | Validation Loss | Rouge1  | Rouge2 | Rougel  | Rougelsum |
+|:-------------:|:-----:|:----:|:---------------:|:-------:|:------:|:-------:|:---------:|
+| 6.7587        | 1.0   | 313  | 3.0537          | 15.5845 | 4.426  | 12.7262 | 13.9385   |
+| 3.6224        | 2.0   | 626  | 2.8799          | 16.4339 | 4.8534 | 13.3138 | 14.9449   |
+| 3.3322        | 3.0   | 939  | 2.8378          | 18.1043 | 6.2202 | 15.376  | 16.5012   |
+| 3.1974        | 4.0   | 1252 | 2.8008          | 17.8905 | 5.7529 | 15.0379 | 16.3205   |
+| 3.1183        | 5.0   | 1565 | 2.7936          | 17.7318 | 5.4565 | 14.8508 | 15.9979   |
+| 3.0522        | 6.0   | 1878 | 2.7824          | 17.6328 | 5.5352 | 14.7803 | 15.8202   |
+| 3.019         | 7.0   | 2191 | 2.7846          | 17.7348 | 5.4391 | 14.7499 | 15.8859   |
+| 2.9889        | 8.0   | 2504 | 2.7797          | 17.5958 | 5.5502 | 14.89   | 15.8861   |
 ### Framework versions
 - Transformers 4.32.1

pytorch_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:50a23d39adbea940addcecccbf4c13ceb101fbe1ccefc9ad9ec27e6886e72644
 size 1200769925

 version https://git-lfs.github.com/spec/v1
+oid sha256:3faf35e22d606c93f54a0c93543d9df15c99ca10c1485234b779d6e5911ddde7
 size 1200769925