ShikhaTiwari commited on
Commit
638b786
1 Parent(s): d92a3bd

End of training

Browse files
README.md CHANGED
@@ -23,7 +23,7 @@ model-index:
23
  metrics:
24
  - name: Rouge1
25
  type: rouge
26
- value: 39.6659
27
  ---
28
 
29
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -33,12 +33,12 @@ should probably proofread and complete it, then remove this comment. -->
33
 
34
  This model is a fine-tuned version of [t5-small](https://huggingface.co/t5-small) on the samsum dataset.
35
  It achieves the following results on the evaluation set:
36
- - Loss: 1.8586
37
- - Rouge1: 39.6659
38
- - Rouge2: 17.1753
39
- - Rougel: 33.2244
40
- - Rougelsum: 36.8446
41
- - Gen Len: 16.1577
42
 
43
  ## Model description
44
 
@@ -63,14 +63,17 @@ The following hyperparameters were used during training:
63
  - seed: 42
64
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
65
  - lr_scheduler_type: linear
66
- - num_epochs: 1
67
  - mixed_precision_training: Native AMP
68
 
69
  ### Training results
70
 
71
- | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
72
- |:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|:-------:|:---------:|:-------:|
73
- | 2.0613 | 1.0 | 3683 | 1.8586 | 39.6659 | 17.1753 | 33.2244 | 36.8446 | 16.1577 |
 
 
 
74
 
75
 
76
  ### Framework versions
 
23
  metrics:
24
  - name: Rouge1
25
  type: rouge
26
+ value: 41.4993
27
  ---
28
 
29
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 
33
 
34
  This model is a fine-tuned version of [t5-small](https://huggingface.co/t5-small) on the samsum dataset.
35
  It achieves the following results on the evaluation set:
36
+ - Loss: 1.7347
37
+ - Rouge1: 41.4993
38
+ - Rouge2: 18.6768
39
+ - Rougel: 34.8901
40
+ - Rougelsum: 38.434
41
+ - Gen Len: 16.533
42
 
43
  ## Model description
44
 
 
63
  - seed: 42
64
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
65
  - lr_scheduler_type: linear
66
+ - num_epochs: 4
67
  - mixed_precision_training: Native AMP
68
 
69
  ### Training results
70
 
71
+ | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
72
+ |:-------------:|:-----:|:-----:|:---------------:|:-------:|:-------:|:-------:|:---------:|:-------:|
73
+ | 1.9691 | 1.0 | 3683 | 1.7864 | 40.9391 | 18.2339 | 34.4773 | 38.0777 | 16.7017 |
74
+ | 1.9213 | 2.0 | 7366 | 1.7533 | 41.4658 | 18.8086 | 34.8947 | 38.3528 | 16.5208 |
75
+ | 1.8757 | 3.0 | 11049 | 1.7367 | 41.641 | 18.7355 | 35.0432 | 38.5403 | 16.544 |
76
+ | 1.8484 | 4.0 | 14732 | 1.7347 | 41.4993 | 18.6768 | 34.8901 | 38.434 | 16.533 |
77
 
78
 
79
  ### Framework versions
runs/Oct21_20-28-05_1182f181c872/events.out.tfevents.1729542493.1182f181c872.1238.1 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:10739917c459eae80bcf7d00cd0dc348d36a80006d1663f0591ab3a8af0d650b
3
- size 13658
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:3a5d0a2a631d26d6de302851d64792bb22db72b3e4ded9a7263e76b566aa0f9d
3
+ size 14537