athuldinesh commited on
Commit
def2a74
1 Parent(s): 2ca8b6e

Model save

Browse files
README.md CHANGED
@@ -23,7 +23,7 @@ model-index:
23
  metrics:
24
  - name: Rouge1
25
  type: rouge
26
- value: 0.4109
27
  ---
28
 
29
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -33,12 +33,12 @@ should probably proofread and complete it, then remove this comment. -->
33
 
34
  This model is a fine-tuned version of [google-t5/t5-small](https://huggingface.co/google-t5/t5-small) on the samsum dataset.
35
  It achieves the following results on the evaluation set:
36
- - Loss: 1.8063
37
- - Rouge1: 0.4109
38
- - Rouge2: 0.1834
39
- - Rougel: 0.3429
40
- - Rougelsum: 0.343
41
- - Gen Len: 16.5562
42
 
43
  ## Model description
44
 
@@ -57,23 +57,24 @@ More information needed
57
  ### Training hyperparameters
58
 
59
  The following hyperparameters were used during training:
60
- - learning_rate: 2e-05
61
  - train_batch_size: 16
62
  - eval_batch_size: 16
63
  - seed: 42
64
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
65
  - lr_scheduler_type: linear
66
- - num_epochs: 4
67
  - mixed_precision_training: Native AMP
68
 
69
  ### Training results
70
 
71
  | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
72
  |:-------------:|:-----:|:----:|:---------------:|:------:|:------:|:------:|:---------:|:-------:|
73
- | 2.2685 | 1.0 | 921 | 1.8752 | 0.3961 | 0.1698 | 0.3294 | 0.3297 | 16.2836 |
74
- | 2.0494 | 2.0 | 1842 | 1.8315 | 0.406 | 0.1807 | 0.3408 | 0.3413 | 16.3484 |
75
- | 2.0014 | 3.0 | 2763 | 1.8096 | 0.4078 | 0.1802 | 0.3407 | 0.3407 | 16.6381 |
76
- | 1.9817 | 4.0 | 3684 | 1.8063 | 0.4109 | 0.1834 | 0.3429 | 0.343 | 16.5562 |
 
77
 
78
 
79
  ### Framework versions
 
23
  metrics:
24
  - name: Rouge1
25
  type: rouge
26
+ value: 0.4282
27
  ---
28
 
29
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 
33
 
34
  This model is a fine-tuned version of [google-t5/t5-small](https://huggingface.co/google-t5/t5-small) on the samsum dataset.
35
  It achieves the following results on the evaluation set:
36
+ - Loss: 1.7255
37
+ - Rouge1: 0.4282
38
+ - Rouge2: 0.2003
39
+ - Rougel: 0.36
40
+ - Rougelsum: 0.3596
41
+ - Gen Len: 16.7372
42
 
43
  ## Model description
44
 
 
57
  ### Training hyperparameters
58
 
59
  The following hyperparameters were used during training:
60
+ - learning_rate: 3e-05
61
  - train_batch_size: 16
62
  - eval_batch_size: 16
63
  - seed: 42
64
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
65
  - lr_scheduler_type: linear
66
+ - num_epochs: 5
67
  - mixed_precision_training: Native AMP
68
 
69
  ### Training results
70
 
71
  | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
72
  |:-------------:|:-----:|:----:|:---------------:|:------:|:------:|:------:|:---------:|:-------:|
73
+ | 1.9452 | 1.0 | 921 | 1.7726 | 0.4147 | 0.1901 | 0.3492 | 0.3493 | 16.4719 |
74
+ | 1.8952 | 2.0 | 1842 | 1.7498 | 0.4237 | 0.1971 | 0.3577 | 0.3577 | 16.4548 |
75
+ | 1.8703 | 3.0 | 2763 | 1.7323 | 0.4243 | 0.1968 | 0.3571 | 0.3566 | 16.7689 |
76
+ | 1.8579 | 4.0 | 3684 | 1.7310 | 0.4262 | 0.2012 | 0.3606 | 0.3604 | 16.7641 |
77
+ | 1.8525 | 5.0 | 4605 | 1.7255 | 0.4282 | 0.2003 | 0.36 | 0.3596 | 16.7372 |
78
 
79
 
80
  ### Framework versions
runs/Oct22_03-00-23_ac5d8fd220f7/events.out.tfevents.1729566029.ac5d8fd220f7.881.2 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:741228a4d16ddbf7e34ff9137dc08b9c14dade9567314e41d259e3b69e0e9c46
3
- size 9930
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:dee1e7ece571137529c4040890ad401c5fa6f829fbc1507f7b6eb09c0c512ff0
3
+ size 10809
runs/Oct22_03-00-23_ac5d8fd220f7/events.out.tfevents.1729567730.ac5d8fd220f7.881.3 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:8d8ce0bb24a1ef81383967c4511637a3d7fbb56d198b63e33b300f82bdc880f8
3
+ size 613