zakerous commited on
Commit
f62957f
1 Parent(s): 9c7be2a

Training complete

Browse files
README.md CHANGED
@@ -22,7 +22,7 @@ model-index:
22
  metrics:
23
  - name: Rouge1
24
  type: rouge
25
- value: 39.4817
26
  ---
27
 
28
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -32,11 +32,11 @@ should probably proofread and complete it, then remove this comment. -->
32
 
33
  This model is a fine-tuned version of [google/pegasus-x-large](https://huggingface.co/google/pegasus-x-large) on the samsum dataset.
34
  It achieves the following results on the evaluation set:
35
- - Loss: 1.6845
36
- - Rouge1: 39.4817
37
- - Rouge2: 17.2378
38
- - Rougel: 33.2558
39
- - Rougelsum: 35.8353
40
 
41
  ## Model description
42
 
@@ -56,19 +56,28 @@ More information needed
56
 
57
  The following hyperparameters were used during training:
58
  - learning_rate: 5.6e-05
59
- - train_batch_size: 8
60
- - eval_batch_size: 8
61
  - seed: 42
62
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
63
  - lr_scheduler_type: linear
64
- - num_epochs: 1
65
  - mixed_precision_training: Native AMP
66
 
67
  ### Training results
68
 
69
  | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum |
70
  |:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|:-------:|:---------:|
71
- | 1.9981 | 1.0 | 125 | 1.6845 | 39.4817 | 17.2378 | 33.2558 | 35.8353 |
 
 
 
 
 
 
 
 
 
72
 
73
 
74
  ### Framework versions
 
22
  metrics:
23
  - name: Rouge1
24
  type: rouge
25
+ value: 46.6996
26
  ---
27
 
28
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 
32
 
33
  This model is a fine-tuned version of [google/pegasus-x-large](https://huggingface.co/google/pegasus-x-large) on the samsum dataset.
34
  It achieves the following results on the evaluation set:
35
+ - Loss: 1.4802
36
+ - Rouge1: 46.6996
37
+ - Rouge2: 21.5586
38
+ - Rougel: 38.1002
39
+ - Rougelsum: 41.42
40
 
41
  ## Model description
42
 
 
56
 
57
  The following hyperparameters were used during training:
58
  - learning_rate: 5.6e-05
59
+ - train_batch_size: 2
60
+ - eval_batch_size: 1
61
  - seed: 42
62
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
63
  - lr_scheduler_type: linear
64
+ - num_epochs: 10
65
  - mixed_precision_training: Native AMP
66
 
67
  ### Training results
68
 
69
  | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum |
70
  |:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|:-------:|:---------:|
71
+ | 1.7681 | 1.0 | 500 | 1.4689 | 47.1766 | 21.8869 | 38.8854 | 42.9534 |
72
+ | 1.4626 | 2.0 | 1000 | 1.4781 | 46.6978 | 20.786 | 37.764 | 41.2028 |
73
+ | 1.3591 | 3.0 | 1500 | 1.4804 | 47.1756 | 21.8821 | 38.2072 | 41.6812 |
74
+ | 1.3466 | 4.0 | 2000 | 1.4804 | 46.9411 | 21.5169 | 38.18 | 41.471 |
75
+ | 1.3464 | 5.0 | 2500 | 1.4803 | 46.8083 | 21.5333 | 38.1539 | 41.4872 |
76
+ | 1.3353 | 6.0 | 3000 | 1.4804 | 46.6675 | 21.1336 | 37.7059 | 41.0869 |
77
+ | 1.3483 | 7.0 | 3500 | 1.4803 | 46.6768 | 21.1916 | 37.7642 | 41.1696 |
78
+ | 1.3536 | 8.0 | 4000 | 1.4804 | 46.7311 | 21.5169 | 38.057 | 41.42 |
79
+ | 1.3533 | 9.0 | 4500 | 1.4802 | 46.6403 | 21.529 | 37.9922 | 41.3437 |
80
+ | 1.3469 | 10.0 | 5000 | 1.4802 | 46.6996 | 21.5586 | 38.1002 | 41.42 |
81
 
82
 
83
  ### Framework versions
runs/Jan28_18-14-18_5b1046ff91c1/events.out.tfevents.1706465658.5b1046ff91c1.4510.0 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:2151087252f3e24738ed3871f16ac2fc044469587ecd74454e2804327dca0b9b
3
- size 11240
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:1bfc1aeec1f991aa98a719321efbb0da6b5bdc1777b3e40d72eede721af23194
3
+ size 12068