manahil1 commited on
Commit
90b385f
1 Parent(s): 0f48751

End of training

Browse files
Files changed (2) hide show
  1. README.md +102 -102
  2. pytorch_model.bin +1 -1
README.md CHANGED
@@ -17,9 +17,9 @@ should probably proofread and complete it, then remove this comment. -->
17
 
18
  This model is a fine-tuned version of [t5-small](https://huggingface.co/t5-small) on the None dataset.
19
  It achieves the following results on the evaluation set:
20
- - Loss: 1.2859
21
  - Bleu: 0.0
22
- - Gen Len: 0.0
23
 
24
  ## Model description
25
 
@@ -50,106 +50,106 @@ The following hyperparameters were used during training:
50
 
51
  | Training Loss | Epoch | Step | Validation Loss | Bleu | Gen Len |
52
  |:-------------:|:-----:|:----:|:---------------:|:----:|:-------:|
53
- | No log | 1.0 | 3 | 10.4721 | 0.0 | 19.0 |
54
- | 11.4017 | 2.0 | 6 | 9.4761 | 0.0 | 19.0 |
55
- | 8.6118 | 3.0 | 9 | 8.6439 | 0.0 | 19.0 |
56
- | 7.0732 | 4.0 | 12 | 7.8504 | 0.0 | 19.0 |
57
- | 7.0732 | 5.0 | 15 | 7.0418 | 0.0 | 19.0 |
58
- | 9.4499 | 6.0 | 18 | 6.4173 | 0.0 | 19.0 |
59
- | 6.047 | 7.0 | 21 | 5.8917 | 0.0 | 19.0 |
60
- | 8.5079 | 8.0 | 24 | 5.4822 | 0.0 | 19.0 |
61
- | 8.5079 | 9.0 | 27 | 5.1167 | 0.0 | 19.0 |
62
- | 5.3118 | 10.0 | 30 | 4.7820 | 0.0 | 19.0 |
63
- | 5.1047 | 11.0 | 33 | 4.4711 | 0.0 | 19.0 |
64
- | 4.2976 | 12.0 | 36 | 4.2040 | 0.0 | 19.0 |
65
- | 4.2976 | 13.0 | 39 | 3.9466 | 0.0 | 19.0 |
66
- | 4.3815 | 14.0 | 42 | 3.6867 | 0.0 | 19.0 |
67
- | 4.1765 | 15.0 | 45 | 3.3908 | 0.0 | 19.0 |
68
- | 3.6726 | 16.0 | 48 | 3.1037 | 0.0 | 19.0 |
69
- | 3.6726 | 17.0 | 51 | 2.8341 | 0.0 | 19.0 |
70
- | 3.5088 | 18.0 | 54 | 2.5814 | 0.0 | 19.0 |
71
- | 3.4025 | 19.0 | 57 | 2.3402 | 0.0 | 19.0 |
72
- | 3.4804 | 20.0 | 60 | 2.1320 | 0.0 | 19.0 |
73
- | 3.4804 | 21.0 | 63 | 1.9531 | 0.0 | 19.0 |
74
- | 2.9683 | 22.0 | 66 | 1.8089 | 0.0 | 14.7778 |
75
- | 2.7596 | 23.0 | 69 | 1.7039 | 0.0 | 0.0 |
76
- | 2.9402 | 24.0 | 72 | 1.6415 | 0.0 | 0.0 |
77
- | 2.9402 | 25.0 | 75 | 1.6086 | 0.0 | 0.0 |
78
- | 3.0547 | 26.0 | 78 | 1.5912 | 0.0 | 0.0 |
79
- | 2.3308 | 27.0 | 81 | 1.5822 | 0.0 | 0.0 |
80
- | 2.3394 | 28.0 | 84 | 1.5777 | 0.0 | 0.0 |
81
- | 2.3394 | 29.0 | 87 | 1.5747 | 0.0 | 0.0 |
82
- | 2.6954 | 30.0 | 90 | 1.5695 | 0.0 | 0.0 |
83
- | 2.2629 | 31.0 | 93 | 1.5636 | 0.0 | 0.0 |
84
- | 2.4494 | 32.0 | 96 | 1.5568 | 0.0 | 0.0 |
85
- | 2.4494 | 33.0 | 99 | 1.5503 | 0.0 | 0.0 |
86
- | 2.2914 | 34.0 | 102 | 1.5422 | 0.0 | 0.0 |
87
- | 2.1202 | 35.0 | 105 | 1.5332 | 0.0 | 0.0 |
88
- | 2.2631 | 36.0 | 108 | 1.5228 | 0.0 | 0.0 |
89
- | 2.2631 | 37.0 | 111 | 1.5123 | 0.0 | 0.0 |
90
- | 2.0139 | 38.0 | 114 | 1.5024 | 0.0 | 0.0 |
91
- | 2.2812 | 39.0 | 117 | 1.4926 | 0.0 | 0.0 |
92
- | 1.798 | 40.0 | 120 | 1.4836 | 0.0 | 0.0 |
93
- | 1.798 | 41.0 | 123 | 1.4740 | 0.0 | 0.0 |
94
- | 1.8001 | 42.0 | 126 | 1.4656 | 0.0 | 0.0 |
95
- | 2.2109 | 43.0 | 129 | 1.4577 | 0.0 | 0.0 |
96
- | 1.6209 | 44.0 | 132 | 1.4502 | 0.0 | 0.0 |
97
- | 1.6209 | 45.0 | 135 | 1.4439 | 0.0 | 0.0 |
98
- | 2.106 | 46.0 | 138 | 1.4373 | 0.0 | 0.0 |
99
- | 2.0342 | 47.0 | 141 | 1.4307 | 0.0 | 0.0 |
100
- | 1.9099 | 48.0 | 144 | 1.4238 | 0.0 | 0.0 |
101
- | 1.9099 | 49.0 | 147 | 1.4172 | 0.0 | 0.0 |
102
- | 2.0013 | 50.0 | 150 | 1.4105 | 0.0 | 0.0 |
103
- | 1.5806 | 51.0 | 153 | 1.4036 | 0.0 | 0.0 |
104
- | 1.9924 | 52.0 | 156 | 1.3963 | 0.0 | 0.0 |
105
- | 1.9924 | 53.0 | 159 | 1.3907 | 0.0 | 0.0 |
106
- | 2.1897 | 54.0 | 162 | 1.3848 | 0.0 | 0.0 |
107
- | 1.6961 | 55.0 | 165 | 1.3792 | 0.0 | 0.0 |
108
- | 1.7686 | 56.0 | 168 | 1.3740 | 0.0 | 0.0 |
109
- | 1.7686 | 57.0 | 171 | 1.3693 | 0.0 | 0.0 |
110
- | 2.0588 | 58.0 | 174 | 1.3643 | 0.0 | 0.0 |
111
- | 1.9657 | 59.0 | 177 | 1.3596 | 0.0 | 0.0 |
112
- | 1.705 | 60.0 | 180 | 1.3556 | 0.0 | 0.0 |
113
- | 1.705 | 61.0 | 183 | 1.3520 | 0.0 | 0.0 |
114
- | 1.7669 | 62.0 | 186 | 1.3486 | 0.0 | 0.0 |
115
- | 1.9735 | 63.0 | 189 | 1.3448 | 0.0 | 0.0 |
116
- | 2.1708 | 64.0 | 192 | 1.3403 | 0.0 | 0.0 |
117
- | 2.1708 | 65.0 | 195 | 1.3365 | 0.0 | 0.0 |
118
- | 1.974 | 66.0 | 198 | 1.3338 | 0.0 | 0.0 |
119
- | 1.8153 | 67.0 | 201 | 1.3313 | 0.0 | 0.0 |
120
- | 2.4112 | 68.0 | 204 | 1.3289 | 0.0 | 0.0 |
121
- | 2.4112 | 69.0 | 207 | 1.3271 | 0.0 | 0.0 |
122
- | 1.4735 | 70.0 | 210 | 1.3244 | 0.0 | 0.0 |
123
- | 1.4407 | 71.0 | 213 | 1.3222 | 0.0 | 0.0 |
124
- | 2.1837 | 72.0 | 216 | 1.3200 | 0.0 | 0.0 |
125
- | 2.1837 | 73.0 | 219 | 1.3179 | 0.0 | 0.0 |
126
- | 1.8413 | 74.0 | 222 | 1.3161 | 0.0 | 0.0 |
127
- | 1.8677 | 75.0 | 225 | 1.3141 | 0.0 | 0.0 |
128
- | 1.9011 | 76.0 | 228 | 1.3119 | 0.0 | 0.0 |
129
- | 1.9011 | 77.0 | 231 | 1.3101 | 0.0 | 0.0 |
130
- | 1.5412 | 78.0 | 234 | 1.3082 | 0.0 | 0.0 |
131
- | 1.844 | 79.0 | 237 | 1.3064 | 0.0 | 0.0 |
132
- | 1.8727 | 80.0 | 240 | 1.3044 | 0.0 | 0.0 |
133
- | 1.8727 | 81.0 | 243 | 1.3026 | 0.0 | 0.0 |
134
- | 1.8597 | 82.0 | 246 | 1.3007 | 0.0 | 0.0 |
135
- | 1.889 | 83.0 | 249 | 1.2988 | 0.0 | 0.0 |
136
- | 1.5874 | 84.0 | 252 | 1.2974 | 0.0 | 0.0 |
137
- | 1.5874 | 85.0 | 255 | 1.2960 | 0.0 | 0.0 |
138
- | 1.7652 | 86.0 | 258 | 1.2945 | 0.0 | 0.0 |
139
- | 1.471 | 87.0 | 261 | 1.2933 | 0.0 | 0.0 |
140
- | 1.8824 | 88.0 | 264 | 1.2924 | 0.0 | 0.0 |
141
- | 1.8824 | 89.0 | 267 | 1.2913 | 0.0 | 0.0 |
142
- | 1.8729 | 90.0 | 270 | 1.2902 | 0.0 | 0.0 |
143
- | 1.409 | 91.0 | 273 | 1.2894 | 0.0 | 0.0 |
144
- | 1.5123 | 92.0 | 276 | 1.2889 | 0.0 | 0.0 |
145
- | 1.5123 | 93.0 | 279 | 1.2882 | 0.0 | 0.0 |
146
- | 1.3037 | 94.0 | 282 | 1.2876 | 0.0 | 0.0 |
147
- | 1.8444 | 95.0 | 285 | 1.2871 | 0.0 | 0.0 |
148
- | 1.3042 | 96.0 | 288 | 1.2866 | 0.0 | 0.0 |
149
- | 1.3042 | 97.0 | 291 | 1.2863 | 0.0 | 0.0 |
150
- | 1.5027 | 98.0 | 294 | 1.2861 | 0.0 | 0.0 |
151
- | 2.0811 | 99.0 | 297 | 1.2860 | 0.0 | 0.0 |
152
- | 1.4372 | 100.0 | 300 | 1.2859 | 0.0 | 0.0 |
153
 
154
 
155
  ### Framework versions
 
17
 
18
  This model is a fine-tuned version of [t5-small](https://huggingface.co/t5-small) on the None dataset.
19
  It achieves the following results on the evaluation set:
20
+ - Loss: 0.9156
21
  - Bleu: 0.0
22
+ - Gen Len: 19.0
23
 
24
  ## Model description
25
 
 
50
 
51
  | Training Loss | Epoch | Step | Validation Loss | Bleu | Gen Len |
52
  |:-------------:|:-----:|:----:|:---------------:|:----:|:-------:|
53
+ | No log | 1.0 | 8 | 6.6371 | 0.0 | 19.0 |
54
+ | 6.8242 | 2.0 | 16 | 5.2780 | 0.0 | 19.0 |
55
+ | 6.8242 | 3.0 | 24 | 4.4851 | 0.0 | 19.0 |
56
+ | 5.345 | 4.0 | 32 | 3.7656 | 0.0 | 19.0 |
57
+ | 5.345 | 5.0 | 40 | 3.0462 | 0.0 | 19.0 |
58
+ | 4.0321 | 6.0 | 48 | 2.4729 | 0.0 | 19.0 |
59
+ | 3.2425 | 7.0 | 56 | 2.1585 | 0.0 | 11.7931 |
60
+ | 3.2425 | 8.0 | 64 | 2.0606 | 0.0 | 0.0 |
61
+ | 2.8344 | 9.0 | 72 | 2.0090 | 0.0 | 0.0 |
62
+ | 2.8344 | 10.0 | 80 | 1.9443 | 0.0 | 0.0 |
63
+ | 2.6721 | 11.0 | 88 | 1.8702 | 0.0 | 0.0 |
64
+ | 2.6721 | 12.0 | 96 | 1.8071 | 0.0 | 0.0 |
65
+ | 2.5019 | 13.0 | 104 | 1.7541 | 0.0 | 0.0 |
66
+ | 2.3339 | 14.0 | 112 | 1.7014 | 0.0 | 0.0 |
67
+ | 2.3339 | 15.0 | 120 | 1.6502 | 0.0 | 0.0 |
68
+ | 2.2227 | 16.0 | 128 | 1.6094 | 0.0 | 0.0 |
69
+ | 2.2227 | 17.0 | 136 | 1.5746 | 0.0 | 0.0 |
70
+ | 2.1738 | 18.0 | 144 | 1.5353 | 0.0 | 0.0 |
71
+ | 2.1738 | 19.0 | 152 | 1.5066 | 0.0 | 0.0 |
72
+ | 2.054 | 20.0 | 160 | 1.4870 | 0.0 | 0.0 |
73
+ | 1.9707 | 21.0 | 168 | 1.4581 | 0.0 | 0.0 |
74
+ | 1.9707 | 22.0 | 176 | 1.4359 | 0.0 | 0.0 |
75
+ | 1.96 | 23.0 | 184 | 1.4032 | 0.0 | 0.0 |
76
+ | 1.96 | 24.0 | 192 | 1.3737 | 0.0 | 0.0 |
77
+ | 1.7402 | 25.0 | 200 | 1.3482 | 0.0 | 0.0 |
78
+ | 1.7402 | 26.0 | 208 | 1.3257 | 0.0 | 0.0 |
79
+ | 1.7044 | 27.0 | 216 | 1.3047 | 0.0 | 0.0 |
80
+ | 1.751 | 28.0 | 224 | 1.2861 | 0.0 | 0.0 |
81
+ | 1.751 | 29.0 | 232 | 1.2644 | 0.0 | 0.0 |
82
+ | 1.6414 | 30.0 | 240 | 1.2353 | 0.0 | 0.0 |
83
+ | 1.6414 | 31.0 | 248 | 1.2160 | 0.0 | 0.0 |
84
+ | 1.6418 | 32.0 | 256 | 1.1991 | 0.0 | 0.0 |
85
+ | 1.6418 | 33.0 | 264 | 1.1937 | 0.0 | 0.0 |
86
+ | 1.6258 | 34.0 | 272 | 1.1762 | 0.0 | 0.0 |
87
+ | 1.6102 | 35.0 | 280 | 1.1632 | 0.0 | 0.0 |
88
+ | 1.6102 | 36.0 | 288 | 1.1498 | 0.0 | 0.0 |
89
+ | 1.5266 | 37.0 | 296 | 1.1361 | 0.0 | 0.0 |
90
+ | 1.5266 | 38.0 | 304 | 1.1205 | 0.0 | 10.4828 |
91
+ | 1.5756 | 39.0 | 312 | 1.1108 | 0.0 | 10.4828 |
92
+ | 1.5756 | 40.0 | 320 | 1.1028 | 0.0 | 10.4828 |
93
+ | 1.5136 | 41.0 | 328 | 1.0937 | 0.0 | 10.4828 |
94
+ | 1.529 | 42.0 | 336 | 1.0837 | 0.0 | 10.4828 |
95
+ | 1.529 | 43.0 | 344 | 1.0714 | 0.0 | 11.7931 |
96
+ | 1.4738 | 44.0 | 352 | 1.0599 | 0.0 | 13.1034 |
97
+ | 1.4738 | 45.0 | 360 | 1.0514 | 0.0 | 13.1034 |
98
+ | 1.4521 | 46.0 | 368 | 1.0467 | 0.0 | 13.1034 |
99
+ | 1.4521 | 47.0 | 376 | 1.0438 | 0.0 | 13.1034 |
100
+ | 1.4758 | 48.0 | 384 | 1.0358 | 0.0 | 13.1034 |
101
+ | 1.4698 | 49.0 | 392 | 1.0264 | 0.0 | 13.1034 |
102
+ | 1.4698 | 50.0 | 400 | 1.0205 | 0.0 | 17.6897 |
103
+ | 1.3355 | 51.0 | 408 | 1.0159 | 0.0 | 18.3448 |
104
+ | 1.3355 | 52.0 | 416 | 1.0087 | 0.0 | 19.0 |
105
+ | 1.36 | 53.0 | 424 | 1.0040 | 0.0 | 19.0 |
106
+ | 1.36 | 54.0 | 432 | 1.0005 | 0.0 | 19.0 |
107
+ | 1.3025 | 55.0 | 440 | 0.9955 | 0.0 | 19.0 |
108
+ | 1.2773 | 56.0 | 448 | 0.9910 | 0.0 | 19.0 |
109
+ | 1.2773 | 57.0 | 456 | 0.9873 | 0.0 | 19.0 |
110
+ | 1.3006 | 58.0 | 464 | 0.9840 | 0.0 | 19.0 |
111
+ | 1.3006 | 59.0 | 472 | 0.9826 | 0.0 | 19.0 |
112
+ | 1.3037 | 60.0 | 480 | 0.9813 | 0.0 | 19.0 |
113
+ | 1.3037 | 61.0 | 488 | 0.9765 | 0.0 | 19.0 |
114
+ | 1.3133 | 62.0 | 496 | 0.9717 | 0.0 | 19.0 |
115
+ | 1.2601 | 63.0 | 504 | 0.9671 | 0.0 | 19.0 |
116
+ | 1.2601 | 64.0 | 512 | 0.9637 | 0.0 | 19.0 |
117
+ | 1.2442 | 65.0 | 520 | 0.9610 | 0.0 | 19.0 |
118
+ | 1.2442 | 66.0 | 528 | 0.9585 | 0.0 | 19.0 |
119
+ | 1.2394 | 67.0 | 536 | 0.9568 | 0.0 | 19.0 |
120
+ | 1.2394 | 68.0 | 544 | 0.9546 | 0.0 | 19.0 |
121
+ | 1.2746 | 69.0 | 552 | 0.9509 | 0.0 | 19.0 |
122
+ | 1.233 | 70.0 | 560 | 0.9478 | 0.0 | 19.0 |
123
+ | 1.233 | 71.0 | 568 | 0.9452 | 0.0 | 19.0 |
124
+ | 1.2382 | 72.0 | 576 | 0.9424 | 0.0 | 19.0 |
125
+ | 1.2382 | 73.0 | 584 | 0.9400 | 0.0 | 19.0 |
126
+ | 1.2603 | 74.0 | 592 | 0.9379 | 0.0 | 19.0 |
127
+ | 1.2603 | 75.0 | 600 | 0.9357 | 0.0 | 19.0 |
128
+ | 1.2028 | 76.0 | 608 | 0.9338 | 0.0 | 19.0 |
129
+ | 1.2755 | 77.0 | 616 | 0.9330 | 0.0 | 19.0 |
130
+ | 1.2755 | 78.0 | 624 | 0.9316 | 0.0 | 19.0 |
131
+ | 1.244 | 79.0 | 632 | 0.9303 | 0.0 | 19.0 |
132
+ | 1.244 | 80.0 | 640 | 0.9291 | 0.0 | 19.0 |
133
+ | 1.115 | 81.0 | 648 | 0.9281 | 0.0 | 19.0 |
134
+ | 1.115 | 82.0 | 656 | 0.9272 | 0.0 | 19.0 |
135
+ | 1.2373 | 83.0 | 664 | 0.9258 | 0.0 | 19.0 |
136
+ | 1.2035 | 84.0 | 672 | 0.9243 | 0.0 | 19.0 |
137
+ | 1.2035 | 85.0 | 680 | 0.9231 | 0.0 | 19.0 |
138
+ | 1.1881 | 86.0 | 688 | 0.9216 | 0.0 | 19.0 |
139
+ | 1.1881 | 87.0 | 696 | 0.9205 | 0.0 | 19.0 |
140
+ | 1.1713 | 88.0 | 704 | 0.9200 | 0.0 | 19.0 |
141
+ | 1.1713 | 89.0 | 712 | 0.9191 | 0.0 | 19.0 |
142
+ | 1.1984 | 90.0 | 720 | 0.9184 | 0.0 | 19.0 |
143
+ | 1.2879 | 91.0 | 728 | 0.9177 | 0.0 | 19.0 |
144
+ | 1.2879 | 92.0 | 736 | 0.9174 | 0.0 | 19.0 |
145
+ | 1.1823 | 93.0 | 744 | 0.9171 | 0.0 | 19.0 |
146
+ | 1.1823 | 94.0 | 752 | 0.9170 | 0.0 | 19.0 |
147
+ | 1.2293 | 95.0 | 760 | 0.9166 | 0.0 | 19.0 |
148
+ | 1.2293 | 96.0 | 768 | 0.9162 | 0.0 | 19.0 |
149
+ | 1.2154 | 97.0 | 776 | 0.9160 | 0.0 | 19.0 |
150
+ | 1.1625 | 98.0 | 784 | 0.9158 | 0.0 | 19.0 |
151
+ | 1.1625 | 99.0 | 792 | 0.9156 | 0.0 | 19.0 |
152
+ | 1.1679 | 100.0 | 800 | 0.9156 | 0.0 | 19.0 |
153
 
154
 
155
  ### Framework versions
pytorch_model.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:c3945ca7f1b40c2c534d35d7e0e2683d528efce086c61c34664be8f852cc659f
3
  size 242071641
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:1fbc59ede370c7bad18825e7410c56d060f3c5e5c53ea627e6891b6f505c7c38
3
  size 242071641