meoo225 commited on
Commit
47265c7
1 Parent(s): a94959d

End of training

Browse files
README.md CHANGED
@@ -16,9 +16,9 @@ should probably proofread and complete it, then remove this comment. -->
16
 
17
  This model is a fine-tuned version of [VietAI/vit5-base](https://huggingface.co/VietAI/vit5-base) on the None dataset.
18
  It achieves the following results on the evaluation set:
19
- - Loss: 0.2349
20
- - Bleu Score: 79.2125
21
- - Gen Len: 12.7933
22
 
23
  ## Model description
24
 
@@ -38,8 +38,8 @@ More information needed
38
 
39
  The following hyperparameters were used during training:
40
  - learning_rate: 0.0001
41
- - train_batch_size: 8
42
- - eval_batch_size: 8
43
  - seed: 42
44
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
45
  - lr_scheduler_type: linear
@@ -49,9 +49,9 @@ The following hyperparameters were used during training:
49
 
50
  | Training Loss | Epoch | Step | Validation Loss | Bleu Score | Gen Len |
51
  |:-------------:|:-----:|:----:|:---------------:|:----------:|:-------:|
52
- | 0.4686 | 1.0 | 838 | 0.2500 | 77.4621 | 12.8244 |
53
- | 0.1722 | 2.0 | 1676 | 0.2120 | 78.5608 | 12.7933 |
54
- | 0.0703 | 3.0 | 2514 | 0.2349 | 79.2125 | 12.7933 |
55
 
56
 
57
  ### Framework versions
 
16
 
17
  This model is a fine-tuned version of [VietAI/vit5-base](https://huggingface.co/VietAI/vit5-base) on the None dataset.
18
  It achieves the following results on the evaluation set:
19
+ - Loss: 0.2178
20
+ - Bleu Score: 78.7502
21
+ - Gen Len: 12.7826
22
 
23
  ## Model description
24
 
 
38
 
39
  The following hyperparameters were used during training:
40
  - learning_rate: 0.0001
41
+ - train_batch_size: 16
42
+ - eval_batch_size: 16
43
  - seed: 42
44
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
45
  - lr_scheduler_type: linear
 
49
 
50
  | Training Loss | Epoch | Step | Validation Loss | Bleu Score | Gen Len |
51
  |:-------------:|:-----:|:----:|:---------------:|:----------:|:-------:|
52
+ | 0.4635 | 1.0 | 419 | 0.2255 | 77.2861 | 12.7969 |
53
+ | 0.166 | 2.0 | 838 | 0.2026 | 78.3051 | 12.8041 |
54
+ | 0.0752 | 3.0 | 1257 | 0.2178 | 78.7502 | 12.7826 |
55
 
56
 
57
  ### Framework versions
logs/events.out.tfevents.1727299711.b656d3ec1883.4091.0 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:04d62fe30d08622905366060894d4c8842a03b20119bc924a7e467a248d6029a
3
- size 6358
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ef116e684e6a2110b60a31b2317f78e3ca12c3c752b7533d0633555f02e885da
3
+ size 7299
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:0471cd61a7a8fde7d6309a119ac67538db2459f640cf031e0168bf565709fb21
3
  size 903834408
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a3d56343698946c69ec8dc1217225d632d5bf35cd2f709a12aa3a7f2ae15623a
3
  size 903834408