nlparabic
/

res_nw_dj

Generated from Trainer

Model card Files Files and versions Community

nlparabic commited on Aug 31

Commit

d9e2495

•

1 Parent(s): 90c3acb

Model save

Files changed (2) hide show

README.md +18 -11
model.safetensors +1 -1

README.md CHANGED Viewed

@@ -18,11 +18,11 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [riotu-lab/ArabianGPT-01B](https://huggingface.co/riotu-lab/ArabianGPT-01B) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.6499
-- Bleu: 0.2518
-- Rouge1: 0.5702
-- Rouge2: 0.3118
-- Rougel: 0.5677
 ## Model description
@@ -48,15 +48,22 @@ The following hyperparameters were used during training:
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 500
-- num_epochs: 3.0
 ### Training results
-| Training Loss | Epoch | Step | Bleu   | Validation Loss | Rouge1 | Rouge2 | Rougel |
-|:-------------:|:-----:|:----:|:------:|:---------------:|:------:|:------:|:------:|
-| 1.2513        | 1.0   | 2703 | 0.2201 | 0.7112          | 0.5172 | 0.2533 | 0.5148 |
-| 0.6462        | 2.0   | 5406 | 0.2451 | 0.6569          | 0.5579 | 0.2974 | 0.5553 |
-| 0.5673        | 3.0   | 8109 | 0.2518 | 0.6499          | 0.5702 | 0.3118 | 0.5677 |
 ### Framework versions

 This model is a fine-tuned version of [riotu-lab/ArabianGPT-01B](https://huggingface.co/riotu-lab/ArabianGPT-01B) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.6658
+- Bleu: 0.2770
+- Rouge1: 0.6173
+- Rouge2: 0.3729
+- Rougel: 0.6153
 ## Model description
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 500
+- num_epochs: 20.0
 ### Training results
+| Training Loss | Epoch | Step  | Bleu   | Validation Loss | Rouge1 | Rouge2 | Rougel |
+|:-------------:|:-----:|:-----:|:------:|:---------------:|:------:|:------:|:------:|
+| 1.2513        | 1.0   | 2703  | 0.2201 | 0.7112          | 0.5172 | 0.2533 | 0.5148 |
+| 0.6462        | 2.0   | 5406  | 0.2451 | 0.6569          | 0.5579 | 0.2974 | 0.5553 |
+| 0.5673        | 3.0   | 8109  | 0.2518 | 0.6499          | 0.5702 | 0.3118 | 0.5677 |
+| 0.556         | 4.0   | 10812 | 0.6330 | 0.2598          | 0.5829 | 0.3262 | 0.5807 |
+| 0.4786        | 5.0   | 13515 | 0.6266 | 0.2679          | 0.5977 | 0.3443 | 0.5959 |
+| 0.4123        | 6.0   | 16218 | 0.6303 | 0.2707          | 0.6044 | 0.3548 | 0.6027 |
+| 0.3573        | 7.0   | 18921 | 0.6372 | 0.2739          | 0.6108 | 0.3631 | 0.6088 |
+| 0.3108        | 8.0   | 21624 | 0.6466 | 0.2760          | 0.6130 | 0.3678 | 0.6112 |
+| 0.2719        | 9.0   | 24327 | 0.6551 | 0.2802          | 0.6178 | 0.3727 | 0.6159 |
+| 0.2396        | 10.0  | 27030 | 0.6658 | 0.2770          | 0.6173 | 0.3729 | 0.6153 |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:3fa37224b6c297bbacc0c91da212ceacbcb5225401b84b2752097c6defd9dbad
 size 539221632

 version https://git-lfs.github.com/spec/v1
+oid sha256:bfe3c622bc3f36c295706ec7e3fd153d28f79268fcf6e11f09f6a77f69068610
 size 539221632