End of training

Browse files

Files changed (5) hide show

README.md +12 -32
model.safetensors +1 -1
runs/Apr01_18-06-47_gweltaz-NUC10i7FNK/events.out.tfevents.1711987611.gweltaz-NUC10i7FNK +2 -2
runs/Apr01_18-54-07_gweltaz-NUC10i7FNK/events.out.tfevents.1711990451.gweltaz-NUC10i7FNK +3 -0
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -1,8 +1,8 @@
 ---
 license: apache-2.0
 tags:
 - generated_from_trainer
-base_model: distilbert/distilgpt2
 model-index:
 - name: tiny-gpt2-br
   results: []
@@ -13,9 +13,14 @@ should probably proofread and complete it, then remove this comment. -->
 # tiny-gpt2-br
-This model is a fine-tuned version of [distilbert/distilgpt2](https://huggingface.co/distilbert/distilgpt2) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 3.6213
 ## Model description
@@ -34,39 +39,14 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 0.001
-- train_batch_size: 16
-- eval_batch_size: 32
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 500
-- training_steps: 19000
-### Training results
-| Training Loss | Epoch | Step  | Validation Loss |
-|:-------------:|:-----:|:-----:|:---------------:|
-| 5.7274        | 0.21  | 1000  | 4.9308          |
-| 4.7191        | 0.42  | 2000  | 4.5347          |
-| 4.4579        | 0.63  | 3000  | 4.3432          |
-| 4.2769        | 0.84  | 4000  | 4.1893          |
-| 4.1086        | 1.05  | 5000  | 4.0861          |
-| 3.9327        | 1.26  | 6000  | 3.9992          |
-| 3.8812        | 1.47  | 7000  | 3.9216          |
-| 3.8298        | 1.68  | 8000  | 3.8648          |
-| 3.7785        | 1.89  | 9000  | 3.8126          |
-| 3.6099        | 2.1   | 10000 | 3.7931          |
-| 3.471         | 2.31  | 11000 | 3.7539          |
-| 3.4651        | 2.52  | 12000 | 3.7141          |
-| 3.4451        | 2.73  | 13000 | 3.6754          |
-| 3.4251        | 2.94  | 14000 | 3.6327          |
-| 3.1855        | 3.15  | 15000 | 3.6779          |
-| 3.0962        | 3.36  | 16000 | 3.6757          |
-| 3.0971        | 3.56  | 17000 | 3.6437          |
-| 3.0816        | 3.77  | 18000 | 3.6287          |
-| 3.0582        | 3.98  | 19000 | 3.6213          |
 ### Framework versions

 ---
 license: apache-2.0
+base_model: distilbert/distilgpt2
 tags:
 - generated_from_trainer
 model-index:
 - name: tiny-gpt2-br
   results: []
 # tiny-gpt2-br
+This model is a fine-tuned version of [distilbert/distilgpt2](https://huggingface.co/distilbert/distilgpt2) on the None dataset.
 It achieves the following results on the evaluation set:
+- eval_loss: 3.2672
+- eval_runtime: 134.3513
+- eval_samples_per_second: 259.023
+- eval_steps_per_second: 16.189
+- epoch: 3.42
+- step: 134000
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 0.0008
+- train_batch_size: 8
+- eval_batch_size: 16
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 500
+- num_epochs: 4
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:46dcb6f9966016e71724c39b390c540dd2d8fd77f4bd9626dd290fb42b8fcf66
 size 327657928

 version https://git-lfs.github.com/spec/v1
+oid sha256:f6721d43c325291c48544c3a1e5ae02183dd14da471bef6ba21b6384c03a7eac
 size 327657928

runs/Apr01_18-06-47_gweltaz-NUC10i7FNK/events.out.tfevents.1711987611.gweltaz-NUC10i7FNK CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:c9f1a904bb874ab823568ffc458dafb0b8a520d84cbb016c88b959f3adeef87b
-size 7029

 version https://git-lfs.github.com/spec/v1
+oid sha256:5567e05cfda04d34326806bbdf4513f4371bbb30357fe3e7c72a00d17b0d1a45
+size 7300

runs/Apr01_18-54-07_gweltaz-NUC10i7FNK/events.out.tfevents.1711990451.gweltaz-NUC10i7FNK ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:5c5990c73a5a4ecbb8f49db6f31581480c903bf193430f8cc698d7f8dabddead
+size 70540

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:39a8d6f02fb3929cbe6520c32e9bb74308e8f5c45da0791a2253fb8abae27908
 size 4475

 version https://git-lfs.github.com/spec/v1
+oid sha256:7b0acbee13ddf623aa2f948dcc2d177e96c8edd64a2eed6cdc5776481bd85c9e
 size 4475