Saving best model of SciBERT_TwoWayLoss_25K_bs64 to hub

Browse files

Files changed (5) hide show

README.md +15 -15
all_results.json +5 -5
pytorch_model.bin +1 -1
train_results.json +5 -5
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -8,23 +8,23 @@ metrics:
 - recall
 - f1
 model-index:
-- name: SciBERT_twowayloss_25K_bs64
   results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
-# SciBERT_twowayloss_25K_bs64
 This model is a fine-tuned version of [allenai/scibert_scivocab_uncased](https://huggingface.co/allenai/scibert_scivocab_uncased) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.0158
-- Accuracy: 0.9945
-- Precision: 0.7948
-- Recall: 0.5830
-- F1: 0.6727
-- Hamming: 0.0055
 ## Model description
@@ -44,8 +44,8 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 2e-05
-- train_batch_size: 64
-- eval_batch_size: 64
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
@@ -56,11 +56,11 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step  | Validation Loss | Accuracy | Precision | Recall | F1     | Hamming |
 |:-------------:|:-----:|:-----:|:---------------:|:--------:|:---------:|:------:|:------:|:-------:|
-| 0.0332        | 0.16  | 5000  | 0.0283          | 0.9921   | 0.8249    | 0.2410 | 0.3730 | 0.0079  |
-| 0.0195        | 0.32  | 10000 | 0.0186          | 0.9939   | 0.7964    | 0.4983 | 0.6131 | 0.0061  |
-| 0.0173        | 0.47  | 15000 | 0.0168          | 0.9943   | 0.7936    | 0.5587 | 0.6557 | 0.0057  |
-| 0.0165        | 0.63  | 20000 | 0.0161          | 0.9944   | 0.7949    | 0.5782 | 0.6694 | 0.0056  |
-| 0.0161        | 0.79  | 25000 | 0.0158          | 0.9945   | 0.7948    | 0.5830 | 0.6727 | 0.0055  |
 ### Framework versions

 - recall
 - f1
 model-index:
+- name: SciBERT_TwoWayLoss_25K_bs64
   results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
+# SciBERT_TwoWayLoss_25K_bs64
 This model is a fine-tuned version of [allenai/scibert_scivocab_uncased](https://huggingface.co/allenai/scibert_scivocab_uncased) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 5.7117
+- Accuracy: 0.7367
+- Precision: 0.0357
+- Recall: 0.9994
+- F1: 0.0689
+- Hamming: 0.2633
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 2e-05
+- train_batch_size: 192
+- eval_batch_size: 192
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 | Training Loss | Epoch | Step  | Validation Loss | Accuracy | Precision | Recall | F1     | Hamming |
 |:-------------:|:-----:|:-----:|:---------------:|:--------:|:---------:|:------:|:------:|:-------:|
+| 6.7538        | 0.47  | 5000  | 6.4722          | 0.7208   | 0.0337    | 0.9987 | 0.0652 | 0.2792  |
+| 6.1625        | 0.95  | 10000 | 6.0293          | 0.7311   | 0.0350    | 0.9991 | 0.0676 | 0.2689  |
+| 5.7863        | 1.42  | 15000 | 5.8415          | 0.7362   | 0.0356    | 0.9992 | 0.0688 | 0.2638  |
+| 5.6995        | 1.9   | 20000 | 5.7343          | 0.7366   | 0.0357    | 0.9994 | 0.0689 | 0.2634  |
+| 5.4711        | 2.37  | 25000 | 5.7117          | 0.7367   | 0.0357    | 0.9994 | 0.0689 | 0.2633  |
 ### Framework versions

all_results.json CHANGED Viewed

@@ -1,7 +1,7 @@
 {
-    "epoch": 0.79,
-    "train_loss": 0.035478990478515625,
-    "train_runtime": 37421.8729,
-    "train_samples_per_second": 42.756,
-    "train_steps_per_second": 0.668
 }

 {
+    "epoch": 2.37,
+    "train_loss": 6.40765580078125,
+    "train_runtime": 40238.8097,
+    "train_samples_per_second": 119.288,
+    "train_steps_per_second": 0.621
 }

pytorch_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:194ef24bb5654deb12d392721c5b011063d67909de835fb505fbfe5ec9bc44c2
 size 440249777

 version https://git-lfs.github.com/spec/v1
+oid sha256:b47e03d0bfd48e03b50aed01a5195ecdbf8d5645c84471f38fe403229b856bda
 size 440249777

train_results.json CHANGED Viewed

@@ -1,7 +1,7 @@
 {
-    "epoch": 0.79,
-    "train_loss": 0.035478990478515625,
-    "train_runtime": 37421.8729,
-    "train_samples_per_second": 42.756,
-    "train_steps_per_second": 0.668
 }

 {
+    "epoch": 2.37,
+    "train_loss": 6.40765580078125,
+    "train_runtime": 40238.8097,
+    "train_samples_per_second": 119.288,
+    "train_steps_per_second": 0.621
 }

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:966cfd2427291b363134071b600212dc366ae784066861a7545e04cb453c3d0c
 size 4155

 version https://git-lfs.github.com/spec/v1
+oid sha256:0bed994926f1d60d4ef894b7c02dab3a173005b5db54ee74c2f0e1711cca1f98
 size 4155