fydhfzh commited on
Commit
edf3530
1 Parent(s): ce2ac4f

End of training

Browse files
README.md CHANGED
@@ -20,12 +20,12 @@ should probably proofread and complete it, then remove this comment. -->
20
 
21
  This model is a fine-tuned version of [facebook/hubert-base-ls960](https://huggingface.co/facebook/hubert-base-ls960) on an unknown dataset.
22
  It achieves the following results on the evaluation set:
23
- - Loss: 1.1320
24
- - Accuracy: 0.7724
25
- - Precision: 0.8107
26
- - Recall: 0.7724
27
- - F1: 0.7633
28
- - Binary: 0.8448
29
 
30
  ## Model description
31
 
@@ -45,73 +45,37 @@ More information needed
45
 
46
  The following hyperparameters were used during training:
47
  - learning_rate: 3e-05
48
- - train_batch_size: 32
49
  - eval_batch_size: 32
50
  - seed: 42
51
  - gradient_accumulation_steps: 4
52
- - total_train_batch_size: 128
53
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
54
  - lr_scheduler_type: linear
55
- - num_epochs: 10
56
  - mixed_precision_training: Native AMP
57
 
58
  ### Training results
59
 
60
  | Training Loss | Epoch | Step | Validation Loss | Accuracy | Precision | Recall | F1 | Binary |
61
  |:-------------:|:-----:|:----:|:---------------:|:--------:|:---------:|:------:|:------:|:------:|
62
- | No log | 0.17 | 50 | 4.2981 | 0.0242 | 0.0171 | 0.0242 | 0.0108 | 0.2760 |
63
- | No log | 0.35 | 100 | 3.9571 | 0.0315 | 0.0038 | 0.0315 | 0.0059 | 0.3133 |
64
- | No log | 0.52 | 150 | 3.7311 | 0.0678 | 0.0243 | 0.0678 | 0.0277 | 0.3424 |
65
- | No log | 0.69 | 200 | 3.5663 | 0.0896 | 0.0557 | 0.0896 | 0.0513 | 0.3593 |
66
- | No log | 0.86 | 250 | 3.4934 | 0.0944 | 0.0408 | 0.0944 | 0.0477 | 0.3545 |
67
- | No log | 1.04 | 300 | 3.3705 | 0.1235 | 0.0864 | 0.1235 | 0.0706 | 0.3748 |
68
- | No log | 1.21 | 350 | 3.2630 | 0.1429 | 0.1042 | 0.1429 | 0.0811 | 0.3906 |
69
- | No log | 1.38 | 400 | 3.1551 | 0.1574 | 0.1413 | 0.1574 | 0.1118 | 0.4029 |
70
- | No log | 1.55 | 450 | 3.0426 | 0.2349 | 0.1580 | 0.2349 | 0.1585 | 0.4593 |
71
- | 3.6339 | 1.73 | 500 | 2.9462 | 0.2542 | 0.1854 | 0.2542 | 0.1806 | 0.4736 |
72
- | 3.6339 | 1.9 | 550 | 2.8439 | 0.2663 | 0.2105 | 0.2663 | 0.2084 | 0.4814 |
73
- | 3.6339 | 2.07 | 600 | 2.7192 | 0.3051 | 0.2797 | 0.3051 | 0.2411 | 0.5092 |
74
- | 3.6339 | 2.24 | 650 | 2.6390 | 0.3293 | 0.3167 | 0.3293 | 0.2650 | 0.5274 |
75
- | 3.6339 | 2.42 | 700 | 2.5491 | 0.3753 | 0.3731 | 0.3753 | 0.3290 | 0.5600 |
76
- | 3.6339 | 2.59 | 750 | 2.4728 | 0.4092 | 0.3891 | 0.4092 | 0.3595 | 0.5823 |
77
- | 3.6339 | 2.76 | 800 | 2.3395 | 0.4431 | 0.4205 | 0.4431 | 0.3917 | 0.6082 |
78
- | 3.6339 | 2.93 | 850 | 2.2685 | 0.4552 | 0.4355 | 0.4552 | 0.4028 | 0.6160 |
79
- | 3.6339 | 3.11 | 900 | 2.1883 | 0.4915 | 0.4680 | 0.4915 | 0.4387 | 0.6414 |
80
- | 3.6339 | 3.28 | 950 | 2.1182 | 0.4843 | 0.5102 | 0.4843 | 0.4440 | 0.6363 |
81
- | 2.6665 | 3.45 | 1000 | 2.0197 | 0.5448 | 0.5629 | 0.5448 | 0.5028 | 0.6804 |
82
- | 2.6665 | 3.62 | 1050 | 1.9782 | 0.5327 | 0.5532 | 0.5327 | 0.4935 | 0.6712 |
83
- | 2.6665 | 3.8 | 1100 | 1.9313 | 0.5593 | 0.5486 | 0.5593 | 0.5156 | 0.6930 |
84
- | 2.6665 | 3.97 | 1150 | 1.8627 | 0.5908 | 0.5893 | 0.5908 | 0.5513 | 0.7119 |
85
- | 2.6665 | 4.14 | 1200 | 1.8169 | 0.5908 | 0.5834 | 0.5908 | 0.5543 | 0.7128 |
86
- | 2.6665 | 4.31 | 1250 | 1.7702 | 0.5835 | 0.5843 | 0.5835 | 0.5487 | 0.7077 |
87
- | 2.6665 | 4.49 | 1300 | 1.7007 | 0.6344 | 0.6857 | 0.6344 | 0.6124 | 0.7438 |
88
- | 2.6665 | 4.66 | 1350 | 1.6638 | 0.6199 | 0.6156 | 0.6199 | 0.5850 | 0.7354 |
89
- | 2.6665 | 4.83 | 1400 | 1.6198 | 0.6368 | 0.6325 | 0.6368 | 0.6004 | 0.7482 |
90
- | 2.6665 | 5.0 | 1450 | 1.5672 | 0.6804 | 0.6888 | 0.6804 | 0.6529 | 0.7753 |
91
- | 2.0909 | 5.18 | 1500 | 1.5308 | 0.6683 | 0.6870 | 0.6683 | 0.6437 | 0.7692 |
92
- | 2.0909 | 5.35 | 1550 | 1.4946 | 0.6877 | 0.6969 | 0.6877 | 0.6632 | 0.7811 |
93
- | 2.0909 | 5.52 | 1600 | 1.4698 | 0.6755 | 0.6767 | 0.6755 | 0.6454 | 0.7743 |
94
- | 2.0909 | 5.69 | 1650 | 1.4228 | 0.6804 | 0.7066 | 0.6804 | 0.6612 | 0.7785 |
95
- | 2.0909 | 5.87 | 1700 | 1.3709 | 0.7312 | 0.7432 | 0.7312 | 0.7128 | 0.8140 |
96
- | 2.0909 | 6.04 | 1750 | 1.3780 | 0.7215 | 0.7356 | 0.7215 | 0.7010 | 0.8082 |
97
- | 2.0909 | 6.21 | 1800 | 1.3291 | 0.7215 | 0.7370 | 0.7215 | 0.7007 | 0.8090 |
98
- | 2.0909 | 6.38 | 1850 | 1.3296 | 0.7191 | 0.7333 | 0.7191 | 0.7028 | 0.8056 |
99
- | 2.0909 | 6.56 | 1900 | 1.3195 | 0.7191 | 0.7584 | 0.7191 | 0.7069 | 0.8048 |
100
- | 2.0909 | 6.73 | 1950 | 1.2939 | 0.7191 | 0.7609 | 0.7191 | 0.7019 | 0.8065 |
101
- | 1.75 | 6.9 | 2000 | 1.2800 | 0.7191 | 0.7353 | 0.7191 | 0.7018 | 0.8065 |
102
- | 1.75 | 7.08 | 2050 | 1.2767 | 0.7094 | 0.7175 | 0.7094 | 0.6920 | 0.7998 |
103
- | 1.75 | 7.25 | 2100 | 1.2280 | 0.7264 | 0.7689 | 0.7264 | 0.7148 | 0.8116 |
104
- | 1.75 | 7.42 | 2150 | 1.2231 | 0.7385 | 0.7585 | 0.7385 | 0.7246 | 0.8201 |
105
- | 1.75 | 7.59 | 2200 | 1.2198 | 0.7385 | 0.7563 | 0.7385 | 0.7248 | 0.8201 |
106
- | 1.75 | 7.77 | 2250 | 1.1782 | 0.7482 | 0.7634 | 0.7482 | 0.7352 | 0.8269 |
107
- | 1.75 | 7.94 | 2300 | 1.1848 | 0.7579 | 0.7900 | 0.7579 | 0.7519 | 0.8337 |
108
- | 1.75 | 8.11 | 2350 | 1.1773 | 0.7579 | 0.7875 | 0.7579 | 0.7484 | 0.8346 |
109
- | 1.75 | 8.28 | 2400 | 1.1752 | 0.7676 | 0.7965 | 0.7676 | 0.7594 | 0.8404 |
110
- | 1.75 | 8.46 | 2450 | 1.1563 | 0.7724 | 0.8048 | 0.7724 | 0.7649 | 0.8438 |
111
- | 1.5635 | 8.63 | 2500 | 1.1320 | 0.7724 | 0.8107 | 0.7724 | 0.7633 | 0.8448 |
112
- | 1.5635 | 8.8 | 2550 | 1.1194 | 0.7700 | 0.8018 | 0.7700 | 0.7601 | 0.8421 |
113
- | 1.5635 | 8.97 | 2600 | 1.1268 | 0.7554 | 0.7756 | 0.7554 | 0.7448 | 0.8329 |
114
- | 1.5635 | 9.15 | 2650 | 1.1176 | 0.7676 | 0.7844 | 0.7676 | 0.7567 | 0.8404 |
115
 
116
 
117
  ### Framework versions
 
20
 
21
  This model is a fine-tuned version of [facebook/hubert-base-ls960](https://huggingface.co/facebook/hubert-base-ls960) on an unknown dataset.
22
  It achieves the following results on the evaluation set:
23
+ - Loss: 2.4637
24
+ - Accuracy: 0.5230
25
+ - Precision: 0.4945
26
+ - Recall: 0.5230
27
+ - F1: 0.4700
28
+ - Binary: 0.6634
29
 
30
  ## Model description
31
 
 
45
 
46
  The following hyperparameters were used during training:
47
  - learning_rate: 3e-05
48
+ - train_batch_size: 64
49
  - eval_batch_size: 32
50
  - seed: 42
51
  - gradient_accumulation_steps: 4
52
+ - total_train_batch_size: 256
53
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
54
  - lr_scheduler_type: linear
55
+ - num_epochs: 30
56
  - mixed_precision_training: Native AMP
57
 
58
  ### Training results
59
 
60
  | Training Loss | Epoch | Step | Validation Loss | Accuracy | Precision | Recall | F1 | Binary |
61
  |:-------------:|:-----:|:----:|:---------------:|:--------:|:---------:|:------:|:------:|:------:|
62
+ | No log | 1.72 | 50 | 4.2179 | 0.0484 | 0.0065 | 0.0484 | 0.0105 | 0.3058 |
63
+ | No log | 3.45 | 100 | 3.8319 | 0.1017 | 0.0846 | 0.1017 | 0.0618 | 0.3634 |
64
+ | No log | 5.17 | 150 | 3.5448 | 0.1864 | 0.1327 | 0.1864 | 0.1311 | 0.4274 |
65
+ | No log | 6.9 | 200 | 3.3129 | 0.2470 | 0.2063 | 0.2470 | 0.1855 | 0.4671 |
66
+ | No log | 8.62 | 250 | 3.1207 | 0.3123 | 0.3090 | 0.3123 | 0.2599 | 0.5150 |
67
+ | No log | 10.34 | 300 | 2.9535 | 0.3826 | 0.3524 | 0.3826 | 0.3277 | 0.5644 |
68
+ | No log | 12.07 | 350 | 2.8121 | 0.4310 | 0.3894 | 0.4310 | 0.3695 | 0.5983 |
69
+ | No log | 13.79 | 400 | 2.6726 | 0.4431 | 0.3939 | 0.4431 | 0.3775 | 0.6075 |
70
+ | No log | 15.52 | 450 | 2.5597 | 0.4818 | 0.4413 | 0.4818 | 0.4206 | 0.6370 |
71
+ | 3.4474 | 17.24 | 500 | 2.4637 | 0.5230 | 0.4945 | 0.5230 | 0.4700 | 0.6634 |
72
+ | 3.4474 | 18.97 | 550 | 2.3747 | 0.5400 | 0.5111 | 0.5400 | 0.4920 | 0.6760 |
73
+ | 3.4474 | 20.69 | 600 | 2.3113 | 0.5545 | 0.5212 | 0.5545 | 0.5067 | 0.6872 |
74
+ | 3.4474 | 22.41 | 650 | 2.2492 | 0.5714 | 0.5475 | 0.5714 | 0.5274 | 0.7007 |
75
+ | 3.4474 | 24.14 | 700 | 2.2053 | 0.5738 | 0.5511 | 0.5738 | 0.5336 | 0.7015 |
76
+ | 3.4474 | 25.86 | 750 | 2.1757 | 0.5714 | 0.5477 | 0.5714 | 0.5283 | 0.7015 |
77
+ | 3.4474 | 27.59 | 800 | 2.1491 | 0.5908 | 0.5574 | 0.5908 | 0.5468 | 0.7140 |
78
+ | 3.4474 | 29.31 | 850 | 2.1403 | 0.5932 | 0.5625 | 0.5932 | 0.5506 | 0.7167 |
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
79
 
80
 
81
  ### Framework versions
runs/Jun13_22-20-06_LAPTOP-1GID9RGH/events.out.tfevents.1718292007.LAPTOP-1GID9RGH.20972.0 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:4f3ac414af9901074d27c92a4216fee74122b7ba93d96f3a99cfe46dad60e884
3
- size 14433
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:1facfeb5d750e6b7dd2fe8fcdf5e9301bebc01ac786529567e735f269f7bead9
3
+ size 18441
runs/Jun13_22-20-06_LAPTOP-1GID9RGH/events.out.tfevents.1718292615.LAPTOP-1GID9RGH.20972.1 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:824e41826fa631f53b1cf06a0d3a82f608c16fdf20a7b8bc424ef43d391cb5b7
3
+ size 610