tiagoblima commited on
Commit
877f461
1 Parent(s): 036bfa1

End of training

Browse files
Files changed (1) hide show
  1. README.md +11 -17
README.md CHANGED
@@ -15,10 +15,10 @@ should probably proofread and complete it, then remove this comment. -->
15
 
16
  # newsdata-bert
17
 
18
- This model is a fine-tuned version of [bert-base-cased](https://huggingface.co/bert-base-cased) on the None dataset.
19
  It achieves the following results on the evaluation set:
20
- - Loss: 0.7534
21
- - Accuracy: 0.8531
22
 
23
  ## Model description
24
 
@@ -38,28 +38,22 @@ More information needed
38
 
39
  The following hyperparameters were used during training:
40
  - learning_rate: 2e-05
41
- - train_batch_size: 2
42
  - eval_batch_size: 2
43
  - seed: 42
 
 
44
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
45
  - lr_scheduler_type: linear
46
  - num_epochs: 1
47
 
48
  ### Training results
49
 
50
- | Training Loss | Epoch | Step | Validation Loss | Accuracy |
51
- |:-------------:|:------:|:-----:|:---------------:|:--------:|
52
- | 1.4704 | 0.0859 | 5000 | 1.4487 | 0.6858 |
53
- | 1.1946 | 0.1718 | 10000 | 1.2197 | 0.7417 |
54
- | 1.1323 | 0.2577 | 15000 | 0.9984 | 0.7733 |
55
- | 0.9926 | 0.3436 | 20000 | 1.0195 | 0.7901 |
56
- | 0.9232 | 0.4295 | 25000 | 0.9879 | 0.8089 |
57
- | 0.9273 | 0.5155 | 30000 | 0.8956 | 0.8224 |
58
- | 1.0023 | 0.6014 | 35000 | 0.8435 | 0.8277 |
59
- | 0.7566 | 0.6873 | 40000 | 0.8668 | 0.8331 |
60
- | 0.9032 | 0.7732 | 45000 | 0.8221 | 0.8408 |
61
- | 0.7227 | 0.8591 | 50000 | 0.7653 | 0.8456 |
62
- | 0.8159 | 0.9450 | 55000 | 0.7534 | 0.8531 |
63
 
64
 
65
  ### Framework versions
 
15
 
16
  # newsdata-bert
17
 
18
+ This model is a fine-tuned version of [bert-base-cased](https://huggingface.co/bert-base-cased) on an unknown dataset.
19
  It achieves the following results on the evaluation set:
20
+ - Loss: 0.5569
21
+ - Accuracy: 0.8447
22
 
23
  ## Model description
24
 
 
38
 
39
  The following hyperparameters were used during training:
40
  - learning_rate: 2e-05
41
+ - train_batch_size: 4
42
  - eval_batch_size: 2
43
  - seed: 42
44
+ - gradient_accumulation_steps: 4
45
+ - total_train_batch_size: 16
46
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
47
  - lr_scheduler_type: linear
48
  - num_epochs: 1
49
 
50
  ### Training results
51
 
52
+ | Training Loss | Epoch | Step | Validation Loss | Accuracy |
53
+ |:-------------:|:------:|:----:|:---------------:|:--------:|
54
+ | 0.8762 | 0.2749 | 2000 | 0.7835 | 0.7864 |
55
+ | 0.6718 | 0.5498 | 4000 | 0.6341 | 0.8259 |
56
+ | 0.5852 | 0.8247 | 6000 | 0.5569 | 0.8447 |
 
 
 
 
 
 
 
 
57
 
58
 
59
  ### Framework versions