tiagoblima commited on
Commit
78e1bc6
1 Parent(s): 36fb4f1

End of training

Browse files
Files changed (1) hide show
  1. README.md +11 -17
README.md CHANGED
@@ -15,10 +15,10 @@ should probably proofread and complete it, then remove this comment. -->
15
 
16
  # newsdata-bertimbal
17
 
18
- This model is a fine-tuned version of [neuralmind/bert-base-portuguese-cased](https://huggingface.co/neuralmind/bert-base-portuguese-cased) on the None dataset.
19
  It achieves the following results on the evaluation set:
20
- - Loss: 0.7455
21
- - Accuracy: 0.8743
22
 
23
  ## Model description
24
 
@@ -38,28 +38,22 @@ More information needed
38
 
39
  The following hyperparameters were used during training:
40
  - learning_rate: 2e-05
41
- - train_batch_size: 2
42
  - eval_batch_size: 2
43
  - seed: 42
 
 
44
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
45
  - lr_scheduler_type: linear
46
  - num_epochs: 1
47
 
48
  ### Training results
49
 
50
- | Training Loss | Epoch | Step | Validation Loss | Accuracy |
51
- |:-------------:|:------:|:-----:|:---------------:|:--------:|
52
- | 0.8448 | 0.0859 | 5000 | 1.1296 | 0.8044 |
53
- | 0.7301 | 0.1718 | 10000 | 1.0056 | 0.8258 |
54
- | 0.6827 | 0.2577 | 15000 | 0.9388 | 0.8464 |
55
- | 0.6221 | 0.3436 | 20000 | 0.9358 | 0.8502 |
56
- | 0.5611 | 0.4295 | 25000 | 0.8983 | 0.8567 |
57
- | 0.5291 | 0.5155 | 30000 | 0.8503 | 0.8575 |
58
- | 0.4202 | 0.6014 | 35000 | 0.8353 | 0.8656 |
59
- | 0.5436 | 0.6873 | 40000 | 0.7476 | 0.8706 |
60
- | 0.4814 | 0.7732 | 45000 | 0.7863 | 0.8669 |
61
- | 0.4853 | 0.8591 | 50000 | 0.7284 | 0.8720 |
62
- | 0.39 | 0.9450 | 55000 | 0.7455 | 0.8743 |
63
 
64
 
65
  ### Framework versions
 
15
 
16
  # newsdata-bertimbal
17
 
18
+ This model is a fine-tuned version of [neuralmind/bert-base-portuguese-cased](https://huggingface.co/neuralmind/bert-base-portuguese-cased) on an unknown dataset.
19
  It achieves the following results on the evaluation set:
20
+ - Loss: 0.3387
21
+ - Accuracy: 0.9045
22
 
23
  ## Model description
24
 
 
38
 
39
  The following hyperparameters were used during training:
40
  - learning_rate: 2e-05
41
+ - train_batch_size: 4
42
  - eval_batch_size: 2
43
  - seed: 42
44
+ - gradient_accumulation_steps: 4
45
+ - total_train_batch_size: 16
46
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
47
  - lr_scheduler_type: linear
48
  - num_epochs: 1
49
 
50
  ### Training results
51
 
52
+ | Training Loss | Epoch | Step | Validation Loss | Accuracy |
53
+ |:-------------:|:------:|:----:|:---------------:|:--------:|
54
+ | 0.5125 | 0.2749 | 2000 | 0.4810 | 0.8749 |
55
+ | 0.3874 | 0.5498 | 4000 | 0.3903 | 0.8938 |
56
+ | 0.3425 | 0.8247 | 6000 | 0.3387 | 0.9045 |
 
 
 
 
 
 
 
 
57
 
58
 
59
  ### Framework versions