mateiaassAI commited on
Commit
7a66900
1 Parent(s): 13943ef

End of training

Browse files
Files changed (4) hide show
  1. README.md +20 -20
  2. config.json +4 -12
  3. model.safetensors +2 -2
  4. training_args.bin +1 -1
README.md CHANGED
@@ -5,7 +5,7 @@ base_model: dumitrescustefan/bert-base-romanian-cased-v1
5
  tags:
6
  - generated_from_trainer
7
  datasets:
8
- - moroco
9
  metrics:
10
  - f1
11
  - accuracy
@@ -18,24 +18,24 @@ model-index:
18
  name: Text Classification
19
  type: text-classification
20
  dataset:
21
- name: moroco
22
- type: moroco
23
- config: moroco
24
- split: validation
25
- args: moroco
26
  metrics:
27
  - name: F1
28
  type: f1
29
- value: 0.8735212436845421
30
  - name: Accuracy
31
  type: accuracy
32
- value: 0.8554298260428982
33
  - name: Precision
34
  type: precision
35
- value: 0.875780196577562
36
  - name: Recall
37
  type: recall
38
- value: 0.8713068696755483
39
  ---
40
 
41
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -43,14 +43,14 @@ should probably proofread and complete it, then remove this comment. -->
43
 
44
  # teacher_laroseda
45
 
46
- This model is a fine-tuned version of [dumitrescustefan/bert-base-romanian-cased-v1](https://huggingface.co/dumitrescustefan/bert-base-romanian-cased-v1) on the moroco dataset.
47
  It achieves the following results on the evaluation set:
48
- - Loss: 0.0937
49
- - F1: 0.8735
50
- - Roc Auc: None
51
- - Accuracy: 0.8554
52
- - Precision: 0.8758
53
- - Recall: 0.8713
54
 
55
  ## Model description
56
 
@@ -69,7 +69,7 @@ More information needed
69
  ### Training hyperparameters
70
 
71
  The following hyperparameters were used during training:
72
- - learning_rate: 2e-05
73
  - train_batch_size: 16
74
  - eval_batch_size: 16
75
  - seed: 42
@@ -81,8 +81,8 @@ The following hyperparameters were used during training:
81
 
82
  | Training Loss | Epoch | Step | Validation Loss | F1 | Roc Auc | Accuracy | Precision | Recall |
83
  |:-------------:|:-----:|:----:|:---------------:|:------:|:-------:|:--------:|:---------:|:------:|
84
- | 0.1161 | 1.0 | 1358 | 0.1005 | 0.8657 | None | 0.8551 | 0.8822 | 0.8532 |
85
- | 0.0807 | 2.0 | 2716 | 0.0937 | 0.8735 | None | 0.8554 | 0.8758 | 0.8713 |
86
 
87
 
88
  ### Framework versions
 
5
  tags:
6
  - generated_from_trainer
7
  datasets:
8
+ - laroseda
9
  metrics:
10
  - f1
11
  - accuracy
 
18
  name: Text Classification
19
  type: text-classification
20
  dataset:
21
+ name: laroseda
22
+ type: laroseda
23
+ config: laroseda
24
+ split: train
25
+ args: laroseda
26
  metrics:
27
  - name: F1
28
  type: f1
29
+ value: 0.9509986768928282
30
  - name: Accuracy
31
  type: accuracy
32
+ value: 0.951
33
  - name: Precision
34
  type: precision
35
+ value: 0.9510144740842682
36
  - name: Recall
37
  type: recall
38
+ value: 0.951
39
  ---
40
 
41
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 
43
 
44
  # teacher_laroseda
45
 
46
+ This model is a fine-tuned version of [dumitrescustefan/bert-base-romanian-cased-v1](https://huggingface.co/dumitrescustefan/bert-base-romanian-cased-v1) on the laroseda dataset.
47
  It achieves the following results on the evaluation set:
48
+ - Loss: 0.1746
49
+ - F1: 0.9510
50
+ - Roc Auc: 0.9510
51
+ - Accuracy: 0.951
52
+ - Precision: 0.9510
53
+ - Recall: 0.951
54
 
55
  ## Model description
56
 
 
69
  ### Training hyperparameters
70
 
71
  The following hyperparameters were used during training:
72
+ - learning_rate: 1.7e-05
73
  - train_batch_size: 16
74
  - eval_batch_size: 16
75
  - seed: 42
 
81
 
82
  | Training Loss | Epoch | Step | Validation Loss | F1 | Roc Auc | Accuracy | Precision | Recall |
83
  |:-------------:|:-----:|:----:|:---------------:|:------:|:-------:|:--------:|:---------:|:------:|
84
+ | 0.2117 | 1.0 | 688 | 0.1362 | 0.9450 | 0.9450 | 0.945 | 0.9450 | 0.945 |
85
+ | 0.1154 | 2.0 | 1376 | 0.1746 | 0.9510 | 0.9510 | 0.951 | 0.9510 | 0.951 |
86
 
87
 
88
  ### Framework versions
config.json CHANGED
@@ -9,22 +9,14 @@
9
  "hidden_dropout_prob": 0.1,
10
  "hidden_size": 768,
11
  "id2label": {
12
- "0": "culture",
13
- "1": "finance",
14
- "2": "politics",
15
- "3": "science",
16
- "4": "sports",
17
- "5": "tech"
18
  },
19
  "initializer_range": 0.02,
20
  "intermediate_size": 3072,
21
  "label2id": {
22
- "culture": 0,
23
- "finance": 1,
24
- "politics": 2,
25
- "science": 3,
26
- "sports": 4,
27
- "tech": 5
28
  },
29
  "layer_norm_eps": 1e-12,
30
  "max_position_embeddings": 512,
 
9
  "hidden_dropout_prob": 0.1,
10
  "hidden_size": 768,
11
  "id2label": {
12
+ "0": "Negative",
13
+ "1": "Positive"
 
 
 
 
14
  },
15
  "initializer_range": 0.02,
16
  "intermediate_size": 3072,
17
  "label2id": {
18
+ "Negative": 0,
19
+ "Positive": 1
 
 
 
 
20
  },
21
  "layer_norm_eps": 1e-12,
22
  "max_position_embeddings": 512,
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:cae10a859983bfc9f55faf666d79cc33ec9460b5256bfe73aa48c9fd86a55ed7
3
- size 497807376
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a7eae8b2ef892a1405c55df905abc1048b34491c60f1d14379bf615f9e6f4350
3
+ size 497795072
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:70411d2e62a23bf5d52f0985f2b8ba20217404797f084cb74ecb0ab78898dd4b
3
  size 5176
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:6f624c9b23cf5ec02c13ebf01d927c709264dcc7261413aa6e73a382b9401c96
3
  size 5176