staghado commited on
Commit
75164bc
·
verified ·
1 Parent(s): b0fcc47

Training in progress, step 27000

Browse files
README.md CHANGED
@@ -21,14 +21,14 @@ should probably proofread and complete it, then remove this comment. -->
21
 
22
  This model is a fine-tuned version of [answerdotai/ModernBERT-base](https://huggingface.co/answerdotai/ModernBERT-base) on an unknown dataset.
23
  It achieves the following results on the evaluation set:
24
- - Loss: 0.2860
25
  - Precision: 0.5389
26
- - Recall: 0.3948
27
- - F1: 0.4306
28
- - Accuracy: 0.6824
29
- - Binary Precision: 0.7543
30
- - Binary Recall: 0.4522
31
- - Binary F1: 0.5654
32
  - Binary Accuracy: 0.9373
33
 
34
  ## Model description
@@ -60,15 +60,15 @@ The following hyperparameters were used during training:
60
 
61
  | Training Loss | Epoch | Step | Validation Loss | Precision | Recall | F1 | Accuracy | Binary Precision | Binary Recall | Binary F1 | Binary Accuracy |
62
  |:-------------:|:-------:|:----:|:---------------:|:---------:|:------:|:------:|:--------:|:----------------:|:-------------:|:---------:|:---------------:|
63
- | No log | 0 | 0 | 3.2225 | 0.1006 | 0.1672 | 0.0384 | 0.1250 | 0.0 | 0.0 | 0.0 | 0.9098 |
64
- | 0.3184 | 2.4331 | 1000 | 0.3097 | 0.5223 | 0.3701 | 0.3955 | 0.6607 | 0.7759 | 0.3442 | 0.4769 | 0.9319 |
65
- | 0.3056 | 4.8662 | 2000 | 0.3007 | 0.5272 | 0.3835 | 0.4127 | 0.6692 | 0.7653 | 0.3791 | 0.5071 | 0.9335 |
66
- | 0.3019 | 7.2993 | 3000 | 0.3016 | 0.5252 | 0.3825 | 0.4131 | 0.6719 | 0.7829 | 0.3730 | 0.5052 | 0.9341 |
67
- | 0.2981 | 9.7324 | 4000 | 0.2926 | 0.5413 | 0.3875 | 0.4207 | 0.6754 | 0.7487 | 0.4361 | 0.5511 | 0.9359 |
68
- | 0.2952 | 12.1655 | 5000 | 0.2890 | 0.5352 | 0.3971 | 0.4317 | 0.6797 | 0.7439 | 0.4541 | 0.5639 | 0.9367 |
69
- | 0.2936 | 14.5985 | 6000 | 0.2891 | 0.5332 | 0.3938 | 0.4288 | 0.6825 | 0.7705 | 0.4254 | 0.5482 | 0.9368 |
70
- | 0.2856 | 17.0316 | 7000 | 0.2869 | 0.5434 | 0.3871 | 0.4237 | 0.6838 | 0.7689 | 0.4216 | 0.5446 | 0.9364 |
71
- | 0.2888 | 19.4647 | 8000 | 0.2860 | 0.5389 | 0.3948 | 0.4306 | 0.6824 | 0.7543 | 0.4522 | 0.5654 | 0.9373 |
72
 
73
 
74
  ### Framework versions
 
21
 
22
  This model is a fine-tuned version of [answerdotai/ModernBERT-base](https://huggingface.co/answerdotai/ModernBERT-base) on an unknown dataset.
23
  It achieves the following results on the evaluation set:
24
+ - Loss: 0.2864
25
  - Precision: 0.5389
26
+ - Recall: 0.3949
27
+ - F1: 0.4305
28
+ - Accuracy: 0.6820
29
+ - Binary Precision: 0.7559
30
+ - Binary Recall: 0.4496
31
+ - Binary F1: 0.5638
32
  - Binary Accuracy: 0.9373
33
 
34
  ## Model description
 
60
 
61
  | Training Loss | Epoch | Step | Validation Loss | Precision | Recall | F1 | Accuracy | Binary Precision | Binary Recall | Binary F1 | Binary Accuracy |
62
  |:-------------:|:-------:|:----:|:---------------:|:---------:|:------:|:------:|:--------:|:----------------:|:-------------:|:---------:|:---------------:|
63
+ | No log | 0 | 0 | 2.4591 | 0.1116 | 0.1645 | 0.0484 | 0.1386 | 0.0 | 0.0 | 0.0 | 0.9098 |
64
+ | 0.3196 | 2.4331 | 1000 | 0.3097 | 0.5201 | 0.3691 | 0.3946 | 0.6587 | 0.7614 | 0.3580 | 0.4870 | 0.9320 |
65
+ | 0.3064 | 4.8662 | 2000 | 0.3067 | 0.5273 | 0.3882 | 0.4154 | 0.6599 | 0.7391 | 0.4375 | 0.5496 | 0.9353 |
66
+ | 0.3088 | 7.2993 | 3000 | 0.2951 | 0.5353 | 0.3833 | 0.4169 | 0.6744 | 0.7656 | 0.4007 | 0.5261 | 0.9349 |
67
+ | 0.2991 | 9.7324 | 4000 | 0.2975 | 0.5421 | 0.3921 | 0.4234 | 0.6699 | 0.7316 | 0.4643 | 0.5681 | 0.9363 |
68
+ | 0.2957 | 12.1655 | 5000 | 0.2920 | 0.5362 | 0.3859 | 0.4207 | 0.6813 | 0.7811 | 0.3953 | 0.5249 | 0.9355 |
69
+ | 0.2932 | 14.5985 | 6000 | 0.2881 | 0.5364 | 0.3946 | 0.4298 | 0.6824 | 0.7591 | 0.4351 | 0.5532 | 0.9366 |
70
+ | 0.2862 | 17.0316 | 7000 | 0.2876 | 0.5411 | 0.3850 | 0.4213 | 0.6829 | 0.7713 | 0.4104 | 0.5358 | 0.9359 |
71
+ | 0.2894 | 19.4647 | 8000 | 0.2864 | 0.5389 | 0.3949 | 0.4305 | 0.6820 | 0.7559 | 0.4496 | 0.5638 | 0.9373 |
72
 
73
 
74
  ### Framework versions
final/model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:b5e2cca3f2353e925451f37169815e2c4120982698e1a46f7175079c2cf1d281
3
  size 598436708
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:0cac61372edf2d52a95fad570f17d06c42bf18cb6718d712325f226e4ff21789
3
  size 598436708
final/tokenizer.json CHANGED
@@ -1,11 +1,6 @@
1
  {
2
  "version": "1.0",
3
- "truncation": {
4
- "direction": "Right",
5
- "max_length": 512,
6
- "strategy": "LongestFirst",
7
- "stride": 0
8
- },
9
  "padding": null,
10
  "added_tokens": [
11
  {
 
1
  {
2
  "version": "1.0",
3
+ "truncation": null,
 
 
 
 
 
4
  "padding": null,
5
  "added_tokens": [
6
  {
final/training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:b05515bc301f3559c8f36e2c4adb77a63a0f061c838426684c2c41e01c6e0a8d
3
  size 5368
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4b031af4edaa4724f2593c3de4c4f2ffb1bf3c560c8d8675dc813770ba9de80e
3
  size 5368
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:b5e2cca3f2353e925451f37169815e2c4120982698e1a46f7175079c2cf1d281
3
  size 598436708
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:753b300530ce8d28493909576a1422b1ae7bb54b35c212987090bd17593e8b84
3
  size 598436708
tokenizer.json CHANGED
@@ -1,11 +1,6 @@
1
  {
2
  "version": "1.0",
3
- "truncation": {
4
- "direction": "Right",
5
- "max_length": 512,
6
- "strategy": "LongestFirst",
7
- "stride": 0
8
- },
9
  "padding": null,
10
  "added_tokens": [
11
  {
 
1
  {
2
  "version": "1.0",
3
+ "truncation": null,
 
 
 
 
 
4
  "padding": null,
5
  "added_tokens": [
6
  {
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:b05515bc301f3559c8f36e2c4adb77a63a0f061c838426684c2c41e01c6e0a8d
3
  size 5368
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:76036464971171a65720841d1473c71f261c52dba59e7098ec5db7be612a6cb2
3
  size 5368