End of training

Browse files

Files changed (4) hide show

README.md +77 -0
model.safetensors +1 -1
runs/Mar19_12-00-20_9710328935b9/events.out.tfevents.1710849644.9710328935b9.451.11 +2 -2
runs/Mar19_12-00-20_9710328935b9/events.out.tfevents.1710849949.9710328935b9.451.12 +3 -0

README.md ADDED Viewed

	@@ -0,0 +1,77 @@

+---
+license: cc-by-4.0
+base_model: vesteinn/DanskBERT
+tags:
+- generated_from_trainer
+model-index:
+- name: MeMo_BERT-WSD-DanskBERT
+  results: []
+---
+<!-- This model card has been generated automatically according to the information the Trainer had access to. You
+should probably proofread and complete it, then remove this comment. -->
+# MeMo_BERT-WSD-DanskBERT
+This model is a fine-tuned version of [vesteinn/DanskBERT](https://huggingface.co/vesteinn/DanskBERT) on an unknown dataset.
+It achieves the following results on the evaluation set:
+- Loss: 1.7755
+- F1-score: 0.5209
+## Model description
+More information needed
+## Intended uses & limitations
+More information needed
+## Training and evaluation data
+More information needed
+## Training procedure
+### Training hyperparameters
+The following hyperparameters were used during training:
+- learning_rate: 5e-05
+- train_batch_size: 8
+- eval_batch_size: 8
+- seed: 42
+- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
+- lr_scheduler_type: linear
+- num_epochs: 20
+### Training results
+| Training Loss | Epoch | Step | Validation Loss | F1-score |
+|:-------------:|:-----:|:----:|:---------------:|:--------:|
+| No log        | 1.0   | 61   | 1.4766          | 0.1229   |
+| No log        | 2.0   | 122  | 1.4366          | 0.1229   |
+| No log        | 3.0   | 183  | 1.3636          | 0.2462   |
+| No log        | 4.0   | 244  | 1.2889          | 0.3692   |
+| No log        | 5.0   | 305  | 1.4150          | 0.3786   |
+| No log        | 6.0   | 366  | 1.5581          | 0.3409   |
+| No log        | 7.0   | 427  | 1.6512          | 0.4664   |
+| No log        | 8.0   | 488  | 1.7405          | 0.4661   |
+| 0.9424        | 9.0   | 549  | 1.7755          | 0.5209   |
+| 0.9424        | 10.0  | 610  | 2.4738          | 0.4351   |
+| 0.9424        | 11.0  | 671  | 2.4721          | 0.4858   |
+| 0.9424        | 12.0  | 732  | 2.9449          | 0.4491   |
+| 0.9424        | 13.0  | 793  | 2.8346          | 0.4528   |
+| 0.9424        | 14.0  | 854  | 3.0715          | 0.4845   |
+| 0.9424        | 15.0  | 915  | 3.1416          | 0.4520   |
+| 0.9424        | 16.0  | 976  | 3.0893          | 0.5197   |
+| 0.1197        | 17.0  | 1037 | 3.1668          | 0.4764   |
+| 0.1197        | 18.0  | 1098 | 3.2142          | 0.4656   |
+| 0.1197        | 19.0  | 1159 | 3.2174          | 0.5087   |
+| 0.1197        | 20.0  | 1220 | 3.2239          | 0.5087   |
+### Framework versions
+- Transformers 4.38.2
+- Pytorch 2.2.1+cu121
+- Datasets 2.18.0
+- Tokenizers 0.15.2

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:66bb9f37f5f2cbe455f09da0134adf6036abb1366f73d964d890fd5e399c7b3e
 size 497820256

 version https://git-lfs.github.com/spec/v1
+oid sha256:8336976be9469536b7e4769fd7d8b2e12a32d954e3252c3a9070d5c9e35d7c51
 size 497820256

runs/Mar19_12-00-20_9710328935b9/events.out.tfevents.1710849644.9710328935b9.451.11 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:4b15a79f231b3a3248d2d40e969db0eb700f43af813659dc361a6f79a054ba5b
-size 11494

 version https://git-lfs.github.com/spec/v1
+oid sha256:97840f6add9b6cd77743103eed4a9e6278064e269c5e03c4cbda74ee2a10997f
+size 12171

runs/Mar19_12-00-20_9710328935b9/events.out.tfevents.1710849949.9710328935b9.451.12 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:9e0eac390c9fbb5065cd5a7f1b73bd3ad41ca2f9f5126708bd5a02b2ad30dac8
+size 411