scenario-kd-scr-ner-full-mdeberta_data-univner_full55

This model is a fine-tuned version of haryoaw/scenario-TCR-NER_data-univner_full on the None dataset. It achieves the following results on the evaluation set:

Loss: 182.5497
Precision: 0.6804
Recall: 0.6154
F1: 0.6463
Accuracy: 0.9657

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

learning_rate: 3e-05
train_batch_size: 8
eval_batch_size: 32
seed: 55
gradient_accumulation_steps: 4
total_train_batch_size: 32
optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
lr_scheduler_type: linear
num_epochs: 10

Training results

Training Loss	Epoch	Step	Validation Loss	Precision	Recall	F1	Accuracy
627.6279	0.2911	500	560.3030	0.0	0.0	0.0	0.9241
529.0723	0.5822	1000	503.1468	0.3145	0.0378	0.0675	0.9253
481.2237	0.8732	1500	462.0725	0.3110	0.0811	0.1286	0.9284
443.6023	1.1643	2000	431.7476	0.4241	0.0822	0.1378	0.9304
413.7332	1.4554	2500	404.1758	0.4897	0.3199	0.3870	0.9448
389.6206	1.7465	3000	381.5862	0.5329	0.3800	0.4437	0.9494
368.3142	2.0375	3500	363.5359	0.5888	0.3769	0.4596	0.9507
349.4665	2.3286	4000	346.6397	0.5410	0.4793	0.5083	0.9539
333.3893	2.6197	4500	331.5422	0.6223	0.4291	0.5079	0.9550
318.8641	2.9108	5000	316.8669	0.5984	0.5612	0.5792	0.9597
303.8825	3.2019	5500	303.3675	0.6190	0.5569	0.5863	0.9608
290.8802	3.4929	6000	291.3924	0.6347	0.5390	0.5830	0.9606
279.9562	3.7840	6500	281.3740	0.6484	0.5403	0.5894	0.9613
268.853	4.0751	7000	270.4638	0.6513	0.5578	0.6009	0.9615
257.9733	4.3662	7500	260.5476	0.6536	0.5817	0.6156	0.9635
248.9305	4.6573	8000	251.8452	0.6631	0.5926	0.6258	0.9638
240.7242	4.9483	8500	243.8925	0.6587	0.5882	0.6215	0.9633
232.3709	5.2394	9000	236.3189	0.6514	0.6077	0.6288	0.9640
224.6698	5.5305	9500	229.3991	0.6675	0.5722	0.6162	0.9629
218.3664	5.8216	10000	223.3077	0.6788	0.5823	0.6269	0.9639
212.9249	6.1126	10500	217.2704	0.6717	0.6003	0.6340	0.9643
206.6058	6.4037	11000	211.8754	0.6570	0.6226	0.6393	0.9649
201.722	6.6948	11500	207.1151	0.6680	0.6210	0.6436	0.9650
197.034	6.9859	12000	202.9470	0.6805	0.6047	0.6403	0.9649
192.5555	7.2770	12500	199.1373	0.6749	0.6130	0.6425	0.9651
189.1607	7.5680	13000	195.9332	0.6605	0.6279	0.6438	0.9652
186.1884	7.8591	13500	193.1577	0.6772	0.6057	0.6395	0.9652
183.2947	8.1502	14000	190.2176	0.6697	0.6318	0.6502	0.9654
180.5764	8.4413	14500	187.9859	0.6970	0.6091	0.6501	0.9657
178.5341	8.7324	15000	186.4189	0.6843	0.5976	0.6380	0.9645
176.79	9.0234	15500	184.5720	0.6846	0.6198	0.6506	0.9661
175.528	9.3145	16000	183.8221	0.7059	0.5905	0.6431	0.9650
174.3179	9.6056	16500	182.7365	0.6842	0.6188	0.6498	0.9658
174.2736	9.8967	17000	182.5497	0.6804	0.6154	0.6463	0.9657

Framework versions

Transformers 4.44.2
Pytorch 2.1.1+cu121
Datasets 2.14.5
Tokenizers 0.19.1

haryoaw
/

scenario-kd-scr-ner-full-mdeberta_data-univner_full55

scenario-kd-scr-ner-full-mdeberta_data-univner_full55

Model description

Intended uses & limitations

Training and evaluation data

Training procedure

Training hyperparameters

Training results

Framework versions

Model tree for haryoaw/scenario-kd-scr-ner-full-mdeberta_data-univner_full55

Evaluation results