Edit model card

AraT5v2-base-1024-p-l-akk-en-20240811-231511

This model was trained from scratch on the None dataset. It achieves the following results on the evaluation set:

  • Loss: 0.4597

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 4e-05
  • train_batch_size: 1
  • eval_batch_size: 1
  • seed: 42
  • optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
  • lr_scheduler_type: linear
  • num_epochs: 10

Training results

Training Loss Epoch Step Validation Loss
0.364 0.0552 2500 0.4332
0.3744 0.1105 5000 0.4408
0.3825 0.1657 7500 0.4430
0.384 0.2210 10000 0.4377
0.3751 0.2762 12500 0.4421
0.4055 0.3314 15000 0.4372
0.355 0.3867 17500 0.4352
0.3871 0.4419 20000 0.4376
0.4159 0.4972 22500 0.4335
0.3782 0.5524 25000 0.4295
0.384 0.6077 27500 0.4305
0.3782 0.6629 30000 0.4311
0.3708 0.7181 32500 0.4332
0.3809 0.7734 35000 0.4263
0.3964 0.8286 37500 0.4280
0.3832 0.8839 40000 0.4253
0.4052 0.9391 42500 0.4320
0.4015 0.9943 45000 0.4261
0.352 1.0496 47500 0.4307
0.3456 1.1048 50000 0.4318
0.3726 1.1601 52500 0.4366
0.323 1.2153 55000 0.4357
0.3565 1.2705 57500 0.4285
0.3679 1.3258 60000 0.4329
0.3921 1.3810 62500 0.4257
0.3587 1.4363 65000 0.4248
0.3502 1.4915 67500 0.4283
0.3768 1.5468 70000 0.4283
0.3461 1.6020 72500 0.4226
0.3524 1.6572 75000 0.4238
0.3838 1.7125 77500 0.4220
0.3849 1.7677 80000 0.4213
0.3731 1.8230 82500 0.4184
0.3722 1.8782 85000 0.4212
0.3762 1.9334 87500 0.4179
0.3737 1.9887 90000 0.4229
0.3311 2.0439 92500 0.4277
0.3308 2.0992 95000 0.4245
0.3454 2.1544 97500 0.4258
0.2972 2.2097 100000 0.4362
0.3284 2.2649 102500 0.4290
0.3774 2.3201 105000 0.4302
0.3287 2.3754 107500 0.4250
0.3281 2.4306 110000 0.4219
0.3312 2.4859 112500 0.4249
0.3651 2.5411 115000 0.4222
0.3639 2.5963 117500 0.4243
0.3602 2.6516 120000 0.4187
0.3222 2.7068 122500 0.4256
0.3474 2.7621 125000 0.4204
0.3317 2.8173 127500 0.4246
0.3616 2.8725 130000 0.4148
0.3729 2.9278 132500 0.4191
0.352 2.9830 135000 0.4184
0.2849 3.0383 137500 0.4272
0.3148 3.0935 140000 0.4285
0.3032 3.1488 142500 0.4324
0.3306 3.2040 145000 0.4238
0.3377 3.2592 147500 0.4264
0.3373 3.3145 150000 0.4254
0.3079 3.3697 152500 0.4267
0.3165 3.4250 155000 0.4239
0.3469 3.4802 157500 0.4225
0.3102 3.5354 160000 0.4194
0.3231 3.5907 162500 0.4199
0.3383 3.6459 165000 0.4210
0.3048 3.7012 167500 0.4188
0.3222 3.7564 170000 0.4206
0.3505 3.8116 172500 0.4202
0.3209 3.8669 175000 0.4172
0.3146 3.9221 177500 0.4197
0.3237 3.9774 180000 0.4204
0.3087 4.0326 182500 0.4298
0.2979 4.0879 185000 0.4278
0.3046 4.1431 187500 0.4203
0.3145 4.1983 190000 0.4273
0.3511 4.2536 192500 0.4282
0.3845 4.3088 195000 0.4255
0.2889 4.3641 197500 0.4261
0.2764 4.4193 200000 0.4269
0.3089 4.4745 202500 0.4280
0.2928 4.5298 205000 0.4216
0.2982 4.5850 207500 0.4294
0.3008 4.6403 210000 0.4240
0.2997 4.6955 212500 0.4239
0.2964 4.7508 215000 0.4215
0.2822 4.8060 217500 0.4214
0.3216 4.8612 220000 0.4219
0.2873 4.9165 222500 0.4197
0.314 4.9717 225000 0.4214
0.3212 5.0270 227500 0.4292
0.2883 5.0822 230000 0.4333
0.2828 5.1374 232500 0.4341
0.2498 5.1927 235000 0.4357
0.2823 5.2479 237500 0.4289
0.2775 5.3032 240000 0.4352
0.3022 5.3584 242500 0.4329
0.269 5.4136 245000 0.4336
0.2769 5.4689 247500 0.4291
0.2627 5.5241 250000 0.4328
0.2632 5.5794 252500 0.4298
0.2856 5.6346 255000 0.4338
0.3124 5.6899 257500 0.4288
0.2662 5.7451 260000 0.4280
0.2849 5.8003 262500 0.4303
0.2972 5.8556 265000 0.4253
0.2866 5.9108 267500 0.4252
0.2689 5.9661 270000 0.4204
0.2459 6.0213 272500 0.4355
0.281 6.0765 275000 0.4386
0.29 6.1318 277500 0.4396
0.2587 6.1870 280000 0.4383
0.2892 6.2423 282500 0.4393
0.2761 6.2975 285000 0.4393
0.2796 6.3527 287500 0.4378
0.2586 6.4080 290000 0.4330
0.2397 6.4632 292500 0.4412
0.2823 6.5185 295000 0.4306
0.2903 6.5737 297500 0.4351
0.2675 6.6290 300000 0.4369
0.2949 6.6842 302500 0.4438
0.284 6.7394 305000 0.4361
0.2794 6.7947 307500 0.4304
0.2475 6.8499 310000 0.4399
0.2804 6.9052 312500 0.4317
0.2634 6.9604 315000 0.4359
0.2447 7.0156 317500 0.4418
0.2582 7.0709 320000 0.4471
0.2468 7.1261 322500 0.4492
0.2584 7.1814 325000 0.4436
0.2619 7.2366 327500 0.4444
0.2273 7.2919 330000 0.4458
0.2385 7.3471 332500 0.4434
0.2324 7.4023 335000 0.4470
0.2475 7.4576 337500 0.4475
0.2591 7.5128 340000 0.4456
0.2565 7.5681 342500 0.4451
0.2258 7.6233 345000 0.4424
0.2253 7.6785 347500 0.4444
0.2418 7.7338 350000 0.4470
0.2608 7.7890 352500 0.4465
0.2497 7.8443 355000 0.4472
0.2516 7.8995 357500 0.4446
0.2423 7.9547 360000 0.4426
0.2711 8.0100 362500 0.4470
0.2386 8.0652 365000 0.4530
0.2317 8.1205 367500 0.4550
0.243 8.1757 370000 0.4560
0.2273 8.2310 372500 0.4523
0.2463 8.2862 375000 0.4534
0.2435 8.3414 377500 0.4520
0.2805 8.3967 380000 0.4541
0.2437 8.4519 382500 0.4548
0.2583 8.5072 385000 0.4531
0.2241 8.5624 387500 0.4502
0.2531 8.6176 390000 0.4551
0.2393 8.6729 392500 0.4524
0.2506 8.7281 395000 0.4525
0.2222 8.7834 397500 0.4533
0.251 8.8386 400000 0.4518
0.2331 8.8938 402500 0.4555
0.2312 8.9491 405000 0.4507
0.2399 9.0043 407500 0.4557
0.2267 9.0596 410000 0.4574
0.2336 9.1148 412500 0.4580
0.263 9.1701 415000 0.4567
0.2207 9.2253 417500 0.4589
0.2457 9.2805 420000 0.4624
0.2577 9.3358 422500 0.4583
0.19 9.3910 425000 0.4600
0.2513 9.4463 427500 0.4575
0.2647 9.5015 430000 0.4587
0.2704 9.5567 432500 0.4577
0.2397 9.6120 435000 0.4592
0.2436 9.6672 437500 0.4601
0.2595 9.7225 440000 0.4591
0.2617 9.7777 442500 0.4595
0.231 9.8330 445000 0.4604
0.2375 9.8882 447500 0.4594
0.2295 9.9434 450000 0.4597
0.2289 9.9987 452500 0.4597

Framework versions

  • Transformers 4.44.0.dev0
  • Pytorch 2.5.0.dev20240625
  • Datasets 2.20.0
  • Tokenizers 0.19.1
Downloads last month
0
Safetensors
Model size
394M params
Tensor type
F32
·
Inference API
Unable to determine this model's library. Check the docs .