GQA_RoBERTa_legal_SQuAD_complete_augmented_1000

This model was trained from scratch on the None dataset. It achieves the following results on the evaluation set:

  • Loss: 1.2040

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 2e-05
  • train_batch_size: 128
  • eval_batch_size: 32
  • seed: 42
  • optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
  • lr_scheduler_type: linear
  • num_epochs: 1000

Training results

Training Loss Epoch Step Validation Loss
No log 1.0 4 3.7757
No log 2.0 8 3.1210
No log 3.0 12 2.7424
No log 4.0 16 2.3990
No log 5.0 20 2.0583
No log 6.0 24 1.9699
No log 7.0 28 1.6942
No log 8.0 32 1.5022
No log 9.0 36 1.4585
No log 10.0 40 1.1937
No log 11.0 44 1.1496
No log 12.0 48 0.9856
No log 13.0 52 0.9389
No log 14.0 56 0.9621
No log 15.0 60 0.8580
No log 16.0 64 0.8093
No log 17.0 68 0.7783
No log 18.0 72 0.7656
No log 19.0 76 0.7793
No log 20.0 80 0.7327
No log 21.0 84 0.7109
No log 22.0 88 0.7120
No log 23.0 92 0.7099
No log 24.0 96 0.7191
No log 25.0 100 0.7350
No log 26.0 104 0.7634
No log 27.0 108 0.7498
No log 28.0 112 0.7353
No log 29.0 116 0.7319
No log 30.0 120 0.7603
No log 31.0 124 0.7701
No log 32.0 128 0.7818
No log 33.0 132 0.7904
No log 34.0 136 0.7580
No log 35.0 140 0.7640
No log 36.0 144 0.7558
No log 37.0 148 0.7470
No log 38.0 152 0.7730
No log 39.0 156 0.7450
No log 40.0 160 0.7516
No log 41.0 164 0.7475
No log 42.0 168 0.7306
No log 43.0 172 0.7488
No log 44.0 176 0.7604
No log 45.0 180 0.8035
No log 46.0 184 0.7837
No log 47.0 188 0.7307
No log 48.0 192 0.6987
No log 49.0 196 0.7281
No log 50.0 200 0.7453
No log 51.0 204 0.7811
No log 52.0 208 0.7951
No log 53.0 212 0.7833
No log 54.0 216 0.7961
No log 55.0 220 0.8255
No log 56.0 224 0.8038
No log 57.0 228 0.8384
No log 58.0 232 0.8412
No log 59.0 236 0.8206
No log 60.0 240 0.8224
No log 61.0 244 0.8638
No log 62.0 248 0.9014
No log 63.0 252 0.9255
No log 64.0 256 0.9019
No log 65.0 260 0.8741
No log 66.0 264 0.8442
No log 67.0 268 0.8526
No log 68.0 272 0.8702
No log 69.0 276 0.9321
No log 70.0 280 0.9450
No log 71.0 284 0.8868
No log 72.0 288 0.8622
No log 73.0 292 0.8586
No log 74.0 296 0.8935
No log 75.0 300 0.9010
No log 76.0 304 0.8703
No log 77.0 308 0.8726
No log 78.0 312 0.9113
No log 79.0 316 0.9175
No log 80.0 320 0.9173
No log 81.0 324 0.9550
No log 82.0 328 0.9649
No log 83.0 332 0.9917
No log 84.0 336 0.9783
No log 85.0 340 0.9558
No log 86.0 344 0.9425
No log 87.0 348 0.9323
No log 88.0 352 0.9471
No log 89.0 356 0.9749
No log 90.0 360 0.9638
No log 91.0 364 0.9881
No log 92.0 368 0.9697
No log 93.0 372 0.9189
No log 94.0 376 0.9036
No log 95.0 380 0.8745
No log 96.0 384 0.8811
No log 97.0 388 0.8967
No log 98.0 392 0.9032
No log 99.0 396 0.9201
No log 100.0 400 0.9524
No log 101.0 404 0.9983
No log 102.0 408 0.9742
No log 103.0 412 0.9834
No log 104.0 416 0.9480
No log 105.0 420 0.9367
No log 106.0 424 0.9340
No log 107.0 428 0.9454
No log 108.0 432 0.9553
No log 109.0 436 0.9694
No log 110.0 440 0.9696
No log 111.0 444 0.9280
No log 112.0 448 0.9166
No log 113.0 452 0.9406
No log 114.0 456 0.9372
No log 115.0 460 0.9147
No log 116.0 464 0.9267
No log 117.0 468 0.9665
No log 118.0 472 1.0231
No log 119.0 476 1.0291
No log 120.0 480 0.9973
No log 121.0 484 0.9516
No log 122.0 488 0.9134
No log 123.0 492 0.8852
No log 124.0 496 0.8535
0.9595 125.0 500 0.9003
0.9595 126.0 504 0.9523
0.9595 127.0 508 0.9925
0.9595 128.0 512 0.9736
0.9595 129.0 516 0.9584
0.9595 130.0 520 0.9625
0.9595 131.0 524 0.9533
0.9595 132.0 528 0.9774
0.9595 133.0 532 0.9898
0.9595 134.0 536 0.9657
0.9595 135.0 540 0.9627
0.9595 136.0 544 1.0049
0.9595 137.0 548 1.0241
0.9595 138.0 552 1.0184
0.9595 139.0 556 1.0387
0.9595 140.0 560 1.0528
0.9595 141.0 564 1.0510
0.9595 142.0 568 1.0153
0.9595 143.0 572 0.9628
0.9595 144.0 576 0.9999
0.9595 145.0 580 1.0139
0.9595 146.0 584 1.0149
0.9595 147.0 588 1.0016
0.9595 148.0 592 0.9516
0.9595 149.0 596 0.9290
0.9595 150.0 600 0.9084
0.9595 151.0 604 0.8736
0.9595 152.0 608 0.8832
0.9595 153.0 612 0.9093
0.9595 154.0 616 0.9489
0.9595 155.0 620 0.9548
0.9595 156.0 624 0.8944
0.9595 157.0 628 0.8681
0.9595 158.0 632 0.8733
0.9595 159.0 636 0.8852
0.9595 160.0 640 0.9133
0.9595 161.0 644 0.8900
0.9595 162.0 648 0.8863
0.9595 163.0 652 0.8928
0.9595 164.0 656 0.8959
0.9595 165.0 660 0.9163
0.9595 166.0 664 0.9739
0.9595 167.0 668 1.0204
0.9595 168.0 672 1.0059
0.9595 169.0 676 0.9578
0.9595 170.0 680 0.9313
0.9595 171.0 684 0.9084
0.9595 172.0 688 0.9836
0.9595 173.0 692 1.0601
0.9595 174.0 696 1.0884
0.9595 175.0 700 1.0779
0.9595 176.0 704 1.0599
0.9595 177.0 708 1.0422
0.9595 178.0 712 1.0271
0.9595 179.0 716 1.0100
0.9595 180.0 720 0.9945
0.9595 181.0 724 1.0018
0.9595 182.0 728 1.0234
0.9595 183.0 732 1.0380
0.9595 184.0 736 1.0525
0.9595 185.0 740 1.0420
0.9595 186.0 744 1.0325
0.9595 187.0 748 1.0125
0.9595 188.0 752 0.9891
0.9595 189.0 756 0.9515
0.9595 190.0 760 0.9495
0.9595 191.0 764 0.9642
0.9595 192.0 768 0.9876
0.9595 193.0 772 0.9985
0.9595 194.0 776 1.0227
0.9595 195.0 780 1.0730
0.9595 196.0 784 1.0871
0.9595 197.0 788 1.0918
0.9595 198.0 792 1.1092
0.9595 199.0 796 1.0989
0.9595 200.0 800 1.0992
0.9595 201.0 804 1.1034
0.9595 202.0 808 1.0881
0.9595 203.0 812 1.0707
0.9595 204.0 816 1.0777
0.9595 205.0 820 1.0758
0.9595 206.0 824 1.0684
0.9595 207.0 828 1.0629
0.9595 208.0 832 1.0659
0.9595 209.0 836 1.0585
0.9595 210.0 840 1.0132
0.9595 211.0 844 0.9791
0.9595 212.0 848 0.9761
0.9595 213.0 852 1.0348
0.9595 214.0 856 1.0910
0.9595 215.0 860 1.1354
0.9595 216.0 864 1.1348
0.9595 217.0 868 1.0884
0.9595 218.0 872 1.0430
0.9595 219.0 876 1.0202
0.9595 220.0 880 1.0097
0.9595 221.0 884 1.0151
0.9595 222.0 888 1.0096
0.9595 223.0 892 1.0302
0.9595 224.0 896 1.0635
0.9595 225.0 900 1.0611
0.9595 226.0 904 1.0548
0.9595 227.0 908 1.1173
0.9595 228.0 912 1.1561
0.9595 229.0 916 1.1550
0.9595 230.0 920 1.0254
0.9595 231.0 924 0.9364
0.9595 232.0 928 0.9316
0.9595 233.0 932 0.9717
0.9595 234.0 936 1.0406
0.9595 235.0 940 1.0643
0.9595 236.0 944 1.1092
0.9595 237.0 948 1.1197
0.9595 238.0 952 1.1270
0.9595 239.0 956 1.1300
0.9595 240.0 960 1.0921
0.9595 241.0 964 1.0446
0.9595 242.0 968 1.0234
0.9595 243.0 972 1.0067
0.9595 244.0 976 1.0324
0.9595 245.0 980 1.0434
0.9595 246.0 984 1.0502
0.9595 247.0 988 1.0618
0.9595 248.0 992 1.1352
0.9595 249.0 996 1.1672
0.4061 250.0 1000 1.1700
0.4061 251.0 1004 1.1416
0.4061 252.0 1008 1.1198
0.4061 253.0 1012 1.1226
0.4061 254.0 1016 1.1220
0.4061 255.0 1020 1.1317
0.4061 256.0 1024 1.1390
0.4061 257.0 1028 1.1069
0.4061 258.0 1032 1.0700
0.4061 259.0 1036 1.0657
0.4061 260.0 1040 1.0839
0.4061 261.0 1044 1.1030
0.4061 262.0 1048 1.1005
0.4061 263.0 1052 1.0882
0.4061 264.0 1056 1.0740
0.4061 265.0 1060 1.0710
0.4061 266.0 1064 1.0775
0.4061 267.0 1068 1.0908
0.4061 268.0 1072 1.1077
0.4061 269.0 1076 1.1204
0.4061 270.0 1080 1.1259
0.4061 271.0 1084 1.1208
0.4061 272.0 1088 1.1004
0.4061 273.0 1092 1.0761
0.4061 274.0 1096 1.0683
0.4061 275.0 1100 1.0663
0.4061 276.0 1104 1.0627
0.4061 277.0 1108 1.1069
0.4061 278.0 1112 1.1032
0.4061 279.0 1116 1.0401
0.4061 280.0 1120 1.0408
0.4061 281.0 1124 1.1004
0.4061 282.0 1128 1.1623
0.4061 283.0 1132 1.1512
0.4061 284.0 1136 1.1242
0.4061 285.0 1140 1.0919
0.4061 286.0 1144 1.0818
0.4061 287.0 1148 1.0703
0.4061 288.0 1152 1.0501
0.4061 289.0 1156 1.0347
0.4061 290.0 1160 1.0299
0.4061 291.0 1164 1.0641
0.4061 292.0 1168 1.0679
0.4061 293.0 1172 1.0680
0.4061 294.0 1176 1.1041
0.4061 295.0 1180 1.1802
0.4061 296.0 1184 1.1971
0.4061 297.0 1188 1.1793
0.4061 298.0 1192 1.1459
0.4061 299.0 1196 1.1035
0.4061 300.0 1200 1.0577
0.4061 301.0 1204 1.0544
0.4061 302.0 1208 1.0737
0.4061 303.0 1212 1.0819
0.4061 304.0 1216 1.0899
0.4061 305.0 1220 1.0885
0.4061 306.0 1224 1.0755
0.4061 307.0 1228 1.0139
0.4061 308.0 1232 0.9849
0.4061 309.0 1236 0.9781
0.4061 310.0 1240 0.9953
0.4061 311.0 1244 1.0138
0.4061 312.0 1248 1.0119
0.4061 313.0 1252 1.0704
0.4061 314.0 1256 1.1161
0.4061 315.0 1260 1.1500
0.4061 316.0 1264 1.1862
0.4061 317.0 1268 1.1833
0.4061 318.0 1272 1.1706
0.4061 319.0 1276 1.1517
0.4061 320.0 1280 1.1309
0.4061 321.0 1284 1.0936
0.4061 322.0 1288 1.0957
0.4061 323.0 1292 1.1080
0.4061 324.0 1296 1.1087
0.4061 325.0 1300 1.1314
0.4061 326.0 1304 1.1757
0.4061 327.0 1308 1.1896
0.4061 328.0 1312 1.1742
0.4061 329.0 1316 1.1661
0.4061 330.0 1320 1.1675
0.4061 331.0 1324 1.1691
0.4061 332.0 1328 1.1715
0.4061 333.0 1332 1.1513
0.4061 334.0 1336 1.1347
0.4061 335.0 1340 1.1386
0.4061 336.0 1344 1.1587
0.4061 337.0 1348 1.1739
0.4061 338.0 1352 1.1790
0.4061 339.0 1356 1.1615
0.4061 340.0 1360 1.1484
0.4061 341.0 1364 1.1376
0.4061 342.0 1368 1.1258
0.4061 343.0 1372 1.1142
0.4061 344.0 1376 1.1062
0.4061 345.0 1380 1.0986
0.4061 346.0 1384 1.0905
0.4061 347.0 1388 1.0776
0.4061 348.0 1392 1.0687
0.4061 349.0 1396 1.0865
0.4061 350.0 1400 1.0822
0.4061 351.0 1404 1.0831
0.4061 352.0 1408 1.0914
0.4061 353.0 1412 1.1018
0.4061 354.0 1416 1.1078
0.4061 355.0 1420 1.1190
0.4061 356.0 1424 1.1374
0.4061 357.0 1428 1.1534
0.4061 358.0 1432 1.2011
0.4061 359.0 1436 1.2166
0.4061 360.0 1440 1.2168
0.4061 361.0 1444 1.2144
0.4061 362.0 1448 1.1989
0.4061 363.0 1452 1.1832
0.4061 364.0 1456 1.1531
0.4061 365.0 1460 1.1422
0.4061 366.0 1464 1.1279
0.4061 367.0 1468 1.1210
0.4061 368.0 1472 1.1114
0.4061 369.0 1476 1.1034
0.4061 370.0 1480 1.0998
0.4061 371.0 1484 1.1009
0.4061 372.0 1488 1.1048
0.4061 373.0 1492 1.1002
0.4061 374.0 1496 1.0920
0.4027 375.0 1500 1.0851
0.4027 376.0 1504 1.0787
0.4027 377.0 1508 1.0733
0.4027 378.0 1512 1.0695
0.4027 379.0 1516 1.0686
0.4027 380.0 1520 1.0687
0.4027 381.0 1524 1.0757
0.4027 382.0 1528 1.1245
0.4027 383.0 1532 1.1659
0.4027 384.0 1536 1.1729
0.4027 385.0 1540 1.1401
0.4027 386.0 1544 1.1316
0.4027 387.0 1548 1.1445
0.4027 388.0 1552 1.1504
0.4027 389.0 1556 1.1461
0.4027 390.0 1560 1.1450
0.4027 391.0 1564 1.1428
0.4027 392.0 1568 1.1392
0.4027 393.0 1572 1.1304
0.4027 394.0 1576 1.1038
0.4027 395.0 1580 1.0931
0.4027 396.0 1584 1.0837
0.4027 397.0 1588 1.0824
0.4027 398.0 1592 1.0808
0.4027 399.0 1596 1.0819
0.4027 400.0 1600 1.0794
0.4027 401.0 1604 1.0887
0.4027 402.0 1608 1.0771
0.4027 403.0 1612 1.1094
0.4027 404.0 1616 1.1436
0.4027 405.0 1620 1.1654
0.4027 406.0 1624 1.1661
0.4027 407.0 1628 1.1561
0.4027 408.0 1632 1.1425
0.4027 409.0 1636 1.1329
0.4027 410.0 1640 1.1031
0.4027 411.0 1644 1.0969
0.4027 412.0 1648 1.1374
0.4027 413.0 1652 1.2151
0.4027 414.0 1656 1.2531
0.4027 415.0 1660 1.2576
0.4027 416.0 1664 1.2520
0.4027 417.0 1668 1.2261
0.4027 418.0 1672 1.1952
0.4027 419.0 1676 1.1627
0.4027 420.0 1680 1.1412
0.4027 421.0 1684 1.1316
0.4027 422.0 1688 1.1335
0.4027 423.0 1692 1.1366
0.4027 424.0 1696 1.1405
0.4027 425.0 1700 1.1503
0.4027 426.0 1704 1.1579
0.4027 427.0 1708 1.1629
0.4027 428.0 1712 1.1647
0.4027 429.0 1716 1.1752
0.4027 430.0 1720 1.2149
0.4027 431.0 1724 1.2361
0.4027 432.0 1728 1.2406
0.4027 433.0 1732 1.2271
0.4027 434.0 1736 1.2130
0.4027 435.0 1740 1.2011
0.4027 436.0 1744 1.1930
0.4027 437.0 1748 1.1895
0.4027 438.0 1752 1.1903
0.4027 439.0 1756 1.1907
0.4027 440.0 1760 1.1871
0.4027 441.0 1764 1.1850
0.4027 442.0 1768 1.1835
0.4027 443.0 1772 1.1841
0.4027 444.0 1776 1.1790
0.4027 445.0 1780 1.1860
0.4027 446.0 1784 1.1998
0.4027 447.0 1788 1.2106
0.4027 448.0 1792 1.2091
0.4027 449.0 1796 1.2059
0.4027 450.0 1800 1.2032
0.4027 451.0 1804 1.2225
0.4027 452.0 1808 1.2336
0.4027 453.0 1812 1.2409
0.4027 454.0 1816 1.2450
0.4027 455.0 1820 1.2479
0.4027 456.0 1824 1.2373
0.4027 457.0 1828 1.2258
0.4027 458.0 1832 1.2178
0.4027 459.0 1836 1.2142
0.4027 460.0 1840 1.2237
0.4027 461.0 1844 1.2365
0.4027 462.0 1848 1.2448
0.4027 463.0 1852 1.2462
0.4027 464.0 1856 1.2458
0.4027 465.0 1860 1.2426
0.4027 466.0 1864 1.2366
0.4027 467.0 1868 1.2280
0.4027 468.0 1872 1.2097
0.4027 469.0 1876 1.1996
0.4027 470.0 1880 1.1970
0.4027 471.0 1884 1.1946
0.4027 472.0 1888 1.1921
0.4027 473.0 1892 1.1885
0.4027 474.0 1896 1.1959
0.4027 475.0 1900 1.2028
0.4027 476.0 1904 1.2091
0.4027 477.0 1908 1.2131
0.4027 478.0 1912 1.2149
0.4027 479.0 1916 1.2142
0.4027 480.0 1920 1.2106
0.4027 481.0 1924 1.2185
0.4027 482.0 1928 1.2249
0.4027 483.0 1932 1.2221
0.4027 484.0 1936 1.2240
0.4027 485.0 1940 1.2291
0.4027 486.0 1944 1.2215
0.4027 487.0 1948 1.2306
0.4027 488.0 1952 1.2364
0.4027 489.0 1956 1.2394
0.4027 490.0 1960 1.2425
0.4027 491.0 1964 1.2441
0.4027 492.0 1968 1.2484
0.4027 493.0 1972 1.2533
0.4027 494.0 1976 1.2587
0.4027 495.0 1980 1.2861
0.4027 496.0 1984 1.3230
0.4027 497.0 1988 1.3310
0.4027 498.0 1992 1.3040
0.4027 499.0 1996 1.2828
0.4015 500.0 2000 1.2658
0.4015 501.0 2004 1.2563
0.4015 502.0 2008 1.2468
0.4015 503.0 2012 1.2381
0.4015 504.0 2016 1.2305
0.4015 505.0 2020 1.2271
0.4015 506.0 2024 1.2447
0.4015 507.0 2028 1.2642
0.4015 508.0 2032 1.2743
0.4015 509.0 2036 1.2797
0.4015 510.0 2040 1.2839
0.4015 511.0 2044 1.2645
0.4015 512.0 2048 1.2411
0.4015 513.0 2052 1.2261
0.4015 514.0 2056 1.2141
0.4015 515.0 2060 1.2026
0.4015 516.0 2064 1.1991
0.4015 517.0 2068 1.2004
0.4015 518.0 2072 1.1927
0.4015 519.0 2076 1.2065
0.4015 520.0 2080 1.1876
0.4015 521.0 2084 1.1670
0.4015 522.0 2088 1.2298
0.4015 523.0 2092 1.2412
0.4015 524.0 2096 1.2469
0.4015 525.0 2100 1.2639
0.4015 526.0 2104 1.2845
0.4015 527.0 2108 1.2928
0.4015 528.0 2112 1.2928
0.4015 529.0 2116 1.2901
0.4015 530.0 2120 1.2863
0.4015 531.0 2124 1.2819
0.4015 532.0 2128 1.2756
0.4015 533.0 2132 1.2602
0.4015 534.0 2136 1.2220
0.4015 535.0 2140 1.1909
0.4015 536.0 2144 1.1784
0.4015 537.0 2148 1.1824
0.4015 538.0 2152 1.1839
0.4015 539.0 2156 1.1836
0.4015 540.0 2160 1.1816
0.4015 541.0 2164 1.1767
0.4015 542.0 2168 1.1693
0.4015 543.0 2172 1.1573
0.4015 544.0 2176 1.1424
0.4015 545.0 2180 1.1312
0.4015 546.0 2184 1.1262
0.4015 547.0 2188 1.1330
0.4015 548.0 2192 1.1370
0.4015 549.0 2196 1.1386
0.4015 550.0 2200 1.1450
0.4015 551.0 2204 1.1489
0.4015 552.0 2208 1.1465
0.4015 553.0 2212 1.1458
0.4015 554.0 2216 1.1438
0.4015 555.0 2220 1.1405
0.4015 556.0 2224 1.1413
0.4015 557.0 2228 1.1443
0.4015 558.0 2232 1.1478
0.4015 559.0 2236 1.1519
0.4015 560.0 2240 1.1579
0.4015 561.0 2244 1.1543
0.4015 562.0 2248 1.1479
0.4015 563.0 2252 1.1474
0.4015 564.0 2256 1.1388
0.4015 565.0 2260 1.1312
0.4015 566.0 2264 1.1319
0.4015 567.0 2268 1.1345
0.4015 568.0 2272 1.1379
0.4015 569.0 2276 1.1343
0.4015 570.0 2280 1.1312
0.4015 571.0 2284 1.1294
0.4015 572.0 2288 1.1286
0.4015 573.0 2292 1.1313
0.4015 574.0 2296 1.1344
0.4015 575.0 2300 1.1408
0.4015 576.0 2304 1.1502
0.4015 577.0 2308 1.1605
0.4015 578.0 2312 1.1661
0.4015 579.0 2316 1.1772
0.4015 580.0 2320 1.1835
0.4015 581.0 2324 1.1882
0.4015 582.0 2328 1.1931
0.4015 583.0 2332 1.1966
0.4015 584.0 2336 1.1995
0.4015 585.0 2340 1.1999
0.4015 586.0 2344 1.1976
0.4015 587.0 2348 1.2158
0.4015 588.0 2352 1.2351
0.4015 589.0 2356 1.2386
0.4015 590.0 2360 1.2322
0.4015 591.0 2364 1.2268
0.4015 592.0 2368 1.2168
0.4015 593.0 2372 1.2058
0.4015 594.0 2376 1.1940
0.4015 595.0 2380 1.1846
0.4015 596.0 2384 1.1756
0.4015 597.0 2388 1.1728
0.4015 598.0 2392 1.1731
0.4015 599.0 2396 1.1747
0.4015 600.0 2400 1.1754
0.4015 601.0 2404 1.1738
0.4015 602.0 2408 1.1766
0.4015 603.0 2412 1.1779
0.4015 604.0 2416 1.1781
0.4015 605.0 2420 1.1755
0.4015 606.0 2424 1.1726
0.4015 607.0 2428 1.1691
0.4015 608.0 2432 1.1652
0.4015 609.0 2436 1.1594
0.4015 610.0 2440 1.1497
0.4015 611.0 2444 1.1450
0.4015 612.0 2448 1.1467
0.4015 613.0 2452 1.1463
0.4015 614.0 2456 1.1456
0.4015 615.0 2460 1.1613
0.4015 616.0 2464 1.1746
0.4015 617.0 2468 1.1846
0.4015 618.0 2472 1.1864
0.4015 619.0 2476 1.1849
0.4015 620.0 2480 1.1839
0.4015 621.0 2484 1.1802
0.4015 622.0 2488 1.1759
0.4015 623.0 2492 1.1711
0.4015 624.0 2496 1.1654
0.4009 625.0 2500 1.1607
0.4009 626.0 2504 1.1558
0.4009 627.0 2508 1.1530
0.4009 628.0 2512 1.1523
0.4009 629.0 2516 1.1515
0.4009 630.0 2520 1.1477
0.4009 631.0 2524 1.1447
0.4009 632.0 2528 1.1449
0.4009 633.0 2532 1.1450
0.4009 634.0 2536 1.1520
0.4009 635.0 2540 1.1594
0.4009 636.0 2544 1.1627
0.4009 637.0 2548 1.1648
0.4009 638.0 2552 1.1668
0.4009 639.0 2556 1.1679
0.4009 640.0 2560 1.1674
0.4009 641.0 2564 1.1629
0.4009 642.0 2568 1.1590
0.4009 643.0 2572 1.1572
0.4009 644.0 2576 1.1574
0.4009 645.0 2580 1.1560
0.4009 646.0 2584 1.1547
0.4009 647.0 2588 1.1626
0.4009 648.0 2592 1.1698
0.4009 649.0 2596 1.1810
0.4009 650.0 2600 1.1890
0.4009 651.0 2604 1.1906
0.4009 652.0 2608 1.1845
0.4009 653.0 2612 1.1802
0.4009 654.0 2616 1.1777
0.4009 655.0 2620 1.1755
0.4009 656.0 2624 1.1743
0.4009 657.0 2628 1.1838
0.4009 658.0 2632 1.1907
0.4009 659.0 2636 1.1953
0.4009 660.0 2640 1.2169
0.4009 661.0 2644 1.2343
0.4009 662.0 2648 1.2517
0.4009 663.0 2652 1.2641
0.4009 664.0 2656 1.2559
0.4009 665.0 2660 1.2292
0.4009 666.0 2664 1.2040
0.4009 667.0 2668 1.1851
0.4009 668.0 2672 1.1710
0.4009 669.0 2676 1.1577
0.4009 670.0 2680 1.1502
0.4009 671.0 2684 1.1591
0.4009 672.0 2688 1.1709
0.4009 673.0 2692 1.1813
0.4009 674.0 2696 1.1893
0.4009 675.0 2700 1.1942
0.4009 676.0 2704 1.1949
0.4009 677.0 2708 1.1814
0.4009 678.0 2712 1.1825
0.4009 679.0 2716 1.1880
0.4009 680.0 2720 1.1829
0.4009 681.0 2724 1.1667
0.4009 682.0 2728 1.1637
0.4009 683.0 2732 1.1631
0.4009 684.0 2736 1.1605
0.4009 685.0 2740 1.1599
0.4009 686.0 2744 1.1571
0.4009 687.0 2748 1.1528
0.4009 688.0 2752 1.1541
0.4009 689.0 2756 1.1628
0.4009 690.0 2760 1.1750
0.4009 691.0 2764 1.1855
0.4009 692.0 2768 1.1928
0.4009 693.0 2772 1.1962
0.4009 694.0 2776 1.1970
0.4009 695.0 2780 1.1976
0.4009 696.0 2784 1.1929
0.4009 697.0 2788 1.1959
0.4009 698.0 2792 1.2003
0.4009 699.0 2796 1.2046
0.4009 700.0 2800 1.2084
0.4009 701.0 2804 1.2097
0.4009 702.0 2808 1.2109
0.4009 703.0 2812 1.2124
0.4009 704.0 2816 1.2159
0.4009 705.0 2820 1.2190
0.4009 706.0 2824 1.2203
0.4009 707.0 2828 1.2186
0.4009 708.0 2832 1.2156
0.4009 709.0 2836 1.2086
0.4009 710.0 2840 1.2024
0.4009 711.0 2844 1.1998
0.4009 712.0 2848 1.1986
0.4009 713.0 2852 1.1981
0.4009 714.0 2856 1.2001
0.4009 715.0 2860 1.2019
0.4009 716.0 2864 1.2038
0.4009 717.0 2868 1.2051
0.4009 718.0 2872 1.1869
0.4009 719.0 2876 1.1780
0.4009 720.0 2880 1.1821
0.4009 721.0 2884 1.1875
0.4009 722.0 2888 1.1881
0.4009 723.0 2892 1.1867
0.4009 724.0 2896 1.1862
0.4009 725.0 2900 1.1858
0.4009 726.0 2904 1.1841
0.4009 727.0 2908 1.1803
0.4009 728.0 2912 1.1781
0.4009 729.0 2916 1.1751
0.4009 730.0 2920 1.1735
0.4009 731.0 2924 1.1709
0.4009 732.0 2928 1.1676
0.4009 733.0 2932 1.1643
0.4009 734.0 2936 1.1640
0.4009 735.0 2940 1.1636
0.4009 736.0 2944 1.1596
0.4009 737.0 2948 1.1704
0.4009 738.0 2952 1.1773
0.4009 739.0 2956 1.1814
0.4009 740.0 2960 1.1891
0.4009 741.0 2964 1.1954
0.4009 742.0 2968 1.2006
0.4009 743.0 2972 1.1996
0.4009 744.0 2976 1.1986
0.4009 745.0 2980 1.1979
0.4009 746.0 2984 1.1958
0.4009 747.0 2988 1.1947
0.4009 748.0 2992 1.1930
0.4009 749.0 2996 1.1894
0.4006 750.0 3000 1.1871
0.4006 751.0 3004 1.1853
0.4006 752.0 3008 1.1854
0.4006 753.0 3012 1.1866
0.4006 754.0 3016 1.1901
0.4006 755.0 3020 1.1924
0.4006 756.0 3024 1.1946
0.4006 757.0 3028 1.2176
0.4006 758.0 3032 1.2392
0.4006 759.0 3036 1.2502
0.4006 760.0 3040 1.2617
0.4006 761.0 3044 1.2924
0.4006 762.0 3048 1.3111
0.4006 763.0 3052 1.3042
0.4006 764.0 3056 1.2828
0.4006 765.0 3060 1.2628
0.4006 766.0 3064 1.2553
0.4006 767.0 3068 1.2600
0.4006 768.0 3072 1.2645
0.4006 769.0 3076 1.2678
0.4006 770.0 3080 1.2706
0.4006 771.0 3084 1.2620
0.4006 772.0 3088 1.2547
0.4006 773.0 3092 1.2503
0.4006 774.0 3096 1.2459
0.4006 775.0 3100 1.2452
0.4006 776.0 3104 1.2442
0.4006 777.0 3108 1.2393
0.4006 778.0 3112 1.2328
0.4006 779.0 3116 1.2249
0.4006 780.0 3120 1.2223
0.4006 781.0 3124 1.2302
0.4006 782.0 3128 1.2334
0.4006 783.0 3132 1.2332
0.4006 784.0 3136 1.2326
0.4006 785.0 3140 1.2330
0.4006 786.0 3144 1.2281
0.4006 787.0 3148 1.2294
0.4006 788.0 3152 1.2327
0.4006 789.0 3156 1.2408
0.4006 790.0 3160 1.2459
0.4006 791.0 3164 1.2488
0.4006 792.0 3168 1.2509
0.4006 793.0 3172 1.2510
0.4006 794.0 3176 1.2514
0.4006 795.0 3180 1.2491
0.4006 796.0 3184 1.2476
0.4006 797.0 3188 1.2470
0.4006 798.0 3192 1.2470
0.4006 799.0 3196 1.2464
0.4006 800.0 3200 1.2468
0.4006 801.0 3204 1.2460
0.4006 802.0 3208 1.2425
0.4006 803.0 3212 1.2415
0.4006 804.0 3216 1.2416
0.4006 805.0 3220 1.2420
0.4006 806.0 3224 1.2442
0.4006 807.0 3228 1.2465
0.4006 808.0 3232 1.2481
0.4006 809.0 3236 1.2477
0.4006 810.0 3240 1.2468
0.4006 811.0 3244 1.2467
0.4006 812.0 3248 1.2471
0.4006 813.0 3252 1.2486
0.4006 814.0 3256 1.2484
0.4006 815.0 3260 1.2484
0.4006 816.0 3264 1.2477
0.4006 817.0 3268 1.2545
0.4006 818.0 3272 1.2622
0.4006 819.0 3276 1.2672
0.4006 820.0 3280 1.2704
0.4006 821.0 3284 1.2719
0.4006 822.0 3288 1.2710
0.4006 823.0 3292 1.2697
0.4006 824.0 3296 1.2671
0.4006 825.0 3300 1.2717
0.4006 826.0 3304 1.2763
0.4006 827.0 3308 1.2774
0.4006 828.0 3312 1.2773
0.4006 829.0 3316 1.2765
0.4006 830.0 3320 1.2767
0.4006 831.0 3324 1.2760
0.4006 832.0 3328 1.2755
0.4006 833.0 3332 1.2742
0.4006 834.0 3336 1.2732
0.4006 835.0 3340 1.2681
0.4006 836.0 3344 1.2624
0.4006 837.0 3348 1.2577
0.4006 838.0 3352 1.2530
0.4006 839.0 3356 1.2488
0.4006 840.0 3360 1.2455
0.4006 841.0 3364 1.2440
0.4006 842.0 3368 1.2459
0.4006 843.0 3372 1.2487
0.4006 844.0 3376 1.2498
0.4006 845.0 3380 1.2504
0.4006 846.0 3384 1.2476
0.4006 847.0 3388 1.2446
0.4006 848.0 3392 1.2400
0.4006 849.0 3396 1.2353
0.4006 850.0 3400 1.2298
0.4006 851.0 3404 1.2246
0.4006 852.0 3408 1.2207
0.4006 853.0 3412 1.2129
0.4006 854.0 3416 1.2030
0.4006 855.0 3420 1.1937
0.4006 856.0 3424 1.1898
0.4006 857.0 3428 1.1907
0.4006 858.0 3432 1.1910
0.4006 859.0 3436 1.1919
0.4006 860.0 3440 1.1920
0.4006 861.0 3444 1.1923
0.4006 862.0 3448 1.1927
0.4006 863.0 3452 1.1933
0.4006 864.0 3456 1.1934
0.4006 865.0 3460 1.1937
0.4006 866.0 3464 1.1936
0.4006 867.0 3468 1.1932
0.4006 868.0 3472 1.1926
0.4006 869.0 3476 1.1917
0.4006 870.0 3480 1.1899
0.4006 871.0 3484 1.1884
0.4006 872.0 3488 1.1858
0.4006 873.0 3492 1.1842
0.4006 874.0 3496 1.1835
0.4 875.0 3500 1.1836
0.4 876.0 3504 1.1845
0.4 877.0 3508 1.1867
0.4 878.0 3512 1.1902
0.4 879.0 3516 1.1945
0.4 880.0 3520 1.1972
0.4 881.0 3524 1.1996
0.4 882.0 3528 1.2025
0.4 883.0 3532 1.2048
0.4 884.0 3536 1.2061
0.4 885.0 3540 1.2076
0.4 886.0 3544 1.2078
0.4 887.0 3548 1.2093
0.4 888.0 3552 1.2160
0.4 889.0 3556 1.2185
0.4 890.0 3560 1.2167
0.4 891.0 3564 1.2196
0.4 892.0 3568 1.2207
0.4 893.0 3572 1.2203
0.4 894.0 3576 1.2191
0.4 895.0 3580 1.2181
0.4 896.0 3584 1.2176
0.4 897.0 3588 1.2169
0.4 898.0 3592 1.2157
0.4 899.0 3596 1.2177
0.4 900.0 3600 1.2208
0.4 901.0 3604 1.2232
0.4 902.0 3608 1.2245
0.4 903.0 3612 1.2242
0.4 904.0 3616 1.2231
0.4 905.0 3620 1.2219
0.4 906.0 3624 1.2211
0.4 907.0 3628 1.2215
0.4 908.0 3632 1.2216
0.4 909.0 3636 1.2204
0.4 910.0 3640 1.2193
0.4 911.0 3644 1.2182
0.4 912.0 3648 1.2165
0.4 913.0 3652 1.2148
0.4 914.0 3656 1.2128
0.4 915.0 3660 1.2120
0.4 916.0 3664 1.2113
0.4 917.0 3668 1.2111
0.4 918.0 3672 1.2114
0.4 919.0 3676 1.2117
0.4 920.0 3680 1.2108
0.4 921.0 3684 1.2107
0.4 922.0 3688 1.2097
0.4 923.0 3692 1.2084
0.4 924.0 3696 1.2072
0.4 925.0 3700 1.2063
0.4 926.0 3704 1.2060
0.4 927.0 3708 1.2055
0.4 928.0 3712 1.2053
0.4 929.0 3716 1.2053
0.4 930.0 3720 1.2055
0.4 931.0 3724 1.2061
0.4 932.0 3728 1.2091
0.4 933.0 3732 1.2121
0.4 934.0 3736 1.2141
0.4 935.0 3740 1.2150
0.4 936.0 3744 1.2152
0.4 937.0 3748 1.2153
0.4 938.0 3752 1.2153
0.4 939.0 3756 1.2150
0.4 940.0 3760 1.2153
0.4 941.0 3764 1.2154
0.4 942.0 3768 1.2156
0.4 943.0 3772 1.2156
0.4 944.0 3776 1.2144
0.4 945.0 3780 1.2107
0.4 946.0 3784 1.2078
0.4 947.0 3788 1.2060
0.4 948.0 3792 1.2047
0.4 949.0 3796 1.2026
0.4 950.0 3800 1.2003
0.4 951.0 3804 1.1986
0.4 952.0 3808 1.1975
0.4 953.0 3812 1.1969
0.4 954.0 3816 1.1958
0.4 955.0 3820 1.1946
0.4 956.0 3824 1.1937
0.4 957.0 3828 1.1928
0.4 958.0 3832 1.1928
0.4 959.0 3836 1.1928
0.4 960.0 3840 1.1933
0.4 961.0 3844 1.1939
0.4 962.0 3848 1.1942
0.4 963.0 3852 1.1947
0.4 964.0 3856 1.1954
0.4 965.0 3860 1.1961
0.4 966.0 3864 1.1966
0.4 967.0 3868 1.1985
0.4 968.0 3872 1.2002
0.4 969.0 3876 1.2015
0.4 970.0 3880 1.2035
0.4 971.0 3884 1.2047
0.4 972.0 3888 1.2050
0.4 973.0 3892 1.2057
0.4 974.0 3896 1.2064
0.4 975.0 3900 1.2068
0.4 976.0 3904 1.2067
0.4 977.0 3908 1.2067
0.4 978.0 3912 1.2065
0.4 979.0 3916 1.2063
0.4 980.0 3920 1.2060
0.4 981.0 3924 1.2059
0.4 982.0 3928 1.2059
0.4 983.0 3932 1.2059
0.4 984.0 3936 1.2060
0.4 985.0 3940 1.2060
0.4 986.0 3944 1.2059
0.4 987.0 3948 1.2059
0.4 988.0 3952 1.2059
0.4 989.0 3956 1.2059
0.4 990.0 3960 1.2059
0.4 991.0 3964 1.2060
0.4 992.0 3968 1.2060
0.4 993.0 3972 1.2060
0.4 994.0 3976 1.2054
0.4 995.0 3980 1.2047
0.4 996.0 3984 1.2043
0.4 997.0 3988 1.2041
0.4 998.0 3992 1.2040
0.4 999.0 3996 1.2039
0.4009 1000.0 4000 1.2040

Framework versions

  • Transformers 4.36.2
  • Pytorch 2.1.2+cu121
  • Datasets 2.14.7
  • Tokenizers 0.15.0
Downloads last month
15
Safetensors
Model size
124M params
Tensor type
F32
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.