File size: 10,241 Bytes
0d0f91a
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
---
license: apache-2.0
base_model: t5-small
tags:
- generated_from_trainer
model-index:
- name: text_shortening_model_v75
  results: []
---

<!-- This model card has been generated automatically according to the information the Trainer had access to. You
should probably proofread and complete it, then remove this comment. -->

# text_shortening_model_v75

This model is a fine-tuned version of [t5-small](https://huggingface.co/t5-small) on the None dataset.
It achieves the following results on the evaluation set:
- Loss: 1.2113
- Bert precision: 0.8889
- Bert recall: 0.8883
- Bert f1-score: 0.8881
- Average word count: 6.8466
- Max word count: 15
- Min word count: 1
- Average token count: 10.892
- % shortened texts with length > 12: 1.9632

## Model description

More information needed

## Intended uses & limitations

More information needed

## Training and evaluation data

More information needed

## Training procedure

### Training hyperparameters

The following hyperparameters were used during training:
- learning_rate: 1e-05
- train_batch_size: 64
- eval_batch_size: 64
- seed: 42
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
- lr_scheduler_type: linear
- num_epochs: 40

### Training results

| Training Loss | Epoch | Step | Validation Loss | Bert precision | Bert recall | Bert f1-score | Average word count | Max word count | Min word count | Average token count | % shortened texts with length > 12 |
|:-------------:|:-----:|:----:|:---------------:|:--------------:|:-----------:|:-------------:|:------------------:|:--------------:|:--------------:|:-------------------:|:----------------------------------:|
| 2.4857        | 1.0   | 30   | 1.9604          | 0.8298         | 0.8444      | 0.8359        | 9.1436             | 19             | 1              | 13.7337             | 14.2331                            |
| 2.1772        | 2.0   | 60   | 1.7312          | 0.8337         | 0.839       | 0.8349        | 8.1264             | 19             | 1              | 12.3264             | 10.5521                            |
| 1.9897        | 3.0   | 90   | 1.6036          | 0.8513         | 0.8528      | 0.8508        | 7.6528             | 19             | 1              | 11.8748             | 8.3436                             |
| 1.8748        | 4.0   | 120  | 1.5274          | 0.8616         | 0.8583      | 0.8589        | 7.1988             | 17             | 1              | 11.4368             | 6.0123                             |
| 1.7948        | 5.0   | 150  | 1.4678          | 0.8709         | 0.8669      | 0.868         | 7.0086             | 17             | 1              | 11.1914             | 4.4172                             |
| 1.7436        | 6.0   | 180  | 1.4245          | 0.8763         | 0.8726      | 0.8737        | 6.9681             | 16             | 1              | 11.1387             | 3.8037                             |
| 1.6914        | 7.0   | 210  | 1.3948          | 0.8808         | 0.8792      | 0.8793        | 6.9706             | 18             | 1              | 11.0773             | 3.9264                             |
| 1.6484        | 8.0   | 240  | 1.3716          | 0.8846         | 0.8814      | 0.8824        | 6.789              | 15             | 2              | 10.8687             | 2.9448                             |
| 1.6177        | 9.0   | 270  | 1.3534          | 0.8858         | 0.8827      | 0.8836        | 6.8294             | 16             | 2              | 10.8712             | 3.0675                             |
| 1.6034        | 10.0  | 300  | 1.3371          | 0.8854         | 0.8826      | 0.8834        | 6.8528             | 16             | 2              | 10.865              | 2.9448                             |
| 1.5696        | 11.0  | 330  | 1.3237          | 0.8863         | 0.8842      | 0.8847        | 6.8393             | 16             | 2              | 10.8577             | 2.6994                             |
| 1.5474        | 12.0  | 360  | 1.3115          | 0.8874         | 0.8844      | 0.8853        | 6.7669             | 16             | 2              | 10.7742             | 2.5767                             |
| 1.5354        | 13.0  | 390  | 1.3011          | 0.8867         | 0.8836      | 0.8846        | 6.7607             | 16             | 2              | 10.7644             | 2.3313                             |
| 1.5173        | 14.0  | 420  | 1.2916          | 0.8872         | 0.8834      | 0.8847        | 6.7067             | 16             | 2              | 10.7117             | 2.0859                             |
| 1.5061        | 15.0  | 450  | 1.2822          | 0.8873         | 0.8833      | 0.8848        | 6.6969             | 16             | 2              | 10.6945             | 1.9632                             |
| 1.4861        | 16.0  | 480  | 1.2742          | 0.8882         | 0.8846      | 0.8858        | 6.692              | 16             | 2              | 10.7043             | 1.5951                             |
| 1.4793        | 17.0  | 510  | 1.2673          | 0.8881         | 0.8848      | 0.8859        | 6.719              | 16             | 1              | 10.7325             | 1.9632                             |
| 1.4736        | 18.0  | 540  | 1.2621          | 0.8888         | 0.8856      | 0.8867        | 6.7399             | 16             | 1              | 10.7571             | 1.9632                             |
| 1.4592        | 19.0  | 570  | 1.2563          | 0.8889         | 0.8863      | 0.8871        | 6.7497             | 16             | 1              | 10.7755             | 1.9632                             |
| 1.459         | 20.0  | 600  | 1.2514          | 0.8885         | 0.8863      | 0.8868        | 6.773              | 16             | 1              | 10.7902             | 1.9632                             |
| 1.4446        | 21.0  | 630  | 1.2472          | 0.8883         | 0.8859      | 0.8865        | 6.7571             | 16             | 1              | 10.7546             | 1.8405                             |
| 1.4324        | 22.0  | 660  | 1.2431          | 0.888          | 0.8864      | 0.8866        | 6.7779             | 16             | 1              | 10.7853             | 1.8405                             |
| 1.431         | 23.0  | 690  | 1.2396          | 0.8881         | 0.8866      | 0.8868        | 6.7828             | 16             | 1              | 10.8098             | 1.8405                             |
| 1.4233        | 24.0  | 720  | 1.2358          | 0.8885         | 0.8869      | 0.8872        | 6.784              | 16             | 1              | 10.8123             | 1.9632                             |
| 1.4218        | 25.0  | 750  | 1.2322          | 0.8887         | 0.8874      | 0.8875        | 6.8135             | 16             | 1              | 10.8417             | 1.8405                             |
| 1.4086        | 26.0  | 780  | 1.2295          | 0.8885         | 0.8878      | 0.8876        | 6.8356             | 16             | 1              | 10.8982             | 1.9632                             |
| 1.4104        | 27.0  | 810  | 1.2267          | 0.8883         | 0.8877      | 0.8875        | 6.8491             | 16             | 1              | 10.9166             | 1.9632                             |
| 1.4046        | 28.0  | 840  | 1.2242          | 0.888          | 0.8877      | 0.8873        | 6.8577             | 16             | 1              | 10.9411             | 1.9632                             |
| 1.4034        | 29.0  | 870  | 1.2222          | 0.8882         | 0.8881      | 0.8876        | 6.8626             | 16             | 1              | 10.9436             | 1.9632                             |
| 1.3942        | 30.0  | 900  | 1.2204          | 0.8883         | 0.8881      | 0.8877        | 6.8577             | 16             | 1              | 10.935              | 2.0859                             |
| 1.3909        | 31.0  | 930  | 1.2182          | 0.8885         | 0.8881      | 0.8878        | 6.8368             | 15             | 1              | 10.908              | 1.8405                             |
| 1.385         | 32.0  | 960  | 1.2167          | 0.8889         | 0.8884      | 0.8882        | 6.838              | 15             | 1              | 10.9006             | 1.8405                             |
| 1.3833        | 33.0  | 990  | 1.2149          | 0.889          | 0.8884      | 0.8882        | 6.8368             | 15             | 1              | 10.8945             | 1.8405                             |
| 1.3831        | 34.0  | 1020 | 1.2139          | 0.8891         | 0.8885      | 0.8883        | 6.8454             | 15             | 1              | 10.9018             | 1.8405                             |
| 1.3811        | 35.0  | 1050 | 1.2129          | 0.8891         | 0.8884      | 0.8882        | 6.8356             | 15             | 1              | 10.8908             | 1.8405                             |
| 1.3869        | 36.0  | 1080 | 1.2124          | 0.8891         | 0.8883      | 0.8881        | 6.8294             | 15             | 1              | 10.8785             | 1.8405                             |
| 1.3696        | 37.0  | 1110 | 1.2120          | 0.889          | 0.8881      | 0.8881        | 6.8233             | 15             | 1              | 10.8663             | 1.8405                             |
| 1.3791        | 38.0  | 1140 | 1.2116          | 0.8889         | 0.8881      | 0.888         | 6.8307             | 15             | 1              | 10.8748             | 1.8405                             |
| 1.3755        | 39.0  | 1170 | 1.2113          | 0.8889         | 0.8881      | 0.888         | 6.8331             | 15             | 1              | 10.8773             | 1.8405                             |
| 1.3668        | 40.0  | 1200 | 1.2113          | 0.8889         | 0.8883      | 0.8881        | 6.8466             | 15             | 1              | 10.892              | 1.9632                             |


### Framework versions

- Transformers 4.33.1
- Pytorch 2.0.1+cu118
- Datasets 2.14.5
- Tokenizers 0.13.3