hftest2242
/

my_awesome_billsum_model

@@ -22,7 +22,7 @@ model-index:
     metrics:
     - name: Rouge1
       type: rouge
-      value: 0.1404
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -32,11 +32,11 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [t5-small](https://huggingface.co/t5-small) on the billsum dataset.
 It achieves the following results on the evaluation set:
-- Loss: 2.4980
-- Rouge1: 0.1404
-- Rouge2: 0.052
-- Rougel: 0.1172
-- Rougelsum: 0.1176
 - Gen Len: 19.0
 ## Model description
@@ -57,8 +57,8 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 2e-05
-- train_batch_size: 16
-- eval_batch_size: 16
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
@@ -68,10 +68,10 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
 |:-------------:|:-----:|:----:|:---------------:|:------:|:------:|:------:|:---------:|:-------:|
-| No log        | 1.0   | 62   | 2.7839          | 0.1277 | 0.0381 | 0.1076 | 0.1081    | 19.0    |
-| No log        | 2.0   | 124  | 2.5780          | 0.1333 | 0.0433 | 0.1114 | 0.1118    | 19.0    |
-| No log        | 3.0   | 186  | 2.5132          | 0.1394 | 0.0518 | 0.1168 | 0.1171    | 19.0    |
-| No log        | 4.0   | 248  | 2.4980          | 0.1404 | 0.052  | 0.1172 | 0.1176    | 19.0    |
 ### Framework versions

     metrics:
     - name: Rouge1
       type: rouge
+      value: 0.1993
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 This model is a fine-tuned version of [t5-small](https://huggingface.co/t5-small) on the billsum dataset.
 It achieves the following results on the evaluation set:
+- Loss: 2.3574
+- Rouge1: 0.1993
+- Rouge2: 0.1009
+- Rougel: 0.1702
+- Rougelsum: 0.1704
 - Gen Len: 19.0
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 2e-05
+- train_batch_size: 4
+- eval_batch_size: 4
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
 |:-------------:|:-----:|:----:|:---------------:|:------:|:------:|:------:|:---------:|:-------:|
+| No log        | 1.0   | 248  | 2.5145          | 0.1394 | 0.0522 | 0.1171 | 0.1172    | 19.0    |
+| No log        | 2.0   | 496  | 2.4057          | 0.1935 | 0.0951 | 0.1642 | 0.1644    | 19.0    |
+| 2.8828        | 3.0   | 744  | 2.3667          | 0.2004 | 0.1024 | 0.1716 | 0.1717    | 19.0    |
+| 2.8828        | 4.0   | 992  | 2.3574          | 0.1993 | 0.1009 | 0.1702 | 0.1704    | 19.0    |
 ### Framework versions

pytorch_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:6c30e0b92fecf1d900f8564df0d5302316116794fa47da061ec8d6bc62c92c41
 size 242071641

 version https://git-lfs.github.com/spec/v1
+oid sha256:3605c851bb0fb9b54ea52661a343d0e3d07dc91f480d300085231e53c1182042
 size 242071641