Marcoz
/

lora

Generated from Trainer

Model card Files Files and versions Community

Marcoz commited on Aug 2

Commit

9211e7f

•

1 Parent(s): f14a462

End of training

Files changed (3) hide show

README.md +8 -8
adapter_config.json +3 -3
adapter_model.safetensors +1 -1

README.md CHANGED Viewed

@@ -1,8 +1,6 @@
 ---
 base_model: google-t5/t5-small
 library_name: peft
-metrics:
-- accuracy
 tags:
 - generated_from_trainer
 model-index:
@@ -13,13 +11,15 @@ model-index:
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
-[<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/zhuangc19/cs5740-sp24-assignment-4-Marcozc19/runs/qpe1e9fe)
 # ft-t5-small-on-airlineDB
 This model is a fine-tuned version of [google t5-small](https://huggingface.co/google t5-small) on the custom airlineDB dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.1025
-- Accuracy: 0.0064
 ## Model description
@@ -49,9 +49,9 @@ The following hyperparameters were used during training:
 ### Training results
-| Training Loss | Epoch | Step | Validation Loss | Accuracy |
-|:-------------:|:-----:|:----:|:---------------:|:--------:|
-| 0.1773        | 1.0   | 1057 | 0.1025          | 0.0064   |
 ### Framework versions

 ---
 base_model: google-t5/t5-small
 library_name: peft
 tags:
 - generated_from_trainer
 model-index:
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
+[<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/zhuangc19/cs5740-sp24-assignment-4-Marcozc19/runs/gcggsyfe)
 # ft-t5-small-on-airlineDB
 This model is a fine-tuned version of [google t5-small](https://huggingface.co/google t5-small) on the custom airlineDB dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.0992
+- Sql Em: 0.0815
+- Record Em: 0.1996
+- Record F1: 0.1996
 ## Model description
 ### Training results
+| Training Loss | Epoch | Step | Validation Loss | Sql Em | Record Em | Record F1 |
+|:-------------:|:-----:|:----:|:---------------:|:------:|:---------:|:---------:|
+| 0.1751        | 1.0   | 1057 | 0.0992          | 0.0815 | 0.1996    | 0.1996    |
 ### Framework versions

adapter_config.json CHANGED Viewed

@@ -20,10 +20,10 @@
   "rank_pattern": {},
   "revision": null,
   "target_modules": [
-    "k",
-    "o",
     "q",
-    "v"
   ],
   "task_type": "SEQ_2_SEQ_LM",
   "use_dora": false,

   "rank_pattern": {},
   "revision": null,
   "target_modules": [
+    "v",
     "q",
+    "o",
+    "k"
   ],
   "task_type": "SEQ_2_SEQ_LM",
   "use_dora": false,

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:9a3147f6cf51ca9d0cf3ad00f3876686cc3900f491598ae9bb0e410f93b1dcaa
 size 1199384

 version https://git-lfs.github.com/spec/v1
+oid sha256:48d9f0fb6522c712386084e5fcc92e9b452e3025037c44e995ffe2f61b6f0308
 size 1199384