End of training

Browse files

Files changed (4) hide show

README.md +8 -13
adapter_config.json +5 -5
adapter_model.safetensors +1 -1
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -13,23 +13,24 @@ model-index:
   results: []
 ---
 # gemma-2b-storytelling
 This model is a fine-tuned version of [google/gemma-2b](https://huggingface.co/google/gemma-2b) on the generator dataset.
-It achieves the following results on the evaluation set:
-- Loss: nan
 ## Model description
-This model has been fine-tuned specifically for the task of text generation, focusing on various storytelling themes. It utilizes advanced language modeling techniques to produce coherent and contextually relevant narratives based on user prompts.
 ## Intended uses & limitations
-This model is intended for use in applications requiring high-quality narrative text generation, such as content creation, interactive storytelling, or game design. Users should be aware of potential limitations in the model's understanding of complex contexts or subtleties in language, which may affect the output quality.
 ## Training and evaluation data
-The model was trained using the `PocketDoc/RUCAIBox-Story-Generation-Alpaca` dataset, which contains diverse storytelling prompts and responses, ensuring a robust ability to generate varied narrative content.
 ## Training procedure
@@ -42,21 +43,15 @@ The following hyperparameters were used during training:
 - seed: 42
 - gradient_accumulation_steps: 8
 - total_train_batch_size: 32
-- optimizer: Adam with betas=(0.9, 0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_ratio: 0.05
 - training_steps: 154
-### Training results
-| Training Loss    | Epoch  | Step | Validation Loss |
-|:----------------:|:------:|:----:|:---------------:|
-| 1454737970954.24 | 0.9164 | 100  | nan             |
 ### Framework versions
 - PEFT 0.10.0
 - Transformers 4.40.1
 - Pytorch 2.2.2+cu121
 - Datasets 2.19.0
-- Tokenizers 0.19.1

   results: []
 ---
+<!-- This model card has been generated automatically according to the information the Trainer had access to. You
+should probably proofread and complete it, then remove this comment. -->
 # gemma-2b-storytelling
 This model is a fine-tuned version of [google/gemma-2b](https://huggingface.co/google/gemma-2b) on the generator dataset.
 ## Model description
+More information needed
 ## Intended uses & limitations
+More information needed
 ## Training and evaluation data
+More information needed
 ## Training procedure
 - seed: 42
 - gradient_accumulation_steps: 8
 - total_train_batch_size: 32
+- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_ratio: 0.05
 - training_steps: 154
 ### Framework versions
 - PEFT 0.10.0
 - Transformers 4.40.1
 - Pytorch 2.2.2+cu121
 - Datasets 2.19.0
+- Tokenizers 0.19.1

adapter_config.json CHANGED Viewed

@@ -20,13 +20,13 @@
   "rank_pattern": {},
   "revision": null,
   "target_modules": [
     "v_proj",
-    "q_proj",
-    "down_proj",
     "up_proj",
-    "k_proj",
-    "o_proj",
-    "gate_proj"
   ],
   "task_type": "CAUSAL_LM",
   "use_dora": false,

   "rank_pattern": {},
   "revision": null,
   "target_modules": [
+    "gate_proj",
+    "o_proj",
     "v_proj",
     "up_proj",
+    "down_proj",
+    "q_proj",
+    "k_proj"
   ],
   "task_type": "CAUSAL_LM",
   "use_dora": false,

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:d54182c3d91897e6633fa14116f4381c64fdfa26ccac18e902cc979f1fd30946
 size 156926880

 version https://git-lfs.github.com/spec/v1
+oid sha256:fd8288e0c2887913ec48ae015aaf5524afb0102d4347650cbd097232b8250fcc
 size 156926880

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:dd7ce1c110ddfa0d013b2a7ebf59c8f67be1b117ca955599b1a6362c8e19d33f
 size 4984

 version https://git-lfs.github.com/spec/v1
+oid sha256:6fe5c200ad09b1c5099fc17a8d8e79cf6638956c5bd4e9a7c17b62099f41039e
 size 4984