robinsmits
/

Mistral-Instruct-7B-v0.2-ChatAlpacaV2

@@ -2,7 +2,6 @@
 library_name: peft
 tags:
 - generated_from_trainer
-- unsloth
 base_model: unsloth/mistral-7b-instruct-v0.2-bnb-4bit
 model-index:
 - name: Mistral-Instruct-7B-v0.2-ChatAlpacaV2
@@ -16,7 +15,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [unsloth/mistral-7b-instruct-v0.2-bnb-4bit](https://huggingface.co/unsloth/mistral-7b-instruct-v0.2-bnb-4bit) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.8225
 ## Model description
@@ -44,22 +43,17 @@ The following hyperparameters were used during training:
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: cosine
 - lr_scheduler_warmup_ratio: 0.05
-- num_epochs: 2
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| 0.8873        | 0.2   | 120  | 0.8803          |
-| 0.8507        | 0.39  | 240  | 0.8559          |
-| 0.8502        | 0.59  | 360  | 0.8462          |
-| 0.8275        | 0.78  | 480  | 0.8406          |
-| 0.8452        | 0.98  | 600  | 0.8299          |
-| 0.8296        | 1.18  | 720  | 0.8259          |
-| 0.8243        | 1.37  | 840  | 0.8242          |
-| 0.8133        | 1.57  | 960  | 0.8232          |
-| 0.8265        | 1.76  | 1080 | 0.8227          |
-| 0.8194        | 1.96  | 1200 | 0.8225          |
 ### Framework versions

 library_name: peft
 tags:
 - generated_from_trainer
 base_model: unsloth/mistral-7b-instruct-v0.2-bnb-4bit
 model-index:
 - name: Mistral-Instruct-7B-v0.2-ChatAlpacaV2
 This model is a fine-tuned version of [unsloth/mistral-7b-instruct-v0.2-bnb-4bit](https://huggingface.co/unsloth/mistral-7b-instruct-v0.2-bnb-4bit) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.8439
 ## Model description
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: cosine
 - lr_scheduler_warmup_ratio: 0.05
+- num_epochs: 1
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| 0.8801        | 0.2   | 120  | 0.8756          |
+| 0.8498        | 0.39  | 240  | 0.8553          |
+| 0.8515        | 0.59  | 360  | 0.8475          |
+| 0.8313        | 0.78  | 480  | 0.8445          |
+| 0.857         | 0.98  | 600  | 0.8439          |
 ### Framework versions

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:1825bf7e59f15457790daedb498e009e48b0a7fd310d3ec77ef0c979574a0a4f
 size 109086416

 version https://git-lfs.github.com/spec/v1
+oid sha256:8667c5cd26654a97cf4eb14d65c636d9424c0d59d6c0f323be9aae34606e073a
 size 109086416

runs/Feb11_15-29-18_DS10/events.out.tfevents.1707661759.DS10.684074.0 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:bc8ea7d34f478d184aca9f8956a66ede70fb33d05a61fd1ecc71d520baf09790
-size 9406

 version https://git-lfs.github.com/spec/v1
+oid sha256:8a97969852f73e1a8dc0c228d6855acfdbb2eb4d92776b52ac48621d9419d956
+size 9760