AvaniSharma
/

mistral_7b_guanaco

Model card Files Files and versions Community

AvaniSharma commited on Mar 1

Commit

149139e

•

1 Parent(s): 2dc3c78

Update Readme

Files changed (1) hide show

README.md +2 -15

README.md CHANGED Viewed

@@ -21,11 +21,8 @@ and providing in quantization config when loading pretrained model
 - Using LORA we add small rank weight matrices whose parameters are modified while LLM's parameters are frozen.
   After finetuning is over we combine weights of these low rank matrices with LLMs weights to obtain new fine tuned weights.
   This makes fine tuning process faster and memory efficient
--
 - **Developed by:** Avani Sharma
 - **Model type:** LLM
@@ -69,13 +66,6 @@ And following Hyperparameters for training
     report_to="wandb"
 ```
-## Evaluation
 ### Compute Infrastructure
 Kaggle
@@ -88,9 +78,6 @@ Kaggle GPU T4x2
 Kaggle Notebook
 ### Framework versions
 - PEFT 0.7.1

 - Using LORA we add small rank weight matrices whose parameters are modified while LLM's parameters are frozen.
   After finetuning is over we combine weights of these low rank matrices with LLMs weights to obtain new fine tuned weights.
   This makes fine tuning process faster and memory efficient
+- We train SFT (Supervised Fine-Tuning) trainer using LORA parameters and training hyperparameters listed under *Training Hyperparameters*
+  section to finetune the base model
 - **Developed by:** Avani Sharma
 - **Model type:** LLM
     report_to="wandb"
 ```
 ### Compute Infrastructure
 Kaggle
 Kaggle Notebook
 ### Framework versions
 - PEFT 0.7.1