beyoru
/

MCQ-3B-o-16

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

beyoru commited on 11 days ago

Commit

746644f

•

1 Parent(s): 2c21f45

Update README.md

Files changed (1) hide show

README.md +5 -3

README.md CHANGED Viewed

@@ -1,9 +1,8 @@
 ---
-base_model: unsloth/Qwen2.5-3B-Instruct
 tags:
 - text-generation-inference
 - transformers
-- unsloth
 - qwen2
 - trl
 - sft
@@ -53,5 +52,8 @@ response = tokenizer.batch_decode(generated_ids, skip_special_tokens=True)[0]
 ```
 # Notes:
-- Focus on o and ignore all linear layer
 - Fine-tuned lora with rank = 16 and alpha = 32, epoch = 1

 ---
+base_model: Qwen2.5-3B-Instruct
 tags:
 - text-generation-inference
 - transformers
 - qwen2
 - trl
 - sft
 ```
 # Notes:
+- For small datasets with narrow content which the model already has well, and doesn't want the model to forget the knowledge by focusing on o.
 - Fine-tuned lora with rank = 16 and alpha = 32, epoch = 1
+# Improvement
+- Increasing rank can help the model do better at your robust structure.