HuggingFaceTB
/

SmolVLM-Instruct-DPO

Image-Text-to-Text

Model card Files Files and versions Community

kashif HF staff commited on 16 days ago

Commit

a52815c

•

1 Parent(s): cdc9683

Update README.md

Files changed (1) hide show

README.md +3 -1

README.md CHANGED Viewed

@@ -98,6 +98,8 @@ print(generated_texts[0])
 ### Training Procedure
 ```bash
 accelerate launch  --config_file examples/accelerate_configs/multi_gpu.yaml \
   examples/scripts/dpo_vlm.py \
@@ -110,7 +112,7 @@ accelerate launch  --config_file examples/accelerate_configs/multi_gpu.yaml \
   --bf16 \
   --torch_dtype bfloat16 \
   --use_peft \
-  --lora_target_modules=all-linear exit
 ```
 ### Framework versions

 ### Training Procedure
+See detailed blog on preference tuning VLLMs [here](https://huggingface.co/blog/dpo_vlm).
 ```bash
 accelerate launch  --config_file examples/accelerate_configs/multi_gpu.yaml \
   examples/scripts/dpo_vlm.py \
   --bf16 \
   --torch_dtype bfloat16 \
   --use_peft \
+  --lora_target_modules=all-linear
 ```
 ### Framework versions