kashif HF staff commited on
Commit
a52815c
1 Parent(s): cdc9683

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -1
README.md CHANGED
@@ -98,6 +98,8 @@ print(generated_texts[0])
98
 
99
  ### Training Procedure
100
 
 
 
101
  ```bash
102
  accelerate launch --config_file examples/accelerate_configs/multi_gpu.yaml \
103
  examples/scripts/dpo_vlm.py \
@@ -110,7 +112,7 @@ accelerate launch --config_file examples/accelerate_configs/multi_gpu.yaml \
110
  --bf16 \
111
  --torch_dtype bfloat16 \
112
  --use_peft \
113
- --lora_target_modules=all-linear exit
114
  ```
115
 
116
  ### Framework versions
 
98
 
99
  ### Training Procedure
100
 
101
+ See detailed blog on preference tuning VLLMs [here](https://huggingface.co/blog/dpo_vlm).
102
+
103
  ```bash
104
  accelerate launch --config_file examples/accelerate_configs/multi_gpu.yaml \
105
  examples/scripts/dpo_vlm.py \
 
112
  --bf16 \
113
  --torch_dtype bfloat16 \
114
  --use_peft \
115
+ --lora_target_modules=all-linear
116
  ```
117
 
118
  ### Framework versions