Orion-zhen commited on
Commit
3f1a6c0
1 Parent(s): 9bd3fb7

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +34 -3
README.md CHANGED
@@ -1,3 +1,34 @@
1
- ---
2
- license: gpl-3.0
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: gpl-3.0
3
+ datasets:
4
+ - NobodyExistsOnTheInternet/ToxicQAFinal
5
+ - anthracite-org/kalo-opus-instruct-22k-no-refusal
6
+ - Orion-zhen/dpo-toxic-zh
7
+ - unalignment/toxic-dpo-v0.2
8
+ - Crystalcareai/Intel-DPO-Pairs-Norefusals
9
+ language:
10
+ - zh
11
+ - en
12
+ base_model:
13
+ - Qwen/Qwen2.5-7B-Instruct
14
+ pipeline_tag: text-generation
15
+ tags:
16
+ - qwen
17
+ - uncensored
18
+ ---
19
+
20
+ # Qwen2.5-7B-Instruct-Uncensored
21
+
22
+ This model is an uncensored fine-tune version of Qwen2.5-7B-Instruct. However, I can still notice that though uncensored, the model fails to generate detailed descriptions on certain extreme scenarios, which might be associated with deletion on some pretrain datasets in Qwen's pretraining stage.
23
+
24
+ ## Traning details
25
+
26
+ I used SFT + DPO to ensure uncensorment as well as trying to maintain original model's capabilities.
27
+
28
+ - SFT:
29
+ - NobodyExistsOnTheInternet/ToxicQAFinal
30
+ - anthracite-org/kalo-opus-instruct-22k-no-refusal
31
+ - DPO:
32
+ - Orion-zhen/dpo-toxic-zh
33
+ - unalignment/toxic-dpo-v0.2
34
+ - Crystalcareai/Intel-DPO-Pairs-Norefusals