3 2

Jack Zhang

jackzhang

http://jackz.io/

AI & ML interests

None yet

Recent Activity

authored a paper about 2 months ago

Verifiable by Design: Aligning Language Models to Quote from Pre-Training Data

authored a paper about 2 months ago

Controllable Safety Alignment: Inference-Time Adaptation to Diverse Safety Requirements

upvoted a paper about 2 months ago

Controllable Safety Alignment: Inference-Time Adaptation to Diverse Safety Requirements

View all activity

Organizations

jackzhang's activity

authored 2 papers about 2 months ago

Verifiable by Design: Aligning Language Models to Quote from Pre-Training Data

Paper • 2404.03862 • Published Apr 5

Controllable Safety Alignment: Inference-Time Adaptation to Diverse Safety Requirements

Paper • 2410.08968 • Published Oct 11 • 12

upvoted a paper about 2 months ago

Controllable Safety Alignment: Inference-Time Adaptation to Diverse Safety Requirements

Paper • 2410.08968 • Published Oct 11 • 12

commented a paper about 2 months ago

Controllable Safety Alignment: Inference-Time Adaptation to Diverse Safety Requirements

Paper • 2410.08968 • Published Oct 11 • 12 •

authored a paper about 2 months ago

RATIONALYST: Pre-training Process-Supervision for Improving Reasoning

Paper • 2410.01044 • Published Oct 1 • 34

upvoted a paper 2 months ago

RATIONALYST: Pre-training Process-Supervision for Improving Reasoning

Paper • 2410.01044 • Published Oct 1 • 34

updated a model 2 months ago

jackzhang/llama3.1-8b-instruct-SFT-V5-bt_wg-addr_imp-DPO_131676

Updated Sep 27 • 105

New activity in RLHFlow/LLaMA3-SFT 2 months ago

LLaMA3.1-SFT

#3 opened 3 months ago by

jackzhang

updated 4 datasets 3 months ago

updated 2 datasets 4 months ago

jackzhang/V2-given_sys-ah-train-no_em

Viewer • Updated Aug 6 • 61.1k • 33

jackzhang/bt_multi_4-V1-given_sys_combine-test

Viewer • Updated Aug 5 • 3.45k • 32

New activity in mistralai/Mistral-Nemo-Instruct-2407 4 months ago

chat_template seems like it was converted incorrectly.

#10 opened 5 months ago by

xzuyn

updated 5 datasets 4 months ago

jackzhang/BeaverTails-dedupprompt_model-gpt-4o_harmful_cat_judge_clustercat_cot-improved

Viewer • Updated Jul 29 • 34.2k • 53

jackzhang/BeaverTails-dedupprompt_model-gpt-4-32k_harmful_cat_clustercat_cot-improved

Viewer • Updated Jul 29 • 34.2k • 37

jackzhang/BeaverTails-dedupprompt_model-gpt-4o_harmful_cat_judge_clustercat

Viewer • Updated Jul 29 • 34.2k • 36 • 2

jackzhang/train-llama3_safegen-bt_helpgen-mixed_mode

Viewer • Updated Jul 23 • 30.8k • 39

jackzhang/eval-llama3_safegen-bt_helpgen-mixed_mode

Viewer • Updated Jul 23 • 1.6k • 35