Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
3
2
172
Jian Hu
chuyi777
Follow
jovanywang's profile picture
1 follower
·
1 following
https://hujian.website
hijkzzz
AI & ML interests
Reinforcement Learning
Recent Activity
updated
a model
about 17 hours ago
OpenRLHF/Llama-3-8b-rm-mixture
updated
a model
about 17 hours ago
OpenRLHF/Llama-2-7b-rm-anthropic_hh-lmsys-oasst-webgpt
updated
a model
about 17 hours ago
OpenRLHF/Llama-3-8b-rm-700k
View all activity
Organizations
chuyi777
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
liked
a model
27 days ago
O1-OPEN/OpenO1-LLama-8B-v0.1
Updated
Oct 8
•
10.1k
•
10
liked
3 models
about 1 month ago
AI-MO/NuminaMath-7B-TIR
Text Generation
•
Updated
Aug 14
•
3.85k
•
321
Nexusflow/Athene-70B
Text Generation
•
Updated
16 days ago
•
8.31k
•
191
peiyi9979/mistral-7b-sft
Text Generation
•
Updated
Jan 15
•
2.62k
•
7
liked
2 datasets
about 2 months ago
nvidia/HelpSteer2
Viewer
•
Updated
Oct 15
•
21.4k
•
5.84k
•
376
GAIR/o1-journey
Viewer
•
Updated
Oct 16
•
327
•
1.09k
•
92
liked
2 models
about 2 months ago
peiyi9979/math-shepherd-mistral-7b-rl
Text Generation
•
Updated
Jan 15
•
974
•
5
peiyi9979/math-shepherd-mistral-7b-prm
Text Generation
•
Updated
Jan 15
•
27.4k
•
32
liked
a dataset
2 months ago
peiyi9979/Math-Shepherd
Viewer
•
Updated
Jan 3
•
445k
•
644
•
74
liked
a model
2 months ago
Qwen/Qwen2.5-Math-RM-72B
Text Classification
•
Updated
about 1 month ago
•
9.97k
•
56
liked
a dataset
3 months ago
Skywork/Skywork-Reward-Preference-80K-v0.1
Viewer
•
Updated
Oct 25
•
82k
•
649
•
38
liked
3 models
3 months ago
ai21labs/AI21-Jamba-1.5-Large
Text Generation
•
Updated
Sep 17
•
3.25k
•
198
microsoft/Phi-3.5-vision-instruct
Image-Text-to-Text
•
Updated
Sep 26
•
973k
•
590
microsoft/Phi-3.5-mini-instruct
Text Generation
•
Updated
Sep 18
•
487k
•
•
661
liked
a dataset
4 months ago
Birchlabs/openai-prm800k-stepwise-critic
Viewer
•
Updated
Jun 3, 2023
•
1.09M
•
346
•
43
liked
a model
5 months ago
mistralai/Codestral-22B-v0.1
Text Generation
•
Updated
Jul 31
•
13.1k
•
1.15k
liked
a model
6 months ago
sfairXC/FsfairX-LLaMA3-RM-v0.1
Text Classification
•
Updated
Oct 14
•
17.2k
•
48
liked
a dataset
6 months ago
RLHFlow/prompt-collection-v0.1
Viewer
•
Updated
May 8
•
179k
•
40
•
8
liked
a model
6 months ago
RLHFlow/pair-preference-model-LLaMA3-8B
Text Generation
•
Updated
Oct 14
•
1.92k
•
36
liked
a dataset
6 months ago
weqweasdas/preference_dataset_mixture2_and_safe_pku
Viewer
•
Updated
Apr 29
•
555k
•
48
•
9
Load more