weishen's picture

8 7 25

weishen

fakerbaby

·

fakerbaby

AI & ML interests

NLP, alignment, LLM

Recent Activity

upvoted a collection 1 day ago

Medical QA Datasets

liked a dataset 26 days ago

yingyingzhang/metamath-qwen2-math

liked a dataset 26 days ago

nvidia/OpenMathInstruct-2

View all activity

Organizations

fakerbaby's activity

upvoted a collection 1 day ago

Medical QA Datasets

A collection of medical question answering (QA) datasets • 20 items • Updated 6 days ago • 25

liked 2 datasets 26 days ago

yingyingzhang/metamath-qwen2-math

Viewer • Updated Oct 1 • 467k • 239 • 14

nvidia/OpenMathInstruct-2

Viewer • Updated 5 days ago • 22M • 10.9k • 115

liked 3 datasets about 1 month ago

KbsdJames/Omni-MATH

Viewer • Updated Oct 12 • 4.43k • 872 • 58

Skywork/Skywork-Reward-Preference-80K-v0.2

Viewer • Updated Oct 25 • 77k • 1.36k • 21

AI-MO/aimo-validation-aime

Viewer • Updated Jul 10 • 90 • 744 • 13

Reacted to onekq's post with 👍 2 months ago

Post

2552

Here is my latest study on OpenAI🍓o1🍓.
A Case Study of Web App Coding with OpenAI Reasoning Models (2409.13773)

I wrote an easy-to-read blogpost to explain finding.
https://huggingface.co/blog/onekq/daily-software-engineering-work-reasoning-models

INSTRUCTION FOLLOWING is the key.

100% instruction following + Reasoning = new SOTA

But if the model misses or misunderstands one instruction, it can perform far worse than non-reasoning models.

upvoted a collection 2 months ago

Infinity Instruct

16 items • Updated Oct 24 • 6

liked 3 datasets 2 months ago

Magpie-Align/MagpieLM-SFT-Data-v0.1

Viewer • Updated Sep 18 • 550k • 179 • 15

MARIO-Math-Reasoning/Gaokao2023-Math-En

Viewer • Updated Jun 1 • 385 • 35 • 5

hfl/stem_zh_instruction

Viewer • Updated May 13 • 256k • 163 • 22

liked a Space 2 months ago

Qwen2.5

liked a Space 3 months ago

Chat-with-OpenAI-o1

upvoted a collection 3 months ago

DeepSeekCoder-V2

6 items • Updated Sep 5 • 83

liked a Space 3 months ago

Big Code Models Leaderboard

liked 3 datasets 3 months ago

BAAI/TACO

Updated Jun 19 • 1.22k • 71

BAAI/Infinity-Preference

Viewer • Updated Aug 30 • 59.4k • 169 • 62

argilla/magpie-ultra-v0.1

Viewer • Updated 5 days ago • 50k • 345 • 216

liked 2 datasets 4 months ago

AI-MO/NuminaMath-CoT

Viewer • Updated 6 days ago • 860k • 2.48k • 248

liwu/MNBVC

Updated Aug 23 • 21.1k • 489