3 6 4

czl

Lin1557

AI & ML interests

None yet

Recent Activity

commented a paper 1 day ago

URSA: Understanding and Verifying Chain-of-thought Reasoning in Multimodal Mathematics

upvoted a paper 1 day ago

URSA: Understanding and Verifying Chain-of-thought Reasoning in Multimodal Mathematics

upvoted a paper about 1 month ago

Critical Tokens Matter: Token-Level Contrastive Estimation Enhence LLM's Reasoning Capability

View all activity

Organizations

Lin1557's activity

commented a paper 1 day ago

URSA: Understanding and Verifying Chain-of-thought Reasoning in Multimodal Mathematics

Paper • 2501.04686 • Published 2 days ago • 40 •

upvoted a paper 1 day ago

URSA: Understanding and Verifying Chain-of-thought Reasoning in Multimodal Mathematics

Paper • 2501.04686 • Published 2 days ago • 40

upvoted a paper about 1 month ago

Critical Tokens Matter: Token-Level Contrastive Estimation Enhence LLM's Reasoning Capability

Paper • 2411.19943 • Published Nov 29, 2024 • 56

commented a paper about 1 month ago

Critical Tokens Matter: Token-Level Contrastive Estimation Enhence LLM's Reasoning Capability

Paper • 2411.19943 • Published Nov 29, 2024 • 56 •

authored a paper about 1 month ago

CriticBench: Benchmarking LLMs for Critique-Correct Reasoning

Paper • 2402.14809 • Published Feb 22, 2024 • 3

upvoted a paper about 1 month ago

CriticBench: Benchmarking LLMs for Critique-Correct Reasoning

Paper • 2402.14809 • Published Feb 22, 2024 • 3

liked a dataset about 2 months ago

Randolphzeng/Mr-GSM8K

Viewer • Updated Jan 7, 2024 • 3k • 66 • 8

upvoted a paper about 2 months ago

Large Language Models Can Self-Improve in Long-context Reasoning

Paper • 2411.08147 • Published Nov 12, 2024 • 63

liked a dataset 2 months ago

xinlai/Math-Step-DPO-10K

Viewer • Updated Jul 4, 2024 • 10.8k • 661 • 46

upvoted a paper 3 months ago

A Survey on the Honesty of Large Language Models

Paper • 2409.18786 • Published Sep 27, 2024 • 32

liked a Space 6 months ago

Running

312

📐

Reward Bench Leaderboard

upvoted a paper 7 months ago

ChartMimic: Evaluating LMM's Cross-Modal Reasoning Capability via Chart-to-Code Generation

Paper • 2406.09961 • Published Jun 14, 2024 • 55

liked a dataset 8 months ago

llm-agents/CriticBench

Viewer • Updated Feb 23, 2024 • 3.83k • 128 • 10

New activity in philschmid/shepherd-2-hf-int4 12 months ago

Upload adapter_config.json

#2 opened 12 months ago by

Lin1557