-
Scaling LLM Inference with Optimized Sample Compute Allocation
Paper • 2410.22480 • Published -
Test-time Computing: from System-1 Thinking to System-2 Thinking
Paper • 2501.02497 • Published • 36 -
Scaling of Search and Learning: A Roadmap to Reproduce o1 from Reinforcement Learning Perspective
Paper • 2412.14135 • Published -
Towards System 2 Reasoning in LLMs: Learning How to Think With Meta Chain-of-Though
Paper • 2501.04682 • Published • 78
Alex Ngai
alexngai
AI & ML interests
None yet
Recent Activity
updated
a collection
about 8 hours ago
Test-Time Compute/Optimal Scaling
updated
a collection
1 day ago
Self-Improving Agents
updated
a collection
1 day ago
Self-Improving Agents
Organizations
Collections
8
models
None public yet
datasets
None public yet