Test-Time Compute/Optimal Scaling - a alexngai Collection

alexngai 's Collections

Test-Time Compute/Optimal Scaling

Self-Improving Agents

Codegen Benchmarks

Test-Time Compute/Optimal Scaling

updated about 8 hours ago

Scaling LLM Inference with Optimized Sample Compute Allocation

Paper • 2410.22480 • Published Oct 29, 2024
Test-time Computing: from System-1 Thinking to System-2 Thinking

Paper • 2501.02497 • Published 10 days ago • 36
Scaling of Search and Learning: A Roadmap to Reproduce o1 from Reinforcement Learning Perspective

Paper • 2412.14135 • Published 28 days ago
Towards System 2 Reasoning in LLMs: Learning How to Think With Meta Chain-of-Though

Paper • 2501.04682 • Published 7 days ago • 78
O1 Replication Journey: A Strategic Progress Report -- Part 1

Paper • 2410.18982 • Published Oct 8, 2024 • 3
O1 Replication Journey -- Part 3: Inference-time Scaling for Medical Reasoning

Paper • 2501.06458 • Published 4 days ago • 19