arxiv:2411.04282
Liu
Zuxin
AI & ML interests
Reinforcement learning, imitation learning
Recent Activity
authored
a paper
18 days ago
Language Models are Hidden Reasoners: Unlocking Latent Reasoning
Capabilities via Self-Rewarding
Organizations
Papers
10
models
None public yet
datasets
None public yet