arxiv:2412.11605
Yida Lu
lrxl
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
3 days ago
Agent-SafetyBench: Evaluating the Safety of LLM Agents
authored
a paper
9 days ago
SPaR: Self-Play with Tree-Search Refinement to Improve
Instruction-Following in Large Language Models
Organizations
None yet
Papers
2
models
None public yet