Agentic-ly agentic - a bfuzzy1 Collection

bfuzzy1 's Collections

Agents

Agentic-ly agentic

Generation Nation

Don't hate - evaluate

Nifty

Agentic-ly agentic

updated Oct 17

Automated Design of Agentic Systems

Paper • 2408.08435 • Published Aug 15 • 38
On the limits of agency in agent-based models

Paper • 2409.10568 • Published Sep 14 • 12
On the Diagram of Thought

Paper • 2409.10038 • Published Sep 16 • 12
DSBench: How Far Are Data Science Agents to Becoming Data Science Experts?

Paper • 2409.07703 • Published Sep 12 • 66
Windows Agent Arena: Evaluating Multi-Modal OS Agents at Scale

Paper • 2409.08264 • Published Sep 12 • 43
Agent Workflow Memory

Paper • 2409.07429 • Published Sep 11 • 27
Paper Copilot: A Self-Evolving and Efficient LLM System for Personalized Academic Assistance

Paper • 2409.04593 • Published Sep 6 • 23
Training Language Models to Self-Correct via Reinforcement Learning

Paper • 2409.12917 • Published Sep 19 • 135
Programming Every Example: Lifting Pre-training Data Quality like Experts at Scale

Paper • 2409.17115 • Published Sep 25 • 59
MTU-Bench: A Multi-granularity Tool-Use Benchmark for Large Language Models

Paper • 2410.11710 • Published Oct 15 • 18