-
Self-Rewarding Language Models
Paper • 2401.10020 • Published • 143 -
Orion-14B: Open-source Multilingual Large Language Models
Paper • 2401.12246 • Published • 11 -
MambaByte: Token-free Selective State Space Model
Paper • 2401.13660 • Published • 50 -
MM-LLMs: Recent Advances in MultiModal Large Language Models
Paper • 2401.13601 • Published • 44
Collections
Discover the best community collections!
Collections including paper arxiv:2409.00729
-
SciLitLLM: How to Adapt LLMs for Scientific Literature Understanding
Paper • 2408.15545 • Published • 34 -
Controllable Text Generation for Large Language Models: A Survey
Paper • 2408.12599 • Published • 62 -
To Code, or Not To Code? Exploring Impact of Code in Pre-training
Paper • 2408.10914 • Published • 40 -
Automated Design of Agentic Systems
Paper • 2408.08435 • Published • 38
-
Unlocking the conversion of Web Screenshots into HTML Code with the WebSight Dataset
Paper • 2403.09029 • Published • 54 -
LLMLingua-2: Data Distillation for Efficient and Faithful Task-Agnostic Prompt Compression
Paper • 2403.12968 • Published • 24 -
RAFT: Adapting Language Model to Domain Specific RAG
Paper • 2403.10131 • Published • 67 -
Quiet-STaR: Language Models Can Teach Themselves to Think Before Speaking
Paper • 2403.09629 • Published • 72
-
Arithmetic Without Algorithms: Language Models Solve Math With a Bag of Heuristics
Paper • 2410.21272 • Published • 1 -
Llama Scope: Extracting Millions of Features from Llama-3.1-8B with Sparse Autoencoders
Paper • 2410.20526 • Published • 1 -
Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering
Paper • 2410.15999 • Published • 17 -
Decomposing The Dark Matter of Sparse Autoencoders
Paper • 2410.14670 • Published • 1
-
A Bi-Step Grounding Paradigm for Large Language Models in Recommendation Systems
Paper • 2308.08434 • Published • 1 -
Grounding Large Language Models in Interactive Environments with Online Reinforcement Learning
Paper • 2302.02662 • Published • 1 -
Self-driven Grounding: Large Language Model Agents with Automatical Language-aligned Skill Learning
Paper • 2309.01352 • Published • 1 -
Automatically Correcting Large Language Models: Surveying the landscape of diverse self-correction strategies
Paper • 2308.03188 • Published • 2
-
TRAMS: Training-free Memory Selection for Long-range Language Modeling
Paper • 2310.15494 • Published • 1 -
A Long Way to Go: Investigating Length Correlations in RLHF
Paper • 2310.03716 • Published • 9 -
YaRN: Efficient Context Window Extension of Large Language Models
Paper • 2309.00071 • Published • 65 -
Giraffe: Adventures in Expanding Context Lengths in LLMs
Paper • 2308.10882 • Published • 1