Models
Datasets
Spaces
Posts
Docs
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2409.00729

Self-Rewarding Language Models

Paper • 2401.10020 • Published Jan 18 • 143
Orion-14B: Open-source Multilingual Large Language Models

Paper • 2401.12246 • Published Jan 20 • 11
MambaByte: Token-free Selective State Space Model

Paper • 2401.13660 • Published Jan 24 • 50
MM-LLMs: Recent Advances in MultiModal Large Language Models

Paper • 2401.13601 • Published Jan 24 • 44

SciLitLLM: How to Adapt LLMs for Scientific Literature Understanding

Paper • 2408.15545 • Published Aug 28 • 34
Controllable Text Generation for Large Language Models: A Survey

Paper • 2408.12599 • Published Aug 22 • 62
To Code, or Not To Code? Exploring Impact of Code in Pre-training

Paper • 2408.10914 • Published Aug 20 • 40
Automated Design of Agentic Systems

Paper • 2408.08435 • Published Aug 15 • 38

community-datasets/doqa

Updated Jan 18 • 91 • 1
metaeval/reclor

Viewer • Updated May 31, 2023 • 5.14k • 478 • 9
community-datasets/so_stacksample

Updated Jan 18 • 71 • 4
community-datasets/yahoo_answers_topics

Viewer • Updated Jun 24 • 1.46M • 1.2k • 52

Unlocking the conversion of Web Screenshots into HTML Code with the WebSight Dataset

Paper • 2403.09029 • Published Mar 14 • 54
LLMLingua-2: Data Distillation for Efficient and Faithful Task-Agnostic Prompt Compression

Paper • 2403.12968 • Published Mar 19 • 24
RAFT: Adapting Language Model to Domain Specific RAG

Paper • 2403.10131 • Published Mar 15 • 67
Quiet-STaR: Language Models Can Teach Themselves to Think Before Speaking

Paper • 2403.09629 • Published Mar 14 • 72

🔍 Daily Picks in Interpretability & Analysis of LMs

Outstanding research in interpretability and evaluation of language models, summarized

Arithmetic Without Algorithms: Language Models Solve Math With a Bag of Heuristics

Paper • 2410.21272 • Published 12 days ago • 1
Llama Scope: Extracting Millions of Features from Llama-3.1-8B with Sparse Autoencoders

Paper • 2410.20526 • Published 13 days ago • 1
Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering

Paper • 2410.15999 • Published 20 days ago • 17
Decomposing The Dark Matter of Sparse Autoencoders

Paper • 2410.14670 • Published 22 days ago • 1

A Bi-Step Grounding Paradigm for Large Language Models in Recommendation Systems

Paper • 2308.08434 • Published Aug 16, 2023 • 1
Grounding Large Language Models in Interactive Environments with Online Reinforcement Learning

Paper • 2302.02662 • Published Feb 6, 2023 • 1
Self-driven Grounding: Large Language Model Agents with Automatical Language-aligned Skill Learning

Paper • 2309.01352 • Published Sep 4, 2023 • 1
Automatically Correcting Large Language Models: Surveying the landscape of diverse self-correction strategies

Paper • 2308.03188 • Published Aug 6, 2023 • 2

TRAMS: Training-free Memory Selection for Long-range Language Modeling

Paper • 2310.15494 • Published Oct 24, 2023 • 1
A Long Way to Go: Investigating Length Correlations in RLHF

Paper • 2310.03716 • Published Oct 5, 2023 • 9
YaRN: Efficient Context Window Extension of Large Language Models

Paper • 2309.00071 • Published Aug 31, 2023 • 65
Giraffe: Adventures in Expanding Context Lengths in LLMs

Paper • 2308.10882 • Published Aug 21, 2023 • 1

Company

© Hugging Face

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs