Models
Datasets
Spaces
Posts
Docs
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2410.01201

Were RNNs All We Needed?

Paper • 2410.01201 • Published Oct 2 • 46

Were RNNs All We Needed?

Paper • 2410.01201 • Published Oct 2 • 46
MotionCLR: Motion Generation and Training-free Editing via Understanding Attention Mechanisms

Paper • 2410.18977 • Published 14 days ago • 13

Were RNNs All We Needed?

Paper • 2410.01201 • Published Oct 2 • 46

Were RNNs All We Needed?

Paper • 2410.01201 • Published Oct 2 • 46

RNNs as alternative

Were RNNs All We Needed?

Paper • 2410.01201 • Published Oct 2 • 46

Were RNNs All We Needed?

Paper • 2410.01201 • Published Oct 2 • 46

Were RNNs All We Needed?

Paper • 2410.01201 • Published Oct 2 • 46
MM-Lego: Modular Biomedical Multimodal Models with Minimal Fine-Tuning

Paper • 2405.19950 • Published May 30 • 1

Eagle: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders

Paper • 2408.15998 • Published Aug 28 • 83
General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model

Paper • 2409.01704 • Published Sep 3 • 80
Mutual Reasoning Makes Smaller LLMs Stronger Problem-Solvers

Paper • 2408.06195 • Published Aug 12 • 61
Self-Reflection in LLM Agents: Effects on Problem-Solving Performance

Paper • 2405.06682 • Published May 5 • 3

The Unreasonable Ineffectiveness of the Deeper Layers

Paper • 2403.17887 • Published Mar 26 • 78
Mixture-of-Depths: Dynamically allocating compute in transformer-based language models

Paper • 2404.02258 • Published Apr 2 • 104
ReFT: Representation Finetuning for Language Models

Paper • 2404.03592 • Published Apr 4 • 89
Direct Nash Optimization: Teaching Language Models to Self-Improve with General Preferences

Paper • 2404.03715 • Published Apr 4 • 60

Locutusque/arc-cot

Viewer • Updated Mar 13 • 1.07k • 74 • 20
microsoft/orca-math-word-problems-200k

Viewer • Updated Mar 4 • 200k • 1.79k • 414
gretelai/synthetic_text_to_sql

Viewer • Updated May 10 • 106k • 2.64k • 420
Beehzod/uzbek_speech_data

Viewer • Updated Aug 1 • 407 • 73

Previous
1
2
Next

Company

© Hugging Face

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs