-
openchat/openchat-3.5-1210
Text Generation • Updated • 9.5k • 276 -
MoE-Mamba: Efficient Selective State Space Models with Mixture of Experts
Paper • 2401.04081 • Published • 70 -
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models
Paper • 2402.03300 • Published • 72 -
Babelscape/rebel-large
Text2Text Generation • Updated • 14.6k • 211
Collections
Discover the best community collections!
Collections including paper arxiv:2412.08905