MiniMax-01: Scaling Foundation Models with Lightning Attention Paper • 2501.08313 • Published 1 day ago • 195 • 1
Scaling Laws for Linear Complexity Language Models Paper • 2406.16690 • Published Jun 24, 2024 • 23 • 4
Scaling Laws for Linear Complexity Language Models Paper • 2406.16690 • Published Jun 24, 2024 • 23 • 4