Nlp - a anirbandas Collection

anirbandas 's Collections

Nlp

Nlp

updated Apr 10

Dense Training, Sparse Inference: Rethinking Training of Mixture-of-Experts Language Models

Paper • 2404.05567 • Published Apr 8 • 10