Universal Language Model Fine-tuning for Text Classification Paper • 1801.06146 • Published Jan 18, 2018 • 6
Exploiting Similarities among Languages for Machine Translation Paper • 1309.4168 • Published Sep 17, 2013
Theory, Analysis, and Best Practices for Sigmoid Self-Attention Paper • 2409.04431 • Published Sep 6 • 1