2 12 18

Alexandre TL

alexandretl

https://www.youtube.com/@alexandretl

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

Transformer^2: Self-adaptive LLMs

liked a model about 2 months ago

nvidia/Hymba-1.5B-Base

updated a model about 2 months ago

alexandretl/ngpt

View all activity

Organizations

None yet

alexandretl's activity

upvoted a paper 1 day ago

Transformer^2: Self-adaptive LLMs

Paper • 2501.06252 • Published 7 days ago • 31

upvoted an article 2 months ago

Article

Releasing the largest multilingual open pretraining dataset

•

Nov 13, 2024

• 98

upvoted 2 articles 5 months ago

Article

A failed experiment: Infini-Attention, and why we should keep trying?

Aug 14, 2024

• 57

Article

Welcome FalconMamba: The first strong attention-free 7B model

Aug 12, 2024

• 108

upvoted a paper 8 months ago

Zamba: A Compact 7B SSM Hybrid Model

Paper • 2405.16712 • Published May 26, 2024 • 22

upvoted 2 papers 9 months ago

KAN: Kolmogorov-Arnold Networks

Paper • 2404.19756 • Published Apr 30, 2024 • 109

TransformerFAM: Feedback attention is working memory

Paper • 2404.09173 • Published Apr 14, 2024 • 44

upvoted an article 9 months ago

Article

Jack of All Trades, Master of Some, a Multi-Purpose Transformer Agent

Apr 22, 2024

• 80

upvoted a paper 11 months ago

Can Mamba Learn How to Learn? A Comparative Study on In-Context Learning Tasks

Paper • 2402.04248 • Published Feb 6, 2024 • 30

upvoted a paper 12 months ago

Learning Universal Predictors

Paper • 2401.14953 • Published Jan 26, 2024 • 19

upvoted a paper about 1 year ago

Large Language Models as Generalizable Policies for Embodied Tasks

Paper • 2310.17722 • Published Oct 26, 2023 • 7

upvoted a paper over 1 year ago

Retroformer: Retrospective Large Language Agents with Policy Gradient Optimization

Paper • 2308.02151 • Published Aug 4, 2023 • 18