Lewis Tunstall's picture

Lewis Tunstall PRO

lewtun

·

https://lewtun.github.io/blog/

AI & ML interests

LLMs, LLMs, LLMs

Articles

Universal Assisted Generation: Faster Decoding with Any Assistant Model

about 1 month ago

Faster Assisted Generation with Dynamic Speculation

Llama can now see and run on your device - welcome Llama 3.2

FineVideo: behind the scenes

How NuminaMath Won the 1st AIMO Progress Prize

Welcome Gemma 2 - Google's new open LLM

Constitutional AI with Open LLMs

Preference Tuning LLMs with Direct Preference Optimization Methods

Mixture of Experts Explained

Welcome Mixtral - a SOTA Mixture of Experts on Hugging Face

SetFitABSA: Few-Shot Aspect Based Sentiment Analysis using SetFit

Fine-tuning Llama 2 70B using PyTorch FSDP

Code Llama: Llama 2 learns to code

Llama 2 is here - get it on Hugging Face

Can foundation models label data like humans?

The Falcon has landed in the Hugging Face ecosystem

Creating a Coding Assistant with StarCoder

StackLLaMA: A hands-on guide to train LLaMA with RLHF

Fine-tuning 20B LLMs with RLHF on a 24GB consumer GPU

Red-Teaming Large Language Models

Diffusion Models Live Event

Very Large Language Models and How to Evaluate Them

SetFit: Efficient Few-Shot Learning Without Prompts

Announcing Evaluation on the Hub

Organizations

lewtun's activity

upvoted a collection 27 days ago

SmolLM2

State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 15 items • Updated about 10 hours ago • 181

upvoted a paper 30 days ago

GPT-4o System Card

Paper • 2410.21276 • Published Oct 25 • 79

upvoted a paper about 1 month ago

AutoTrain: No-code training for state-of-the-art models

Paper • 2410.15735 • Published Oct 21 • 57

upvoted a paper about 2 months ago

Falcon Mamba: The First Competitive Attention-free 7B Language Model

Paper • 2410.05355 • Published Oct 7 • 29

upvoted an article about 2 months ago

Article

Faster Assisted Generation with Dynamic Speculation

Oct 8

• 34

upvoted a collection about 2 months ago

Critique-out-Loud Reward Models

Paper: https://arxiv.org/abs/2408.11791 | Code: https://github.com/zankner/CLoud • 7 items • Updated Sep 5 • 3

upvoted a paper 2 months ago

Style over Substance: Failure Modes of LLM Judges in Alignment Benchmarking

Paper • 2409.15268 • Published Sep 23 • 12

upvoted a paper 3 months ago

Building and better understanding vision-language models: insights and future directions

Paper • 2408.12637 • Published Aug 22 • 121

upvoted an article 3 months ago

Article

Llama-3.1-Storm-8B: Improved SLM with Self-Curation + Model Merging

By

•

Aug 19

• 73

upvoted a paper 3 months ago

Agent Q: Advanced Reasoning and Learning for Autonomous AI Agents

Paper • 2408.07199 • Published Aug 13 • 20

upvoted an article 3 months ago

Article

A failed experiment: Infini-Attention, and why we should keep trying?

Aug 14

• 50

upvoted a paper 4 months ago

Instruction-Following Evaluation for Large Language Models

Paper • 2311.07911 • Published Nov 14, 2023 • 19

upvoted an article 4 months ago

Article

Tool Use, Unified

Aug 12

• 64

upvoted 2 collections 4 months ago

🍃 MINT-1T

Data for "MINT-1T: Scaling Open-Source Multimodal Data by 10x: A Multimodal Dataset with One Trillion Tokens" • 13 items • Updated Jul 24 • 54

NuminaMath

Datasets and models for training SOTA math LLMs. See our GitHub for training & inference code: https://github.com/project-numina/aimo-progress-prize • 6 items • Updated Jul 21 • 64

upvoted an article 4 months ago

Article

Docmatix - a huge dataset for Document Visual Question Answering

Jul 18

• 68

upvoted a paper 4 months ago

Qwen2 Technical Report

Paper • 2407.10671 • Published Jul 15 • 156

upvoted 2 articles 5 months ago

Article

How NuminaMath Won the 1st AIMO Progress Prize

Jul 11

• 104

Article

Preference Optimization for Vision Language Models

Jul 10

• 46

upvoted an article 6 months ago

Article

Putting RL back in RLHF

Jun 12

• 62