Gabriele Sarti's picture

Gabriele Sarti

gsarti

·

https://gsarti.com

AI & ML interests

Interpretability for generative language models

Recent Activity

liked a model about 6 hours ago

Qwen/QwQ-32B-Preview

liked a Space about 6 hours ago

Qwen/QwQ-32B-preview

upvoted a collection about 8 hours ago

NLI Eval Datasets

View all activity

Organizations

gsarti's activity

upvoted 2 collections about 8 hours ago

NLI Eval Datasets

A curated collection of NLI evaluation datasets. Each dataset is exactly as originally proposed • 19 items • Updated 15 days ago • 3

🇮🇹👓 LLaVA-NDiNO

HF Collection for the models of the paper "LLaVA-NDiNO: Empowering LLMs with Multimodality for the Italian Language" • 7 items • Updated Oct 20 • 2

upvoted a paper about 18 hours ago

ShowUI: One Vision-Language-Action Model for GUI Visual Agent

Paper • 2411.17465 • Published 1 day ago • 49

upvoted a collection 1 day ago

SmolVLM

State-of-the-art compact VLMs for on-device applications: Base, Synthetic, and Instruct • 5 items • Updated 1 day ago • 18

upvoted an article 3 days ago

Article

Halo: Open Source Health Tracking with Wearables

By

•

8 days ago

• 83

upvoted 3 papers 6 days ago

Do I Know This Entity? Knowledge Awareness and Hallucinations in Language Models

Paper • 2411.14257 • Published 6 days ago • 9

Procedural Knowledge in Pretraining Drives Reasoning in Large Language Models

Paper • 2411.12580 • Published 8 days ago • 2

Controllable Context Sensitivity and the Knob Behind It

Paper • 2411.07404 • Published 16 days ago • 1

upvoted 2 papers 10 days ago

Features that Make a Difference: Leveraging Gradients for Improved Dictionary Learning

Paper • 2411.10397 • Published 12 days ago • 1

Counterfactual Generation from Language Models

Paper • 2411.07180 • Published 16 days ago • 5

upvoted a collection 27 days ago

SmolLM2

State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 15 items • Updated about 12 hours ago • 181

upvoted 3 papers 29 days ago

The Geometry of Concepts: Sparse Autoencoder Feature Structure

Paper • 2410.19750 • Published Oct 10 • 1

Llama Scope: Extracting Millions of Features from Llama-3.1-8B with Sparse Autoencoders

Paper • 2410.20526 • Published Oct 27 • 1

Arithmetic Without Algorithms: Language Models Solve Math With a Bag of Heuristics

Paper • 2410.21272 • Published about 1 month ago • 1

upvoted 4 papers about 1 month ago

Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering

Paper • 2410.15999 • Published Oct 21 • 19

Automatically Interpreting Millions of Features in Large Language Models

Paper • 2410.13928 • Published Oct 17 • 1

Decomposing The Dark Matter of Sparse Autoencoders

Paper • 2410.14670 • Published Oct 18 • 1

How Do Multilingual Models Remember? Investigating Multilingual Factual Recall Mechanisms

Paper • 2410.14387 • Published Oct 18 • 1

upvoted 2 papers about 2 months ago

Towards Interpreting Visual Information Processing in Vision-Language Models

Paper • 2410.07149 • Published Oct 9 • 1

What Matters for Model Merging at Scale?

Paper • 2410.03617 • Published Oct 4 • 8