1 4 2

Jonathan Lorraine

lorraine2

https://www.cs.toronto.edu/~lorraine/

AI & ML interests

machine learning, computer vision, generative AI

Recent Activity

posted an update about 9 hours ago

New NeurIPS paper: “Training Data Attribution via Approximate Unrolling” Ever wondered how individual data points influence AI decisions? 🤔 We explore how specific training data pieces affect machine learning models' behavior, which can be crucial for making AI systems more transparent, trustworthy, and fair. Our method, SOURCE, bridges the gap between implicit differentiation and unrolling approaches, combining computational efficiency with flexibility making it suitable for non-converged models and multi-stage training pipelines. 📄 Full paper: https://openreview.net/pdf?id=3NaqGg92KZ Juhan Bae led along with Wu Lin and Roger Grosse. Supported by the University of Toronto, Vector Institute, NVIDIA, and Anthropic

liked a model 9 days ago

Zhengyi/LLaMA-Mesh

liked a Space 9 days ago

Zhengyi/LLaMA-Mesh

View all activity

Organizations

Posts 5

Post

134

New NeurIPS paper: “Training Data Attribution via Approximate Unrolling”

Ever wondered how individual data points influence AI decisions? 🤔 We explore how specific training data pieces affect machine learning models' behavior, which can be crucial for making AI systems more transparent, trustworthy, and fair.

Our method, SOURCE, bridges the gap between implicit differentiation and unrolling approaches, combining computational efficiency with flexibility making it suitable for non-converged models and multi-stage training pipelines.

📄 Full paper: https://openreview.net/pdf?id=3NaqGg92KZ

Juhan Bae led along with Wu Lin and Roger Grosse.

Supported by the University of Toronto, Vector Institute, NVIDIA, and Anthropic

Post

561

🚨 Code now available for "Using Large Language Models for Hyperparameter Optimization" at https://github.com/michaelrzhang/LLM-HyperOpt 🚨

TLDR: You can just ask LLMs which hyperparameters to use, and it works pretty well! You can even directly optimize your model’s code as a hyperparameter with this.

Check out the paper at https://arxiv.org/abs/2312.04528 - with Michael Zhang, Nishkrit Desai, Juhan Bae, and Jimmy Ba

View all posts

Papers 18

models

None public yet

datasets

None public yet