Sayak Paul's picture

Sayak Paul

sayakpaul

·

https://sayak.dev

AI & ML interests

Diffusion models, representation learning

Recent Activity

New activity about 21 hours ago

sayakpaul/mochi-lora:The uploaded videos are completely broken.

upvoted an article 1 day ago

Let’s make a generation of amazing image generation models

New activity 1 day ago

Spawning/PD12M:Okay to host the downloaded images?

View all activity

Articles

🧨 Diffusers welcomes Stable Diffusion 3.5 Large

Memory-efficient Diffusion Transformers with Quanto and Diffusers

🧨 Diffusers welcomes Stable Diffusion 3

🤗 PEFT welcomes new merging methods

Welcome aMUSEd: Efficient Text-to-Image Generation

SDXL in 4 steps with Latent Consistency LoRAs

Personal Copilot: Train Your Own Coding Assistant

Exploring simple optimizations for SDXL

Finetune Stable Diffusion Models with DDPO via TRL

Introducing Würstchen: Fast Diffusion for Image Generation

Efficient Controllable Generation for SDXL with T2I-Adapters

Happy 1st anniversary 🤗 Diffusers!

Optimizing Stable Diffusion for Intel CPUs with NNCF and 🤗 Optimum

Instruction-tuning Stable Diffusion with InstructPix2Pix

Training a language model with 🤗 Transformers using TensorFlow and TPUs

ControlNet in Diffusers 🧨

🤗 PEFT: Parameter-Efficient Fine-Tuning of Billion-Scale Models on Low-Resource Hardware

A Dive into Pretraining Strategies for Vision-Language Models

The State of Computer Vision at Hugging Face 🤗

Using LoRA for Efficient Stable Diffusion Fine-Tuning

Image Similarity with Hugging Face Datasets and Transformers

Deploying 🤗 ViT on Vertex AI

Deploying 🤗 ViT on Kubernetes with TF Serving

Deploying TensorFlow Vision Models in Hugging Face with TF Serving

Organizations

Posts 13

Post

2332

It's been a while we shipped native quantization support in diffusers 🧨

We currently support bistandbytes as the official backend but using others like torchao is already very simple.

This post is just a reminder of what's possible:

1. Loading a model with a quantization config
2. Saving a model with quantization config
3. Loading a pre-quantized model
4. enable_model_cpu_offload()
5. Training and loading LoRAs into quantized checkpoints

Docs:
https://huggingface.co/docs/diffusers/main/en/quantization/bitsandbytes

Post

2697

Did some little experimentation to resize pre-trained LoRAs on Flux. I explored two themes:

* Decrease the rank of a LoRA
* Increase the rank of a LoRA

The first one is helpful in reducing memory requirements if the LoRA is of a high rank, while the second one is merely an experiment. Another implication of this study is in the unification of LoRA ranks when you would like to torch.compile() them.

Check it out here:
sayakpaul/flux-lora-resizing

Collections 2

Papers 11

arxiv:2408.13467

arxiv:2406.06424

arxiv:2404.01197

arxiv:2402.17412

spaces 19

Demo Docker Gradio

Diffusers Docs QA Chatbot

Ask questions to the Diffusers documentation.

Convert Kerascv SD to Diffusers

Inpainting Tool

Generate Custom Pokemons with Stable Diffusion

Evaluate StableDiffusionPipeline with Different Schedulers

models 56

sayakpaul/bnb-single-file-checkpoint-from-civitai

Updated 1 day ago

sayakpaul/mochi-lora

Text-to-Video • Updated 2 days ago • 47 • 2

sayakpaul/FLUX.1-Canny-dev-nf4

Updated 4 days ago • 1

sayakpaul/FLUX.1-Depth-dev-nf4

Updated 4 days ago • 1

sayakpaul/FLUX.1-Fill-dev-nf4

Updated 4 days ago • 6

sayakpaul/flux.1-dev-int8-aot-compiled

Updated 28 days ago • 2

sayakpaul/sd35-large-nf4

Text-to-Image • Updated Oct 27 • 5

sayakpaul/yarn_art_lora_flux_nf4

Text-to-Image • Updated Oct 21 • 63 •

sayakpaul/FLUX.1-merged

Text-to-Image • Updated Oct 8 • 1.17k • 186

sayakpaul/tiny-sd-pipeline-with-lora

Text-to-Image • Updated Sep 28 • 6 • 1

datasets 26

sayakpaul/pick-a-pic-v2-unique-prompts

Viewer • Updated 18 days ago • 59k • 151

sayakpaul/sample-datasets

Viewer • Updated 28 days ago • 6 • 25.5k • 1

sayakpaul/poses-controlnet-dataset

Viewer • Updated Aug 29 • 496 • 64 • 5

sayakpaul/torchao-diffusers

Updated Aug 28 • 123

sayakpaul/pickapic_v2_webdataset

Viewer • Updated Apr 4 • 8.7k • 36.1k

sayakpaul/generated-gemini-responses

Viewer • Updated Apr 1 • 115 • 40

sayakpaul/no_robots_only_coding

Viewer • Updated Mar 20 • 350 • 52 • 1

sayakpaul/diffusers-qa-chatbot-artifacts

Viewer • Updated Mar 9 • 265k • 257 • 1

sayakpaul/coco-30-val-2014

Viewer • Updated Feb 5 • 30k • 1.65k • 6

sayakpaul/drawbench-sdxl-refiner

Viewer • Updated Oct 21, 2023 • 200 • 50 • 1