47 30 76

Kashif Rasul

kashif

AI & ML interests

Time Series Forecasting, Denoising Diffusion, Generative Modeling, Reinforcement Learning

Recent Activity

liked a dataset 1 day ago

ylecun/mnist

updated a model 1 day ago

HuggingFaceTB/SmolVLM-Instruct-DPO

liked a model 4 days ago

apple/coreml-mobileclip

View all activity

Articles

Organizations

kashif's activity

liked a dataset 1 day ago

ylecun/mnist

Viewer • Updated Aug 8 • 70k • 38.4k • 132

updated a model 1 day ago

HuggingFaceTB/SmolVLM-Instruct-DPO

Image-Text-to-Text • Updated 1 day ago • 53 • 8

liked a model 4 days ago

apple/coreml-mobileclip

Updated 9 days ago • 256 • 28

liked a model 6 days ago

apple/aimv2-large-patch14-448

Image Feature Extraction • Updated 6 days ago • 228 • 1

liked a dataset 7 days ago

Maple728/Time-300B

Preview • Updated Oct 22 • 3.23k • 10

liked a Space 16 days ago

Running

🥇

GIFT Eval

GIFT-Eval: A Benchmark for General Time Series Forecasting

liked a model about 1 month ago

jimmycarter/LibreFLUX

Text-to-Image • Updated Oct 24 • 1.5k • 147

upvoted a paper about 2 months ago

A Rate-Distortion View of Uncertainty Quantification

Paper • 2406.10775 • Published Jun 16 • 1

updated a dataset 2 months ago

kashif/chronos-preference

Preview • Updated Sep 26 • 42

upvoted a paper 2 months ago

Training Language Models to Self-Correct via Reinforcement Learning

Paper • 2409.12917 • Published Sep 19 • 135

updated 3 models 3 months ago

kashif/gkd-model

Updated Sep 8 • 3

kashif/pythia-1b-deduped-tldr-xpo

Updated Sep 7 • 6

kashif/pythia-1b-deduped-tldr-online-dpo

Updated Sep 6

upvoted a paper 3 months ago

Spectrum: Targeted Training on Signal to Noise Ratio

Paper • 2406.06623 • Published Jun 7 • 7

upvoted a collection 3 months ago

Power-LM

Collection

Dense & MoE LLMs trained with power learning rate scheduler. • 4 items • Updated Oct 17 • 15

upvoted a paper 3 months ago

Agent Q: Advanced Reasoning and Learning for Autonomous AI Agents

Paper • 2408.07199 • Published Aug 13 • 20

upvoted a paper 4 months ago

The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery

Paper • 2408.06292 • Published Aug 12 • 116

commented 2 papers 4 months ago

On-Policy Distillation of Language Models: Learning from Self-Generated Mistakes

Paper • 2306.13649 • Published Jun 23, 2023 • 16 •

Self-Training with Direct Preference Optimization Improves Chain-of-Thought Reasoning

Paper • 2407.18248 • Published Jul 25 • 31 •

upvoted a paper 4 months ago

Exploratory Preference Optimization: Harnessing Implicit Q*-Approximation for Sample-Efficient RLHF

Paper • 2405.21046 • Published May 31 • 3

Kashif Rasul

AI & ML interests

Recent Activity

Articles

How NuminaMath Won the 1st AIMO Progress Prize

Preference Optimization for Vision Language Models

🧨 Diffusers welcomes Stable Diffusion 3

Patch Time Series Transformer in Hugging Face

Constitutional AI with Open LLMs

PatchTSMixer in HuggingFace

Preference Tuning LLMs with Direct Preference Optimization Methods

Finetune Stable Diffusion Models with DDPO via TRL

Introducing Würstchen: Fast Diffusion for Image Generation

Fine-tune Llama 2 with DPO

Yes, Transformers are Effective for Time Series Forecasting (+ Autoformer)

StackLLaMA: A hands-on guide to train LLaMA with RLHF

Multivariate Probabilistic Time Series Forecasting with Informer

Fine-tuning 20B LLMs with RLHF on a 24GB consumer GPU

Probabilistic Time Series Forecasting with 🤗 Transformers

The Annotated Diffusion Model

Organizations

kashif's activity

GIFT Eval