6 129 48

rotem israeli

irotem98

https://rotem154154.github.io

rotem154154

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago

Efficient Long Video Tokenization via Coordinated-based Patch Reconstruction

liked a model 4 days ago

Efficient-Large-Model/Sana_1600M_1024px

liked a model 4 days ago

Efficient-Large-Model/Sana_1600M_512px

View all activity

Organizations

None yet

irotem98's activity

upvoted a paper 3 days ago

Efficient Long Video Tokenization via Coordinated-based Patch Reconstruction

Paper • 2411.14762 • Published 6 days ago • 10

liked 2 models 4 days ago

Efficient-Large-Model/Sana_1600M_1024px

Text-to-Image • Updated 7 days ago • 78

Efficient-Large-Model/Sana_1600M_512px

Text-to-Image • Updated 7 days ago • 31

liked a Space 11 days ago

Running

🏆

The timm Leaderboard

upvoted 6 papers 14 days ago

EgoVid-5M: A Large-Scale Video-Action Dataset for Egocentric Video Generation

Paper • 2411.08380 • Published 15 days ago • 25

SAMPart3D: Segment Any Part in 3D Objects

Paper • 2411.07184 • Published 16 days ago • 26

JanusFlow: Harmonizing Autoregression and Rectified Flow for Unified Multimodal Understanding and Generation

Paper • 2411.07975 • Published 15 days ago • 26

BLIP3-KALE: Knowledge Augmented Large-Scale Dense Captions

Paper • 2411.07461 • Published 16 days ago • 21

Scaling Properties of Diffusion Models for Perceptual Tasks

Paper • 2411.08034 • Published 15 days ago • 13

Wavelet Latent Diffusion (Wala): Billion-Parameter 3D Generative Model with Compact Wavelet Encodings

Paper • 2411.08017 • Published 15 days ago • 11

upvoted 2 papers 23 days ago

GPT or BERT: why not both?

Paper • 2410.24159 • Published 28 days ago • 13

Randomized Autoregressive Visual Generation

Paper • 2411.00776 • Published 26 days ago • 17

upvoted an article 26 days ago

Article

Trick or ResNet Treat

•

27 days ago

• 3

upvoted a paper 26 days ago

Unpacking SDXL Turbo: Interpreting Text-to-Image Models with Sparse Autoencoders

Paper • 2410.22366 • Published about 1 month ago • 75

liked a dataset 27 days ago

visual-layer/imagenet-1k-vl-enriched

Viewer • Updated Sep 16 • 1.33M • 2.21k • 14

liked a model 27 days ago

HuggingFaceTB/SmolLM2-135M-Instruct

Text Generation • Updated 2 days ago • 21.4k • 60

upvoted a collection 27 days ago

SmolLM2

Collection

State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 15 items • Updated about 15 hours ago • 181

liked a model 27 days ago

FoundationVision/var

Updated Apr 23 • 58

upvoted 2 papers 30 days ago

GPT-4o System Card

Paper • 2410.21276 • Published Oct 25 • 79

A Survey of Small Language Models

Paper • 2410.20011 • Published Oct 25 • 38