5 10 11

Yuanshi

AI & ML interests

Reinforcement Learning; Large Language Model; Multimodality; AI Infrastructure;

Recent Activity

upvoted a paper about 19 hours ago

ShowUI: One Vision-Language-Action Model for GUI Visual Agent

New activity about 22 hours ago

Yuanshi/OminiControl:deactivate server side rendering to avoid css breaks on mobile

upvoted a paper 2 days ago

Large-Scale Text-to-Image Model with Inpainting is a Zero-Shot Subject-Driven Image Generator

View all activity

Organizations

None yet

Yuanshi's activity

upvoted a paper about 19 hours ago

ShowUI: One Vision-Language-Action Model for GUI Visual Agent

Paper • 2411.17465 • Published 1 day ago • 46

New activity in Yuanshi/OminiControl about 22 hours ago

deactivate server side rendering to avoid css breaks on mobile

#1 opened 1 day ago by

fffiloni

upvoted a paper 2 days ago

Large-Scale Text-to-Image Model with Inpainting is a Zero-Shot Subject-Driven Image Generator

Paper • 2411.15466 • Published 5 days ago • 32

New activity in Yuanshi/OminiControl 2 days ago

Add model card

#2 opened 2 days ago by

nielsr

upvoted a paper 2 days ago

Style-Friendly SNR Sampler for Style-Driven Generation

Paper • 2411.14793 • Published 6 days ago • 34

upvoted a paper 3 days ago

OminiControl: Minimal and Universal Control for Diffusion Transformer

Paper • 2411.15098 • Published 5 days ago • 38

liked a model 3 days ago

Yuanshi/OminiControl

Image-to-Image • Updated 2 days ago • 29

commented a paper 3 days ago

OminiControl: Minimal and Universal Control for Diffusion Transformer

Paper • 2411.15098 • Published 5 days ago • 38 •

liked a Space 3 days ago

Running on L40S

108

🌍

OminiControl

updated a Space 3 days ago

Running on L40S

108

🌍

OminiControl

updated a model 4 days ago

Yuanshi/OminiControl

Image-to-Image • Updated 2 days ago • 29

liked a dataset 15 days ago

jackyhate/text-to-image-2M

Viewer • Updated Sep 22 • 649k • 5.06k • 38

liked 2 models about 1 month ago

LiheYoung/depth-anything-small-hf

Depth Estimation • Updated Jan 25 • 83.8k • 27

Zigeng/SlimSAM-uniform-50

Mask Generation • Updated 20 days ago • 4.11k • 12

upvoted a paper about 2 months ago

Attention Prompting on Image for Large Vision-Language Models

Paper • 2409.17143 • Published Sep 25 • 7

liked 2 models 2 months ago

Yuanshi/LinFusion-2-1

Text-to-Image • Updated Sep 13 • 17 • 3

Yuanshi/LinFusion-XL

Text-to-Image • Updated Sep 13 • 61 • 4

updated 3 models 3 months ago