69 12 63

Ziyang Luo

Ziyang

https://chiyeunglaw.github.io/

ChiYeungLaw

AI & ML interests

LLMs, Multimodal ML

Organizations

Ziyang's activity

upvoted an article about 2 months ago

Article

The Annotated Diffusion Model

Jun 7, 2022

• 101

upvoted a paper about 2 months ago

Discrete Diffusion Modeling by Estimating the Ratios of the Data Distribution

Paper • 2310.16834 • Published Oct 25, 2023 • 4

upvoted an article 4 months ago

Article

The Rise of Agentic Data Generation

•

Jul 15

• 75

upvoted a paper 5 months ago

VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs

Paper • 2406.07476 • Published Jun 11 • 32

upvoted a collection 5 months ago

From screenshots to HTML

Collection

WebSight is a dataset of 823,000 HTML/CSS codes representing synthetically generated English websites, each accompanied by a corresponding screenshot. • 4 items • Updated Apr 15 • 18

upvoted a paper 6 months ago

OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments

Paper • 2404.07972 • Published Apr 11 • 46

upvoted a paper 7 months ago

MMCode: Evaluating Multi-Modal Code Large Language Models with Visually Rich Programming Problems

Paper • 2404.09486 • Published Apr 15 • 1

upvoted a paper 9 months ago

aMUSEd: An Open MUSE Reproduction

Paper • 2401.01808 • Published Jan 3 • 28

upvoted a paper 10 months ago

GOAT-Bench: Safety Insights to Large Multimodal Models through Meme-Based Social Abuse

Paper • 2401.01523 • Published Jan 3 • 1

upvoted 2 papers 12 months ago

GAIA: a benchmark for General AI Assistants

Paper • 2311.12983 • Published Nov 21, 2023 • 183

LexLIP: Lexicon-Bottlenecked Language-Image Pre-Training for Large-Scale Image-Text Retrieval

Paper • 2302.02908 • Published Feb 6, 2023 • 1

upvoted a paper over 1 year ago

Demystifying GPT Self-Repair for Code Generation

Paper • 2306.09896 • Published Jun 16, 2023 • 19