kcz's picture

kcz

kcz358

·

kcz358

AI & ML interests

None yet

Recent Activity

updated a dataset 5 days ago

lmms-lab/sae-sample-cache-dataset

updated a model 5 days ago

lmms-lab/llama3-llava-next-8b-hf-sae-131k

authored a paper 5 days ago

Large Multi-modal Models Can Interpret Features in Large Multi-modal Models

View all activity

Organizations

kcz358's activity

upvoted a paper 6 days ago

Large Multi-modal Models Can Interpret Features in Large Multi-modal Models

Paper • 2411.14982 • Published 8 days ago • 13

upvoted a collection 6 days ago

Multimodal-SAE

The collection of the sae that hooked on llava • 4 items • Updated 5 days ago • 2

upvoted a paper about 1 month ago

MixEval-X: Any-to-Any Evaluations from Real-World Data Mixtures

Paper • 2410.13754 • Published Oct 17 • 74

upvoted a paper about 2 months ago

LLaVA-Critic: Learning to Evaluate Multimodal Models

Paper • 2410.02712 • Published Oct 3 • 34

upvoted a collection about 2 months ago

LLaVA-Critic

as a general evaluator for assessing model performance • 6 items • Updated Oct 6 • 8

upvoted a paper about 2 months ago

Video Instruction Tuning With Synthetic Data

Paper • 2410.02713 • Published Oct 3 • 37

upvoted a collection 2 months ago

LLaVA-Video

Models focus on video understanding (previously known as LLaVA-NeXT-Video). • 6 items • Updated Oct 5 • 55

upvoted 2 collections 3 months ago

LLaVA-Onevision

LLaVa_Onevision models for single-image, multi-image, and video scenarios • 9 items • Updated Sep 18 • 12

LongVA

Long Context Transfer From Text To Vision: https://lmms-lab.github.io/posts/longva/ • 5 items • Updated Oct 4 • 13

upvoted a collection 4 months ago

LLaVA-OneVision

a model good at arbitrary types of visual input • 15 items • Updated Oct 5 • 20

upvoted a paper 4 months ago

LLaVA-OneVision: Easy Visual Task Transfer

Paper • 2408.03326 • Published Aug 6 • 59

upvoted 2 papers 5 months ago

LMMs-Eval: Reality Check on the Evaluation of Large Multimodal Models

Paper • 2407.12772 • Published Jul 17 • 33

Long Context Transfer from Language to Vision

Paper • 2406.16852 • Published Jun 24 • 32

upvoted a collection 7 months ago

LLaVA-NeXT

Some powerful image models. • 10 items • Updated Oct 14 • 2

upvoted a collection 8 months ago

LMMs-Eval

Dataset Collection of LMMs-Eval • 36 items • Updated Oct 4 • 25