Crocy Cheng's picture

8 10

Crocy Cheng

zhycheng4ai

·

AI & ML interests

None yet

Recent Activity

liked a model 5 days ago

Qwen/Qwen2.5-72B-Instruct

liked a model 5 days ago

NexaAIDev/Qwen2-Audio-7B-GGUF

liked a model 16 days ago

NexaAIDev/omnivision-968M

View all activity

Organizations

None yet

zhycheng4ai's activity

liked 2 models 5 days ago

Qwen/Qwen2.5-72B-Instruct

Text Generation • Updated Sep 25 • 447k • • 523

NexaAIDev/Qwen2-Audio-7B-GGUF

Audio-Text-to-Text • Updated 5 days ago • 9.14k • 87

liked 3 models 16 days ago

NexaAIDev/omnivision-968M

Updated 2 days ago • 10k • 431

OuteAI/OuteTTS-0.1-350M

Text-to-Speech • Updated 3 days ago • 10k • 290

black-forest-labs/FLUX.1-dev

Text-to-Image • Updated Aug 16 • 1.36M • • 6.82k

liked a model 20 days ago

NexaAIDev/Octopus-v2

Text Generation • Updated May 21 • 761 • 860

upvoted 2 papers 20 days ago

LLaVA-Plus: Learning to Use Tools for Creating Multimodal Agents

Paper • 2311.05437 • Published Nov 9, 2023 • 48

CharXiv: Charting Gaps in Realistic Chart Understanding in Multimodal LLMs

Paper • 2406.18521 • Published Jun 26 • 28

liked a model 20 days ago

liuhaotian/llava-v1.6-vicuna-7b

Image-Text-to-Text • Updated May 9 • 51.3k • 99

upvoted 4 papers 20 days ago

MarkupLM: Pre-training of Text and Markup Language for Visually-rich Document Understanding

Paper • 2110.08518 • Published Oct 16, 2021 • 1

OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments

Paper • 2404.07972 • Published Apr 11 • 46

SEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite for Southeast Asian Languages

Paper • 2406.10118 • Published Jun 14 • 30

MLLM as Retriever: Interactively Learning Multimodal Retrieval for Embodied Agents

Paper • 2410.03450 • Published Oct 4 • 36

liked a model 20 days ago

microsoft/Phi-3.5-vision-instruct

Image-Text-to-Text • Updated Sep 26 • 973k • 590

upvoted a paper 20 days ago

StarCoder 2 and The Stack v2: The Next Generation

Paper • 2402.19173 • Published Feb 29 • 136

liked a model 20 days ago

google/gemma-2-2b-it

Text Generation • Updated Aug 27 • 900k • 706

upvoted a paper 20 days ago

Datasets: A Community Library for Natural Language Processing

Paper • 2109.02846 • Published Sep 7, 2021 • 10

liked a model 20 days ago

meta-llama/Llama-3.2-1B

Text Generation • Updated Oct 24 • 1.58M • 1.08k