3 62 1

ltl

2793145003

AI & ML interests

None yet

Recent Activity

upvoted a paper about 1 month ago

StdGEN: Semantic-Decomposed 3D Character Generation from Single Images

upvoted a paper about 1 month ago

WebRL: Training LLM Web Agents via Self-Evolving Online Curriculum Reinforcement Learning

upvoted a paper about 1 month ago

OS-ATLAS: A Foundation Action Model for Generalist GUI Agents

View all activity

Organizations

None yet

ltl's activity

upvoted 4 papers about 1 month ago

StdGEN: Semantic-Decomposed 3D Character Generation from Single Images

Paper • 2411.05738 • Published Nov 8 • 14

WebRL: Training LLM Web Agents via Self-Evolving Online Curriculum Reinforcement Learning

Paper • 2411.02337 • Published Nov 4 • 35

OS-ATLAS: A Foundation Action Model for Generalist GUI Agents

Paper • 2410.23218 • Published Oct 30 • 46

ReferEverything: Towards Segmenting Everything We Can Speak of in Videos

Paper • 2410.23287 • Published Oct 30 • 18

upvoted 8 papers about 2 months ago

D-FINE: Redefine Regression Task in DETRs as Fine-grained Distribution Refinement

Paper • 2410.13842 • Published Oct 17 • 1

Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models

Paper • 2409.17146 • Published Sep 25 • 103

Animate-X: Universal Character Image Animation with Enhanced Motion Representation

Paper • 2410.10306 • Published Oct 14 • 53

Omni-MATH: A Universal Olympiad Level Mathematic Benchmark For Large Language Models

Paper • 2410.07985 • Published Oct 10 • 26

upvoted 3 papers 2 months ago

Differential Transformer

Paper • 2410.05258 • Published Oct 7 • 166

Emu3: Next-Token Prediction is All You Need

Paper • 2409.18869 • Published Sep 27 • 92

Making Text Embedders Few-Shot Learners

Paper • 2409.15700 • Published Sep 24 • 29

upvoted a paper 3 months ago

Loopy: Taming Audio-Driven Portrait Avatar with Long-Term Motion Dependency

Paper • 2409.02634 • Published Sep 4 • 89

upvoted 4 papers 4 months ago

Transformer Explainer: Interactive Learning of Text-Generative Models

Paper • 2408.04619 • Published Aug 8 • 155

Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraining

Paper • 2408.02657 • Published Aug 5 • 33

MiniCPM-V: A GPT-4V Level MLLM on Your Phone

Paper • 2408.01800 • Published Aug 3 • 79

Reenact Anything: Semantic Video Motion Transfer Using Motion-Textual Inversion

Paper • 2408.00458 • Published Aug 1 • 10