Vince's picture

637 49

Vince

bolerovt

·

bolerovt

AI & ML interests

None yet

Organizations

None yet

bolerovt's activity

upvoted 14 papers 4 days ago

LLaMo: Large Language Model-based Molecular Graph Assistant

Paper • 2411.00871 • Published 10 days ago • 19

MarDini: Masked Autoregressive Diffusion for Video Generation at Scale

Paper • 2410.20280 • Published 14 days ago • 21

AutoKaggle: A Multi-Agent Framework for Autonomous Data Science Competitions

Paper • 2410.20424 • Published 14 days ago • 36

CLEAR: Character Unlearning in Textual and Visual Modalities

Paper • 2410.18057 • Published 17 days ago • 197

CORAL: Benchmarking Multi-turn Conversational Retrieval-Augmentation Generation

Paper • 2410.23090 • Published 11 days ago • 52

A Large Recurrent Action Model: xLSTM enables Fast Inference for Robotics Tasks

Paper • 2410.22391 • Published 11 days ago • 21

OS-ATLAS: A Foundation Action Model for Generalist GUI Agents

Paper • 2410.23218 • Published 10 days ago • 43

Personalization of Large Language Models: A Survey

Paper • 2411.00027 • Published 12 days ago • 28

Hunyuan-Large: An Open-Source MoE Model with 52 Billion Activated Parameters by Tencent

Paper • 2411.02265 • Published 5 days ago • 22

WebRL: Training LLM Web Agents via Self-Evolving Online Curriculum Reinforcement Learning

Paper • 2411.02337 • Published 5 days ago • 32

"Give Me BF16 or Give Me Death"? Accuracy-Performance Trade-Offs in LLM Quantization

Paper • 2411.02355 • Published 5 days ago • 44

AndroidLab: Training and Systematic Benchmarking of Android Autonomous Agents

Paper • 2410.24024 • Published 10 days ago • 45

DeeR-VLA: Dynamic Inference of Multimodal Large Language Models for Efficient Robot Execution

Paper • 2411.02359 • Published 5 days ago • 12

HtmlRAG: HTML is Better Than Plain Text for Modeling Retrieved Knowledge in RAG Systems

Paper • 2411.02959 • Published 5 days ago • 52

upvoted 6 papers 11 days ago

MMIE: Massive Multimodal Interleaved Comprehension Benchmark for Large Vision-Language Models

Paper • 2410.10139 • Published 27 days ago • 50

LVD-2M: A Long-take Video Dataset with Temporally Dense Captions

Paper • 2410.10816 • Published 26 days ago • 19

CompassJudger-1: All-in-one Judge Model Helps Model Evaluation and Evolution

Paper • 2410.16256 • Published 19 days ago • 58

SAM2Long: Enhancing SAM 2 for Long Video Segmentation with a Training-Free Memory Tree

Paper • 2410.16268 • Published 19 days ago • 65

LOGO -- Long cOntext aliGnment via efficient preference Optimization

Paper • 2410.18533 • Published 17 days ago • 42

Can Knowledge Editing Really Correct Hallucinations?

Paper • 2410.16251 • Published 19 days ago • 53