Simon DL

SimonDL

Simon-dl

AI & ML interests

Reinforcement Learning

Recent Activity

upvoted a paper 4 days ago

ShowUI: One Vision-Language-Action Model for GUI Visual Agent

upvoted a paper 4 days ago

O1 Replication Journey -- Part 2: Surpassing O1-preview through Simple Distillation, Big Progress or Bitter Lesson?

upvoted a paper 6 days ago

BALROG: Benchmarking Agentic LLM and VLM Reasoning On Games

View all activity

Organizations

None yet

SimonDL's activity

upvoted 2 papers 4 days ago

ShowUI: One Vision-Language-Action Model for GUI Visual Agent

Paper • 2411.17465 • Published 5 days ago • 60

O1 Replication Journey -- Part 2: Surpassing O1-preview through Simple Distillation, Big Progress or Bitter Lesson?

Paper • 2411.16489 • Published 6 days ago • 28

upvoted a paper 6 days ago

BALROG: Benchmarking Agentic LLM and VLM Reasoning On Games

Paper • 2411.13543 • Published 10 days ago • 17

upvoted a paper 9 days ago

RedPajama: an Open Dataset for Training Large Language Models

Paper • 2411.12372 • Published 12 days ago • 47

upvoted a paper 13 days ago

LLaVA-o1: Let Vision Language Models Reason Step-by-Step

Paper • 2411.10440 • Published 15 days ago • 106

upvoted a paper 15 days ago

Sharingan: Extract User Action Sequence from Desktop Recordings

Paper • 2411.08768 • Published 17 days ago • 9

liked a dataset 18 days ago

fka/awesome-chatgpt-prompts

Viewer • Updated Sep 3 • 170 • 9.64k • 6.39k

updated a collection 22 days ago

Robotics

Collection

5 items • Updated 22 days ago

updated a collection 23 days ago

Robotics

Collection

5 items • Updated 22 days ago

upvoted a paper 23 days ago

Both Text and Images Leaked! A Systematic Analysis of Multimodal LLM Data Contamination

Paper • 2411.03823 • Published 25 days ago • 43

updated a collection about 1 month ago

Robotics

Collection

5 items • Updated 22 days ago

upvoted 8 papers about 1 month ago

Precise and Dexterous Robotic Manipulation via Human-in-the-Loop Reinforcement Learning

Paper • 2410.21845 • Published Oct 29 • 11

Robots Pre-train Robots: Manipulation-Centric Robotic Representation from Large-Scale Robot Dataset

Paper • 2410.22325 • Published Oct 29 • 9

PyramidDrop: Accelerating Your Large Vision-Language Models via Pyramid Visual Redundancy Reduction

Paper • 2410.17247 • Published Oct 22 • 43

JMMMU: A Japanese Massive Multi-discipline Multimodal Understanding Benchmark for Culture-aware Evaluation

Paper • 2410.17250 • Published Oct 22 • 12

Agent-to-Sim: Learning Interactive Behavior Models from Casual Longitudinal Videos

Paper • 2410.16259 • Published Oct 21 • 5