marinaretikof

marinaretik

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

3D Convex Splatting: Radiance Field Rendering with 3D Smooth Convexes

upvoted a paper 1 day ago

CAT4D: Create Anything in 4D with Multi-View Video Diffusion Models

upvoted a paper 1 day ago

Large Language Model-Brained GUI Agents: A Survey

View all activity

Organizations

None yet

marinaretik's activity

upvoted 4 papers 1 day ago

upvoted 3 papers 5 days ago

TÜLU 3: Pushing Frontiers in Open Language Model Post-Training

Paper • 2411.15124 • Published 8 days ago • 54

O1 Replication Journey -- Part 2: Surpassing O1-preview through Simple Distillation, Big Progress or Bitter Lesson?

Paper • 2411.16489 • Published 6 days ago • 28

MH-MoE:Multi-Head Mixture-of-Experts

Paper • 2411.16205 • Published 6 days ago • 21

upvoted 5 papers 7 days ago

Insight-V: Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models

Paper • 2411.14432 • Published 9 days ago • 19

OpenScholar: Synthesizing Scientific Literature with Retrieval-augmented LMs

Paper • 2411.14199 • Published 10 days ago • 25

Marco-o1: Towards Open Reasoning Models for Open-Ended Solutions

Paper • 2411.14405 • Published 9 days ago • 52

Hymba: A Hybrid-head Architecture for Small Language Models

Paper • 2411.13676 • Published 10 days ago • 37

Enhancing the Reasoning Ability of Multimodal Large Language Models via Mixed Preference Optimization

Paper • 2411.10442 • Published 15 days ago • 61

upvoted 3 papers 10 days ago

Generative World Explorer

Paper • 2411.11844 • Published 12 days ago • 67

SymDPO: Boosting In-Context Learning of Large Multimodal Models with Symbol Demonstration Direct Preference Optimization

Paper • 2411.11909 • Published 14 days ago • 20

SageAttention2 Technical Report: Accurate 4 Bit Attention for Plug-and-play Inference Acceleration

Paper • 2411.10958 • Published 14 days ago • 47

upvoted 5 papers 13 days ago

LLaVA-o1: Let Vision Language Models Reason Step-by-Step

Paper • 2411.10440 • Published 15 days ago • 106

GaussianAnything: Interactive Point Cloud Latent Diffusion for 3D Generation

Paper • 2411.08033 • Published 18 days ago • 21

Number it: Temporal Grounding Videos like Flipping Manga

Paper • 2411.10332 • Published 15 days ago • 12

Xmodel-1.5: An 1B-scale Multilingual LLM

Paper • 2411.10083 • Published 16 days ago • 14

The Dawn of GUI Agent: A Preliminary Case Study with Claude 3.5 Computer Use

Paper • 2411.10323 • Published 15 days ago • 27