Yongxin Guo's picture

Yongxin Guo

Yongxin-Guo

·

https://gyxxyg.github.io/yongxinguo/

gyxxyg

AI & ML interests

None yet

Organizations

Yongxin-Guo's activity

upvoted a paper 10 days ago

GPT-4o System Card

Paper • 2410.21276 • Published 13 days ago • 76

upvoted 2 papers 11 days ago

Can Knowledge Editing Really Correct Hallucinations?

Paper • 2410.16251 • Published 17 days ago • 53

LOGO -- Long cOntext aliGnment via efficient preference Optimization

Paper • 2410.18533 • Published 15 days ago • 42

upvoted a paper 19 days ago

Janus: Decoupling Visual Encoding for Unified Multimodal Understanding and Generation

Paper • 2410.13848 • Published 21 days ago • 27

upvoted a paper 21 days ago

MixEval-X: Any-to-Any Evaluations from Real-World Data Mixtures

Paper • 2410.13754 • Published 21 days ago • 74

upvoted a paper 25 days ago

Baichuan-Omni Technical Report

Paper • 2410.08565 • Published 28 days ago • 82

upvoted a paper 29 days ago

TRACE: Temporal Grounding Video LLM via Causal Event Modeling

Paper • 2410.05643 • Published about 1 month ago • 8

upvoted a collection 29 days ago

TRACE

TRACE: Temporal Grounding Video LLM via Casual Event Modeling • 10 items • Updated 7 days ago • 1

upvoted a paper 2 months ago

OLMoE: Open Mixture-of-Experts Language Models

Paper • 2409.02060 • Published Sep 3 • 77

upvoted 4 papers 3 months ago

Layerwise Recurrent Router for Mixture-of-Experts

Paper • 2408.06793 • Published Aug 13 • 30

MovieSum: An Abstractive Summarization Dataset for Movie Screenplays

Paper • 2408.06281 • Published Aug 12 • 9

The Llama 3 Herd of Models

Paper • 2407.21783 • Published Jul 31 • 105

SAM 2: Segment Anything in Images and Videos

Paper • 2408.00714 • Published Aug 1 • 106

upvoted 3 papers 4 months ago

SlowFast-LLaVA: A Strong Training-Free Baseline for Video Large Language Models

Paper • 2407.15841 • Published Jul 22 • 39

LLaMAX: Scaling Linguistic Horizons of LLM by Enhancing Translation Capabilities Beyond 100 Languages

Paper • 2407.05975 • Published Jul 8 • 34

Unveiling Encoder-Free Vision-Language Models

Paper • 2406.11832 • Published Jun 17 • 49

upvoted a collection 5 months ago

DynMoE Family

DynMoE model checkpoints and paper on huggingface • 4 items • Updated Aug 19 • 3

upvoted 2 papers 5 months ago

VTG-LLM: Integrating Timestamp Knowledge into Video LLMs for Enhanced Video Temporal Grounding

Paper • 2405.13382 • Published May 22 • 1

Dynamic Mixture of Experts: An Auto-Tuning Approach for Efficient Transformer Models

Paper • 2405.14297 • Published May 23 • 2