9 18 1

Tianyi Zhou

zhoutianyi

https://tianyizhou.github.io/

AI & ML interests

ML, NLP, RL, Multi-modality

Recent Activity

authored a paper 8 days ago

GUI Agents: A Survey

upvoted a paper 9 days ago

GUI Agents: A Survey

authored a paper 22 days ago

Florence-VL: Enhancing Vision-Language Models with Generative Vision Encoder and Depth-Breadth Fusion

View all activity

Organizations

zhoutianyi's activity

upvoted a paper 9 days ago

GUI Agents: A Survey

Paper • 2412.13501 • Published 10 days ago • 22

upvoted a paper 22 days ago

Florence-VL: Enhancing Vision-Language Models with Generative Vision Encoder and Depth-Breadth Fusion

Paper • 2412.04424 • Published 22 days ago • 56

upvoted 2 papers about 2 months ago

DynaSaur: Large Language Agents Beyond Predefined Actions

Paper • 2411.01747 • Published Nov 4 • 18

What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A Gradient Perspective

Paper • 2410.23743 • Published Oct 31 • 59

upvoted 3 papers 2 months ago

Diffusion Curriculum: Synthetic-to-Real Generative Curriculum Learning via Image-Guided Diffusion

Paper • 2410.13674 • Published Oct 17 • 15

BenTo: Benchmark Task Reduction with In-Context Transferability

Paper • 2410.13804 • Published Oct 17 • 19

Your Mixture-of-Experts LLM Is Secretly an Embedding Model For Free

Paper • 2410.10814 • Published Oct 14 • 48

upvoted 2 papers 3 months ago

WALL-E: World Alignment by Rule Learning Improves World Model-based LLM Agents

Paper • 2410.07484 • Published Oct 9 • 48

Do great minds think alike? Investigating Human-AI Complementarity in Question Answering with CAIMIRA

Paper • 2410.06524 • Published Oct 9 • 4

upvoted a paper 6 months ago

AUTOHALLUSION: Automatic Generation of Hallucination Benchmarks for Vision-Language Models

Paper • 2406.10900 • Published Jun 16 • 11

upvoted a paper 11 months ago

ODIN: Disentangled Reward Mitigates Hacking in RLHF

Paper • 2402.07319 • Published Feb 11 • 13

upvoted a paper 12 months ago

TrustLLM: Trustworthiness in Large Language Models

Paper • 2401.05561 • Published Jan 10 • 66

upvoted 3 papers about 1 year ago

Embodied Multi-Modal Agent trained by an LLM from a Parallel TextWorld

Paper • 2311.16714 • Published Nov 28, 2023 • 1

Reflection-Tuning: Data Recycling Improves LLM Instruction-Tuning

Paper • 2310.11716 • Published Oct 18, 2023 • 5

HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V(ision), LLaVA-1.5, and Other Multi-modality Models

Paper • 2310.14566 • Published Oct 23, 2023 • 25

upvoted 3 papers over 1 year ago

Diffusion Models Beat GANs on Image Classification

Paper • 2307.08702 • Published Jul 17, 2023 • 17

AlpaGasus: Training A Better Alpaca with Fewer Data

Paper • 2307.08701 • Published Jul 17, 2023 • 22

InstructZero: Efficient Instruction Optimization for Black-Box Large Language Models

Paper • 2306.03082 • Published Jun 5, 2023 • 5