arxiv:2501.03936

PPTAgent: Generating and Evaluating Presentations Beyond Text-to-Slides

Published on Jan 7

· Submitted by

Forceless on Jan 8

Upvote

Authors:

Hao Zheng ,

Xinyan Guan ,

Hao Kong ,

Hongyu Lin ,

Yaojie Lu ,

Xianpei Han ,

Abstract

Automatically generating presentations from documents is a challenging task that requires balancing content quality, visual design, and structural coherence. Existing methods primarily focus on improving and evaluating the content quality in isolation, often overlooking visual design and structural coherence, which limits their practical applicability. To address these limitations, we propose PPTAgent, which comprehensively improves presentation generation through a two-stage, edit-based approach inspired by human workflows. PPTAgent first analyzes reference presentations to understand their structural patterns and content schemas, then drafts outlines and generates slides through code actions to ensure consistency and alignment. To comprehensively evaluate the quality of generated presentations, we further introduce PPTEval, an evaluation framework that assesses presentations across three dimensions: Content, Design, and Coherence. Experiments show that PPTAgent significantly outperforms traditional automatic presentation generation methods across all three dimensions. The code and data are available at https://github.com/icip-cas/PPTAgent.

View arXiv page View PDF Add to collection

Community

Forceless

Paper author Paper submitter 2 days ago

Hi, Everyone
We proposed PPTAgent, a system for automatically generating presentations from documents. It follows a two-step process inspired by how people create slides, ensuring high-quality content, clear structure, and visually appealing design. To evaluate the generated presentations, we also introduce PPTEval, a framework that measures the quality of presentations in terms of content, design, and coherence.
Github Link, Dataset