metadata

title: Audio Abstract42
emoji: 😻
colorFrom: blue
colorTo: green
sdk: gradio
sdk_version: 4.7.1
app_file: app.py
pinned: false

PDF Audio Summarizer

This application summarizes PDF documents and converts the summary to audio.

How it works

The core logic is in the audio_pdf function. It:

Extracts raw text from the uploaded PDF using PyPDF2
Summarizes the text using LED-Based Summarization Model from HuggingFace Transformers. This uses AutoTokenizer and AutoModelForSeq2SeqLM to load the model and generate a summary
Converts the text summary to an audio file using gTTS (Google Text-to-Speech)

The summary and audio file are returned and displayed in the Gradio web interface.

The interface is created using Gradio. The key components are:

The interface is launched via iface.launch()

Additional dependencies: