pdf to podcast Download - pdf to podcast Source code download

pdf to podcast

AI Source Code

1.0.0

Download

pdf to podcast

Overview

This project provides a tool to convert any PDF document into a podcast episode! Using Google's Gemini for dialogue generation and OpenAI's text-to-speech models, this tool processes the content of a PDF, generates a natural dialogue suitable for an audio podcast, and outputs it as an MP3 file.

Features

Convert pdf to podcast: Upload a PDF and convert its content into a podcast dialogue.
AI-Powered Dialogue: Uses Google's Gemini LLM to create engaging, natural conversations.
High-Quality Audio: Leverages OpenAI's text-to-speech for lifelike voices.
User-friendly Interface: Simple interface using Gradio for easy interaction.

Installation

To set up the project, follow these steps:

Clone the repository:

git clone https://github.com/knowsuchagency/pdf-to-podcast.git
cd pdf-to-podcast

Install dependencies:
```
uv sync
```

Usage

Set up API Key(s):

You'll need an api key for OpenAI which you can either pass through the interface or set as the OPENAI_API_KEY environment variable.
Run the application:
```
python main.py
```
This will launch a Gradio interface in your web browser.
Upload a PDF: Upload the PDF document you want to convert into a podcast.
Enter OpenAI API Key: Provide your OpenAI API key in the designated textbox.
Generate Audio: Click the button to start the conversion process. The output will be an MP3 file containing the podcast dialogue.