📢 Podcast Summaries Project

🚀 Overview

This project aims to generate structured PDF reports from podcast interviews, highlighting key takeaways, quotes, and insights. The goal is to create shareable and accessible summaries for a broader audience.

🔹 Features (Planned)

Summarization using LLMs
Search & Retrieval
PDF Report Generation
Web UI (Streamlit) for user interaction

📌 Getting Started

1️⃣ Setup the Project

Clone the repository:

git clone https://github.com/DataTalksClub/podcast-summary-generation.git
cd podcast-summaries

Install dependencies:
```
pip install -r requirements.txt
```

Before running the application, ensure you have your OPEN_API_KEY and GROK_API_KEY configured.

You can do this using either of the following methods:

Option 1: Use a `.env` File

Create a .env file in the root of your project with the following content:

OPEN_API_KEY = sample-value-here
GROK_API_KEY = sample-value-here

Option 2: Use a `secrets.toml` File

Create a file named secrets.toml inside the .streamlit folder with the following content:

OPEN_API_KEY = sample-value-here
GROK_API_KEY = sample-value-here

2️⃣ Run the Project (Development Mode)

To run the application, do the following:

# Start the backend services (if needed)
docker-compose up -d

# Generate a podcast summary using the OpenAI API key.
 python main_openai.py  --input episode.md --output episode_summary_openai.md

# Run the Streamlit application
python run_streamlit_app.py

🔄 Workflow Pipeline

LLM Processing → Summarization, Extracting Key Insights
Storage & Retrieval → Search Engine (ElasticSearch/In-memory DB)
PDF Generation → Formatted Report
Web UI → User Interaction & Downloads

🏗️ Contribution Guidelines

Open an issue before working on any feature.
Use feature branches for development.
Submit PRs with at least 2 approvals before merging.

📚 Resources

🚀 Let's build something great together! 🎙️📄

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
.github		.github
docs		docs
llms		llms
pipeline		pipeline
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
app.py		app.py
docker-compose.yml		docker-compose.yml
episode.md		episode.md
episode_summary.md		episode_summary.md
episode_summary_openai.md		episode_summary_openai.md
how_to_run.md		how_to_run.md
main.py		main.py
main_openai.py		main_openai.py
requirements.txt		requirements.txt
run_streamlit_app.py		run_streamlit_app.py
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

📢 Podcast Summaries Project

🚀 Overview

🔹 Features (Planned)

📌 Getting Started

1️⃣ Setup the Project

Option 1: Use a `.env` File

Option 2: Use a `secrets.toml` File

2️⃣ Run the Project (Development Mode)

🔄 Workflow Pipeline

🏗️ Contribution Guidelines

📚 Resources

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 5

Uh oh!

Languages

License

DataTalksClub/podcast-summary-generation

Folders and files

Latest commit

History

Repository files navigation

📢 Podcast Summaries Project

🚀 Overview

🔹 Features (Planned)

📌 Getting Started

1️⃣ Setup the Project

Option 1: Use a .env File

Option 2: Use a secrets.toml File

2️⃣ Run the Project (Development Mode)

🔄 Workflow Pipeline

🏗️ Contribution Guidelines

📚 Resources

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 5

Uh oh!

Languages

Option 1: Use a `.env` File

Option 2: Use a `secrets.toml` File

Packages