🩺 Medical Chatbot

An intelligent medical assistant chatbot powered by AI that provides accurate medical information using Retrieval-Augmented Generation (RAG) with LangChain, Pinecone, and Groq.

📋 Table of Contents

Overview
Features
Tech Stack
Architecture
Installation
Configuration
Usage
Project Structure
API Endpoints

🎯 Overview

This Medical Chatbot uses state-of-the-art natural language processing to answer medical questions by retrieving relevant information from a curated knowledge base of medical documents. The system combines semantic search with large language models to provide accurate, context-aware responses.

✨ Features

🤖 AI-Powered Responses: Uses Groq's Llama 3.3 70B model for intelligent answers
🔍 Semantic Search: Leverages Pinecone vector database for fast, accurate document retrieval
📚 PDF Knowledge Base: Processes medical PDFs to build a comprehensive knowledge base
💬 Interactive Web Interface: User-friendly chat interface built with Flask
🎯 Context-Aware: Retrieves top 3 most relevant documents for each query
⚡ Fast Response Time: Optimized for quick inference with Groq API

🛠️ Tech Stack

Backend

Python 3.10+
Flask: Web framework
LangChain: LLM orchestration framework
LangChain Groq: Groq integration for LLM inference
LangChain Pinecone: Vector store integration
LangChain HuggingFace: Embeddings model

AI/ML

Groq API: Fast LLM inference (Llama 3.3 70B Versatile)
HuggingFace: Embeddings (sentence-transformers/all-MiniLM-L6-v2)
Pinecone: Vector database for semantic search

Data Processing

PyPDF: PDF parsing
RecursiveCharacterTextSplitter: Text chunking

🏗️ Architecture

User Query
    ↓
Flask Web App
    ↓
LangChain RAG Pipeline
    ↓
    ├─→ HuggingFace Embeddings (384-dim vectors)
    ↓
Pinecone Vector Store (Similarity Search)
    ↓
Top 3 Relevant Documents
    ↓
Groq LLM (Llama 3.3 70B)
    ↓
Generated Response
    ↓
User Interface

📦 Installation

Prerequisites

Python 3.10 or higher
Anaconda/Miniconda (recommended)
Pinecone account
Groq API account

Step 1: Clone the Repository

git clone https://github.com/nadamankai/Medical-Chatbot.git
cd Medical-Chatbot

Step 2: Create Virtual Environment

conda create -n medibot python=3.10 -y
conda activate medibot

Step 3: Install Dependencies

pip install -r requirements.txt

⚙️ Configuration

Step 1: Create Environment File

Create a .env file in the root directory:

PINECONE_API_KEY=your_pinecone_api_key_here
GROQ_API_KEY=your_groq_api_key_here

Step 2: Get API Keys

Pinecone API Key

Go to https://www.pinecone.io/
Sign up for a free account
Navigate to API Keys section
Copy your API key

Groq API Key

Go to https://console.groq.com/
Sign up for a free account
Navigate to API Keys section
Create a new API key
Copy your API key (starts with gsk_)

Step 3: Prepare Your Data

Place your medical PDF files in the data/ directory
Run the indexing script to create embeddings:

python store_index.py

This will:

Load all PDFs from the data/ directory
Split documents into chunks (500 chars with 20 char overlap)
Generate embeddings using HuggingFace model
Store vectors in Pinecone index named medical-bot

🚀 Usage

Start the Application

python app.py

The application will start on http://localhost:8080

Using the Chatbot

Open your browser and navigate to http://localhost:8080
Type your medical question in the chat interface
Press Enter or click Send
Wait for the AI-generated response

Example Queries

"What is diabetes?"
"What are the symptoms of hypertension?"
"How is acne treated?"
"What causes anemia?"

📁 Project Structure

Medical-Chatbot/
├── app.py                      # Main Flask application
├── store_index.py              # Script to create Pinecone index
├── requirements.txt            # Python dependencies
├── setup.py                    # Package setup file
├── .env                        # Environment variables (not in repo)
├── README.md                   # Project documentation
│
├── data/                       # Medical PDF documents
│   └── *.pdf
│
├── src/                        # Source code modules
│   ├── __init__.py
│   ├── helper.py               # Helper functions for data processing
│   └── prompt_template.py     # System prompt for the chatbot
│
├── templates/                  # HTML templates
│   └── chat.html               # Chat interface
│
└── research/                   # Jupyter notebooks for experimentation
    └── trials.ipynb

🔌 API Endpoints

`GET /`

Renders the main chat interface.

Response: HTML page

`POST /get`

Handles chat messages and returns bot responses.

Request:

{
  "msg": "What is diabetes?"
}

Response:

Diabetes is a chronic condition characterized by high blood sugar levels...

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
.idea		.idea
anaconda_projects/db		anaconda_projects/db
assets		assets
data		data
medical_chatbot.egg-info		medical_chatbot.egg-info
research		research
src		src
templates		templates
.env.template		.env.template
.gitignore		.gitignore
README.md		README.md
app.py		app.py
requirements.txt		requirements.txt
setup.py		setup.py
store_index.py		store_index.py
template.sh		template.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🩺 Medical Chatbot

📋 Table of Contents

🎯 Overview

✨ Features

🛠️ Tech Stack

Backend

AI/ML

Data Processing

🏗️ Architecture

📦 Installation

Prerequisites

Step 1: Clone the Repository

Step 2: Create Virtual Environment

Step 3: Install Dependencies

⚙️ Configuration

Step 1: Create Environment File

Step 2: Get API Keys

Pinecone API Key

Groq API Key

Step 3: Prepare Your Data

🚀 Usage

Start the Application

Using the Chatbot

Example Queries

📁 Project Structure

🔌 API Endpoints

`GET /`

`POST /get`

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🩺 Medical Chatbot

📋 Table of Contents

🎯 Overview

✨ Features

🛠️ Tech Stack

Backend

AI/ML

Data Processing

🏗️ Architecture

📦 Installation

Prerequisites

Step 1: Clone the Repository

Step 2: Create Virtual Environment

Step 3: Install Dependencies

⚙️ Configuration

Step 1: Create Environment File

Step 2: Get API Keys

Pinecone API Key

Groq API Key

Step 3: Prepare Your Data

🚀 Usage

Start the Application

Using the Chatbot

Example Queries

📁 Project Structure

🔌 API Endpoints

GET /

POST /get

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

`GET /`

`POST /get`

Packages