MedAid 🩺
A Bi-Modal Multi-Agent Medical Assistant

Overview

MedAid is an AI-powered assistant designed to assist with medical diagnosis, research, and patient interactions.

This project integrates LLMs, CV Models, RAG, Web Search and Human-in-the-loop validation for AI based medical diagnosis and research.

Flow Chart

Features

Modular Multi-Agent System : Specialized agents for diagnosis, retrieval, reasoning, and image analysis.
Agentic RAG Pipeline :
- PDF parsing via Docling (text, tables, images)
- Structural-aware semantic chunking
- Domain-specific query expansion
- Hybrid search with BM25 + dense vectors (Qdrant)
- Cross-encoder reranking for relevance
- Guardrails and source linking
- Confidence-based switch to Web Search to reduce hallucinations
Medical Imaging Module :
- COVID-19 chest X-ray classification
- Skin lesion segmentation
Live Research Agent : Real-time retrieval of current medical literature.
Confidence Scoring : Log-probability–based accuracy verification.
Voice Interface : Speech-to-text and TTS via Eleven Labs API.
Expert Review : Human-in-the-loop validation by medical professionals.
Safety Filters : Robust I/O guardrails for trustworthy and ethical responses.
User-Friendly UI : Accessible design tailored for medical practitioners.

Technology Used

Component	Technologies
Backend	FastAPI
Agent Orchestration	LangGraph
Document Parsing	Docling
Knowledge Storage	Qdrant Vector Database
Medical Image Analysis	Computer Vision Models
	• Chest X-Ray: Image Classification (PyTorch)
	• Skin Lesion: Semantic Segmentation (PyTorch)
Guardrails	LangChain
Speech Processing	Eleven Labs API
Frontend	HTML, CSS, JavaScript

Project Setup

1️⃣ Clone the Repository

git clone https://github.com/vedprakashnautiyal/MedAid.git
cd MedAid

2️⃣ Create Environment File

Create a .env file in the root directory and add API keys or other environment variables:

# Speech API Key 
ELEVEN_LABS_API_KEY=

# Web Search API Key
TAVILY_API_KEY=

# Hugging Face Token (For ReRanker Model -  "ms-marco-TinyBERT-L-6" )
HUGGINGFACE_TOKEN=

# For Gemini API (Can use other LLMs like Ollama Based or OpenAI but need code modification)
GOOGLE_API_KEY=

3️⃣ Create & Activate Virtual Environment

python -m venv .medaid
source .medaid/bin/activate  # For Mac/Linux
.medaid\Scripts\activate     # For Windows

4️⃣ Install Dependencies

[NOTE] ffmpeg is required for speech service to work.

winget install ffmpeg

pip install -r requirements.txt

5️⃣ Ingest Data into Vector DB

To ingest one document at a time:

python ingest_rag_data.py --file ./data/raw/brain_tumors_ucni.pdf

To ingest multiple documents from a directory:

python ingest_rag_data.py --dir ./data/raw

6️⃣ Run the Project

python app.py

The application will be available at: http://localhost:8000

🔝 Return

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
agents		agents
assets		assets
data		data
templates		templates
uploads		uploads
README.md		README.md
References.md		References.md
app.py		app.py
config.py		config.py
ingest_rag_data.py		ingest_rag_data.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

MedAid 🩺
A Bi-Modal Multi-Agent Medical Assistant

A Bi-Modal Multi-Agent Medical Assistant

Table of Contents

Overview

Flow Chart

Features

Technology Used

Project Setup

1️⃣ Clone the Repository

2️⃣ Create Environment File

3️⃣ Create & Activate Virtual Environment

4️⃣ Install Dependencies

5️⃣ Ingest Data into Vector DB

6️⃣ Run the Project

About

Uh oh!

Languages

vedprakashnautiyal/MedAid

Folders and files

Latest commit

History

Repository files navigation

MedAid 🩺 A Bi-Modal Multi-Agent Medical Assistant

A Bi-Modal Multi-Agent Medical Assistant

Table of Contents

Overview

Flow Chart

Features

Technology Used

Project Setup

1️⃣ Clone the Repository

2️⃣ Create Environment File

3️⃣ Create & Activate Virtual Environment

4️⃣ Install Dependencies

5️⃣ Ingest Data into Vector DB

6️⃣ Run the Project

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Languages

MedAid 🩺
A Bi-Modal Multi-Agent Medical Assistant