Major Project: End-to-End PDF QA Chatbot

This project is an end-to-end system for querying PDF documents using a chatbot interface. It combines a modern Next.js frontend with Python microservices for document processing, retrieval, and LLM-based question answering. The system is designed for extensibility and performance, supporting scalable document ingestion and semantic search.

Techniques Used

Server Components in Next.js: Efficient rendering and data fetching (MDN: Server Components).
TypeScript for Type Safety: Ensures robust code and easier refactoring (MDN: TypeScript).
Custom Hooks and Utility Functions: Modularizes logic for reusability.
Python Microservices: Decouples document processing and QA logic for scalability.
Chroma Vector Database: Enables fast semantic search over document embeddings (ChromaDB).
PDF Parsing and Embedding: Processes and indexes PDF content for retrieval.
LLM Integration: Uses language models for natural language question answering.

Notable Technologies & Libraries

Next.js (React framework)
ChromaDB (vector database)
LangChain (LLM orchestration)
FastAPI (Python web framework)
PyPDF2 (PDF parsing)
Tailwind CSS (utility-first CSS)
Vercel (deployment platform)
PostCSS (CSS processing)
ESLint (code linting)
TypeScript (typed JavaScript)
React (UI library)

Project Structure

.
├── pdf-qa-chatbot/
│   ├── public/
│   ├── src/
│   │   ├── app/
│   │   ├── lib/
├── Python_Microservices_Be/
│   ├── chroma_db_legal/
│   ├── uploads/

pdf-qa-chatbot/public/: Contains SVG assets for UI.
pdf-qa-chatbot/src/app/: Next.js app directory, includes global styles and layout.
pdf-qa-chatbot/src/lib/: Utility functions for frontend logic.
Python_Microservices_Be/chroma_db_legal/: ChromaDB vector store and metadata.
Python_Microservices_Be/uploads/: Uploaded PDF documents for processing.

Fonts

No custom fonts detected; uses system or default web fonts.

Next.js Frontend Setup

Install dependencies:
```
cd pdf-qa-chatbot
npm install
```
Run the development server:
```
npm run dev
```
The app will be available at http://localhost:3000.

Python Microservices Setup

Create a virtual environment:
```
python -m venv venv
```
Activate the environment:
- On Windows:
```
venv\Scripts\activate
```
- On macOS/Linux:
```
source venv/bin/activate
```

Install dependencies:

pip install -r Python_Microservices_Be/requirements.txt

Ollama Setup & Model Preparation

Install Ollama Desktop:
- Download and install from Ollama Desktop.
Download GGUF Model from Hugging Face:
- Visit Indian-LegalBot-Llama-3.1-8B-GGUF.
- Download the recommended version: Q4_K_S or Q4_K_M.
Convert GGUF Model for Ollama Compatibility:
- Create a Modfile in your model directory with the following content:
```
FROM ./Indian-LegalBot-Llama-3.1-8B-Q4_K_S.gguf
PARAMETER stop "<|eot_id|>"
```
- Replace the filename with your downloaded GGUF file.
- Build the model for Ollama:
```
ollama create indian-legalbot -f Modfile
```
- The model is now available for use with Ollama.
Update Python Microservices Configuration:
- Open Python_Microservices_Be/config.py.
- Set the model name:
```
LLM_MODEL_NAME = "indian-legalbot"
```
- This ensures your microservices use the correct Ollama model.

...

Next.js Frontend Setup

Navigate to the frontend directory:
```
cd pdf-qa-chatbot
```
Install dependencies:
```
npm install
```
Start the development server:
```
npm run dev
```
The application will be available at http://localhost:3000 ...

...

Starting Python Microservices

Activate your virtual environment:
- On Windows:
```
venv\Scripts\activate
```
- On macOS/Linux:
```
source venv/bin/activate
```
Start the microservices:
```
flask run --port=5001
```

API Endpoints

1. `POST /api/upload`

BODY Example:

{
    "collection_name": "legal_case_a1b2c3d4e5",
    "message": "File 'sample_case.pdf' processed successfully."
}

2. `POST /api/ask_rag`

BODY Example:

{
  "question": "What is the main subject of this document?",
  "collection_name": "legal_case_a1b2c3d4e5"
}

3. `POST api/ask_direct`

BODY Example:

{
  "question": "Explain the concept of 'audi alteram partem' in Indian law."
}

...

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
Python_Microservices_Be		Python_Microservices_Be
pdf-qa-chatbot		pdf-qa-chatbot
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Major Project: End-to-End PDF QA Chatbot

Techniques Used

Notable Technologies & Libraries

Project Structure

Fonts

Next.js Frontend Setup

Python Microservices Setup

Ollama Setup & Model Preparation

Next.js Frontend Setup

Starting Python Microservices

API Endpoints

1. `POST /api/upload`

2. `POST /api/ask_rag`

3. `POST api/ask_direct`

About

Uh oh!

Releases

Packages

Languages

Manishhhsys/AI-Powered-Legal-Case-Retrieval-System

Folders and files

Latest commit

History

Repository files navigation

Major Project: End-to-End PDF QA Chatbot

Techniques Used

Notable Technologies & Libraries

Project Structure

Fonts

Next.js Frontend Setup

Python Microservices Setup

Ollama Setup & Model Preparation

Next.js Frontend Setup

Starting Python Microservices

API Endpoints

1. POST /api/upload

2. POST /api/ask_rag

3. POST api/ask_direct

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

1. `POST /api/upload`

2. `POST /api/ask_rag`

3. `POST api/ask_direct`

Packages