RAG-Genius

A private, offline Q&A system for your documents, powered by FAISS, LangChain, and GPT-4 (or local LLaMA).

Final Product:

🚀 Features

PDF ingestion: Chunk, embed, and index any PDF corpus.
Vector search: Ultra-fast retrieval over FAISS.
Flexible inference: Use OpenAI GPT-4 API or spin up a local LLaMA-based model.
Customizable retrieval: Tweak chunk size, overlap, and top-k retrieval.
Incremental indexing: Add new PDFs without re-indexing everything.
Streamlit UI: Fully interactive demo with ingestion, uploads, and Q&A.
Automated tests: Pytest suite covers ingestion, caching, retrieval, and end-to-end QA.

Quickstart

Clone

git clone [email protected]:you/RAG-Genius.git
cd RAG-Genius

Configure Copy your PDFs into data/pdfs/ Create .env with your OpenAI key (if using GPT-4):
```
OPENAI_API_KEY=sk-...
```
Creating Environment View env_sample. Make sure to protect your key before uploading it on an open-source!

Install & run locally

# Create virtualenv and install
python -m venv venv
source venv/bin/activate        # macOS/Linux
.\venv\Scripts\Activate.ps1     # Windows PowerShell

pip install --upgrade pip
pip install -r requirements.txt

# Ingest your PDFs
python -m src.ingestion

# Start the Streamlit UI
python -m streamlit run app/streamlit_app.py

Visit http://localhost:8501 to ask questions against your documents.

Project layout

/src: ingestion, vector store, retrieval and inference logic

/app: Streamlit demo

/tests: unit tests for each module

Dockerfile + requirements.txt for containerized deploy

Automated Tests The pytest suite spins up isolated temp folders, ingests a sample PDF, and verifies:

Ingestion writes FAISS files.
CacheManager supports incremental adds
Retriever returns relevant chunks
Inference produces answers with source docs

Run them with:

pytest -q

Docker docker build -t rag-genius . docker run -p 8501:8501 --env-file .env rag-genius

Git Ignore

# Secrets & env
.env

# FAISS index cache
/data/cache/

# Raw PDFs (if you want to keep them local)
/data/pdfs/

# Python caches
__pycache__/
*.py[cod]

# Virtualenv
venv/

ENJOY!

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
app		app
src		src
tests		tests
.env_sample		.env_sample
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

RAG-Genius

🚀 Features

Quickstart

About

Uh oh!

Releases

Packages

Languages

License

Nainikas/RAG-Genius

Folders and files

Latest commit

History

Repository files navigation

RAG-Genius

🚀 Features

Quickstart

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages