The Customer Complaint Intelligence System is a GenAI-powered application that analyzes, clusters, and extracts insights from customer complaints. By using advanced NLP techniques like embeddings and Large Language Models (LLMs), it transforms raw text into actionable intelligence.
Live Demo: Click Here to View App
- Automated Cleaning: Preprocesses raw complaint text automatically
- Semantic Clustering: Groups similar complaints using sentence embeddings and KMeans
- AI Summarization: Generates concise summaries and categories using Gemini or Groq
- Smart Search: Find specific complaints using natural language queries (RAG)
- Interactive Dashboard: Explore data through a user-friendly Streamlit interface
- GPU Acceleration: Auto-detects CUDA for faster embedding generation
- Python (Logic & Data Processing)
- Streamlit (User Interface)
- Sentence Transformers (Embeddings with GPU support)
- FAISS (Vector Database)
- Gemini API / Groq API (LLM Intelligence)
Clone the repository and install the required packages:
git clone https://github.com/AswaniSahoo/complaint-intelligence-system.git
cd complaint-intelligence-system
pip install -r requirements.txtCreate a .env file in the root directory and add your API key:
GEMINI_API_KEY="your-gemini-api-key"Process the raw data:
python run_pipeline.pystreamlit run app/app.pyOpen your browser to http://localhost:8501.
DATA PROCESSING PIPELINE
+------------------------------------------------------------------+
| |
| Raw CSV --> Preprocess --> Embeddings --> Clustering --> LLM |
| (CFPB) (clean text) (MiniLM/GPU) (KMeans) (Gemini) |
| |
+------------------------------------------------------------------+
STREAMLIT DASHBOARD
+------------------------------------------------------------------+
| |
| Overview ---> Clusters ---> Viewer ---> Ask AI (RAG) |
| (metrics) (drilldown) (filter) (FAISS search) |
| |
+------------------------------------------------------------------+
Contributions are welcome! Please see CONTRIBUTING.md for details.
This project is licensed under the MIT License.
Aswani Sahoo
AI & ML Engineer
- CFPB Consumer Complaint Database for the dataset
- Sentence Transformers for embeddings
- Google Gemini for LLM summarization


