π Dual Degree (B.Tech + M.Tech) in Metallurgical & Materials Engineering @ IIT Kharagpur (2020-2025)
π AI Developer Intern @ ModelsLab | Computer Vision Intern @ Dewinter Optical | ML Intern @ TheAware.AI
π± Skills: TTS Model Training | Audio/Image Processing | LLM Finetuning | Server Development & Deployment
π Projects: AI Dubbing API | XTTS V2 Finetuning | Vision-based Blood Cell Segmentation | Text-based Question Generation
π€ Co-founder: SPARK4AI - Collaborative AI Research Society at IIT Kharagpur
βοΈ Current Focus: Building Generative AI, Speech AI, and Advanced Machine Learning Systems
π« Reach me at: [email protected]
I co-founded SPARK4AI at IIT Kharagpur to ignite a culture of collaborative research and innovation in Artificial Intelligence.
Our mission is to provide a platform for students and researchers to work together on impactful AI projects, focusing on real-world applications.
Follow my discord server to be a part of my initiative - https://discord.gg/QMcKPdYQ.
Vision:
To establish a world-class research lab in India dedicated to advancing research, innovation, and development in Artificial Intelligence.
I aspire to contribute towards strengthening India's AI ecosystem, nurturing local talent, and building technology solutions that can positively transform industries and society.
- Languages: Python | C++ | SQL | ReactJS
- Libraries/Frameworks: PyTorch | HuggingFace | OpenCV | FastAPI | Celery | Numpy | Librosa
- Tools: Docker | Git | GitHub | Jupyter Notebook | Android Studio | Google Cloud Platform | Redis | MySQL
- Domains: Speech Synthesis (TTS), Computer Vision, NLP, Server APIs, Generative AI
- XTTS V2 Finetuning: Finetuned multilingual TTS models on 14k+ hrs speaker-annotated data
- AI Dubbing API: Developed transcription β diarization β translation β TTS β audio alignment pipeline
- WBC Classification: Built WBC type classifier (97% accuracy) using custom YOLOv5 model on 12k cell images
- Smart Doc Scanner App: Built Android App to extract text from real-time videos using Optical Flow & OCR
- Kaggle GNSS Challenge: Predicted vehicle coordinates with advanced filtering and geospatial transformations
- π Co-Founder: SPARK4AI @ IIT Kharagpur (Led 15+ GenAI research projects, hosted national hackathons)
- π― Hackathon Co-organizer: AI4ICPS AI Hackathon @ IIT Kharagpur (Curated datasets and panelists)
- π‘ Head: National Students' Space Challenge 2022 Sponsorship and Marketing (IIT KGP)

