π Hyderabad, India Β |Β π» Data Scientist & Data Analyst Β |Β
A maths nerd turned Data Analyst β I live and breathe data. Turning raw numbers into meaningful insights is what gets me out of bed every morning.
I'm currently deep-diving into AI at the intersection of Biology/Agriculture π§¬, exploring how machine learning can unlock new frontiers in the life sciences. I'm actively open to opportunities at the AI Γ Biology space.
Beyond work, I love building projects and tools that scratch my own itch β if it makes my life even a little better, it's worth building. When I step away from the screen, you'll find me with a camera in hand, capturing emotions and untold stories in still frames π·.
π Python
Pandas NumPy Matplotlib Seaborn Scikit-Learn SpaCy TensorFlow Pytorch
ποΈ SQL
SQL Commands Joins CTE Stored Procedures Window functions Views
π Analytics
Excel Data Preprocessing Data Analysis Statistical Modelling Predictive Modeling Machine Learning Deep Learning
βοΈ Data Engineering
Microsoft Fabric Databricks Data Engineering & Transformation Data Warehousing & Querying Data Ingestion PySpark
π Visualization Tools
Power BI Β |Β Tableau
βοΈ Cloud Computing
AWS GCP
A growing collection of fundamental analysis of listed Indian companies, powered by LLMs.
- π Covers economic, industry, and company-level analysis
- π Sources: Annual reports (statutory sections), MoneyControl financials
- π€ Tools: Gemini models + custom MD&A summarizer
- π Updated monthly
Skills used: fundamental analysis llms pdf parsing llm api keys
A machine learning project to predict credit risk for loan applicants using historical financial data.
Skills used: Python Pandas Scikit-Learn Machine Learning
Predicts domestic electricity consumption patterns for TGNPDCL using time-series forecasting and ML models.
Skills used: Python Pandas Machine Learning Time Series
Extracts and summarizes various sections of RHP/DRHP documents (IPO filings) using Google Gemini AI.
Skills used: Python Google Gemini AI LLM NLP PDF Parsing
Extracts and summarizes Management Discussion & Analysis sections from annual reports using Google Gemini AI.
Skills used: Python Google Gemini AI LLM NLP PDF parsing
A Python-based revenue cycle intelligence tool for healthcare organizations to detect anomalies and protect revenue.
Skills used: Python Machine Learning Data Analysis Streamlit
Explores chemical compound spaces using cheminformatics techniques for drug discovery applications.
Skills used: Python Cheminformatics RDKit Data Visualization
Generates summaries and transcripts from YouTube videos using AI.
Skills used: Python LLM NLP YouTube API
Predicts whether a bank customer is likely to churn using an Artificial Neural Network trained on customer attributes such as geography, gender, age, balance, tenure, and salary. Skills used:
PythonPandasNumPyTensorFlowKerasANNData PreprocessingFeature Engineering
Built a machine learning pipeline to recognize handwritten digits by combining Principal Component Analysis (PCA) for dimensionality reduction with classification models for prediction. The project focuses on reducing high-dimensional image features, improving training efficiency, and comparing model performance for accurate digit classification.
Skills used: Python Pandas NumPy Matplotlib Seaborn Scikit-Learn PCA Classification
Predicts loan approval outcomes using a 614-record dataset with applicant demographics and financial features. Includes EDA, missing value treatment, and comparison of multiple classification models β Logistic Regression, KNN, Decision Tree, and Random Forest β achieving ~78% accuracy.
Skills used: Python Pandas NumPy Matplotlib Seaborn Scikit-Learn Classification Models EDA
GitHub
Visualization projects
Skills used: Power BI
A rule-based chatbot built to handle movie booking conversations.
Skills used: Python NLP Chatbot Design