This project utilizes a Support Vector Machine (SVM) classification model to predict customer eligibility for banking credit cards. It employs various data preprocessing techniques, visualizations, and model optimization strategies to enhance prediction accuracy.
- Accurately classify customers based on their eligibility for credit cards.
- Explore and visualize data relationships to understand customer attributes influencing eligibility.
- Optimize the SVM model using hyperparameter tuning and handling data imbalance effectively.
- Data exploration and cleaning
- Identifying key features and their impact
- Correlation analysis using Heatmap
- Understanding feature relationships and data distribution
- Standardization to ensure optimal model performance
- SVM (Support Vector Machine) classification
- Searching optimal hyperparameters through sampling techniques to improve model accuracy
- Implementation of SMOTE (Synthetic Minority Oversampling Technique)
- Addressing data imbalance to enhance predictive performance
- Further refining the SVM model post-balancing for improved accuracy
To run this project locally:
-
Clone the repository:
git clone https://github.com/azizzoaib786/banking-dataset-prediction-svm.git
-
Install the required dependencies:
pip install -r requirements.txt
-
Open and run the notebook:
- Open
banking-dataset-prediction-svm.ipynbin Jupyter Notebook or JupyterLab. - Execute all cells (
Run All).
- Open
- Python (3.7 or higher recommended)
- scikit-learn
- pandas
- numpy
- matplotlib
- seaborn
- imbalanced-learn
- Jupyter Notebook/JupyterLab
Contributions are encouraged! Feel free to fork the repository, make improvements, and submit a pull request.
- Email: [email protected]
This project is licensed under the MIT License.