Skip to content
Tarik Salay edited this page Sep 28, 2019 · 2 revisions

This project discusses classification. Such as:
b. Classification algorithm
c. Scikit learn
d. Advanced concept related to machine learning algorithm like overfitting, underfitting, cross validation, evaluation for clustering methods

Use Case Description:
k-nearest neighbor classifier

Programming elements:
Classification

Source Code:
https://umkc.box.com/s/cra3wkg3y7c4yolhrler01tq5d42mc1j

Data Set:
URL: https://umkc.box.com/s/ea6wn1cidukan67t02j60nmp1ljln3kd
Dataset description: https://www.kaggle.com/uciml/glass/downloads/glass-classification.zip/1
The name of target Column is Type
1. find the correlation between ‘survived’ (target column) and ‘sex’ column for the Titanic use case in class.

2. Implement Naïve Bayes method using scikit-learn library
Use dataset available in https://umkc.box.com/s/ea6wn1cidukan67t02j60nmp1ljln3kd
Use train_test_split to create training and testing part
Evaluate the model on testing part using score and
classification_report(y_true, y_pred)

3. Implement linear SVM method using scikit library
Use the same dataset above
Use train_test_split to create training and testing part
Evaluate the model on testing part using score and
classification_report(y_true, y_pred)

Clone this wiki locally