This project involves forecasting whether an audiobook user will be able to retain or churn from a vendor company. Data comes from an unspecified audiobook vendor. This repository contains a well-commented jupyter notebook that walks along the path of prediction process. This involves 5 stepe:
- Introduction
- Data Exploration
- Data Cleaning and Feature Engineering
- Model Building
- Results and Discussion
In the modelling phase a number of classic and modern models are run and explored. The tree based ensemble models seem to perform better in terms of model performance metrics.