Skip to content

elch10/instacart-market-basket

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

  1. Install mysql server with jdbc connector. Start server and fill sql_cfg.json file to be able make connection to sql from spark
  2. Install dependencies
conda env create -f environment.yml
  1. Run loadToSQL.py, that loads instacart data to local path. Then loads that dataframes to spark, which sends data to sql server
  2. Check analysis inside sql.md. There are written sql queries and analysis of table with checking different hypothesis. All visualizations made by Grafana
  3. Look at spark.ipynb to analysis using spark

About

Analyzing instacart market basket data using pyspark and mysql

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published