Using the sample dataset, you are required to present and apply based on your understanding on data lifecycling:
- Data Collection
- ETL
- Data Storage
- Data Categorization
- Data Visualization
Your City Mayor has contacted you to build a dashboard that analyzes the general data of the city.
- You can use any form of data storage (mysql, sqlite, postgresql, mongodb, etc), either offline, on-premise or cloud-based.
- You can use any programming language of your choice for the data operations, and present your data visualization on a simple HTML dashboard (no login module needed) with suitable visualization tool. Dashboard design is not graded, minimally to be presentable for grading of data visualization.
- Your data import should be done via code and show clearly your data cleaning considerations.
- Start off with
population.csv, your data has to be built upon district & neighborhood columns.
- Year 2017 Metrics
- On overall, show population breakdown % by districts, unemployment % by districts, and accident breakdown % by district, immigrant breakdown by nationality.
- Based on district, plot data visualization metrics to show population, births, deaths, unemployment, immigrants vs emigrants
- Plot a visualization on life expectancy over time of each districts in comparison
- Plot visualizations on accident distribution by month, accident distribution by day.
- Plot a Google Map visualization to show the bus stops in the city.
- Provide necessary information for us to access your code (eg. Github) and database (eg. db access credential). Instructions on environment setup prior to running of application should be included as well.
- Submission deadline is included in the email.