In this repository you will work through an example how to clean data. Since data cleaning is one part of an EDA, the containing notebook will help you with your first project.
Please make sure you have forked the repo and set up a new virtual environment. For this purpose you can use the following commands:
The added requirements file contains all libraries and dependencies we need to execute the data cleaning notebook.
-
Install the virtual environment and the required packages by following commands:
pyenv local 3.11.3 python -m venv .venv source .venv/bin/activate pip install --upgrade pip pip install -r requirements.txt
-
Install the virtual environment and the required packages by following commands.
For
PowerShellCLI :pyenv local 3.11.3 python -m venv .venv .venv\Scripts\Activate.ps1 python -m pip install --upgrade pip pip install -r requirements.txt
For
Git-BashCLI :pyenv local 3.11.3 python -m venv .venv source .venv/Scripts/activate python -m pip install --upgrade pip pip install -r requirements.txtNote:If you encounter an error when trying to runpip install --upgrade pip, try using the following command:python.exe -m pip install --upgrade pip
The dataset for the notebook is stored in the data.zip folder. To unzip the data folder directly in the terminal run
unzip data.zip