Skip to content

DTS1234/Programming-for-data-science-UPM-course-project

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

24 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Programming for data science

This project is the implementation of the final project for UPM Programming for Data Science course.

The project contains three directories:

  • Analysis - this part covers data analysis part and it contains only Portfolio_data_analysis.ipynb notebook file
  • Portfolio - here we covered the data generation part, it contains portfolio.py, portfolio_performance.py and test for portfolio performance, as well two csv files generated after running portfolio_performance.py file.
  • Scraping - this directory contains data_scraping.py file with the scrip for the first part of the assigment as well as chromedriver.exe required to run selenium code. More than that after executing data_scraping.py there should appear csv files with the data scraped.

In order to run the code it's needed to run the following python files:

  1. data_scraping.py to scrape the data
  2. portfolio_performance.py for the data generation part

Finally, to view the data analysis notebook, run the Portfolio_data_analysis.ipynb file in jupyter notebook.

The following anaconda environment with python 3.10 version, and the following packages installed is required:

Package Version


jsonschema 4.4.0 jupyter-client 7.1.2 jupyter-core 4.9.2 jupyterlab-pygments 0.1.2 lxml 4.8.0 matplotlib 3.5.1 matplotlib-inline 0.1.2 notebook 6.4.8 numpy 1.21.5 pandas 1.4.1 pip 21.2.4 pipenv 2021.5.29 scipy 1.7.3 seaborn 0.11.2 selenium 3.141.0 virtualenv 20.4.7 webdriver-manager 3.5.4

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 4

  •  
  •  
  •  
  •