Skip to content

samarthk/OpenSF-Apache-Spark

 
 

Repository files navigation

OpenSF-Apache-Spark

Spark Logo + SF Open Data Logo

Exploring the City of San Francisco public data with Apache Spark 2.0

Fireworks

The SF OpenData project was launched in 2009 and contains hundreds of datasets from the city and county of San Francisco. Open government data has the potential to increase the quality of life for residents, create more efficient government services, better public decisions, and even new local businesses and services.

APACHE SPARK:

Spark is a unified processing engine that can analyze big data using SQL, machine learning, graph processing or real time stream analysis:

Spark Engines

Spark Goal

About

Exploring the City of San Francisco public data with Apache Spark 2.0

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 54.6%
  • Python 45.4%