This repo tries to replicate databricks runtime, plus feature-rich jupyter/docker-stacks.
- (Almost) Everything in the
jupyter/all-spark-notebookand (eventually)jupyter/r-notebookimages, and their ancestor images. (Inheritance tree) delta-lakeanddelta-spark
- All additional packages from
jupyter/r-notebook. - All additional packages that are on top of Databricks runtime dependencies tree (10.3 ML GPU runtime)
xgboostand Spark distribution ofxgboost(Waiting for this PR)hyperopt
Check .env.template for environment variables template, or modify and copy these lines
echo "JUPYTER_PATH=<path-to-notebook-directory>" > .env
echo "NB_UID=`id -u`" >> .env
echo "NB_GID=`id -g`" >> .envGet path-to-notebook-directory using pwd in the notebook directory
docker-compose up -d