The Modern ML Monitoring Mess: Failure Modes in Extending Prometheus

Accompanying blog post here.

This WIP project (Monitoring Extension) aims to benchmark open-source ML monitoring tools. Tools in the benchmark include:

Prometheus

ML Task and Pipeline Architecture

Data source

We use the NYC taxicab data, which has been migrated from a bucket of flat files to an AWS RDS instance via the TTB project.

Feedback lag

To simulate lag that a real-world system might experience, we inject a delay sampled from a Gaussian distribution. (TODO: shreyashankar)

Pipeline

The ML task is to predict whether a passenger will give a taxi driver a sizeable tip (10% or more). Pipeline components are defined in the components folder and are called in train.py to train a model on Jan 2020 data. The inference code is in inference/main.py, which runs the model on data in 2-day increments from Feb 1 2020 to May 31 2020.

Prometheus Extension

We use 2 Gauge Metrics -- one for outputs, and one for feedback -- and aggregate them in PromQL to compute accuracy. These Metrics are defined in lib/prometheus_ml_ext.py. Read the accompanying blog post for more details.

Name		Name	Last commit message	Last commit date
Latest commit History 29 Commits
analysis		analysis
components		components
inference		inference
mext		mext
monitoring		monitoring
.gitignore		.gitignore
README.md		README.md
docker-compose.yml		docker-compose.yml
figs.afdesign		figs.afdesign
setup.py		setup.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

The Modern ML Monitoring Mess: Failure Modes in Extending Prometheus

ML Task and Pipeline Architecture

Data source

Feedback lag

Pipeline

Prometheus Extension

About

Uh oh!

Releases

Packages

Languages

loglabs/mext

Folders and files

Latest commit

History

Repository files navigation

The Modern ML Monitoring Mess: Failure Modes in Extending Prometheus

ML Task and Pipeline Architecture

Data source

Feedback lag

Pipeline

Prometheus Extension

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages