DocumentDB Benchmarks

A framework for benchmarking MongoDB-compatible databases using Locust. Compare performance across different database engines (MongoDB, Atlas, Azure DocumentDB, AWS DocumentDB) and deployment configurations (single-node, sharded, replicated).

Project Structure

documentdb-benchmarks/
├── benchmark_runner/           # Run benchmarks and collect metrics
│   ├── runner.py               # Main runner - orchestrates Locust execution
│   ├── config.py               # YAML config loading + CLI argument parsing
│   ├── base_benchmark.py       # MongoUser base class for all benchmarks
│   ├── data_generators/        # Shared document generators (used by all benchmarks)
│   │   └── document_256byte.py # ~256-byte documents with standard field schema
│   └── benchmarks/             # Individual benchmark definitions
│       ├── insert/             # Insert (write) performance variants
│       ├── count/              # Count/aggregation performance variants
├── benchmark_analyzer/         # Analyze and compare results across runs
│   ├── analyzer.py             # CLI for analysis and comparison
│   ├── report_loader.py        # Load Locust CSV + metadata files
│   ├── comparator.py           # Compare runs across scenarios or databases
│   └── report_generator.py     # Generate console/HTML/CSV reports
├── config/                     # Example configuration files
│   ├── insert/                 # Insert benchmark configs
│   ├── count/                  # Count/aggregation benchmark configs
├── pyproject.toml
└── README.md

Quick Start

Installation

# Clone and install
cd documentdb-benchmarks
pip install -e '.[dev]'

# Enable the pre-commit hook to prevent accidental credential commits
git config core.hooksPath .githooks

Dev container users: Both steps above run automatically via postCreateCommand — no manual setup needed.

Running a Benchmark

# Using a config file (recommended)
python -m benchmark_runner --config config/insert/insert_no_index.yaml \
    --database-engine mongodb

# With CLI overrides
python -m benchmark_runner --config config/insert/insert_unique_index.yaml \
    --database-engine mongodb \
    --mongodb-url "mongodb://myhost:27017" \
    --users 20 \
    --run-time 120s

# Using the installed entry point
bench-run --config config/insert/insert_no_index.yaml --database-engine mongodb

Analyzing Results

# View comparison on console
python -m benchmark_analyzer.analyzer --results-dir results/insert

# List all discovered runs
python -m benchmark_analyzer.analyzer --results-dir results/ --list-runs

# Compare across database engines
python -m benchmark_analyzer.analyzer --results-dir results/insert \
    --group-by database_engine --output insert_comparison.html

# Compare across configurations
python -m benchmark_analyzer.analyzer --results-dir results/insert \
    --group-by run_label --output config_comparison.html

# Export as CSV for spreadsheet analysis
python -m benchmark_analyzer.analyzer --results-dir results/ \
    --format csv --output comparison.csv

# Using the installed entry point
bench-analyze --results-dir results/insert --output report.html

Writing Custom Benchmarks

See CONTRIBUTING.md for a full guide on writing new benchmarks, the MongoUser base class API, running tests, and code style guidelines.

Output Files

Each benchmark run generates:

File	Description
`{prefix}_stats.csv`	Summary statistics per operation
`{prefix}_stats_history.csv`	Time-series statistics (every 5s)
`{prefix}_failures.csv`	Failure details
`{prefix}_report.md`	Markdown report
`{prefix}_metadata.json`	Run configuration and metadata (used by analyzer)

Cross-Database Comparison Workflow

Define benchmark once — write a benchmark module and base config

Run against each target — execute with different connection strings and labels:

# MongoDB
bench-run -c config/insert/insert_unique_index.yaml \
    --mongodb-url "mongodb://mongo:27017" \
    --database-engine mongodb --run-label "MongoDB 7.0"

# Azure DocumentDB
bench-run -c config/insert/insert_unique_index.yaml \
    --mongodb-url "mongodb://azure:10255/?ssl=true" \
    --database-engine azure-documentdb --run-label "Azure DocumentDB"

# Atlas
bench-run -c config/insert/insert_unique_index.yaml \
    --mongodb-url "mongodb+srv://atlas.example.net" \
    --database-engine atlas --run-label "Atlas M10"

Compare results — generate a unified comparison report:

bench-analyze -d results/insert --group-by database_engine -o comparison.md

Remote / Cloud Deployment

Both deployment scripts read a shared deploy/pipeline.config file that defines database engines, benchmark configs, and environment-specific settings.

Local Docker

Runs benchmarks in Docker containers on the local machine:

./deploy/run-local.sh deploy/pipeline.config

Azure Container Instances (zero VM setup)

Runs benchmarks serverlessly in ACI. Requires the Azure CLI (az login):

./deploy/run-aci.sh deploy/pipeline.config

Configure database engines and benchmarks in deploy/pipeline.config:

# Global settings
cpu=2
memory=4g
results_dir=./results

# Locust concurrency overrides (optional).
# When set, these override the per-benchmark YAML config values.
# users=10
# spawn_rate=5
# run_time=60s

# Docker-specific
[docker]
network=auto

# ACI-specific
[aci]
resource_group=benchmarks-rg
location=eastus

# Database engines to benchmark against
[database_engines]
mongodb=mongodb://mongodb:27017
# atlas=mongodb+srv://user:pass@cluster.mongodb.net

# Benchmarks to run (one per line — config/ is prepended automatically)
[benchmarks]
insert/insert_no_index.yaml
insert/insert_unique_index.yaml

Results are organized by engine under a timestamped run directory: ./results/YYYYMMDD-NNN/<engine_name>/.

Configuration Reference

Field	CLI Flag	Default	Description
`mongodb_url`	`--mongodb-url`	`mongodb://localhost:27017`	Connection string
`database`	`--database`	`benchmark_db`	Database name
`collection`	`--collection`	`benchmark_collection`	Collection name
`benchmark_name`	`--benchmark-name`	(required)	Name for this benchmark
`benchmark_module`	`--benchmark-module`	(required)	Python module (e.g. `benchmarks.insert_benchmark`)
`run_label`	`--run-label`	(from engine)	Label for grouping results
`database_engine`	`--database-engine`	(required)	Engine identifier (e.g. `mongodb`, `atlas`, `azure-documentdb`)
`users`	`--users` / `-u`	`10`	Concurrent Locust users
`spawn_rate`	`--spawn-rate` / `-r`	`5`	Users spawned per second
`run_time`	`--run-time` / `-t`	`60s`	Test duration (`60s`, `5m`, `1h`)
`output_dir`	`--output-dir` / `-o`	`results`	Output directory
`workload_params`	(config only)	`{}`	Benchmark-specific parameters
`imports`	(config only)	(none)	Parent config file (relative path); values are deep-merged

Benchmark-Specific Parameters

Each benchmark category defines its own workload_params in its base YAML config (e.g. config/insert/insert_base.yaml, config/count/count_base.yaml). Refer to the base config and the benchmark module docstrings for the full list of available parameters and defaults.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DocumentDB Benchmarks

Project Structure

Quick Start

Installation

Running a Benchmark

Analyzing Results

Writing Custom Benchmarks

Output Files

Cross-Database Comparison Workflow

Remote / Cloud Deployment

Local Docker

Azure Container Instances (zero VM setup)

Configuration Reference

Benchmark-Specific Parameters

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.devcontainer		.devcontainer
.githooks		.githooks
.github		.github
benchmark_analyzer		benchmark_analyzer
benchmark_runner		benchmark_runner
config		config
deploy		deploy
tests		tests
.gitignore		.gitignore
CONTRIBUTING.md		CONTRIBUTING.md
Dockerfile		Dockerfile
README.md		README.md
pyproject.toml		pyproject.toml

Folders and files

Latest commit

History

Repository files navigation

DocumentDB Benchmarks

Project Structure

Quick Start

Installation

Running a Benchmark

Analyzing Results

Writing Custom Benchmarks

Output Files

Cross-Database Comparison Workflow

Remote / Cloud Deployment

Local Docker

Azure Container Instances (zero VM setup)

Configuration Reference

Benchmark-Specific Parameters

About

Resources

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages