GitHub - eval-protocol/digital_store_app: Mock digitial store app based on chinooks database

Digital Store App (Eval Protocol + MCP)

Build and test a database-aware storefront assistant using Eval Protocol and a Postgres MCP server. This repo follows a test-driven agent development workflow.

For background and a walkthrough, see the blog post: Test-Driven Agent Development with Eval Protocol.

Requirements

Python 3.10+
Docker (for Postgres + MCP server)
uv (recommended) or pip

Setup

Create and activate a virtual environment

uv venv .venv
source .venv/bin/activate

Install in editable mode

uv pip install -e .

(Optional) Start Postgres + MCP server

# Starts Postgres (Chinook) and the Postgres MCP server
docker compose up -d

Running tests

Fast local tests (no external model calls):

pytest -q

Full MCP/agent integration tests (require Docker up and a model key):

export RUN_MCP_EVAL=1
export FIREWORKS_API_KEY=your_fireworks_api_key
pytest -q

Run a single test with a summary line printed:

EP_PRINT_SUMMARY=1 pytest tests/pytest/test_storefront_agent_eval.py::test_storefront_agent_browse -q

Emit a JSON summary artifact for CI:

EP_SUMMARY_JSON=artifacts/ pytest -q
# writes JSON files under ./artifacts/

Useful environment variables

RUN_MCP_EVAL=1: enable MCP/agent integration test suite
FIREWORKS_API_KEY: API key for Fireworks models used in agent tests
EP_PRINT_SUMMARY=1: print a concise summary line to stdout
EP_SUMMARY_JSON=: write machine-readable summary JSON(s)
EP_MAX_DATASET_ROWS=<N|none>: clamp dataset/messages length per run

Services

docker-compose.yml defines:
- db: Postgres 16 with Chinook schema/data
- mcp: Postgres MCP server exposing tools (e.g., execute_sql) on port 8010

Project structure

tests/pytest/: evaluation tests (batch and pointwise)
prompts/: system prompt(s)
external/: third-party assets (Chinook database SQL, MCP server repo)
scripts/: helper scripts (MCP proxy, etc.)

Troubleshooting

Editable install errors about "Multiple top-level packages" were resolved by explicitly disabling package discovery in pyproject.toml.
If MCP tests fail to connect, ensure docker compose ps shows both db and mcp healthy.
The agent tests hit real models—credentials and network access are required.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
docker-entrypoint-initdb.d		docker-entrypoint-initdb.d
external		external
prompts		prompts
scripts		scripts
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
docker-compose.yml		docker-compose.yml
mcp.json		mcp.json
project.md		project.md
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Digital Store App (Eval Protocol + MCP)

Requirements

Setup

Running tests

Useful environment variables

Services

Project structure

Troubleshooting

About

Uh oh!

Releases

Packages

Languages

License

eval-protocol/digital_store_app

Folders and files

Latest commit

History

Repository files navigation

Digital Store App (Eval Protocol + MCP)

Requirements

Setup

Running tests

Useful environment variables

Services

Project structure

Troubleshooting

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages