feat(demo): add compute mesh observability demo with benchmarking and execution tracing by Avi-47 · Pull Request #1248 · mofa-org/mofa

Avi-47 · 2026-03-15T05:13:07Z

Summary

This PR is the third phase of #952 which introduces an end-to-end demo for the MoFA Compute Mesh that showcases:
• routing behavior across backends
• latency benchmarking
• execution trace visualization
• architecture documentation

It consolidates functionality previously implemented across three demo PRs:

demo(compute-mesh): add latency benchmark to local_compute_mesh_demo #1073 — latency benchmarking
demo(compute-mesh): add execution trace visualization to local_compute_mesh_demo #1077 — execution trace visualization
docs(compute-mesh): add architecture diagram and execution flow documentation for local_compute_mesh_demo #1087 — architecture documentation

The result is a single runnable example demonstrating how requests flow through the compute mesh pipeline while exposing performance metrics and execution traces.

No core framework logic is modified.
All changes are confined to the example/demo layer.

Motivation

While the compute mesh infrastructure exists in MoFA, contributors and new developers currently lack a simple way to see how the system behaves end-to-end.

Specifically it is difficult to observe:

• how routing policies select backends
• how inference requests move through the pipeline
• how token streaming behaves
• how latency differs across routing strategies

This demo addresses those gaps by providing a runnable example that makes the entire pipeline visible.

The example demonstrates how workflow execution, routing, backend selection, streaming, and metrics collection interact in a single execution flow.
This demo provides a reference implementation for the Compute Mesh architecture and helps contributors understand how routing, inference, and observability work together in practice.

Features Implemented

1. Latency Benchmarking

The demo collects real-time metrics during inference execution.

The following metrics are reported:

latency_ms
total time from request start to completion

time_to_first_token_ms
time until the first token appears

tokens_streamed
number of tokens produced

tokens_per_second
token generation throughput

total_time_ms
total duration of token streaming

These metrics make it easy to compare routing strategies such as:

LocalFirstWithCloudFallback
LocalOnly
CloudOnly

2. Execution Trace Visualization

The demo adds execution tracing so developers can observe how requests move through the compute mesh pipeline.

Trace events include:

workflow.start
router.policy
router.backend_selection
inference.start
streaming.tokens
metrics.latency_ms
workflow.complete

The trace output makes the internal execution flow visible and can optionally be exported as JSON for external observability tools.

3. Architecture Documentation

The demo now includes detailed documentation explaining the compute mesh architecture and execution lifecycle.

The documentation provides:

• a visual pipeline overview
• explanation of routing policies
• execution lifecycle stages
• example trace output
• walkthrough of how requests travel through the system

This makes the compute mesh easier to understand for new contributors.

Architecture Overview

Demo Walkthrough

Running the Demo

cargo run -p local_compute_mesh_demo --manifest-path examples/Cargo.toml -- "Explain photosynthesis"

Example Output

[workflow] executing step: generate_response
[router] policy: LocalFirstWithCloudFallback
[router] selected backend: local
[stream] This
[stream] is
...
[metrics] latency_ms = 365
[metrics] time_to_first_token_ms = 0
[metrics] tokens_streamed = 10
[metrics] tokens_per_second = 27.4
[metrics] total_time_ms = 365

==== Compute Mesh Execution Trace ====

[trace] workflow.start
[trace] router.policy = LocalFirstWithCloudFallback
[trace] router.backend_selection = local
[trace] inference.start
[trace] streaming.tokens = token_1
...
[trace] metrics.latency_ms = 365
[trace] workflow.complete

Testing Instructions

Build the demo:

cargo build -p local_compute_mesh_demo --manifest-path examples/Cargo.toml

Run the demo:

cargo run -p local_compute_mesh_demo --manifest-path examples/Cargo.toml -- "Explain photosynthesis"

Verify metrics output shows:
- latency_ms
- time_to_first_token_ms
- tokens_streamed
- tokens_per_second
- total_time_ms
Verify trace output shows:
- workflow.start
- router.policy
- router.backend_selection
- inference.start
- streaming.tokens
- metrics.latency_ms
- workflow.complete

Example Output

Performance Metrics

backend: local
latency_ms: 365
time_to_first_token_ms: 0
tokens_streamed: 10
tokens_per_second: 27.4
total_time_ms: 365

Execution Trace (JSON)

{
  "request_id": "uuid-here",
  "stages": [
    {"stage": "workflow.start", "timestamp_ms": 1700000000000},
    {"stage": "router.policy", "detail": "LocalFirstWithCloudFallback", "timestamp_ms": 1700000000005},
    {"stage": "router.backend_selection", "detail": "local", "timestamp_ms": 1700000000010},
    {"stage": "inference.start", "timestamp_ms": 1700000000015},
    {"stage": "streaming.tokens", "detail": "token_1", "timestamp_ms": 1700000000020},
    {"stage": "metrics.latency_ms", "detail": "365", "timestamp_ms": 1700000000365},
    {"stage": "workflow.complete", "timestamp_ms": 1700000000370}
  ]
}

Screenshots

Breaking Changes

None. This is a new demo package that doesn't affect existing functionality.

Checklist

Demo builds successfully
Demo runs with example prompt
Latency benchmarking implemented with all required metrics
Execution trace visualization implemented with all required events
Architecture documentation with diagrams
No core framework changes (only demo files)

Files Changed

examples/Cargo.toml                          # Added demo to workspace
examples/local_compute_mesh_demo/Cargo.toml  # New demo package
examples/local_compute_mesh_demo/README.md   # Architecture documentation
examples/local_compute_mesh_demo/src/main.rs # Demo implementation
examples/local_compute_mesh_demo/workflow.yaml # Demo workflow config

… tracing

Avi-47 · 2026-03-15T05:46:56Z

Hi @lijingrs and @BH3GEI,
Just a quick ping when you have time.
This PR adds observability to the compute mesh demo, including:

latency benchmarking
execution trace visualization
architecture documentation
Together with feat(compute-mesh): end-to-end local compute mesh demo pipeline #1233 and feat(compute-mesh): routing policies and streaming support for demo #1234 it completes a small end-to-end demo showing routing, streaming, and observability for the compute mesh.
All changes are confined to the example layer and CI checks are passing.
Happy to refine anything if maintainers prefer a different structure. Thanks!

PrinceGautam2106 · 2026-03-15T08:56:08Z

/assign

feat(demo): add compute mesh observability demo with benchmarking and…

d7a5ec6

… tracing

Avi-47 marked this pull request as ready for review March 15, 2026 05:14

Avi-47 force-pushed the demo/compute-mesh-observability branch from 72756a8 to d7a5ec6 Compare March 22, 2026 10:01

Merge branch 'main' into demo/compute-mesh-observability

94599d4

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(demo): add compute mesh observability demo with benchmarking and execution tracing#1248

feat(demo): add compute mesh observability demo with benchmarking and execution tracing#1248
Avi-47 wants to merge 2 commits intomofa-org:mainfrom
Avi-47:demo/compute-mesh-observability

Avi-47 commented Mar 15, 2026 •

edited

Loading

Uh oh!

Avi-47 commented Mar 15, 2026

Uh oh!

PrinceGautam2106 commented Mar 15, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Avi-47 commented Mar 15, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Motivation

Features Implemented

1. Latency Benchmarking

2. Execution Trace Visualization

3. Architecture Documentation

Architecture Overview

Demo Walkthrough

Running the Demo

Example Output

Testing Instructions

Example Output

Performance Metrics

Execution Trace (JSON)

Screenshots

Breaking Changes

Checklist

Files Changed

Uh oh!

Avi-47 commented Mar 15, 2026

Uh oh!

PrinceGautam2106 commented Mar 15, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Avi-47 commented Mar 15, 2026 •

edited

Loading