feat: add task for in-cluster load test #4007

RodrigoVillar · 2025-06-10T18:57:38Z

Why this should be merged

This PR adds a task allowing for running load tests within a local kind cluster.

How this works

The task does the following:

Start a kind cluster if one has not been started yet
Create an image that runs the load test and deploys it to a local registry
Create a pod manifest for the load test and deploys the pod to the kind cluster

How this was tested

The load test was run with task test-load-kind-cluster and passed.

Need to be documented in RELEASES.md?

N/A

Taskfile.yml

maru-ava · 2025-06-11T18:16:00Z

Taskfile.yml

@@ -244,6 +244,10 @@ tasks:
    cmds:
      - cmd: go run ./tests/load/c/main --runtime=kube --kube-use-exclusive-scheduling {{.CLI_ARGS}}

+  test-load-kind-cluster:


Maybe choose a name more reflective of the test running inside the cluster? 'kind cluster' isn't very specific.

Done: d9fdb87

maru-ava · 2025-06-11T18:20:12Z

scripts/build_image.sh

@@ -10,6 +10,7 @@ set -euo pipefail
 # DOCKER_IMAGE=avaplatform/avalanchego ./scripts/build_image.sh                       # Build and push multi-arch image to docker hub
 # DOCKER_IMAGE=localhost:5001/avalanchego ./scripts/build_image.sh                    # Build and push multi-arch image to private registry
 # DOCKER_IMAGE=localhost:5001/avalanchego FORCE_TAG_LATEST=1 ./scripts/build_image.sh # Build and push image to private registry with tag `latest`
+# DOCKERFILE="./Dockerfile" ./scripts/build_image.sh                                    # Build image with a custom Dockerfile


(No action required) Why is it desirable to customize this build script instead of following the example of scripts/build_bootstrap_monitor_image.sh?

Note that a compiled binary is suggested rather than using 'go run' at runtime.

maru-ava · 2025-06-11T18:21:10Z

scripts/tests.load.kind.sh

+fi
+
+# Start kind cluster
+./scripts/start_kind_cluster.sh


Suggest passing arguments as per the example of other kind-using scripts.

Done: 387e4ef

maru-ava · 2025-06-11T18:22:18Z

scripts/tests.load.kind.sh

+metadata:
+  name: load-test
+  namespace: tmpnet
+rules:


(No action required) How did you arrive at these permissions?

maru-ava · 2025-06-11T18:39:50Z

tests/load/prometheus.go

@@ -101,5 +101,9 @@ func (s *MetricsServer) GenerateMonitoringConfig(monitoringLabels map[string]str
 		return "", err
 	}

+	if err := os.MkdirAll(filepath.Dir(collectorFilePath), 0o755); err != nil {


While this addition might avoid an error, the fact that the path does not exist is a symptom of a larger problem: collection is not being configured in the pod. Given the requirement to label test workload metrics with network uuid, which isn't known at the time of pod deployment, I think deployment of local prometheus collector would be suggested so that tmpnet configure it. That would mean setting the collector credentials to the pod - easy enough - but also ensuring the availability of a compatible version of prometheus so that tmpnet could start it.

Maybe coordinate with Elvis to see what the timeline is for getting ARC online? Other than as a learning exercise, I'm less convinced of the wisdom of supporting pod-based workloads if it requires not just publishing an image and that image being complex to build. CI-launched tests won't need to publish images, and don't need extra work to support workload monitoring. Local iteration would likely be easier to support via enabling external access to nodes via a proxy instead of forwarding.

github-actions · 2025-07-13T00:00:36Z

This PR has become stale because it has been open for 30 days with no activity. Adding the lifecycle/frozen label will cause this PR to ignore lifecycle events.

RodrigoVillar self-assigned this Jun 10, 2025

RodrigoVillar added the testing This primarily focuses on testing label Jun 10, 2025

github-project-automation bot added this to avalanchego Jun 10, 2025

Base automatically changed from in-cluster-test-fix to master June 11, 2025 11:08

RodrigoVillar added 8 commits June 11, 2025 08:13

chore: tmpnetctl

f58ff3a

fix: tmpnetctl

01248dc

docs: register

62c0ec9

doc: docstring => comment

bb4b582

chore: reduce diff

7f9bc71

feat: add task for in-cluster load test

ae297ad

chore: lint

4f63437

docs: update build_image.sh

8c66465

RodrigoVillar force-pushed the add-kind-cluster-task branch from e53c1ad to 8c66465 Compare June 11, 2025 12:14

RodrigoVillar commented Jun 11, 2025

View reviewed changes

Taskfile.yml Show resolved Hide resolved

RodrigoVillar marked this pull request as ready for review June 11, 2025 12:45

RodrigoVillar requested a review from maru-ava as a code owner June 11, 2025 12:45

RodrigoVillar requested a review from Elvis339 June 11, 2025 12:45

maru-ava reviewed Jun 11, 2025

View reviewed changes

RodrigoVillar added 2 commits June 11, 2025 15:47

chore: pass args to kind cluster

387e4ef

style: rename task

d9fdb87

RodrigoVillar mentioned this pull request Jun 25, 2025

[tmpnet] Run kube load test under a service account to validate RBAC #4030

Merged

github-actions bot added the lifecycle/stale label Jul 13, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: add task for in-cluster load test #4007

feat: add task for in-cluster load test #4007

RodrigoVillar commented Jun 10, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

maru-ava Jun 11, 2025

Uh oh!

RodrigoVillar Jun 11, 2025

Uh oh!

maru-ava Jun 11, 2025

Uh oh!

maru-ava Jun 11, 2025

Uh oh!

RodrigoVillar Jun 11, 2025

Uh oh!

maru-ava Jun 11, 2025

Uh oh!

maru-ava Jun 11, 2025

Uh oh!

github-actions bot commented Jul 13, 2025

Uh oh!

Uh oh!

feat: add task for in-cluster load test #4007

Are you sure you want to change the base?

feat: add task for in-cluster load test #4007

Conversation

RodrigoVillar commented Jun 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Why this should be merged

How this works

How this was tested

Need to be documented in RELEASES.md?

Uh oh!

Uh oh!

Uh oh!

maru-ava Jun 11, 2025

Choose a reason for hiding this comment

Uh oh!

RodrigoVillar Jun 11, 2025

Choose a reason for hiding this comment

Uh oh!

maru-ava Jun 11, 2025

Choose a reason for hiding this comment

Uh oh!

maru-ava Jun 11, 2025

Choose a reason for hiding this comment

Uh oh!

RodrigoVillar Jun 11, 2025

Choose a reason for hiding this comment

Uh oh!

maru-ava Jun 11, 2025

Choose a reason for hiding this comment

Uh oh!

maru-ava Jun 11, 2025

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Jul 13, 2025

Uh oh!

Uh oh!

RodrigoVillar commented Jun 10, 2025 •

edited

Loading