Add ROCm libraries benchmark tests #2261

lajagapp · 2025-11-21T20:38:55Z

Utilities with automated system detection, results collection with API integration, and performance tracking with LKG comparison for ROCm libraries.

Motivation

Add reusable test utilities for ROCm benchmark automation (ROCfft, ROCrand, and more) with standardized system detection, results collection, and LKG performance tracking.

Technical Details

Modular utility package:

Core: TestClient API, logging, constants, exceptions
Config: YAML parser, environment variables, schema validation
System: Hardware/OS/ROCm detection
Results: Local JSON storage, API client with retry, LKG comparison

Test Plan

Tested benchmark execution, system detection, API submission, and LKG comparison.

Test Result

Benchmarks execute successfully
System detection, API upload, and LKG comparison working correctly

Submission Checklist

Look over the contributing guidelines at https://github.com/ROCm/ROCm/blob/develop/CONTRIBUTING.md#pull-requests.

Utilities with automated system detection, results collection with API integration, and performance tracking with LKG comparison for ROCm libraries. Features: - Benchmark execution with system auto-detection - Local JSON storage and API submission for results - LKG (Last Known Good) comparison - Modular utilities: config, system detection, and results handling - Comprehensive logging with file rotation - Environment variable support for configuration Signed-off-by: Lenine Ajagappane <[email protected]>

Signed-off-by: Lenine Ajagappane <[email protected]>

geomin12

initial round of comments, there's a lot of code here that may be needed before, but may not be needed now?

also, a few architecture questions i have as well. happy to chat about this via teams or here!

geomin12 · 2025-11-25T16:21:07Z

build_tools/github_actions/fetch_test_configurations.py

+    "hipblaslt_bench": {
+        "job_name": "hipblaslt_bench",
+        "fetch_artifact_args": "--blas --tests",
+        "timeout_minutes": 60,
+        "test_script": f"python {_get_script_path('test_hipblaslt_benchmark.py')}",
+        # TODO(lajagapp): Add windows test
+        "platform": ["linux"],
+        "total_shards": 1,
+    },


this may be an issue as this will get run during each PR, push to main and scheduled run, probably resulting in long queue times and machine shortages

I would imagine we want to add benchmark tests as a separate (perhaps nightly) run on separate machines? is this the case? how frequent do we want to run these and how long do these take?

geomin12 · 2025-11-25T16:22:18Z

build_tools/github_actions/fetch_test_configurations.py

+        "fetch_artifact_args": "--blas --tests",
+        "timeout_minutes": 60,
+        "test_script": f"python {_get_script_path('test_hipblaslt_benchmark.py')}",
+        # TODO(lajagapp): Add windows test


for add windows tests, can we open a Github issue and link it here? so we can keep track?

geomin12 · 2025-11-25T16:22:59Z

build_tools/github_actions/test_executable_scripts/test_hipblaslt_benchmark.py

+    ACTIVATION_TYPE = "none"
+
+    # Load benchmark configuration
+    config_file = SCRIPT_DIR.parent / 'configs/benchmarks/hipblaslt.json'


Suggested change

config_file = SCRIPT_DIR.parent / 'configs/benchmarks/hipblaslt.json'

config_file = SCRIPT_DIR.parent / 'configs' / 'benchmarks' / 'hipblaslt.json'

this will make it compatible with windows machines

geomin12 · 2025-11-25T16:24:03Z

build_tools/github_actions/test_executable_scripts/test_hipblaslt_benchmark.py

+    # Compare with LKG
+    log.info("Comparing results with LKG")
+    final_table = client.compare_results(test_name=BENCHMARK_NAME, table=table)
+    log.info(f"\n{final_table}")


you can also use gha_append_step_summary too to append to the workflow run

geomin12 · 2025-11-25T16:24:20Z

build_tools/github_actions/test_executable_scripts/test_rocfft_benchmark.py

+    NUM_ITERATIONS = 20  # Number of benchmark iterations
+
+    # Load benchmark configuration
+    config_file = SCRIPT_DIR.parent / 'configs/benchmarks/rocfft.json'


same here regarding path comment

geomin12 · 2025-11-25T16:40:32Z

build_tools/github_actions/utils/results/results_api.py

+        """
+        self.api_url = api_url.rstrip('/')
+        self.fallback_url = fallback_url.rstrip('/') if fallback_url else None
+        self.api_key = api_key


secrets in github actions are a bit tricky, particularly for forked PRs (as forked PRs cannot pull data from secrets)

I personally think this should be run a nightly basis as i would imagine it takes quite a while to run (and here, it can pull secrets)

happy to chat about arch here

geomin12 · 2025-11-25T16:41:35Z

build_tools/github_actions/utils/results/results_api.py

+        """
+        status_code = response.status_code
+
+        if status_code == 401:


instead of these elongated error messages, could we just print out the error message given by API? and status code?

geomin12 · 2025-11-25T16:42:30Z

build_tools/github_actions/utils/results/results_api.py

+    Returns:
+        Complete results payload for API submission
+    """
+    # Build BM config


can we clarify what BM is?

geomin12 · 2025-11-25T16:43:06Z

build_tools/github_actions/utils/results/results_api.py

+    return payload
+
+
+def validate_payload(payload: Dict[str, Any]) -> bool:


we may be able to use some python libraries to validate payload :)

geomin12 · 2025-11-25T16:44:43Z

requirements-test.txt

+prettytable>=3.0.0
+requests>=2.28.0
+jsonschema>=4.0.0
+packaging>=21.0


can we make the version exact?

github-project-automation bot added this to TheRock Triage Nov 21, 2025

github-project-automation bot moved this to TODO in TheRock Triage Nov 21, 2025

Update requirements-test.txt

2697c01

Signed-off-by: Lenine Ajagappane <[email protected]>

lajagapp requested review from amd-aakash, gkathirv, jayhawk-commits and kiranzangam November 22, 2025 06:58

lajagapp changed the title ~~Initial commit: Add ROCm libraries benchmark tests~~ Add ROCm libraries benchmark tests Nov 22, 2025

lajagapp added 3 commits November 23, 2025 08:57

Update requirements-test.txt

1798f3d

Signed-off-by: Lenine Ajagappane <[email protected]>

Update THEROCK_DIR in *benchmark.py

013f14d

Signed-off-by: Lenine Ajagappane <[email protected]>

Add rocsolver & hipblaslt benchmark tests

52d7171

Signed-off-by: Lenine Ajagappane <[email protected]>

jayhawk-commits requested a review from geomin12 November 24, 2025 17:47

geomin12 reviewed Nov 25, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add ROCm libraries benchmark tests #2261

Add ROCm libraries benchmark tests #2261

Uh oh!

lajagapp commented Nov 21, 2025

Uh oh!

geomin12 left a comment

Uh oh!

geomin12 Nov 25, 2025

Uh oh!

geomin12 Nov 25, 2025

Uh oh!

geomin12 Nov 25, 2025

Uh oh!

geomin12 Nov 25, 2025

Uh oh!

geomin12 Nov 25, 2025

Uh oh!

geomin12 Nov 25, 2025

Uh oh!

geomin12 Nov 25, 2025

Uh oh!

geomin12 Nov 25, 2025

Uh oh!

geomin12 Nov 25, 2025

Uh oh!

geomin12 Nov 25, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

	config_file = SCRIPT_DIR.parent / 'configs/benchmarks/hipblaslt.json'
	config_file = SCRIPT_DIR.parent / 'configs' / 'benchmarks' / 'hipblaslt.json'

		return payload


		def validate_payload(payload: Dict[str, Any]) -> bool:

Add ROCm libraries benchmark tests #2261

Are you sure you want to change the base?

Add ROCm libraries benchmark tests #2261

Uh oh!

Conversation

lajagapp commented Nov 21, 2025

Motivation

Technical Details

Test Plan

Test Result

Submission Checklist

Uh oh!

geomin12 left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants