Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
33 changes: 22 additions & 11 deletions projects/rocprofiler-compute/CHANGELOG.md
Original file line number Diff line number Diff line change
Expand Up @@ -2,39 +2,50 @@

Full documentation for ROCm Compute Profiler is available at [https://rocm.docs.amd.com/projects/rocprofiler-compute/en/latest/](https://rocm.docs.amd.com/projects/rocprofiler-compute/en/latest/).

## Unreleased
## ROCm Compute Profiler 3.4.0 for ROCm 7.2.0

### Added

* Add `--list-blocks <arch>` option to general options to list available IP blocks on specified arch (similar to `--list-metrics`), cannot be used with `--block`.
* `--list-blocks <arch>` option to general options. It lists the available IP blocks on the specified arch (similar to `--list-metrics`), however cannot be used with `--block`.

* Added `config_delta/gfx950_diff.yaml` to analysis config yamls to track the revision between a gfx9 architecture against the latest supported architecture gfx950
* `config_delta/gfx950_diff.yaml` to analysis config YAMLs to track the revision between the gfx9xx GPUs against the latest supported gfx950 GPUs.

* Analysis db features
* Add support for per kernel metrics analysis.
* Add support for dispatch timeline analysis.
* Show duration as median in addition to mean in kernel view.
* Adds support for per kernel metrics analysis.
* Adds support for dispatch timeline analysis.
* Shows duration as median in addition to mean in kernel view.

* Implement AMDGPU driver info and GPU VRAM attributes in system info. section of analysis report.

* Added `CU Utilization` metric to display the percentage of CUs utilized during kernel execution.

### Changed

* `-b/--block` accepts block alias(es) (See block aliases using command-line option `--list-blocks <arch>`).
* `-b/--block` accepts block alias(es). See block aliases using command-line option `--list-blocks <arch>`.

* analysis configs yamls are now managed with the new config management workflow in `tools/config_management/`
* Analysis configs YAMLs are now managed with the new config management workflow in `tools/config_management/`.

* `amdsmi` python API is used instead of `amd-smi` CLI to query GPU specifications.

* Empty cells replaced with `N/A` for unavailable metrics in analysis.


### Deprecated

* `Active CUs` metric has been deprecated and replaced by `CU Utilization`.

### Removed
* Removed `database` mode from `rocprofiler-compute`. This is to move our focus from grafana
and mongodb integration to other visualization methods such as:
* Analysis DB based Visualizer (upcoming)

* Removed `database` mode from ROCm Compute Profiler in favor of other visualization methods, rather than Grafana and MongoDB integration, such as the upcoming Analysis DB-based Visualizer.
* Plotly server based standalone GUI
* Commandline based Textual User Interface

### Optimized

### Resolved issues

* Fixed sL1D metric values showing up as N/A in memory chart diagram

### Known issues

### Upcoming changes
Expand Down
27 changes: 21 additions & 6 deletions projects/rocprofiler-compute/CMakeLists.txt
Original file line number Diff line number Diff line change
Expand Up @@ -283,10 +283,19 @@ add_test(
)

add_test(
NAME test_profile_roofline
NAME test_profile_roofline_1
COMMAND
${Python3_EXECUTABLE} -m pytest -m roofline
--junitxml=tests/test_profile_roofline.xml ${COV_OPTION}
${Python3_EXECUTABLE} -m pytest -m roofline_1
--junitxml=tests/test_profile_roofline_1.xml ${COV_OPTION}
${PROJECT_SOURCE_DIR}/tests/test_profile_general.py
WORKING_DIRECTORY ${PROJECT_SOURCE_DIR}
)

add_test(
NAME test_profile_roofline_2
COMMAND
${Python3_EXECUTABLE} -m pytest -m roofline_2
--junitxml=tests/test_profile_roofline_2.xml ${COV_OPTION}
${PROJECT_SOURCE_DIR}/tests/test_profile_general.py
WORKING_DIRECTORY ${PROJECT_SOURCE_DIR}
)
Expand Down Expand Up @@ -335,8 +344,13 @@ set_tests_properties(
test_profile_sort
test_profile_misc
test_profile_path
test_profile_roofline
PROPERTIES LABELS "profile" RESOURCE_GROUPS gpus:1
test_profile_roofline_1
test_profile_roofline_2
test_profile_section
test_profile_pc_sampling
test_profile_sets_func
test_profile_live_attach_detach
PROPERTIES LABELS "profile" RESOURCE_GROUPS gpus:1 TIMEOUT 1800
)

# ---------------------------
Expand Down Expand Up @@ -437,7 +451,8 @@ if(${ENABLE_COVERAGE})
test_profile_sort
test_profile_misc
test_profile_path
test_profile_roofline
test_profile_roofline_1
test_profile_roofline_2
test_profile_section
test_profile_sets_func
test_analyze_commands
Expand Down
2 changes: 1 addition & 1 deletion projects/rocprofiler-compute/VERSION
Original file line number Diff line number Diff line change
@@ -1 +1 @@
3.3.1
3.4.0
Loading
Loading