Skip to content

Conversation

@elezar
Copy link
Member

@elezar elezar commented Sep 26, 2025

This change bumps the github.com/NVIDIA/go-nvlib dependency from v0.8.0 to v0.8.1 to ensures that nvidia.com/gpu.coherent CDI specs are only generated for devices with an addressing mode of ATS.

It also disables the functionality for splitting generated CDI specifications based on device coherence by default. This was added in #1247, but due to discussions around whether coherence is an property that should be exposed, we are disabling this by default.

Note that users can opt in to the feature by running the nvidia-ctk cdi generate with the --feature-flag=enable-coherent-annotations command line flag. Alternatively the nvidia-ctk cdi generate command can be run with the NVIDIA_CTK_CDI_GENERATE_FEATURE_FLAGS enviroment set to include "enable-coherent-annotations" (in a comma-separated list).

The same feature flag can be passed to the nvcdi API when constructing a CDI spec generator.

@elezar elezar changed the title Bump GitHub.com/nvidia/go nvlib 0.8.1 Restrict nvidia.com/gpu.coherent devices to ATS addressing mode Sep 26, 2025
This change disables the functionality for splitting
generated CDI specifications based on device coherence by default.
This was added in NVIDIA#1247, but due to discussions around whether
coherence is an property that should be exposed, we are disabling
this by default.

Note that users can opt in to the feature by running the
`nvidia-ctk cdi generate` with the `--feature-flag=enable-coherent-annotations`
command line flag. Alternatively the `nvidia-ctk cdi generate` command can
be run with the `NVIDIA_CTK_CDI_GENERATE_FEATURE_FLAGS` enviroment set to
include "enable-coherent-annotations" (in a comma-separated list).

Signed-off-by: Evan Lezar <[email protected]>
@elezar elezar changed the title Restrict nvidia.com/gpu.coherent devices to ATS addressing mode Disable generation of nvidia.com/gpu.[non]coherent CDI specs by default Sep 26, 2025
@elezar elezar added this to the v1.18.0 milestone Sep 26, 2025
Copy link
Contributor

@klueska klueska left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

What was the primary motivation for the change in semantics?

@elezar
Copy link
Member Author

elezar commented Sep 26, 2025

It also disables the functionality for splitting generated CDI specifications based on device coherence by default. This was added in #1247, but due to discussions around whether coherence is an property that should be exposed, we are disabling this by default.

As noted in the description:

due to discussions around whether coherence is an property that should be exposed, we are disabling this by default.

With regards to what constitutes coherence, it was called out after the initial implementation that HMM addressing mode is not actually coherent. This was initially included due to an incorrect requirement in the original request.

@elezar elezar merged commit f5b4c15 into NVIDIA:main Sep 26, 2025
16 checks passed
@elezar elezar deleted the bump-github.com/NVIDIA/go-nvlib-0.8.1 branch September 26, 2025 10:29
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants