Skip to content

Conversation

@xiongzubiao
Copy link

Closes #1015

@copy-pr-bot
Copy link

copy-pr-bot bot commented Jan 9, 2025

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

@xiongzubiao xiongzubiao force-pushed the uuid branch 2 times, most recently from eb8fdec to 2ad1041 Compare January 10, 2025 01:49
@elezar
Copy link
Member

elezar commented Jan 10, 2025

@xiongzubiao could you please provide information on how these labels will be used?

@xiongzubiao
Copy link
Author

xiongzubiao commented Jan 10, 2025

@xiongzubiao could you please provide information on how these labels will be used?

@elezar, we want to provide some sort of visualization to user. User can click each GPU to check its properties, status, and metrics. The device UUID is the natural choice for indexing. There are other ways to get UUID, but it is most straightforward to get it from node labels, because it is a part of node properties.

There is another use case mentioned in #1015: scheduling pod to a specific GPU using node label matching.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Add gpu uuids to node labels

3 participants