Motivation: For development usecases we would like to bill for GPU usage based on percent utilization.
There's a dcgm query that returns GPU utilization for all GPUs in the cluster and there's some associated metadata to correlate it with pods. I need to prototype something that can show percent utilization for a namespace.