Setting CUDA_MPS_PINNED_DEVICE_MEM_LIMIT before starting the MPS control daemon on the host would be helpful in enforcing the memory boundaries of the MPS partition. Should CUDA_MPS_PINNED_DEVICE_MEM_LIMIT also be set in the MPS control daemon template?
/cc @klueska