-
Notifications
You must be signed in to change notification settings - Fork 899
Open
Labels
kind/featureCategorizes issue or PR as related to a new feature.Categorizes issue or PR as related to a new feature.lifecycle/rottenDenotes an issue or PR that has aged beyond stale and will be auto-closed.Denotes an issue or PR that has aged beyond stale and will be auto-closed.priority/backlogHigher priority than priority/awaiting-more-evidence.Higher priority than priority/awaiting-more-evidence.sig/k8s-infraCategorizes an issue or PR as relevant to SIG K8s Infra.Categorizes an issue or PR as relevant to SIG K8s Infra.sig/nodeCategorizes an issue or PR as relevant to SIG Node.Categorizes an issue or PR as relevant to SIG Node.
Description
sig-node owns a set of features related to exposing and using hardware details which require some hardware features to exercise the code. Examples are exclusive CPU allocation (cpumanager), device allocation (device manager), NUMA alignment (topology manager), NUMA alignment considering distances between NUMA zones (topology manager).
Note: some requirement overlap. Easy example: a powerful high end (at time of writing) server CPU can have at the same time multi core count, exposing multiple NUMA nodes, and have split L3, satisfying in one go all cpumanager requirements
Hardware requirements, driven by feature, rationale
- cpumanager (GA): x86_64, arm: machine with at least 4 cores exposed, better like 16 or more. We need more cores to be able to run different set of tests in different scenarios. 4 cores is the minimum to run basic test about the feature. We already got machines with 4 cores
- topology manager (GA): x86_64, arm: machine with 2 or more NUMA nodes. We need to align on NUMA nodes, so we need 2 or more nodes to begin with
- topology manager (KEP Improved multi-numa alignment in Topology Manager enhancements#3545): x86_64, arm: machine with 4 or more NUMA nodes. We need to consider NUMA distances in allocation, so we need 4 or more nodes with different distances between each other
- topology manager (KEP KEP-4622: Add a TopologyManager policy option for MaxAllowableNUMANodes enhancements#4622): arm only?: machine with 9 or more NUMA nodes (!) like grace gpus
- cpumanager (KEP KEP-4800: Split UnCoreCache awareness enhancements#4810): x86_64, arm: a machine with split (non-uniform) L3 cache, like epyc cpus
- cpumanager (KEP Add CPUManager policy option to align CPUs by Socket instead of by NUMA node enhancements#3327): x86_64, arm: a machine with multiple CPU sockets, in order to exercise alignment by socket.
- devicemanager (GA): x86_64, arm: hardware devices controlled by device plugins. Most common usecases are SRIOV cards and GPUs. One device is the bare minimum, we would like 2 or more to be able to use them also in the topology manager tests as deciding factor (see topology manager in this list)
- memorymanager (Beta, GA graduating): x86_64, arm: totally overlaps with topology manager requirements, listed here for the sake of completeness.
this list will be updated after more review of the ongoing sig-node features
PiotrProkop and esotsal
Metadata
Metadata
Assignees
Labels
kind/featureCategorizes issue or PR as related to a new feature.Categorizes issue or PR as related to a new feature.lifecycle/rottenDenotes an issue or PR that has aged beyond stale and will be auto-closed.Denotes an issue or PR that has aged beyond stale and will be auto-closed.priority/backlogHigher priority than priority/awaiting-more-evidence.Higher priority than priority/awaiting-more-evidence.sig/k8s-infraCategorizes an issue or PR as relevant to SIG K8s Infra.Categorizes an issue or PR as relevant to SIG K8s Infra.sig/nodeCategorizes an issue or PR as relevant to SIG Node.Categorizes an issue or PR as relevant to SIG Node.
Type
Projects
Status
Backlog