Skip to content

Commit d5fbf96

Browse files
Merge pull request Mellanox#281 from almaslennikov/dgx-nic-op-feature-flag
fix: add a note on DGX's FW_RESET_AFTER_CONFIG_UPDATE env var feature…
2 parents 21c8988 + b5989c7 commit d5fbf96

File tree

1 file changed

+8
-0
lines changed

1 file changed

+8
-0
lines changed

docs/nic-conf-operator/nic-fw-configuration.rst

Lines changed: 8 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -83,6 +83,9 @@ Install the NIC Configuration Operator and observe NIC devices in the cluster
8383
After deploying the NFS server and NFS CSI driver, the `storage class <https://github.com/kubernetes-csi/csi-driver-nfs/blob/master/deploy/example/storageclass-nfs.yaml>`_ should become available in the cluster. The name of the storage class should then be passed when configuring the NIC Configuration Operator.
8484
To disable the Firmware upgrade and validation logic, do not define the ``nicFirmwareStorage`` section in the NicClusterPolicy CR.
8585

86+
.. note::
87+
On some DGX servers, the configuration update is not successfully applied after the warm reboot. In this case, it is recommended to explicitly reset the NIC's Firmware before the reboot and after updating its non-volatile configuration. This can be achieved by specifying the ``FW_RESET_AFTER_CONFIG_UPDATE`` environment variable in the NicClusterPolicy CR. Please see the commented section in the example below.
88+
8689
First install the Network Operator helm chart with the Maintenance Operator enabled and deploy a NIC Cluster Policy CRD with NIC Configuration Operator and DOCA-OFED Driver enabled:
8790

8891
``values.yaml``:
@@ -111,6 +114,11 @@ First install the Network Operator helm chart with the Maintenance Operator enab
111114
image: nic-configuration-operator-daemon
112115
repository: |nic-configuration-operator-repository|
113116
version: |nic-configuration-operator-version|
117+
# Uncomment to explicitely reset the NIC's Firmware before the reboot and after updating its non-volatile configuration.
118+
# Might be required on DGX servers where configuration update is not successfully applied after the warm reboot.
119+
# env:
120+
# - name: "FW_RESET_AFTER_CONFIG_UPDATE"
121+
# value: "true"
114122
nicFirmwareStorage:
115123
create: true
116124
pvcName: nic-fw-storage-pvc

0 commit comments

Comments
 (0)