Skip to content

mstflint v4.33 unable to query a device in kernel lockdown #1371

@markhaywood

Description

@markhaywood

When trying to query an mlx5_0 device while the system is in kernel lockdown, fails with v4.33:

$ ibdev2netdev -v
0000:af:00.0 bnxt_re0 (MT5847 - Broadcom ) Oracle Dual Port 25Gb Ethernet Adapter fw 214.2.255.0 port 1 (DOWN ) ==> ens1f0np0 (Down)
0000:af:00.1 bnxt_re1 (MT5847 - Broadcom ) Oracle Dual Port 25Gb Ethernet Adapter fw 214.2.255.0 port 1 (DOWN ) ==> ens1f1np1 (Down)
0000:3b:00.0 mlx5_0 (MT4121 - 7359059 ) CX556A - ConnectX-5 QSFP28 -Aux[1] fw 16.35.4506 port 1 (ACTIVE) ==> ens2f0np0 (Up)
0000:3b:00.1 mlx5_1 (MT4121 - 7359059 ) CX556A - ConnectX-5 QSFP28 -Aux[1] fw 16.35.4506 port 1 (ACTIVE) ==> ens2f1np1 (Up)

$ mstconfig -d mlx5_0 query
-E- Failed to open the device

$ mstconfig -d 3b:00.0 query
-E- Failed to identify the device

This works fine with mstflint 4.32:

$ mstconfig -d mlx5_0 query

Device #1:

Device type: ConnectX5
Name: 7359059_OCI_SD_Ax_Bx
Description: ConnectX-5 Ex VPI adapter card; EDR IB (100Gb/s) and 100GbE; dual-port QSFP28; PCIe4.0 x16; tall bracket; ROHS R6
Device: mlx5_0

Configurations: Next Boot
FLEX_PARSER_PROFILE_ENABLE 0

  ...

    SAFE_MODE_ENABLE                            True(1) 

$ mstconfig -d 3b:00.0 query

Device #1:

Device type: ConnectX5
Name: 7359059_OCI_SD_Ax_Bx
Description: ConnectX-5 Ex VPI adapter card; EDR IB (100Gb/s) and 100GbE; dual-port QSFP28; PCIe4.0 x16; tall bracket; ROHS R6
Device: mlx5_0

Configurations: Next Boot
FLEX_PARSER_PROFILE_ENABLE 0

    ...

    SAFE_MODE_ENABLE                            True(1) 

Debugging this a bit, it appears to me that this was broken by patches to mtcr_parse_name() in mtcr_ul/mtcr_ul_com.c such as:

3b1d749

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions