Skip to content

unable to import fbgemm_gpu #4754

@vkuzo

Description

@vkuzo

Hi folks! When I install fbgemm-gpu-genai from pip, I am unable to import the library. Neither stable or nightly versions work for me. Repro:

(pytorch) [[email protected] ~/local/ao (20250821_float8_tensor_fix)]$ with-proxy pip install --pre fbgemm-gpu-genai --index-url https://download.pytorch.org/whl/nightly/cu128/
Looking in indexes: https://download.pytorch.org/whl/nightly/cu128/
Collecting fbgemm-gpu-genai
  Using cached https://download.pytorch.org/whl/nightly/cu128/fbgemm_gpu_genai-2025.8.20%2Bcu128-cp311-cp311-manylinux_2_28_x86_64.whl.metadata (2.7 kB)
Requirement already satisfied: numpy in /home/vasiliy/.conda/envs/pytorch/lib/python3.11/site-packages (from fbgemm-gpu-genai) (2.2.3)
Using cached https://download.pytorch.org/whl/nightly/cu128/fbgemm_gpu_genai-2025.8.20%2Bcu128-cp311-cp311-manylinux_2_28_x86_64.whl (32.2 MB)
Installing collected packages: fbgemm-gpu-genai
Successfully installed fbgemm-gpu-genai-2025.8.20+cu128
(pytorch) [[email protected] ~/local/ao (20250821_float8_tensor_fix)]$ python
Python 3.11.0 (main, Mar  1 2023, 18:26:19) [GCC 11.2.0] on linux
Type "help", "copyright", "credits" or "license" for more information.
>>> import fbgemm_gpu
ERROR:root:Could not load the library 'experimental/gen_ai/fbgemm_gpu_experimental_gen_ai.so'!


Could not load this library: /home/vasiliy/.conda/envs/pytorch/lib/python3.11/site-packages/fbgemm_gpu/experimental/gen_ai/fbgemm_gpu_experimental_gen_ai.so



Traceback (most recent call last):
  File "/data/users/vasiliy/pytorch/torch/_ops.py", line 1487, in load_library
    ctypes.CDLL(path)
  File "/home/vasiliy/.conda/envs/pytorch/lib/python3.11/ctypes/__init__.py", line 376, in __init__
    self._handle = _dlopen(self._name, mode)
                   ^^^^^^^^^^^^^^^^^^^^^^^^^
OSError: /home/vasiliy/.conda/envs/pytorch/lib/python3.11/site-packages/fbgemm_gpu/experimental/gen_ai/fbgemm_gpu_experimental_gen_ai.so: undefined symbol: _ZNSt7__cxx1112basic_stringIcSt11char_traitsIcESaIcEE15_M_replace_coldEPcmPKcmm

The above exception was the direct cause of the following exception:

Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
  File "/home/vasiliy/.conda/envs/pytorch/lib/python3.11/site-packages/fbgemm_gpu/__init__.py", line 90, in <module>
    _load_library(f"{library}.so", __variant__ == "docs")
  File "/home/vasiliy/.conda/envs/pytorch/lib/python3.11/site-packages/fbgemm_gpu/__init__.py", line 22, in _load_library
    raise error
  File "/home/vasiliy/.conda/envs/pytorch/lib/python3.11/site-packages/fbgemm_gpu/__init__.py", line 17, in _load_library
    torch.ops.load_library(os.path.join(os.path.dirname(__file__), filename))
  File "/data/users/vasiliy/pytorch/torch/_ops.py", line 1489, in load_library
    raise OSError(f"Could not load this library: {path}") from e
OSError: Could not load this library: /home/vasiliy/.conda/envs/pytorch/lib/python3.11/site-packages/fbgemm_gpu/experimental/gen_ai/fbgemm_gpu_experimental_gen_ai.so
>>> 

I am on an H100 with PyTorch built from source.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions