Support Iluvatar CoreX #8585

honglyua-il · 2025-06-19T05:14:19Z

The PR was validated on Iluvatar CoreX GPUs. We need to install Iluvatar Corex Toolkit first. Then run:

# Intsall dependencies
pip install -r requirements.txt
# run
python3 main.py --disable-cuda-malloc

We use the sd_xl_base_1.0 model and get the default workflow's results as below:

root@848fa421ea4c:~/ComfyUI# python3 main.py --disable-cuda-malloc --listen 0.0.0.0
Checkpoint files will always be loaded safely.
Total VRAM 32716 MB, total RAM 515630 MB
pytorch version: 2.4.1
/usr/local/corex-4.2.0/lib64/python3/dist-packages/xformers/ops/swiglu_op.py:107: FutureWarning: `torch.cuda.amp.custom_fwd(args...)` is deprecated. Please use `torch.amp.custom_fwd(args..., device_type='cuda')` instead.
  def forward(cls, ctx, x, w1, b1, w2, b2, w3, b3):
/usr/local/corex-4.2.0/lib64/python3/dist-packages/xformers/ops/swiglu_op.py:128: FutureWarning: `torch.cuda.amp.custom_bwd(args...)` is deprecated. Please use `torch.amp.custom_bwd(args..., device_type='cuda')` instead.
  def backward(cls, ctx, dx5):
xformers version: 0.0.26.post1
Set vram state to: NORMAL_VRAM
Device: cuda:0 Iluvatar BI-V150 : native
Using pytorch attention
Python version: 3.10.12 (main, Nov 29 2024, 18:13:52) [GCC 9.4.0]
ComfyUI version: 0.3.41
ComfyUI frontend version: 1.22.2
[Prompt Server] web root: /usr/local/lib/python3.10/site-packages/comfyui_frontend_package/static
/usr/local/corex-4.2.0/lib64/python3/dist-packages/flash_attn/ops/fused_dense.py:30: FutureWarning: `torch.cuda.amp.custom_fwd(args...)` is deprecated. Please use `torch.amp.custom_fwd(args..., device_type='cuda')` instead.
  def forward(
/usr/local/corex-4.2.0/lib64/python3/dist-packages/flash_attn/ops/fused_dense.py:71: FutureWarning: `torch.cuda.amp.custom_bwd(args...)` is deprecated. Please use `torch.amp.custom_bwd(args..., device_type='cuda')` instead.
  def backward(ctx, grad_output, *args):

Import times for custom nodes:
   0.0 seconds: /root/ComfyUI/custom_nodes/websocket_image_save.py

Context impl SQLiteImpl.
Will assume non-transactional DDL.
/usr/local/lib/python3.10/site-packages/alembic/config.py:564: DeprecationWarning: No path_separator found in configuration; falling back to legacy splitting on spaces, commas, and colons for prepend_sys_path.  Consider adding path_separator=os to Alembic config.
  util.warn_deprecated(
No target revision found.
/usr/local/corex-4.2.0/lib64/python3/dist-packages/aiohttp/web_urldispatcher.py:202: DeprecationWarning: Bare functions are deprecated, use async ones
  warnings.warn(
Starting server

To see the GUI go to: http://0.0.0.0:8188
got prompt
model weight dtype torch.float16, manual cast: None
model_type EPS
Using pytorch attention in VAE
Using pytorch attention in VAE
VAE load device: cuda:0, offload device: cpu, dtype: torch.bfloat16
CLIP/text encoder model load device: cuda:0, offload device: cpu, current: cpu, dtype: torch.float16
Requested to load SDXLClipModel
loaded completely 31430.74140625 1560.802734375 True
/root/ComfyUI/comfy/ldm/modules/attention.py:451: UserWarning: Optional attn_mask_ param is not recommended to use. For better performance,1.Assuming causal attention masking, 'is_causal' parameter can be selected.2.Assuming alibi attention masking, 'PT_SDPA_USE_ALIBI_MASK' env can be selected. (Triggered internally at /home/corex/sw_home/apps/pytorch/aten/src/ATen/native/transformers/cuda/flash_attn/flash_api.cpp:1769.)
  out = torch.nn.functional.scaled_dot_product_attention(q, k, v, attn_mask=mask, dropout_p=0.0, is_causal=False)
Requested to load SDXL
loaded completely 29856.74970703125 4897.0483474731445 True
100%|███████████████████████████████████████████| 20/20 [00:02<00:00,  7.18it/s]
Requested to load AutoencoderKL
loaded completely 24619.5859375 159.55708122253418 True
Prompt executed in 5.58 seconds

README.md

cuda_malloc.py

honglyua-il · 2025-06-30T02:15:51Z

Hello @comfyanonymous Is this PR still under review? Let me know if there is anything else I need to do.

From the feedback from the community, many users expect to use ComfyUI on Iluvatar CoreX GPU. I submitted this PR and hope it can be merged as soon as possible. Thanks!

Here I have rebased the new master, and have tested it, the result showed it worked well. The test logs and images are shown below.

If you mind we modify the cuda malloc to adapt Iluvatar CoreX GPU, we can also revert the c795d23 and launch ComfyUI by running python main.py --disable-cuda-malloc.

PTAL, Thanks!

root@666c5f0762e9:~/ComfyUI# python3 main.py --listen 0.0.0.0
Checkpoint files will always be loaded safely.
Total VRAM 32716 MB, total RAM 515630 MB
pytorch version: 2.4.1
Set vram state to: NORMAL_VRAM
Device: cuda:0 Iluvatar BI-V150 : native
Using pytorch attention
Python version: 3.10.18 (main, Jun 11 2025, 16:28:51) [GCC 9.4.0]
ComfyUI version: 0.3.43
ComfyUI frontend version: 1.23.4
[Prompt Server] web root: /usr/local/lib/python3.10/site-packages/comfyui_frontend_package/static

Import times for custom nodes:
   0.0 seconds: /root/ComfyUI/custom_nodes/websocket_image_save.py

Context impl SQLiteImpl.
Will assume non-transactional DDL.
/usr/local/lib/python3.10/site-packages/alembic/config.py:577: DeprecationWarning: No path_separator found in configuration; falling back to legacy splitting on spaces, commas, and colons for prepend_sys_path.  Consider adding path_separator=os to Alembic config.
  util.warn_deprecated(
No target revision found.
/usr/local/lib/python3.10/site-packages/aiohttp/web_urldispatcher.py:204: DeprecationWarning: Bare functions are deprecated, use async ones
  warnings.warn(
Starting server

To see the GUI go to: http://0.0.0.0:8188
got prompt
model weight dtype torch.float16, manual cast: None
model_type EPS
Using pytorch attention in VAE
Using pytorch attention in VAE
VAE load device: cuda:0, offload device: cpu, dtype: torch.bfloat16
CLIP/text encoder model load device: cuda:0, offload device: cpu, current: cpu, dtype: torch.float16
Requested to load SDXLClipModel
loaded completely 31476.7765625 1560.802734375 True
/root/ComfyUI/comfy/ldm/modules/attention.py:451: UserWarning: Optional attn_mask_ param is not recommended to use. For better performance,1.Assuming causal attention masking, 'is_causal' parameter can be selected.2.Assuming alibi attention masking, 'PT_SDPA_USE_ALIBI_MASK' env can be selected. (Triggered internally at /home/corex/sw_home/apps/pytorch/aten/src/ATen/native/transformers/cuda/flash_attn/flash_api.cpp:1599.)
  out = torch.nn.functional.scaled_dot_product_attention(q, k, v, attn_mask=mask, dropout_p=0.0, is_causal=False)
Requested to load SDXL
loaded completely 29902.75751953125 4897.0483474731445 True
100%|███████████████████████████████████████████| 20/20 [00:02<00:00,  6.79it/s]
Requested to load AutoencoderKL
loaded completely 24661.50390625 159.55708122253418 True
Prompt executed in 5.98 seconds

comfyanonymous · 2025-07-03T23:23:18Z

cuda_malloc.py

@@ -50,7 +50,33 @@ def enum_display_devices():
                "GeForce GTX 1650", "GeForce GTX 1630", "Tesla M4", "Tesla M6", "Tesla M10", "Tesla M40", "Tesla M60"
                }

+def _load_torch_submodule(filename):
+    """Helper to load and check a submodule from torch's installation"""


Instead of doing this can't you just check if the computer has an iluvatar device?

Hi @comfyanonymous
Thank you for your time and support.

We’ve looked into the issue and would like to propose two possible approaches to address it:

Modify cuda_malloc.py to detect Iluvatar GPU names by using subprocess.check_output(['ixsmi', '-L']), following a pattern similar to how NVIDIA GPUs are currently detected.

As an alternative, we could leave cuda_malloc.py unchanged and instead update the README to include instructions for launching ComfyUI with the command: python main.py --disable-cuda-malloc.

Both options seem viable, but we’d really appreciate your thoughts on which direction would be more suitable for the project.

Thanks again for your guidance

honglyua-il · 2025-07-11T03:02:11Z

hello @comfyanonymous , we have updated the README to include instructions for launching ComfyUI with the command: python main.py --disable-cuda-malloc. PTAL, Thanks!

honglyua-il · 2025-07-18T03:00:06Z

@comfyanonymous @ltdrdata PTAL.

Co-authored-by: mingjiang.li <[email protected]>

honglyua-il · 2025-07-24T02:24:07Z

@comfyanonymous we have rebase the new master, and because of this #9031, we do not need add --disable-cude-malloc to start ComfyUI.

PTAL, thanks!

honglyua-il requested review from yoland68, robinjhuang, webfiltered, pythongosssss, ltdrdata, Kosinkadink, christian-byrne and comfyanonymous as code owners June 19, 2025 05:14

ltdrdata added the Core-Important label Jun 19, 2025

comfyanonymous reviewed Jun 20, 2025

View reviewed changes

README.md Outdated Show resolved Hide resolved

honglyua-il force-pushed the iluvatar_support branch 2 times, most recently from 46d9466 to da50a8e Compare June 23, 2025 05:57

comfyanonymous reviewed Jun 24, 2025

View reviewed changes

cuda_malloc.py Outdated Show resolved Hide resolved

honglyua-il force-pushed the iluvatar_support branch 4 times, most recently from fa8a063 to c795d23 Compare June 30, 2025 01:56

honglyua-il force-pushed the iluvatar_support branch from c795d23 to 7d9d546 Compare July 2, 2025 02:09

comfyanonymous reviewed Jul 3, 2025

View reviewed changes

honglyua-il force-pushed the iluvatar_support branch 2 times, most recently from 4b6d9a5 to dda8034 Compare July 11, 2025 03:00

honglyua-il force-pushed the iluvatar_support branch 2 times, most recently from 46ebfd4 to b0ac606 Compare July 18, 2025 02:42

honglyua-il added 2 commits July 24, 2025 10:13

Support Iluvatar CoreX

5e81cc0

Co-authored-by: mingjiang.li <[email protected]>

because of comfyanonymous#9031, don't need --disable-cuda-malloc

3280340

honglyua-il force-pushed the iluvatar_support branch from 2299df3 to 3280340 Compare July 24, 2025 02:17

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Support Iluvatar CoreX #8585

Support Iluvatar CoreX #8585

honglyua-il commented Jun 19, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

honglyua-il commented Jun 30, 2025 •

edited

Loading

Uh oh!

comfyanonymous Jul 3, 2025

Uh oh!

honglyua-il Jul 7, 2025

Uh oh!

honglyua-il commented Jul 11, 2025

Uh oh!

honglyua-il commented Jul 18, 2025

Uh oh!

honglyua-il commented Jul 24, 2025

Uh oh!

Uh oh!

Support Iluvatar CoreX #8585

Are you sure you want to change the base?

Support Iluvatar CoreX #8585

Conversation

honglyua-il commented Jun 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

honglyua-il commented Jun 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

comfyanonymous Jul 3, 2025

Choose a reason for hiding this comment

Uh oh!

honglyua-il Jul 7, 2025

Choose a reason for hiding this comment

Uh oh!

honglyua-il commented Jul 11, 2025

Uh oh!

honglyua-il commented Jul 18, 2025

Uh oh!

honglyua-il commented Jul 24, 2025

Uh oh!

Uh oh!

honglyua-il commented Jun 19, 2025 •

edited

Loading

honglyua-il commented Jun 30, 2025 •

edited

Loading