Migrating Existing CUDA Projects to torchada

This guide shows how to migrate an existing PyTorch CUDA project to use torchada, making it compatible with multiple GPU platforms (CUDA and MUSA).

Quick Migration

Step 1: Install torchada

pip install torchada

Step 2: Add One Import

Before:

import torch
from torch.utils.cpp_extension import CUDAExtension, BuildExtension, CUDA_HOME

After:

import torchada  # Just add this one line!
import torch
from torch.utils.cpp_extension import CUDAExtension, BuildExtension, CUDA_HOME

That's it! No other code changes needed. Your existing torch.cuda.* code and torch.utils.cpp_extension imports work on all supported platforms.

Important Note on GPU Detection

torch.cuda.is_available() returns False on MUSA platform by design. This allows downstream projects to properly detect the platform. Define a helper like this:

import torch

def has_gpu():
    """Check if any GPU (CUDA or MUSA) is available."""
    if torch.cuda.is_available():
        return True
    if hasattr(torch, "musa") and torch.musa.is_available():
        return True
    return False

For platform detection (choosing platform-specific code paths), use torch.version:

def _is_cuda():
    return torch.version.cuda is not None

def _is_musa():
    return hasattr(torch.version, 'musa') and torch.version.musa is not None

Detailed Migration Examples

Example 1: Basic GPU Usage

Before:

import torch

if torch.cuda.is_available():
    device = torch.device("cuda")
    tensor = torch.randn(100, 100).cuda()
    model = MyModel().cuda()

After:

import torchada  # Add this line at the top
import torch

# Use has_gpu() from above instead of torch.cuda.is_available()
if has_gpu():
    device = torch.device("cuda")  # Works on MUSA too!
    tensor = torch.randn(100, 100).cuda()  # Moves to MUSA on MUSA platform
    model = MyModel().cuda()

Example 2: torch.cuda APIs

All standard torch.cuda APIs work after importing torchada:

import torchada  # noqa: F401 - Import first
import torch

if has_gpu():
    torch.cuda.set_device(0)
    print(f"Using: {torch.cuda.get_device_name()}")
    print(f"Memory: {torch.cuda.memory_allocated() / 1024**2:.2f} MB")
    torch.cuda.synchronize()

Example 3: Mixed Precision Training

Before:

from torch.cuda.amp import autocast, GradScaler

After:

import torchada  # Add this line
from torch.cuda.amp import autocast, GradScaler  # Same import works!

Or use the newer API:

import torchada
import torch

with torch.amp.autocast(device_type='cuda'):
    output = model(input)

Example 4: Building Extensions (setup.py)

Before:

from setuptools import setup
from torch.utils.cpp_extension import CUDAExtension, BuildExtension, CUDA_HOME

setup(
    name="my_extension",
    ext_modules=[
        CUDAExtension(
            name="my_extension",
            sources=["my_extension.cpp", "my_kernel.cu"],
        ),
    ],
    cmdclass={"build_ext": BuildExtension},
)

After:

import torchada  # Add this line at the top
from setuptools import setup
from torch.utils.cpp_extension import CUDAExtension, BuildExtension, CUDA_HOME

# Exactly the same setup code works on all supported platforms!
setup(
    name="my_extension",
    ext_modules=[
        CUDAExtension(
            name="my_extension",
            sources=["my_extension.cpp", "my_kernel.cu"],
        ),
    ],
    cmdclass={"build_ext": BuildExtension},
)

Example 5: Distributed Training

Before:

import torch.distributed as dist
dist.init_process_group(backend='nccl')

After:

import torchada  # Add this line
import torch.distributed as dist
dist.init_process_group(backend='nccl')  # Works on all supported platforms

Example 6: CUDA Graphs

Before:

g = torch.cuda.CUDAGraph()
with torch.cuda.graph(g):
    y = model(x)

After:

import torchada  # Add this line
import torch

g = torch.cuda.CUDAGraph()
with torch.cuda.graph(g):
    y = model(x)

What Happens Under the Hood

When you import torchada, it:

Detects the platform: Identifies the available GPU hardware
Patches PyTorch modules: Makes torch.cuda and torch.utils.cpp_extension work transparently
Translates device strings: "cuda" device strings work on any supported platform
Maps backends: "nccl" backend works on all supported platforms
Converts symbols: CUDA API calls in extensions are mapped to platform equivalents
Handles compilation: .cu files are compiled with the appropriate compiler

Environment Variables

You can force a specific platform:

export TORCHADA_PLATFORM=cuda  # or cpu

Common Patterns in Popular Projects

vLLM-style setup.py

# Just add this at the top of setup.py
import torchada

# Keep all your existing imports unchanged
from torch.utils.cpp_extension import CUDAExtension, BuildExtension

SGLang-style code

# Just add this at the top
import torchada

# All existing torch.cuda code works unchanged
import torch.cuda
torch.cuda.synchronize()

Tips for Migration

Import torchada first: Always import torchada before torch to ensure patches are applied
Replace GPU checks: Replace torch.cuda.is_available() with a has_gpu() helper (see above)
Keep standard imports: Use from torch.utils.cpp_extension import ... (not from torchada)
Keep "cuda" strings: No need to change device strings - torchada handles platform differences
Test your code: Verify your code works correctly after adding the torchada import

Why torch.cuda.is_available() Returns False on MUSA

By design, torch.cuda.is_available() is NOT redirected to torch.musa.is_available(). This allows downstream projects (like SGLang, vLLM) to properly detect the platform using patterns like:

if torch.version.cuda is not None:
    # CUDA platform
elif hasattr(torch.version, 'musa') and torch.version.musa is not None:
    # MUSA platform

Use torch.version.cuda and torch.version.musa for platform detection.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Migrating Existing CUDA Projects to torchada

Quick Migration

Step 1: Install torchada

Step 2: Add One Import

Important Note on GPU Detection

Detailed Migration Examples

Example 1: Basic GPU Usage

Example 2: torch.cuda APIs

Example 3: Mixed Precision Training

Example 4: Building Extensions (setup.py)

Example 5: Distributed Training

Example 6: CUDA Graphs

What Happens Under the Hood

Environment Variables

Common Patterns in Popular Projects

vLLM-style setup.py

SGLang-style code

Tips for Migration

Why torch.cuda.is_available() Returns False on MUSA

FilesExpand file tree

migrate_existing_project.md

Latest commit

History

migrate_existing_project.md

File metadata and controls

Migrating Existing CUDA Projects to torchada

Quick Migration

Step 1: Install torchada

Step 2: Add One Import

Important Note on GPU Detection

Detailed Migration Examples

Example 1: Basic GPU Usage

Example 2: torch.cuda APIs

Example 3: Mixed Precision Training

Example 4: Building Extensions (setup.py)

Example 5: Distributed Training

Example 6: CUDA Graphs

What Happens Under the Hood

Environment Variables

Common Patterns in Popular Projects

vLLM-style setup.py

SGLang-style code

Tips for Migration

Why torch.cuda.is_available() Returns False on MUSA