Fork explanation

This is a fork of https://github.com/Dao-AILab/causal-conv1d with a patched version of Dao-AILab#45 on top.

This fork is pulled into the BioNeMo Framework to enable tentative Blackwell support (https://github.com/NVIDIA/bionemo-framework/blob/main/Dockerfile)

Once the origin repository enables Blackwell support, we will remove this fork.

Causal depthwise conv1d in CUDA with a PyTorch interface

Features:

Support fp32, fp16, bf16.
Kernel size 2, 3, 4.

How to use

from causal_conv1d import causal_conv1d_fn

def causal_conv1d_fn(x, weight, bias=None, activation=None):
    """
    x: (batch, dim, seqlen)
    weight: (dim, width)
    bias: (dim,)
    activation: either None or "silu" or "swish"

    out: (batch, dim, seqlen)
    """

Equivalent to:

import torch.nn.functional as F

F.conv1d(x, weight.unsqueeze(1), bias, padding=width - 1, groups=dim)[..., :seqlen]

Additional Prerequisites for AMD cards

Patching ROCm

If you are on ROCm 6.0, run the following steps to avoid errors during compilation. This is not required for ROCm 6.1 onwards.

Locate your ROCm installation directory. This is typically found at /opt/rocm/, but may vary depending on your installation.

Apply the Patch. Run with sudo in case you encounter permission issues.

 patch /opt/rocm/include/hip/amd_detail/amd_hip_bf16.h < rocm_patch/rocm6_0.patch

Name		Name	Last commit message	Last commit date
Latest commit History 84 Commits
.github/workflows		.github/workflows
causal_conv1d		causal_conv1d
csrc		csrc
rocm_patch		rocm_patch
tests		tests
.gitignore		.gitignore
AUTHORS		AUTHORS
LICENSE		LICENSE
README.md		README.md
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Fork explanation

Causal depthwise conv1d in CUDA with a PyTorch interface

How to use

Additional Prerequisites for AMD cards

Patching ROCm

About

Uh oh!

Releases

Packages

Languages

License

trvachov/causal-conv1d

Folders and files

Latest commit

History

Repository files navigation

Fork explanation

Causal depthwise conv1d in CUDA with a PyTorch interface

How to use

Additional Prerequisites for AMD cards

Patching ROCm

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages