Releases · facebookresearch/xformers

13 Nov 14:25

bottler

v0.0.33.post1

3f91ad6

v0.0.33.post1 Latest

Latest

Fixed wheel upload to pypi

Assets 2

12 Nov 13:49

bottler

v0.0.33

aa7bc36

Support Pytorch 2.9

Added

cutlass fmha Op for Blackwell GPUs
Support flash-attention package up to 2.8.3
expose FA3 deterministic mode
FW+BW pass overlap for DeepSeek-like comms/compute overlap

Improved

merge_attentions support for irregular head dimension

Assets 2

15 Aug 05:58

danthe3rd

v0.0.32.post2

5d4b92a

v0.0.32.post2

Add ROCM 6.4 build

Assets 2

14 Aug 12:13

danthe3rd

v0.0.32.post1

840bcec

v0.0.32.post1

wheels/windows timeout (#1309)

* wheels/windows timeout

Try building with `MAX_JOBS=3`

* Update wheels_build.yml

Assets 2

13 Aug 19:29

danthe3rd

v0.0.32

ac00641

v0.0.32: Wheels for PyTorch 2.8.0

Pre-built binary wheels are available for PyTorch 2.8.0.

Added

Support flash-attention package up to 2.8.2
Speed improvements to python -m xformers.profiler.find_slowest

Removed

Removed autograd backward pass for merge_attentions as it is easy to use incorrectly.
Attention biases are no longer torch.Tensor subclasses. This is no longer
necessary for torch.compile to work, and was adding more complexity

Assets 2

08 Jul 09:36

danthe3rd

v0.0.31.post1

8354497

`v0.0.31.post1` Fixing wheels for windows

remove merge_attentions backward (fairinternal/xformers#1402)

__original_commit__ = fairinternal/xformers@601197af8bf5a55f73b4bb79b5d74a03b853dc51

Assets 2

25 Jun 09:11

danthe3rd

v0.0.31

eb0946a

v0.0.31 - PyTorch 2.7.1, Flash3 on windows, and dropping V100 support

[0.0.31] - 2025-06-25

Pre-built binary wheels are available for PyTorch 2.7.1.

Added

xFormers wheels are now python-version agnostic: this means that the same wheel can be used for python 3.9, 3.10, ... 3.13
Added support for Flash-Attention 3 on Ampere GPUs

Removed

We will no longer support V100 or older GPUs, following PyTorch (pytorch/pytorch#147607)
Deprecated support for building Flash-Attention 2 as part of xFormers. For Ampere GPUs, we now use Flash-Attention 3 on windows, and Flash-Attention 2 can still be used through PyTorch on linux.

Assets 2

28 Apr 14:51

v0.0.30

4cf69f0

`v0.0.30` - build for PyTorch 2.7.0

Pre-built binary wheels are available for PyTorch 2.7.0. Following PyTorch, we build wheels for CUDA 11.8, 12.6, and 12.8 only (we no longer build for CUDA 12.4).
xFormers now requires PyTorch >= 2.7

Added

[fMHA] Added support for local attention on the Flash3 backend (H100)
[fMHA] Added a new paged gappy attention bias

Improved

[fMHA] The FlashAttention3 backend now ships with more head dimensions to support MLA, and with a FLOPs formula in order to be compatible with PyTorch's partitioner-base automatic activation checkpointing
The fused operators for sequence parallelism were migrated to PyTorch's SymmetricMemory
The profiler prepends the traces' filenames with the rank of the process when doing distributed training

Removed

Removed documentation for legacy unmaintained components

Assets 2

10 Feb 12:43

v0.0.29.post3

0b3963a

`v0.0.29.post3` Fix CUDA 12.6 builds on Windows

Fix missing builds for CUDA 12.6 on Windows

Assets 2

31 Jan 20:42

danthe3rd

v0.0.29.post2

1298453

`v0.0.29.post2` - build for PyTorch 2.6.0

Pre-built binary wheels are available for PyTorch 2.6.0. Following PyTorch, we build wheels for CUDA 11.8, 12.4, and 12.6 only (we no longer build for CUDA 12.1).
xFormers now requires PyTorch >= 2.6

Assets 2

Releases: facebookresearch/xformers

v0.0.33.post1

Uh oh!

Support Pytorch 2.9

Added

Improved

Uh oh!

v0.0.32.post2

Uh oh!

v0.0.32.post1

Uh oh!

v0.0.32: Wheels for PyTorch 2.8.0

Added

Removed

Uh oh!

`v0.0.31.post1` Fixing wheels for windows

Uh oh!

v0.0.31 - PyTorch 2.7.1, Flash3 on windows, and dropping V100 support

[0.0.31] - 2025-06-25

Added

Removed

Uh oh!

`v0.0.30` - build for PyTorch 2.7.0

Added

Improved

Removed

Uh oh!

`v0.0.29.post3` Fix CUDA 12.6 builds on Windows

Uh oh!

`v0.0.29.post2` - build for PyTorch 2.6.0

Uh oh!