fix(simd): umasked AVX2 load #239

ahuber21 · 2025-12-09T10:14:00Z

The previous code always performed a full width load on the provided data. In ragged-epilogue scenarios, where we request a masked load, this resulted in SEGV errors in certain runs with address sanitizer.

    if (i < count.size()) {
        auto mask = create_mask<simd_width>(count);
        s0 = op.accumulate(mask, s0, op.load_a(mask, a + i), op.load_b(mask, b + i));
    }

Why wasn't this caught sooner?

The OS only triggers a segmentation fault if a read accesses an unmapped memory page. Since memory protection (typically) operates at a 4KB page granularity, reading past the end of a buffer is "safe" from the OS's perspective unless the overflow happens to cross exactly into an unmapped page.

Why is ASan catching it sporadically?

Since our underlying object storage is std::vector, ASan detection requires two specific conditions to align:

No Spare Capacity: The vector's size() must equal its capacity(). If there is spare capacity, the unsafe load simply reads valid (though uninitialized) memory owned by the vector.
Alignment & Redzones: The underlying heap allocation must be sized and aligned such that the full-width SIMD read (e.g., 32 bytes) actually crosses the allocation boundary into the ASan redzone. If the allocator adds padding for alignment, the read might land in that valid padding instead.

.github/workflows/asan.yml

This reverts commit 528ff19.

This reverts commit 3ba1fd8.

ahuber21 · 2025-12-10T16:32:02Z

@copilot summarize the changes in this PR

Copilot · 2025-12-10T16:32:11Z

@ahuber21 I've opened a new pull request, #242, to work on those changes. Once the pull request is ready, I'll request review from you.

Copilot

Pull request overview

This PR fixes a critical SIMD memory safety bug in AVX2 masked load operations that could cause segmentation faults when reading beyond allocated memory boundaries. The fix ensures masked loads respect buffer boundaries in ragged-epilogue scenarios.

Key Changes:

Fixed AVX2 masked load implementation to prevent out-of-bounds reads
Added comprehensive ASan-detected regression tests for distance computations
Enhanced CI with dedicated AddressSanitizer build configuration

Reviewed changes

Copilot reviewed 8 out of 8 changed files in this pull request and generated 4 comments.

Show a summary per file

File	Description
tests/svs/lib/avx_detection.cpp	Added test for runtime AVX flag patching mechanism
tests/svs/core/distance.cpp	Added ASan regression tests for distance computation with ragged epilogues
tests/CMakeLists.txt	Updated Catch2 to v3.11.0 and improved test discovery with tag-based labels
.github/workflows/build-linux.yml	Added clang++-18 ASan build configuration with leak detection disabled
tests/svs/index/vamana/multi.cpp	Tagged test as long-running to exclude from ASan builds
tests/svs/index/vamana/index.cpp	Tagged test as long-running to exclude from ASan builds
tests/svs/index/inverted/memory_based.cpp	Tagged test as long-running to exclude from ASan builds
tests/svs/index/inverted/clustering.cpp	Tagged test as long-running to exclude from ASan builds

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

tests/svs/lib/avx_detection.cpp

tests/svs/core/distance.cpp

.github/workflows/build-linux.yml

This reverts commit f856a96.

Co-authored-by: Copilot <[email protected]>

ibhati

The new workflow is added in the existing workflows, please make sure the earlier flow and options did not change.

ibhati · 2025-12-10T17:25:41Z

tests/svs/index/inverted/clustering.cpp

 } // namespace

-CATCH_TEST_CASE("Random Clustering - End to End", "[inverted][random_clustering]") {
+CATCH_TEST_CASE("Random Clustering - End to End", "[long][inverted][random_clustering]") {


What is this [long]? Why is this required?

Sorry, had a comment that I didn't post. I added a new tag [long] that marks long-running tests. They are skipped in the debug asan build.

ScalableVectorSearch/.github/workflows/build-linux.yml

Lines 63 to 64 in 63e58cd

# skip longer-running tests

ctest_args: "-LE long"

ethanglaser

Do we know how long the ASan run takes if completing successfully?

.github/workflows/build-linux.yml

…kip (#241) ASan will be added in #239. It flags `SearchBuffer::can_skip`. Here we reorder the logic to check `full()` before accessing `back()`. Accessing `back()` on an empty buffer caused an index underflow (`SIZE_MAX`).

ahuber21 · 2025-12-11T10:24:50Z

Do we know how long the ASan run takes if completing successfully?

@ethanglaser about 10 minutes total, < 5 mins of which is testing.

ahuber21 · 2025-12-11T10:29:39Z

The new workflow is added in the existing workflows, please make sure the earlier flow and options did not change.

@ibhati the only change is that the existing steps are executing three more tests (with negligible runtime).

fix(simd): umasked AVX2 load

ef180f7

ahuber21 requested a review from ibhati as a code owner December 9, 2025 10:14

remove L2Impl specific test

c1705f5

ahuber21 requested review from mihaic and yuejiaointel as code owners December 9, 2025 14:45

add asan yml

05dce8f

ahuber21 requested review from ethanglaser and homksei as code owners December 9, 2025 15:04

ahuber21 added 2 commits December 9, 2025 07:07

fix naming and clang version

ecf5b28

copyright

e5d46aa

ahuber21 removed the request for review from homksei December 9, 2025 15:09

typo

f271dc9

ibhati reviewed Dec 9, 2025

View reviewed changes

.github/workflows/asan.yml Outdated Show resolved Hide resolved

ibhati reviewed Dec 9, 2025

View reviewed changes

.github/workflows/asan.yml Outdated Show resolved Hide resolved

comments on workflow file

2f8dd96

ibhati approved these changes Dec 9, 2025

View reviewed changes

ahuber21 added 6 commits December 10, 2025 00:24

chore(ci): streamline asan & linux workflows

4046e99

fixup: undo format

9bc5313

refactor: use CATCH_TEMPLATE_TEST_CASE

2a43709

fix(ci): remove unused cmake option

1e74a55

fix(ci): skip integration testsin debug asan build

dbe4ae8

feat(ci): add new tag 'long' that's skipped for asan

2b00b78

ahuber21 requested a review from ibhati December 10, 2025 09:50

ahuber21 added 7 commits December 10, 2025 05:53

update catch2 and use ADD_TAGS_AS_LABELS ADD_TAGS_AS_LABELS

6cf9ecd

add more [long] labels

d94e2a7

revert simd_utils.h to trip new asan check in CI

3ba1fd8

add AVX2 L2 calculation back to trigger asan

528ff19

Revert "add AVX2 L2 calculation back to trigger asan"

99dbac3

This reverts commit 528ff19.

Revert "revert simd_utils.h to trip new asan check in CI"

055214f

This reverts commit 3ba1fd8.

run all ISA paths in test

478c0dc

keep avx_runtime_flags const by using const_cast in tests

c205443

ahuber21 mentioned this pull request Dec 10, 2025

fix(search-buffer): ASan heap-buffer-underflow in SearchBuffer::can_skip #241

Merged

ahuber21 added 4 commits December 10, 2025 08:13

fix false positive failure for skipped tests

de0bfac

fix: only modify isa dispatching on x86

be64f34

fixup

f856a96

simplify test

80d1d84

Copilot AI mentioned this pull request Dec 10, 2025

[WIP] Fix umasked AVX2 load to prevent segmentation faults #242

Closed

ahuber21 requested a review from Copilot December 10, 2025 16:33

Copilot AI reviewed Dec 10, 2025

View reviewed changes

tests/svs/lib/avx_detection.cpp Show resolved Hide resolved

tests/svs/core/distance.cpp Show resolved Hide resolved

.github/workflows/build-linux.yml Show resolved Hide resolved

.github/workflows/build-linux.yml Show resolved Hide resolved

ahuber21 and others added 4 commits December 10, 2025 09:01

Revert "fixup"

966d58c

This reverts commit f856a96.

fixup

6fcc214

Include asan in C flags

7d5b6ed

Co-authored-by: Copilot <[email protected]>

fixup

63e58cd

ibhati approved these changes Dec 10, 2025

View reviewed changes

rfsaliev added a commit to RedisAI/VectorSimilarity that referenced this pull request Dec 10, 2025

Apply SVS umasked read fix from intel/ScalableVectorSearch#239

ff23d3a

ethanglaser reviewed Dec 10, 2025

View reviewed changes

.github/workflows/build-linux.yml Outdated Show resolved Hide resolved

.github/workflows/build-linux.yml Outdated Show resolved Hide resolved

ahuber21 and others added 2 commits December 11, 2025 02:01

remove asan_options; remove auto-formatted double-quote change

8b36bef

Merge branch 'main' into dev/fix-unmasked-read

4ee907d

ahuber21 merged commit 724ac33 into main Dec 11, 2025
15 checks passed

ahuber21 deleted the dev/fix-unmasked-read branch December 11, 2025 10:58

fix(simd): umasked AVX2 load #239

fix(simd): umasked AVX2 load #239

Uh oh!

Conversation

ahuber21 commented Dec 9, 2025

Uh oh!

Uh oh!

Uh oh!

ahuber21 commented Dec 10, 2025

Uh oh!

Copilot AI commented Dec 10, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ibhati left a comment

Choose a reason for hiding this comment

Uh oh!

ibhati Dec 10, 2025

Choose a reason for hiding this comment

Uh oh!

ahuber21 Dec 11, 2025

Choose a reason for hiding this comment

Uh oh!

ethanglaser left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

ahuber21 commented Dec 11, 2025

Uh oh!

ahuber21 commented Dec 11, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants