GS/SW: Mask color gradients to prevent incorrect clamping. #13519

TJnotJT · 2025-11-11T16:10:53Z

Description of Changes

Mask color gradients before converting 32->16 bit in prim setup to prevent unwanted clamping.

Rationale behind Changes

When gradients are too large to fit in the fix point format, rolling them over will still preserve the correct colors in the scanline renderer. This makes sure the roll over happens correctly to prevent graphical bugs.

Fixes #6459
Fixes #10210.

Suggested Testing Steps

Testing any games with the SW renderer on both SSE and AVX2 builds.

Did you use AI to help find, test, or implement this issue or feature?

Looking up aarch64 instructions.

Credits

Co-authored-by: TellowKrinkle

TJnotJT · 2025-11-11T16:13:26Z

Haven't done a dump run yet so converting to draft.

TellowKrinkle · 2025-11-14T05:16:31Z

Didn't look super hard but I think you're missing save & restore of xmm15 on windows. Yay for calling conventions being fun.

Co-authored-by: TellowKrinkle

TJnotJT · 2025-11-14T22:04:01Z

Didn't look super hard but I think you're missing save & restore of xmm15 on windows. Yay for calling conventions being fun.

Good point, I had forgotten about this. Just fixed it.

TellowKrinkle

arm has non-saturating pack instructions, so no separate masking is needed

GH doesn't let me suggest edits on existing code, but the entire second section can be replaced with

			// VectorI r = VectorI(dr * shift[1 + i]);

			armAsm->Fmul(v2.V4S(), v0.V4S(), VRegister(4 + i, kFormat4S));
			armAsm->Fcvtzs(v2.V4S(), v2.V4S());

			// VectorI b = VectorI(db * shift[1 + i]);

			armAsm->Fmul(v3.V4S(), v1.V4S(), VRegister(4 + i, kFormat4S));
			armAsm->Fcvtzs(v3.V4S(), v3.V4S());

			// m_local.d[i].rb = r.trn1_16(b); // Yeah I know this isn't in GSVector since that's mainly targeting x86 for now
			armAsm->Trn1(v2.V8H(), v2.V8H(), v3.V8H());
			armAsm->Str(v2, _local(d[i].rb));

pcsx2/GS/Renderers/SW/GSSetupPrimCodeGenerator.arm64.cpp

TJnotJT · 2025-11-15T12:25:22Z

arm has non-saturating pack instructions, so no separate masking is needed

GH doesn't let me suggest edits on existing code, but the entire second section can be replaced with

Sounds good, I made the suggested changes in a new commit so it's clear where it diverged from x64. When you have a chance, let me know if it looks kosher (along with the amended comments).

pcsx2/GS/Renderers/SW/GSSetupPrimCodeGenerator.arm64.cpp

This is more efficient on ARM, though the equivalent instructions are not currently used in the x64 JIT and C++ versions of GSVector. Co-authored-by: TellowKrinkle

TellowKrinkle

Looks good assuming nothing breaks in the dump run

TJnotJT · 2025-11-18T20:15:11Z

Dump run with SSE4 build came clean so removing draft status. Just to be safe I'll do an AVX2 run also.

Edit: If anyone is able to do an ARM dump run that would be highly appreciated.

JordanTheToaster

Tested a variety of games and all seems fine on Windows with AVX2.

TJnotJT · 2025-11-20T01:25:31Z

Also did a AVX2 dump run - all looks good.

lightningterror

Did an sse4 dump and didn't notice any issues (700+ dumps showed diffs with no visual difference so maybe something could still slip).

TJnotJT · 2025-11-26T20:44:14Z

Did an sse4 dump and didn't notice any issues (700+ dumps showed diffs with no visual difference so maybe something could still slip).

Great, thanks for the testing. Yeah, many dumps will have small differences, but hopefully these are all slight improvements.

TJnotJT requested a review from TellowKrinkle November 11, 2025 16:10

github-actions bot added GS GS: Software labels Nov 11, 2025

TJnotJT marked this pull request as draft November 11, 2025 16:13

GS/SW: Mask color gradients to prevent incorrect clamping.

941e6a4

Co-authored-by: TellowKrinkle

TJnotJT force-pushed the gs-sw-gradient-mask branch from 1d3ce85 to 941e6a4 Compare November 14, 2025 22:01

TellowKrinkle reviewed Nov 15, 2025

View reviewed changes

pcsx2/GS/Renderers/SW/GSSetupPrimCodeGenerator.arm64.cpp Outdated Show resolved Hide resolved

TJnotJT force-pushed the gs-sw-gradient-mask branch from 82fe3dd to 6025236 Compare November 15, 2025 12:49

TellowKrinkle reviewed Nov 16, 2025

View reviewed changes

pcsx2/GS/Renderers/SW/GSSetupPrimCodeGenerator.arm64.cpp Outdated Show resolved Hide resolved

GS/SW: Use non-saturating ARM instructions for color gradient setup.

0cf9ea8

This is more efficient on ARM, though the equivalent instructions are not currently used in the x64 JIT and C++ versions of GSVector. Co-authored-by: TellowKrinkle

TJnotJT force-pushed the gs-sw-gradient-mask branch from 6025236 to 0cf9ea8 Compare November 16, 2025 17:48

TellowKrinkle approved these changes Nov 17, 2025

View reviewed changes

TJnotJT marked this pull request as ready for review November 18, 2025 20:15

JordanTheToaster approved these changes Nov 19, 2025

View reviewed changes

JordanTheToaster added this to the Release 2.6 milestone Nov 25, 2025

lightningterror approved these changes Nov 26, 2025

View reviewed changes

lightningterror merged commit f322dfb into PCSX2:master Nov 26, 2025
12 checks passed

Uh oh!

GS/SW: Mask color gradients to prevent incorrect clamping. #13519

GS/SW: Mask color gradients to prevent incorrect clamping. #13519

Conversation

TJnotJT commented Nov 11, 2025 • edited by lightningterror Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description of Changes

Rationale behind Changes

Suggested Testing Steps

Did you use AI to help find, test, or implement this issue or feature?

Credits

Uh oh!

TJnotJT commented Nov 11, 2025

Uh oh!

TellowKrinkle commented Nov 14, 2025

Uh oh!

TJnotJT commented Nov 14, 2025

Uh oh!

TellowKrinkle left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

TJnotJT commented Nov 15, 2025

Uh oh!

Uh oh!

TellowKrinkle left a comment

Choose a reason for hiding this comment

Uh oh!

TJnotJT commented Nov 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

JordanTheToaster left a comment

Choose a reason for hiding this comment

Uh oh!

TJnotJT commented Nov 20, 2025

Uh oh!

lightningterror left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

TJnotJT commented Nov 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

TJnotJT commented Nov 11, 2025 •

edited by lightningterror

Loading

TJnotJT commented Nov 18, 2025 •

edited

Loading

TJnotJT commented Nov 26, 2025 •

edited

Loading