Improve RD testing to recover overlapped CNVs > 100 kbp #851

kjaisingh · 2025-08-04T14:37:22Z

Description

This PR includes code to improve the depth testing methods in order to capture large CNVs that are overlapped by another CNV of a different SV type but called as two distinct events.

During our DRAGEN-SV benchmarking evaluations, we noticed 3 real CNVs > 100 kbp that were dropped following FilterBatchSites. Upon closer inspection of their characteristics, we came to the conclusion that:

All 3 of these CNVs were overlapped by another distinct CNV of a different SV type, but had far fewer samples called for it relative to the overlapping CNV.
The background depth levels for the dropped CNV were hence distorted.
This resulted in the RD_log_pval values for these CNVs being shockingly low, despite the CNVs clearly appearing to be real events.

This PR introduces the following enhancements to ensure that such CNVs are not dropped:

The singlesampZ test now uses a robust z-score based on median depths rather than means, which is distorted when there is an overlapping event of a different SV type in the background.
twosampPerm test: TBD.

Testing

The following deck has testing results so far.

Pre-Merge Changes Required

Conduct end-to-end testing.
Copy over changes into RdTestV2.R.

…ample test

… package correctly installed

kjaisingh added 30 commits May 23, 2025 16:01

Added print statements to rdtest

046ac3a

Updated to use t-test and mad/median

b4997e3

Implemented ttest

d9a6b14

Implemented median/MAD for singlesample test, Welch's t-test for twos…

806f878

…ample test

Reset other RD testing scripts to original versions

ee18730

Used wilcoxon rank-sum test for twosample case

de90d27

Implemented permutation test with n=1000

d1cbd78

Implemented anderson-darling test

c2513e4

Added ksamples to dockerfile

9c19c9c

Implemented Cramér-von Mises test

ade5bd4

Implemented robust welch's t-test with median/MAD

4cb02c9

Revert to anderson-darling but import ksamples in RdTest now

83c94c6

Use the cramer-von mises test implementation again, but with ksamples…

b42ea0f

… package correctly installed

Implemented robust perm test with median/mad in pclt method

e08a8fd

Correct double application of correction factor

8e0b27e

Correct double application of correction factor pt2

c81eab2

Reverted to AD test

2122624

Implemented CVM test

a209d83

Used mad/median throughout instead of median/std

e988fcb

Use robust z-score V3

cd7407c

Reverted to original implementation + dockerfile

213f933

Simple replacement of mean with median for group calculation

90e1ee7

Include original code for reference

c3b143c

WIP

9a006e9

Merge branch 'main' into kj_rdtest_verbose

7dd4c1b

Merge branch 'main' into kj_rdtest_verbose

970da78

Stop tracking RdTestV0.R

c8a6618

Add single sample robust z-score back to script

4e89474

Accidentally modified RdTestV2.R instead of main version

da5050f

Remove permTS function itself, just keep twosample.pclt override

98a7b45

kjaisingh self-assigned this Aug 4, 2025

kjaisingh added the methods label Aug 4, 2025

kjaisingh added 10 commits August 4, 2025 17:01

Minor formatting change + removed from gitignore

c1246cd

Removed redundant comment

f3d1e2c

Removed redundant comment

c8b3729

Merge branch 'main' into kj_rdtest_recover_overlapped_cnvs

39341b2

Removed overwriting of robust correction

b4d49c9

Reverted to OG implementation of perm test

0164cbe

Implemented sampling of middle 80%

cf54280

Implemented sampling of only those within 6 MAD

7235d2b

Implemented hard cutoffs

7d72f36

Log counts before/after dropping bg samples

4f5b9d3

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Improve RD testing to recover overlapped CNVs > 100 kbp #851

Improve RD testing to recover overlapped CNVs > 100 kbp #851

Uh oh!

kjaisingh commented Aug 4, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Improve RD testing to recover overlapped CNVs > 100 kbp #851

Are you sure you want to change the base?

Improve RD testing to recover overlapped CNVs > 100 kbp #851

Uh oh!

Conversation

kjaisingh commented Aug 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Testing

Pre-Merge Changes Required

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

kjaisingh commented Aug 4, 2025 •

edited

Loading