Will it be beneficial to use all genomics bins, instead of most variable feature selection? #121
Unanswered
wangmeijiao
asked this question in
Q&A
Replies: 1 comment 5 replies
-
|
SnapATAC version 1 also filters bins. Some bins, especially those with low counts, may be very noisy. If you don't filter them, you will see a lot of noisy structures in your clustering result. |
Beta Was this translation helpful? Give feedback.
5 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
Hi Kai and all,
It seems that it is quite important to select features for downstream dimension reduction and other. If I understood correct the hints you raised in the discussion thread "#116", both top ranked variable features (needs tunning) and iteratively selection method can do good to the next dimension reduction step. Here I ask that why not just select all genomic bins (with some filtering of black list regions) as SnapATAC version 1 does? if CPU and memory are not the problems.
Best,
Meijiao
Beta Was this translation helpful? Give feedback.
All reactions