Skip to content

estimated pass threshold with default settings very high (v0.5.0 and v.0.6.1) #611

@AmberVerhasselt

Description

@AmberVerhasselt

Hi!

I ran modkit pileup (v.0.5.0) on one of my ONT samples for 5mC and noticed that the pass threshold with the default settings was put very high. A lot of methylated calls were filtered out giving very low percentages. This is what the logfile shows:
estimated threshold too high 1, using 0.9941406

I also tried runnning it with the newer version (v0.6.1), but got the same issue. This is what the log file shows:
estimated threshold too high 1, using 0.9746094 (maximum explicit canonical probability)

I also ran the sample probs to see what the thresholds and distribution looks like. The threshold was the same as for the pileup for both versions, put at 1:
base percentile threshold
C 10 1
C 50 1
C 90 1

I understood from the documentation that with the default settings, the lowest 10% of probabilities are filtered out. But when i look at the probability distributions of this sample it does not seem to put the pass threshold at the 10th lowest percentile, but a lot higher (see attached picture)? This leads to the majority of data not passing the threshold and being filtered out.

For reference, all my ONT samples get the exact same threshold put. I've played around with the filter settings myself, but there is no best practice for what threshold to put it at (yet) so this is not the best option for me at this moment.

Could you tell me why this happens and how I could fix it?
Image

Metadata

Metadata

Assignees

No one assigned

    Labels

    troubleshootingworkflow and data preparation questions

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions