Hi!
I ran modkit pileup (v.0.5.0) on one of my ONT samples for 5mC and noticed that the pass threshold with the default settings was put very high. A lot of methylated calls were filtered out giving very low percentages. This is what the logfile shows:
estimated threshold too high 1, using 0.9941406
I also tried runnning it with the newer version (v0.6.1), but got the same issue. This is what the log file shows:
estimated threshold too high 1, using 0.9746094 (maximum explicit canonical probability)
I also ran the sample probs to see what the thresholds and distribution looks like. The threshold was the same as for the pileup for both versions, put at 1:
base percentile threshold
C 10 1
C 50 1
C 90 1
I understood from the documentation that with the default settings, the lowest 10% of probabilities are filtered out. But when i look at the probability distributions of this sample it does not seem to put the pass threshold at the 10th lowest percentile, but a lot higher (see attached picture)? This leads to the majority of data not passing the threshold and being filtered out.
For reference, all my ONT samples get the exact same threshold put. I've played around with the filter settings myself, but there is no best practice for what threshold to put it at (yet) so this is not the best option for me at this moment.
Could you tell me why this happens and how I could fix it?

Hi!
I ran modkit pileup (v.0.5.0) on one of my ONT samples for 5mC and noticed that the pass threshold with the default settings was put very high. A lot of methylated calls were filtered out giving very low percentages. This is what the logfile shows:
estimated threshold too high 1, using 0.9941406I also tried runnning it with the newer version (v0.6.1), but got the same issue. This is what the log file shows:
estimated threshold too high 1, using 0.9746094 (maximum explicit canonical probability)I also ran the sample probs to see what the thresholds and distribution looks like. The threshold was the same as for the pileup for both versions, put at 1:
base percentile threshold
C 10 1
C 50 1
C 90 1
I understood from the documentation that with the default settings, the lowest 10% of probabilities are filtered out. But when i look at the probability distributions of this sample it does not seem to put the pass threshold at the 10th lowest percentile, but a lot higher (see attached picture)? This leads to the majority of data not passing the threshold and being filtered out.
For reference, all my ONT samples get the exact same threshold put. I've played around with the filter settings myself, but there is no best practice for what threshold to put it at (yet) so this is not the best option for me at this moment.
Could you tell me why this happens and how I could fix it?
