It takes a 5 column file as input, writes the
- penta-nucleotide context and transsribed and unstranscibed strand status of mutation
- Generates bar plots for the same
- Runs a fishers test to check for TSB
row_id sample T_MM1 U_MM1 T U pvalue q
1 MMRC006BBM 172 122 3011 2907 1.2e-02 0.08866
7 MMRC023ZBM 250 172 1826 1576 3.4e-02 0.16608
19 PD26403d 190 126 1690 1608 2.6e-03 0.02920
33 PD26411a 219 134 1482 1365 4.4e-04 0.00789
34 PD26411c 189 104 1224 1099 1.4e-04 0.00315
35 PD26411d 90 47 640 603 1.6e-03 0.01997
36 PD26412a 175 123 1475 1354 3.3e-02 0.16608
39 PD26414a 690 498 4901 4586 3.1e-05 0.00093
40 PD26414b 253 143 1489 1375 7.9e-06 0.00070
43 PD26414g 220 147 1376 1287 3.1e-03 0.03062
57 PD26422d 115 74 721 688 1.3e-02 0.08941
58 PD26422e 112 76 736 705 3.0e-02 0.16608
59 PD26422f 119 74 737 697 8.9e-03 0.07206
60 PD26423e 240 120 1427 1178 2.0e-05 0.00090
62 PD26423h 153 93 1162 961 3.0e-02 0.16608
63 PD26424a 99 65 1063 1113 4.6e-03 0.04070
64 PD26424c 84 61 1032 1065 4.8e-02 0.22505
82 V0D57C 146 80 998 880 1.1e-03 0.01666
85 V0D583 67 41 763 737 2.8e-02 0.16608