Hi,
Thanks for developing deltaBS.
I am having a bit of trouble prepping my data and was hoping you could clarify a few things.
The background: I have 8k Klebsiella genomes I want to generate DBS values for.
- I have been running hmmsearch using Gammaproteobacteria eggnog HMM profiles, but this is taking a very long time. Do you have any recommendations as to the most appropriate way to generate HMM bitscores in a relatively short period of time?
- Is there any reason I can't use hmmscan over hmmsearch to generate the bit scores? Just that it will be quicker for me to screen genes against HMMs rather than HMMs against genes.
- As I'm using a Panaroo input as per the 'deltaBS across a population of bacteria' wiki page, would it be better for me to build my own HMMs of each gene, and is this the buildCustomModels.pl script?
- Furthermore on this, could I use my pangenome protein fasta file as --database and for the --proteome option?
Thanks
Hi,
Thanks for developing deltaBS.
I am having a bit of trouble prepping my data and was hoping you could clarify a few things.
The background: I have 8k Klebsiella genomes I want to generate DBS values for.
Thanks