Skip to content

quevan/phlame

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

74 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

PHLAME: Novelty-aware intraspecies profiling from metagenomes

PHLAME is a complete pipeline for the creation of intraspecies reference databases and the metagenomic detection of intraspecies clades, their relative frequency, and their estimated Divergence from the reference phylogeny.

The accepted raw inputs to PHLAME are:

  • [1] A species-specific assembled reference genome in .fasta format
  • [2] A collection of whole genome sequences of the same species in .fastq or .fasta format
  • [3] Metagenomic sequencing data in either .fastq or aligned .bam format.

Link to preprint is here.

The data used in the manuscript is available here.

Pre-built reference databases are available in classifiers/

Installation

You can install PHLAME using either pip or conda:

pip install phlame
conda install -c bioconda phlame

PHLAME is in active development. Please make sure you have the latest version installed before running.

Dependencies

  • python >=3.8, <3.13
  • numpy - (tested with v1.20.3)
  • matplotlib - (tested with v3.4.2)
  • pandas - (tested with v1.2.5)
  • biopython - (tested with v1.79)
  • scipy - (tested with v1.6.2)
  • statsmodels - (tested with v0.13.1)
  • ete3 - (tested with v3.1.2)
  • samtools (>=v1.15)
  • bcftools (>=v1.2)

Optional

  • RaXML - (tested with v8.2.13)
  • Additionally, starting with raw sequencing read data will require an aligner (like bowtie2).

Overview

PHLAME uses intraspecies reference databases to profile strain-level diversity of individual species from metagenomic data. PHLAME is novelty-aware and will quantify the proportion of novel diversity in a sample that is not be explained by existing reference genomes. This is made possible through PHLAME's Divergence (DVb) metric, which estimates the point on individual branches of a phylogeny for which novel strains in a sample are inferred to diverge from known references.

alt text

Tutorial

This tutorial uses the small set of files found in example/ and is made to be run inside the example/ directory.

PHLAME manual

0. Conceptual introduction to PHLAME

1. Building a database

2. Classifying metagenome samples

3. Intepreting PHLAME results

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors