Syn_Tool is a deep learning framework designed for analyzing Synechococcus ITS sequences. It integrates identification, classification, and novel clade delineation into a single streamlined workflow. With minimal setup, you can process sequence data and abundance information to gain insights into Synechococcus communities.
- Python Version: Python 3.8 or later
- Dependencies: Listed in
requirements.txt
-
Clone the repository:
git clone https://github.com/Aldred-Wang/Syn_Tool.git cd Syn_Tool -
Install dependencies:
pip install -r requirements.txt
Before running the tool, ensure that the following files are placed inside the model folder:
Syn_Tool_run.pyinput_sequences.fastainput_abundance.txt
Run the following command to start the tool:
python Syn_Tool_run.py -fa input_sequences.fasta -a input_abundance.txtParameters
-faor--fasta: Path to the input FASTA file.-aor--abundance: Path to the abundance file.
Here’s an example of how to use Syn_Tool:
python Syn_Tool_run.py -fa example_sequences.fasta -a example_abundance.txtAfter execution, the following files will be generated:
Syn.fasta: Output file from the identification module containing Synechococcus ITS sequences in FASTA format.Syn_df.csv: Output file from the identification module containing Synechococcus ITS sequences in CSV format.combined_df.csv: Output file from the classification module containing feature vectors of Synechococcus ITS sequences extracted by the CNN model.result.csv: Output file from the novel clade delineation module providing information about Synechococcus novel clades.Syn_Tool_final_result.txt: Final classification file generated by Syn_Tool.
If you use Syn_Tool in your research, please cite:
For questions or issues, please contact: