ProSECFPs

ProSECFPs is a tool for generating representations of protein sequences. This repository provides a Python script for computing ProSECFP descriptors using a predefined environment with all required dependencies.

Installation

To set up the required environment, use the provided YAML file:

conda env create -f prosecfps_env.yml Then, activate the environment:

conda activate prosecfps_env

Usage

Run the prediction script with Python:

python ProSECFPs.py -in input.csv -out output.csv -nj 1

Input

The input file must be a CSV containing a column named Sequences, which represents the protein sequences. A sample CSV file (input.csv) is included in the repository for testing purposes, along with the amino acid descriptor dataset (descriptors.dump) required for computing the ProSECFP representations.

Output

The script generates a CSV file containing the ProSECFP descriptors for each protein sequence. The descriptors are computed using the C-PSECFP variant, with an iteration radius of 12 and a representation vector length of 1024.

Dependencies

All necessary dependencies are included in prosecfps_env.yml.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

ProSECFPs

Installation

Usage

Input

Output

Dependencies

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
ProSECFPs.py		ProSECFPs.py
README.md		README.md
descriptors.dump		descriptors.dump
input.csv		input.csv
prosecfps_env.yml		prosecfps_env.yml

MMVSL/ProSECFPs

Folders and files

Latest commit

History

Repository files navigation

ProSECFPs

Installation

Usage

Input

Output

Dependencies

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages