SLIDER: Super-fast predictor of proteins with Long Intrinsically DisordERed regions

SLIDER webserver

SLIDER predicts whether a given protein sequence has long disordered segment(s), i.e., segment(s) with at least 30 consecutive disordered residues. The predictions is performed in high-throughput manner utilizing logistic regression model that takes amino acid composition, sequence complexity and selected physicochemical properties of amino acids as inputs.

Standalone version

Standalone version of the SLIDER method can be downloaded at SLIDER.tar.gz

References

Upon the usage the users are requested to use the following citation:

Peng Z, Mizianty MJ, Kurgan LA, 2014. Genome-scale prediction of proteins with long intrinsically disordered regions. Proteins: Structure, Function, and Bioinformatics, 82(1): 145-158.

Materials

TRAINING dataset - Dataset used to develop classifier including feature selection and parameterization using 5-fold cross validation.
TEST dataset - Dataset used to perform out-of-sample evaluation of SLIDER.

Line 1: >protein ID
Line 2: protein sequence (one letter amino acid code)
Line 3: disorder annotations, where 0/1/X represents the ordered/disordered/unannotated residue

Help

SLIDER accepts either single or multiple protein sequences and the input is limited to 75 000 protein sequences at the time. The user should submit the protein sequence(s) in FASTA format.

example for 498 proteins from Hemiselmis proteome

Line 1: >protein name (The server will trim protein names to first 12 characters)
Line 2: protein sequence (one letter amino acid code only)

Acknowledgments

We acknowledge, with thanks, that the following software was used as a part of this server:

SEG - Application for the prediction of low complexity regions (LCRs)
AAIndex - Representing the physicochemical properties of amino acids