readme

Read me

The web-server ApoliPred was developed to identify the apolipoproteins based on the sequence information. The analysis of variance was used to seek optimized dipeptide composition. The anticipated overall success rates are 98.4% by using five-fold cross-validation. 96.2% apolipoproteins and 99.3% non-immunoglobulins can be correctly identified. All data can be downloaded from the Data window of this web-server.

Caveat

(1) For each submission, the number of protein sequences is limited at 100 or less;

(2) The input sequences must be in FASTA format; i.e., each protein sequence should start with a greater-than symbol (" > ") in the first column. The words right after the " > " symbol in the single initial line are optional and only used for the purpose of identification and description.

(3) If a query sequence contains any illegal character, the prediction will be stopped.