Read me

The web-server C2Pred was developed to identify the cell-penetrating peptides (CPPs) based on the sequence information. The analysis of variance was used to seek optimized dipeptide composition. The anticipated overall success rates are 83.6% by using 5-fold cross-validation. 81.5% CPPs and 85.6% non-CPPs can be correctly identified. All data can be downloaded from the Data window of this web-server.



(1) For each submission, the number of protein sequences is limited at 100 or less;

(2) The input sequences must be in FASTA format; i.e., each protein sequence should start with a greater-than symbol (" > ") in the first column. The words right after the " > " symbol in the single initial line are optional and only used for the purpose of identification and description.

(3) If a query sequence contains any illegal character, the prediction will be stopped.