The benchmark dataset contains 307 phage proteins which are 99 virion proteins and 208 non-virion proteins. The sequence identity in the dataset is less 40%. Download
Independent data can be available at here.
Close