The benchmark dataset contains 122 enzymes which are 54 acidic enzymes and 68 alkaline enzymes. The sequence identity in the dataset is less 25%. Download The independent dataset contains 20 acidic enzymes and 20 alkaline enzymes. The sequence identity between training set and independent set is less 40%. Download |