New data set filters proteins for more accurate predictions in ConforMine.
The ShiftCrypt training data set was created to train the ConforMine model. This data set includes ShiftCrypt values of proteins and a python script for filtering sequences. The goal was to improve the accuracy of ConforMine by discarding proteins with no valid ShiftCrypt predictions.