Bayesian sequence learning for predicting protein cleavage points
Mayo, M. (2005). Bayesian sequence learning for predicting protein cleavage points. In T.B. Ho, D. Cheung, & H. Liu (Eds), Advances in Knowledge Discovery and Data Mining, 9th Pacific-Asia Conference, PAKDD 2005, Hanoi, Vietnam, May 18-20, 2005, Proceedings (pp. 192-202). Berlin: Springer.
Permanent Research Commons link: http://hdl.handle.net/10289/1390
A challenging problem in data mining is the application of efficient techniques to automatically annotate the vast databases of biological sequence data. This paper describes one such application in this area, to the prediction of the position of signal peptide cleavage points along protein sequences. It is shown that the method, based on Bayesian statistics, is comparable in terms of accuracy to the existing state-of-the-art neural network techniques while providing explanatory information for its predictions.