Title: Protein secondary structure prediction using support vector machines and a codon encoding scheme
Abstract: In this study, we evaluate the performance of a protein secondary structure prediction model using a new amino acid "codon" encoding inspired by genetic codon mappings. The dimensionality of the binary codon encoding is less than that of an orthogonal encoding which requires less computations. Protein secondary structure prediction is an important step for machine learning techniques ultimately applied for protein 3D structure prediction. In the proposed model, one-stage binary support vector machines are employed, and the efficiency of the codon encoding to that of a commonly used orthogonal encoding are compared without incorporating protein evolutionary and structural information for an unbiased comparison. The performance of the classification model is measured according to Q <sub xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">3</sub> and segment overlap (SOV) scores. The scores are compared with those of the prediction methods using an orthogonal encoding and protein sequence profiles. The experimental results indicate higher prediction accuracy based on Q <sub xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">3</sub> SOV scores when sequence profiles are not used. Also, the relative classification scores of the proposed method are comparable with the methods incorporating protein global and evolutionary information. The experimental result implies the encoding scheme is able to integrate the evolutionary information into the prediction model since the encoding is based on genetic codon mappings which are the building blocks of amino acid formations at the primary level of biological processes. The codon encoding is worthwhile to be investigated using more complex learning architectures with the profiles and structural properties of proteins.
Publication Year: 2012
Publication Date: 2012-10-01
Language: en
Type: article
Indexed In: ['crossref']
Access and Citation
Cited By Count: 6
AI Researcher Chatbot
Get quick answers to your questions about the article from our AI researcher chatbot