By Topic

PCA feature extraction for protein structure prediction

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

Formats Non-Member Member
$33 $13
Learn how you can qualify for the best price for this item!
Become an IEEE Member or Subscribe to
IEEE Xplore for exclusive pricing!
close button

puzzle piece

IEEE membership options for an individual and IEEE Xplore subscriptions for an organization offer the most affordable access to essential journal articles, conference papers, standards, eBooks, and eLearning courses.

Learn more about:

IEEE membership

IEEE Xplore subscriptions

3 Author(s)
J. C. B. Melo ; Phys. & Math. Dept., Fed. Rural Univ. of Pernambuco, Recife, Brazil ; G. D. C. Cavalcanti ; K. S. Guimaraes

The PCA linear transformation method is used for feature extraction to the secondary structure prediction problem. The method of dimensionality reduction is applied on PSI-Blast profiles built on NCBI's Nonredundant Protein database. Different numbers of components extracted are used as input to three artificial neural networks with 30, 35 or 40 nodes in the hidden layer. Those classifiers are trained with the RPROP algorithm. To estimate the accuracy of the predictor the sevenfold cross-validation method is applied to CB396, a database used previously to evaluate the performance of several predictors. Aiming to increase the efficiency of the predictor presented here, the outputs of the classifiers are combined through five simple rules: product, average, voting, minimum and maximum. This original application for the PCA method derives relevant results. Even with a drastic reduction from 260 to 80 components, the accuracy obtained is at least 1% superior to the best one published for another predictor, the CONSENSUS, a combination of four other predictors. With a reduction from 260 to 180 components the performance is even better, achieving an Q3 accuracy of 74.5%. The results flag the PCA as a promising method for feature extraction in the secondary structure prediction problem.

Published in:

Neural Networks, 2003. Proceedings of the International Joint Conference on  (Volume:4 )

Date of Conference:

20-24 July 2003