By Topic

A data mining approach to predict protein secondary structure

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

Formats Non-Member Member
$33 $13
Learn how you can qualify for the best price for this item!
Become an IEEE Member or Subscribe to
IEEE Xplore for exclusive pricing!
close button

puzzle piece

IEEE membership options for an individual and IEEE Xplore subscriptions for an organization offer the most affordable access to essential journal articles, conference papers, standards, eBooks, and eLearning courses.

Learn more about:

IEEE membership

IEEE Xplore subscriptions

4 Author(s)
Bingru Yang ; School of Information Engineering University of Science and Technology Beijing, China ; Haifeng Sui ; QuWu ; Lijun Wang

In bioinformatics,proteins are coded by strings, called “primary structures”. Biologists have long enough gathered these primary structures in large databases. Numerous experiments and analyses of primary structures have revealed that the protein primary structure closely correlates with the protein second structure. In this paper, we present a data mining approach based on machine learning techniques to predict protein second structure. Based on majority voting mechanism, the approach combine the predictions of homology analysis classifier, Support vector machine(SVM) classifier and modified Knowledge Discovery in Databases (KDD*) process. They are validated with 2 different datasets. Their predictive accuracy results outperform the best secondary structure predictors by 2.00% on average.

Published in:

2010 International Conference on Computer Application and System Modeling (ICCASM 2010)  (Volume:6 )

Date of Conference:

22-24 Oct. 2010