By Topic

Feature Selection Approach for Quantitative Prediction of Transcriptional Activities

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

Formats Non-Member Member
$31 $13
Learn how you can qualify for the best price for this item!
Become an IEEE Member or Subscribe to
IEEE Xplore for exclusive pricing!
close button

puzzle piece

IEEE membership options for an individual and IEEE Xplore subscriptions for an organization offer the most affordable access to essential journal articles, conference papers, standards, eBooks, and eLearning courses.

Learn more about:

IEEE membership

IEEE Xplore subscriptions

4 Author(s)
Anand, A. ; Sch. of Electr. & Electron. Eng., Nanyang Technol. Univ., Singapore ; Fogei, G. ; E Ke Tang ; Suganthan, P.N.

Protein-DNA interactions play a crucial role in transcriptional regulation and other biological processes. Quantitative predictive models of protein-DNA binding affinities can increase our understanding of molecular interaction and help validate putative transcription factor binding sites or other regulatory features. Such predictive models must take into account context-specific features associated with both DNA and proteins. Given the large complexity associated with such features, here we consider only the contextual features of DNA associated with binding affinity. Two types of features are considered in this paper: 1) features accounting for conformational and physico-chemical properties of nucleotide sequence and 2) another set of features accounting for conservation of evolutionary information in the form of position-specific weight matrices. A feature selection approach, named, leave-one-out sequential forward selection (LOOSFS), is presented. The feature selection method employs leave-one-out cross-validation error of the least square support vector machines (LS-SVM) to estimate the test error of quantitative prediction model. The method is used to identify important features possibly responsible for differences in transcriptional activities of 130 DNA sequences. These sequences were obtained by single base substitutions within promoter of the mouse beta-major globin gene. The selected features and predicted activity values correlate well with experimental results

Published in:

Computational Intelligence and Bioinformatics and Computational Biology, 2006. CIBCB '06. 2006 IEEE Symposium on

Date of Conference:

28-29 Sept. 2006