Home  |   Login  |   Logout  |   Access Information  |   Alerts  |   Purchase History  |   Cart  |   Sitemap  |   Help   
 
Login
BROWSE SEARCH IEEE XPLORE GUIDE SUPPORT
Article Information

Discrete Methods for Association Search and Status Prediction in Genotype Case-Control Studies
Brinza, D.; Zelikovsky, A.
Bioinformatics and Bioengineering, 2007. BIBE 2007. Proceedings of the 7th IEEE International Conference on
Volume , Issue , 14-17 Oct. 2007 Page(s):270 - 277
Digital Object Identifier   10.1109/BIBE.2007.4375576
Summary:Recent improvements in high-throughput genotyping technology make possible genome-wide association studies and status prediction (classification) for common complex diseases. This paper addresses three challenges commonly facing such studies: (i) searching an enormous amount of possible gene interactions, (ii) validating reproducibility of associations and (iii) reliably predicting disease status. These challenges have been traditionally addressed in statistics while here we apply computational approaches -optimization and cross-validation. A complex risk factor is modeled as a subset of SNP's with specified alleles and the optimization formulation asks for the one with the maximum odds ratio. When searching for disease associated risk factor, we show that greedy heuristics are much faster and lead to significantly better solutions than exhaustive heuristics in a reasonable amount of time. We propose a novel randomized complimentary greedy search method that is advantageous to the previously best search method. To measure and compare ability of search methods to find reproducible risk factors, we propose to apply a cross-validation scheme usually used for prediction validation. The proposed heuristic association search methods promise better reproducibility than exhaustive searches. We then show that k-fold cross-validation is more reliable than leave-one-out cross-validation for disease status prediction methods since it captures overtraining effect. We have applied known search methods with proposed enhancements as well as status prediction methods (based on these search methods) to real case-control studies for several diseases (Chron's disease, autoimmune disorder, tick-born encephalitis, lung cancer, and rheumatoid arthritis). 2-and 3-fold cross-validations show that the new methods find strongly associated risk factors and reliably predict disease status for considered case-control studies.

» View citation and abstract

IEEE Members

Log in by entering your IEEE Web Account Username and Password.

IEEE Communications Society members: If you subscribe to the IEEE Electronic Periodicals Package or IEEE Electronic Periodicals Package Plus, you must access your subscription at www.comsoc.org.

Users at Subscribing Institutions

Check with your librarian, information professional, or system manager to determine if you need to log in. Please complete the online Technical Support Form if you need assistance.

Already Purchased This Article?

Select the Purchase History link to access the document. You will have 5 Days after purchase to access the Full Text PDF. Please complete the online Technical Support Form if you need assistance.

Guests

• Search and access Abstract records free of charge
Register for table of contents alerts
• Purchase Full Text PDF documents

» Learn more about subscription options or how to become an IEEE Member.

You are not logged in.
LOGIN
Username
Password
GO
» Forgot your password?
Please remember to log out when you have finished your session.
You must log in to access:
• Advanced or Author Search
• CrossRef Search
• AbstractPlus Records
• Full Text PDF
• Full Text HTML
Access this document
» Buy this document now
» Learn more about
» Learn more about
   purchasing articles
   and standards
Learn more about IEEE Subscriptions
Indexed by IEE Inspec
© Copyright 2010 IEEE – All Rights Reserved