By Topic

Multicategory Prediction of Multifactorial Diseases Through Risk Factor Fusion and Rank-Sum Selection

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

Formats Non-Member Member
$31 $13
Learn how you can qualify for the best price for this item!
Become an IEEE Member or Subscribe to
IEEE Xplore for exclusive pricing!
close button

puzzle piece

IEEE membership options for an individual and IEEE Xplore subscriptions for an organization offer the most affordable access to essential journal articles, conference papers, standards, eBooks, and eLearning courses.

Learn more about:

IEEE membership

IEEE Xplore subscriptions

4 Author(s)
Phegley, J.W. ; Acad. Affairs & Res., Southern Illinois Univ., Carbondale, IL, USA ; Perkins, K. ; Gupta, L. ; Hughes, L.F.

A generalized strategy is developed to predict the occurrence of a multicategory–multifactorial disease from a set of medical risk factors that are most often used to screen patients for the disease. The prediction problem is formulated as an M -class classification problem. The strategy employs fusion to combine risk factors into a single feature vector, normalization to fuse risk factors which have different formats and ranges, rank-sum ordering for feature selection, discrete Karhunen–Loeve transform-based transformation to facilitate parametric classifier development, and the design of parametric classifiers. Two methods, which differ on how the features are selected, are developed. In the first method, features are selected from a set consisting of linear combinations of all risk factors. In the second method, the features are linear combinations of a preselected subset of the risk factors. The methods are applied to predict the occurrence of Alzheimer's disease (AD) into three classes: Probable-AD, Possible-AD, and Uncertain. It is shown that a classification accuracy of over 71% can be obtained. This result is quite encouraging given that AD is very difficult to clinically diagnose. Higher classification accuracies can be expected for diseases that are not as complex to diagnose as AD. Most importantly, it is concluded that the generalized strategy can not only be applied to the multicategory–multifactorial disease prediction problem but also to other multiclass pattern recognition problems involving diverse information collected from different sources.

Published in:

Systems, Man and Cybernetics, Part A: Systems and Humans, IEEE Transactions on  (Volume:35 ,  Issue: 5 )