Close category search window
 

Socio economic characterization of student's data using ICA and cluster analysis

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

Formats Non-Member Member
$31 $13
Learn how you can qualify for the best price for this item!
Become an IEEE Member or Subscribe to
IEEE Xplore for exclusive pricing!
close button

puzzle piece

IEEE membership options for an individual and IEEE Xplore subscriptions for an organization offer the most affordable access to essential journal articles, conference papers, standards, eBooks, and eLearning courses.

Learn more about:

IEEE membership

IEEE Xplore subscriptions

3 Author(s)
Ghose, U. ; Univ. Sch. of Inf. Technol., GGSIP Univ., Delhi, India ; Rai, C.S. ; Singh, Y.

Data mining is an automated process of discovering knowledge from databases. There are various kinds of data mining methods aiming to search for different kinds of knowledge. Data mining systems induce knowledge from data sets, which are huge, noisy (incorrect), incomplete, inconsistent, imprecise (fuzzy), and uncertain. The problem is that existing systems use a limiting attribute value language for representing the training examples and induced knowledge. Furthermore, some important patterns are ignored because they are statistically insignificant. Independent component analysis (ICA) can be used as tool in extraction of features in large data sets. Optimization of the objective functions like - mutual information, joint entropy, negentropy, kurtosis etc., lead to the iterative algorithms for ICA. In this paper a new approach is taken for identification of data attributes and socio economic characterization of data on the basis of ICA and cluster analysis. The approach is based entirely on measured entropies of the system and minimization of mutual information. The technique has been applied to efficiently extract the independent components or data attributes from a large data set. The sample data set is obtained from scanned OMR application forms of candidates applying for various courses in an Indian University, which provides educational services to various sections of society. By using cluster analysis technique useful results have been found which can be used for the socio economic characterization of candidates applying for engineering courses in the University. Simulations with such data have been presented to show the effect.

Published in:
Industrial Informatics (INDIN), 2010 8th IEEE International Conference on

Date of Conference: 13-16 July 2010

Need Help?


IEEE Advancing Technology for Humanity About IEEE Xplore | Contact | Help | Terms of Use | Nondiscrimination Policy | Site Map | Privacy & Opting Out of Cookies

A not-for-profit organization, IEEE is the world's largest professional association for the advancement of technology.
© Copyright 2013 IEEE - All rights reserved. Use of this web site signifies your agreement to the terms and conditions.