Abstract:
In this paper, we consider the problem of communicating data from distributed sensors for the goal of inference. Two inference problems of linear regression and binary li...Show MoreMetadata
Abstract:
In this paper, we consider the problem of communicating data from distributed sensors for the goal of inference. Two inference problems of linear regression and binary linear classification are investigated. Assuming perfect training of the classifier, an approximation of the problem of minimizing classification error-probability under Gaussianity assumptions leads us to recover Fisher score: a metric that is commonly used for feature selection in machine learning. Further, this allows us to soften the notion of feature selection by assigning a degree of relevance to each feature based on the number of bits assigned to it. This relative relevance is used to obtain numerical results on savings on number of bits acquired and communicated for classification of neural data obtained from Electrocorticography (ECoG) experiments. The results demonstrate that significant savings on costs of communication can be achieved by compressing Big Data at the source.
Published in: 2014 52nd Annual Allerton Conference on Communication, Control, and Computing (Allerton)
Date of Conference: 30 September 2014 - 03 October 2014
Date Added to IEEE Xplore: 02 February 2015
Electronic ISBN:978-1-4799-8009-3