Research and implementation of big data preprocessing system based on Hadoop | IEEE Conference Publication | IEEE Xplore