Automatic metadata extraction and classification of spreadsheet documents based on layout similarity | IEEE Conference Publication | IEEE Xplore