A robust document processing system combining image segmentation with content-based document compression | IEEE Conference Publication | IEEE Xplore