Rule-based middle-level character detection for simplifying Thai document layout analysis | IEEE Conference Publication | IEEE Xplore