Skip to Main Content
This paper reports a new address interpretation system which allows both non-use of postal codes and omission of address elements, such as the omission of county or state when a city name has been given. While lexicon-driven recognizers show good recognition performance when they are input with adequate word images and a lexicon containing correct word strings, it is difficult to design one which would be of practical use when postal codes are not in use and elements of addresses may have been omitted. That is to say, an inadequate design in this area is likely to result in an impractically high erroneous recognition rate. In response to this problem, we propose here an advanced address interpretation system that utilizes both an improved address interpretation method and improved word recognition methods. The improved address interpretation method has been designed to satisfy as completely as possible the need to accommodate non-use of postal codes and the omission of address elements, while the improved word recognition methods have been designed to achieve low erroneous recognition rates in cases in which that need has not been fully satisfied. When we applied our new system to approximately 2,000 actual address images for which that need would be relevant, we achieved a 52% rate of correct outward sorting with only a 0.7% rate of erroneous outward sorting. These rates are good enough for practical applications.