Image Number 7 for United States Patent #7415171.
Systems, methods, and software consistent with the present invention use combinations of two or more letters (xgraphs) or characters, such as three letter combinations (trigraphs), to aid in deciphering ambiguous characters in OCR systems. Embodiments consistent with the invention compare possible xgraphs constructed by substitution for the ambiguous OCR-read character(s) to a reference set of xgraphs. The reference set of xgraphs contains predetermined valid xgraphs, which may be ranked by preference. If a possible xgraph matches a reference xgraph, then the ambiguous character(s) are resolved in favor of the characters in the reference xgraph. Embodiments may also use the context of the ambiguous character string to aid in resolving the ambiguity where more than one possible solution exists.