Resources Contact Us Home
Modified Levenshtein distance algorithm for coding

Image Number 8 for United States Patent #7664343.

Methods and systems of mapping of an optical character recognition (OCR) text string to a code included in a coding dictionary by supplementing the Levenshtein Distance Algorithm (LDA) with additional information in the form of adjustments based on particular character substitutions, insertions and deletions together with weighting based on multiple alternatives for the OCR text string. In one embodiment, an OCR text string mapping method (100) includes receiving (110) an OCR text string, comparing (120) it with selected text strings from a coding dictionary, computing (130) modified Levenshtein distances associated with the comparisons by determining (140) substitution penalties, determining (150) insertion penalties, determining (160) deletion penalties and combining (170) the penalties, selecting (180) the best matching text string from the coding dictionary based on the modified Levenshtein distances, determining (190) whether a maximum threshold distance is met, and assigning (200) a code associated with the best matching text string to the OCR text string when met, and assigning (210) a null or no code when not met.

  Recently Added Patents
Wearable display device
Real-time RSL monitoring in a web-based application
Network attachment for IMS systems for legacy CS UE with home node B access
Aggregating completion messages in a sideband interface
Encoder that detects positional information of a moving body generating interference fringes that move in opposite directions
Support core for cold shrink tube
  Randomly Featured Patents
Method and apparatus for adapting enforcement of network quality of service policies based on feedback about network conditions
Rule-based stimulation program search
Off feed conveyor for use with woodworking mill machines
Cell-phone holder
Hanging-folder file frame
Radial shaft seal
Method and means for chemically modifying gases or fumes
Cone crushers
Rotational speed measuring system having a circuit for increasing the accuracy thereof
Thermoplastic polycarbonate moulding materials