Resources Contact Us Home
Modified Levenshtein distance algorithm for coding

Image Number 8 for United States Patent #7664343.

Methods and systems of mapping of an optical character recognition (OCR) text string to a code included in a coding dictionary by supplementing the Levenshtein Distance Algorithm (LDA) with additional information in the form of adjustments based on particular character substitutions, insertions and deletions together with weighting based on multiple alternatives for the OCR text string. In one embodiment, an OCR text string mapping method (100) includes receiving (110) an OCR text string, comparing (120) it with selected text strings from a coding dictionary, computing (130) modified Levenshtein distances associated with the comparisons by determining (140) substitution penalties, determining (150) insertion penalties, determining (160) deletion penalties and combining (170) the penalties, selecting (180) the best matching text string from the coding dictionary based on the modified Levenshtein distances, determining (190) whether a maximum threshold distance is met, and assigning (200) a code associated with the best matching text string to the OCR text string when met, and assigning (210) a null or no code when not met.

  Recently Added Patents
Method for parking or exiting a parking bay and for avoiding a collision of a vehicle, and corresponding assistance systems and vehicle
Projection illumination system for EUV microlithography
Power management method for reducing power of host when turning off main monitor and computer system applying the same
Video reproducing apparatus and video reproducing method
Method and apparatus for cutting high quality internal features and contours
Implantable medical devices including elongated conductor bodies that facilitate device and lead configuration variants
Motion estimation for a video transcoder
  Randomly Featured Patents
System and method for generating self-synchronized launch of last shift capture pulses using on-chip phase locked loop for at-speed scan testing
Power-over-ethernet isolation loss detector
Device and method for ascertaining the temperature in an electrical battery
Membranes and electrochemical cells incorporating such membranes
Low-precious metal/high-rare earth oxide catalysts
Process automation system
Piping joint structure
Dual damascene process
Semiconductor integrated circuit