Resources Contact Us Home
Modified Levenshtein distance algorithm for coding

Image Number 8 for United States Patent #7664343.

Methods and systems of mapping of an optical character recognition (OCR) text string to a code included in a coding dictionary by supplementing the Levenshtein Distance Algorithm (LDA) with additional information in the form of adjustments based on particular character substitutions, insertions and deletions together with weighting based on multiple alternatives for the OCR text string. In one embodiment, an OCR text string mapping method (100) includes receiving (110) an OCR text string, comparing (120) it with selected text strings from a coding dictionary, computing (130) modified Levenshtein distances associated with the comparisons by determining (140) substitution penalties, determining (150) insertion penalties, determining (160) deletion penalties and combining (170) the penalties, selecting (180) the best matching text string from the coding dictionary based on the modified Levenshtein distances, determining (190) whether a maximum threshold distance is met, and assigning (200) a code associated with the best matching text string to the OCR text string when met, and assigning (210) a null or no code when not met.

  Recently Added Patents
Fluorescent proteins
Method for producing an adhesive fastening element made of plastic
Error correct coding device, error correct coding method, and error correct coding program
Antenna arrangement and antenna housing
Nonvolatile semiconductor memory device and method of manufacturing the same
Level-shift circuit, electro-optical device, and level shift method
Shape memory polymers formed by self-crosslinking of copolymers
  Randomly Featured Patents
Method and apparatus for generating design information, and computer product
Sensitive silicon pin diode fast neutron dosimeter
Channel quality measurement in relay systems
Multiple functionality associated with a computer ON/OFF pushbutton switch
Non-relaxed embedded stressors with solid source extension regions in CMOS devices
Light guide
Apparatus for recording and reproducing digital signals using a helical scan
Utility lighter
Conveyor skirtboard apron
Variable group associativity branch target address cache delivering multiple target addresses per cache line