Resources Contact Us Home
Modified Levenshtein distance algorithm for coding

Image Number 8 for United States Patent #7664343.

Methods and systems of mapping of an optical character recognition (OCR) text string to a code included in a coding dictionary by supplementing the Levenshtein Distance Algorithm (LDA) with additional information in the form of adjustments based on particular character substitutions, insertions and deletions together with weighting based on multiple alternatives for the OCR text string. In one embodiment, an OCR text string mapping method (100) includes receiving (110) an OCR text string, comparing (120) it with selected text strings from a coding dictionary, computing (130) modified Levenshtein distances associated with the comparisons by determining (140) substitution penalties, determining (150) insertion penalties, determining (160) deletion penalties and combining (170) the penalties, selecting (180) the best matching text string from the coding dictionary based on the modified Levenshtein distances, determining (190) whether a maximum threshold distance is met, and assigning (200) a code associated with the best matching text string to the OCR text string when met, and assigning (210) a null or no code when not met.

  Recently Added Patents
Disk-based storage device having read channel memory that is selectively accessible to disk controller
Equipment to facilitate money transfers into bank accounts
Methods and systems for improved engine speed control during engine starting
Method of manufacturing semiconductor device
System and method for providing definitions
Image forming apparatus capable of timely starting different image formation mode
  Randomly Featured Patents
Modular outward opening solenoid direct fuel injector
High cis diene/phenylbutadiene copolymers prepared using a Ziegler/Natta neodymium catalyst
Method and apparatus for necking can bodies
Hand truck stair crawler assembly
Method of controlling the reception of data
Highly flexible starch-based films
Wild rocket cultivar 40-0801188-B
Automotive antitheft system
Laser irradiation apparatus and method for manufacturing semiconductor device using the laser irradiation apparatus
Flush toilet