Resources Contact Us Home
Modified Levenshtein distance algorithm for coding

Image Number 8 for United States Patent #7664343.

Methods and systems of mapping of an optical character recognition (OCR) text string to a code included in a coding dictionary by supplementing the Levenshtein Distance Algorithm (LDA) with additional information in the form of adjustments based on particular character substitutions, insertions and deletions together with weighting based on multiple alternatives for the OCR text string. In one embodiment, an OCR text string mapping method (100) includes receiving (110) an OCR text string, comparing (120) it with selected text strings from a coding dictionary, computing (130) modified Levenshtein distances associated with the comparisons by determining (140) substitution penalties, determining (150) insertion penalties, determining (160) deletion penalties and combining (170) the penalties, selecting (180) the best matching text string from the coding dictionary based on the modified Levenshtein distances, determining (190) whether a maximum threshold distance is met, and assigning (200) a code associated with the best matching text string to the OCR text string when met, and assigning (210) a null or no code when not met.

  Recently Added Patents
Adaptive known signal canceller
Beverage container lid
External preparation composition for skin comprising ginseng flower or ginseng seed extracts
Extreme ultraviolet light source device and method for generating extreme ultraviolet light
Specimen preparation device, and control method in specimen preparation device
Toy vehicle housing
Braided boomerang pet toy
  Randomly Featured Patents
Method and apparatus for installing telephone intercom-voice messaging apparatus at doorbell for dwelling
Process for the synthesis of ribonucleic acid (RNA) using a novel deprotection reagent
Electrical switch
Variable aperture
Vehicle seat
Dispensing lottery tickets
System for providing an enhanced immersive display environment
Optical disk recording apparatus and optical recording method
Device for allocating resources in a radiocommunication network
Method for the treatment of waste sludge