Resources Contact Us Home
Modified Levenshtein distance algorithm for coding

Image Number 8 for United States Patent #7664343.

Methods and systems of mapping of an optical character recognition (OCR) text string to a code included in a coding dictionary by supplementing the Levenshtein Distance Algorithm (LDA) with additional information in the form of adjustments based on particular character substitutions, insertions and deletions together with weighting based on multiple alternatives for the OCR text string. In one embodiment, an OCR text string mapping method (100) includes receiving (110) an OCR text string, comparing (120) it with selected text strings from a coding dictionary, computing (130) modified Levenshtein distances associated with the comparisons by determining (140) substitution penalties, determining (150) insertion penalties, determining (160) deletion penalties and combining (170) the penalties, selecting (180) the best matching text string from the coding dictionary based on the modified Levenshtein distances, determining (190) whether a maximum threshold distance is met, and assigning (200) a code associated with the best matching text string to the OCR text string when met, and assigning (210) a null or no code when not met.

  Recently Added Patents
Nanowire structured photodiode with a surrounding epitaxially grown P or N layer
Apparatus and method of managing radio bearer in wireless communication system
Apparatus and method for adapted deblocking filtering strength
Methods, devices and software applications for facilitating a development of a computer program
Load balancing for parallel tasks
Semiconductor device and method of manufacturing the same
Advanced joint detection in a TD-SCDMA system
  Randomly Featured Patents
Toilet seat handle
Telemetry sensing system for infant care apparatus
Vehicle security alarm
Christmas wreath
Image processing apparatus, image processing method and portable imaging apparatus
Vehicle cargo space liner
Cleaning of a body of liquid
Method and device for connecting two components and an assembly of the components
Cold start strategy for direct injected engines