Resources Contact Us Home
Modified Levenshtein distance algorithm for coding

Image Number 8 for United States Patent #7664343.

Methods and systems of mapping of an optical character recognition (OCR) text string to a code included in a coding dictionary by supplementing the Levenshtein Distance Algorithm (LDA) with additional information in the form of adjustments based on particular character substitutions, insertions and deletions together with weighting based on multiple alternatives for the OCR text string. In one embodiment, an OCR text string mapping method (100) includes receiving (110) an OCR text string, comparing (120) it with selected text strings from a coding dictionary, computing (130) modified Levenshtein distances associated with the comparisons by determining (140) substitution penalties, determining (150) insertion penalties, determining (160) deletion penalties and combining (170) the penalties, selecting (180) the best matching text string from the coding dictionary based on the modified Levenshtein distances, determining (190) whether a maximum threshold distance is met, and assigning (200) a code associated with the best matching text string to the OCR text string when met, and assigning (210) a null or no code when not met.

  Recently Added Patents
Preamplifier-to-channel communication in a storage device
Electrical installation arrangement
Technique for effectively providing program material in a cable television system
Linerless labels
Rotating device
Washing machine
System and method for operating an electric power converter
  Randomly Featured Patents
Spreading and lap-forming machine
Internal connection system for high power electrochemical cell
Preloading method for preload-adjustable rolling bearing and manufacture of the same
Method of producing succinic acid
Electronic device and camera
Picture processing method and apparatus
Automatic gas shut-off valve
Master cylinder for a vehicle hydraulic braking system
Position detecting method and position detecting device for detecting relative positions of objects having position detecting marks by using separate reference member having alignment marks
Tape recording of video signals