Resources Contact Us Home
Modified Levenshtein distance algorithm for coding

Image Number 8 for United States Patent #7664343.

Methods and systems of mapping of an optical character recognition (OCR) text string to a code included in a coding dictionary by supplementing the Levenshtein Distance Algorithm (LDA) with additional information in the form of adjustments based on particular character substitutions, insertions and deletions together with weighting based on multiple alternatives for the OCR text string. In one embodiment, an OCR text string mapping method (100) includes receiving (110) an OCR text string, comparing (120) it with selected text strings from a coding dictionary, computing (130) modified Levenshtein distances associated with the comparisons by determining (140) substitution penalties, determining (150) insertion penalties, determining (160) deletion penalties and combining (170) the penalties, selecting (180) the best matching text string from the coding dictionary based on the modified Levenshtein distances, determining (190) whether a maximum threshold distance is met, and assigning (200) a code associated with the best matching text string to the OCR text string when met, and assigning (210) a null or no code when not met.

  Recently Added Patents
Multi-radio coexistence
Flash drive
Adaptive known signal canceller
Systems and methods for updating a data store using a transaction store
Methods of operating non-volatile memory devices during write operation interruption, non-volatile memory devices, memories and electronic systems operating the same
Fuel cell and a method of manufacturing a fuel cell
Inductance element
  Randomly Featured Patents
Water soluble carbon nanotubes
On-chip automatic system for impedance matching in very high speed input-output chip interfacing
Cytokine related to hemolytic anemia and method of use
Drum type washing machine
Method of preparing a self-sealing pneumatic tire
Apparatus for allowing wheeled negotiation of an obstacle
Expandable transformable gutter bracket
Apparatus for cutting winding strips for use in a wound core
Router interface card
Rigid ribbed tray