Resources Contact Us Home
Modified Levenshtein distance algorithm for coding

Image Number 8 for United States Patent #7664343.

Methods and systems of mapping of an optical character recognition (OCR) text string to a code included in a coding dictionary by supplementing the Levenshtein Distance Algorithm (LDA) with additional information in the form of adjustments based on particular character substitutions, insertions and deletions together with weighting based on multiple alternatives for the OCR text string. In one embodiment, an OCR text string mapping method (100) includes receiving (110) an OCR text string, comparing (120) it with selected text strings from a coding dictionary, computing (130) modified Levenshtein distances associated with the comparisons by determining (140) substitution penalties, determining (150) insertion penalties, determining (160) deletion penalties and combining (170) the penalties, selecting (180) the best matching text string from the coding dictionary based on the modified Levenshtein distances, determining (190) whether a maximum threshold distance is met, and assigning (200) a code associated with the best matching text string to the OCR text string when met, and assigning (210) a null or no code when not met.

  Recently Added Patents
Content distribution system, mobile communication terminal device, and computer readable medium
Headset electronics
Method for providing information of access point selection
Identifying users of remote sessions
Image capturing apparatus, control method thereof, and program
Configurable pitch reducing optical fiber array
Display for gloves
  Randomly Featured Patents
Evaporative emissions canister having an internal insert
Cellular radio routing system
Method and apparatus for securing a suture
Steroids with radical-attracting aromatic substituents, process for the production thereof and pharmaceutical compounds containing the said substances
Process for producing dies
Function specific property nodes for graphical programs
Shape clustering in post optical character recognition processing
Bicycle derailleur
Method of welding continuous rails and apparatus therefor
Method and apparatus for ligating a body part