Resources Contact Us Home
Browse by: INVENTOR PATENT HOLDER PATENT NUMBER DATE
 
 
Modified Levenshtein distance algorithm for coding










Image Number 8 for United States Patent #7664343.

Methods and systems of mapping of an optical character recognition (OCR) text string to a code included in a coding dictionary by supplementing the Levenshtein Distance Algorithm (LDA) with additional information in the form of adjustments based on particular character substitutions, insertions and deletions together with weighting based on multiple alternatives for the OCR text string. In one embodiment, an OCR text string mapping method (100) includes receiving (110) an OCR text string, comparing (120) it with selected text strings from a coding dictionary, computing (130) modified Levenshtein distances associated with the comparisons by determining (140) substitution penalties, determining (150) insertion penalties, determining (160) deletion penalties and combining (170) the penalties, selecting (180) the best matching text string from the coding dictionary based on the modified Levenshtein distances, determining (190) whether a maximum threshold distance is met, and assigning (200) a code associated with the best matching text string to the OCR text string when met, and assigning (210) a null or no code when not met.








 
 
  Recently Added Patents
Polyureas made from aminocrotonates and enaminones
Compounds, compositions and methods for reducing lipid levels
Input device with photodetector pairs
Earphone
Battery comprising circuitry for charge and discharge control, and method of operating a battery
Coordinate locating method and apparatus
System and method to assess and report the health of landing gear related components
  Randomly Featured Patents
Apparatus and method for mapping relational data and metadata to XML
Method and apparatus for modulus error checking
Detection of allergen-associated materials
Weighted open loop power control system
Fermented feed for ruminants and process for producing same
Apparatus for descrambling a data retrieved from an optical storage medium, and method therefor
Cellular telephone
External fixation device
Arrangement for the exchange of filter elements
Monoclonal antibodies that recognize a shared epitope between the human immunodeficiency virus type 1 (HIV-1) capsid (CA/p24) and the human immunodeficiency virus type 2 (HIV-2) capsid (CA/p26