Locating Misspellings, Typos, and Approximations




The approximate pattern matcher lets you find "looks roughly like" or "sounds like" information. To invoke a fuzzy match precede the word or pattern with the '%' (percent) character.



This pattern matcher locates items by examining how "closely" the text matches your query item. It uses percentages of difference as a measure, and will default to finding items with 80% similar content. You may specify a different percentage on the query line by following the '%' with a two digit number that represents the value you desire.

It is important to note our method is far superior to the "soundex" method found in many other tools. Here's why:

  • Handles character transpositions and omissions.
  • Can be used on non-word items like addresses.
  • Will match foreign language constructions.
  • Finds accidental OCR errors or character insertions.

Examples:

Expression           Will Find
-------------------------------------------------------------------------
ronald %regan        Ronald Raygun, Ronald Re~an, Ronald 8eagan
%75MYPARTNO9045d/6a  Anything within 75% of looking like MYPARTNO9045d/6a
mo* %quadaffi        mohamar kadaffi, Mo Qadaffi, etc...


Notes:
Since fuzzy searching by its very nature tends to bring back more than you really wanted, it is always a really good idea to try to "anchor" your fuzzy term with another keyword. (Like the %regan example).

go back




Copyright © 1996 Thunderstone Software
Copyright © 2024 Thunderstone Software LLC. All rights reserved.