Quantcast
Channel: Data Preparation & Blending discussions
Viewing all articles
Browse latest Browse all 4999

Fuzzy Matching not matching?

$
0
0

I am trying to match company names. I am having an issue where it is not matching items I would expect the tool to match unless I change the match threshold to be very low. My example is the "PRESIDIO" is only matching to "PRESIDIO NETWORKED SOLUTIONS INC" at a 52% match score. Even if I tell the tool to not to create keys for "NETWORKED" "SOLUTIONS" or "INC" it still only matches at 52%. When I look at the results I can see that it generates the same Match Key for "PRESIDIO" and "PRESIDIO NETWORKED SOLUTIONS INC" but for some reason it does not match. I have tried setting the tool to run "Key Match Only" but when I run that through the rest of my data I get a bunch of bad matches. 

 

I have attached how I have the Fuzzy Match tool set up, my sample data, and the results I am getting. In my results I am trying to match ID 1 to ID2 100 you can see that it has the same Match Key for the first 2 items but no match. 

Does anyone know if there is a way I can get these types of items to match at a reasonable % threshold? Is this just a limitation of the tool where the other words such as "NETWORKED" or "SOLUTIONS" prevent it from matching?

 

Any help would be greatly appreciated! Thank you!

 

 

 

 SampleData.PNGResults.PNGoptionssettings.PNG

 

 

 

 

 


Viewing all articles
Browse latest Browse all 4999

Trending Articles