Hi, I have two sets of data ( Company names ) that I have been able to fuzzy match successfully - however I have the following issues:
- A lot of names are just matched on the basis of common words like 'Corporation', 'International', 'Holding, 'Inc', 'Ltd' etc. I want to eliminate such matches based on such list of words only. How do i do that? If I increase the match threshold - it is eliminating some key returns as well: for eg:
ABC Partners Inc & ABC Inc
- I am also getting multiple matches for a particular company name with different match score - is it possible to only keep the highest match score data?
Any help will be highly appreciated