Eu estava procurando no Fuzzy Lookup como uma forma de destacar essas quase duplicatas
O suplemento de pesquisa difusa para o Excel executa a correspondência difusa de dados textuais no Excel.
Suplemento de pesquisa difusa para o Excel
The Fuzzy Lookup Add-In for Excel was developed by Microsoft Research and performs fuzzy matching of textual data in Microsoft Excel.
It can be used to identify fuzzy duplicate rows within a single table or to fuzzy join similar rows between two different tables. The matching is robust to a wide variety of errors including spelling mistakes, abbreviations, synonyms and added/missing data.
For instance, it might detect that the rows “Mr. Andrew Hill”, “Hill, Andrew R.” and “Andy Hill” all refer to the same underlying entity, returning a similarity score along with each match.
While the default configuration works well for a wide variety of textual data, such as product names or customer addresses, the matching may also be customized for specific domains or languages.
Fonte Add-in de pesquisa difusa para o Excel
Alguma sugestão sobre a configuração do limiar de similaridade?
Realizar pesquisas difusas no Excel tem algumas dicas sobre o limiar de similaridade configuração.