Google Refine recipe for matching messy objects in two databases

I have two databases of dirty names such as:

  • Jindal, Bobby
  • Fla. Governor Bobby Jindal
  • Bobby jindal
  • 3M Corp.
  • 3M Menomonie

I need to find matches. Can someone point me or suggest a good recipe for how to do this on Google Refine?

This link gives me a starting point, but I could use further tips: http://blog.ouseful.info/2011/05/06/merging-datesets-with-common-columns-in-google-refine/

+3
source share
2 answers

You can try our Refine extension , especially see the reconciliation part of the document.

+2

cell.cross vlookup Excel, . , .

. : rdf one .

+1

All Articles