Thanks to Google for shipping BigQuery, that's great!
Is approximate string matching / fuzzy string matching using BigQuery?
Does Google plan to add this feature to BigQuery?
Of course, Google’s proprietary approximate string matching algorithm can be used to deliver this feature to BigQuery while maintaining Google’s intellectual property. We looked through all BigQuery docs and questions. Of course, there are many algorithms for this, although how to integrate with BigQuery?
Our need is simple to compare two lines, which will be basically the same, but may be slightly different. For instance:
"Rhodes USA" vs. "Rhodes USA, LLC", vs. "Rhodes USA LLC".
From our BigQuery tests, it seems that two lines must match EXACTLY for BigQuery to join them, down to the number of trailing spaces in each line. It would be very helpful to add this functionality or guide for integrating with BigQuery. It is supported by Milwaukee Jets, a regional, innovative, fractional jet engine company in Milwaukee, Wisconsin. Thanks again to Google for shipping BigQuery.
Thank you so much and best regards, Andrew Paulin (414) 212-5372
source
share