finding duplicates in large table with the use of utl_match
617589Jan 14 2008 — edited Jan 15 2008hello specialist
I'm relatively new to the world of working with text comparisons but now the client threw me in the pool and I need to start swimming.
Basic concept is that based on some predefined rules I have to find doubles in a table. Doubles can be exact matches on addresses and names, those are the easier ones. But it gets harder when the utl_match needs to kick in.
How would you solve the issue of trying to find a duplicate with the utl_match in a 20 million record table? I have my source table of 20M and at the end I need a table that contains a field that indicates that record X is a match to record Y.
Thanks in advance for helping me out and sharing your expertise.
Koen Verheyen
Belgium