My Photo

Your email address:


Powered by FeedBlitz

January 2014

Sun Mon Tue Wed Thu Fri Sat
      1 2 3 4
5 6 7 8 9 10 11
12 13 14 15 16 17 18
19 20 21 22 23 24 25
26 27 28 29 30 31  
Blog powered by Typepad

Become a Fan

« More Death Cheaper in Future | Main | Six Ticks till Midnight: One Plausible Journey from Here to a Total Surveillance Society »

September 25, 2007

TrackBack

TrackBack URL for this entry:
http://www.typepad.com/services/trackback/6a00d83452946769e200e550870fdd8834

Listed below are links to weblogs that reference Entity Resolution Systems vs. Match Merge/Merge Purge/List De-duplication Systems:

Comments

Feed You can follow this conversation by subscribing to the comment feed for this post.

Jeff Carr

Thanks for addressing this topic. I'm interested in your opinion on a recent US Air Force SBIR proposal entitled "Consolidating Entity Information from Heterogeneous Text Sources for Multi-INT Fusion". It concerns the difficulty in solving two cross-document coreference resolution problems: (1) cross-document name disambiguation, and (2) alias resolution. The authors of this topic seem to think that cross-document resolution involving structured and un-structured data across multi-INT domains is still a major problem.

Is that your view as well, Jeff?

Delpierre

Very informative.

Douglas Schwartz

We found the same thing to be true also.

Andrew

Thank you for this article. It has expanded my narrow thoughts on the uses of Merge and Purge. I had not considered a real-time application of the service.

Umang Juthani

Thank you for this article. My team is currently debating between what kind of business framework we give to our tool set. i.e Suvivorship vs. Order of precedence.
Our goal is to make available individual source system data as well as a golden record computed from various source system holding the most accurate active information about a customer hence a true 360 view of a customer.

Please! expect future question(s) on this topic once i demo this article to my team.

Leonardo

This article makes a strong case for probabilistic databases, or other kind of uncertainty management, and collective matching (current trend in machine learning and data mining). I certainly agree with this view.

Verify your Comment

Previewing your Comment

This is only a preview. Your comment has not yet been posted.

Working...
Your comment could not be posted. Error type:
Your comment has been saved. Comments are moderated and will not appear until approved by the author. Post another comment

The letters and numbers you entered did not match the image. Please try again.

As a final step before posting your comment, enter the letters and numbers you see in the image below. This prevents automated programs from posting comments.

Having trouble reading this image? View an alternate.

Working...

Post a comment

Comments are moderated, and will not appear until the author has approved them.