My Photo

Your email address:


Powered by FeedBlitz

May 2008

Sun Mon Tue Wed Thu Fri Sat
        1 2 3
4 5 6 7 8 9 10
11 12 13 14 15 16 17
18 19 20 21 22 23 24
25 26 27 28 29 30 31
Blog powered by TypePad

« Data Mining, Predicate Triage and NSA Domestic Surveillance | Main | What’s In A Name? »

March 14, 2006

There Is No Such Thing As A Single Version of Truth

If you are not interested in a technical peculiarity that occurs in aggregated data sets, just ignore this post.

I am often asked what my thoughts are about selecting the single best attributes (e.g., best name and best address) when multiple attributes are known. I always respond with, “truth is in the eye of the beholder.”

This came as a hard lesson. In the mid-1990’s, I built a data warehouse that was being fed daily by over 4,000 disparate operational systems belonging to handful of widely recognized consumer brands. The goal was to better understand the customer by recognizing when the same person was transacting across different brands all held by the same holding company. The underlying motivation: the more fully the customer is understood the more you can sell to the customer.

There I sat with a number of marketing VP’s, each representing their brand’s interests. And while everyone worked for the same parent company, there was one question no one could agreed upon: When a consumer has transacted with all of the brands, each time using a slightly different name or new address, which name and address should be considered the enterprise-wide GOLD standard? As it turns out, there is no such thing as a single version of truth.

The name and address supplied to a human resources system by an employee is the best name and address for an IRS filing, even if a different name and address has become available from another system. And a hotel statement better be sent to the address supplied by the guest when he or she checks out of the hotel – not some other address deemed “best” because of its perceived currency and reliability from some other data source. For a direct marketing piece, a name and address from a loyalty club program is generally better than the hotel reservation data provided over the phone. Why? Because loyalty club data is more reliable as consumers want to receive their points statement in the mail.

Thus, the definition of best varies based on who is asking the question. So when I am asked how to determine the single best version of truth I recommend being prepared to deliver every version of truth -- for truth is in the eye of the beholder.

Truth on demand … so to speak.

TrackBack

TrackBack URL for this entry:
http://www.typepad.com/t/trackback/692172/4453160

Listed below are links to weblogs that reference There Is No Such Thing As A Single Version of Truth:

Comments

I used to work for a data recognition company. I called this concept "Quantum Recognition". Basically, a person existed in a number of different states until you provided enough information to determine what "version" of the consumer you wanted.
Good to see someone else thinking along the same lines.

It would be more accurate to say that these particular entities have n attributes - like an n-dimensional vector - and that truth is not in the eye of the beholder (the values for each dimension are fixed at any one time), but rather each user defines what is important to him (his "truth") as one (or multiple), but not all, of the dimensions.

Why is the industry always pushing one version of the truth.

I agree "truth is in the eye of the beholder".

We could not agree an a recent MDM effort and scrapped it. But a seperate app to app integration project I was on adopted an open-spec approach we found on sourceforge called Jumper metamodel. This gave us the best of MDM with much needed flexibility.

Post a comment

If you have a TypeKey or TypePad account, please Sign In