Posted by Chris McCauley in:
Data Quality
>
Best Practices
;
Data Quality
;
Data Quality
>
Technology
;
Data Quality
>
Vertical Solutions
On reflection, dipping into details of matching technologies in my last blog entry wasn't that much of a detour from the subject of metadata. It broached the idea that one technology was better than another because of its ability to better handle the context in which it was used. There are a number of themes that run through my work on data quality at Informatica and one of them is "metadata as context".
By way of explanation, let's pretend that you are the newly hired Head of Engineering in a software company bringing a product to market (maybe a new operating system). The quality assurance team has completed its testing and you've just been told that it found no "show-stopper" defects, some usability "gotchas" and a smattering of documentation problems. The pressure is on; Sales has been promising these features to customers for weeks and Marketing announced this stuff months ago: to ship or not to ship, that is the question.
The QA stats are on the surface very reassuring, but we need to know some more about how the team arrived at those numbers before we start pressing CDs. If you found out that the team had been added late in a very long development process and was unfamiliar with the product would you still be sitting comfortably?
You can see how the words "context" and "metadata" could be used interchangeably when thinking about the scenario above. Harking back to the discussion about Probabilistic matching systems, there's value in understanding the context in which you are operating. Metadata can be used to capture some aspects of context hence "metadata as context".
Continue reading "Data Quality Metadata; a lot more than just "data about data"" »