2

Building A Business Case For Data Quality: Analyze Data And Identify Anomalies

Building A Business Case For Data Quality, 4 of a 7-part series

Now comes the fun part, inspecting the data. For this step, automated data profiling will help you identify actual problems with the data as they relate to business client expectations. Here are just a few possible issues:

  • Are the phone numbers empty?
  • Are the admission dates missing in inpatient hospital claims?
  • Are there car loans with durations greater than 10 years?
  • Do shipping records lack corresponding billing records?
  • Do product descriptions differ only slightly?
  • Are you delivering products to many different customers with the same address?
  • What business rules are being violated?

 

Using a product such as Informatica Data Explorer, you can significantly speed up data inspection because much of the work is automated

 

You begin testing your data. Generally, in building a case for data quality, you would only look for the anomalies. There are three major steps in profiling data to find anomalies. First, look at the individual columns of each table. Second, look at the structure of the individual tables. And finally, look for relationships between tables. Here are some basic questions to ask:

  • Are the key fields unique?
  • Are all the important fields populated?
  • Are date fields dates or some other datatype?
  • Are natural keys unique?
  • Are relationships between tables intact?
  • Are there non-unique descriptions in a table with unique keys?
  • Can you validate entries against reference data?
  • Are there duplicate entries for the same subject?
  • Do all the values in a column exhibit the same pattern of the data?
  • Does the data conform to basic business rules?

For more on Building a Business Case for Data Quality read my first post.

 

Please share your feedback and of course any questions you might have.

FacebookTwitterLinkedInEmailPrintShare
This entry was posted in Customers, Data Quality, Pervasive Data Quality and tagged , , , . Bookmark the permalink.

2 Responses to Building A Business Case For Data Quality: Analyze Data And Identify Anomalies

  1. From March, 2011 we are distributors of INFORMATICA CORPORATION and I am constructing a case of DQ’s business, for which everything interests his reforehead to this topic.

    Please, It is possible that you send to me the bussines for DQ 1 to 10?

    Thank you

    Antonio

  2. We are in Colombia, and the Business case is for a Bank

Leave a Reply

Your email address will not be published. Required fields are marked *

You may use these HTML tags and attributes: <a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <strike> <strong>