<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0">
   <channel>
      <title>Informatica Data Quality Blog</title>
      <link>http://blogs.informatica.com/dataquality/</link>
      <description></description>
      <language>en</language>
      <copyright>Copyright 2008</copyright>
      <lastBuildDate>Thu, 15 May 2008 16:20:43 -0800</lastBuildDate>
      <generator>http://www.sixapart.com/movabletype/?v=3.2</generator>
      <docs>http://blogs.law.harvard.edu/tech/rss</docs> 

            <item>
         <title>Identity Systems Acquisition – the Next Evolution of Data Quality</title>
         <description><![CDATA[<table cellpadding="4" cellspacing="0" cellborder="0" align="left" valign="top">
<tr><td><img src="http://www.informatica.com/blogs/chris_cingrani.jpg" height="63" width="50" alt="Chris Cingrani"></td></tr>
</table>
On May 15th, Informatica completed the acquisition of Identity Systems, a pioneer in identity resolution technology allowing customers to search and match identification data across multiple systems and more than 60 languages.  As someone who has been in the data quality space since 2001, I remember encountering Identity Systems in various sales cycles when they were known as Search Software America (SSA).  In each instance, I recall that the level of sophistication around matching they offered was something that was difficult to compete against.  If we could expand the data quality discussion to include other aspects, such as cleansing and validation, we had a much better chance in the proof of concept or sales cycle.]]></description>
         <link>http://blogs.informatica.com/dataquality/2008/05/identity_systems_acquisition_t.html</link>
         <guid>http://blogs.informatica.com/dataquality/2008/05/identity_systems_acquisition_t.html</guid>
         <category>Benefits</category>
         <pubDate>Thu, 15 May 2008 16:20:43 -0800</pubDate>
      </item>
            <item>
         <title>Data Quality Maturity Model – How Does Your Organization Rate?</title>
         <description><![CDATA[<table cellpadding="4" cellspacing="0" cellborder="0" align="left" valign="top">
<tr><td><img src="http://www.informatica.com/blogs/chris_cingrani.jpg" height="63" width="50" alt="Chris Cingrani"></td></tr>
</table>

<p>Recently I spoke at a User Group Meeting on the topic “Align for Success: The critical part Data Quality plays in complex Business and IT Initiatives.”  I began the discussion by polling the group to find out how many of the organizations represented had a data quality solution in place.  The response to the question was mixed, with approximately half the audience indicating they either had a solution or were considering one, while the other half indicated they weren’t currently considering data quality (or the person was unaware of any data quality initiatives).  Although this was a very unscientific survey, it set the tone for my presentation, as I attempted to explain the concept of a data quality maturity model.  By understanding where an organization is today from the standpoint of the model, management can begin to develop plans as to where they want to end up both in the short and long term.</p>]]></description>
         <link>http://blogs.informatica.com/dataquality/2008/05/data_quality_maturity_model_ho_1.html</link>
         <guid>http://blogs.informatica.com/dataquality/2008/05/data_quality_maturity_model_ho_1.html</guid>
         <category>Best Practices</category>
         <pubDate>Fri, 09 May 2008 10:23:12 -0800</pubDate>
      </item>
            <item>
         <title>Profile Early, Profile Often</title>
         <description><![CDATA[<p>Dr. Claudia Imhoff, President & Founder, Intelligent Solutions and Ed Lindsey, National Product Specialist, Informatica answer some questions that were raised during our recent web seminar. If you missed the web seminar, you can listen to it by clicking the following link:</p>

<p><a href="http://www.informatica.com/info/imhoffreplayq108">An Eye-opener for Your Business: How Data Profiling Can Build Support for Data Quality within Data Management Projects</a></p>

<p><br />
Q: Who in the organization should be responsible for data quality? And who should sign off on the scope document?</p>]]></description>
         <link>http://blogs.informatica.com/dataquality/2008/04/profile_early_profile_often.html</link>
         <guid>http://blogs.informatica.com/dataquality/2008/04/profile_early_profile_often.html</guid>
         <category>Data Quality</category>
         <pubDate>Tue, 08 Apr 2008 08:12:54 -0800</pubDate>
      </item>
            <item>
         <title>Seeing is Believing</title>
         <description><![CDATA[<table cellpadding="4" cellspacing="0" cellborder="0" align="left" valign="top">
<tr><td><img src="http://www.informatica.com/blogs/dq_cimhoff.jpg" height="63" width="50" alt="Claudia Imhoff, PhDi"></td></tr>
</table>You know the expression "Seeing is believing"? Well, it is even more true for data management projects. Getting sponsorship much less funding for data management projects like data quality, CRM, MDM, data warehousing, or other such enterprise-encompassing initiatives is very difficult at best. What should you do?

<p>Today I am doing a webcast with Informatica on just this topic. The answer lies in your ability to demonstrate the true state of the data -- "seeing" the problems by performing data profiling on your data. Nothing says "Houston, we have a problem" more than viewing the profiling results. It will make believers out of the most difficult people!</p>

<p>I hope you enjoy the presentation.</p>

<p>Update: Questions and answers from this event can be in the following post - <a href="http://blogs.informatica.com/dataquality/2008/04/profile_early_profile_often.html">Profile Early, Profile Often</a><br />
</p>]]></description>
         <link>http://blogs.informatica.com/dataquality/2008/03/seeing_is_believing.html</link>
         <guid>http://blogs.informatica.com/dataquality/2008/03/seeing_is_believing.html</guid>
         <category>Data Quality</category>
         <pubDate>Wed, 12 Mar 2008 08:40:05 -0800</pubDate>
      </item>
            <item>
         <title>Rome Wasn’t Built in a Day and Neither is a Data Governance Initiative</title>
         <description><![CDATA[<table cellpadding="4" cellspacing="0" cellborder="0" align="left" valign="top">
<tr><td><img src="http://www.informatica.com/blogs/chris_cingrani.jpg" height="63" width="50" alt="Chris Cingrani"></td></tr>
</table>
In my previous posts, I have discussed building the business case for data quality as well as the role that a data quality dashboard plays in supporting this case.  As previously noted, these efforts will directly impact your ability to articulate the need to pursue a data quality initiative.  The reason for returning to this topic is that I have recently participated in multiple discussions with a variety of companies that were either in the process of forming a data governance council or in the process of building the internal business case to support exploring a data governance initiative.  In these discussions two common threads were present – the role of data quality in the data governance initiative and the need to change the culture within the organization if data governance is going to succeed.  Although these are only two aspects to consider when pursuing a data governance initiative, they are directly tied to the underlying success or failure of the program.]]></description>
         <link>http://blogs.informatica.com/dataquality/2008/02/rome_wasnt_built_in_a_day_and.html</link>
         <guid>http://blogs.informatica.com/dataquality/2008/02/rome_wasnt_built_in_a_day_and.html</guid>
         <category>Governance / Stewardship</category>
         <pubDate>Fri, 15 Feb 2008 09:44:19 -0800</pubDate>
      </item>
            <item>
         <title>You can’t have CDI without Data Quality</title>
         <description><![CDATA[<table cellpadding="4" cellspacing="0" cellborder="0" align="left" valign="top">

<p><tr><td><img src="http://www.informatica.com/blogs/tom_golden.gif" height="63" width="50" alt="Tom Golden"></td></tr></p>

</table>
Looking in Webopedia.com recently I came across a definition for CDI. Yes webopedia.com - it bills itself as the #1 online encyclopaedia dedicated to computer technology. You might wonder what I was doing surfing this font of knowledge – well I had time on my hands between delayed flights coming back to Europe from the US. You know what they say “time to spare, travel by air.”  

<p>The Webopedia.com CDI definition went: “Short for <em>Customer Data Integration</em>, it is the combination of the technology, processes, and services needed to create and maintain an accurate, timely and complete view of the customer across multiple channels, business lines, and, potentially, enterprises, where there are multiple sources of customer data in multiple application systems and databases.”</p>

<p>A bit long winded perhaps, but the three words that shone out at me through the glare of the florescent lights in San Francisco airport were “accurate, timely and complete”; all data quality issues. Despite this, few if any of the Customer Data Integration (CDI) vendors in the market today have truly addressed the data quality issues in their CDI solutions. And anyone who has gone down the route of developing their own custom-built CDI application will be all too familiar with the data quality demands involved.<br />
</p>]]></description>
         <link>http://blogs.informatica.com/dataquality/2008/02/you_cant_have_cdi_without_data.html</link>
         <guid>http://blogs.informatica.com/dataquality/2008/02/you_cant_have_cdi_without_data.html</guid>
         <category></category>
         <pubDate>Fri, 01 Feb 2008 02:13:00 -0800</pubDate>
      </item>
            <item>
         <title>IQ and Information Product Specifications Quality</title>
         <description><![CDATA[<table cellpadding="4" cellspacing="0" cellborder="0" align="left" valign="top">
<tr><td><img src="http://www.informatica.com/blogs/t_bio_lenglish.jpg" height="63" width="50" alt="Larry English"></td></tr>
</table>
One of the root causes of poor quality information is defects in the data definition, specifically the “information product specifications.”  Because information is a product of our business, manufacturing and service processes, the analogy of an “information product” is real, and the requirement for quality in “information product specifications” is a critical requirement for Information Quality.

<p>This blog is the first of a series of three blogs on the critical quality characteristics (or measures) of information quality required to achieve Total Information Quality Management.</p>

<p>1.	Information Product Specification data quality<br />
2.	Data content quality<br />
3.	Information presentation quality</p>

<p>What constitutes the “Information Product Specifications” data?</p>

<p>•	Information standards<br />
•	Data names<br />
•	Data definitions<br />
•	Attribute valid value set or range of values<br />
•	Value format for structured attributes (VIN, SSN, Product Codes)<br />
•	Business rule specifications of constraints on data<br />
•	Information Steward accountable for data definition quality<br />
</p>]]></description>
         <link>http://blogs.informatica.com/dataquality/2008/01/iq_and_information_product_spe_1.html</link>
         <guid>http://blogs.informatica.com/dataquality/2008/01/iq_and_information_product_spe_1.html</guid>
         <category></category>
         <pubDate>Sun, 20 Jan 2008 04:15:06 -0800</pubDate>
      </item>
            <item>
         <title>Data Quality Dashboard – Capture Your Audience’s Attention</title>
         <description><![CDATA[<table cellpadding="4" cellspacing="0" cellborder="0" align="left" valign="top">
<tr><td><img src="http://www.informatica.com/blogs/chris_cingrani.jpg" height="63" width="50" alt="Chris Cingrani"></td></tr>
</table>
In my last blog post, I discussed the topic of building the business case for data quality.  As such, one of the points I mentioned was the need to highlight resulting issues.  Since my last post, I have had a number of discussions with clients and prospects on this topic.  At the core of these discussions is the same fundamental question – what is the best way to package up the issues we uncover?  In answering this, I often discuss the six dimensions of data quality <a href="http://www.informatica.com/solutions/data_quality/management/monitoring/default.htm">(Completeness, Conformity, Consistency, Accuracy, Integrity, and Duplicates)</a> and how to use a data quality scorecard to present the information in a meaningful way that it can be shared with key stakeholders within the organization.  Although my response to this question remains the same, a conversation I overheard at the airport a couple of weeks ago made me look at the need for a DQ scorecard a little differently.

<p>While grabbing a bite to eat prior to a flight, I overheard two gentlemen who were both retired from the newspaper business discussing how people don’t really take the time to read a newspaper like they used to.  They were lamenting that people today preferred quick sound bites of information – whether it be from television or from reading one of the various news sites on the Internet.  <br />
</p>]]></description>
         <link>http://blogs.informatica.com/dataquality/2007/12/data_quality_dashboard_capture.html</link>
         <guid>http://blogs.informatica.com/dataquality/2007/12/data_quality_dashboard_capture.html</guid>
         <category>Monitoring</category>
         <pubDate>Fri, 21 Dec 2007 17:11:47 -0800</pubDate>
      </item>
            <item>
         <title>Better management through measuring data quality</title>
         <description><![CDATA[<table cellpadding="4" cellspacing="0" cellborder="0" align="left" valign="top">
<tr><td><img src="http://www.informatica.com/blogs/ivan_sm_hd.gif" height="63" width="50" alt="Ivan Chong"></td></tr>
</table>
I recently asked a customer of ours why they invested so much in monitoring and publishing key performance indicators for their data quality. “Believe it or not, the biggest reason we measure data quality is not to correct bad data” came the reply. “The reason we monitor data quality is to detect problems with our business processes.” 

<p>Indeed, as I mentioned in my last blog post, business users look to investments in people and processes in addition to technology in order to address poor data quality. For example, if a bank branch manager received a report showing that customer data originating from his branch office had much higher incidents of duplicate entries and was putting the entire bank at risk of massive regulatory fines, he is not going to throw technology at the problem. His response might be mandatory training for tellers or better hiring practices to screen for adequate computer skills. </p>

<p>Experts in quality control methodology refer to this as addressing “root cause.” Common starting points of measurement involve completeness, accuracy, consistency, conformity, duplication, and integrity. Eventually, as the business culture matures its data quality practices, timeliness and data lineage (origination) are used to evaluate quality of data. Of course, software technology that automates the process of parsing, standardizing, matching and consolidating data is of immense value and is an absolute requirement in any data integration project. However, the issue of data quality goes beyond these IT projects. Ongoing measurement and monitoring of data quality provides value directly to the business because it helps them to better manage their people and processes. </p>

<p><br />
</p>]]></description>
         <link>http://blogs.informatica.com/dataquality/2007/12/better_management_through_meas.html</link>
         <guid>http://blogs.informatica.com/dataquality/2007/12/better_management_through_meas.html</guid>
         <category></category>
         <pubDate>Thu, 20 Dec 2007 03:57:29 -0800</pubDate>
      </item>
            <item>
         <title>The self-service data quality minefield</title>
         <description><![CDATA[<table cellpadding="4" cellspacing="0" cellborder="0" align="left" valign="top">

<p><tr><td><img src="http://www.informatica.com/blogs/tom_golden.gif" height="63" width="50" alt="Tom Golden"></td></tr></p>

</table>Marshall Field, the American department-store owner and retail merchandizing pioneer, is usually credited with coming up with the now much abused adage “Right or wrong, the customer is always right”. These days the saying is usually shortened to the more functional “the customer is always right”. But with self-service data entry, where more and more customers are responsible for supplying their own contact and order details via web applications, it is worth revisiting the entire quote. 

<p>The fact is even when the customer is responsible for “bad data” the vendor still has to shoulder the blame and do something about it.</p>

<p>Take the example of a friend of mine who recently signed up with a well known social networking website. Let’s ignore the reasons why he signed up – he’s not entirely sure – or what social networking sites are for – none of us is entirely sure yet, but I guess we’ll start to get the point as more and more of them are sold to industry giants for <em>n</em> billion dollars.</p>]]></description>
         <link>http://blogs.informatica.com/dataquality/2007/12/the_selfservice_data_quality_m_1.html</link>
         <guid>http://blogs.informatica.com/dataquality/2007/12/the_selfservice_data_quality_m_1.html</guid>
         <category>Data Quality</category>
         <pubDate>Fri, 14 Dec 2007 04:08:33 -0800</pubDate>
      </item>
            <item>
         <title>Start small with monitoring, but always think big to achieve data quality goals</title>
         <description><![CDATA[<table cellpadding="4" cellspacing="0" cellborder="0" align="left" valign="top">

<p><tr><td><img src="http://www.informatica.com/blogs/tom_golden.gif" height="63" width="50" alt="Tom Golden"></td></tr></p>

</table>
I attended my first parent-teacher meeting the other day for my five-year old daughter. Another one of those “life stage” events done and dusted – I remember dreading the annual meeting when I was a kid. The notion of my parents and my teacher comparing notes on my behaviour was too much to bear – somebody was eventually going to put two and two together and find out I was up to no good. 

<p>It all got me thinking about a recent blog post by my esteemed colleague Garry Moroney. His post <a href="http://blogs.informatica.com/dataquality/2007/06/mobilizing_the_data_quality_ar.html">Mobilizing the Data Quality Army</a> outlined the level of effort, thought and planning that the US Department of Education is putting into data quality.</p>

<p>As Garry points out dealing with data quality in a large, disconnected organization such as the US schools system is not a trivial exercise. But if you were to only read that one post you might be overwhelmed by the potential size of the data quality task in front of you. <br />
</p>]]></description>
         <link>http://blogs.informatica.com/dataquality/2007/12/start_small_with_monitoring_bu_1.html</link>
         <guid>http://blogs.informatica.com/dataquality/2007/12/start_small_with_monitoring_bu_1.html</guid>
         <category>Data Quality</category>
         <pubDate>Thu, 06 Dec 2007 07:50:31 -0800</pubDate>
      </item>
            <item>
         <title>Business and IT Collaboration is Essential for Data Quality</title>
         <description><![CDATA[<table cellpadding="4" cellspacing="0" cellborder="0" align="left" valign="top">
<tr><td><img src="http://www.informatica.com/blogs/ivan_sm_hd.gif" height="63" width="50" alt="Ivan Chong"></td></tr>
</table>
A recent InformationWeek article* described the growth in IT employment across the US as a result of a shift in skills. Rather than focusing on pure IT proficiency, organizations are looking for talent with “a more hybrid mix of technology skills, along with an understanding of the business and its customers.” 

<p>IT departments are highly motivated to increase the level of collaboration with their counterparts in the business. Nowhere is this more critical than in the area of data quality and the trend is causing a shift in the way companies are looking to solve their data quality issues. First generation data quality tools had a natural focus on technology, instead of business. Here are some of the differences between technology focused data quality solutions and business-focused data quality solutions.</p>

<p><strong>Tools vs. Process</strong><br />
Technology focused data quality solutions provide tools that automate data processing. Evidence of this type of focus can be seen in the way that vendors will tout the sophistication and type of their algorithms over and above their ability to support ongoing data quality management processes. While technology is extremely important, its relevance cannot eclipse the overall data quality management process. Even if your data quality tool can automate the correction of 95 percent of the data, if the remaining five percent cannot be managed properly, you will continue to suffer from poor data quality.<br />
</p>]]></description>
         <link>http://blogs.informatica.com/dataquality/2007/12/business_and_it_collaboration_1.html</link>
         <guid>http://blogs.informatica.com/dataquality/2007/12/business_and_it_collaboration_1.html</guid>
         <category>Data Quality</category>
         <pubDate>Wed, 05 Dec 2007 00:05:42 -0800</pubDate>
      </item>
            <item>
         <title>Building the Business Case for Data Quality</title>
         <description><![CDATA[<table cellpadding="4" cellspacing="0" cellborder="0" align="left" valign="top">
<tr><td><img src="http://www.informatica.com/blogs/chris_cingrani.jpg" height="63" width="50" alt="Chris Cingrani"></td></tr>
</table>

<p>As a new contributor to the data quality blog site, I wanted to start by introducing myself and highlighting the types of topics I plan to discuss on a semi-frequent basis. I am a Principal Consultant with Informatica Professional Services and have spent the past 6 years in the data quality space in a variety of sales and post sales roles.  During this time I have seen the data quality market continue to evolve and mature.  Thus, I would like to use this column to reflect on the types of use cases I have seen and continue to see when meeting with organization’s faced with data quality problems.  I hope these posts can start an active dialogue, regardless if your company is trying to tackle their first data quality initiative or looking to build out a formal center of excellence around data quality.</p>

<p>To start, I wanted to pose a common question I am often asked by clients and prospects – how do I build a business case for data quality?  Although an organization may think (or even know) there is a problem, the need to justify the cost around procuring a data quality solution often exists.  This justification requirements often comes from the idea that data quality issues aren’t necessarily a core business issue (how wrong this is!) or something that can be handled through manual intervention (this is true – if you have unlimited time and money, but even then your results will be limited). Thus, the following points are meant to help start an organization down the path to building the internal business case through a Data Quality Audit.  Note - if you have access to Informatica’s Velocity Methodology, I go into these steps in further detail in the best practice document, “Developing the Data Quality Business Case.” <br />
</p>]]></description>
         <link>http://blogs.informatica.com/dataquality/2007/11/building_the_business_case_for.html</link>
         <guid>http://blogs.informatica.com/dataquality/2007/11/building_the_business_case_for.html</guid>
         <category>Best Practices</category>
         <pubDate>Wed, 28 Nov 2007 14:14:40 -0800</pubDate>
      </item>
            <item>
         <title>Hospital Billing Errors &apos;Kill&apos; Patients</title>
         <description><![CDATA[<table cellpadding="4" cellspacing="0" cellborder="0" align="left" valign="top">
<tr><td><img src="http://www.informatica.com/blogs/t_bio_lenglish.jpg" height="63" width="50" alt="Larry English"></td></tr>
</table>

<p>    Hospital billing mistakes have become so prevalent that a niche industry has evolved to help patients decipher their bills and help correct the errors.  ”Pat Palmer, founder of Medical Billing Advocates of America, estimates that she finds multiple errors in 8 out of every 10 hospital bills she reviews” (Dina ElBoghdady, “Killer Billing Errors,” Washington Post, June 27, 2004, p. F01.  Accessed Aug 9, 2007 at: http://www.washingtonpost.com/wp-dyn/articles/A7351-2004Jun26.html ).  </p>

<p>    One patient saw the bill of $25,652.14 for her 2-hour routine operation, reduced to a cost of $17,000 billed to the insurance company and her cost reduced to $2,148, after correcting errors and overcharges.<br />
While this error-rate was not statistically measured, nor would it apply to all hospital bills, it points out a huge problem in Information Quality in the health care system, not counting actual medical errors caused by information defects.</p>

<p>    The consequence of health care over billing is significant.  It contributes to the high costs of health care, which is one major cause of personal bankruptcy. <br />
</p>]]></description>
         <link>http://blogs.informatica.com/dataquality/2007/08/hospital_billing_errors_kill_p.html</link>
         <guid>http://blogs.informatica.com/dataquality/2007/08/hospital_billing_errors_kill_p.html</guid>
         <category>Data Quality</category>
         <pubDate>Wed, 29 Aug 2007 13:45:21 -0800</pubDate>
      </item>
            <item>
         <title>Information Quality &amp; Management Transformation</title>
         <description><![CDATA[<table cellpadding="4" cellspacing="0" cellborder="0" align="left" valign="top">
<tr><td><img src="http://www.informatica.com/blogs/t_bio_lenglish.jpg" height="63" width="50" alt="Larry English"></td></tr>
</table>I recently received an email from one of my early clients.  After having worked in four different companies in four different industries, she came to a sad conclusion, writing:

<p><em>“The thing that they all have in common is a desire to cut corners and deal with quality later.  It takes a lot of energy to be the information quality cheerleader, and I find it discouraging and overwhelming at times.  Keep writing your articles and books to encourage all the people like me who are dealing with these issues every day.” P. G</em>.</p>

<p>The discovery that P. G. has experienced is, unfortunately, the norm—not the exception.  There are two critical elements in this experience.</p>]]></description>
         <link>http://blogs.informatica.com/dataquality/2007/06/information_quality_management_2.html</link>
         <guid>http://blogs.informatica.com/dataquality/2007/06/information_quality_management_2.html</guid>
         <category>Data Quality</category>
         <pubDate>Thu, 28 Jun 2007 07:59:33 -0800</pubDate>
      </item>
      
   </channel>
</rss>
