Data Quality in Voter Rolls: A Big Problem with a Familiar Ring
Posted in Business Impact / Benefits, Data Integration, Data Quality, Data Warehousing, Enterprise Data Management, Governance, Risk and Compliance by Joe McKendrick |![]() |
In Chris Cingrani's recent post the question: "Data quality, does anyone care?" was posed. The answer is yes, of course people care about data quality – in fact, there are a lot of good reasons why a lot of people should care very deeply about data quality. Let’s look at the most recent example of where data quality makes a big difference, and that is in the federal election process.
A recent Washington Post article "Thousands Face Mix-Ups In Voter Registrations", published just prior to the recent presidential election, points to all the problems and confusion that have been arising around managing data about hundreds of millions of voters. A major problem has been that state voter registration systems have been denying eligibility to thousands of voters, resulting in a lot of acrimony, as well as lawsuits. One of those cases, involving data on 200,000 voters in Ohio whose voter records did not match those in other state databases such as drivers’ licenses and Social Security records, went all the way to the US Supreme Court. In other places, eligible voters’ names have erroneously been flagged as those of convicted felons.
All this confusion started with the best intentions. Following the hotly contested and deadlocked election of 2000, Congress passed the Help America Vote Act (HAVA), which funded the establishment of, among other things, centralized databases within states to maintain digitized voter records. The states were required to authenticate these records against driver’s licenses, Social Security numbers, and other records.
As Mary Pat Flaherty wrote in the Washington Post:
“As the databases are implemented, voters' names and other information are verified against state driver's license records or Social Security records to determine their eligibility. Federal law allows each state to decide what constitutes a match — whether it will accept nicknames, for example. But states are not using "the best scientific knowledge known today" when they verify the information, said Herbert Lin, who is studying the issue for the federal Election Assistance Commission, which oversees election reforms.”
The mismatches are occurring because voter records may vary from database to database. “The problem is that databases are prone to errors such as misspellings and transposed numbers and, as a result, thousands of new voters across the country are coming up as non-matches and having to take additional steps to verify their identity,” observes Kim Zetter, writing in the Wired article "Voter Registration Databases and Purges — Part I". “Voters who have hyphenated names, names with non-traditional spellings or names transliterated from other alphabets (such as those from Middle East and Asian languages) are most likely to show up with discrepancies among databases and wind up on mismatch lists.”
Sound familiar? What these articles and posts are touching upon are the same issues that enterprises have been wrestling with for a number of years now – rationalizing and integrating information from silos of data that have arisen across their organizations. Many organizations have set out to integrate information from various parts of their organizations, only to find tangles of data.
As Lin said in the Washington Post article, state-of-the-art technology is not being applied against this problem when it comes to election management. The federal and state governments could probably learn a thing or two from the experiences other organizations have been having in dealing with data quality issues. Perhaps a good place to start for these government entities is data governance. Voter registration systems need common information models, combined with common integration platforms, robust identity resolution, and ultimately, perhaps even enterprise data warehouses and master data management to ensure that every eligible voter can exercise the right to vote.










4 Comments, Comment or Ping
Daragh O Brien (Director Publicity IAIDQ)
This is a nice post. The IAIDQ wrote about this at length on the iqtrainwrecks.com blog site (http://www.iqtrainwrecks.com) and we also issued a press release on this to the media in the run up to the election (see the IAIDQ Website for that press release).
The US is not alone in having 'issues' with its electoral register. I blogged at length about the problems in the Irish electoral register on my own blog and on the IQTrainwrecks site. Fundamentally the challenge is for government organisations to step above the level of thinking that sees 'scrap and rework' of information as being the solution (we tried that in Ireland and now we don't know how bad the register is any more) but to start to see their electoral register as a key information asset and treat it as such. nfortunately the problem is quite often ignored until the next time the electoral rolls are needed (in Ireland next year) and it only becomes an issue when it is already a crisis.
Now that is a set of circumstances we are all too familiar with in a business context.
Nov 12th, 2008
Joe McKendrick
Daragh:
Thank you for highlighting what is truly a global problem — especially as we rely more and more on digitized records. In fact, many corporations may have been spurred to act on data quality issues by legislation such as Sarbanes Oxley. Let's hope that by the next major election in 2012, we will have many more examples of best practices and new technology applications that will help clean up the mess.
Joe
Nov 13th, 2008
Daragh O Brien
Joe,
SoX and statutory regulations are only one piece of the puzzle unfortunately. In fact, in the case of the US Voter Register, it seems that legislation actually served to compound the problem by tying people to 'master data' sources which were of questionable quality.
Technology is also only just one piece of the puzzle. No amount of tools or software to inspect defects out will ameliorate the problem if the underlying processes and ways of thinking about information are not addressed as well.
The case in point is the futile attempt in Ireland to clean up the electoral register in 2006. Because the information wasn't being thought of as an asset with a life cycle and because there was insufficient thought leadership around the importance of the process, Ireland is now in a situation were we have no idea just how bad the register actually is.
Information/Data Quality requires people to rethink the importance of information to their mission. Tom Redman's latest book tackles that issue very well, particularly the issues of information as an asset.
Nov 28th, 2008
Joe McKendrick
Daraugh:
Precisely. Ash Parikh, who contributes frequently here at the Perspectives blogsite, frequently raises the point that businesses do not do enough on the back end to address data quality or integration issues.
http://blogs.informatica.com/perspectives/index.php/2008/09/06/it-is-call-about-the-data/
Another area businesses are just starting to tackle is the idea of data governance. Who owns the data? Who is responsible for it?
Dec 4th, 2008
Reply to “Data Quality in Voter Rolls: A Big Problem with a Familiar Ring”