The Practitioner's Guide to Data Quality Improvement and over 1.5 million other books are available for Amazon Kindle . Learn more

Buy New

or
Sign in to turn on 1-Click ordering.
Buy Used
Used - Very Good See details
Price: £29.89

or
Sign in to turn on 1-Click ordering.
 
   
Trade in Yours
For a £3.40 Gift Card
Trade in
More Buying Choices
Have one to sell? Sell yours here
Sorry, this item is not available in
Image not available for
Colour:
Image not available

 
Start reading The Practitioner's Guide to Data Quality Improvement on your Kindle in under a minute.

Don't have a Kindle? Get your Kindle here, or download a FREE Kindle Reading App.

The Practitioner's Guide to Data Quality Improvement (The MK/OMG Press) [Paperback]

David Loshin

RRP: £38.99
Price: £34.31 & this item Delivered FREE in the UK with Super Saver Delivery. See details and conditions
You Save: £4.68 (12%)
o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o
Only 5 left in stock (more on the way).
Dispatched from and sold by Amazon. Gift-wrap available.

Formats

Amazon Price New from Used from
Kindle Edition £25.73  
Paperback £34.31  
Trade In this Item for up to £3.40
Trade in The Practitioner's Guide to Data Quality Improvement (The MK/OMG Press) for an Amazon.co.uk gift card of up to £3.40, which you can then spend on millions of items across the site. Trade-in values may vary (terms apply). Learn more

Book Description

13 Dec 2010 0123737176 978-0123737175
Business problems are directly related to missed data quality expectations. Flawed information production processes introduce risks preventing the successful achievement of critical business objectives. However, these flaws are mitigated through data quality management and control: controlling the quality of the information production process from beginning to end to ensure that any imperfections are identified early, prioritized, and remediated before material impacts can be incurred. "The Practitioner's Guide to Data Quality Improvement" shares the fundamentals for understanding the impacts of poor data quality, and guides practitioners and managers alike in socializing, gaining sponsorship for, planning, and establishing a data quality program. This book shares templates and processes for business impact analysis, defining data quality metrics, inspection and monitoring, remediation, and using data quality tools. Never shying away from the difficult topics or subjects, this is the seminal book that offers advice on how to actually get the job done. It offers a comprehensive look at data quality for business and IT, encompassing people, process, and technology. It shows how to institute and run a data quality program, from first thoughts and justifications to maintenance and ongoing metrics. It includes an in-depth look at the use of data quality tools, including business case templates, and tools for analysis, reporting, and strategic planning.

Frequently Bought Together

The Practitioner's Guide to Data Quality Improvement (The MK/OMG Press) + Executing Data Quality Projects: Ten Steps to Quality Data and Trusted Information + Data Quality Assessment
Price For All Three: £103.57

Buy the selected items together


Product details


More About the Author

Discover books, learn about writers, and more.

Product Description

Review

"There is NOTHING like this out there that I am aware of, and certainly nothing from anyone with same stature as David Loshin."--David Plotkin, Wells Fargo Bank "The book provides a comprehensive look at data quality from both a business and IT perspective. It does not just cover technology issues, but discusses people, process, and technology. And that is important, because this is the mix that is needed in order to initiate any type of quality improvement regimen."--Data Technology Today Blog

About the Author

David Loshin is President of Knowledge Integrity, Inc., a company specializing in data management consulting. The author of numerous books on performance computing and data management, including "Master Data Management" (2008) and "Business Intelligence - The Savvy Manager's Guide" (2003), and creator of courses and tutorials on all facets of data management best practices, David is often looked to for thought leadership in the information management industry.

Inside This Book (Learn More)
Browse Sample Pages
Front Cover | Copyright | Table of Contents | Excerpt | Index
Search inside this book:

What Other Items Do Customers Buy After Viewing This Item?


Customer Reviews

There are no customer reviews yet on Amazon.co.uk.
5 star
4 star
3 star
2 star
1 star
Most Helpful Customer Reviews on Amazon.com (beta)
Amazon.com: 4.8 out of 5 stars  12 reviews
3 of 3 people found the following review helpful
5.0 out of 5 stars Improve Your Data Quality By Reading This Book 16 Nov 2010
By Data Guy - Published on Amazon.com
Format:Paperback
David Loshin's new book, The Practitioner's Guide to Data Quality Improvement, is well-organized, helpful, and on topic. One of my pet peeves is the poor state of data quality rampant just about everywhere these days... and Loshin's text offers expert guidance on how organizations can remedy that situation.

The book provides a comprehensive look at data quality from both a business and IT perspective. It does not just cover technology issues, but discusses people, process, and technology. And that is important, because this is the mix that is needed in order to initiate any type of quality improvement regimen.

In the book, Loshin shows how to institute and run a data quality program, from start to finish. And this is all helpful information. But I think my favorite chapter of the book is the one on Data Quality Service Level Agreements. This is so because data quality is not a project that can be started and completed. It needs to become an on-going component of our everyday procedures. And only through adopting a service level agreement mentality when it comes to data quality can we ever hope to make data quality monitoring and improvement an accepted, regular component of what we do.
2 of 2 people found the following review helpful
4.0 out of 5 stars Good introductory tour on planning the data quality effort 5 Mar 2011
By Erik Gfesser - Published on Amazon.com
Format:Paperback|Amazon Vine™ Review (What's this?)
Some of the other reviews that have been posted here provide some interesting observations from perspectives that are not always centered on data architecture or general enterprise architecture, and the hope of this reviewer is that he will be able to offer feedback to others on this text based on his consulting experience in these areas. In his preface, David Loshin comments that "this book is intended to provide the fundamentals for developing the enterprise data quality program, and is intended to guide both the manager and the practitioner in establishing operational data quality control throughout an organization, with particular focus on the ability to build a business case for instituting a data quality program", "the assessment of levels of data quality maturity", "the guidelines and techniques for evaluating data quality and identifying metrics related to the achievement of business objectives", "the techniques for measuring, reporting, and taking action based on these metrics", and "the policies and processes used in exploiting data quality tools and technologies for data quality improvement".

With these goals in mind, this reviewer thinks Loshin succeeded in this effort. Taking into account the fact that data quality is an enormous practice area, and success requires understanding of both data and the business to succeed, this introductory text walks the reader step-by-step through a considerable number of topics over which many authors would likely stumble. Some of the explanations that Loshin provides, such as the one in the chapter entitled "Developing a Business Case and a Data Quality Road Map" on how data flaws can incur business impacts, are extremely well done, especially when married with effective diagrams. And in his chapter entitled "Metrics and Performance Improvement", the author provides an explanation on drilling through key performance indicators that this reviewer has not seen elsewhere until this effort, and the presentation is exceedingly well done. Other areas of this text that this reviewer especially appreciates are the chapters entitled "Data Requirements Analysis", "Metadata and Data Standards", and "Inspection, Monitoring, Auditing, and Tracking".

This reviewer however would like to make potential readers of this book aware that most of what Loshin provides here is high level walkthroughs and examples of pertinent elements within data quality, rather than practical advice on how to approach much of the lower level work that should be expected to take place on a day-to-day basis. For example, in the chapter entitled "Entity Identity Resolution", the author provides a section on matching algorithms that briefly discusses parsing and standardization, abbreviation expansion, edit distance, phonetic comparison, and n-gramming, which consumes just a few short paragraphs. The author does not explain that there are many more matching algorithms currently in use in industry, that in most cases matching exercises need to take into account multiple rather than single algorithms in isolation, that in the world today internationalization takes an ever more important role when performing matching, and that there is a wide variety of commercial tooling available that needs to be assessed based on the needs of the organization.

However, armed with this knowledge the reader is sure to make use of this work by utilizing it while planning and strategizing data quality, as well as making use of it as introductory material to understanding what it might take to pursue efforts that require a higher level of data quality maturity such as master data management (MDM), in which case this reviewer recommends "Enterprise Master Data Management: An SOA Approach to Managing Core Information" by Allen Dreibelbis, Eberhard Hechler, Ivan Milman, Martin Oberhofer, Paul van Run, and Dan Wolfson (see my review). In the opinion of this reviewer, what Loshin provides here is best suited for managers looking to piece together all of the steps associated with data quality pursuits as well as get a better handle on how each of the steps are interrelated and whether each is a requirement or just an option, possibly looking to solve some aspects of data quality in an evolutionary, piecemeal fashion rather than revolutionary endeavor.
2 of 2 people found the following review helpful
5.0 out of 5 stars Judge this book by its cover 17 Jan 2011
By Aceto - Published on Amazon.com
Format:Paperback|Amazon Vine™ Review (What's this?)
Mr. Loshin has produced a hands-on, practical work on data quality improvement and management. If you need some theory but a nuts and bolts focus with a framework laid out for you, this is the book for you. You can do a thorough assessment of the quality of your data across several dimensions, and develop a roadmap for making a program of specific improvements.

Transactional data is distinguished here from informational data, especially from what quality means. They differ in nature and in function. Informational data is often mishandled by trying to apply the same standards and principles as you would for transactional data. Because transactional data is primary and in full view of your business operations, it can overshadow informational data. While quality is necessary and vital for transactional data, it is not sufficient for an optimally profitable process. You may be in a line of business where the race is won on the competitive advantage gotten from complex, accurate and flexible informational data. Quality flaws here are not always obvious up front.

I use this book to:

- develop action plans at all levels from assessments to strategy to hard dollar reporting.
- teach myself, my staff and my colleagues (each needing different education)

On the technical level, this book is certainly not the last word; but it is detailed enough and thorough enough to get real work done while organizing your research and development efforts to plan an enterprise level program of data quality improvement both transactional and informational. I have been able to define a stream of benefits at each stage of the program.

The vulnerabilities I can address using this book are related to regulatory non-compliance, audit failures for data integrity in reporting and contractual liabilities. On the other hand, I use the data profiling techniques to improve revenue streams as allowed by contracts and related instruments. In particular, contracts that provide for incentives (and penalties), premiums, bonuses and performance schedules. Data profiling, parsing and standardization are the big efforts to achieving profit results especially when you are dealing with heterogeneous data sets, for example transactional portfolios from different customers or partners where you need to establish audit worthy claims to monies due you. We leave a lot of money on the table that we either overlook or miscalculate or do not claim in accordance with agreed terms and conditions. All interested parties need to have confidence in the reliability and accuracy of data.

Informationally, profit and performance align by themselves to create optimal processes and systems. Data quality tools are used to engineer that alignment. Of course, the really nasty side of improving data quality is the organizational. The graves are full of visionaries, pioneers, prophets or whatever buzz word you like or despise. Here I mean the doers, and not the happy babble types that float in and out of our rather more difficult working lives. So Mr. Loshin here has a good list of topics in the organizational area, so that you can see and understand what are the obstacles and any possible paths for progress. Maturity, readiness, will, governance, policy and leadership are all pre-requisites for making quality improvements that can be measured in hard dollar pay-offs. He has working templates and descriptions for charting all the components of organizational maturity.. He has all the worksheets and exhibits you will need to determine and to document your assessments and recommendations. I followed his roadmap without making a big political (confrontational) deal about getting agreement, one player at a time until the executive leadership decided what geniuses they were for coming up with it.

I would have been satisfied enough with just the first five chapters, just to get me launched. But the real meat is in the metrics. You will see the standard old warhorse of statistical process control. I have really come to like the granularity of daily performance at the shop floor level because you can find the weak and the broken events as they happen, especially as you start to deploy process changes. People really respond when you are able to show them within a day of launch if they are still doing x or not yet doing y. By the way, all the mathematics here are early high school level, so you can expect to roll them out to all your analysts, leads and coordinators with just a little presentation, review and coaching.

In highly federated organizations, your first steps will present themselves out of common ground, dependencies and other basic interfaces. Communities of interest and best practices follow especially if you can establish a single showcase.

Mr. Loshin lays out the dimensions of data quality, each requiring a method of measurement. He says that each line of business can pick these up on a dashboard that can eventually roll up to an enterprise view. Furthermore, he wants these dimensions to create a hierarchy of categories that lay out according to a pyramidal framework where the broad base is constructed from simple measurements and many rules, upward toward the enterprise view of few rules and complex metrics. This undertaking is ambitious, far beyond the statistical process control and continuous process improvement of the industrial beginnings.

His dimensions are categorized between the intrinsic and the contextual. Both are problematic. Contextual dimensions are by definition formed in relation to one another, tested for consistency and coherence. There is no way of knowing if you have done a complete job at any point in time. Anyway all these systems and process exist in a dynamic world anyway, so you are never truly done, but you get the point. But the idea is that if they make relational sense, you are likely on the right track.

We are still far from some grand unification theory of business process and data quality. Indeed, just when you get to the critical detail of this schema, you hit a patch like this:

Yet although information policies (such as those governing security or privacy) are a major source of data quality assertions, they imply the need for data governance, which is covered in chapter 7. - [i.e. the previous chapter]

Huh? If you read it really fast you can pretend that it means something. But just try to parse it. In a tough and complicated exposition where you most need clarity, he goes all muddy. This is no time for guessing, which is precisely what you are left to do. In a moment of a Godel-like nightmare, Mr. Loshin makes completeness and coherence two of the contextual dimensions. He does say consistency rather than coherence, but to little comfort. Talk about asking for trouble, especially under the scrutiny of those who have not yet bought into this program...

Yet his failure to attain the elegance of mathematical theory or the rigor of applied science does not invalidate his approach. Seeing is believing, which is his ultimate aim. The intrinsic dimensions are more straightforward and serve a bit as guideposts in otherwise uncharted territory. Again it is his tables that provide you with handholds to his framework. Chances are you will not be able to farm out a lot of this work to your people, especially at the beginning. You must make the high level customization to your organization, your business context. Then others can carry it forward into their own work and departments. A core team of architects, analysts, process mappers, project coordinators, data stewards and report producers will require a material operating expense just to get started, about a six month period, and about eighteen to be really firing on all cylinders. You will be integrated into the familiar world of budgeting and forecasting, capacity planning, performance measurement, and project/data governance.

His treatment of a Data Requirements Analysis Process shows you how to transform the way you manage data and processes. The BIG however, is that it presumes a level of project and development life cycle management far advanced from what is usually existing in operational business units.

A further complication is that informational data by nature is a highly derived and constructed. Because strategic foresight and organizational cooperation (let alone external cooperation) are rare in our customary approach to work, whether business or government, these derivations and constructions are so much more divergent at their origins. You can build cooperation into your methodologies and governance in order to make data standards a priority. The best thing you can start with is a proper assessment of the state of standardization (and data quality, for that matter) and decide on ways to move toward your desired point of arrival. A common fallacy, stemming from shortsightedness, is to choose highly proprietary solutions, confusing obscurity with innovation and competitive advantage. When leadership imagines that it is way out ahead of the pack, it is actually only temporarily so, because you often motivate the seemingly disadvantaged to cooperate against the outlier. Think Beta Max. By contrast, when contemplating the critical employment of meta data, Mr. Loshin does so via ISO 11179.

I cannot resist pointing out that, as he is illustrating the problem of data quality, he gives as an example all the different ways of representing California. He includes "06" as the number in alphabetical order. The table he references meanwhile clearly shows California in position five. I almost think he is doing this on purpose to prove his point in a self-validating example. That or he just wants to se if I am paying attention.

By the way, for those who are old cranks when it comes to grammar, this book may be the ultimate in showing why perfect grammar can be worth millions, just by making data representation and communication clear. Grammar is especially effective in defeating that old bugaboo - ambiguity.

Mr. Loshin covers both remediation (correction and correction, to name two types) as well as future planning and design. He gives a substantial discussion of Service Level Agreements. Finally, I know of no better place to go for help in beginning data profiling and parsing; he gives each a chapter. He shows how it all comes together in a nice master data management schema. Thusly, he returns to the top of the pyramid, the Enterprise view, well connected, powerful and clearly represented. This is the only business book I am likely to wear out before I am done with it.

Please vote so if you found this review helpful. Thank you.
Were these reviews helpful?   Let us know

Customer Discussions

This product's forum
Discussion Replies Latest Post
No discussions yet

Ask questions, Share opinions, Gain insight
Start a new discussion
Topic:
First post:
Prompts for sign-in
 

Search Customer Discussions
Search all Amazon discussions
   
Related forums


Listmania!


Look for similar items by category


Feedback


Amazon.co.uk Privacy Statement Amazon.co.uk Delivery Information Amazon.co.uk Returns & Exchanges