Enter your mobile number or email address below and we'll send you a link to download the free Kindle App. Then you can start reading Kindle books on your smartphone, tablet, or computer - no Kindle device required.

  • Apple
  • Android
  • Windows Phone
  • Android

To get the free app, enter your mobile phone number.

Kindle Price: £17.76

Save £13.23 (43%)

includes VAT*
* Unlike print books, digital books are subject to VAT.

These promotions will be applied to this item:

Some promotions may be combined; others are not eligible to be combined with other offers. For details, please see the Terms & Conditions associated with these promotions.

Deliver to your Kindle or other device

Deliver to your Kindle or other device

The Data WarehouseETL Toolkit: Practical Techniques for Extracting, Cleaning, Conforming, and Delivering Data by [Kimball, Ralph, Caserta, Joe]
Kindle App Ad

The Data WarehouseETL Toolkit: Practical Techniques for Extracting, Cleaning, Conforming, and Delivering Data Kindle Edition

4.2 out of 5 stars 5 customer reviews

See all formats and editions Hide other formats and editions
Amazon Price
New from Used from
Kindle Edition
"Please retry"
£17.76

Kindle Books from 99p
Load up your Kindle library before your next holiday -- browse over 500 Kindle Books on sale from 99p until 31 August, 2016. Shop now


Product Description

From the Back Cover

The single most authoritative guide on the most difficult phase of building a data warehouse

The extract, transform, and load (ETL) phase of the data warehouse development life cycle is far and away the most difficult, time–consuming, and labor–intensive phase of building a data warehouse. Done right, companies can maximize their use of data storage; if not, they can end up wasting millions of dollars storing obsolete and rarely used data. Bestselling author Ralph Kimball, along with Joe Caserta, shows you how a properly designed ETL system extracts the data from the source systems, enforces data quality and consistency standards, conforms the data so that separate sources can be used together, and finally delivers the data in a presentation–ready format.

Serving as a road map for planning, designing, building, and running the back–room of a data warehouse, this book provides complete coverage of proven, timesaving ETL techniques. Beginning with a quick overview of ETL fundamentals, it then looks at ETL data structures, both relational and dimensional. The authors show how to build useful dimensional structures, providing practical examples of techniques. Along the way you′ll learn how to:

  • Plan and design your ETL system
  • Choose the appropriate architecture from the many possible options
  • Build the development/test/production suite of ETL processes
  • Build a comprehensive data cleaning subsystem
  • Tune the overall ETL process for optimum performance

About the Author

RALPH KIMBALL, PhD, founder of the Kimball Group, has been a leading visionary in the data warehousing industry since 1982 and is one of today′s best–known speakers and educators. He is the author of several bestselling titles published on data warehousing, including The Data Warehouse Toolkit (Wiley).

JOE CASERTA is the founder of Caserta Concepts, LLC, a data warehousing consulting firm. He writes frequently for print and online magazines, and is an active contributor to DWList, the major online community for data warehousing professionals.


Product details

  • Format: Kindle Edition
  • File Size: 5228 KB
  • Print Length: 528 pages
  • Publisher: Wiley; 1 edition (10 Dec. 2007)
  • Sold by: Amazon Media EU S.à r.l.
  • Language: English
  • ASIN: B006JAWFE4
  • Text-to-Speech: Enabled
  • X-Ray:
  • Word Wise: Not Enabled
  • Enhanced Typesetting: Enabled
  • Average Customer Review: 4.2 out of 5 stars 5 customer reviews
  • Amazon Bestsellers Rank: #232,711 Paid in Kindle Store (See Top 100 Paid in Kindle Store)
  •  Would you like to give feedback on images or tell us about a lower price?


Customer Reviews

4.2 out of 5 stars
Share your thoughts with other customers

Top Customer Reviews

Format: Paperback Verified Purchase
Computing is an exact and unambiguous discipline; consequently I want my computer books to be written in an exact and unambiguous manner. "The Data Warehouse ETL Toolkit" falls far short of this requirement, being wordy, vague, overblown and crammed with jargon. Worst of all, I found there were very few "Practical Techniques" I could take away with me that would help me in my work.
Here's a sample sentence: "This section discusses what needs to go into the data-cleansing baseline for the data warehouse, including simple methods for detecting, capturing and addressing common data-quality issues and procedures for providing the organisation with improved visibility into data-lineage and data-quality improvements over time". Now imagine a whole book written like this. OK, I've taken this sentence out of context, but if I tell you that this was used to introduce a section - there are no preceding or trailing sentences - then I think I am starting to paint a picture.
The authors and publishers seem to have taken the attitude, "Why use a bullet point when a paragraph will do?". Text and examples have been embellished as if in an effort to prove how clever the authors are. A lot of jargon is employed (no glossary), but the reader is always left in doubt as to whether this is industry standard or idiom employed only by the authors.
I think this book could have been so much more useful if they had taken a worked example right through from start to finish. They could have explained where the real world may be different to this perfect model and drawn on their experiences to add colour. Also, if this truly was supposed to be a book of practical techniques, they should have highlighted them, say 1 to 100, through the text, as applicable.
So why two stars rather than none?
Read more ›
Comment 32 people found this helpful. Was this review helpful to you? Yes No Sending feedback...
Thank you for your feedback.
Sorry, we failed to record your vote. Please try again
Report abuse
By John Ryan HALL OF FAMETOP 10 REVIEWER on 16 Jun. 2006
Format: Paperback Verified Purchase
Problem with this book is it is a bit woolly and wordy (just like the previous reviewer described). However, the main difficulty is there simply is no other book around on the market. As a description of the entire end-to-end ETL process, including many subject areas I'd not even considered (eg. COBOL copy books), it's very good.

However, I'd say the REAL reason for buying this book is it works well with Ralph Kimballs other work "The Data Warehouse Toolkit", and gives an excellent summary of Dimensional Design. I guess the authors felt they must put this in to explain the background. Personally I found it invaluable.

Also the description of "real time ETL" was invaluable. Everyone's talking about it, but the book gives a credible outline solution.

Yes woolly, yes it uses 10 words when two would do, but overall I got a lot out of it.

Recommended.
Comment 12 people found this helpful. Was this review helpful to you? Yes No Sending feedback...
Thank you for your feedback.
Sorry, we failed to record your vote. Please try again
Report abuse
Format: Paperback Verified Purchase
Fantastic down to earth explanations with real business situations.
Worth buying if you are a starter in the DW technology.
Comment One person found this helpful. Was this review helpful to you? Yes No Sending feedback...
Thank you for your feedback.
Sorry, we failed to record your vote. Please try again
Report abuse
Format: Paperback Verified Purchase
I love all Kimball's books. That one as well. He is the truth in DW/BI development :)
Comment Was this review helpful to you? Yes No Sending feedback...
Thank you for your feedback.
Sorry, we failed to record your vote. Please try again
Report abuse
Format: Paperback Verified Purchase
!!perfect!!
Comment Was this review helpful to you? Yes No Sending feedback...
Thank you for your feedback.
Sorry, we failed to record your vote. Please try again
Report abuse
click to open popover