or
Sign in to turn on 1-Click ordering.
 
 
More Buying Choices
40 used & new from £15.00

Have one to sell? Sell yours here
 
   
The Data Warehouse ETL Toolkit: Practical Techniques for Extracting, Cleaning, Conforming, and Delivering Data
 
 

The Data Warehouse ETL Toolkit: Practical Techniques for Extracting, Cleaning, Conforming, and Delivering Data (Paperback)

by Ralph Kimball (Author), Joe Caserta (Author) "Ideally, you must start the design of your ETL system with one of the toughest challenges: surrounding the requirements ..." (more)
3.0 out of 5 stars  See all reviews (2 customer reviews)
RRP: £30.99
Price: £18.14 & this item Delivered FREE in the UK with Super Saver Delivery. See details and conditions
You Save: £12.85 (41%)
o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o
In stock.
Dispatched from and sold by Amazon.co.uk. Gift-wrap available.

Want guaranteed delivery by Tuesday, November 17? Choose Express delivery at checkout. See Details
26 new from £16.62 14 used from £15.00

Frequently Bought Together

The Data Warehouse ETL Toolkit: Practical Techniques for Extracting, Cleaning, Conforming, and Delivering Data + The Data Warehouse Toolkit: The Complete Guide to Dimensional Modeling + The Data Warehouse Lifecycle Toolkit
Price For All Three: £64.20

Show availability and delivery details


Customers Who Bought This Item Also Bought

The Data Warehouse Toolkit: The Complete Guide to Dimensional Modeling

The Data Warehouse Toolkit: The Complete Guide to Dimensional Modeling

by Ralph Kimball
4.8 out of 5 stars (9)  £25.12
The Data Warehouse Lifecycle Toolkit

The Data Warehouse Lifecycle Toolkit

by Ralph Kimball
4.2 out of 5 stars (5)  £20.94
Microsoft Data Warehouse Toolkit: With SQL Server 2005 and the Microsoft Business Intelligence Toolset

Microsoft Data Warehouse Toolkit: With SQL Server 2005 and the Microsoft Business Intelligence Toolset

by Joy Mundy
4.7 out of 5 stars (3)  £21.09
Mastering Data Warehouse Aggregates: Solutions for Star Schema Performance

Mastering Data Warehouse Aggregates: Solutions for Star Schema Performance

by Christopher Adamson
5.0 out of 5 stars (1)  £19.01
Oracle Data Warehousing and Business Intelligence Solutions: With Business Intelligence Solutions

Oracle Data Warehousing and Business Intelligence Solutions: With Business Intelligence Solutions

by Robert Stackowiak
£22.04
Explore similar items

Product details

  • Paperback: 528 pages
  • Publisher: John Wiley & Sons (24 Sep 2004)
  • Language English
  • ISBN-10: 0764567578
  • ISBN-13: 978-0764567575
  • Product Dimensions: 23.4 x 18.4 x 2.6 cm
  • Average Customer Review: 3.0 out of 5 stars  See all reviews (2 customer reviews)
  • Amazon.co.uk Sales Rank: 94,170 in Books (See Bestsellers in Books)

    Popular in this category:

    #14 in  Books > Computing & Internet > Databases > Data Storage & Management > Data Warehousing
  • See Complete Table of Contents

Customers Viewing This Page May Be Interested in These Sponsored Links

  (What is this?)
   Open Source ETL Leader opens new browser window
www.Talend.com/Open_Source_ETL_Tool  -  Leading Open Source Enterprise ETL Integration Tools. Download Now! 
   Comparison of ETL Tools opens new browser window
www.ETLtool.com  -  Buy report comparing 18 ETL tools Which tool suits your company best? 
   Open Source ETL opens new browser window
www.jitterbit.com  -  Easy, Affordable, Open Source Data Integration. Download Today! 
  
 

Product Description

Product Description

  • Cowritten by Ralph Kimball, the world′s leading data warehousing authority, whose previous books have sold more than 150,000 copies
  • Delivers real–world solutions for the most time– and labor–intensive portion of data warehousing–data staging, or the extract, transform, load (ETL) process
  • Delineates best practices for extracting data from scattered sources, removing redundant and inaccurate data, transforming the remaining data into correctly formatted data structures, and then loading the end product into the data warehouse
  • Offers proven time–saving ETL techniques, comprehensive guidance on building dimensional structures, and crucial advice on ensuring data quality


From the Back Cover

The single most authoritative guide on the most difficult phase of building a data warehouse

The extract, transform, and load (ETL) phase of the data warehouse development life cycle is far and away the most difficult, time–consuming, and labor–intensive phase of building a data warehouse. Done right, companies can maximize their use of data storage; if not, they can end up wasting millions of dollars storing obsolete and rarely used data. Bestselling author Ralph Kimball, along with Joe Caserta, shows you how a properly designed ETL system extracts the data from the source systems, enforces data quality and consistency standards, conforms the data so that separate sources can be used together, and finally delivers the data in a presentation–ready format.

Serving as a road map for planning, designing, building, and running the back–room of a data warehouse, this book provides complete coverage of proven, timesaving ETL techniques. Beginning with a quick overview of ETL fundamentals, it then looks at ETL data structures, both relational and dimensional. The authors show how to build useful dimensional structures, providing practical examples of techniques.

Along the way you’ll learn how to:

  • Plan and design your ETL system
  • Choose the appropriate architecture from the many possible options
  • Build the development/test/production suite of ETL processes
  • Build a comprehensive data cleaning subsystem
  • Tune the overall ETL process for optimum performance

Inside This Book (Learn More)
First Sentence
Ideally, you must start the design of your ETL system with one of the toughest challenges: surrounding the requirements. Read the first page
Explore More
Concordance
Browse Sample Pages
Front Cover | Copyright | Table of Contents | Excerpt | Index | Back Cover
Search inside this book:

Suggested Tags from Similar Products

 (What's this?)
Be the first one to add a relevant tag (keyword that's strongly related to this product)
 
etl
prakash
practicle techniques
livelogic
kimball
datawarehousing
data warehousing
data warehouse
business intelligence

Your tags: Add your first tag
 

What Do Customers Ultimately Buy After Viewing This Item?

The Data Warehouse ETL Toolkit: Practical Techniques for Extracting, Cleaning, Conforming, and Delivering Data
56% buy the item featured on this page:
The Data Warehouse ETL Toolkit: Practical Techniques for Extracting, Cleaning, Conforming, and Delivering Data 3.0 out of 5 stars (2)
£18.14
The Data Warehouse Toolkit: The Complete Guide to Dimensional Modeling
16% buy
The Data Warehouse Toolkit: The Complete Guide to Dimensional Modeling 4.8 out of 5 stars (9)
£25.12
The Data Warehouse Lifecycle Toolkit
16% buy
The Data Warehouse Lifecycle Toolkit 4.2 out of 5 stars (5)
£20.94
Microsoft Data Warehouse Toolkit: With SQL Server 2005 and the Microsoft Business Intelligence Toolset
9% buy
Microsoft Data Warehouse Toolkit: With SQL Server 2005 and the Microsoft Business Intelligence Toolset 4.7 out of 5 stars (3)
£21.09

 

Customer Reviews

2 Reviews
5 star:    (0)
4 star:
 (1)
3 star:    (0)
2 star:
 (1)
1 star:    (0)
 
 
 
 
 
Average Customer Review
3.0 out of 5 stars (2 customer reviews)
 
 
 
 
Share your thoughts with other customers:
Most Helpful Customer Reviews

 
28 of 29 people found the following review helpful:
2.0 out of 5 stars Wordy, vague and few "Practical Techniques", 27 Jan 2005
By N. Chivers - See all my reviews
(REAL NAME)      
Computing is an exact and unambiguous discipline; consequently I want my computer books to be written in an exact and unambiguous manner. "The Data Warehouse ETL Toolkit" falls far short of this requirement, being wordy, vague, overblown and crammed with jargon. Worst of all, I found there were very few "Practical Techniques" I could take away with me that would help me in my work.

Here's a sample sentence: "This section discusses what needs to go into the data-cleansing baseline for the data warehouse, including simple methods for detecting, capturing and addressing common data-quality issues and procedures for providing the organisation with improved visibility into data-lineage and data-quality improvements over time". Now imagine a whole book written like this. OK, I've taken this sentence out of context, but if I tell you that this was used to introduce a section - there are no preceding or trailing sentences - then I think I am starting to paint a picture.

The authors and publishers seem to have taken the attitude, "Why use a bullet point when a paragraph will do?". Text and examples have been embellished as if in an effort to prove how clever the authors are. A lot of jargon is employed (no glossary), but the reader is always left in doubt as to whether this is industry standard or idiom employed only by the authors.

I think this book could have been so much more useful if they had taken a worked example right through from start to finish. They could have explained where the real world may be different to this perfect model and drawn on their experiences to add colour. Also, if this truly was supposed to be a book of practical techniques, they should have highlighted them, say 1 to 100, through the text, as applicable.

So why two stars rather than none? Firstly, because there are some good nuggets of information in there, if you work hard to find them, and secondly, because the authors' interest for the subject does show through. They do have a knack of answering the important questions, but only after a long journey round the houses first.

Kimball and Caserta would probably be fantastic consultants to have on a big data warehousing project, unfortunately they are awful technical writers - only buy this book if nothing else covers the subject you are interested in.

Comment Comment | Permalink | Was this review helpful to you? Yes No (Report this)



 
5 of 5 people found the following review helpful:
4.0 out of 5 stars Woolly at times - but good overall, 16 Jun 2006
By John Ryan (Cambridge, UK) - See all my reviews
(REAL NAME)   
Problem with this book is it is a bit woolly and wordy (just like the previous reviewer described). However, the main difficulty is there simply is no other book around on the market. As a description of the entire end-to-end ETL process, including many subject areas I'd not even considered (eg. COBOL copy books), it's very good.

However, I'd say the REAL reason for buying this book is it works well with Ralph Kimballs other work "The Data Warehouse Toolkit", and gives an excellent summary of Dimensional Design. I guess the authors felt they must put this in to explain the background. Personally I found it invaluable.

Also the description of "real time ETL" was invaluable. Everyone's talking about it, but the book gives a credible outline solution.

Yes woolly, yes it uses 10 words when two would do, but overall I got a lot out of it.

Recommended.
Comment Comment | Permalink | Was this review helpful to you? Yes No (Report this)


Share your thoughts with other customers: Create your own review
 
 
 
Only search this product's reviews



Customer Discussions

This product's forum
Discussion Replies Latest Post
No discussions yet

Ask questions, Share opinions, Gain insight
Start a new discussion
Topic:
First post:
Prompts for sign-in
 

   


Listmania!


Look for similar items by category


Look for similar items by subject


Feedback

Ad

Your Recent History

 (What's this?)

After viewing product detail pages or search results, look here to find an easy way to navigate back to pages you are interested in.