Pentaho Data Integration 4 Cookbook and over one million other books are available for Amazon Kindle . Learn more


or
Sign in to turn on 1-Click ordering.
or
Amazon Prime free trial required. Sign up when you check out. Learn more
More Buying Choices
Have one to sell? Sell yours here
or
Get a £12.60 Amazon.co.uk Gift Card
Pentaho Data Integration 4 Cookbook
 
 
Start reading Pentaho Data Integration 4 Cookbook on your Kindle in under a minute.

Don't have a Kindle? Get your Kindle here, or download a FREE Kindle Reading App.

Pentaho Data Integration 4 Cookbook [Paperback]

Adrian Pulvirenti , Maria Roldan
5.0 out of 5 stars  See all reviews (1 customer review)
RRP: £27.99
Price: £26.59 & this item Delivered FREE in the UK with Super Saver Delivery. See details and conditions
You Save: £1.40 (5%)
o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o
In stock.
Dispatched from and sold by Amazon.co.uk. Gift-wrap available.
Want guaranteed delivery by Wednesday, June 6? Choose Express delivery at checkout. See Details

Formats

Amazon Price New from Used from
Kindle Edition £14.00  
Paperback £26.59  
Trade In this Item for up to £12.60
Get an extra £5 when you trade in books worth £10 or more until June 30, 2012. Trade in Pentaho Data Integration 4 Cookbook for an Amazon.co.uk gift card of up to £12.60, which you can then spend on millions of items across the site. Trade-in values may vary (terms apply). Find more products eligible for trade-in.

Frequently Bought Together

Pentaho Data Integration 4 Cookbook + Pentaho Kettle Solutions: Building Open Source ETL Solutions with Pentaho Data Integration + Pentaho Solutions: Business Intelligence and Data Warehousing with Pentaho and MySQL
Price For All Three: £70.77

Show availability and delivery details

Buy the selected items together


Product details

  • Paperback: 352 pages
  • Publisher: PACKT PUBLISHING (23 Jun 2011)
  • Language English
  • ISBN-10: 1849515247
  • ISBN-13: 978-1849515245
  • Product Dimensions: 23.5 x 19 x 1.9 cm
  • Average Customer Review: 5.0 out of 5 stars  See all reviews (1 customer review)
  • Amazon Bestsellers Rank: 78,130 in Books (See Top 100 in Books)

Product Description

Product Description

Pentaho Data Integration (PDI, also called Kettle), one of the data integration tools leaders, is broadly used for all kind of data manipulation such as migrating data between applications or databases, exporting data from databases to flat files, data cleansing, and much more. Do you need quick solutions to the problems you face while using Kettle?

Pentaho Data Integration 4 Cookbook explains Kettle features in detail through clear and practical recipes that you can quickly apply to your solutions. The recipes cover a broad range of topics including processing files, working with databases, understanding XML structures, integrating with Pentaho BI Suite, and more.

Pentaho Data Integration 4 Cookbook shows you how to take advantage of all the aspects of Kettle through a set of practical recipes organized to find quick solutions to your needs. The initial chapters explain the details about working with databases, files, and XML structures. Then you will see different ways for searching data, executing and reusing jobs and transformations, and manipulating streams. Further, you will learn all the available options for integrating Kettle with other Pentaho tools.

Pentaho Data Integration 4 Cookbook has plenty of recipes with easy step-by-step instructions to accomplish specific tasks. There are examples and code that are ready for adaptation to individual needs.

Learn to solve data manipulation problems using the Pentaho Data Integration tool Kettle.

About the Author

Adrián Sergio Pulvirenti


Adrián was born in Buenos Aires, Argentina, in 1972. He earned his Bachelor degree in Computer Sciences at UBA, one of the most prestigious universities in South America.


He has dedicated more than fifteen years to developing desktop and web-based software solutions. Over the last few years he has been leading integration projects and the development of BI solutions.


María Carina Roldán


María Carina was born in Esquel, Argentina, in 1970. She earned her Bachelor degree in Computer Science at UNLP in La Plata and then moved to Buenos Aires where she has lived since 1994.


She has worked as a BI consultant for more than ten years. Over the last four, she has been dedicated full time to developing BI solutions using Pentaho Suite. Currently she works for Webdetails, one of the main Pentaho contributors.


She is the author of Pentaho 3.2 Data Integration: Beginner's Guide published by Packt Publishing in April 2010.


Inside This Book (Learn More)
Browse Sample Pages
Front Cover | Copyright | Table of Contents | Excerpt | Back Cover
Search inside this book:

Tag this product

 (What's this?)
Think of a tag as a keyword or label you consider is strongly related to this product.
Tags will help all customers organise and find favourite items.
Your tags: Add your first tag
 


Customer Reviews

4 star
0
3 star
0
2 star
0
1 star
0
Most Helpful Customer Reviews
Essential book 15 July 2011
Format:Paperback
Pentaho Data Integration (PDI) has reached its 4th version with a lot of new interesting features and capabilities.
This versatile tool is a must for all people working with data integration.
Transformations and jobs are the target in PDI to realize a task including data reading, writing, manipulations and integrations, doing mathematical or logical
operations, all this is tipical of a ETL tool (where ETL stands for Extract, Transform and Load).

Do you need to move data from an excel file to a database, from a database to a text file? Do you need to extract data from a LDAP server, FTP, mail, log file, compressed file, web service or web site?
All this must be done regularly, automatically? Would it be cool to be notified by email if the process failed?

Sure you can do it in a lot of ways, but an ETL tool gives you the necessary help.
In addition an open source ETL, like Pentaho Data Integration, has behind a strong and skilled community to help you.

This book provides a lot of step-by-step examples (called "recipes") with a lot of practical, useful and very smart hints and strategies for developing transformations and jobs.
New steps (a step a is basic task, for example reading from a file, sorting , grouping, calculating, ...) are very well described and explained

Chapters of this book cover deeply all you need to know to understand the software and be ready to write your own transformations and be quickly productive.
I found very useful the space dedicated to:
- read and write file: unstructured and structured text files, excel and openoffice spreadsheets
- XML files and validation with DTD and XSD Schemas
- use fuzzy match step
- reuse and flexibility of trasformations (name parameters, variable, mapping)
- sending email with log log about the status of the execution
- file management: retrieve file from server like FTP, copying, moving, deleting, comparing
- integration of Kettle with Pentaho Suite (Pentaho Reporting Engine)

The way all these subjects are explained is progressive and gradual. The use of targeted examples makes the reading very pleasant and easy.
I suggest this book to you.
Comment | 
Was this review helpful to you?
Most Helpful Customer Reviews on Amazon.com (beta)
Amazon.com:  7 reviews
2 of 2 people found the following review helpful
Covers use cases an advanced user will enjoy 17 July 2011
By Richard J. Wagner - Published on Amazon.com
Format:Paperback
This book does not teach the basics of using Kettle. It's a collection of best practices for accomplishing things with Kettle (or Pentaho Data Integration, it's commercial cousin.)

Kettle itself is intuitive enough to learn, so this book could serve as a good resource even for Kettle novices. (They'll have to self-study other materials, perhaps the product documentation, to get off the ground.) Once a basic level of expertise is obtained, the patterns and practices given in this book will be of use.

Use cases for common scenarios are well represented. (Examples: How to read data from a database, dealing with fixed format and comma delimited files, working with XML, consuming a web service, generating reports.) These were all expected so no extra credit for these topics, though it's nice to have them all documented in one place for future reference. There are also quite a few recipes given for things I'd never before encountered like parsing of unstructured files (i.e. a Log4j log file), writing out JSON, producing Cartesian products given two lists, and matching values using fuzzy comparison logic. These topics were pleasant surprises to find, I can imagine practical uses for many of them. As an experienced ETL user, I can assure you anyone doing real production work with an ETL tool will find a few things of value here.

If you have a need for integration work and don't enjoy a lot of low-level coding, you probably owe it to yourself to try Kettle or another ETL product. If you're using ETL for anything beyond dirt-simple scenarios, you'll probably save yourself some time and effort by reviewing the best practices contained here.
1 of 1 people found the following review helpful
Great Follow Up - Will save many g**gle-hours! 7 July 2011
By Data Aggregator - Published on Amazon.com
Format:Paperback
PDI4_Cookbook is worth owning even if you have the 'other' two Kettle books on hand given the depth of that open source product. Easy to follow, it could be used as a first book on PDI once you get through the basic install/sample documents from the Pentaho.com site. It is well organized, up to date with PDI4 features and the recipes are for the most part truly useful in the ETL domain. The authors spend a sizable amount of space on the more obscure but useful Kettle facilities (e.g., sub-transformations or generating sample data, etc.) so this book will pay off in web searches for solutions. Recommended for all PDI developers!
Great Intermediate PDI Reference 26 Aug 2011
By C. A. Meadows - Published on Amazon.com
Format:Paperback
As with her earlier book, PDI 3.2 - Beginner's Guide, Maria has successfully written a great guide for those who are just starting out with Pentaho Data Integrator (aka Kettle). I recommend PDI Cookbook for intermediate users - those who have graduated from the Beginner's Guide, but still require some guidance on using the beginner to intermediate features. If the reader is past this stage, I suggest picking up a copy of Pentaho Kettle Solutions and using the PDI Cookbook as a great desktop reference. The Cookbook is written with easy, step by step recipes for various standard 'gotchas' like reading structured or unstructured files.

Of particular interest for advanced users will be the last three chapters, which discuss how to integrate PDI with the rest of the Pentaho Business Intelligence suite of tools, reusing transformations and jobs, and showing how to collect metadata on the processes being created in those transformations and jobs. If the reader finds building the recipes taking up too much time, the full set of code is available on the publisher's website as well as sample database sets on which the recipies are built.

In short, PDI Cookbook is another great reference book for Pentaho Data Integrator which fills a gap that was not covered with the Beginner's Guide or Pentaho Kettle Solutions.

Disclaimer: In the essence of full disclosure, Packt Publishing asked me to write this review and offered a copy of one of their other published works for my trouble. This in no way has changed my opinion on PDI Cookbook.
Search Customer Reviews
Only search this product's reviews

Customer Discussions

This product's forum
Discussion Replies Latest Post
No discussions yet

Ask questions, Share opinions, Gain insight
Start a new discussion
Topic:
First post:
Prompts for sign-in
 

Search Customer Discussions
Search all Amazon discussions
   


Listmania!

Create a Listmania! list

Look for similar items by category


Look for similar items by subject


Feedback


Amazon.co.uk Privacy Statement Amazon.co.uk Delivery Information Amazon.co.uk Returns & Exchanges