- Paperback: 304 pages
- Publisher: Manning Publications; 01 edition (14 Feb. 2001)
- Language: English
- ISBN-10: 1930110006
- ISBN-13: 978-1930110007
- Product Dimensions: 18.7 x 1.7 x 23.4 cm
- Average Customer Review: 4.3 out of 5 stars See all reviews (7 customer reviews)
- Amazon Bestsellers Rank: 880,362 in Books (See Top 100 in Books)
Data Munging with Perl: Techniques for Data Recognition, Parsing, Transformation and Filtering Paperback – 14 Feb 2001
Enter your mobile number or email address below and we'll send you a link to download the free Kindle App. Then you can start reading Kindle books on your smartphone, tablet, or computer - no Kindle device required.
To get the free app, enter your mobile phone number.
If you are a seller for this product, would you like to suggest updates through seller support?
" . . . well written, informative, thought provoking . . . will be as relevant five years from now as it is today. . . . buy [one]." -- Dr. Dobbs Journal
From the Publisher
Recent reviews for Data Munging with Perl
"...Manning Publications continues its fine line of Perl books with the consistent and powerful Data Munging with Perl. Coders looking to transform data somehow and hackers who want to take advantage of Perl's unique features will improve their knowledge and understanding. If you find yourself working with files or records in Perl, this book will save you time and trouble." --Slashdot.org
"The book's chapters are concise, the coverage is comprehensive, and the examples are plentiful and relevant. I've been using Perl's data munging capabilities heavily for many years, and I still picked up some useful new insights from Cross' book." --Web Techniques magazineSee all Product description
Top Customer Reviews
If you do _any_ data manipulation with Perl then GET this book, it explains what Perl data structures(or modules) to use with what data, and then how to use that data structure...
The author writes and thinks like a programmer, so programmers can ultimately understand the topics/concepts and not just cut-paste someone elses code.
I also recommend 'OO Perl' by Damian Conway (same publisher).
These two Manning books are the only other Non-O'Reilly books I own.
I too found the chapter on Parse::RecDecent easy to understand having struggled with Damian's TPJ article. But this book to get on your way to being an expert munger.
The book is written for programmers or analysts who transform data as a regular part of their jobs. It assumes a beginning knowledge of Perl programming, as one might gain from reading Learning Perl. Part I introduces data munging as a recurring necessary evil and points out aspects of Perl that recommend it for this task. Part II surveys different types of unstructured and semi-structured data formats and suggests Perl-based strategies for working with them. PART III examines the limitations of simple data formats and discusses parsing strategies and specific techniques for working with HTML, XML and other hierarchical data structures. PART IV extracts some useful lessons from the previous chapters and suggests sources for additional study. The organization is logical and easy to follow.
Cross has written a well-designed book with helpful examples and insights. The accompanying book web site and author web site provide downloadable code and other resources. This book is of course most useful to those working in Perl. But many general concepts and strategies have transferred well to data munging tasks I have done in TextPipe.
One of Perl's mottos is: "There's more than one way to do it." A variety of ways are illustrated and explained in this book. Note that it is over ten years old and does not include the latest evolutions of the Perl language.
Most Recent Customer Reviews
The quintessential Perl activity is data processing, particularly in a Unix environment, where output is piped into a script from some other program, transformed, and spat out... Read morePublished on 21 July 2007 by Thing with a hook
This book is well written, and quite informative. But it should not be read in isolation, as it does gloss over quite a lot so that it remains focused on the key topic "data... Read morePublished on 9 Mar. 2001