Enter your mobile number or email address below and we'll send you a link to download the free Kindle App. Then you can start reading Kindle books on your smartphone, tablet, or computer - no Kindle device required.

  • Apple
  • Android
  • Windows Phone
  • Android

To get the free app, enter your mobile phone number.

Kindle Price: £44.64
includes VAT*
* Unlike print books, digital books are subject to VAT.

These promotions will be applied to this item:

Some promotions may be combined; others are not eligible to be combined with other offers. For details, please see the Terms & Conditions associated with these promotions.

Deliver to your Kindle or other device

Deliver to your Kindle or other device

Data Mining: Practical Machine Learning Tools and Techniques, Second Edition (The Morgan Kaufmann Series in Data Management Systems) by [Witten, Ian H., Frank, Eibe]
Kindle App Ad

Data Mining: Practical Machine Learning Tools and Techniques, Second Edition (The Morgan Kaufmann Series in Data Management Systems) 2nd , Kindle Edition

3.7 out of 5 stars 3 customer reviews

See all 4 formats and editions Hide other formats and editions
Amazon Price
New from Used from
Kindle Edition
"Please retry"

Summer Sale
Choose from over 450 books on sale from 99p. Shop now
Get a £1 reward for movies or TV
Enjoy a £1.00 reward to spend on movies or TV on Amazon Video when you purchase any Amazon Kindle Book from the Kindle Store (excluding Kindle Unlimited, Periodicals and free Kindle Books) offered by Amazon.co.uk. A maximum of 1 reward per customer applies. UK customers only. Offer ends at 23:59 on Wednesday, September 27, 2017. Terms and conditions apply

Product description


“This book presents this new discipline in a very accessible form: both as a text to train the next generation of practitioners and researchers, and to inform lifelong learners like myself. Witten and Frank have a passion for simple and elegant solutions. They approach each topic with this mindset, grounding all concepts in concrete examples, and urging the reader to consider the simple techniques first, and then progress to the more sophisticated ones if the simple ones prove inadequate. If you have data that you want to analyze and understand, this book and the associated Weka toolkit are an excellent way to start.”
― From the foreword by Jim Gray, Microsoft Research

“It covers cutting-edge, data mining technology that forward-looking organizations use to successfully tackle problems that are complex, highly dimensional, chaotic, non-stationary (changing over time), or plagued by. The writing style is well-rounded and engaging without subjectivity, hyperbole, or ambiguity. I consider this book a classic already!”
― Dr. Tilmann Bruckhaus, StickyMinds.com


As with any burgeoning technology that enjoys commercial attention, the use of data mining is surrounded by a great deal of hype. Exaggerated reports tell of secrets that can be uncovered by setting algorithms loose on oceans of data. But there is no magic in machine learning, no hidden power, no alchemy. Instead there is an identifiable body of practical techniques that can extract useful information from raw data. This book describes these techniques and shows how they work. The book is a major revision of the first edition that appeared in 1999. While the basic core remains the same, it has been updated to reflect the changes that have taken place over five years, and now has nearly double the references. The highlights for the new edition include thirty new technique sections; an enhanced Weka machine learning workbench, which now features an interactive interface; comprehensive information on neural networks; a new section on Bayesian networks; plus much more. It includes algorithmic methods at the heart of successful data mining including tried and true techniques as well as leading edge methods.

There is performance improvement techniques that work by transforming the input or output. It includes a downloadable Weka, a collection of machine learning algorithms for data mining tasks, including tools for data pre-processing, classification, regression, clustering, association rules, and visualizationin in a new, interactive interface.

Product details

  • Format: Kindle Edition
  • File Size: 8652 KB
  • Print Length: 560 pages
  • Publisher: Morgan Kaufmann; 2 edition (13 July 2005)
  • Sold by: Amazon Media EU S.à r.l.
  • Language: English
  • Text-to-Speech: Enabled
  • X-Ray:
  • Word Wise: Not Enabled
  • Enhanced Typesetting: Enabled
  • Average Customer Review: 3.7 out of 5 stars 3 customer reviews
  • Amazon Bestsellers Rank: #1,588,514 Paid in Kindle Store (See Top 100 Paid in Kindle Store)
  • Would you like to tell us about a lower price?

Customer reviews

3.7 out of 5 stars
Share your thoughts with other customers

Top customer reviews

Format: Paperback
Books explains basics of machine learnig in a way that quite easy to understand. does not go deep into maths involved, but sufficiently to allow understandinf of algorithms explained.

Very usefull especially if you plan to use weka datamining tool as pretty much everythin available in weka is explained in this book to degree that you choose suitable algorithms and tune them correctly
Comment One person found this helpful. Was this review helpful to you? Yes No Sending feedback...
Thank you for your feedback.
Sorry, we failed to record your vote. Please try again
Report abuse
Format: Paperback
I've tried three times to read this book.

But it is so badly written I can't get through it! It doesn't flow well and it seems to jump into things with little or no context in many places.

At times I even had to refer to other books such as 'Speech and Language Processing: an Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition' by Jurafsky to get a proper explanation and work through of the theories.

I'll be looking for another book in this area - don't buy this one.
Comment One person found this helpful. Was this review helpful to you? Yes No Sending feedback...
Thank you for your feedback.
Sorry, we failed to record your vote. Please try again
Report abuse
Format: Paperback
Having read the first edition, the authors earn the extra rating because they've managed to improve on their work and practical WEKA resource offering. Without a doubt, an essential read for people who are both new and experienced in the fields of data mining, descriptive & predictive analytics or state & behavioural modelling.

The volume of material on the market today is still quite limited and in the gap between the first and second edition of this book, quite a lot has actually changed in the field. In my view, book content has only marginally progressed with the times, perhaps in favour of attempting to attract and activate new members, practictioners and commercially oriented researchers to the fore of data mining. It's a bold step to evolve material as the field evolves; those breaking new ground in this area should be more visible and offered greater support.

I believe that there is room in the market now for some revised materials covering anomalised commercial implementations of Advanced Data Mining & AI Concepts. A small community of authors could plug this gap really well.
1 Comment 3 people found this helpful. Was this review helpful to you? Yes No Sending feedback...
Thank you for your feedback.
Sorry, we failed to record your vote. Please try again
Report abuse

Most Helpful Customer Reviews on Amazon.com (beta) (May include reviews from Early Reviewer Rewards Program)

Amazon.com: 3.8 out of 5 stars 38 reviews
2 of 2 people found the following review helpful
5.0 out of 5 stars Excellent Beginning Text for Software Engineers 25 Mar. 2010
By E. Kohlwey - Published on Amazon.com
Format: Paperback Verified Purchase
I chose this book after looking at a number of options. I was not disappointed. The text is clearly written for individuals with an bachelor-level education in computer science. The author prefers pseudocode and text explanations of algorithms to equations, and when he does use equations they use clear, commonly understandable notation rather than the terse greek alphabet soup preferred by many of the more mathematically oriented authors.

It should be pointed out that about 10% of the text of this book is devoted simply as a user manual for an open source MLA package called Weka. When I first realized this I almost flipped; I really didn't want a book that was devoted to gaining a surface understanding of a particular implementation of a set of algorithms. After reading through, I can tell you it is not. All the algorithms are explained well enough that you could implement them and work out simple examples on paper.

I should note also that Weka, as well as a lot of the algorithms in this book, don't parallelize well (or obviously). This is an excellent point to get your feet wet and do some exploratory analysis, but if you're past that point and want to learn about crunching big (TB+) data you should look elsewhere.

One area that the text does not cover (and, for many software engineers this is not a fault) is some of the mathematics behind some of the algorithms the author proposes. For instance, in the author's description of linear regression using SGD he glosses over the math of actually calculating the gradient by saying "there's a matrix inversion involved and its available in prepackaged software." I'm not saying this is bad, because if you're a good software engineer the first thing you'll do it look for an existing implementation that you can alter to fit your needs, so he's right. It just may not be what mathematicians or more theory-oriented computer scientists expect.
36 of 37 people found the following review helpful
4.0 out of 5 stars Very readable book on Data Mining and ML 9 Oct. 2005
By K. Greene - Published on Amazon.com
Format: Paperback Verified Purchase
This book is very easy to read and understand. Unlike Hastie's Statistical Learning book, it is not geared towards those with an expert level knowledge of statistics, and instead takes time to explain functions and formulas for the person with a decent but not extrordinary understanding of statistical/math concepts. For example, their description of a Gaussian was the clearest I've seen. On the other hand, if you're math/statistics background is considerable, you may find this book somewhat simplistic or tedious.

The book has a good coverage of techniques and algorithms, although I was somewhat disappointed that they do not mention Influence Diagrams, considering the amount of coverage of both decision trees and Bayesian techniques. Their discussion of Combining Multiple Models, however, is well done, and is not covered to this extent in most books I've seen. I also like how they broke out the discussion of input and output (knowledge representation) into their own chapters.

Addendum 10/30: After reading a good hunk of this book I still agree with most of what I said earlier, but I do think the authors could have gone into graphical models a lot more. At the end of the discussion on Bayesian networks, Markov networks and other graphical models are mentioned very briefly and the author says they are very big in ML right now, but he doesn't say why they didn't describe them further. It might have something to do with the organization of the book. Graphical models almost need a chapter of their own but the book's chapters discuss all techniques in one chapter but with varying levels of detail.
6 of 6 people found the following review helpful
4.0 out of 5 stars A great introduction to data mining and machine learning 23 July 2007
By James Pine - Published on Amazon.com
Format: Paperback Verified Purchase
I bought this book in the hopes that it would help me better explore the data from the Netflix Prize contest, which it did. I had been reading numerous Wikipedia articles, scientific papers, etc. on line and felt it would be useful to have a more general tome on the subject. This book covers many of the common, overarching themes i.e. clustering, neural networks, linear regression, etc. to varing degree. I only wish the examples involved slightly more complex data sets and more pseudo code was provided. I suppose since the book is very closely tied to WEKA, one could always dig through the source code of that application; but I feel that the authors could have provided a bit more of the strictly algorithm relevant code in the book.
4 of 4 people found the following review helpful
3.0 out of 5 stars A little too wordy for my tastes, but good 3 Jun. 2008
By Jason C. Maestri - Published on Amazon.com
Format: Paperback Verified Purchase
This book was pretty good. I have to admit that for the first hundred or so pages, I was feeling very impatient. All of that information could have been conveyed in about 25 pages, and been much easier to read. But there are some very good examples in here, and it is worth reading. If you are looking for something more technical, try "Pattern Recognition and Machine Learning", by Christopher M. Bishop or "The Elements of Statistical Learning" by Hastie, Tibshirani, and Friedman.
3 of 3 people found the following review helpful
5.0 out of 5 stars Thorough, well-written, and crystal-clear explanations. 9 Jun. 2008
By Jay L - Published on Amazon.com
Format: Paperback Verified Purchase
Highly recommend this book for a practical introduction to the theory and applications of Machine Learning. Great book if you are looking to ACTUALLY implement some machine learning systems, prefer to learn via diagrams, a "how-stuff-works"-style explanation, and skip much of the equations and heavy math that fills similar books.
Obviously, this book is a perfect companion to the Weka machine toolbox, which is quickly becoming a standard, invaluable research toolbox for many.
Were these reviews helpful? Let us know
click to open popover