- Paperback: 466 pages
- Publisher: O'Reilly Media; 1 edition (1 Nov. 2012)
- Language: English
- ISBN-10: 1449319793
- ISBN-13: 978-1449319793
- Product Dimensions: 17.8 x 2.3 x 23.3 cm
- Average Customer Review: 4.6 out of 5 stars See all reviews (30 customer reviews)
- Amazon Bestsellers Rank: 4,992 in Books (See Top 100 in Books)
- See Complete Table of Contents
Python for Data Analysis: Data Wrangling with Pandas, NumPy, and IPython Paperback – 1 Nov 2012
|New from||Used from|
- Choose from over 13,000 locations across the UK
- Prime members get unlimited deliveries at no additional cost
- Find your preferred location and add it to your address book
- Dispatch to this address when you check out
Frequently Bought Together
Customers Who Bought This Item Also Bought
Enter your mobile number or email address below and we'll send you a link to download the free Kindle App. Then you can start reading Kindle books on your smartphone, tablet, or computer - no Kindle device required.
To get the free app, enter your mobile phone number.
Data Wrangling with Pandas, NumPy, and IPython
About the Author
Wes McKinney is the main author of pandas, the popular open sourcePython library for data analysis. Wes is an active speaker andparticipant in the Python and open source communities. He worked as aquantitative analyst at AQR Capital Management and Python consultantbefore founding DataPad, a data analytics company, in 2013. Hegraduated from MIT with an S.B. in Mathematics.
What Other Items Do Customers Buy After Viewing This Item?
Top Customer Reviews
This book is really about pandas (the author is the main author of pandas, after all), and less about either numpy or ipython or other tools. I don't mean that as a criticism. It is precisely as it should be. If you are doing strictly data analysis in python, it is pandas primarily that is center stage, with tools like numpy/ipython etc playing supporting roles. What this book does convey, however, is just how well all these tools work together and how they form a big team for scientific/numerical computing in python.
This book is detailed and extensive. It is entirely focused on well thought out follow-along-yourself code examples, and this makes it a remarkable effective way to learn pandas especially, but also to learn more about numpy/ipython/matplotlib etc.
If you do data analysis in python, this book is a must have. It is highly recommended too for anyone doing scientific/numerical computing in python generally.
Well worth the money!
As well as Pandas you'll cover IPython, NumPy and Matplotlib in enough depth to get you started with data analysis and visualization.
You don't need to be a python expert but some python knowledge, and some experience of R, will definitely help.
The book is well structured, breaking down the different topics into well defined chapters which deal with topics such as data wrangling, data loading, time series analysis and more. It has a tutorial feel to it, where you're building your knowledge as you go and applying it all the time - I really enjoyed this approach.
Python For Data Analysis is primarily about doing stuff, about how to manipulate data, clean data and more. And that makes it special to me. If you have spent any amount of time working with data from different sources you'll know that the nitty gritty stuff is something you spend a lot of time on. That's why having such a book and such a framework is golden.
Some of the Pros:
I especially liked the fact that, throughout the whole book, small, self contained examples are given. This allows you to easily generate the data you need and to play with it. This makes the code more accessible and the book more fun.
A detailed appendix on "Python Language Essentials" for people, like me, who have used python in the past but need a bit of a refresher course before reading the book.
Mini-Cons/Things to note:
The book focuses on a slightly older Pandas version, however, the author highlights areas where you may need to do things differently in the future.Read more ›
Let me be more specific. In the interest of full disclosure, I should note that I got this book for free via O'Reilly's Blogger Review program. I have some experience in Python and, during the time of my exposure to it, I always read that Python was very powerful in the Data Analysis field, be it Scientific Computing, Financial Computing (up to a point, of course) and others, so naturally, I wanted to read a book to get to study Python's usage in this field. What got me more hooked into reading this book is that this particular one was written by an expert on the field. The author of the book is also the author of the Pandas library. When I finally got through it, here are my comments on it:
- First this book gives you some information on why the data analysis field matters. For instance it refers to an example, using data analysis to come up with data sets to feed a machine learning algorithm.
- The book has short and concise (and above all, easy to follow) code examples that demonstrate the point of the text very quickly.
- The book provides several realistic use cases of the demonstrated content, so that you can get a good idea of what data analysis is all about.
- Covers (in varying degrees) xml parsing, interaction with HTML and databases. It even makes a small reference to MongoDB!
- It also covers string manipulation (including regular expressions) which is very nice!
- Has a whole chapter dedicating to plotting and visualizing.
- Has several chapters on Numpy and Pandas!Read more ›
Most Recent Customer Reviews
This book is for anyone that wants to get up to speed fast in using Python for data analysis / science. Read morePublished 7 days ago by Amazon Customer
The book is a 2012 edition republished in 2013. Several command and function examples given in the book don't work because the software platform has changed. Read morePublished 4 months ago by Amazon Customer
Great intro to pandas. It would have been interesting to see some complex examples and some snippets.Published 8 months ago by matteo pallini
A solid introduction and reference point for Pandas, iPython, NumPy and matplotlib. If you're looking for somewhere to start there's no better place.Published 13 months ago by B. M. Lester
Be sure this matches your expectations, if yes then it is a good reference bookPublished 14 months ago by Mr W.