FREE Delivery in the UK.
Only 8 left in stock (more on the way).
Dispatched from and sold by Amazon. Gift-wrap available.
Quantity:1
Tika in Action has been added to your Basket
+ £2.80 UK delivery
Used: Very Good | Details
Sold by thriftbooks-USA
Condition: Used: Very Good
Comment: All items ship from the USA.  Arrival time is usually 2-3 weeks. Book has appearance of light use with no easily noticeable wear. Spend Less. Read More. Your satisfaction is guaranteed.
Have one to sell?
Flip to back Flip to front
Listen Playing... Paused   You're listening to a sample of the Audible audio edition.
Learn more
See this image

Tika in Action Paperback – 11 Dec 2011

5.0 out of 5 stars 1 customer review

See all formats and editions Hide other formats and editions
Amazon Price
New from Used from
Paperback
"Please retry"
£28.99
£20.43 £3.05
Note: This item is eligible for click and collect. Details
Pick up your parcel at a time and place that suits you.
  • Choose from over 13,000 locations across the UK
  • Prime members get unlimited deliveries at no additional cost
How to order to an Amazon Pickup Location?
  1. Find your preferred location and add it to your address book
  2. Dispatch to this address when you check out
Learn more
£28.99 FREE Delivery in the UK. Only 8 left in stock (more on the way). Dispatched from and sold by Amazon. Gift-wrap available.
click to open popover

Special Offers and Product Promotions

Enter your mobile number below and we'll send you a link to download the free Kindle App. Then you can start reading Kindle books on your smartphone, tablet, or computer - no Kindle device required.
Getting the download link through email is temporarily not available. Please check back later.

  • Apple
  • Android
  • Windows Phone

To get the free app, enter your mobile phone number.




Product details

  • Paperback: 256 pages
  • Publisher: Manning Publications; 1 edition (11 Dec. 2011)
  • Language: English
  • ISBN-10: 1935182854
  • ISBN-13: 978-1935182856
  • Product Dimensions: 18.7 x 1.6 x 23.5 cm
  • Average Customer Review: 5.0 out of 5 stars  See all reviews (1 customer review)
  • Amazon Bestsellers Rank: 683,805 in Books (See Top 100 in Books)

Product Description

About the Author

Chris Mattmann has a wealth of experience in software design, and in the construction of large-scale data-intensive systems. His work has infected a broad set of communities, ranging from helping NASA unlock data from its next generation of earth science system satellites, to assisting graduate students at the University of Southern California (his Alma mater) in the study of software architecture, all the way to helping industry and open source as a member of the Apache Software Foundation. When he's not busy being busy, he's spending time with his lovely wife and son braving the mean streets of Southern California.

Jukka Zitting is a core Tika developer with over a decade of experience of open source content management. Jukka works as a Senior Developer for the Swiss content management company Day Software, and is a member of the JCP expert group for the Content Repository for Java Technology API. He is a member of the Apache Software Foundation and the chairman of the Apache Jackrabbit project.

Customer Reviews

5.0 out of 5 stars
5 star
1
4 star
0
3 star
0
2 star
0
1 star
0
See the customer review
Share your thoughts with other customers

Top Customer Reviews

Format: Paperback Verified Purchase
I was already using Tika embedded in Jackrabbit (JCR) but this book got me into another level of Tika understanding.
Comment One person found this helpful. Was this review helpful to you? Yes No Sending feedback...
Thank you for your feedback.
Sorry, we failed to record your vote. Please try again
Report abuse

Most Helpful Customer Reviews on Amazon.com (beta)

Amazon.com: HASH(0x978833a8) out of 5 stars 8 reviews
5 of 6 people found the following review helpful
HASH(0x97978198) out of 5 stars A must-read for anyone working with content in Java 11 Jan. 2012
By Mr Nick Burch - Published on Amazon.com
Format: Paperback
For people working with content, a common problem is "what kind of thing is this binary file, and what does it contain?". Apache Tika provides a solution for these issues, and Tika In Action tells you how to make use of it! The book starts with a great introduction to content, content types and metadata, then quickly gets you started on using Tika. Next we're guided through how to identify the type of content and files, then how to get out the textual contents, formatting and metadata. Finally we're given guides on extending Tika, and integrating it into Search Systems, Content Management Systems and Scientific Processing. The book is well written, full of great examples, and is a brilliant way to get started (and work your way to expert!) on this common problem area, identifying and extracting information from content.
1 of 1 people found the following review helpful
HASH(0x979781ec) out of 5 stars The book starts with a great introduction to content 19 Dec. 2014
By Tasha M - Published on Amazon.com
Format: Paperback
vides a solution for these issues, and Tika In Action tells you how to make use of it! The book starts with a great introduction to content, content types and metadata, then quickly gets you started on using Tika. Next we're guided through how to identify the type of content and files, then how to get out the textual contents, formatting and metadata. Finally we're given guides on extending Tika, and integrating it into Search Systems! GREAT!!
0 of 1 people found the following review helpful
HASH(0x97978624) out of 5 stars I'm giving three stars rather than one because... 1 April 2014
By Adam Churvis - Published on Amazon.com
Format: Paperback Verified Purchase
...it's not the authors' faults that the publisher mis-positioned this book in the wrong series. This is definitely *NOT* an "In Action" book; it is a book with rambling yet accurate background discussions about general functionality and how one element might be integrated with another. Just hardly any code or concrete examples of how to actually create the Tika portion of a usable solution.

It's page after page of generalized talk and talk and talk and talk and -- LOOK! A diagram with a smiley face! -- and talk and talk and then one tiny snippet of code completely isolated from any other code that might be needed to make something actually happen. It's like ordering a book titled "Hot Models in Bikinis" and getting a book that talked endlessly about the history of the development of the bikini entirely in text, then talked about the history of textiles used in the manufacturing of bathing suits, and then the timeline in the day of the life of a model, etc, and that was it.

Thumb through your favorite "In Action" series book and you'll find something very different: brief targeted discussion, code that shows what was just discussed -- wait for it -- *IN ACTION!*, brief targeted discussion, code that shows it in action, lather-rinse-repeat, index, back cover. For a good example of how "Tika In Action" should have been structured, look at "Lucene In Action, Second Edition."

All this being said, even if I had been able to hold this book in my hands and leaf through it before making my purchase decision, I WOULD have bought it because of the extremely valuable background discussions, about 10% to 20% of which I would have read to get a better understanding of the subject that is Tika. But I would have then immediately gone looking for the book I really needed, which would have shown Tika actually "In Action."
2 of 3 people found the following review helpful
HASH(0x97978630) out of 5 stars Thoroughly explains what Tika is and how to use it 12 Jan. 2012
By Richard J. Wagner - Published on Amazon.com
Format: Paperback
This book covers Tika from high-level overview to low-level usage. There is plenty of developer-centric coverage here, including relevant API usage. Java classes and architecture are explained right beside the high-level explanations of the user-level overviews, so you are given a very good understanding of what makes Tika tick. This book is especially strong in emphasizing document type detection, content extraction, metadata extraction, and language detection. You'll also learn how to partner Tika with other tools like Lucene as you build your information library.

All things considered, a very readable book and a great resource for anyone using Tika.
2 of 3 people found the following review helpful
HASH(0x97978af8) out of 5 stars Useful book for "content developers" 25 Jan. 2012
By Alexey Ott - Published on Amazon.com
Format: Paperback
Tika in Action is very good book on media type detection & content extraction using the Apache Tika framework. By using Tika for text & metadata extraction you can index & search documents in many existing formats. You can also extend Tika with support new formats that are need in your work. And its open source nature, makes it very attractive for both open source & corporate developers, and allows flexible integration with many different systems, like, ManifoldCF, Lucene, UIMA, etc.

Book provides comprehensive description of framework itself, how to use it for different tasks (file format & language detection, text/metadata extraction, etc.), how to extend it to support new file formats (both detection & data extraction). Besides this, there are several chapters dedicated to real world use-cases - how Apache Tika is used in different projects.

I would recommend this book for everybody who need to perform media type detection and/or text/metadata extraction, especially who're working with indexing & searching of heterogeneous documents.

P.S. I gave 4 stars only because I would like to have more detailed description of how to create complex signatures for file formats (although, this information could be found on project's pages).
Were these reviews helpful? Let us know


Feedback