4 used & new from £31.92

Have one to sell? Sell yours here
 
 
Programming Spiders, Bots and Aggregators in Java (Transcend Technique)
 
 

Programming Spiders, Bots and Aggregators in Java (Transcend Technique) (Paperback)

by Jeff Heaton (Author) "The Internet is built of many related protocols, and more complex protocols are layered on top of system level protocols ..." (more)
No customer reviews yet. Be the first.

Available from these sellers.


2 new from £59.86 2 used from £31.92

Customers Viewing This Page May Be Interested in These Sponsored Links

  (What is this?)
   Aggregate Network Links opens new browser window
www.netoptics.com  -  Use monitoring tools more efficiently with link aggregators 
   Aggregators opens new browser window
Ask.com  -  Find the Best Results for Aggregators
  
 

Product details


Product Description

Amazon.co.uk Review

If you want to surf the Web programmatically, rather than by firing up your favourite browser, then Programming Spiders, Bots and Aggregators in Java is the book for you. Spiders are programs that crawl the Web following hyperlinks, and are used to populate databases for search engines. Aggregators pull together data from several different sites, for example to present and compare prices from a number of online retailers, while "Bot" is a generic term covering any program that pulls data from the Internet.

Readers are expected to have some knowledge of Java, but the author's patient step-by-step approach makes this title accessible to developers of every skill level. He begins by explaining the basics of Internet programming, showing how to send and receive data using sockets, and how to parse HTML to get useful results. Bots need to know how to fill in forms, accept cookies and connect over HTTPS, to mention three of the topics covered here. The book continues with specifics such as how to construct a spider, how to build a bot for multiple sites within a certain category and how to get an aggregated view of many Web sites.

Towards the end, the book touches on SOAP, a protocol for Web services that in theory could make bots redundant. It will be a long time, if ever, before enough sites offer SOAP services to make that happen. In the meantime, bots are essential tools for exploiting Web resources, and this thorough and well-written title is an excellent place to learn how to use them. The bundled CD includes a bot package, which you can incorporate into your own applications, along with Java tools and resources. --Tim Anderson



Product Description

The content and services available on the web continue to be accessed mostly through direct human control. But this is changing. Increasingly, users rely on automated agents that save them time and effort by programmatically retrieving content, performing complex interactions, and aggregating data from diverse sources. Programming Spiders, Bots, and Aggregators in Java teaches you how to build and deploy a wide variety of these agents–from single–purpose bots to exploratory spiders to aggregators that present a unified view of information from multiple user accounts.

You will quickly build on your basic knowledge of Java to quickly master the techniques that are essential to this specialized world of programming, including parsing HTML, interpreting data, working with cookies, reading and writing XML, and managing high–volume workloads. You′ll also learn about the ethical issues associated with bot use––and the limitations imposed by some websites.

This book offers two levels of instruction, both of which are focused on the library of routines provided on the companion CD. If your main concern is adding ready–made functionality to an application, you′ll achieve your goals quickly thanks to step–by–step instructions and sample programs that illustrate effective implementations. If you′re interested in the technologies underlying these routines, you′ll find in–depth explanations of how they work and the techniques required for customization.

Inside This Book (Learn More)
First Sentence
The Internet is built of many related protocols, and more complex protocols are layered on top of system level protocols. Read the first page
Explore More
Concordance
Browse Sample Pages
Front Cover | Copyright | Table of Contents | Excerpt | Index | Back Cover
Search inside this book:

Suggested Tags from Similar Products

 (What's this?)
Be the first one to add a relevant tag (keyword that's strongly related to this product)
 
java
webbots
spidering
spider
jeff heaton
http
bot

Your tags: Add your first tag
 

What Do Customers Ultimately Buy After Viewing This Item?

Webbots, Spiders, & Screen Scrapers
30% buy
Webbots, Spiders, & Screen Scrapers
£25.60
Programming Spiders, Bots and Aggregators in Java (Transcend Technique)
24% buy the item featured on this page:
Programming Spiders, Bots and Aggregators in Java (Transcend Technique)
Spidering Hacks: 100 Industrial-Strength Tips & Tools
21% buy
Spidering Hacks: 100 Industrial-Strength Tips & Tools 2.0 out of 5 stars (1)
£12.18
HTTP Programming Recipes for C# Bots
18% buy
HTTP Programming Recipes for C# Bots
£24.64

Customer Reviews


There are no customer reviews yet.
Video reviews
Video reviews
Amazon now allows customers to upload product video reviews. Use a webcam or video camera to record and upload reviews to Amazon.



Customer Discussions

This product's forum
Discussion Replies Latest Post
No discussions yet

Ask questions, Share opinions, Gain insight
Start a new discussion
Topic:
First post:
Prompts for sign-in
 

   


Look for similar items by category


Look for similar items by subject


Feedback

Ad

Your Recent History

 (What's this?)

After viewing product detail pages or search results, look here to find an easy way to navigate back to pages you are interested in.