Getting data (Data Source)


Web pages by next button


This method can be used to extract numerated pages like you find in a google search or at Ebay. It lets you easily harvest pages that are typically generated by a search query, like seraching for a Porsche on Ebay. You get results on a first page, and you can visit consecutive pages via a construct like: [ 1 | 2 | 3 | Next ]. The last part we call the "Next button", and this is used in HH to access all pages. This works as follows (You will find the example in "next button example.hhp").

In the Data Source window, select Web pages by next button. Under Base url, put in the url that presents the first page of the search results, in this case "http://www.happyharvester.com/test/top1000/page0.html". Click on the Preview button and this page is displayed.

Switch from Source preview to Browser preview. At the bottom of the page, you will find a next button. This we will put in the Next button definition field. The text of the next button reads "NEXT »", and this is what we will use as next button defnition.

That is all we need to access web pages via a next button construct.

To read of more ways to get data, go here.
To read about how to extract data, go here.