|
Getting
data (Data Source)
Web pages by next button
This method can be used to extract numerated pages like you find in a google search or at Ebay.
It lets you easily harvest pages that
are typically generated by a search
query, like seraching for a Porsche on
Ebay. You get results on a first page,
and you can visit consecutive pages via
a construct like: [ 1 | 2
| 3 | Next ]. The last
part we call the "Next button", and this
is used in HH to access all pages. This
works as follows (You will find the
example in "next button example.hhp").
In the Data Source window, select
Web pages by next button. Under
Base url, put in the url that
presents the first page of the search
results, in this case "http://www.happyharvester.com/test/top1000/page0.html".
Click on the Preview button and this
page is displayed.
Switch from Source preview to
Browser preview. At the bottom of
the page, you will find a next button.
This we will put in the Next button
definition field. The text of the
next button reads "NEXT »", and this is
what we will use as next button
defnition.
That is all we need to access web pages
via a next button construct.
To read of more ways to get data, go
here.
To read about how to extract data, go
here.
|