|
Getting
data (Data Sources)
Introduction
There are a few different ways to obtain
the raw data from which HH can process.
You can extract from any text based file
by using the Files mode. You can
also extract from web pages. Three
different ways are provided to achieve
this. The first method is the easiest
one, called Web pages by next button.
This can be used to extract numerated
pages like you find in a google search
or at Ebay. You can also use the Web
pages by url generator that also
collects numerated pages, but lets you
build the url yourself. Use Web pages by url list from file
when you already have a list of url's
you want to collect. This is
particularly useful when you first
collect interesting url's and put them
in a file HH can use.
All modes will be explained in separate
sections.
Next >
|