Getting data (Data Sources)


Introduction


There are a few different ways to obtain the raw data from which HH can process. You can extract from any text based file by using the Files mode. You can also extract from web pages. Three different ways are provided to achieve this. The first method is the easiest one, called Web pages by next button. This can be used to extract numerated pages like you find in a google search or at Ebay. You can also use the Web pages by url generator that also collects numerated pages, but lets you build the url yourself. Use Web pages by url list from file when you already have a list of url's you want to collect. This is particularly useful when you first collect interesting url's and put them in a file HH can use.

All modes will be explained in separate sections.

Next >