Tuesday, September 1, 2009

Scheduled web data extraction

Extracting Data on a Schedule

Extracting data from a website is a process by which a program automatically downloads the content of a page and parses it for specific information. In some cases you may want to do scheduled web data extracting. This is used in cases where you know that the data will change on a regular interval. For example: the current Labor rate, interst rate or the Federal housing rate. You can schedule a web extractor tool to go out once a day to collect this information and then store it in a local database for later use. One of the things I have found most valuble is to consitantly scrape public records like government auctions and mash that data together with with other statistics that I have mined from the internet.

No comments:

Post a Comment