Chapter |
Name |
URL |
Notes |
Introduction |
n.a. |
n.a. |
|
Chapter 1 |
n.a. |
n.a. |
|
Chapter 2 |
n.a. |
n.a. |
|
Chapter 3 |
Hello World! |
http://www.WebbotsSpidersScreenScrapers.com/hello_world.html | Target for your first webbot |
Chapter 4 |
n.a. |
n.a. |
n.a. |
Chapter 5 |
n.a. |
n.a. |
n.a. |
Chapter 6 |
Form example |
search/search.php |
Used as an example form emulation |
Form analyzer |
http://www.WebbotsSpidersScreenScrapers.com/form_analyzer.php |
Analyzes submitted forms |
Chapter 7 |
Sample webpage |
http://www.schrenk.com | Sample webpage |
Sample image |
http://www.schrenk.com/north_beach.jpg |
Sample image |
Chapter 8 |
Sample store |
http://www.webbotsSpidersScreenscrapers.com/buyair |
This simple sample store is used to monitor prices. |
Chapter 9 |
Nasa Viking |
http://www.nasa.gov/mission_pages/viking/index.html | This is a NASA page with a lot of images for testing the Image Download Webbot |
Chapter 10 |
Hyper-reference test page |
http://www.WebbotsSpidersScreenScrapers.com/page_with_broken_links.php | This page links to pages with links in various conditions (poorly defined, broken, internal errors, etc) |
501 Error page |
http://www.WebbotsSpidersScreenScrapers.com/501_error_page.php | Creates an HTTP 501 error |
Chapter 11 |
Generic Search Page |
http://www.WebbotsSpidersScreenScrapers.com/search | Exmpample (static) seartch engine used by search ranking webbot |
Chapter 12 |
RSS Page 1 |
http://www.lasvegassun.com/feeds/headlines/all |
Your should be able to substitue any of these pages with any other valid RSS feed. Google "RSS" for more examples. |
RSS Page 3 |
http://www.startribune.com/rss/1557.xml |
RSS Page 3 |
http://www.lasvegassun.com/feeds/headlines/all |
Chapter 13 | n.a. | n.a. | |
Chapter 14 | n.a. | n.a. | |
Chapter 15 | n.a. | n.a. | |
Chapter 16 | n.a. | n.a. | |
Chapter 17 |
Zip code form |
http://www.WebbotsSpidersScreenScrapers.com/zip_code_form.php | Exmpample zip code finding application |
Chapter 18 | n.a. | n.a. | |
Chapter 19 | n.a. | n.a. | |
Chapter 20 |
Basic Authentication Example |
http://www.WebbotsSpidersScreenScrapers.com/basic_authentication/ |
These three URL provide practice areas to write autoauthenticating webbots The user names and passwords are published in the book. |
Cookie Authentication Example |
http://www.WebbotsSpidersScreenScrapers.com/cookie_authentication/ |
Query Authentication Exmaple |
http://www.WebbotsSpidersScreenScrapers.com/query_authentication/ |
Chapter 21 |
Cookie writing example |
http://www.WebbotsSpidersScreenScrapers.com/EXAMPLE_writing_cookies.php | This web page writes a temporary cookie and a permanent cookie to your browser or webbot |
Chapter 22 | n.a. | n.a. | |
Chapter 23 | n.a. | n.a. | |
Chapter 24 | n.a. | n.a. | |
Chapter 28 |
Page redirection |
http://www.WebbotsSpidersScreenScrapers.com/head_redirection_test.php | This web page performs an HTTP header redirection after a five second delay. |
Simple form |
http://www.WebbotsSpidersScreenScrapers.com/easy_form.php | This form contains hidden values, parsed by LISTING 25-9. |
Chapter 29 |
Sample XML |
www.WebbotsSpidersScreenScrapers.com/29_7.php | While a webbot doesn't care, to view corrently in a browser, this file would need an .XML extension. |
Chapter 29 |
Example of a light-weight interface |
www.WebbotsSpidersScreenScrapers.com/29_9.php | This file is a webbot interface and not intended to be read in a browser. If you are reading this file in a browser, look at the page source to see the correct formatting. |
Chapter 30 | n.a. | n.a. | |
Chapter 31 | n.a. | n.a. | |