2 - Design Features in WebLinker
WebLinker consists of two distinct subsystems:
MOMSpider is used as the body of the traversal robot. This has hooks into the WebLinker routines, so that we can parse the contents of every HTML node that MOMSpider visits.
The robot is used to do two things:
When a HTTP request referring to an infostructure is received, the script is called. It then works out which document on the server has been requested, and returns the contents of that file.
One extra feature provided by the use of a script is that it allows us to return extra information to the client. We define a special LRN as follows:
For an infostructure FARMING on the server www.agri.org, we define the
LRN http://www.agri.org/lrn/FARMING?fragsto return the complete list of valid fragment identifiers for that infostructure in text/plain format. This allows the WebLinker robot to use such a LRN to retrieve the fragment identifier mapping table for a remote infostructure easily. Also, the
LRN http://www.agri.org/lrn/FARMING?Listreturns a formatted HTML document giving the list of fragment identifiers and a brief description of what they are. This can automatically be extracted from the HTML nodes in the infostructure: Using the HTML fragment from Section 2.3.3 as an example, the following descriptions are generated automatically, one for the fragment Pastimes and one for the file farming_33.html itself:
Pastimes Pastimes of sheep farmers farming_33.html Bolivian Sheep Farmers
Generated with WebMaker