Administrator Guide: Adding a New Schedule

There are two ways to provide the collection with URLs to begin crawling from. One of the ways is to add them to the Starting URLs screen. These URLs are automatically added to the Todo list when the Todo list is empty and crawling has been initiated. This feature is useful if you always need to start crawling from the same location.

Schedules can be created by clicking on the “Schedule Updates” hyperlink on the Status screen. Once on the new screen, click “New Schedule” at the very top.

Start & Stop Times:
The Start time is when the collection will be crawled. The Stop time is when the collection will be stopped, if it’s still being crawled when that time is reached.

Task:
This tells the spider what you would like it to do. It can either crawl from the Todo list or crawl the previously collection documents updating them.

Frequency:
This is how often the spider will crawl the collection. Collections can be crawled Hourly, Daily, Weekly, or Monthly.

Empty Tables:
This instructs the spider to empty the specified table(s) before starting the collection.

For example, if Errors is checked, the spider will empty the errors table before crawling. This would be particularly useful when crawling your own Web site or Intranet and want to make sure that it visits every page regardless of whether or not a page was unavailable during the last crawl.

Full-Text Index:
This instructs the spider to update the Full-Text Index at the scheduled stop time.

 

 

 

 

<< Back To Table Of Contents

 

©2009 Innerprise . All rights reserved. Privacy Statement - Link To Us