Base URL(s)

Syntax: one or more URLs, one per line

This is the address where the web walker will start walking your site. If the whole site is to be searched, simply enter your web address, for example "http://www.example.com". If the search is to be limited, specify the address to start the search or create a page listing the URLs to search. The search will only return information from your web site - no off-site searching will be done. Directory URLs should include a final forward slash "/". Example - "http://www.example.com/mysite/". If you have a virtual domain that just redirects to another URL, enter the destination URL as your Base URL instead of your virtual domain name.

You may specify multiple base URLs to index multiple sites; the Search Appliance's idea of a "site" is a single host as identified by the hostname portion of a URL. Therefore http://www.example.com, http://www2.example.com, and http://example.com would all be considered different sites.

In version 4.02.1046373961 Feb 27 2003 and later, the special "protocol" http-post or https-post may be used for a Base URL. This uses the POST method instead of the GET method to fetch the URL, using the query string as POST data (it must be URL-encoded). This can be used to start walking at a login page form that requires POST instead of GET. Note that the URL stored in the html table will have the -post and query string removed for security. During a Refresh walk, when a URL is about to be refreshed, the probable Base URL that led to it (i.e. the one with the longest prefix) will also be fetched. This helps ensure that login cookies are properly restored to allow the Search Appliance access during the refresh. Example:

"http-post://www.somehost.com/login.asp?user=bigbird&pass=open-sesame"

See also URL file 3.5.7, URL URL 3.5.8, Single page 3.5.9, Page file 3.5.10, and Page URL 3.5.11 for more ways to specify URLs.


Copyright © Thunderstone Software     Last updated: Nov 8 2024

 

Copyright © 2024 Thunderstone Software LLC. All rights reserved.