Sitemap Generator - HELP: Settings

for more info on how to use Sitemap Generator -- read below...

Sitemap Generator Commands Overview

Sitemap Generator buttons, commands and tabs...

 

  • Settings

 

sitemap generator settings

 

The "Settings" window allows you to configure program behavior.

 

"Max.file size" is the maximal filesize accepted. Every target file (page) bigger than this will be skipped and will be labeled as "failed"

 

"Max.URL length" -- this is the maximal size of the URL in characters. URLs longer than this will be skipped.

 

"Server Response Timeout" -- this is the max.time that Sitemap Generator will wait for the target server to respond

 

The "www" checkbox at the bottom, if checked, will instruct the program to crawl both "www.domain.com" and "domain.com" as the same site. Usially the "www." prefix points to the same site, but because "www." is in fact a subdomain, it may be separate site. If so - check this box to tell the program that these are different sites.

 

 

  • Simultaneous Connections

 

simultaneous connections

 

Use this combo-box to set the maximal number of simultaneous connections. In most cases this will speed-up the process of crawling, however - the target server will receive more concurrent conections which may be potential problem (may slow-down the server speed). In general - use this option only AT YOUR OWN RISK.

 

 

  • "Clear Cache" button

 

The "cache" is temp folder where the program stores all crawled (downloaded) files since the program start. On exit Sitemap Generator will delete these files. They exist only to make the crawling faster -- for example, if you click the "Stop" button and then restart scanning the site, the second time crawler will get the files crawled on the first start from cache, without downloading them again. "In case you want to re-download all pages from the server, just click the "Clear Cache" button.