Hi,
I have been using Zoom for indexing my company's intranet for almost a month and since being in a development stage its being done on a local machine.
The website being indexed is a DNN website which is called using the following url
http://gerald/technical/
The site contains over 20,000 pages (Major pages from a documentation system - HTML pages) and we have purchased the professional edition.
Previously before including the documentation system onto the website, we dropped just few HTML pages for the FREE Version which was picked up promptly.
But for now it doesnt seem to index the HTML file.
I am enclosing my log file,
[IMG]file:\\\C:\Documents and Settings\gjoseph\Desktop\zoom2.cfg[/IMG] pretty sure I must have got the config or the Spider URL wrong.
(Hw do i add attachments in this forum )
10/31/08 10:15:30 - Start indexing (spider mode)
10/31/08 10:15:30 - Maximum number of words: 300000
10/31/08 10:15:30 - Maximum number of files: 65500
10/31/08 10:15:30 - Will scan files with extensions
10/31/08 10:15:30 - .php
10/31/08 10:15:30 - .asp
10/31/08 10:15:30 - .cfm
10/31/08 10:15:30 - .aspx
10/31/08 10:15:30 - .php3
10/31/08 10:15:30 - .php4
10/31/08 10:15:30 - .txt
10/31/08 10:15:30 - .doc
10/31/08 10:15:30 - .ae
10/31/08 10:15:30 - .aef
10/31/08 10:15:30 - .ocx
10/31/08 10:15:30 - .msi
10/31/08 10:15:30 - .dll
10/31/08 10:15:30 - .exe
10/31/08 10:15:30 - .zip
10/31/08 10:15:30 - .rar
10/31/08 10:15:30 - .gif
10/31/08 10:15:30 - .jpeg
10/31/08 10:15:30 - .jpg
10/31/08 10:15:30 - .bmp
10/31/08 10:15:30 - .png
10/31/08 10:15:30 - .htm
10/31/08 10:15:30 - .html
10/31/08 10:15:30 - .xml
10/31/08 10:15:30 - Spider from: http://gerald/technical/Home/tabid/36/Default.aspx
10/31/08 10:15:30 - Web site URL: http://gerald/technical/Home/tabid/36/
10/31/08 10:15:30 - Estimated RAM required during index process: 472287 KB
10/31/08 10:15:31 - Initiating HTTP session (thread #1) ...
10/31/08 10:15:31 - DL Thread #1, got URL (http://gerald/technical/Home/tabid/36/Default.aspx) off queue
10/31/08 10:15:31 - Downloading file http://gerald/technical/Home/tabid/36/Default.aspx
10/31/08 10:15:31 - Index Thread got ready buffer for http://gerald/technical/Home/tabid/36/Default.aspx (Content-type: HTML text)
10/31/08 10:15:31 - Initiating HTTP session (thread #2) ...
10/31/08 10:15:31 - Spidering for links on http://gerald/technical/Home/tabid/36/Default.aspx
10/31/08 10:15:31 - Queued URL: http://gerald/technical/Default.aspx
10/31/08 10:15:31 - Queued URL: http://gerald/Technical/Portals/0/Newest%20Logo.gif
10/31/08 10:15:31 - Queued URL: http://gerald/Technical/Portals/_default/Skins/skins/media/menuitemsel_l.gif
10/31/08 10:15:31 - Queued URL: http://gerald/Technical/Portals/_default/Skins/skins/media/menuitemsel_r.gif
10/31/08 10:15:31 - Queued URL: http://gerald/Technical/Portals/_default/Skins/skins/media/menuitem_l.gif
10/31/08 10:15:31 - Queued URL: http://gerald/technical/Downloads/tabid/55/Default.aspx
10/31/08 10:15:31 - Queued URL: http://gerald/Technical/Portals/_default/Skins/skins/media/menuitem_r.gif
10/31/08 10:15:31 - Queued URL: http://gerald/technical/IssueManagement/tabid/56/Default.aspx
10/31/08 10:15:31 - Queued URL: http://gerald/technical/ContactUs/tabid/62/Default.aspx
10/31/08 10:15:31 - Queued URL: http://gerald/Technical/Portals/_default/Skins/skins/spacer.gif
10/31/08 10:15:31 - Queued URL: http://gerald/Technical/Portals/_default/Containers/containers/media/blue3_tl.gif
10/31/08 10:15:31 - Queued URL: http://gerald/Technical/Portals/_default/Containers/containers/media/blue3_tr.gif
10/31/08 10:15:31 - Queued URL: http://gerald/Technical/Portals/_default/Containers/containers/media/blue3_ml.gif
10/31/08 10:15:31 - Queued URL: http://gerald/Technical/images/min.gif
10/31/08 10:15:31 - Skipping https://support.stayinfront.co.nz/technical/documentation/Version%2011/138145/138186/141718/StayinFront%20CRM%2011%20Product%20Sheet.pdf (External site - does not match base URL)
10/31/08 10:15:31 - Skipping https://support.stayinfront.co.nz/technical/documentation/Version%2011/103514/104772/141442/StayinFront%20Analytics%2011%20User%20Guide.pdf (External site - does not match base URL)
10/31/08 10:15:31 - Writing index data for ASP search... (Please wait)
10/31/08 10:15:31 - Created pagedata data file (zoom_pagedata.zdat)
10/31/08 10:15:31 - Created pagetext data file (zoom_pagetext.zdat)
10/31/08 10:15:31 - Created pageinfo data file (zoom_pageinfo.zdat)
10/31/08 10:15:31 - Created categories data file (zoom_cats.zdat)
10/31/08 10:15:31 - Created spelling data file (zoom_spelling.zdat)
10/31/08 10:15:31 - Created dictionary data file (zoom_dictionary.zdat)
10/31/08 10:15:31 - Created wordmap data file (zoom_wordmap.zdat)
10/31/08 10:15:31 - Created script settings file (settings.asp)
10/31/08 10:15:31 - Indexing completed
10/31/08 10:15:31 - INDEX SUMMARY
10/31/08 10:15:31 - Files indexed: 8
10/31/08 10:15:31 - Files skipped: 19
10/31/08 10:15:31 - Files filtered: 0
10/31/08 10:15:31 - Files downloaded: 8
10/31/08 10:15:31 - Unique words found: 1233
10/31/08 10:15:31 - Total words found: 4382
10/31/08 10:15:31 - Avg. unique words per page: 154.13
10/31/08 10:15:31 - Avg. words per page: 547
10/31/08 10:15:31 - Start index time: 10:15:30 (2008/10/31)
10/31/08 10:15:31 - Elapsed index time: 00:00:01
10/31/08 10:15:31 - Errors: 1
10/31/08 10:15:31 - URLs visited by spider: 32
10/31/08 10:15:31 - URLs in spider queue: 0
10/31/08 10:15:31 - Start points indexed: 2 / 2
10/31/08 10:15:31 - Total bytes scanned/downloaded: 187833
10/31/08 10:15:31 - File extensions:
10/31/08 10:15:31 - .php indexed: 0
10/31/08 10:15:31 - .asp indexed: 0
10/31/08 10:15:31 - .cfm indexed: 0
10/31/08 10:15:31 - .aspx indexed: 8
10/31/08 10:15:31 - .php3 indexed: 0
10/31/08 10:15:31 - .php4 indexed: 0
10/31/08 10:15:31 - .txt indexed: 0
10/31/08 10:15:31 - .doc indexed: 0
10/31/08 10:15:31 - .ae indexed: 0
10/31/08 10:15:31 - .aef indexed: 0
10/31/08 10:15:31 - .ocx indexed: 0
10/31/08 10:15:31 - .msi indexed: 0
10/31/08 10:15:31 - .dll indexed: 0
10/31/08 10:15:31 - .exe indexed: 0
10/31/08 10:15:31 - .zip indexed: 0
10/31/08 10:15:31 - .rar indexed: 0
10/31/08 10:15:31 - .gif indexed: 0
10/31/08 10:15:31 - .jpeg indexed: 0
10/31/08 10:15:31 - .jpg indexed: 0
10/31/08 10:15:31 - .bmp indexed: 0
10/31/08 10:15:31 - .png indexed: 0
10/31/08 10:15:31 - .htm indexed: 0
10/31/08 10:15:31 - .html indexed: 0
10/31/08 10:15:31 - .xml indexed: 0
10/31/08 10:15:31 - No extensions indexed: 0
10/31/08 10:15:31 - Cleaning up memory used for index data... please wait.
10/31/08 10:15:31 - Finished cleaning up memory.
I have been using Zoom for indexing my company's intranet for almost a month and since being in a development stage its being done on a local machine.
The website being indexed is a DNN website which is called using the following url
http://gerald/technical/
The site contains over 20,000 pages (Major pages from a documentation system - HTML pages) and we have purchased the professional edition.
Previously before including the documentation system onto the website, we dropped just few HTML pages for the FREE Version which was picked up promptly.
But for now it doesnt seem to index the HTML file.
I am enclosing my log file,
[IMG]file:\\\C:\Documents and Settings\gjoseph\Desktop\zoom2.cfg[/IMG] pretty sure I must have got the config or the Spider URL wrong.
(Hw do i add attachments in this forum )
10/31/08 10:15:30 - Start indexing (spider mode)
10/31/08 10:15:30 - Maximum number of words: 300000
10/31/08 10:15:30 - Maximum number of files: 65500
10/31/08 10:15:30 - Will scan files with extensions
10/31/08 10:15:30 - .php
10/31/08 10:15:30 - .asp
10/31/08 10:15:30 - .cfm
10/31/08 10:15:30 - .aspx
10/31/08 10:15:30 - .php3
10/31/08 10:15:30 - .php4
10/31/08 10:15:30 - .txt
10/31/08 10:15:30 - .doc
10/31/08 10:15:30 - .ae
10/31/08 10:15:30 - .aef
10/31/08 10:15:30 - .ocx
10/31/08 10:15:30 - .msi
10/31/08 10:15:30 - .dll
10/31/08 10:15:30 - .exe
10/31/08 10:15:30 - .zip
10/31/08 10:15:30 - .rar
10/31/08 10:15:30 - .gif
10/31/08 10:15:30 - .jpeg
10/31/08 10:15:30 - .jpg
10/31/08 10:15:30 - .bmp
10/31/08 10:15:30 - .png
10/31/08 10:15:30 - .htm
10/31/08 10:15:30 - .html
10/31/08 10:15:30 - .xml
10/31/08 10:15:30 - Spider from: http://gerald/technical/Home/tabid/36/Default.aspx
10/31/08 10:15:30 - Web site URL: http://gerald/technical/Home/tabid/36/
10/31/08 10:15:30 - Estimated RAM required during index process: 472287 KB
10/31/08 10:15:31 - Initiating HTTP session (thread #1) ...
10/31/08 10:15:31 - DL Thread #1, got URL (http://gerald/technical/Home/tabid/36/Default.aspx) off queue
10/31/08 10:15:31 - Downloading file http://gerald/technical/Home/tabid/36/Default.aspx
10/31/08 10:15:31 - Index Thread got ready buffer for http://gerald/technical/Home/tabid/36/Default.aspx (Content-type: HTML text)
10/31/08 10:15:31 - Initiating HTTP session (thread #2) ...
10/31/08 10:15:31 - Spidering for links on http://gerald/technical/Home/tabid/36/Default.aspx
10/31/08 10:15:31 - Queued URL: http://gerald/technical/Default.aspx
10/31/08 10:15:31 - Queued URL: http://gerald/Technical/Portals/0/Newest%20Logo.gif
10/31/08 10:15:31 - Queued URL: http://gerald/Technical/Portals/_default/Skins/skins/media/menuitemsel_l.gif
10/31/08 10:15:31 - Queued URL: http://gerald/Technical/Portals/_default/Skins/skins/media/menuitemsel_r.gif
10/31/08 10:15:31 - Queued URL: http://gerald/Technical/Portals/_default/Skins/skins/media/menuitem_l.gif
10/31/08 10:15:31 - Queued URL: http://gerald/technical/Downloads/tabid/55/Default.aspx
10/31/08 10:15:31 - Queued URL: http://gerald/Technical/Portals/_default/Skins/skins/media/menuitem_r.gif
10/31/08 10:15:31 - Queued URL: http://gerald/technical/IssueManagement/tabid/56/Default.aspx
10/31/08 10:15:31 - Queued URL: http://gerald/technical/ContactUs/tabid/62/Default.aspx
10/31/08 10:15:31 - Queued URL: http://gerald/Technical/Portals/_default/Skins/skins/spacer.gif
10/31/08 10:15:31 - Queued URL: http://gerald/Technical/Portals/_default/Containers/containers/media/blue3_tl.gif
10/31/08 10:15:31 - Queued URL: http://gerald/Technical/Portals/_default/Containers/containers/media/blue3_tr.gif
10/31/08 10:15:31 - Queued URL: http://gerald/Technical/Portals/_default/Containers/containers/media/blue3_ml.gif
10/31/08 10:15:31 - Queued URL: http://gerald/Technical/images/min.gif
10/31/08 10:15:31 - Skipping https://support.stayinfront.co.nz/technical/documentation/Version%2011/138145/138186/141718/StayinFront%20CRM%2011%20Product%20Sheet.pdf (External site - does not match base URL)
10/31/08 10:15:31 - Skipping https://support.stayinfront.co.nz/technical/documentation/Version%2011/103514/104772/141442/StayinFront%20Analytics%2011%20User%20Guide.pdf (External site - does not match base URL)
10/31/08 10:15:31 - Writing index data for ASP search... (Please wait)
10/31/08 10:15:31 - Created pagedata data file (zoom_pagedata.zdat)
10/31/08 10:15:31 - Created pagetext data file (zoom_pagetext.zdat)
10/31/08 10:15:31 - Created pageinfo data file (zoom_pageinfo.zdat)
10/31/08 10:15:31 - Created categories data file (zoom_cats.zdat)
10/31/08 10:15:31 - Created spelling data file (zoom_spelling.zdat)
10/31/08 10:15:31 - Created dictionary data file (zoom_dictionary.zdat)
10/31/08 10:15:31 - Created wordmap data file (zoom_wordmap.zdat)
10/31/08 10:15:31 - Created script settings file (settings.asp)
10/31/08 10:15:31 - Indexing completed
10/31/08 10:15:31 - INDEX SUMMARY
10/31/08 10:15:31 - Files indexed: 8
10/31/08 10:15:31 - Files skipped: 19
10/31/08 10:15:31 - Files filtered: 0
10/31/08 10:15:31 - Files downloaded: 8
10/31/08 10:15:31 - Unique words found: 1233
10/31/08 10:15:31 - Total words found: 4382
10/31/08 10:15:31 - Avg. unique words per page: 154.13
10/31/08 10:15:31 - Avg. words per page: 547
10/31/08 10:15:31 - Start index time: 10:15:30 (2008/10/31)
10/31/08 10:15:31 - Elapsed index time: 00:00:01
10/31/08 10:15:31 - Errors: 1
10/31/08 10:15:31 - URLs visited by spider: 32
10/31/08 10:15:31 - URLs in spider queue: 0
10/31/08 10:15:31 - Start points indexed: 2 / 2
10/31/08 10:15:31 - Total bytes scanned/downloaded: 187833
10/31/08 10:15:31 - File extensions:
10/31/08 10:15:31 - .php indexed: 0
10/31/08 10:15:31 - .asp indexed: 0
10/31/08 10:15:31 - .cfm indexed: 0
10/31/08 10:15:31 - .aspx indexed: 8
10/31/08 10:15:31 - .php3 indexed: 0
10/31/08 10:15:31 - .php4 indexed: 0
10/31/08 10:15:31 - .txt indexed: 0
10/31/08 10:15:31 - .doc indexed: 0
10/31/08 10:15:31 - .ae indexed: 0
10/31/08 10:15:31 - .aef indexed: 0
10/31/08 10:15:31 - .ocx indexed: 0
10/31/08 10:15:31 - .msi indexed: 0
10/31/08 10:15:31 - .dll indexed: 0
10/31/08 10:15:31 - .exe indexed: 0
10/31/08 10:15:31 - .zip indexed: 0
10/31/08 10:15:31 - .rar indexed: 0
10/31/08 10:15:31 - .gif indexed: 0
10/31/08 10:15:31 - .jpeg indexed: 0
10/31/08 10:15:31 - .jpg indexed: 0
10/31/08 10:15:31 - .bmp indexed: 0
10/31/08 10:15:31 - .png indexed: 0
10/31/08 10:15:31 - .htm indexed: 0
10/31/08 10:15:31 - .html indexed: 0
10/31/08 10:15:31 - .xml indexed: 0
10/31/08 10:15:31 - No extensions indexed: 0
10/31/08 10:15:31 - Cleaning up memory used for index data... please wait.
10/31/08 10:15:31 - Finished cleaning up memory.
Comment