Hi again,
allow me one (hoepfully last) stupid question.
i came across a "funny" HTML error message with some of our pages.
interesting: The HTML is not perfect but according to various HTML checker
ok. 1127 error reported in 52644 ....(see log)
The are several things on the html pages, which i initially thought might be the reason for the problem, but they are ok on other pages
...any idea what this might be?...
Thanks already...
Greetings ...
---------------------
14:17:40 - [ERROR] Invalid HTML found while spidering http://harvester.fzk.de/harvester/mouse/IPI00758/IPI00758369.htm, page aborted
14:17:42 - [ERROR] Invalid HTML found while spidering http://harvester.fzk.de/harvester/mouse/IPI00759/IPI00759894.htm, page aborted
14:17:43 - [ERROR] Invalid HTML found while spidering http://harvester.fzk.de/harvester/mouse/IPI00759/IPI00759928.htm, page aborted
14:17:47 - [ERROR] Invalid HTML found while spidering http://harvester.fzk.de/harvester/mouse/IPI00760/IPI00760082.htm, page aborted
14:17:50 - [ERROR] Invalid HTML found while spidering http://harvester.fzk.de/harvester/mouse/IPI00118/IPI00118096.htm, page aborted
14:17:53 - [ERROR] Invalid HTML found while spidering http://harvester.fzk.de/harvester/mouse/IPI00118/IPI00118238.htm, page aborted
14:17:53 - [ERROR] Invalid HTML found while spidering http://harvester.fzk.de/harvester/mouse/IPI00118/IPI00118271.htm, page aborted
14:17:53 - [ERROR] Invalid HTML found while spidering http://harvester.fzk.de/harvester/mouse/IPI00118/IPI00118296.htm, page aborted
14:17:53 - [ERROR] Invalid HTML found while spidering http://harvester.fzk.de/harvester/mouse/IPI00118/IPI00118304.htm, page aborted
14:17:53 - [ERROR] Invalid HTML found while spidering http://harvester.fzk.de/harvester/mouse/IPI00118/IPI00118309.htm, page aborted
14:19:22 - Indexing completed at Tue Mar 13 14:19:22 2007
14:19:22 - INDEX SUMMARY
14:19:22 - Files indexed: 52644
14:19:22 - Files skipped: 4681074
14:19:22 - Files filtered: 0
14:19:22 - Files downloaded: 52644
14:19:22 - Unique words found: 1778195
14:19:22 - Total words found: 46694865
14:19:22 - Avg. unique words per page: 33
14:19:22 - Avg. words per page: 886
14:19:22 - Start index time: 13:54:59 (2007/03/13)
14:19:22 - Elapsed index time: 00:24:23
14:19:22 - Errors: 1127
14:19:22 - URLs visited by spider: 52644
14:19:22 - URLs in spider queue: 0
14:19:22 - Total bytes scanned/downloaded: 1793589489
14:19:22 - File extensions:
14:19:22 - .htm indexed: 52394
14:19:22 - .html indexed: 250
14:19:22 - Cleaning up memory used for index data... please wait.
14:19:22 - Finished cleaning up memory.
allow me one (hoepfully last) stupid question.
i came across a "funny" HTML error message with some of our pages.
interesting: The HTML is not perfect but according to various HTML checker
ok. 1127 error reported in 52644 ....(see log)
The are several things on the html pages, which i initially thought might be the reason for the problem, but they are ok on other pages
...any idea what this might be?...
Thanks already...
Greetings ...
---------------------
14:17:40 - [ERROR] Invalid HTML found while spidering http://harvester.fzk.de/harvester/mouse/IPI00758/IPI00758369.htm, page aborted
14:17:42 - [ERROR] Invalid HTML found while spidering http://harvester.fzk.de/harvester/mouse/IPI00759/IPI00759894.htm, page aborted
14:17:43 - [ERROR] Invalid HTML found while spidering http://harvester.fzk.de/harvester/mouse/IPI00759/IPI00759928.htm, page aborted
14:17:47 - [ERROR] Invalid HTML found while spidering http://harvester.fzk.de/harvester/mouse/IPI00760/IPI00760082.htm, page aborted
14:17:50 - [ERROR] Invalid HTML found while spidering http://harvester.fzk.de/harvester/mouse/IPI00118/IPI00118096.htm, page aborted
14:17:53 - [ERROR] Invalid HTML found while spidering http://harvester.fzk.de/harvester/mouse/IPI00118/IPI00118238.htm, page aborted
14:17:53 - [ERROR] Invalid HTML found while spidering http://harvester.fzk.de/harvester/mouse/IPI00118/IPI00118271.htm, page aborted
14:17:53 - [ERROR] Invalid HTML found while spidering http://harvester.fzk.de/harvester/mouse/IPI00118/IPI00118296.htm, page aborted
14:17:53 - [ERROR] Invalid HTML found while spidering http://harvester.fzk.de/harvester/mouse/IPI00118/IPI00118304.htm, page aborted
14:17:53 - [ERROR] Invalid HTML found while spidering http://harvester.fzk.de/harvester/mouse/IPI00118/IPI00118309.htm, page aborted
14:19:22 - Indexing completed at Tue Mar 13 14:19:22 2007
14:19:22 - INDEX SUMMARY
14:19:22 - Files indexed: 52644
14:19:22 - Files skipped: 4681074
14:19:22 - Files filtered: 0
14:19:22 - Files downloaded: 52644
14:19:22 - Unique words found: 1778195
14:19:22 - Total words found: 46694865
14:19:22 - Avg. unique words per page: 33
14:19:22 - Avg. words per page: 886
14:19:22 - Start index time: 13:54:59 (2007/03/13)
14:19:22 - Elapsed index time: 00:24:23
14:19:22 - Errors: 1127
14:19:22 - URLs visited by spider: 52644
14:19:22 - URLs in spider queue: 0
14:19:22 - Total bytes scanned/downloaded: 1793589489
14:19:22 - File extensions:
14:19:22 - .htm indexed: 52394
14:19:22 - .html indexed: 250
14:19:22 - Cleaning up memory used for index data... please wait.
14:19:22 - Finished cleaning up memory.
Comment