Hi,
On our company Intranet, for each directory on it there are 3 copys of each HTML page, the page was originally done in word (dont ask we've told them not to do webpages in word) then the word document is converted to HTML and another copy is made in PDF format. How can i stop the ZoomSearch software from indexing the PDF documents, ive got CRC turned on, and its skipping the word documents as they are identical. Im pretty sure the PDF files arent listed in the webpages anywhere so theres no reason to index it. Cant really do anything to the pdf files as the site is huge.
Im currently using spider mode.
cheers
On our company Intranet, for each directory on it there are 3 copys of each HTML page, the page was originally done in word (dont ask we've told them not to do webpages in word) then the word document is converted to HTML and another copy is made in PDF format. How can i stop the ZoomSearch software from indexing the PDF documents, ive got CRC turned on, and its skipping the word documents as they are identical. Im pretty sure the PDF files arent listed in the webpages anywhere so theres no reason to index it. Cant really do anything to the pdf files as the site is huge.
Im currently using spider mode.
cheers
Comment