I have html images of many of my pdf files, with links to the .pdf's if the user wishes a "printable" page. The information on these pages is completely redundant, and I tried bookending the .pdf URLs between "<!--ZOOMSTOP--> and <!--ZOOMRESTART-->, but I'm getting both versions indexed. Is there another way to hide the .pdf's in this directory, or do I need to put all of my .pdf's in another "off-zoom-limits" directory?
Announcement
Collapse
No announcement yet.
Selectively hide pages
Collapse
X
-
I assume that you have some PDF files that you want indexed and some PDF files that you don't want indexed?
Seperating them into different sub-directories is an easy solution. (then get Zoom to skip one directory).
Giving them different files names is another solution. e.g giving all the printable PDF's a name like,
/printversion_document1.pdf
/printversion_document2.pdf
The get Zoom to skip any URL with /printversion_ in the name.
You can also use <!--ZOOMSTOPFOLLOW--> and <!--ZOOMRESTARTFOLLOW--> to specify sections of a page containing links that the spider should not follow. Note that the text within this section will still be indexed when this is used alone.
Comment