PassMark Logo
Home » Forum

Announcement

Collapse
No announcement yet.

Indexing Options Question

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Indexing Options Question

    I am currently eval'ing the Zoom Search Engine indexer for intranet site and had a question in regard to the indexing options. One of the options that can be checked is "Link text". Being that the interpretation of this option is a little ambiguous (for me at least), is this a reference to href's which have a name-property set for them?

    If not, is there a way in which I can have links within html-documents indexed via this software? As it stands, I've got a page setup on a fileserver which links to files (as in non-html-formatted documents) at several different locations on a local fileserver.

  • #2
    "Link text" on the "Indexing options" window refers to text that is part of a HTML link.

    So in this example,
    <a href="redshoes.html">Big Red Shoes</a>

    "Big Red Shoes" is the link text. And the check box you are refering to controls if this text is indexed or not.

    Note that "Link text" only affect the indexing of these elements for the target or destination file. That is, if a text link appears on "pageA.html" to "imageB.jpg", with the link text"picture of my pets", then these words will be indexed for the file "imageB.jpg", and NOT for "pageA.html".

    Comment


    • #3
      is there a way in which I can have links within html-documents indexed via this software?
      Do you mean that you want the text in the link itself indexed? Or do you mean you want the link followed and the document that is the target of the link indexed?

      The later case should happen automatically, assuming the document isn't excluded from the indexing process for some reason.

      Comment


      • #4
        How would documents possibly be excluded from the indexing process? I think that there's a possibility that this may be occurring, but I'm unsure of whether or not this is due to it still being a trial version.

        Comment


        • #5
          There are many ways documents might be excluded.

          A few examples,
          1) You are using the free edition of the software, which will stop index pages after 50 pages.

          2) You have a robots.txt file that excludes pages

          3) The page in question has a URL that is in different domain (or sub-domain) from the start point.

          4) The URL is in the Zoom skip list.

          See also these 3 FAQs
          Q. Why are some of my pages being skipped by the indexer?
          http://www.wrensoft.com/zoom/support...s.html#skipped

          Q. Why are links in my Javascript menus being skipped?
          http://www.wrensoft.com/zoom/support...avascriptmenus

          Q. I am indexing with spider mode but it is not finding all the pages on my web site
          http://www.wrensoft.com/zoom/support...spider_finding

          Comment

          Working...
          X