PassMark Logo
Home » Forum

Announcement

Collapse
No announcement yet.

Spider only one folder

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Spider only one folder

    Hi,
    I only want to spider one folder which contains product files.
    Someone else uploads new products to this folder which are not on my hard drive.

    The files are located at www.bridgetools.co.uk/htdocs/WebGen
    Is this possible?

  • #2
    Hello,

    This may be accomplished with combination of the "Page and folder skip list" on the "Skip options" configuration page and depending if you have directory listing turned off on your webserver, you may need to use a directory listing script.

    Please see the following past posts about directory listing scripts:

    http://www.wrensoft.com/forum/showthread.php?t=919
    http://wrensoft.com/forum/showthread.php?t=907

    Regards,

    Richard

    Comment


    • #3
      Also you can just start the indexer in the sub folder and default behaviour will be not to leave the sub-folder.

      As an example if I started indexing at this URL,
      http://www.wrensoft.com/forum/index.php
      and the base URL was thus,
      http://www.wrensoft.com/forum/
      only the files in the forum folder would be indexed.

      Comment


      • #4
        Thank you for your reply but I am obviously missing something as I still cannot get it to work.

        The base url is http://www.bridgetools.co.uk/htdocs/

        and I want to spider from http://www.bridgetools.co.uk/htdocs/WebGen/


        I am getting the message Check that the url exists and satisfies the settings in the configuration window.
        Regards
        Irene

        Comment


        • #5
          Hi,

          Loading http://www.bridgetools.co.uk/htdocs/WebGen/ into a browser, returns the "404 The page cannot be found"

          If you have directory listing disabled on your webserver, then Zoom will not be able to index that folder. When spidering, Zoom downloads a copy of the "webpage" and if none is found, then obviously it cannot index said page.

          Please see my post above about turning on directory listing, or using a script to list directory contents for that directory that you are trying to index.

          Regards,

          Richard

          Comment


          • #6
            Thank you so much I have followed your advice and it now works fine.

            Excellent service you provide.

            Comment

            Working...
            X