PassMark Logo
Home » Forum

Announcement

Collapse
No announcement yet.

Index web server from LAN side

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Index web server from LAN side

    Is it possible to index a webserver from the LAN side (i.e. spider LAN side) yet the base URL is WAN side? In other words, I would like all spidering activity to be on the LAN side yet I want to resultant indexed files to permit search via WAN side. It seems rather silly to have to spider out to the Internet then back into the web server when it would be much more efficient to do so LAN side then just change the config for the base URL.

    Dave

  • #2
    There is of course, Offline Mode indexing, which allows you to index files locally off the hard disk (on your computer or a local network drive).

    However, this is no good for dynamically generated sites which depend on server-side scripting - this would require Spider Mode.

    At this point, we do not have a built-in feature to support this.

    What is possible however, is that you can manually modify the "zoom_pages.zdat" file after indexing. eg. you can load up the file and do a search and replace with the URLs (eg. search for "http://mywebserver/" and replace with "http://www.mysite.com/"). Note that this process would need to be performed after every re-index.

    We are currently considering adding a feature (in a future version) to allow for automated post-processing of the URLs in the above fashion. If other people are interested in such a feature, let us know.
    --Ray
    Wrensoft Web Software
    Sydney, Australia
    Zoom Search Engine

    Comment


    • #3
      This feature has been added - right?

      Hello

      I am a new user and just wanted to confirm that the offline spidering mode was available and the base URL will automatically updated according to setting in config.

      Thanks.
      Atif

      Comment


      • #4
        The re-write links feature was added in Version 5.0.1005 (9 Mar 2007)

        ---EXTRACT FROM THE V5 USERS GUIDE------

        This option allows you to rewrite the indexed URLs of the pages indexed. This can be useful if you are spidering a development version of your site on a test server (eg. http://test.mycompany.com/) and creating index files to go on the live server (eg. http://www.mycompany.com/). You would do this by specifying rewrite options to replace all instances of "http://test.mycompany.com/" in the indexed URLs with "http://www.mycompany.com/".

        You could also use this option to change all the search result links to be relative rather than absolute by replacing the domain (eg. "http://www.mysite.com/") with a relative path (eg. "./" or "../"). We only recommend this for users who are very familiar with relative linking and understand that the linking would only work if the generated search files are placed in an appropriate folder on the server.

        Note: Using the Rewrite Links option disables the ability to use incremental indexing on the produced set of index files. This means you will not be able to perform an incremental update, or add/remove pages from the index without re-indexing your site entirely. For more information on these features, see "Incremental indexing".

        Comment

        Working...
        X