PassMark Logo
Home » Forum

Announcement

Collapse
No announcement yet.

Zoom Search Engine v. 7 and Atlassian Confluence server files

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Zoom Search Engine v. 7 and Atlassian Confluence server files

    Anyone have any success indexing files in spider mode on an Atlassian Confluence web server?

    Confluence documentation states this is possible via third-party tools:
    https://confluence.atlassian.com/doc...-51871758.html

    But so far I have not had any success with the Confluence server running on our corporate Intranet, despite setting my Zoom configuration to be as unconstrained as possible (just to test connection), and having an authenticated session open on the server before launching the search.

    The Zoom log indicates a redirect of my start URL, which points to the server home page I have access to, to the corporate login page, then a skipped login page because "External site - does not match base URL" error, followed by a "No files found to spider from https:// ... etc.

    My Zoom configuration scan options are set to search all files, including ones with unknown extensions, or no extensions.

    I can however index pages on the Atlassian confluence server on the Internet, and have noticed that the HTTP address for a page when copied from that server includes an actual HTML page reference, whereas a page address copied from the server on our corporate Intranet does not include a such reference. I would expect this not to matter, having configured Zoom to search all files.

    My current workaround is to save the web page locally, index that and remap target links to the server. While this is workable, I would prefer to use the spidering option instead.
    Richard

  • #2
    I don't believe we have had the experience of indexing a Confluence server so I can't say for certain whether there are any obstacles that can't be overcome. However,

    Originally posted by rkg82 View Post
    The Zoom log indicates a redirect of my start URL, which points to the server home page I have access to, to the corporate login page, then a skipped login page because "External site - does not match base URL" error, followed by a "No files found to spider from https:// ... etc.
    ... these sound like pretty typical issues that can be worked around with configuration.

    "External site... " issues mean you should change your Base URL to include all the necessary URLs you require. You can either simplify the URL or specify multiple base URLs delimited by semi-colon, e.g.
    http://mysite.com/;https://mysite.co...cs.mysite.com/

    Login page: use the Configure->Authentication options. See the Help file or this FAQ for more details.

    You are likely to come across more problems to deal with one by one, as the spider encounters them. The FAQ is a good resource for tackling this.
    --Ray
    Wrensoft Web Software
    Sydney, Australia
    Zoom Search Engine

    Comment


    • #3
      Zoom 8 can index files with no extension. Is there a method to add a "null" extension to the scan options, where you could tell it to default "no extension" to being html?

      Comment


      • #4
        All files with "no extension" would be processed by the internal file format detection mechanism. HTML should be detected pretty easily. Let us know if that's not being detected and send us some files to take a look.
        --Ray
        Wrensoft Web Software
        Sydney, Australia
        Zoom Search Engine

        Comment

        Working...
        X