PassMark Logo
Home » Forum

Announcement

Collapse
No announcement yet.

IPB (Discussion forum) Skip List

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • IPB (Discussion forum) Skip List

    I'm having trouble with IPB Invision v2.1.4 (discussion forum).

    The search index correctly displays the first topic in a thread, but none of the subsequent replies. Here is my skip list:

    /forums/cache
    /forums/install
    /forums/ips_kernel
    /forums/lofiversion
    /forums/jscripts
    /forums/lang
    /forums/modules
    /forums/retail
    /forums/skin_acp
    /forums/skin_cache
    /forums/sources
    /forums/style_avatars
    /forums/style_images
    /forums/upgrade
    /forums/uploads
    ?s=
    ?act=calendar
    ?act=rssout&id


    Thanks a million

    David

  • #2
    There are some comments about how to index forum and message boards here,
    http://www.wrensoft.com/zoom/support...html#msgboards

    But we don't have specific instructions for the IPB Invision forum.

    My understanding was thas the initially post and subsequent replies are all on a single page with IPB. So it is hard to see how you could see the post and not the replies.

    Can you gives URLs to your search function and an example where it doesn't work.

    ------
    David

    Comment


    • #3
      Hi David,

      Yes, I read the FAQ about forums but that was more designed for phpBB, not IPB.

      With my current skip list, the index finds the following file:

      http://www.newcomers-network.de/forums/index.php?showtopic=17

      but not

      http://www.newcomers-network.de/forums/index.php?showtopic=17&pid=99&mode=threaded&show=& st=&#entry99

      Any ideas would be most appreciated.

      David

      PS: By the way, I think you software is great. It works very well and is well documented.

      Comment


      • #4
        I took a quick look at trying to index your IPB board.

        It seems that your IPB is configured to return links with a session ID parameter when accessed by a client with cookie support disabled. This means that when this happens, all links on the page will look like the following:

        http://www.newcomers-network.de/forums/index.php?showtopic=17&pid=34&mode=threaded&show=& st=&

        Note the extra "?s=e62119..." parameter in the URL compared to the links you mentioned above.

        Now from your original post, I can see that you have a skip page entry for "?s=", which would cause the indexer to skip this URL accordingly. You can see this by turning on "Verbose" mode.

        If you remove this skip page entry, it should scan and index the pages you want, but this would probably not be ideal because you will end up with a search engine that return links containing the same session ID.

        An alternative is that you can enable cookie support in Zoom. This can be found in the Configuration window, under the "Authentication" tab. You might also want to make sure you have "Reload all pages (do not use cache)" enabled so that it will take effect immediately.

        Doing the above allows Zoom to use cookies shared with IE and Windows. By doing so, you should find that the "?s=e62119..." parameter will disappear from the links. However you should note that because this is using the same cookie as IE, this means that Zoom will see pages as that user would.

        Hope that helps.
        --Ray
        Wrensoft Web Software
        Sydney, Australia
        Zoom Search Engine

        Comment


        • #5
          Thanks for the great support.



          I did what you said and now have 21 search entries, where I only need 3.



          Below are the list of the URL I want to SKIP and NOT SKIP. Perhaps you can let me know what I need to enter in my config:





          NOT SKIP



          http://www.newcomers-network.de/forums/index.php?showtopic=17&pid=98&mode=threaded&show=& st=&



          SKIP



          http://www.newcomers-network.de/forums/index.php?act=Post&CODE=02&f=6&t=17&qpid=100

          http://www.newcomers-network.de/forums/index.php?showtopic=17&mode=linear

          http://www.newcomers-network.de/forums/index.php?showtopic=17&mode=linearplus

          http://www.newcomers-network.de/forums/index.php?act=post&do=reply_post&f=6&t=17

          http://www.newcomers-network.de/forums/index.php?act=Print&client=printer&f=6&t=17

          http://www.newcomers-network.de/forums/index.php?act=Search&nav=au&CODE=show&searchid=bcb 4eab9a88851e1f24dbac256e69a55&search_in=posts&resu lt_type=posts





          Thanks a million

          Comment


          • #6
            Your guesses would be as good as mine, since I have not looked any further into indexing IPB at this point. But from your examples, I would try the following:

            ?act=Post
            ?act=post
            &mode=linear
            ?act=Print
            ?act=Search

            Basically any bit of text in the URL that distinguishes one of the links to skip from the ones you do not want to skip.
            --Ray
            Wrensoft Web Software
            Sydney, Australia
            Zoom Search Engine

            Comment

            Working...
            X