PassMark Logo
Home » Forum

Announcement

Collapse
No announcement yet.

stops wen indexing

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • stops wen indexing

    hi all

    when indexing this site Downloading file http://bible.com/abtrailer.php
    the index stops

    anyone know why this is???

  • #2
    You need to provide more information. Does it stop on that very page, or does it index a number of pages from that website before it stops?

    I did a quick test indexing that site, and managed to index over 700 pages without problem. It may also be dependent on your Indexer configuration.

    There is one possible cause of concern on that page though. It uses some Javascript on it which contains code to write out HTML link tags. These link tags are HTML, and Zoom parses them and extracts them, when they were actually intended to be Javascript code, see below:

    Code:
    function printBanner2() {
        var ranNum= Math.round(Math.random()*3);
        if (ranNum == 0) {
            document.writeln('<a href=\"kids/index.php\"><img src=\"banners/Kids-Section_728x90.gif\" alt=\"\" border=\"0\"></a>');
        }
        if (ranNum == 1) {
            document.writeln('<a href=\"<A href="http://www.bible.com/marketplace/browse.php?browseby=Search%20Criteria&search_title=bible\"><img">http://www.bible.com/marketplace/browse.php?browseby=Search%20Criteria&search_title=bible\"><img src=\"banners/Market-Place_728x90_b.gif\" border=\"0\" alt=\"\"></a>');
        }
        if (ranNum == 2) {
            document.writeln('<a href=\"<A href="http://store.bible.com/bythecase/\"><img">http://store.bible.com/bythecase/\"><img src=\"banners/Bookstore4_728x90.jpg\" border=\"0\" alt=\"\"></a>');
        }
        if (ranNum == 3) {
            document.writeln('<a href=\"<A href="http://store.bible.com/watchword/main.html\"><img">http://store.bible.com/watchword/main.html\"><img src=\"banners/Bookstore5_728x90.jpg\" border=\"0\" alt=\"\"></a>');
        }
    }
    In some cases, this could be good, because we could be picking up some links that would otherwise not be made visible. In this case however, it is not, because the links are escaped for the purpose of the Javascript, and Zoom ends up looking for pages like "kids/index.php\" which is invalid.

    This should not however cause a indexing to "stop". It would simply look for these pages on the server, and the server should report that they are not found, and Zoom should continue indexing.

    Scripts such as these, embedded on a webpage are generally recommended to be wrapped in HTML comment tags ("<!--" and "-->") if they should not be interpreted. That is, like the following:

    Code:
    <script language="JavaScript">
    [B]<!--[/B]
    function myFunction()
    {
    ... etc.
    }
    [B]-->[/B]
    </script>
    This is explained in the W3 recommendations here:
    http://www.w3.org/TR/html401/interac....html#h-18.3.2

    This method is actually already employed for the other Javascripts on the page, it is just this block of script that is missing the comment tags. If they can be inserted back in place where they belong, then the above issue would be eliminated.

    But as I mentioned, this issue should not cause the index to "stop" as you reported, so there might be something else at hand. Can you provide more information as requested above. And describe what happens when the indexing "stops". Is there an error message? Is it during the "Downloading" message, and in which case did you wait for at least 60 seconds (the timeout limit for a HTTP request, if the web server stops responding)?
    --Ray
    Wrensoft Web Software
    Sydney, Australia
    Zoom Search Engine

    Comment


    • #3
      ray

      hi ray when i've been indexing that site and the index is about 40pages and then it just stops on that 1 link.
      it says downloading the link and i've waited over 2hours and still indexing the same link

      i can't see it being the setting as its indexed that link in the past and when i click on the end index button it won't stop the index

      so i have to tell it to end the task the same thing happened with the plugin for the images also the AVG virus scan slows down the indexing speed so i have to shut down the AVG software to speed up the indexing

      would you know away arould this problem
      also im using CGI

      thank you

      Comment


      • #4
        Are you sure your internet connection is stable? Knowing what version of Zoom you are using and confirming that same URL is always the problem would be useful (as opposed to being random). E-Mailing us your Zoom configuration file would also be good.

        If you have a problem with the image plug-in, then you need to describe it in detail. Saying that you have the "same problem" with the image plug-in doesn't make sense when the problem seem to be related to indexing a PHP page (and not an image file).

        Comment

        Working...
        X