PassMark Logo
Home » Forum

Announcement

Collapse
No announcement yet.

unable to download/spider certain files

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • unable to download/spider certain files

    I have 9,000 + pages on one site, i am spidering it, the following links all say no such URL, yet they work fine. any clue as to why they got spit out unindexed?

    [size="2"]09:39:06 - [WARNING] Could not download file: http://ambergriscaye.com/fieldguide/geo-plate13.html (Invalid URL or domain name)
    09:39:06 - [WARNING] Could not download file: http://ambergriscaye.com/fieldguide/geo-plate14.html (Invalid URL or domain name)
    09:39:06 - [WARNING] Could not download file: http://ambergriscaye.com/fieldguide/geo-plate15.html (Invalid URL or domain name)
    09:39:07 - [WARNING] Could not download file: http://ambergriscaye.com/fieldguide/geo-plate16.html (Invalid URL or domain name)

    every time i index, a few are like this. they change every time. never the same ones....
    Last edited by toadstooldan; Apr-07-2009, 09:34 PM.

  • #2
    That would look like you have an unstable Internet connection. It is most likely dropping out during those pages (which is why it happens randomly). Another possibility is if your server is actually failing every once in awhile. Try running the index on another machine, on another ISP/internet connection, and see if you experience the same problem. If you do, then it's likely something to do with your server.

    If the files you are indexing are all static (that is, they are not PHP or ASP pages which are dynamically generated), then you could consider using Offline Mode and indexing the files directly on your hard disk. This would avoid any need for a stable Internet connection to the server whilst indexing.
    --Ray
    Wrensoft Web Software
    Sydney, Australia
    Zoom Search Engine

    Comment


    • #3
      thank you sir!

      Comment


      • #4
        i indexed my site a dozen times with version 5 and 6 yesterday. I consistently got 9066 files indexed with version 6, on two different machines.

        with version 5, exact same config file as with version 6, i get 9400+ files each time. any clue as to why i get different results?

        Comment


        • #5
          There is probably a good explaination. To start with you would need to look at the Zoom log to determine the difference. Work out what files were missing, or didn't index. There might be some errors in the log file?

          Comment

          Working...
          X