PassMark Logo
Home » Forum

Announcement

Collapse
No announcement yet.

Need a Different (Worse) Spider

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Need a Different (Worse) Spider

    The Zoom spider is an excellent beast (arachnid), coping with javascript and frameset page links no problem.

    Now corporate IT was to include results from the site I maintain in their search run by a different spider (Knova) (Spanish for K-no go) which seems incapable of anything.

    I've put in hidden links to let it bypass the javascript and framesets but I can't test it with Zoom.

    Anyone know of a Windows spider that can't cope with framesets/javascript or can be told to not use them so I can run my own tests. I'm not fussed by the output so long as I can see a log of which pages the spider got to.
    Mark Gallagher

  • #2
    Some versions of wget can do recursive downloads & parsing.
    http://www.gnu.org/software/wget/man...rsive-Download

    I am fairly sure it doesn't handle anything complex.

    Or you could manually use the Lynx text based browser to check what links can be seen.

    Comment


    • #3
      Lynx !!! is it still going!!! next someone will mention Compuserve and I'll be back to my very first website.

      Thanks guys, I'll have a look at those.
      Mark Gallagher

      Comment


      • #4
        Just installed and used Lynx. Strangely usable within seconds and proved that even the Knova spider should be able to get around the site.

        Many thanks.
        Mark Gallagher

        Comment

        Working...
        X