PassMark Logo
Home » Forum

Announcement

Collapse
No announcement yet.

Verbose Queued URL messages?

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Verbose Queued URL messages?

    Hi,
    What exactly does 'queued' mean in the log file?

  • #2
    It means a URL was found on a page and that URL has been placed in a queue to be downloaded and indexed.

    This is a normal part of spidering a web site.

    ----
    David

    Comment


    • #3
      Meaning of verbose mode messages

      As other people have asked about this I will expand a little on the answer.

      The normal process for each download thread running in Zoom is to,

      1) Get a URL from a queue of URLs to be indexed.
      DL Thread #1, got URL (http://www.example.com/index.html) off queue

      2) Download the file from the server (or get it from cache)
      Downloading file http://www.example.com/index.html

      3) Put the file into a memory buffer for processing and identify the MIME content type.
      Index Thread got ready buffer for http://www.example.com/index.html (Content-type: HTML text)

      4) Scan the page for links to follow
      Spidering for links on http://www.example.com/index.html

      5) Add any links found to the queue of URLs indexed
      Queued URL: http://www.example.com/page2.html
      Queued URL: http://www.example.com/page3.html
      (These links will be indexed later, maybe by another thread)

      6) Index the text on the page,
      Indexing http://www.example.com/index.html

      At each step of the process there are mesages written to the log in verbose mode. In non-verbose mode only steps 2 and 6 are logged.

      ------
      David

      Comment

      Working...
      X