PassMark Logo
Home » Forum

Announcement

Collapse
No announcement yet.

Not searching file

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Not searching file

    I have about 10 pdfs on my site. One of them is particularly important to my users but Zoom is not indexing it. The size is slightly over 1M, and I was a getting a "file to large" message, so I reduced it to 904K. I no longer get any message but Zoom isn't indexing it either.

    Any suggestions?

    Thanks.

    Don

  • #2
    If you are using the pro or enterprise edition, you can adjust this file size limit from the limits tab in the Zoom configuration window.

    If you look through your log of indexing activity, does this PDF get mentioned at all in the log? E-Mail us the log and the document if you want.

    See also these related FAQs
    Q. Why are some of my pages being skipped by the indexer?

    Q. Why are links in my Javascript menus being skipped?

    Q. I am indexing with spider mode but it is not finding all the pages on my web site

    Comment


    • #3
      You sent the log file and from the log it can be seen that the file in question is in fact being indexed.
      Code:
       
      Queued URL: http://www.lakesestates.org/documents/ACC_Guidelines.pdf
      DL Thread #1, got URL (http://www.lakesestates.org/documents/ACC_Guidelines.pdf) off queue
      Downloading file http://www.lakesestates.org/documents/ACC_Guidelines.pdf
      Index Thread got ready buffer for http://www.lakesestates.org/documents/ACC_Guidelines.pdf (Content-type: Acrobat document)
      Downloading file http://www.lakesestates.org/documents/ACC_Guidelines.pdf
      Processing PDF file http://www.lakesestates.org/documents/ACC_Guidelines.pdf
      Indexing http://www.lakesestates.org/documents/ACC_Guidelines.pdf
      Why do you think the file is not indexed?

      Mind you however, the text in the document is largely garbage. It looks like a bad OCR job has been done on the document. So now a typical text extract from the document in question looks like this.
      fm ces and walls
      awnings and shM ers
      declcs and balconies
      patio. terracas and grolmd level
      scr- O ciosures
      recreadon and play equipment
      qwimmm g px ls
      nmilboxes and house numbers
      sir s
      Just nonsense words for the most part. But you could still use Zoom to search for the words that are there. nonsense or not.

      Comment


      • #4
        Please also see this FAQ:
        Q. Why can't I find words from my scanned PDF files? (PDFs created from scanning in physical documents)
        --Ray
        Wrensoft Web Software
        Sydney, Australia
        Zoom Search Engine

        Comment

        Working...
        X