PassMark Logo
Home » Forum

Announcement

Collapse
No announcement yet.

v5.1 indexes pdf, does not search?

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • v5.1 indexes pdf, does not search?

    I have 150 - 200 files I'm indexing in offline mode w/V5.1 Pro. Most are .pdf, some .xls & .doc files. I use the .asp config. The logs say everything indexed fine. When I search though ... it only finds things in the .xls or .doc files. None of the pdfs have any security. This all worked a month ago, not sure why this isn't working now... Any hints? Thank you.

  • #2
    If you can show us the search page (either by a URL to your website, or e-mail us the files), we can give you a much more accurate answer.

    There shouldn't be any reason why the same PDF files that were indexed previously do not work now. Are you sure the same files were previously indexed and searchable?

    Are you using incremental indexing?

    Some PDF files do not contain searchable content and some users are unaware of this. For example, PDF files created from scanning in paper documents. See this FAQ:
    Q. Why can't I find words from my scanned PDF files? (PDFs created from scanning in physical documents)
    --Ray
    Wrensoft Web Software
    Sydney, Australia
    Zoom Search Engine

    Comment


    • #3
      Sorry it's an intranet site with some copyright stuff. But I'll send a sample of files that I think you'd need to see what I'm talking about.


      There shouldn't be any reason why the same PDF files that were indexed previously do not work now. Are you sure the same files were previously indexed and searchable?
      Yes, same files

      Are you using incremental indexing?
      No.

      Some PDF files do not contain searchable content and some users are unaware of this. For example, PDF files created from scanning in paper documents. See this FAQ:

      Right... These are pretty plain searchable pdf files though.

      So I have a zip that has this:

      05/05/2008 01:00 PM 1,508 index.html
      05/08/2008 10:33 AM <DIR> RADPlus
      05/08/2008 10:33 AM <DIR> zsearch

      \RADPlus (This is one example of content that indexes, but is not searched)

      05/08/2008 10:33 AM <DIR> .
      05/08/2008 10:33 AM <DIR> ..
      04/30/2008 09:41 AM 91,350 RADplus 2006 Maintenance Release 2006.06.01.pdf

      \zsearch

      05/08/2008 10:33 AM <DIR> .
      05/08/2008 10:33 AM <DIR> ..
      05/08/2008 10:32 AM 42,221 indexlog.txt
      08/09/2007 02:35 PM 90,232 search.asp
      11/01/2007 05:46 PM 3,279 search_template.html
      05/01/2008 08:22 AM 3,698 settings.asp
      05/06/2008 07:03 AM 4,990 zoom.zcfg
      5 File(s) 144,420 bytes

      Total Files Listed:
      8 File(s) 237,278 bytes

      But I don't think I can upload it? Can I email it?

      Comment


      • #4
        Yes. Our e-mail address can be found on this page.
        --Ray
        Wrensoft Web Software
        Sydney, Australia
        Zoom Search Engine

        Comment


        • #5
          We have received esova's files and further corresponded this issue in e-mail. The problem was a case of either mixing index files from different sessions, and/or the use of an older build. This problem was resolved by re-indexing with the latest build.
          --Ray
          Wrensoft Web Software
          Sydney, Australia
          Zoom Search Engine

          Comment

          Working...
          X