PassMark Logo
Home » Forum

Announcement

Collapse
No announcement yet.

Not indexing arabic and chinese pages

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Not indexing arabic and chinese pages

    Hello,

    I have problems indexing the Chinese and Arabic version of our website. All other languages are OK. The first page the indexer reads keeps in status Spidering / Waiting. All other pages are just downloaded without being indexed. When I use another page as starting point it's the same.

    Regards,
    Frank

  • #2
    Can you post the URL to the Chinese page, so that we can take a look.
    Are you running the software on Mac, Windows or Linux?
    What exact version of the Zoom software are you running?

    Comment


    • #3
      The URL to the Chinese page ist http://www.dentaurum.de/chs/default.aspx
      I'm running the sofware on Windows Server 2003. Version 7 Build 1006.

      Comment


      • #4
        I indexed your site from here.
        Some pages are being skipped due to the CRC page duplication check.

        Can you try turning CRC off from the "Scan options" configuration window.

        I'll also have a deeper look into it do see why Zoom thinks so many of the pages are duplicates. Might be a bug, or might be something special about these Chinese pages.

        Comment


        • #5
          We've confirmed that there's a bug with the duplicate page check in the current build. We're working on fixing the issue for the next release.

          In the meantime, as suggested above, you can disable CRC / duplicate page detection and it should allow your pages to index.
          --Ray
          Wrensoft Web Software
          Sydney, Australia
          Zoom Search Engine

          Comment


          • #6
            Thank you, with duplicate page detection disabled I could index the site.

            Comment

            Working...
            X