PassMark Logo
Home » Forum

Announcement

Collapse
No announcement yet.

zoom stop to index the website

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • zoom stop to index the website

    Since 2-3 y I work with zoomsearch and had never a big problem and was very happy.
    Since begining december 2007 zoomsearch hang or stop to index the web site http://www.info-radiologie.ch
    I have to try 4-5X to get something.
    This evening I am unable to get anything.
    I have enterprise edition 5.1 (build: 1012). My computer runs with vista professional, RAM 4G and a good graphic card.
    The vast majority of the website is with the extension .php
    start url http://www.info-radiologie.ch/index.htm
    base url http://www.info-radiologie.ch/
    plaftform is php.
    there are at least 234 URL that should be indexed. Gallery 2 is in the skip folder.
    no error message. it stops suddenly
    for instance the last four lines are:
    indexing http..../irm_hypophyse/fullsize/irm_hypophyse_27fsa.jpg
    DL Thread #2 got URL.........../irm_hypophyse/thumbs/Sans_titre_._tmb.gif) off queue
    Download ................................(1024 bytes)
    skipping ................................. (image filesize small than min specified)
    and after that :nothing!
    Any idea?
    what to do?
    thanks and best regards

  • #2
    Did you leave it for at least 3 minutes to see if there was a timeout message. Maybe you just lost your internet connection.

    Did you check if you were using the latest versions of the plug-ins.

    I indexed your site from here using a Vista machine and V5.1 build 1012.

    It indexed 259 files in 3min 10sec. There were no errors and no lock ups.

    So it is probably something to do with your machine, your configuration or your internet connection.

    Do you have a 2nd machine to try running it from as an experiment. Can you also E-mail us your configuration file.

    Comment


    • #3
      >Did you leave it for at least 3 minutes to see if there was a timeout >message. Maybe you just lost your internet connection.
      No error message, No timeout message. It just stops. the DSL line (download:5000kbis/s, upload:500Kbi/s) seems ok. In the same time I can connect on internet witjh this machine or a mac. It's a network and lights on the router is ok.

      >Did you check if you were using the latest versions of the plug-ins.
      yes


      >It indexed 259 files in 3min 10sec. There were no errors and no lock ups.
      you were lucky. this morning AM7:43 same problem

      >So it is probably something to do with your machine, your configuration or >your internet connection.
      no comment


      >Do you have a 2nd machine to try running it from as an experiment.
      I will try with a mac trough parallel windows in this office. The other thing is to go to an other branch: however the computers are running under xp and it is dsl line.
      Stupid question: How to move zoomsearch to the other computers?

      >Can you also E-mail us your configuration file.
      yes it's done.

      best regards

      Comment


      • #4
        Got your file.

        I believe it was a problem in your configuration file.

        I believe that in fact everything was working correctly. But you didn't see anything happening because you had turned off the display of almost all of the log messages.

        If you reset the log display options back to their default values you can see the indexing progress.

        After making this change, I was able to index your site with your configuration file, without error.

        Comment


        • #5
          The answer is: NO. It's something else. No file is created. Nothing happens. It stops. Period. End of sentence. In addition no logindex.txt is created.
          I talk to the internet provider and everything is ok for them.
          If there is no solution. maybe should I downgrade to the version before december?
          best regards

          Comment


          • #6
            No logindex.txt file is created becuase you don't have logging turned on in your configuration file. If you have logging off, it is normal that there is no log file.

            When I first ran your config file it looked like it had stopped. Like you describe. But in fact it hadn't.

            It was still running, but becuase you have turned off almost all the log messages, there appeared to be no progress. It you left it for 30min I suspect it would have finished indexing. It took me 32min to index your 1147 image and HTML files.

            You can also see progress from the status tab if for some reason you want the log disabled.

            Comment


            • #7
              The beginning:
              ----------------------------------------------------------------
              11:12:35 - Zoom Search Engine Indexer (Enterprise Edition)
              11:12:35 - Version 5.1 (Build: 1012) on Windows Vista
              11:12:35 - Copyright Wrensoft 2000-2008 (http://www.wrensoft.com/)
              11:12:35 - Plugin for DOC files found. DOC file support enabled.
              11:12:35 - Plugin for PDF files found. PDF file support enabled.
              11:12:35 - Plugin for PPT files found. PPT file support enabled.
              11:12:35 - Plugin for XLS files found. XLS file support enabled.
              11:12:35 - Plugin for WPD files found. WPD file support enabled.
              11:12:35 - Plugin for SWF files found. SWF file support enabled.
              11:12:35 - Plugin for RTF files found. RTF file support enabled.
              11:12:35 - Plugin for DjVu files found. DjVu file support enabled.
              11:12:35 - Plugin for image files found. Image file support enabled.
              11:12:35 - Plugin for MP3 files found. MP3 file support enabled.
              11:12:35 - Plugin for DWF files found. DWF file support enabled.
              11:12:35 - Config file loaded: C:\Program Files\Zoom Search Engine 5.1\zoom.zcfg
              11:13:41 - Start indexing (spider mode) at Sun Mar 02 11:13:41 2008
              11:13:41 - Maximum number of words: 50000
              11:13:41 - Maximum number of files: 5000
              11:13:41 - Will scan files with extensions
              11:13:41 - .htm
              11:13:41 - .html
              11:13:41 - .php
              11:13:41 - .jpg
              11:13:41 - .gif
              11:13:41 - .png
              11:13:41 - .tiff
              11:13:41 - Spider from: http://www.info-radiologie.ch/index.htm
              11:13:41 - Web site URL: http://www.info-radiologie.ch/
              11:13:41 - Estimated RAM required during index process: 63789 KB
              11:13:41 - [DOWNLOAD] Downloading robots.txt file found at http://www.info-radiologie.ch/robots.txt
              11:13:41 - Initiating HTTP session (thread #1) ...
              11:13:41 - DL Thread #1, got URL (http://www.info-radiologie.ch/index.htm) off queue
              11:13:41 - [DOWNLOAD] Downloading file http://www.info-radiologie.ch/index.htm (22992 bytes)
              11:13:41 - Index Thread got ready buffer for http://www.info-radiologie.ch/index.htm (Content-type: HTML text)
              11:13:41 - Initiating HTTP session (thread #2) ...
              11:13:41 - [SPIDER] Spidering for links on http://www.info-radiologie.ch/index.htm
              11:13:41 - [SPIDER] Queued URL: http://www.info-radiologie.ch/images/info-radiologie-c.jpg
              11:13:41 - [SKIPPED] Skipping http://info-radiologie.ch/ (External site - does not match base URL)
              11:13:41 - [SKIPPED] Skipping http://www.info-radiologie.ch/technique_radiologie.php (Blocked by page skip list)
              ------------------------------------------------------


              Before it stopps
              ---------------------------------------------------------
              11:16:21 - Index Thread got ready buffer for http://www.info-radiologie.ch/abdominal_ct.php (Content-type: HTML text)
              11:16:21 - [SPIDER] Spidering for links on http://www.info-radiologie.ch/abdominal_ct.php
              11:16:22 - [SKIPPED] Skipping http://info-radiologie.ch/ (External site - does not match base URL)
              11:16:22 - [SKIPPED] Skipping http://www.info-radiologie.ch/technique_radiologie.php (Blocked by page skip list)
              11:16:22 - [SKIPPED] Skipping http://www.info-radiologie.ch/risques_radiologie.php (Blocked by page skip list)
              11:16:22 - [SKIPPED] Skipping http://www.info-radiologie.ch/performance_radiologie.php (Blocked by page skip list)
              11:16:22 - [SKIPPED] Skipping http://www.info-radiologie.ch/pathologie_info-radiologie.php (Blocked by page skip list)
              11:16:22 - [SKIPPED] Skipping http://www.info-radiologie.ch/atlas_info-radiologie.php (Blocked by page skip list)
              11:16:22 - [SKIPPED] Skipping http://www.info-radiologie.ch/divers_radiologie.php (Blocked by page skip list)
              11:16:22 - [SKIPPED] Skipping http://www.info-radiologie.ch/nouveautes.php (Blocked by page skip list)
              11:16:22 - [SKIPPED] Skipping http://www.info-radiologie.ch/gallery2/main.php?g2_page=1 (Blocked by page skip list)
              11:16:22 - [SKIPPED] Skipping http://www.info-radiologie.ch/pathologie_info-radiologie.php (Blocked by page skip list)
              11:16:22 - [SKIPPED] Skipping http://www.info-radiologie.ch/sitemap.php (Blocked by page skip list)
              11:16:22 - [SKIPPED] Skipping http://www.info-radiologie.ch/glossaire_radiologie.php (Blocked by page skip list)
              11:16:22 - [INDEXED] Indexing http://www.info-radiologie.ch/abdominal_ct.php
              11:16:22 - DL Thread #1, got URL (http://www.info-radiologie.ch/ct_abdomen/fullsize/ct_abdominal_1_jpg_fs.jpg) off queue
              11:16:22 - [DOWNLOAD] Downloading file http://www.info-radiologie.ch/ct_abdomen/fullsize/ct_abdominal_1_jpg_fs.jpg (13552 bytes)
              11:16:22 - Index Thread got ready buffer for http://www.info-radiologie.ch/ct_abdomen/fullsize/ct_abdominal_1_jpg_fs.jpg (Content-type: Image file)
              11:16:22 - [PLUGIN] Processing image file http://www.info-radiologie.ch/ct_abdomen/fullsize/ct_abdominal_1_jpg_fs.jpg
              11:16:22 - [SKIPPED] [Image plugin warning] Could not find meta information.
              11:16:22 - [INDEXED] Indexing http://www.info-radiologie.ch/ct_abdomen/fullsize/ct_abdominal_1_jpg_fs.jpg
              11:16:22 - DL Thread #2, got URL (http://www.info-radiologie.ch/ct_abdomen/thumbs/ct_abdominal_1_jpg_tmb.jpg) off queue
              11:16:22 - [DOWNLOAD] Downloading file http://www.info-radiologie.ch/ct_abdomen/thumbs/ct_abdominal_1_jpg_tmb.jpg
              11:16:22 - [SKIPPED] Skipping http://www.info-radiologie.ch/ct_abdomen/thumbs/ct_abdominal_1_jpg_tmb.jpg (Image filesize smaller than minimum specified)
              11:16:23 - DL Thread #2, got URL (http://www.info-radiologie.ch/ct_abdomen/fullsize/ct_abdominal_2_jpg_fs.jpg) off queue
              11:16:23 - [DOWNLOAD] Downloading file http://www.info-radiologie.ch/ct_abdomen/fullsize/ct_abdominal_2_jpg_fs.jpg (16660 bytes)
              11:16:23 - Index Thread got ready buffer for http://www.info-radiologie.ch/ct_abdomen/fullsize/ct_abdominal_2_jpg_fs.jpg (Content-type: Image file)
              11:16:23 - [PLUGIN] Processing image file http://www.info-radiologie.ch/ct_abdomen/fullsize/ct_abdominal_2_jpg_fs.jpg
              11:16:23 - [SKIPPED] [Image plugin warning] Could not find meta information.
              11:16:23 - [INDEXED] Indexing http://www.info-radiologie.ch/ct_abdomen/fullsize/ct_abdominal_2_jpg_fs.jpg
              11:16:23 - DL Thread #1, got URL (http://www.info-radiologie.ch/ct_abdomen/thumbs/ct_abdominal_2_jpg_tmb.jpg) off queue
              11:16:23 - [DOWNLOAD] Downloading file http://www.info-radiologie.ch/ct_abdomen/thumbs/ct_abdominal_2_jpg_tmb.jpg (1024 bytes)
              11:16:23 - [SKIPPED] Skipping http://www.info-radiologie.ch/ct_abdomen/thumbs/ct_abdominal_2_jpg_tmb.jpg (Image filesize smaller than minimum specified)
              -----------------------------------------------------------------


              and then no more indexing process: it's not a dream but a nightmare.
              I m'going in otheroffice and will post some pitcures
              best regards.

              Comment


              • #8
                It is impossible the the above trace came from the configuration file that you sent us. Can you send us the configuration file that you are actually using.

                It also contradicts your initial statement, "This evening I am unable to get anything". Which I assumed meant that you thought no files were being indexed.

                If we can't reproduce it with your config file, we might need to create a special debug build to narrow things down further.

                I also realised I didn't answer an earlier question.
                Stupid question: How to move zoomsearch to the other computers?
                You can just uninstall and re-install. You'll need to enter in your license key again on the new machine however.

                Comment


                • #9
                  Hi,

                  Does this mean something special for you?

                  03/03/08 21:01:25 - DL Thread #1, got URL (http://www.info-radiologie.ch/plonger/bibliotheque3.jpg) off queue
                  03/03/08 21:01:25 - Downloading file http://www.info-radiologie.ch/plonger/bibliotheque3.jpg
                  03/03/08 21:01:25 - Index Thread got ready buffer for http://www.info-radiologie.ch/plonger/bibliotheque3.jpg (Content-type: Image file)
                  03/03/08 21:01:25 - Processing image file http://www.info-radiologie.ch/plonger/bibliotheque3.jpg
                  03/03/08 21:01:25 - [Image plugin error] Failed to open file for reading (Error reading from: C:\Users\HP_Administrateur\AppData\Local\Wrensoft\ Zoom Search Engine Indexer\zoom_plugin.in)
                  03/03/08 21:01:25 - Image plugin failed. Only filename is indexed.

                  thanks for your help
                  best regards

                  Comment


                  • #10
                    It sounds like the issue with Vista's built-in Indexing Service as described here and here.

                    Actually, I just noticed that first thread I linked to was actually posted by you when you first came across this problem last year. Do you have Indexing Service disabled for the correct folders? Note that the GUI Microsoft made for this is a bit ambiguous: a root folder may not be marked as selected, but subfolders within that root folder could be selected. Make sure you expand the folder tree and see if any of the folders in that path are checked.
                    --Ray
                    Wrensoft Web Software
                    Sydney, Australia
                    Zoom Search Engine

                    Comment

                    Working...
                    X