PassMark Logo
Home » Forum

Announcement

Collapse
No announcement yet.

certain pages not appearing in search results

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • certain pages not appearing in search results

    I have a few pages that are not appearing in the search results. I've checked all the points you list here, but still no joy. An example page is this one and also this page. I've indexed using verbose mode, but can't really tell if the pages are being added. Any clues would be gratefully received.
    Pete Vickerstaff
    MG F/TF Central

  • #2
    this is a link to the index log
    http://www.mgf-tf-central.co.uk/mg/indexlog.txt
    Pete Vickerstaff
    MG F/TF Central

    Comment


    • #3
      The /links-shopping.htm and /links-service.htm were both indexed. You can see this in the log.


      [INDEXED] Indexing http://www.mgf-tf-central.co.uk/links-service.htm
      [INDEXED] Indexing http://www.mgf-tf-central.co.uk/links-shopping.htm


      If I go to your site and search for a word like, shopping, then the links-shopping.htm page is displayed as the third result.

      So I don't see the problem?

      Comment


      • #4
        thanks for the reply. What you say is true, but do a search for say "Leicestershire", which is in the body of the /links-service.htm page, and that page does not appear in the search results. I've also checked I have a <!--ZOOMRESTART--> after a <!--ZOOMSTOP-->, which I have, and its before the main body of the page.
        Pete Vickerstaff
        MG F/TF Central

        Comment


        • #5
          The problem is that you have broken HTML code on your webpages. Looking at this page:
          http://www.mgf-tf-central.co.uk/links-service.htm

          There is the following on line 91:
          Code:
          <td valign="top" height="100&#37;[b]"[/b] ; padding-top: 5">
          Note the extra quotation mark within the height= attribute (which is still wrong even with the extra quote removed, as "padding-top" is CSS and belongs in a style= attribute).

          This extra quote mark caused the rest of your content to be filtered out, because Zoom believed it was a new opening quote character. There may be other similar issues with your other pages.

          It's worth putting problematic pages through a HTML validator to easily locate these sort of problems. The official W3 one is here:
          http://validator.w3.org/

          However, that can be fairly pedantic and will point out much more minor issues (standards conforming being their priority). There are other, less pedantic but equally useful validation tools such as this one:
          http://onlinewebcheck.com/check.php

          Having said all that, I added some code to the next build of Zoom which will prevent the above broken HTML from causing the rest of the page to be filtered out from indexing, and will be a little more forgiving of bad HTML.
          --Ray
          Wrensoft Web Software
          Sydney, Australia
          Zoom Search Engine

          Comment


          • #6
            thanks Ray, your a diamond , I put the pages thru the validator to try and get W3 standard HTML, but it came back with soooooo many problems I gave up. Your 2nd link looks more hopeful. Thanks again.
            Pete Vickerstaff
            MG F/TF Central

            Comment

            Working...
            X