PassMark Logo
Home » Forum

Announcement

Collapse
No announcement yet.

Wordpress WPML localized URLs with parameters are always "[SKIPPED]"

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Wordpress WPML localized URLs with parameters are always "[SKIPPED]"

    Hello,
    I am a long time Zoom Professional user, and for the first time I ran into an issue I couldn't figure out myself, even after consulting FAQ / Forum:

    We run a Wordpress / WPML site that has some content in a second language, German. Wordpress WPML Plugin setting is:
    "Language name added as a parameter (https://docklight.de?lang=de - German)"

    What we want is one common search index - search always in English and German, like a standard Google search. But the German pages never show up in the index.

    I even tried adding all the German URLs as extra spider "start points", both in the WPML canonical format,
    with "added parameter":
    https://docklight.de/kundenmeinungen/?lang=de

    But I also tried variations:
    https://docklight.de/kundenmeinungen
    or
    https://docklight.de/kundenmeinungen/

    Here is what the last variant results in the log file:

    -> Spider finds the German parts already without the extra starting point, but says "blocked by extensions":

    11:59:59 - Zoom Search Engine Indexer (Professional Edition)
    11:59:59 - Version 7.1 (Build: 1022) on Windows 10

    12:00:08 - [SPIDER] Spidering for links on https://docklight.de/
    ...
    12:00:08 - [SKIPPED] Skipping https://docklight.de/?lang=de (Blocked by extensions list)
    ...
    12:00:08 - [SKIPPED] Skipping https://docklight.de/kundenmeinungen/?lang=de (Blocked by extensions list)

    -> here's Zoom finding my extra starting point. Gets redirected properly to the above format, no duplicates, great.
    But again - no indexing:

    12:00:51 - [SPIDER] Moving on to next start point: https://docklight.de/kundenmeinungen/
    12:00:51 - [SPIDER] Queued URL: https://docklight.de/kundenmeinungen/
    12:00:51 - DL Thread #8, got URL (https://docklight.de/kundenmeinungen/) off queue
    12:00:51 - [DOWNLOAD] Downloading file https://docklight.de/kundenmeinungen/
    12:00:52 - [DOWNLOAD] URL redirected to: https://docklight.de/kundenmeinungen/?lang=de [thread #8]


    I even activated "Scan files with no extensions" - no change.

    I am out of clue how I can add "added parameter" pages to the index.

    Is there a solution/parameter I did not consider?

    Thanks in advance for any assistance!

    Best regards,
    Oliver
    Last edited by FuH-Oliver; Sep-13-2018, 10:37 AM. Reason: deleted duplicate log snippet, minor formatting

  • #2
    Scan files with no extensions
    Yes, this needs to be turned on. As it isn't clear by just looking at the URL if the URL points to an JPG file or a HTML file or something else.

    After that I didn't have any problems if the correct Base URL is set.

    Click image for larger version

Name:	baseURL.png
Views:	168
Size:	258.2 KB
ID:	34713

    Comment

    Working...
    X