As a follow up to this question that I previously posted in the wrong area (I hope I am doing it right this time by posting a new thread) and your reply:
Quote:
Originally Posted by WilliamJ
Lastly, and this too is off the subject, did you know that zoom can index 2 separate password protected directories (with the same log-in info) but they must be in a particular order in the SPIDER listing of directories? And I think this order may be alphabetical. In other words, if you want to index /file/A_protected and /file/B_protected, I think they need to be listed in in this order in the SPIDER list. If I put /file/B_protected ahead of /file/A_protected in the spider list it will index /file/B_protected but /file/A_protected will error out with a 401 error. Reversing the order so that they are alphabetical results in a correct search of both directories with no errors. Now, someone may ask the obvious next question of indexing two directories that have 2 separate login passwords?
No, the alphabetical order of the listing of URLs has no significance. I think what you observed would most likely be coincidence and is actually a behaviour caused by an unrelated issue.
First question would be what type of authentication is your site using. Please see this support page for a detailed explanation:
http://www.wrensoft.com/zoom/support/auth.html
My guess is, if you're using session/cookie-based authentication - that your first URL/directory might contain a link to logout, and you have not added this to your skip list. Because of this, when you index the first URL, the spider is logged out, and unable to log back in to access the second URL. That's just purely guessing from what little information I have though. But you should look into it, and I can tell you that it's definitely not due to the alphabetical order of the directories.
Initially I was able to change the order the protected directories were listed in the spider index URL list and it indexed OK. I thought it simply needed to be alphabetical. Today, we tried to index again and it would not index both directories - It 401 errored out on the 2nd directory in list. We made no changes to this list or to any files in those directories since the last index. What worked before was to delete the directory and change the order so I tried that today - deleted that directory listing from the spider listbox and added it back in at a different spot (changing the spider directory order), performed the index and it worked fine. We use HTTP authentication and we do not have a LOG OFF page or field on any of the pages being indexed in those directories. I thought it was alphabetical but as you noted above it is not, I think it must be somehow not performing the login on the 2nd directory correctly and may be out of sync or ??. It is no big deal as I worked around it and now that I know what I can do to get it to index, I can certainly live with it but it may be something in the program that may need to be addressed. I am using V5 build 1004 PROFESSIONAL.
Quote:
Originally Posted by WilliamJ
Lastly, and this too is off the subject, did you know that zoom can index 2 separate password protected directories (with the same log-in info) but they must be in a particular order in the SPIDER listing of directories? And I think this order may be alphabetical. In other words, if you want to index /file/A_protected and /file/B_protected, I think they need to be listed in in this order in the SPIDER list. If I put /file/B_protected ahead of /file/A_protected in the spider list it will index /file/B_protected but /file/A_protected will error out with a 401 error. Reversing the order so that they are alphabetical results in a correct search of both directories with no errors. Now, someone may ask the obvious next question of indexing two directories that have 2 separate login passwords?
No, the alphabetical order of the listing of URLs has no significance. I think what you observed would most likely be coincidence and is actually a behaviour caused by an unrelated issue.
First question would be what type of authentication is your site using. Please see this support page for a detailed explanation:
http://www.wrensoft.com/zoom/support/auth.html
My guess is, if you're using session/cookie-based authentication - that your first URL/directory might contain a link to logout, and you have not added this to your skip list. Because of this, when you index the first URL, the spider is logged out, and unable to log back in to access the second URL. That's just purely guessing from what little information I have though. But you should look into it, and I can tell you that it's definitely not due to the alphabetical order of the directories.
Initially I was able to change the order the protected directories were listed in the spider index URL list and it indexed OK. I thought it simply needed to be alphabetical. Today, we tried to index again and it would not index both directories - It 401 errored out on the 2nd directory in list. We made no changes to this list or to any files in those directories since the last index. What worked before was to delete the directory and change the order so I tried that today - deleted that directory listing from the spider listbox and added it back in at a different spot (changing the spider directory order), performed the index and it worked fine. We use HTTP authentication and we do not have a LOG OFF page or field on any of the pages being indexed in those directories. I thought it was alphabetical but as you noted above it is not, I think it must be somehow not performing the login on the 2nd directory correctly and may be out of sync or ??. It is no big deal as I worked around it and now that I know what I can do to get it to index, I can certainly live with it but it may be something in the program that may need to be addressed. I am using V5 build 1004 PROFESSIONAL.
Comment