First up - new user alert.
Currently I am taking on a task for a large organisation that will not permit Zoom on the servers or network. For a number of years the tool has been used offline on a laptop then the index loaded onto the server which works very well.
Given the success of this approach, another business entity has decided to follow suit, but they have numerous linked PDF files (thousands).
The Problem
To download the 15-16000 html files takes a few hours, indexing a few minutes then the upload of the index files about 20 minutes. Currently we do not download the PDF files as the time is prohibitive but I would like to work out a way to get them done too. Ideally they would be downloaded closer to the server location and scanned there, or perhaps a mirror site would be kept for just this purpose on a laptop (I may have just answered my own question here).
Is it possible to merge two index results? Are there any other options experienced users have come up with?
The search engine is distributed on CDROM as well as the website.
Currently I am taking on a task for a large organisation that will not permit Zoom on the servers or network. For a number of years the tool has been used offline on a laptop then the index loaded onto the server which works very well.
Given the success of this approach, another business entity has decided to follow suit, but they have numerous linked PDF files (thousands).
The Problem
To download the 15-16000 html files takes a few hours, indexing a few minutes then the upload of the index files about 20 minutes. Currently we do not download the PDF files as the time is prohibitive but I would like to work out a way to get them done too. Ideally they would be downloaded closer to the server location and scanned there, or perhaps a mirror site would be kept for just this purpose on a laptop (I may have just answered my own question here).
Is it possible to merge two index results? Are there any other options experienced users have come up with?
The search engine is distributed on CDROM as well as the website.
Comment