Hi,
I'm trying to build a vertical search engine but keep running into indexing problems. What I'd like to do is index about 100 unique domains but restrict the indexing to certain folders that are unique to each site.
For example:
domain1.com/news-events/
domain2.com/news/
domain3.com/events/
domain4.com/calendar/
etc.
Every time I try to do this by adding multiple starting points and restricting the spider URL to the unique folder for each site, all I'm able to index is the first domain. All other domains are skipped because they're considered external sites that don't match the base URL.
How would I configure the additional start points to effectively index these domains but only for the folders as listed above?
Thanks,
Cory
I'm trying to build a vertical search engine but keep running into indexing problems. What I'd like to do is index about 100 unique domains but restrict the indexing to certain folders that are unique to each site.
For example:
domain1.com/news-events/
domain2.com/news/
domain3.com/events/
domain4.com/calendar/
etc.
Every time I try to do this by adding multiple starting points and restricting the spider URL to the unique folder for each site, all I'm able to index is the first domain. All other domains are skipped because they're considered external sites that don't match the base URL.
How would I configure the additional start points to effectively index these domains but only for the folders as listed above?
Thanks,
Cory
Comment