I tried to dig thru the documentation, but I couldn't find any information about the Crawling engine in EPiServer find. What we need to know is if we for example have a site that have both normal Episerver content but also want to crawl lets say 2 more sites for content. Can we then include all the content into one index? Or is one crawled websites = one index? And then we need to search in several indexes? I sort of assumed that we could crawl how many websites we want and then just bundle everything into one index.
Please enlighten me! :)
A single Episerver instance points to a single Find index (1 index per CMS instance)
That Find index can have crawlers associated with it. (multiple crawlers per Index)
Multiple websites can be crawled, and the results will be indexed in the same index that this has been set up on.
NB - the items in the Find index that result from a crawl will contain the full HTML of the crawled pages. To my knowledge there is now HTML parsing.
Thanks for your answer Marcus, just was I was looking for!