Don't miss out Virtual Happy Hour today (April 26).

Try our conversational search powered by Generative AI!

Indexing page with external content

Vote:
 

Hi,

I have ha page that is getting its content from a extern sorce"available jobs".

How should i do to index this page in find when the content does not come from EPiServer CMS?

I have looked at Connectors but it seems unnecessary to have to pay for when it's just one page.

#171979
Edited, Nov 22, 2016 17:06
Vote:
 

Hi Pär,

You can create an extension method for that page type, and then index it as a field by setting custom conventions in an initialization module:

 SearchClient.Instance.Conventions
    .ForInstancesOf<MyPageType>()
    .IncludeField(x => x.MyExtension());
#172002
Edited, Nov 23, 2016 10:41
Vote:
 

Thanks Dejan for your response, I'll check into it.


I got one more question from the customer. 

Customer website has pages which redirects to external Web sites. Is it possible to get these external pages indexed too?

#172045
Nov 24, 2016 12:53
Vote:
 

Hi Pär,

Episerver Find is not limited to Episerver content (pages, blocks, etc.).

You can index custom classes as well.

For external pages, you could create a custom crawler. HtmlAgilityPack is your best friend: https://www.nuget.org/packages/HtmlAgilityPack

And then index the content using standard API calls: http://world.episerver.com/documentation/Items/Developers-Guide/EPiServer-Find/11/DotNET-Client-API/Indexing/

But the question is what they want to achieve, and do you really have to index external websites.

#172047
Nov 24, 2016 13:07
Vote:
 

Once again thank you so much for your help.

You ask a good question that I will ask the customer.

The customer has previously redirectat to a hacked site that showed pornographic material. It's probably good to think this through once or twice.

#172048
Nov 24, 2016 13:19
This topic was created over six months ago and has been resolved. If you have a similar question, please create a new topic and refer to this one.
* You are NOT allowed to include any hyperlinks in the post because your account hasn't associated to your company. User profile should be updated.