November Happy Hour will be moved to Thursday December 5th.

Indexing documents with Find

Vote:
 

Hi there!

Can anybody explain in a simple way how Find indexes documents (pdf, doc)? Are search queries matched against document content or just metadata? We have a site that uses Find and for certain queries documents that are not a match neither on the title nor in the content are returned. I'm new to Find so I'm a bit clueless on where the problem is. There's no customization done in code regarding the indexing and searching of just documents.

Thanks in advance

/Kenia

#155221
Sep 12, 2016 16:38
Vote:
 

Find will index the content of the document and search against it for types like doc, docx, ppt, pptx and so on.

To make Find index document I thought that it was done by default, but otherwize it is simple to do, read more about it here:

http://world.episerver.com/documentation/Items/Developers-Guide/EPiServer-Find/12/Integration/episerver-7-5/Indexing/

If you like to index only documents that are visible on the site, follow my blog post here:

http://world.episerver.com/blogs/Henrik-Fransas/Dates/2014/3/Indexing-only-referenced-files-with-EPiServer-Find-75/

#155224
Sep 12, 2016 16:54
Vote:
 

Thanks Henrik, I'll check the links out!

#155287
Sep 13, 2016 14:37
Vote:
 

Hi in previous version like 8 - 9 of Find I used an indexing convention init module adding this:

SearchClient.Instance.Conventions.UnifiedSearchRegistry.Add(typeof(PdfFile));

I remember adding a feature pointing out wich page the document was held by.

If you want searchhighlight in files that was the only way to go.

But this required Unified search.

/D

#155368
Sep 15, 2016 9:26
This topic was created over six months ago and has been resolved. If you have a similar question, please create a new topic and refer to this one.
* You are NOT allowed to include any hyperlinks in the post because your account hasn't associated to your company. User profile should be updated.