A critical vulnerability was discovered in React Server Components (Next.js). Our systems remain protected but we advise to update packages to newest version. Learn More

Weird excerpt when indexing old word .doc documents

Vote:
 

I'm having trouble with weird results when indexing old .doc Word documents. docx works fine, but the doc files all appear like this in the search results:

���� � ���NAMN PŠDOKUMENT�Datum�Beteckning�����2015-01-01� �Sida � PAGE \* MERGEFORMAT �1� (� NUMPAGES \* MERGEFORMAT �1�) ���� ���� ���� ��

Has anyone got this to work? We're on EPiServer 7.13 and Find 8.8 if it matters. The Excerpt field is set with

conventions.ProjectExcerptUsing(spec =>
    doc => doc.SearchAttachment().AsCropped(spec.ExcerptLength)
);

Any help appreciated. Cheers.

#116737
Feb 04, 2015 13:21
* You are NOT allowed to include any hyperlinks in the post because your account hasn't associated to your company. User profile should be updated.