Office documents with EMF images embedded fail metadata extraction

Description

Steps to Reproduce
1. Set org.alfresco.repo.content.metadata.AbstractMappingMetadataExtracter to debug
2. Upload attached file to Share

Expected Results
Metadata is successfully extracted.

Observed Results
Metadata extraction fails with the following exception:

Notes

  • Customer has discovered the issue is because our patched tika-parsers-1.21-20190624-alfresco-patched.jar has notkept up with Apache's new poi-scratchpad-4.1.1.jar

  • Also reproduced with ATS AIO 2.3.7

Environment

None

Testcase ID

None
Duplicate
Your pinned fields
Click on the next to a field label to start pinning.

Assignee

Alexandru Epure

Reporter

Scott Ashcraft

Hot Fix Version

ACT Numbers

00359716

Delivery Team

Customer Excellence

Bug Priority

Category 2