Apache-tika and MediaDataBox Files in System %TEMP% Folder After Classification

book

Article ID: 100051922

calendar_today

Updated On:

Description

Error Message

Caused by: org.apache.tika.exception.TikaException: Unexpected RuntimeException from org.apache.tika.parser.mp4.MP4Parser@712f21b5

Cause

ThisĀ is a known issue processing MP4 files in the version of Apache Tika software shipped with Data Insight

[TIKA-3128] MOV file produces RuntimeException with 1.24.1, used to work with earlier version 1.19.1 - ASF JIRA (apache.org)

Resolution

Workaround:

At this time, it is recommended to exclude MP4 files with a .MOV extension from classification until a fix is released by Apache

Issue/Introduction

The system TEMP folder can fill up with files that begin with apache-tika* and MediaDataBox* during a classification job. These files may not be cleaned up properly once the classification request is completed.