Here are the most frequent problems and their solutions, as documented by the Apache Tika project:
If the log shows a specific file ID or filename right before the crash, isolate that file. It is likely a corrupted document or an unsupported legacy format causing a loop. Step 2: Adjust JVM Memory Allocation filedotto tika fixed