Uploaded image for project: 'VuFind'
  1. VuFind
  2. VUFIND-1330

Multiple Tika processes are spawned & hanging while indexing

    XMLWordPrintable

    Details

    • Type: Bug
    • Status: Resolved
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: 3.1.4
    • Fix Version/s: 5.1.1
    • Component/s: Import Tools
    • Labels:
    • Environment:
      Ubuntu 16.04 LTS, tika-app-1.20.jar

      Description

      When importing the attached record, Tika seems to hang, and with it the entire import process (harvest/batch-import-marc.sh).

      Running the Tika command directly from terminal finishes within a second. In my case the command is `java -jar /usr/local/vufind/tika/tika.jar -t -eUTF8 https://www.muenzfunde.ch/downloads/bulletins/ifs_bulletin_2008.pdf`, the exact same command shows up multiple times in the process list while the import hangs.

      Any ideas how to debug this further?

      Originally I tried to import a rather large XML file with ~20k records, I traced this record by searching for the muenfunde.ch URL.

        Attachments

          Activity

            People

            Assignee:
            Unassigned
            Reporter:
            shohl Simon Hohl
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

              Dates

              Created:
              Updated:
              Resolved: