About Features Downloads Getting Started Documentation Events Support GitHub

Site Tools


indexing:full_text_tools

Full Text Extraction Tools

VuFind's import tools include support for using external software to extract full text from external documents (PDF, Word, etc.). In order to take advantage of this, you need to install an appropriate tool (see options below) and point VuFind to the software by editing the fulltext.ini file within your local settings directory.

For more details of using full text in VuFind, see the import instructions for MARC and XML documents.

Aperture

:!: Aperture was the first tool that VuFind supported for full-text extraction, and it is the only option for use with versions 1.3 and earlier. Unfortunately, Aperture is no longer in active development, so users with VuFind 1.4 or later are encouraged to use Tika instead (see below).

Downloading Aperture

Troubleshooting Aperture

Under Linux, there is a known bug with version 1.5.0 which prevents Aperture from running on the command line correctly. See this page for details on the fix (it just involves minor edits to the lcp.sh file).

Tika

Tika is supported by VuFind 1.4 and later, and is the recommended full-text extraction tool when supported.

Downloading Tika

indexing/full_text_tools.txt · Last modified: 2018/12/19 13:31 by demiankatz