About Features Downloads Getting Started Documentation Events Support GitHub

Love VuFind®? Consider becoming a financial supporter. Your support helps build a better VuFind®!

Site Tools


Warning: This page has not been updated in over over a year and may be outdated or deprecated.
indexing:full_text_tools

Full Text Extraction Tools

VuFind®'s import tools include support for using external software to extract full text from external documents (PDF, Word, etc.). In order to take advantage of this, you need to install an appropriate tool (see options below) and point VuFind® to the software by editing the fulltext.ini file within your local settings directory.

For more details of using full text in VuFind®, see the import instructions for MARC and XML documents.

Tika

Tika is the recommended full-text extraction tool for use with VuFind®.

Downloading Tika

Aperture

:!: Aperture is supported as an alternative to Tika. However, Aperture is no longer in active development, so users are strongly encouraged to use Tika instead (see above).

Downloading Aperture

The Sitemaps and Web Indexing video includes a demonstration of setting up a full text extraction tool.

indexing/full_text_tools.txt · Last modified: 2024/03/13 11:58 by demiankatz