Table of Contents

OAI-PMH Harvesting

About OAI-PMH

Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH) is a low-barrier mechanism for repository interoperability. Data Providers are repositories that expose structured metadata via OAI-PMH. Service Providers then make OAI-PMH service requests to harvest that metadata. OAI-PMH is a set of six verbs or services that are invoked via HTTP.

VuFind has been able to act as a Service Provider since release 1.0.1, since a simple tool is included for harvesting records using the protocol. This tool is also available as a standalone project called VuFindHarvest.

Setting up OAI-PMH

To set up OAI-PMH harvesting, simply edit the oai.ini file in the harvest subdirectory of your VuFind installation (or better still, edit a copy of it inside the harvest subdirectory of your local settings directory).

You can set up one or more OAI-PMH repositories in the configuration – details are included in comments within the file.

Harvest Workflow

Once OAI-PMH is configured, you can follow these steps to get documents from an OAI-PMH repository into your VuFind index:

It should be possible to automate this process using a top-level script and cron job in order to do a nightly harvest/index operation.

Important notes

You can learn more about VuFind's OAI-PMH functionality in the OAI-PMH Server and Harvest Functionality tutorial video.