About Features Downloads Getting Started Documentation Events Support GitHub

Love VuFind®? Consider becoming a financial supporter. Your support helps build a better VuFind®!

Site Tools


Warning: This page has not been updated in over over a year and may be outdated or deprecated.
indexing:dspace

This is an old revision of the document!


How to index DSpace with VuFind

These are the instructions used by the Naval Postgraduate School in Monterey, California to index DSpace records in VuFind.

:!: These instructions were written for VuFind 2.x or newer; See this page for VuFind 1.x

Steps

OAI must be enabled on the DSpace repository first:

  1. Modify the DSpace server config in nginx.conf on the DSpace server:
    Location /oai/ {
        Proxy_set_header X-Forwarded-Host $host;
        Proxy_set_header X-Forwarded-Server $host;
        Proxy_set_header X-Forwarded-For $proxy_add_x_forwarded_for;
       
        Proxy_pass http://yourdspacehostname:8080/oai/;
        Proxy_redirect http://yourdspacehostname:8080/oai/  http://yourdspacehostname/oai;
    
        Proxy_buffering off;
        Proxy_store off;
    
        Proxy_connect_timeout 120;
        Proxy_send_timeout 120;
        Proxy_read_timeout 120;
    }

    Comparable configuration in Apache makes use of mod_proxy. Note that the proxy configuration is only necessary if you are unable to open port 8080 to your VuFind instance. If you are not limited by such restrictions, feel free to use your full DSpace hostname appended with “:8080” and skip the above proxy configuration.

  2. Modify the server.xml for the appropriate DSpace Tomcat instance in the HOST block:
    <Context path="/oai" docBase="/path_to_dspace/webapps/oai" debug="0"
        Reloadable="true" cachingAllowed="false"
        allowLinking="true" />
  3. Modify the dspace.conf config file for the appropriate DSpace instance:
    ...
    harvest.includerestricted.oai = true
    harvester.autoStart = true
    ...

Then you may proceed to import the OAI feed into VuFind:

  1. Modify $VUFIND_LOCAL_DIR/harvest/oai.ini as per oai.ini below
  2. Run these commands:
cd $VUFIND_HOME/harvest
php harvest_oai.php
./batch-import-xsl.sh DSpace dspace.properties

Required Files

oai.ini

[DSpace]
url=http://yourdspacehostname/oai/request
metadataPrefix=oai_dc
idSearch[]="/^oai:yourdspacehostname:/"
idReplace[]="ir-"
idSearch[]="/\//"
idReplace[]="-"
injectDate="datestamp"
injectId="identifier"
dateGranularity=auto
harvestedIdLog=harvest.log
indexing/dspace.1492775368.txt.gz · Last modified: 2017/04/21 11:49 by demiankatz