About Features Downloads Getting Started Documentation Events Support GitHub

Love VuFind®? Consider becoming a financial supporter. Your support helps build a better VuFind®!

Site Tools


Warning: This page has not been updated in over over a year and may be outdated or deprecated.
indexing:open_data_sources:doaj

DOAJ - Directory of Open Access Journals

Articles

DOAJ not only provides a Directory of open access journals, but also harvests their articles, which in turn are available to be harvested from DOAJ. The data includes not just Dublin Core, but also a richer oai_doaj metadata format (very similar to NLM's). This allows something like

Journal Title: BMGN Low Countries Historical Review    vol: 85    issue: 1

With the first field value (container_title) searchable – to obtain all the articles from this Journal, and the other two acting as filters over this Journal (filter to retrieve all the articles published in the 85th volume or from the first issue in this volume).

It is also possible to harvest only the journal-level data indexed in DOAJ if you do not want the article-level data.

Suggested entry in ./harvest/oai.ini:

[DOAJart]
url = https://doaj.org/oai.article
metadataPrefix = oai_doaj
idSearch[] = "/oai:doaj.org\/article:/"
idReplace[] = "doaj-art-"
injectId = "identifier"
injectDate = "datestamp" 

VuFind® provides default import files to load the harvested records (see files ./import/doaj.properties and ./import/xsl/doaj.xsl).

This service was first discussed in the VUFIND-543 JIRA ticket.

:!: The DOAJ format changed significantly between VuFind® versions 3 and 4. If your harvest is not working, you may need to update your XSLT. See pull request #944 for details.

:!: If you are using a VuFind® version less than 4.0 or a VuFindHarvest version less than 2.3.0, there is a bug that will affect the harvested XML files, and you probably will need to adjust them; this command could help:

sed -i "s/xmlns:oai_doaj/ xmlns:oai_doaj/" local/harvest/DOAJart/*.xml
indexing/open_data_sources/doaj.txt · Last modified: 2023/11/28 19:46 by demiankatz