About Features Downloads Getting Started Documentation Events Support GitHub

Site Tools


administration:search_engine_optimization

Search Engine Optimization

In some cases, you may wish to make sure that the contents of your VuFind system are visible beyond your local site. This page provides tips and tools for increasing and controlling search engine visibility.

Sitemaps

This feature was added after the release of VuFind 1.0.1.

Several search engines look for sitemap files which list all of the pages of your site and ensure that no content is missed during indexing. For example, sitemaps are useful in combination with Google Webmaster Tools.

VuFind comes with a simple tool for generating sitemaps. To use it, follow these steps:

1. Edit the sitemap.ini file under your VuFind installation to specify the location of the generated sitemaps and some other settings. All options are explained by comments within the file.

2. Switch to the util folder under your VuFind installation, and run:

php sitemap.php

This may take some time. When it is complete, your sitemap file(s) will be generated.

You may wish to automate this so that sitemaps are built on a regular basis; see the Automation page for tips on automating VuFind-related tasks.

Best Practices

Exposing Static / Multi-lingual Content

If you have important static content pages (landing pages, texts about services/institutions, etc.), you may wish to include these in a base sitemap as a complement to the record list build by VuFind's sitemap generator. If your VuFind instance serves content in multiple languages, you may also wish to take advantage of the ?lng= GET parameter to provide multiple language-specific versions of the link in the sitemap. See RelBib's baseSitemap.xml for an example.

Understand Crawling Budgets

A VuFind site can easily have hundreds of thousands, or even millions, of pages in its sitemaps – it's all dependent on the number of records in your index. Search engines will take a significant amount of time to crawl all of these pages, and even more time to detect changes. Be aware that publishing a sitemap will not instantly lead to full visibility of all of your content.

Using Search Engine Tools

Creating a sitemap is only half the battle; the rest is informing search engines about it. Tools like Google's Search Console are important for publishing your sitemaps and managing how your site is crawled.

Technical Details

Early versions of VuFind made use of the standard Solr TermsComponent for extracting identifiers. This is still available as a configurable option, since it is very fast, but it does not account for certain configuration options such as hidden filters and may not properly represent your site. Starting with VuFind 5.1, the default behavior was changed from TermsComponent to a slower but more universally compatible search-based approach. The configuration can be changed via the retrievalMode setting in sitemap.ini.

Staying Informed

Search engine optimization is challenging to maintain, because search engines are constantly changing their rules for crawling and ranking. Google, for example, can update its procedures hundreds of times every year (see History of Google Algorithm Updates for details). News sites like Search Engine Land can provide some help in learning about recent changes and trends.

robots.txt - Recommendations for restricting search engine access to avoid confusing results and/or unnecessary server load.

administration/search_engine_optimization.txt · Last modified: 2020/06/04 15:19 by demiankatz