Details
Description
Issue: VuFind does not currently include browsable pages which can be crawled by search engine.
This new feature would allow the generation of a sitemap index and individual sitemaps files for submission to the major search engines (confoming to the stiemaps protocol: http://www.sitemaps.org/) .
Options:
- Generate static xml files at index time (the contents will not change again until the next re-index anyway)
- Dynamically generate by making a call to Solr (this could allow for sitemap generation based on a query/filter if an institution wished to expose a particular subset of their collection)
Protocol Restrictions:
- Each sitemap file can contain a max of 50,000 links
- Each sitemap file must be no larger than 10MB
- Each sitemap index file can group a max of 50,000 individual sitemaps
This new feature would allow the generation of a sitemap index and individual sitemaps files for submission to the major search engines (confoming to the stiemaps protocol: http://www.sitemaps.org/) .
Options:
- Generate static xml files at index time (the contents will not change again until the next re-index anyway)
- Dynamically generate by making a call to Solr (this could allow for sitemap generation based on a query/filter if an institution wished to expose a particular subset of their collection)
Protocol Restrictions:
- Each sitemap file can contain a max of 50,000 links
- Each sitemap file must be no larger than 10MB
- Each sitemap index file can group a max of 50,000 individual sitemaps
These could be useful for sites using VuFind to expose digital content like images or videos. It could probably be based on 856 links but they would require some processing, e.g. is the link to a local domain, is the extension an image (jpg, jp2, png, gif, tiff, etc.) or a video (avi, mov, mp4, flv, etc).