Warning: This page has not been updated in over over a year and may be outdated or deprecated.
indexing:websites
Differences
This shows you the differences between two versions of the page.
Both sides previous revisionPrevious revisionNext revision | Previous revision | ||
indexing:websites [2015/12/14 17:08] – ↷ Page moved and renamed from vufind2:indexing_websites to indexing:websites demiankatz | indexing:websites [2021/08/03 13:49] (current) – demiankatz | ||
---|---|---|---|
Line 1: | Line 1: | ||
====== Indexing a Website ====== | ====== Indexing a Website ====== | ||
- | Starting with release 2.1, VuFind can be used to create a website index separate from your main search index. | + | Starting with release 2.1, VuFind can be used to create a website index separate from your main search index. |
===== Getting Started ===== | ===== Getting Started ===== | ||
- | - Make sure that you have a [[..:aperture|full text extraction tool]] installed and configured. | + | - Make sure that you have a [[full_text_tools|full text extraction tool]] installed and configured. |
- | - Enable the website core by editing solr/ | + | - Copy config/ |
- | - [[..: | + | |
- | - Copy config/ | + | |
- Run the import/ | - Run the import/ | ||
- When crawling is done, go to < | - When crawling is done, go to < | ||
+ | |||
+ | (//In very old versions of VuFind -- earlier than release 3.0 -- you will need to enable the website core by editing solr/ | ||
+ | |||
===== Customizing the Web Search ===== | ===== Customizing the Web Search ===== | ||
- | Several things can be modified (with the help of your [[local settings directory]]) to adjust web search behavior and appearance. | + | Several things can be modified (with the help of your [[configuration: |
* You can customize the way web pages are indexed by creating a custom version of import/ | * You can customize the way web pages are indexed by creating a custom version of import/ | ||
Line 22: | Line 23: | ||
* The current webcrawl.php tool works very much by brute force; we may want to build a more intelligent, | * The current webcrawl.php tool works very much by brute force; we may want to build a more intelligent, | ||
+ | |||
+ | ===== Related Video ===== | ||
+ | |||
+ | You can learn more about web indexing through the [[videos: | ||
---- struct data ---- | ---- struct data ---- | ||
+ | properties.Page Owner : | ||
---- | ---- | ||
indexing/websites.txt · Last modified: 2021/08/03 13:49 by demiankatz