[VUFIND-169] Wikipedia: wrong author displayed Created: 11/Nov/09  Updated: 16/Nov/12  Resolved: 16/Nov/12

Status: Resolved
Project: VuFind®
Components: Author
Affects versions: None
Fix versions: 2.0RC1

Type: Improvement Priority: Minor
Reporter: Demian Katz Assignee: Unassigned
Resolution: Fixed Votes: 3
Labels: None
Remaining Estimate: Not Specified
Time Spent: Not Specified
Original estimate: Not Specified


 Description   
The Wikipedia module (see Author/Home page) doesn't currently account for birth/death dates attached to author names, so it sometimes brings up the biography of the wrong person. We should investigate whether it is feasible to parse dates and avoid false positive matches.

 Comments   
Comment by Demian Katz [ 22/Sep/10 ]
Mark Triggs from the National Library of Australia has shared their Wikipedia code here:

http://gist.github.com/590822

This adds some extra smarts, like dealing with birth/death dates and filtering out authors without words like "author" in their entries.

Unfortunately, it also adds a new PEAR dependency (http://pear.php.net/package/Text_Wiki_Mediawiki/).

We should consider merging this with the current trunk Wikipedia code; perhaps this would also justify a new [Wikipedia] section in config.ini to allow configuration of some of the advanced features. The possibility of whitelisting/blacklisting certain name has also been mentioned and might be another useful addition to configuration files.
Comment by Eoghan Ó Carragáin [ 16/Jul/12 ]
The VUFIND-629 patch (for the 1.x branch) and the VUFIND-622 patch (for the 2.x alpha branch) use VIAF and LCCNs from authority data to identify the correct wikipedia article.
Comment by Demian Katz [ 16/Nov/12 ]
VUFIND-629 does a good job of solving this problem, so I'm going to close this ticket. If somebody wants to try to port the old NLA code to VuFind 2.0, feel free to open a new ticket with a patch... but I don't think that's likely to be a high priority for anyone.
Generated at Sat Apr 20 14:24:36 UTC 2024 using Jira 1001.0.0-SNAPSHOT#100250-rev:2b88e55752dc82be8616a67bc2b73a87c8e22b48.