Detecting Wikipedia articles strongly based on single library collections

In this post I illustrate an approach to detect Wikipedia articles whose contents are fully or largely based on content from a single online source, such as a full-text digitized newspaper archive or a digital text library. Using Dutch Wikipedia I track down 247 articles that owe their existence to Delpher and DBNL, two full-text collections operated by the KB, the national library of the Netherlands. This approach might be relevant for GLAMs that have digital text collections used by the Wikipedia community for writing articles. ; This article is also available at * Github : https://github.co... Mehr ...

Verfasser: Olaf Janssen
Dokumenttyp: technicalDocumentation
Erscheinungsdatum: 2020
Schlagwörter: Wikipedia / Delpher / DBNL / uptake / GLAMwiki / KB / national library of the Netherlands / reuse / digital heritage / Koninklijke Bibliotheek / Olaf Janssen
Sprache: Englisch
Permalink: https://search.fid-benelux.de/Record/base-26848031
Datenquelle: BASE; Originalkatalog
Powered By: BASE
Link(s) : https://zenodo.org/record/7433549

In this post I illustrate an approach to detect Wikipedia articles whose contents are fully or largely based on content from a single online source, such as a full-text digitized newspaper archive or a digital text library. Using Dutch Wikipedia I track down 247 articles that owe their existence to Delpher and DBNL, two full-text collections operated by the KB, the national library of the Netherlands. This approach might be relevant for GLAMs that have digital text collections used by the Wikipedia community for writing articles. ; This article is also available at * Github : https://github.com/KBNLwikimedia/KB-Wiki-Stats-Graphs/blob/master/stories/Detecting%20Wikipedia%20articles%20strongly%20based%20on%20single%20library%20collections.md * Wikimedia Commons : https://commons.wikimedia.org/wiki/File:Detecting_Wikipedia_articles_strongly_based_on_single_library_collections_-_Olaf_Janssen,_KB,_21_May_2020.pdf