The Kronieken Corpus:an Annotated Collection of Dutch/Flemish Chronicles from 1500–1850

In this paper we present the Kronieken Corpus, a new digital collection of 204 local chronicles, containing almost 24 million words, written in Dutch/Flemish between 1500 and 1850. About half of these texts had not been published before. The manuscripts were photographed in 39 archives and libraries in The Netherlands and Belgium and subsequently transcribed and manually annotated by volunteers. The annotations include named entities and dates, as well as source mentions and attributions. The result is a unique, enriched historical corpus of original hand-written, non-canonical and non-fiction... Mehr ...

Verfasser: Dekker, Theo
Kuijpers, Erika
Lassche, Alie
Lenarduzzi, Carolina
Morante, Roser
Pollmann, Judith
Dokumenttyp: contributionToPeriodical
Erscheinungsdatum: 2024
Verlag/Hrsg.: Association for Computational Linguistics (ACL)
Sprache: Englisch
Permalink: https://search.fid-benelux.de/Record/base-27483033
Datenquelle: BASE; Originalkatalog
Powered By: BASE
Link(s) : https://research.vu.nl/en/publications/bed32361-ac8a-4391-bc2b-5ae7e5c7ae6b

In this paper we present the Kronieken Corpus, a new digital collection of 204 local chronicles, containing almost 24 million words, written in Dutch/Flemish between 1500 and 1850. About half of these texts had not been published before. The manuscripts were photographed in 39 archives and libraries in The Netherlands and Belgium and subsequently transcribed and manually annotated by volunteers. The annotations include named entities and dates, as well as source mentions and attributions. The result is a unique, enriched historical corpus of original hand-written, non-canonical and non-fictional text by lay people from the early modern period.