Textual repetition and variation in the Resolutions of the States General of the Dutch Republic ...

In the NWO REPUBLIC project, we are creating digital access to the corpus of the Resolutions of the States General of the Dutch Republic (1576-1796). This corpus contains the decisions made in the States General each day for a 220 year period. The resolutions were recorded using a standard structure and contain many standard formulations for aspects of the decision making process, including the source of the topic that was decided on (a formal request, a missive, etc.), whether a decision was reached and what that decision was. We discuss different techniques we use to identify formulaic expre... Mehr ...

Verfasser: Koolen, Marijn
Hoekstra, Rik
Koert, Rutger Van
Nijenhuis, Ida
Sluijter, Ronald
Dokumenttyp: Scholarlyarticle
Erscheinungsdatum: 2020
Verlag/Hrsg.: Zenodo
Schlagwörter: information extraction / digital history / text variation / digital humanities
Sprache: Englisch
Permalink: https://search.fid-benelux.de/Record/base-28981166
Datenquelle: BASE; Originalkatalog
Powered By: BASE
Link(s) : https://dx.doi.org/10.5281/zenodo.3874152

In the NWO REPUBLIC project, we are creating digital access to the corpus of the Resolutions of the States General of the Dutch Republic (1576-1796). This corpus contains the decisions made in the States General each day for a 220 year period. The resolutions were recorded using a standard structure and contain many standard formulations for aspects of the decision making process, including the source of the topic that was decided on (a formal request, a missive, etc.), whether a decision was reached and what that decision was. We discuss different techniques we use to identify formulaic expressions and how we iteratively build a corpus-specific phrase model with which we can identify 1) the dates and attendants of each meeting, which are followed by all the resolutions of that day, 2) resolution boundaries, e.g. where they start and stop in the running text, so we know which text belongs to which resolution, 3) different types of opening phrases that correspond to different types of sources (e.g. requests, ...