Data and models for automatic scansion experiment Dutch Song Database

This release contains the data used in an experiment on automatic scansion for historical Dutch song texts. Aside form the data, two models are included in this release as well. One model is essential for running the code that is part of this experiment (model_s); while the other model is an example of an acquired automatic scansion model (best_model). Item descriptions: meertens-meter-songs.zip → collection of 23,197 historic Dutch songs (xml-format). These files (and the gathered meta-data) stems from a collaboration project between the Dutch Song Database and the Digital Library for Dutch L... Mehr ...

Verfasser: Haverals, Wouter
Karsdorp, Folgert
Kestemont, Mike
Dokumenttyp: other
Erscheinungsdatum: 2019
Schlagwörter: scansion / historical Dutch songs / Dutch Song Database / lyrics / meter / historical metrics
Sprache: Niederländisch
Permalink: https://search.fid-benelux.de/Record/base-27078157
Datenquelle: BASE; Originalkatalog
Powered By: BASE
Link(s) : https://zenodo.org/record/3243662

This release contains the data used in an experiment on automatic scansion for historical Dutch song texts. Aside form the data, two models are included in this release as well. One model is essential for running the code that is part of this experiment (model_s); while the other model is an example of an acquired automatic scansion model (best_model). Item descriptions: meertens-meter-songs.zip → collection of 23,197 historic Dutch songs (xml-format). These files (and the gathered meta-data) stems from a collaboration project between the Dutch Song Database and the Digital Library for Dutch Literature. All files contain meta-data on the number of beats that is present in individual verse lines. Snippet: <lg> <l id="s1:l1" met="4" type="-+"> Een Meysken op een Rivierken <rhyme label="a" type="m">sadt</rhyme>,</l> <l id="s1:l2" met="2" type="-+"> So schoon zy <rhyme label="b" type="m">was</rhyme>,</l> <l id="s1:l3" met="4" type="-+"> Sy sadt en verbeyde haer soete <rhyme label="c" type="m">Lief</rhyme>,</l> <l id="s1:l4" met="2" type="-+"> Int groene <rhyme label="b" type="m" corresp="#s1:l2">gras</rhyme>.</l> </lg> model_s → model used for syllabification and assignment of lexical stress of (historic) Dutch words. The development of this model was part of a previous project. stress_xml.zip → collection of 23,197 historic Dutch songs (xml-format). These are the same songs a the meertens-meter-songs, yet now their individual words are syllabified and annotated for lexical stress. The songs in this folder are used as input during the training process. Snippet: <l id="s1:l1" met="4" type="-+"> <w token="een"> <s word-stress="1" line-stress="0">een</s> </w> <w token="meysken"> <s word-stress="1" line-stress="0">meys</s> <s word-stress="0" line-stress="0">ken</s> </w> <w token="op"> <s word-stress="1" line-stress="0">op</s> ...