Challenges in tagging and parsing spoken dialects of Dutch

This paper reports on the construction of a tagged and parsed pilot corpus of the southern Dutch dialects. The corpus aims to facilitate diachronic research into the syntax of Dutch, as its dialects have retained many interesting (morpho)syntactic features which can often be traced back to changes starting in or characteristics retained from older stages of historical Dutch. The discussion mainly focuses on initial test results achieved by applying existing NLP tools which have been developed or optimised for POS tagging and parsing standard Dutch. We report on initial tests on our data with F... Mehr ...

Verfasser: Farasyn, Melissa
Ghyselen, Anne-Sophie
Van Keymeulen, Jacques
Breitbarth, Anne
Dokumenttyp: journalarticle
Erscheinungsdatum: 2022
Schlagwörter: Languages and Literatures / tagging / parsing / dialects / Dutch / corpus / spoken dialects
Sprache: Englisch
Permalink: https://search.fid-benelux.de/Record/base-26675701
Datenquelle: BASE; Originalkatalog
Powered By: BASE
Link(s) : https://biblio.ugent.be/publication/8759210