Automatic compound processing : compound splitting and semantic analysis for Afrikaans and Dutch
Abstract: Compounding, the process of combining several simplex words into a complex whole, is a productive process in a wide range of languages. In particular, concatenative compounding, in which the components are glued together, leads to problems, for instance, in computational tools that rely on a predefined lexicon. Here we present the AuCoPro project, which focuses on compounding in the closely related languages Afrikaans and Dutch. The project consists of subprojects focusing on compound splitting (identifying the boundaries of the components) and compound semantics (identifying semanti... Mehr ...
Verfasser: | |
---|---|
Dokumenttyp: | conferenceObject |
Erscheinungsdatum: | 2014 |
Schlagwörter: | Linguistics |
Sprache: | Englisch |
Permalink: | https://search.fid-benelux.de/Record/base-27448936 |
Datenquelle: | BASE; Originalkatalog |
Powered By: | BASE |
Link(s) : | https://hdl.handle.net/10067/1182920151162165141 |
Abstract: Compounding, the process of combining several simplex words into a complex whole, is a productive process in a wide range of languages. In particular, concatenative compounding, in which the components are glued together, leads to problems, for instance, in computational tools that rely on a predefined lexicon. Here we present the AuCoPro project, which focuses on compounding in the closely related languages Afrikaans and Dutch. The project consists of subprojects focusing on compound splitting (identifying the boundaries of the components) and compound semantics (identifying semantic relations between the components). We describe the developed datasets as well as results showing the effectiveness of the developed datasets.