Automatic compound processing : compound splitting and semantic analysis for Afrikaans and Dutch

Abstract: Compounding, the process of combining several simplex words into a complex whole, is a productive process in a wide range of languages. In particular, concatenative compounding, in which the components are glued together, leads to problems, for instance, in computational tools that rely on a predefined lexicon. Here we present the AuCoPro project, which focuses on compounding in the closely related languages Afrikaans and Dutch. The project consists of subprojects focusing on compound splitting (identifying the boundaries of the components) and compound semantics (identifying semanti... Mehr ...

Verfasser: Verhoeven, Ben
van Zaanen, Menno
Daelemans, Walter
van Huyssteen, Gerhard
Dokumenttyp: conferenceObject
Erscheinungsdatum: 2014
Schlagwörter: Linguistics
Sprache: Englisch
Permalink: https://search.fid-benelux.de/Record/base-27448936
Datenquelle: BASE; Originalkatalog
Powered By: BASE
Link(s) : https://hdl.handle.net/10067/1182920151162165141

Abstract: Compounding, the process of combining several simplex words into a complex whole, is a productive process in a wide range of languages. In particular, concatenative compounding, in which the components are glued together, leads to problems, for instance, in computational tools that rely on a predefined lexicon. Here we present the AuCoPro project, which focuses on compounding in the closely related languages Afrikaans and Dutch. The project consists of subprojects focusing on compound splitting (identifying the boundaries of the components) and compound semantics (identifying semantic relations between the components). We describe the developed datasets as well as results showing the effectiveness of the developed datasets.