A Comparison of Different NMT Approaches to Low-Resource Dutch-Albanian Machine Translation

Low-resource languages can be understood as languages that are more scarce, less studied, less privileged, less commonly taught and for which there are less resources available (Singh, 2008; Cieri et al., 2016; Magueresse et al., 2020). Natural Language Processing (NLP) research and technology mainly focuses on those languages for which there are large data sets available. To illustrate differences in data availability: there are 6 million Wikipedia articles available for English, 2 million for Dutch, and merely 82 thousand for Albanian. The scarce data issue becomes increasingly apparent when... Mehr ...

Verfasser: Rama, Arbnor
Vanmassenhove, Eva
Dokumenttyp: contributionToPeriodical
Erscheinungsdatum: 2021
Sprache: Englisch
Permalink: https://search.fid-benelux.de/Record/base-26673080
Datenquelle: BASE; Originalkatalog
Powered By: BASE
Link(s) : https://research.tilburguniversity.edu/en/publications/e9a550fc-0b2c-40a2-9678-379eeff4ace7