When Simple n-gram Models Outperform Syntactic Approaches: Discriminating between Dutch and Flemish

In this paper we present the results of our participation in the Discriminating between Dutch and Flemish in Subtitles VarDial 2018 shared task. We try techniques proven to work well for discriminating between language varieties as well as explore the potential of using syntactic features, i.e. hierarchical syntactic subtrees. We experiment with different combinations of features. Discriminating between these two languages turned out to be a very hard task, not only for a machine: human performance is only around 0.51 F1 score; our best system is still a simple Naive Bayes model with word unig... Mehr ...

Verfasser:	Kroon, Martin Medvedeva, Masha Plank, Barbara
Dokumenttyp:	contributionToPeriodical
Erscheinungsdatum:	2018
Verlag/Hrsg.:	Association for Computational Linguistics
Sprache:	Englisch
Permalink:	https://search.fid-benelux.de/Record/base-27025485
Datenquelle:	BASE; Originalkatalog
Powered By:	BASE
Link(s) :	https://pure.itu.dk/portal/en/publications/when-simple-ngram-models-outperform-syntactic-approaches-discriminating-between-dutch-and-flemish(a1881a58-c2f1-49ae-b21c-179f0854feab).html

Suche in Bibliothekskatalogen:

	Prüfen Sie die Verfügbarkeit in Ihrer Heimatbibliothek
	Suche deutschlandweit und international (KVK – Karlsruher Virtueller Katalog)
	Suche weltweit im Worldcatworldwide_worldcat

Suche via Google:

Suche via Google

Suche in Google Scholar

Suche in Google Books