Benchmarking Offensive and Abusive Language in Dutch Tweets

We present an extensive evaluation of different fine-tuned models to detect instances of offensive and abusive language in Dutch across three benchmarks: a standard held-out test, a task-agnostic functional benchmark, and a dynamic test set. We also investigate the use of data cartography to identify high quality training data. Our results show a relatively good quality of the manually annotated data used to train the models while highlighting some critical weakness. We have also found a good portability of trained models along the same language phenomena. As for the data cartography, we have... Mehr ...

Verfasser:	Caselli, Tommaso van der Veen, Hylke
Dokumenttyp:	contributionToPeriodical
Erscheinungsdatum:	2023
Verlag/Hrsg.:	Association for Computational Linguistics ACL Anthology
Sprache:	Englisch
Permalink:	https://search.fid-benelux.de/Record/base-26671543
Datenquelle:	BASE; Originalkatalog
Powered By:	BASE
Link(s) :	https://hdl.handle.net/11370/af254a9f-d179-486f-b2e8-a31012ed3f21

Suche in Bibliothekskatalogen:

	Prüfen Sie die Verfügbarkeit in Ihrer Heimatbibliothek
	Suche deutschlandweit und international (KVK – Karlsruher Virtueller Katalog)
	Suche weltweit im Worldcatworldwide_worldcat

Suche via Google:

Suche via Google

Suche in Google Scholar

Suche in Google Books