Readability Metrics for Machine Translation in Dutch: Google vs. Azure & IBM

This paper introduces a novel method to predict when a Google translation is better than other machine translations (MT) in Dutch. Instead of considering fidelity, this approach considers fluency and readability indicators for when Google ranked best. This research explores an alternative approach in the field of quality estimation. The paper contributes by publishing a dataset with sentences from English to Dutch, with human-made classifications on a best-worst scale. Logistic regression shows a correlation between T-Scan output, such as readability measurements like lemma frequencies, and wh... Mehr ...

Verfasser:	Chaïm van Toledo Marijn Schraagen Friso van Dijk Matthieu Brinkhuis Marco Spruit
Dokumenttyp:	Artikel
Erscheinungsdatum:	2023
Reihe/Periodikum:	Applied Sciences, Vol 13, Iss 7, p 4444 (2023)
Verlag/Hrsg.:	MDPI AG
Schlagwörter:	quality estimation / SQUAD 2.0 / machine translation / English to Dutch quality estimation / Technology / T / Engineering (General). Civil engineering (General) / TA1-2040 / Biology (General) / QH301-705.5 / Physics / QC1-999 / Chemistry / QD1-999
Sprache:	Englisch
Permalink:	https://search.fid-benelux.de/Record/base-28576361
Datenquelle:	BASE; Originalkatalog
Powered By:	BASE
Link(s) :	https://doi.org/10.3390/app13074444