All mixed up? Finding the optimal feature set for general readability prediction and its application to English and Dutch

Readability research has a long and rich tradition, but there has been too little focus on general readability prediction without targeting a specific audience or text genre. Moreover, though NLP-inspired research has focused on adding more complex readability features there is still no consensus on which features contribute most to the prediction. In this article, we investigate in close detail the feasibility of constructing a readability prediction system for English and Dutch generic text using supervised machine learning. Based on readability assessments by both experts and a crowd, we im... Mehr ...

Verfasser: De Clercq, Orphée
Hoste, Veronique
Dokumenttyp: journalarticle
Erscheinungsdatum: 2016
Verlag/Hrsg.: MIT Press
Schlagwörter: Languages and Literatures / LANGUAGE / COHESION / TEXTS / COH-METRIX / FORMULAS / DIFFICULTY
Sprache: Englisch
Permalink: https://search.fid-benelux.de/Record/base-26675496
Datenquelle: BASE; Originalkatalog
Powered By: BASE
Link(s) : https://biblio.ugent.be/publication/7175390