Machine learning canonicity in Dutch novels 1800-2000

In a previous post, we introduced a dataset of Dutch novels with textual features and metadata. In this post we explore the dataset with the aim of answering the question to what extent canonicity can be predicted using textual features. We first explore syntactic characteristics, and then train a classifier on lexical frequencies.

Verfasser: van Cranenburgh, Andreas
Dokumenttyp: other
Erscheinungsdatum: 2022
Verlag/Hrsg.: Koninklijke Bibliotheek
Sprache: Englisch
Permalink: https://search.fid-benelux.de/Record/base-29443698
Datenquelle: BASE; Originalkatalog
Powered By: BASE
Link(s) : https://hdl.handle.net/11370/43df22ff-a8b3-473c-91b7-64ac697153d6