Machine learning canonicity in Dutch novels 1800-2000
In a previous post, we introduced a dataset of Dutch novels with textual features and metadata. In this post we explore the dataset with the aim of answering the question to what extent canonicity can be predicted using textual features. We first explore syntactic characteristics, and then train a classifier on lexical frequencies.
Verfasser: | |
---|---|
Dokumenttyp: | other |
Erscheinungsdatum: | 2022 |
Verlag/Hrsg.: |
Koninklijke Bibliotheek
|
Sprache: | Englisch |
Permalink: | https://search.fid-benelux.de/Record/base-29443698 |
Datenquelle: | BASE; Originalkatalog |
Powered By: | BASE |
Link(s) : | https://hdl.handle.net/11370/43df22ff-a8b3-473c-91b7-64ac697153d6 |