Psycholinguistic LIWC and n-gram counts in a corpus of 1145 English and Dutch novels ...

This dataset consists of CSV files with word counts in several corpora: - 694 English language novels from male and female authors classified by authors' sexual orientation (heterosexual, bisexual, homosexual) - 401 bestselling Dutch language novels - 50 novels nominated for Dutch literary prizes Each corpus comes with: - LIWC counts; this file also includes the available metadata for each novel. The English data was created with LIWC 2015. The Dutch data was created with the validated translation of LIWC 2001. - Word counts (unigrams) and bigram counts per novel. All text has been converted t... Mehr ...

Verfasser: Van Cranenburgh, Andreas
Dokumenttyp: dataset
Erscheinungsdatum: 2020
Verlag/Hrsg.: Mendeley
Schlagwörter: Literature / Computational Linguistics / Psycholinguistics
Sprache: unknown
Permalink: https://search.fid-benelux.de/Record/base-28979469
Datenquelle: BASE; Originalkatalog
Powered By: BASE
Link(s) : https://dx.doi.org/10.17632/tmp32v54ss