Dutch Historical Word2Vec models

Introduction The repository contains Word2Vec models trained on Dutch historical newspaper data converting the period from 1840 to 1890. Models were created as part of a Research-in-Residence at the Dutch National Library. During my residency, I created language models trained on specific subsections of the newspaper corpus, to explore bias over time and by place or political leaning. To read more about this project, please read the introductory blog post. Code The code used for training the models is available on Github. Please look at the README for more instruction. Warning: the raw text da... Mehr ...

Verfasser: Kaspar Beelen
Mirjam Cuper
Dokumenttyp: other
Erscheinungsdatum: 2021
Schlagwörter: Digital Heritage / word2vec / digital newspapers
Sprache: Niederländisch
Permalink: https://search.fid-benelux.de/Record/base-27466331
Datenquelle: BASE; Originalkatalog
Powered By: BASE
Link(s) : https://zenodo.org/record/4892800