Word2Vec Models Dutch Newspapers ...

Word Embedding models trained on 6 national Dutch newspapers. We use the Gensim implementation of Word2Vec to train four embedding models per newspaper, each representing one decade between 1950 and 1990. The models were trained using C-BOW with hierarchical softmax, with a dimensionality of 300, a minimal word count and context of 5, and downsampling of 10 -5 These models belong to the article: Using Word Embeddings to Examine Gender Bias in Dutch Newspapers, 1950-1990 ...

Verfasser: Wevers, Melvin
Dokumenttyp: dataset
Erscheinungsdatum: 2019
Verlag/Hrsg.: Zenodo
Schlagwörter: word2vec / newspapers / word embedding
Sprache: Niederländisch
Permalink: https://search.fid-benelux.de/Record/base-28981049
Datenquelle: BASE; Originalkatalog
Powered By: BASE
Link(s) : https://dx.doi.org/10.5281/zenodo.3237379