A Neural Network Based Dutch Part of Speech Tagger

In this paper a Neural Network is designed for Part-of-Speech Tagging of Dutch text. Our approach uses the Corpus Gesproken Nederlands (CGN) consisting of almost 9 million transcribed words of spoken Dutch, divided into 15 different categories. The outcome of the design is a Neural Network with an input window of size 8 (4 words back and 3 words ahead) and a hidden layer of 370 neurons. The words ahead are coded based on the relative frequency of the tags in the training set for the word. Special attention is paid to unknown words (words not in the training set) for which such a relative frequ... Mehr ...

Verfasser: Poel, M.
Boschman, E.
Akker, H.J.A. op den
Dokumenttyp: article in monograph or in proceedings
Erscheinungsdatum: 2008
Verlag/Hrsg.: Twente University Press
Sprache: unknown
Permalink: https://search.fid-benelux.de/Record/base-28627110
Datenquelle: BASE; Originalkatalog
Powered By: BASE
Link(s) : http://purl.utwente.nl/publications/65237