Discontinuous Constituency and BERT: A Case Study of Dutch ...

In this paper, we set out to quantify the syntactic capacity of BERT in the evaluation regime of non-context free patterns, as occurring in Dutch. We devise a test suite based on a mildly context-sensitive formalism, from which we derive grammars that capture the linguistic phenomena of control verb nesting and verb raising. The grammars, paired with a small lexicon, provide us with a large collection of naturalistic utterances, annotated with verb-subject pairings, that serve as the evaluation test bed for an attention-based span selection probe. Our results, backed by extensive analysis, sug... Mehr ...

Verfasser: Kogkalidis, Konstantinos
Wijnholds, Gijs
Dokumenttyp: Artikel
Erscheinungsdatum: 2022
Verlag/Hrsg.: arXiv
Schlagwörter: Computation and Language cs.CL / Machine Learning cs.LG / FOS: Computer and information sciences
Sprache: unknown
Permalink: https://search.fid-benelux.de/Record/base-28980398
Datenquelle: BASE; Originalkatalog
Powered By: BASE
Link(s) : https://dx.doi.org/10.48550/arxiv.2203.01063