A benchmark for dutch end-to-end cross-document event coreference resolution

In this paper, we present a benchmark result for end-to-end cross-document event coreference resolution in Dutch. First, the state of the art of this task in other languages is introduced, as well as currently existing resources and commonly used evaluation metrics. We then build on recently published work to fully explore end-to-end event coreference resolution for the first time in the Dutch language domain. For this purpose, two well-performing transformer-based algorithms for the respective detection and coreference resolution of Dutch textual events are combined in a pipeline architecture... Mehr ...

Verfasser: De Langhe, Loic
Desot, Thierry
De Clercq, Orphée
Hoste, Veronique
Dokumenttyp: journalarticle
Erscheinungsdatum: 2023
Schlagwörter: Technology and Engineering / Languages and Literatures / event coreference resolution / end-to-end / cross-document / Dutch language domain / NEWS / EXTRACTION
Sprache: Englisch
Permalink: https://search.fid-benelux.de/Record/base-27450614
Datenquelle: BASE; Originalkatalog
Powered By: BASE
Link(s) : https://biblio.ugent.be/publication/01H4110F1NQ0P01WSZSB6ZX71M

In this paper, we present a benchmark result for end-to-end cross-document event coreference resolution in Dutch. First, the state of the art of this task in other languages is introduced, as well as currently existing resources and commonly used evaluation metrics. We then build on recently published work to fully explore end-to-end event coreference resolution for the first time in the Dutch language domain. For this purpose, two well-performing transformer-based algorithms for the respective detection and coreference resolution of Dutch textual events are combined in a pipeline architecture and compared to baseline scores relying on feature-based methods. The results are promising and comparable to similar studies in higher-resourced languages; however, they also reveal that in this specific NLP domain, much work remains to be done. In order to gain more insights, an in-depth analysis of the two pipeline components is carried out to highlight and overcome possible shortcoming of the current approach and provide suggestions for future work.