A Benchmark for Dutch End-to-End Cross-Document Event Coreference Resolution

In this paper, we present a benchmark result for end-to-end cross-document event coreference resolution in Dutch. First, the state of the art of this task in other languages is introduced, as well as currently existing resources and commonly used evaluation metrics. We then build on recently published work to fully explore end-to-end event coreference resolution for the first time in the Dutch language domain. For this purpose, two well-performing transformer-based algorithms for the respective detection and coreference resolution of Dutch textual events are combined in a pipeline architecture... Mehr ...

Verfasser: Loic De Langhe
Thierry Desot
Orphée De Clercq
Veronique Hoste
Dokumenttyp: Text
Erscheinungsdatum: 2023
Verlag/Hrsg.: Multidisciplinary Digital Publishing Institute
Schlagwörter: event coreference resolution / end-to-end / cross-document / Dutch language domain
Sprache: Englisch
Permalink: https://search.fid-benelux.de/Record/base-27415779
Datenquelle: BASE; Originalkatalog
Powered By: BASE
Link(s) : https://doi.org/10.3390/electronics12040850

In this paper, we present a benchmark result for end-to-end cross-document event coreference resolution in Dutch. First, the state of the art of this task in other languages is introduced, as well as currently existing resources and commonly used evaluation metrics. We then build on recently published work to fully explore end-to-end event coreference resolution for the first time in the Dutch language domain. For this purpose, two well-performing transformer-based algorithms for the respective detection and coreference resolution of Dutch textual events are combined in a pipeline architecture and compared to baseline scores relying on feature-based methods. The results are promising and comparable to similar studies in higher-resourced languages; however, they also reveal that in this specific NLP domain, much work remains to be done. In order to gain more insights, an in-depth analysis of the two pipeline components is carried out to highlight and overcome possible shortcoming of the current approach and provide suggestions for future work.