Exploring the Utility of Dutch Question Answering Datasets for Human Resource Contact Centres

We explore the use case of question answering (QA) by a contact centre for 130,000 Dutch government employees in the domain of questions about human resources (HR). HR questions can be answered using personnel files or general documentation, with the latter being the focus of the current research. We created a Dutch HR QA dataset with over 300 questions in the format of the Squad 2.0 dataset, which distinguishes between answerable and unanswerable questions. We applied various BERT-based models, either directly or after finetuning on the new dataset. The F1-scores reached 0.47 for unanswerable... Mehr ...

Verfasser: Chaïm van Toledo
Marijn Schraagen
Friso van Dijk
Matthieu Brinkhuis
Marco Spruit
Dokumenttyp: Artikel
Erscheinungsdatum: 2022
Reihe/Periodikum: Information, Vol 13, Iss 513, p 513 (2022)
Verlag/Hrsg.: MDPI AG
Schlagwörter: question and answering / dataset / Squad 2.0 / question and answering dataset creation / question and answering Dutch / human resource dataset / Information technology / T58.5-58.64
Sprache: Englisch
Permalink: https://search.fid-benelux.de/Record/base-26626582
Datenquelle: BASE; Originalkatalog
Powered By: BASE
Link(s) : https://doi.org/10.3390/info13110513

We explore the use case of question answering (QA) by a contact centre for 130,000 Dutch government employees in the domain of questions about human resources (HR). HR questions can be answered using personnel files or general documentation, with the latter being the focus of the current research. We created a Dutch HR QA dataset with over 300 questions in the format of the Squad 2.0 dataset, which distinguishes between answerable and unanswerable questions. We applied various BERT-based models, either directly or after finetuning on the new dataset. The F1-scores reached 0.47 for unanswerable questions and 1.0 for answerable questions depending on the topic; however, large variations in scores were observed. We conclude more data are needed to further improve the performance of this task.