Development and structure of the VariaNTS corpus:A spoken Dutch corpus containing talker and linguistic variability

Speech perception and spoken word recognition are not only affected by what is being said, but also by who is speaking. Currently, publicly available corpora of spoken Dutch do not offer a wide variety of linguistic materials produced by multiple talkers. The VariaNTS (Variatie in Nederlandse Taal en Sprekers) corpus is a Dutch spoken corpus that was developed to maximize both linguistic and talker variability. It contains 1000 items from 11 linguistic subcategories, recorded by 8 male and 8 female native speakers of standard Dutch. The corpus contains audio recordings, orthographic transcript... Mehr ...

Verfasser: Arts, Floor
Baskent, Deniz
Tamati, Terrin N.
Dokumenttyp: Artikel
Erscheinungsdatum: 2021
Reihe/Periodikum: Arts , F , Baskent , D & Tamati , T N 2021 , ' Development and structure of the VariaNTS corpus : A spoken Dutch corpus containing talker and linguistic variability ' , Speech Communication , vol. 127 , pp. 64-72 . https://doi.org/10.1016/j.specom.2020.12.006
Schlagwörter: Spoken Dutch / Speech corpus / Talker variability / Linguistic variability / FAMILIAR VOICE RECOGNITION / NORMAL-HEARING / SPEECH RECEPTION / SENTENCE MATERIALS / CLEAR SPEECH / WORD / NOISE / PERCEPTION / INTELLIGIBILITY / LISTENERS
Sprache: Englisch
Permalink: https://search.fid-benelux.de/Record/base-29027221
Datenquelle: BASE; Originalkatalog
Powered By: BASE
Link(s) : https://hdl.handle.net/11370/1442b954-3705-49fd-ada5-03d3faa466c0

Speech perception and spoken word recognition are not only affected by what is being said, but also by who is speaking. Currently, publicly available corpora of spoken Dutch do not offer a wide variety of linguistic materials produced by multiple talkers. The VariaNTS (Variatie in Nederlandse Taal en Sprekers) corpus is a Dutch spoken corpus that was developed to maximize both linguistic and talker variability. It contains 1000 items from 11 linguistic subcategories, recorded by 8 male and 8 female native speakers of standard Dutch. The corpus contains audio recordings, orthographic transcriptions, item-specific details such as word frequencies, neighborhood densities and phonotactic probabilities, and talker details. The VariaNTS corpus aims to provide new materials to be used for broad assessment of speech perception and word recognition in Dutch clinical and academic settings.