Benefits of pre-trained mono- and cross-lingual speech representations for spoken language understanding of Dutch dysarthric speech

Abstract With the rise of deep learning, spoken language understanding (SLU) for command-and-control applications such as a voice-controlled virtual assistant can offer reliable hands-free operation to physically disabled individuals. However, due to data scarcity, it is still a challenge to process dysarthric speech. Pre-training (part of) the SLU model with supervised automatic speech recognition (ASR) targets or with self-supervised learning (SSL) may help to overcome a lack of data, but no research has shown which pre-training strategy performs better for SLU on dysarthric speech and to wh... Mehr ...

Verfasser: Wang, Pu
Van hamme, Hugo
Dokumenttyp: Artikel
Erscheinungsdatum: 2023
Reihe/Periodikum: EURASIP Journal on Audio, Speech, and Music Processing ; volume 2023, issue 1 ; ISSN 1687-4722
Verlag/Hrsg.: Springer Science and Business Media LLC
Sprache: Englisch
Permalink: https://search.fid-benelux.de/Record/base-27467203
Datenquelle: BASE; Originalkatalog
Powered By: BASE
Link(s) : http://dx.doi.org/10.1186/s13636-023-00280-z