Tutorial Proposal: End-to-End Speech Translation.

Speech translation is the translation of speech in one language typically to text in another, traditionally accomplished through a combination of automatic speech recognition and machine translation. Speech translation has attracted interest for many years, but the recent successful applications of deep learning to both individual tasks have enabled new opportunities through joint modeling, in what we today call 'end-to-end speech translation.' In this tutorial we will introduce the techniques used in cutting-edge research on speech translation. Starting from the traditional cascaded approach,... Mehr ...

Verfasser: Matteo Negri
Marco Turchi
Jan Niehues
Elizabeth Salesky
Dokumenttyp: conferencePaper
Erscheinungsdatum: 2021
Schlagwörter: Netherlands / Knowmad Institut
Sprache: unknown
Permalink: https://search.fid-benelux.de/Record/base-26811507
Datenquelle: BASE; Originalkatalog
Powered By: BASE
Link(s) : https://www.openaccessrepository.it/record/115337

Speech translation is the translation of speech in one language typically to text in another, traditionally accomplished through a combination of automatic speech recognition and machine translation. Speech translation has attracted interest for many years, but the recent successful applications of deep learning to both individual tasks have enabled new opportunities through joint modeling, in what we today call 'end-to-end speech translation.' In this tutorial we will introduce the techniques used in cutting-edge research on speech translation. Starting from the traditional cascaded approach, we will given an overview on data sources and model architectures to achieve state-of-the art performance with end-to-end speech translation for both high- and low-resource languages. In addition, we will discuss methods to evaluate analyze the proposed solutions, as well as the challenges faced when applying speech translation models for real-world applications.