InterSpeech 2021

Speech Synthesis: Prosody Modeling II

Alternate Endings: Improving Prosody for Incremental Neural TTS with Predicted Future Text Input
(Oral presentation)

Brooke Stephenson (GIPSA-lab (UMR 5216), France), Thomas Hueber (GIPSA-lab (UMR 5216), France), Laurent Girin (GIPSA-lab (UMR 5216), France), Laurent Besacier (LIG (UMR 5217), France)

Exploring emotional prototypes in a high dimensional TTS latent space
(Oral presentation)

Pol van Rijn (MPI for Empirical Aesthetics, Germany), Silvan Mertes (Universität Augsburg, Germany), Dominik Schiller (Universität Augsburg, Germany), Peter M.C. Harrison (MPI for Empirical Aesthetics, Germany), Pauline Larrouy-Maestri (MPI for Empirical Aesthetics, Germany), Elisabeth André (Universität Augsburg, Germany), Nori Jacoby (MPI for Empirical Aesthetics, Germany)

ADEPT: A Dataset for Evaluating Prosody Transfer
(Oral presentation)

Alexandra Torresquintero (Papercup Technologies, UK), Tian Huey Teh (Papercup Technologies, UK), Christopher G.R. Wallis (Papercup Technologies, UK), Marlene Staib (Papercup Technologies, UK), Devang S. Ram Mohan (Papercup Technologies, UK), Vivian Hu (Papercup Technologies, UK), Lorenzo Foglianti (Papercup Technologies, UK), Jiameng Gao (Papercup Technologies, UK), Simon King (Papercup Technologies, UK)