InterSpeech 2021

Glottal Stops in Upper Sorbian: a Data-Driven Approach
(Oral presentation)

Ivan Kraljevski (Fraunhofer IKTS, Germany), Maria Paola Bissiri (Università dell’Insubria, Italy), Frank Duckhorn (Fraunhofer IKTS, Germany), Constanze Tschoepe (Fraunhofer IKTS, Germany), Matthias Wolff (Brandenburgische Technische Universität, Germany)
We present a data-driven approach for the quantitative analysis of glottal stops before word-initial vowels in Upper Sorbian, a West Slavic minority language spoken in Germany. Glottal stops are word-boundary markers and their detection can improve the performance of automatic speech recognition and speech synthesis systems. We employed cross-language transfer using an acoustic model in German to develop a forced-alignment method for the phonetic segmentation of a read-speech corpus in Upper Sorbian. The missing phonemic units were created by combining the existing phoneme models. In the forced-alignment procedure, the glottal stops were considered optional in front of word-initial vowels. To investigate the influence of speaker type (males, females, and children) and vowel on the occurrence of glottal stops, binomial regression analysis with a generalized linear mixed model was performed. Results show that children glottalize word-initial vowels more frequently than adults, and that glottal stop occurrences are influenced by vowel quality.