An Empirical Study on Channel Effects for Synthetic Voice Spoofing Countermeasure Systems <BR>(3 minutes introduction)

An Empirical Study on Channel Effects for Synthetic Voice Spoofing Countermeasure Systems
(3 minutes introduction)

You Zhang (University of Rochester, USA), Ge Zhu (University of Rochester, USA), Fei Jiang (University of Rochester, USA), Zhiyao Duan (University of Rochester, USA)

Spoofing countermeasure (CM) systems are critical in speaker verification; they aim to discern spoofing attacks from bona fide speech trials. In practice, however, acoustic condition variability in speech utterances may significantly degrade the performance of CM systems. In this paper, we conduct a cross-dataset study on several state-of-the-art CM systems and observe significant performance degradation compared with their single-dataset performance. Observing differences of average magnitude spectra of bona fide utterances across the datasets, we hypothesize that channel mismatch among these datasets is one important reason. We then verify it by demonstrating a similar degradation of CM systems trained on original but evaluated on channel-shifted data. Finally, we propose several channel robust strategies (data augmentation, multi-task learning, adversarial learning) for CM systems, and observe a significant performance improvement on cross-dataset experiments.

Search in Audio

Related Recordings

Representation Learning to Classify and Detect Adversarial Attacks against Speaker and Speech Recognition Systems
(3 minutes introduction)

Jesús Villalba , Sonal Joshi , Piotr Żelasko , Najim Dehak

An Empirical Study on Channel Effects for Synthetic Voice Spoofing Countermeasure Systems
(longer introduction)

You Zhang , Ge Zhu , Fei Jiang , Zhiyao Duan

InterSpeech 2021

An Empirical Study on Channel Effects for Synthetic Voice Spoofing Countermeasure Systems (3 minutes introduction)

Search in Audio

Related Recordings

Representation Learning to Classify and Detect Adversarial Attacks against Speaker and Speech Recognition Systems (3 minutes introduction)

An Empirical Study on Channel Effects for Synthetic Voice Spoofing Countermeasure Systems (longer introduction)

An Empirical Study on Channel Effects for Synthetic Voice Spoofing Countermeasure Systems
(3 minutes introduction)

Representation Learning to Classify and Detect Adversarial Attacks against Speaker and Speech Recognition Systems
(3 minutes introduction)

An Empirical Study on Channel Effects for Synthetic Voice Spoofing Countermeasure Systems
(longer introduction)