Voting for the right answer: Adversarial defense for speaker verification <BR>(3 minutes introduction)

Voting for the right answer: Adversarial defense for speaker verification
(3 minutes introduction)

Haibin Wu (Tsinghua University, China), Yang Zhang (Tsinghua University, China), Zhiyong Wu (Tsinghua University, China), Dong Wang (Tsinghua University, China), Hung-yi Lee (National Taiwan University, Taiwan)

Automatic speaker verification (ASV) is a well developed technology for biometric identification, and has been ubiquitous implemented in security-critic applications, such as banking and access control. However, previous works have shown that ASV is under the radar of adversarial attacks, which are very similar to their original counterparts from human’s perception, yet will manipulate the ASV render wrong prediction. Due to the very late emergence of adversarial attacks for ASV, effective countermeasures against them are limited. Given that the security of ASV is of high priority, in this work, we propose the idea of “voting for the right answer” to prevent risky decisions of ASV in blind spot areas, by employing random sampling and voting. Experimental results show that our proposed method improves the robustness against both the limited-knowledge attackers by pulling the adversarial samples out of the blind spots, and the sufficient-knowledge attackers by introducing randomness and increasing the attackers’ budgets.

Search in Audio

Related Recordings

Pairing Weak with Strong: Twin Models for Defending against Adversarial Attack on Speaker Verification
(longer introduction)

Zhiyuan Peng , Xu Li , Tan Lee

Visualizing Classifier Adjacency Relations: A Case Study in Speaker Verification and Voice Anti-Spoofing
(3 minutes introduction)

Tomi Kinnunen , Andreas Nautsch , Md. Sahidullah , Nicholas Evans , Xin Wang , Massimiliano Todisco , Héctor Delgado , Junichi Yamagishi , Kong Aik Lee

InterSpeech 2021

Voting for the right answer: Adversarial defense for speaker verification (3 minutes introduction)

Search in Audio

Related Recordings

Pairing Weak with Strong: Twin Models for Defending against Adversarial Attack on Speaker Verification (longer introduction)

Visualizing Classifier Adjacency Relations: A Case Study in Speaker Verification and Voice Anti-Spoofing (3 minutes introduction)

Voting for the right answer: Adversarial defense for speaker verification
(3 minutes introduction)

Pairing Weak with Strong: Twin Models for Defending against Adversarial Attack on Speaker Verification
(longer introduction)

Visualizing Classifier Adjacency Relations: A Case Study in Speaker Verification and Voice Anti-Spoofing
(3 minutes introduction)