Asvspoof 5: Design, Collection And Validation Of Resources For Spoofing, Deepfake, And Adversarial Attack Detection Using Crowdsourced Speech

Abstract

ASVspoof 5 is the fifth edition in a series of challenges which promote the study of speech spoofing and deepfake attacks as well as the design of detection solutions. We introduce the ASVspoof 5 database which is generated in a crowdsourced fashion from data collected in diverse acoustic conditions (cf. studio-quality data for earlier ASVspoof databases) and from ~2,000 speakers (cf. ~100 earlier). The database contains attacks generated with 32 different algorithms, also crowdsourced, and optimised to varying degrees using new surrogate detection models. Among them are attacks generated with a mix of legacy and contemporary text-to-speech synthesis and voice conversion models, in addition to adversarial attacks which are incorporated for the first time. ASVspoof 5 protocols comprise seven speaker-disjoint partitions. They include two distinct partitions for the training of different sets of attack models, two more for the development and evaluation of surrogate detection models, and

Asvspoof 5: Design, Collection And Validation Of Resources For Spoofing, Deepfake, And Adversarial Attack Detection Using Crowdsourced Speech

Abstract

Authors

Tags

Stats

Related papers