Synthetic Speech Classification: IEEE Signal Processing Cup 2022 Challenge
2024 Β· Mahieyin Rahmun, Rafat Hasan Khan, Tanjim Taharat Aurpa, et al.
Abstract
The aim of this project is to implement and design arobust synthetic speech classifier for the IEEE Signal ProcessingCup 2022 challenge. Here, we learn a synthetic speech attributionmodel using the speech generated from various text-to-speech(TTS) algorithms as well as unknown TTS algorithms. Weexperiment with both the classical machine learning methodssuch as support vector machine, Gaussian mixture model, anddeep learning based methods such as ResNet, VGG16, and twoshallow end-to-end networks. We observe that deep learningbased methods with raw data demonstrate the best performance.
Authors
(none)
Tags
Stats
Related papers
- Syn-att: Synthetic Speech Attribution Via Semi-supervised Unknown Multi-class Ensemble Of Cnns (2023)0.00
- Using Deep Learning Techniques And Inferential Speech Statistics For AI Synthesised Speech Recognition (2021)0.00
- An End-to-end Multi-module Audio Deepfake Generation System For ADD Challenge 2023 (2023)0.00
- Tts-by-tts 2: Data-selective Augmentation For Neural Speech Synthesis Using Ranking Support Vector Machine With Variational Autoencoder (2022)4.52
- On The Problem Of Text-to-speech Model Selection For Synthetic Data Generation In Automatic Speech Recognition (2024)4.52
- Lightweight Model Attribution And Detection Of Synthetic Speech Via Audio Residual Fingerprints (2024)0.00
- Autospeech 2020: The Second Automated Machine Learning Challenge For Speech Classification (2020)3.58
- Generating Synthetic Audio Data For Attention-based Speech Recognition Systems (2019)12.68