Noise-robust Dsp-assisted Neural Pitch Estimation With Very Low Complexity
2023 Β· Krishna Subramani, Jean-Marc Valin, Jan Buethe, et al.
Abstract
Pitch estimation is an essential step of many speech processing algorithms, including speech coding, synthesis, and enhancement. Recently, pitch estimators based on deep neural networks (DNNs) have have been outperforming well-established DSP-based techniques. Unfortunately, these new estimators can be impractical to deploy in real-time systems, both because of their relatively high complexity, and the fact that some require significant lookahead. We show that a hybrid estimator using a small deep neural network (DNN) with traditional DSP-based features can match or exceed the performance of pure DNN-based models, with a complexity and algorithmic delay comparable to traditional DSP-based algorithms. We further demonstrate that this hybrid approach can provide benefits for a neural vocoding task.
Authors
(none)
Tags
Stats
Related papers
- Cross-domain Neural Pitch And Periodicity Estimation (2023)4.88
- Between Homomorphic Signal Processing And Deep Neural Networks: Constructing Deep Algorithms For Polyphonic Music Transcription (2017)0.00
- Human Voice Pitch Estimation: A Convolutional Network With Auto-labeled And Synthetic Data (2023)0.00
- Ultra-lightweight Neural Differential DSP Vocoder For High Quality Speech Synthesis (2024)5.24
- DEEPF0: End-to-end Fundamental Frequency Estimation For Music And Speech Signals (2021)10.35
- Puffin: Pitch-synchronous Neural Waveform Generation For Fullband Speech On Modest Devices (2022)3.58
- Deep-learning Architectures For Multi-pitch Estimation: Towards Reliable Evaluation (2022)0.00
- Unsupervised Harmonic Parameter Estimation Using Differentiable DSP And Spectral Optimal Transport (2023)5.84