Between Homomorphic Signal Processing And Deep Neural Networks: Constructing Deep Algorithms For Polyphonic Music Transcription
2017 Β· Li Su
Abstract
This paper presents a new approach in understanding how deep neural networks (DNNs) work by applying homomorphic signal processing techniques. Focusing on the task of multi-pitch estimation (MPE), this paper demonstrates the equivalence relation between a generalized cepstrum and a DNN in terms of their structures and functionality. Such an equivalence relation, together with pitch perception theories and the recently established rectified-correlations-on-a-sphere (RECOS) filter analysis, provide an alternative way in explaining the role of the nonlinear activation function and the multi-layer structure, both of which exist in a cepstrum and a DNN. To validate the efficacy of this new approach, a new feature designed in the same fashion is proposed for pitch salience function. The new feature outperforms the one-layer spectrum in the MPE task and, as predicted, it addresses the issue of the missing fundamental effect and also achieves better robustness to noise.
Authors
(none)
Tags
Stats
Related papers
- Deep-learning Architectures For Multi-pitch Estimation: Towards Reliable Evaluation (2022)0.00
- Noise-robust Dsp-assisted Neural Pitch Estimation With Very Low Complexity (2023)5.24
- A Holistic Approach To Polyphonic Music Transcription With Neural Networks (2019)0.00
- Comparing Conventional Pitch Detection Algorithms With A Neural Network Approach (2022)0.00
- Hppnet: Modeling The Harmonic Structure And Pitch Invariance In Piano Transcription (2022)0.00
- Cross-domain Neural Pitch And Periodicity Estimation (2023)4.88
- Modeling Music Modality With A Key-class Invariant Pitch Chroma CNN (2019)0.00
- DEEPF0: End-to-end Fundamental Frequency Estimation For Music And Speech Signals (2021)10.35