MF-PAM: Accurate Pitch Estimation Through Periodicity Analysis And Multi-level Feature Fusion
2023 Β· Woo-Jin Chung, Doyeon Kim, Soo-Whan Chung, et al.
Abstract
We introduce Multi-level feature Fusion-based Periodicity Analysis Model (MF-PAM), a novel deep learning-based pitch estimation model that accurately estimates pitch trajectory in noisy and reverberant acoustic environments. Our model leverages the periodic characteristics of audio signals and involves two key steps: extracting pitch periodicity using periodic non-periodic convolution (PNP-Conv) blocks and estimating pitch by aggregating multi-level features using a modified bi-directional feature pyramid network (BiFPN). We evaluate our model on speech and music datasets and achieve superior pitch estimation performance compared to state-of-the-art baselines while using fewer model parameters. Our model achieves 99.20 % accuracy in pitch estimation on a clean musical dataset. Overall, our proposed model provides a promising solution for accurate pitch estimation in challenging acoustic environments and has potential applications in audio signal processing.
Authors
(none)
Tags
Stats
Related papers
- DEEPF0: End-to-end Fundamental Frequency Estimation For Music And Speech Signals (2021)10.35
- Cross-domain Neural Pitch And Periodicity Estimation (2023)4.88
- Deep-learning Architectures For Multi-pitch Estimation: Towards Reliable Evaluation (2022)0.00
- Traditional Machine Learning For Pitch Detection (2019)10.85
- Hf0: A Hybrid Pitch Extraction Method For Multimodal Voice (2019)0.00
- MAJL: A Model-agnostic Joint Learning Framework For Music Source Separation And Pitch Estimation (2025)4.52
- Noisy Speech Based Temporal Decomposition To Improve Fundamental Frequency Estimation (2021)5.24
- Real-time Pitch/f0 Detection Using Spectrogram Images And Convolutional Neural Networks (2025)0.00