Time-frequency Audio Features For Speech-music Classification
2018 Β· Mrinmoy Bhattacharjee, S. R. M. Prasanna, Prithwijit Guha
Abstract
Distinct striation patterns are observed in the spectrograms of speech and music. This motivated us to propose three novel time-frequency features for speech-music classification. These features are extracted in two stages. First, a preset number of prominent spectral peak locations are identified from the spectra of each frame. These important peak locations obtained from each frame are used to form Spectral peak sequences (SPS) for an audio interval. In second stage, these SPS are treated as time series data of frequency locations. The proposed features are extracted as periodicity, average frequency and statistical attributes of these spectral peak sequences. Speech-music categorization is performed by learning binary classifiers on these features. We have experimented with Gaussian mixture models, support vector machine and random forest classifiers. Our proposal is validated on four datasets and benchmarked against three baseline approaches. Experimental results establish the vali
Authors
(none)
Tags
Stats
Related papers
- Music Genre Classification Using Spectral Analysis And Sparse Representation Of The Signals (2018)8.09
- Convolution Channel Separation And Frequency Sub-bands Aggregation For Music Genre Classification (2022)0.00
- Audio Classification Of Low Feature Spectrograms Utilizing Convolutional Neural Networks (2024)5.84
- Spectral And Rhythm Features For Audio Classification With Deep Convolutional Neural Networks (2024)0.00
- Music Genre Classification: A Comparative Analysis Of CNN And Xgboost Approaches With Mel-frequency Cepstral Coefficients And Mel Spectrograms (2024)0.00
- An Investigation Of The Effectiveness Of Phase For Audio Classification (2021)3.58
- Wavelet-filtering Of Symbolic Music Representations For Folk Tune Segmentation And Classification (2025)0.00
- Music Artist Classification With Convolutional Recurrent Neural Networks (2019)11.93