PERSA+: A Deep Learning Front-end For Context-agnostic Audio Classification
2021 Β· Lazaros Vrysis, Iordanis Thoidis, Charalampos Dimoulas, et al.
Abstract
Deep learning has been applied to diverse audio semantics tasks, enabling the construction of models that learn hierarchical levels of features from high-dimensional raw data, delivering state-of-the-art performance. But do these algorithms perform similarly in real-world conditions, or just at the benchmark, where their high learning capability assures the complete memorization of the employed datasets? This work presents a deep learning front-end, aiming at discarding detrimental information before entering the modeling stage, bringing the learning process closer to the point, anticipating the development of robust and context-agnostic classification algorithms.
Authors
(none)
Tags
Stats
Related papers
- Audio Concept Classification With Hierarchical Deep Neural Networks (2017)0.00
- Deep Audio Prior (2019)0.00
- Audioformer: Audio Transformer Learns Audio Feature Representations From Discrete Acoustic Codes (2023)0.00
- Audio Language Modeling Using Perceptually-guided Discrete Representations (2022)0.00
- Utilizing Domain Knowledge In End-to-end Audio Processing (2017)0.00
- Perceptual Loss Based Speech Denoising With An Ensemble Of Audio Pattern Recognition And Self-supervised Models (2020)10.21
- A Deep Neural Network For Audio Classification With A Classifier Attention Mechanism (2020)0.00
- Diverse Audio Embeddings -- Bringing Features Back Outperforms CLAP! (2023)0.00