Neuralogram: A Deep Neural Network Based Representation For Audio Signals
2019 Β· Prateek Verma, Chris Chafe, Jonathan Berger
Abstract
We propose the Neuralogram -- a deep neural network based representation for understanding audio signals which, as the name suggests, transforms an audio signal to a dense, compact representation based upon embeddings learned via a neural architecture. Through a series of probing signals, we show how our representation can encapsulate pitch, timbre and rhythm-based information, and other attributes. This representation suggests a method for revealing meaningful relationships in arbitrarily long audio signals that are not readily represented by existing algorithms. This has the potential for numerous applications in audio understanding, music recommendation, meta-data extraction to name a few.
Authors
(none)
Tags
Stats
Related papers
- Audio Spectrogram Representations For Processing With Convolutional Neural Networks (2017)0.00
- Audio Time-scale Modification With Temporal Compressing Networks (2022)0.00
- Melnet: A Generative Model For Audio In The Frequency Domain (2019)0.00
- An Investigation Of The Reconstruction Capacity Of Stacked Convolutional Autoencoders For Log-mel-spectrograms (2023)0.00
- Audioformer: Audio Transformer Learns Audio Feature Representations From Discrete Acoustic Codes (2023)0.00
- Audio Concept Classification With Hierarchical Deep Neural Networks (2017)0.00
- The Impact Of Audio Input Representations On Neural Network Based Music Transcription (2020)8.82
- Exploring Single-song Autoencoding Schemes For Audio-based Music Structure Analysis (2021)0.00