Semi-supervised Neural Chord Estimation Based On A Variational Autoencoder With Latent Chord Labels And Features
2020 Β· Yiming Wu, Tristan Carsault, Eita Nakamura, et al.
Abstract
This paper describes a statistically-principled semi-supervised method of automatic chord estimation (ACE) that can make effective use of music signals regardless of the availability of chord annotations. The typical approach to ACE is to train a deep classification model (neural chord estimator) in a supervised manner by using only annotated music signals. In this discriminative approach, prior knowledge about chord label sequences (model output) has scarcely been taken into account. In contrast, we propose a unified generative and discriminative approach in the framework of amortized variational inference. More specifically, we formulate a deep generative model that represents the generative process of chroma vectors (observed variables) from discrete labels and continuous features (latent variables), which are assumed to follow a Markov model favoring self-transitions and a standard Gaussian distribution, respectively. Given chroma vectors as observed data, the posterior distributio
Authors
(none)
Tags
Stats
Related papers
- Multi-step Chord Sequence Prediction Based On Aggregated Multi-scale Encoder-decoder Network (2019)0.00
- Learning Style-aware Symbolic Music Representations By Adversarial Autoencoders (2020)2.26
- Self-supervised Disentanglement Of Harmonic And Rhythmic Features In Music Audio Signals (2023)0.00
- Interpretable Timbre Synthesis Using Variational Autoencoders Regularized On Timbre Descriptors (2023)0.00
- Emotion-conditioned Melody Harmonization With Hierarchical Variational Autoencoder (2023)5.24
- Semi-supervised Multichannel Speech Enhancement With Variational Autoencoders And Non-negative Matrix Factorization (2018)12.25
- Music Fadernets: Controllable Music Generation Based On High-level Features Via Low-level Feature Modelling (2020)0.00
- Domain Adversarial Training On Conditional Variational Auto-encoder For Controllable Music Generation (2022)0.00