Midi-sandwich: Multi-model Multi-task Hierarchical Conditional VAE-GAN Networks For Symbolic Single-track Music Generation
2019 Β· Xia Liang, Junmin Wu, Yan Yin
Abstract
Most existing neural network models for music generation explore how to generate music bars, then directly splice the music bars into a song. However, these methods do not explore the relationship between the bars, and the connected song as a whole has no musical form structure and sense of musical direction. To address this issue, we propose a Multi-model Multi-task Hierarchical Conditional VAE-GAN (Variational Autoencoder-Generative adversarial networks) networks, named MIDI-Sandwich, which combines musical knowledge, such as musical form, tonic, and melodic motion. The MIDI-Sandwich has two submodels: Hierarchical Conditional Variational Autoencoder (HCVAE) and Hierarchical Conditional Generative Adversarial Network (HCGAN). The HCVAE uses hierarchical structure. The underlying layer of HCVAE uses Local Conditional Variational Autoencoder (L-CVAE) to generate a music bar which is pre-specified by the First and Last Notes (FLN). The upper layer of HCVAE uses Global Variational Autoen
Authors
(none)
Tags
Stats
Related papers
- Multi-view Midivae: Fusing Track- And Bar-view Representations For Long Multi-track Symbolic Music Generation (2024)0.00
- Polyphonic Music Generation With Sequence Generative Adversarial Networks (2017)2.26
- A Unified Neural Architecture For Instrumental Audio Tasks (2019)0.00
- Music Generation Based On Generative Adversarial Networks With Transformer (2023)0.00
- Rethinking Recurrent Latent Variable Model For Music Composition (2018)7.50
- Conditional Variational Autoencoder To Improve Neural Audio Synthesis For Polyphonic Music Sound (2022)0.00
- Learning Style-aware Symbolic Music Representations By Adversarial Autoencoders (2020)2.26
- The Effect Of Explicit Structure Encoding Of Deep Neural Networks For Symbolic Music Generation (2018)11.49