Bandcondinet: Parallel Transformers-based Conditional Popular Music Generation With Multi-view Features
2024 Β· Jing Luo, Xinyu Yang, Dorien Herremans
Abstract
Conditional music generation offers significant advantages in terms of user convenience and control, presenting great potential in AI-generated content research. However, building conditional generative systems for multitrack popular songs presents three primary challenges: insufficient fidelity of input conditions, poor structural modeling, and inadequate inter-track harmony learning in generative models. To address these issues, we propose BandCondiNet, a conditional model based on parallel Transformers, designed to process the multiple music sequences and generate high-quality multitrack samples. Specifically, we propose multi-view features across time and instruments as high-fidelity conditions. Moreover, we propose two specialized modules for BandCondiNet: Structure Enhanced Attention (SEA) to strengthen the musical structure, and Cross-Track Transformer (CTT) to enhance inter-track harmony. We conducted both objective and subjective evaluations on two popular music datasets with
Authors
(none)
Tags
Stats
Related papers
- Midi-sandwich: Multi-model Multi-task Hierarchical Conditional VAE-GAN Networks For Symbolic Single-track Music Generation (2019)0.00
- Music Generation Based On Generative Adversarial Networks With Transformer (2023)0.00
- Rethinking Recurrent Latent Variable Model For Music Composition (2018)7.50
- Editing Music With Melody And Text: Using Controlnet For Diffusion Transformer (2024)5.84
- Samuel: Efficient Vocal-conditioned Music Generation Via Soft Alignment Attention And Latent Diffusion (2025)0.00
- Who Will Top The Charts? Multimodal Music Popularity Prediction Via Adaptive Fusion Of Modality Experts And Temporal Engagement Modeling (2025)0.00
- Conditional Diffusion As Latent Constraints For Controllable Symbolic Music Generation (2025)0.00
- C3net: Compound Conditioned Controlnet For Multimodal Content Generation (2023)5.84