Latent Normalizing Flows For Discrete Sequences
2019 Β· Zachary M. Ziegler, Alexander M. Rush
Abstract
Normalizing flows are a powerful class of generative models for continuous random variables, showing both strong model flexibility and the potential for non-autoregressive generation. These benefits are also desired when modeling discrete random variables such as text, but directly applying normalizing flows to discrete sequences poses significant additional challenges. We propose a VAE-based generative model which jointly learns a normalizing flow-based distribution in the latent space and a stochastic mapping to an observed discrete space. In this setting, we find that it is crucial for the flow-based distribution to be highly multimodal. To capture this property, we propose several normalizing flow architectures to maximize model flexibility. Experiments consider common discrete sequence tasks of character-level language modeling and polyphonic music generation. Our results indicate that an autoregressive flow-based model can match the performance of a comparable autoregressive base
Authors
(none)
Tags
Stats
Related papers
- Generative Modeling For Low Dimensional Speech Attributes With Neural Spline Flows (2022)0.00
- Flow-tsvad: Target-speaker Voice Activity Detection Via Latent Flow Matching (2024)0.00
- Using Vaes And Normalizing Flows For One-shot Text-to-speech Synthesis Of Expressive Speech (2019)9.92
- Flowvocoder: A Small Footprint Neural Vocoder Based Normalizing Flow For Speech Synthesis (2021)0.00
- Predicting Phoneme-level Prosody Latents Using AR And Flow-based Prior Networks For Expressive Speech Synthesis (2022)0.00
- Text-free Non-parallel Many-to-many Voice Conversion Using Normalising Flows (2022)7.16
- Generative Pre-training For Speech With Flow Matching (2023)0.00
- Multimodal Latent Language Modeling With Next-token Diffusion (2024)0.00