Awesome Speech Audio

📄Papers 🧭Topics 🔥Trending 🗺️Map 🏆Leaderboards 🎓Learn 🤖Ask AI

⋯More

👥Authors 📚Reading Packs 📊Datasets 🛠️Tools 📰News 📝Blogs ✉️Newsletter 🎯Research Radar 🔖Saved

← authors · overview

Loading author…

Stay Updated

E-Mail Digest 🎯 Research Radar

Submit a paper · Privacy · Terms

© 2026 Awesome Papers.

Yuxuan Wang — most-cited papers & profile · Speech Audio

← authors · overview

Yuxuan Wang

92 papers · 1899 citations · 0 h-index

Beijing Jiaotong University · Harvard University Press

Google Scholar ↗Semantic Scholar ↗OpenAlex ↗

Most-cited papers

Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis
2018 · 475 citations
Towards End-to-End Prosody Transfer for Expressive Speech Synthesis with Tacotron
2018 · 205 citations
Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions
2017 · 184 citations
Tacotron: Towards End-to-End Speech Synthesis
2017 · 152 citations
Predicting Expressive Speaking Style From Text In End-To-End Speech Synthesis
2018 · 117 citations
MinMo: A Multimodal Large Language Model for Seamless Voice Interaction
2025 · 86 citations
Hierarchical Generative Modeling for Controllable Speech Synthesis
2018 · 45 citations
Uncovering Latent Style Factors for Expressive Speech Synthesis

2017 · 44 citations

Decoupling Magnitude and Phase Estimation with Deep ResUNet for Music Source Separation

2021 · 38 citations

VoiceFixer: Toward General Speech Restoration with Neural Vocoder

2021 · 25 citations

USTC-NELSLIP System Description for DIHARD-III Challenge

2021 · 20 citations

ByteSing: A Chinese Singing Voice Synthesis System Using Duration Allocated Encoder-Decoder Acoustic Models and WaveRNN Vocoders

2020 · 17 citations

SALMONN-omni: A Standalone Speech LLM without Codec Injection for Full-duplex Conversation

2025 · 16 citations

Trainable Frontend For Robust and Far-Field Keyword Spotting

2016 · 12 citations

Efficient Neural Music Generation

2023 · 12 citations

Top co-authors

Yuping Wang · 18 Lu Lu · 13 Jun Zhang · 9 Qiuqiang Kong · 8 Zhuo Chen · 8 Rui Xia · 7 Haohe Liu · 6 Rif A. Saurous · 6 Yang Zhang · 5 Yonghui Wu · 5 Fan Yu · 4 Jiawei Chen · 4

Topics

Text-to-Speech Audio Generation Speech Recognition Audio Understanding Speech Enhancement Multimodal Audio Speech Translation Music Generation Voice Cloning Speaker Analysis