Beam Search Decoding Using Manner Of Articulation Detection Knowledge Derived From Connectionist Temporal Classification
2018 Β· Pradeep Rangan, Sreenivasa Rao K
Abstract
Manner of articulation detection using deep neural networks require a priori knowledge of the attribute discriminative features or the decent phoneme alignments. However generating an appropriate phoneme alignment is complex and its performance depends on the choice of optimal number of senones, Gaussians, etc. In the first part of our work, we exploit the manner of articulation detection using connectionist temporal classification (CTC) which doesn't need any phoneme alignment. Later we modify the state-of-the-art character based posteriors generated by CTC using the manner of articulation CTC detector. Beam search decoding is performed on the modified posteriors and it's impact on open source datasets such as AN4 and LibriSpeech is observed.
Authors
(none)
Tags
Stats
Related papers
- Manner Of Articulation Detection Using Connectionist Temporal Classification To Improve Automatic Speech Recognition Performance (2018)0.00
- Joint Beam Search Integrating CTC, Attention, And Transducer Decoders (2024)5.24
- Back From The Future: Bidirectional CTC Decoding Using Future Information In Speech Recognition (2021)0.00
- Segment-level Vectorized Beam Search Based On Partially Autoregressive Inference (2023)0.00
- A Fully Differentiable Beam Search Decoder (2019)0.00
- Robust Beam Search For Encoder-decoder Attention Based Speech Recognition Without Length Bias (2020)4.52
- Comparison Of Decoding Strategies For CTC Acoustic Models (2017)10.48
- Self-attention Networks For Connectionist Temporal Classification In Speech Recognition (2019)14.55