Training LDCRF Model On Unsegmented Sequences Using Connectionist Temporal Classification
2016 Β· Amir Ahooye Atashin, Kamaledin Ghiasi-Shirazi, Ahad Harati
Abstract
Many machine learning problems such as speech recognition, gesture recognition, and handwriting recognition are concerned with simultaneous segmentation and labeling of sequence data. Latent-dynamic conditional random field (LDCRF) is a well-known discriminative method that has been successfully used for this task. However, LDCRF can only be trained with pre-segmented data sequences in which the label of each frame is available apriori. In the realm of neural networks, the invention of connectionist temporal classification (CTC) made it possible to train recurrent neural networks on unsegmented sequences with great success. In this paper, we use CTC to train an LDCRF model on unsegmented sequences. Experimental results on two gesture recognition tasks show that the proposed method outperforms LDCRFs, hidden Markov models, and conditional random fields.
Authors
(none)
Tags
Stats
Related papers
- Multitask Learning With CTC And Segmental CRF For Speech Recognition (2017)0.00
- A Study Of All-convolutional Encoders For Connectionist Temporal Classification (2017)5.84
- Variational Connectionist Temporal Classification For Order-preserving Sequence Modeling (2023)5.24
- Residual Convolutional CTC Networks For Automatic Speech Recognition (2017)0.00
- CR-CTC: Consistency Regularization On CTC For Improved Speech Recognition (2024)6.30
- Star Temporal Classification: Sequence Classification With Partially Labeled Data (2022)3.58
- Self-attention Networks For Connectionist Temporal Classification In Speech Recognition (2019)14.55
- Context-aware Selective Label Smoothing For Calibrating Sequence Recognition Model (2023)0.00