Gram-ctc: Automatic Unit Selection And Target Decomposition For Sequence Labelling
2017 Β· Hairong Liu, Zhenyao Zhu, Xiangang Li, et al.
Abstract
Most existing sequence labelling models rely on a fixed decomposition of a target sequence into a sequence of basic units. These methods suffer from two major drawbacks: 1) the set of basic units is fixed, such as the set of words, characters or phonemes in speech recognition, and 2) the decomposition of target sequences is fixed. These drawbacks usually result in sub-optimal performance of modeling sequences. In this pa- per, we extend the popular CTC loss criterion to alleviate these limitations, and propose a new loss function called Gram-CTC. While preserving the advantages of CTC, Gram-CTC automatically learns the best set of basic units (grams), as well as the most suitable decomposition of tar- get sequences. Unlike CTC, Gram-CTC allows the model to output variable number of characters at each time step, which enables the model to capture longer term dependency and improves the computational efficiency. We demonstrate that the proposed Gram-CTC improves CTC in terms of both perf
Authors
(none)
Tags
Stats
Related papers
- Efficient CTC Regularization Via Coarse Labels For End-to-end Speech Translation (2023)0.00
- Blank Collapse: Compressing CTC Emission For The Faster Decoding (2022)0.00
- Star Temporal Classification: Sequence Classification With Partially Labeled Data (2022)3.58
- Comparison Of Decoding Strategies For CTC Acoustic Models (2017)10.48
- Softctc -- Semi-supervised Learning For Text Recognition Using Soft Pseudo-labels (2022)5.24
- CTC-GMM: CTC Guided Modality Matching For Fast And Accurate Streaming Speech Translation (2024)3.58
- Multitask Learning With CTC And Segmental CRF For Speech Recognition (2017)0.00
- Align With Purpose: Optimize Desired Properties In CTC Models With A General Plug-and-play Framework (2023)0.00