On A Novel Training Algorithm For Sequence-to-sequence Predictive Recurrent Networks
2021 Β· Boris Rubinstein
Abstract
Neural networks mapping sequences to sequences (seq2seq) lead to significant progress in machine translation and speech recognition. Their traditional architecture includes two recurrent networks (RNs) followed by a linear predictor. In this manuscript we perform analysis of a corresponding algorithm and show that the parameters of the RNs of the well trained predictive network are not independent of each other. Their dependence can be used to significantly improve the network effectiveness. The traditional seq2seq algorithms require short term memory of a size proportional to the predicted sequence length. This requirement is quite difficult to implement in a neuroscience context. We present a novel memoryless algorithm for seq2seq predictive networks and compare it to the traditional one in the context of time series prediction. We show that the new algorithm is more robust and makes predictions with higher accuracy than the traditional one.
Authors
(none)
Tags
Stats
Related papers
- Learning The Sequential Temporal Information With Recurrent Neural Networks (2018)0.00
- Sequence Segmentation Using Joint RNN And Structured Prediction Models (2016)7.81
- Residual Memory Networks: Feed-forward Approach To Learn Long Temporal Dependencies (2018)7.16
- An Empirical Evaluation Of Generic Convolutional And Recurrent Networks For Sequence Modeling (2018)7.31
- Quaternion Recurrent Neural Networks (2018)0.00
- From Nodes To Networks: Evolving Recurrent Neural Networks (2018)0.00
- Linear Memory Networks (2018)6.34
- Incremental Training Of A Recurrent Neural Network Exploiting A Multi-scale Dynamic Memory (2020)3.58