Learning Long-Term Dependencies is Not as Difficult With NARX 
Recurrent Neural Networks

Lin, Tsungnan; Horne, Bill G.; Tino, Peter; Giles, C. Lee

Learning Long-Term Dependencies is Not as Difficult With NARX Recurrent Neural Networks

dc.contributor.author	Lin, Tsungnan	en_US
dc.contributor.author	Horne, Bill G.	en_US
dc.contributor.author	Tino, Peter	en_US
dc.contributor.author	Giles, C. Lee	en_US
dc.date.accessioned	2004-05-31T22:33:43Z
dc.date.available	2004-05-31T22:33:43Z
dc.date.created	1995-07	en_US
dc.date.issued	1998-10-15	en_US
dc.description.abstract	It has recently been shown that gradient descent learning algorithms for recurrent neural networks can perform poorly on tasks that involve long- term dependencies, i.e. those problems for which the desired output depends on inputs presented at times far in the past. In this paper we explore the long-term dependencies problem for a class of architectures called NARX recurrent neural networks, which have power ful representational capabilities. We have previously reported that gradient descent learning is more effective in NARX networks than in recurrent neural network architectures that have ``hidden states'' on problems includ ing grammatical inference and nonlinear system identification. Typically, the network converges much faster and generalizes better than other net works. The results in this paper are an attempt to explain this phenomenon. We present some experimental results which show that NARX networks can often retain information for two to three times as long as conventional recurrent neural networks. We show that although NARX networks do not circumvent the problem of long-term dependencies, they can greatly improve performance on long-term dependency problems. We also describe in detail some of the assumption regarding what it means to latch information robustly and suggest possible ways to loosen these assumptions. (Also cross-referenced as UMIACS-TR-95-78)	en_US
dc.format.extent	481301 bytes
dc.format.mimetype	application/postscript
dc.identifier.uri	http://hdl.handle.net/1903/745
dc.language.iso	en_US
dc.relation.isAvailableAt	Digital Repository at the University of Maryland	en_US
dc.relation.isAvailableAt	University of Maryland (College Park, Md.)	en_US
dc.relation.isAvailableAt	Tech Reports in Computer Science and Engineering	en_US
dc.relation.isAvailableAt	UMIACS Technical Reports	en_US
dc.relation.ispartofseries	UM Computer Science Department; CS-TR-3500	en_US
dc.relation.ispartofseries	UMIACS; UMIACS-TR-95-78	en_US
dc.title	Learning Long-Term Dependencies is Not as Difficult With NARX Recurrent Neural Networks	en_US
dc.type	Technical Report	en_US

Files

Original bundle

Now showing 1 - 2 of 2

Name:: CS-TR-3500.ps
Size:: 470.02 KB
Format:: Postscript Files

Download

Name:: CS-TR-3500.pdf
Size:: 226.58 KB
Format:: Adobe Portable Document Format
Description:: Auto-generated copy of CS-TR-3500.ps

Download

Collections

Technical Reports from UMIACS
Technical Reports of the Computer Science Department