How Embedded Memory in Recurrent Neural Network Architectures Helps 
Learning Long-term Dependencies

Lin, Tsungnan; Horne, Bill G.; Giles, C. Lee

How Embedded Memory in Recurrent Neural Network Architectures Helps Learning Long-term Dependencies

dc.contributor.author	Lin, Tsungnan	en_US
dc.contributor.author	Horne, Bill G.	en_US
dc.contributor.author	Giles, C. Lee	en_US
dc.date.accessioned	2004-05-31T22:38:52Z
dc.date.available	2004-05-31T22:38:52Z
dc.date.created	1996-08	en_US
dc.date.issued	1998-10-15	en_US
dc.description.abstract	Learning long-term temporal dependencies with recurrent neural networks can be a difficult problem. It has recently been shown that a class of recurrent neural networks called NARX networks perform much better than conventional recurrent neural networks for learning certain simple long-term dependency problems. The intuitive explanation for this behavior is that the output memories of a NARX network can be manifested as jump-ahead connections in the time-unfolded network. These jump-ahead connections can propagate gradient information more efficiently, thus reducing the sensitivity of the network to long-term dependencies. This work gives empirical justification to our hypothesis that similar improvements in learning long-term dependencies can be achieved with other classes of recurrent neural network architectures simply by increasing the order of the embedded memory. In particular we explore the impact of learning simple long-term dependency problems on three classes of recurrent neural networks architectures: globally recurrent networks, locally recurrent networks, and NARX (output feedback) networks. Comparing the performance of these architectures with different orders of embedded memory on two simple long-term dependences problems shows that all of these classes of networks architectures demonstrate significant improvement on learning long-term dependencies when the orders of embedded memory are increased. These results can be important to a user comfortable to a specific recurrent neural network architecture because simply increasing the embedding memory order will make the architecture more robust to the problem of long-term dependency learning. (Also cross-referenced as UMIACS-TR-96-28)	en_US
dc.format.extent	275595 bytes
dc.format.mimetype	application/postscript
dc.identifier.uri	http://hdl.handle.net/1903/813
dc.language.iso	en_US
dc.relation.isAvailableAt	Digital Repository at the University of Maryland	en_US
dc.relation.isAvailableAt	University of Maryland (College Park, Md.)	en_US
dc.relation.isAvailableAt	Tech Reports in Computer Science and Engineering	en_US
dc.relation.isAvailableAt	UMIACS Technical Reports	en_US
dc.relation.ispartofseries	UM Computer Science Department; CS-TR-3626	en_US
dc.relation.ispartofseries	UMIACS; UMIACS-TR-96-28	en_US
dc.title	How Embedded Memory in Recurrent Neural Network Architectures Helps Learning Long-term Dependencies	en_US
dc.type	Technical Report	en_US

Files

Original bundle

Now showing 1 - 2 of 2

Name:: CS-TR-3626.ps
Size:: 269.14 KB
Format:: Postscript Files

Download

Name:: CS-TR-3626.pdf
Size:: 181.41 KB
Format:: Adobe Portable Document Format
Description:: Auto-generated copy of CS-TR-3626.ps

Download

Collections

Technical Reports from UMIACS
Technical Reports of the Computer Science Department