What Size Neural Network Gives Optimal Generalization? Convergence Properties of Backpropagation

Lawrence, Steve; Giles, C. Lee; Tsoi, Ah Chung

What Size Neural Network Gives Optimal Generalization? Convergence Properties of Backpropagation

dc.contributor.author	Lawrence, Steve	en_US
dc.contributor.author	Giles, C. Lee	en_US
dc.contributor.author	Tsoi, Ah Chung	en_US
dc.date.accessioned	2004-05-31T22:38:33Z
dc.date.available	2004-05-31T22:38:33Z
dc.date.created	1996-04	en_US
dc.date.issued	1998-10-15	en_US
dc.description.abstract	One of the most important aspects of any machine learning paradigm is how it scales according to problem size and complexity. Using a task with known optimal training error, and a pre-specified maximum number of training updates, we investigate the convergence of the backpropagation algorithm with respect to a) the complexity of the required function approximation, b) the size of the network in relation to the size required for an optimal solution, and c) the degree of noise in the training data. In general, for a) the solution found is worse when the function to be approximated is more complex, for b) oversize networks can result in lower training and generalization error, and for c) the use of committee or ensemble techniques can be more beneficial as the amount of noise in the training data is increased. For the experiments we performed, we do not obtain the optimal solution in any case. We further support the observation that larger networks can produce better training and generalization error using a face recognition example where a network with many more parameters than training points generalizes better than smaller networks. (Also cross-referenced as UMIACS-TR-96-22)	en_US
dc.format.extent	2226577 bytes
dc.format.mimetype	application/postscript
dc.identifier.uri	http://hdl.handle.net/1903/809
dc.language.iso	en_US
dc.relation.isAvailableAt	Digital Repository at the University of Maryland	en_US
dc.relation.isAvailableAt	University of Maryland (College Park, Md.)	en_US
dc.relation.isAvailableAt	Tech Reports in Computer Science and Engineering	en_US
dc.relation.isAvailableAt	UMIACS Technical Reports	en_US
dc.relation.ispartofseries	UM Computer Science Department; CS-TR-3617	en_US
dc.relation.ispartofseries	UMIACS; UMIACS-TR-96-22	en_US
dc.title	What Size Neural Network Gives Optimal Generalization? Convergence Properties of Backpropagation	en_US
dc.type	Technical Report	en_US

Files

Original bundle

Now showing 1 - 2 of 2

Name:: CS-TR-3617.ps
Size:: 2.12 MB
Format:: Postscript Files

Download

Name:: CS-TR-3617.pdf
Size:: 575.02 KB
Format:: Adobe Portable Document Format
Description:: Auto-generated copy of CS-TR-3617.ps

Download

Collections

Technical Reports from UMIACS
Technical Reports of the Computer Science Department