On the Applicability of Neural Network and Machine Learning Methodologies to Natural Language Processing

dc.contributor.authorLawrence, Steveen_US
dc.contributor.authorGiles, C. Leeen_US
dc.contributor.authorFong, Sandiwayen_US
dc.date.accessioned2004-05-31T22:32:51Z
dc.date.available2004-05-31T22:32:51Z
dc.date.created1995-06en_US
dc.date.issued1998-10-15en_US
dc.description.abstractWe examine the inductive inference of a complex grammar - specifically, we consider the task of training a model to classify natural language sentences as grammatical or ungrammatical, thereby exhibiting the same kind of discriminatory power provided by the Principles and Parameters linguistic framework, or Government- and-Binding theory. We investigate the following models: feed-forward neural networks, Fransconi-Gori-Soda and Back-Tsoi locally recurrent networks, Elman, Narendra \& Parthasarathy, and Williams \& Zipser recurrent networks, Euclidean and edit-distance nearest-neighbors, simulated annealing, and decision trees. The feed-forward neural networks and non-neural network machine learning models are included primarily for comparison. We address the question: How can a neural network, with its distributed nature and gradient descent based iterative calculations, possess linguistic capability which is traditionally handled with symbolic computation and recursive processes? Initial simulations with all models were only partially successful by using a large temporal window as input. Models trained in this fashion did not learn the grammar to a significant degree. Attempts at training recurrent networks with small temporal input windows failed until we implemented several techniques aimed at improving the convergence of the gradient descent training algorithms. We discuss the theory and present an empirical study of a variety of models and learning algorithms which highlights behaviour not present when attempting to learn a simpler grammar. (Also cross-referenced as UMIACS-TR-95-64)en_US
dc.format.extent756744 bytes
dc.format.mimetypeapplication/postscript
dc.identifier.urihttp://hdl.handle.net/1903/733
dc.language.isoen_US
dc.relation.isAvailableAtDigital Repository at the University of Marylanden_US
dc.relation.isAvailableAtUniversity of Maryland (College Park, Md.)en_US
dc.relation.isAvailableAtTech Reports in Computer Science and Engineeringen_US
dc.relation.isAvailableAtUMIACS Technical Reportsen_US
dc.relation.ispartofseriesUM Computer Science Department; CS-TR-3479en_US
dc.relation.ispartofseriesUMIACS; UMIACS-TR-95-64en_US
dc.titleOn the Applicability of Neural Network and Machine Learning Methodologies to Natural Language Processingen_US
dc.typeTechnical Reporten_US

Files

Original bundle
Now showing 1 - 2 of 2
No Thumbnail Available
Name:
CS-TR-3479.ps
Size:
739.01 KB
Format:
Postscript Files
Loading...
Thumbnail Image
Name:
CS-TR-3479.pdf
Size:
453.86 KB
Format:
Adobe Portable Document Format
Description:
Auto-generated copy of CS-TR-3479.ps