Browsing by Author "Maxwell, Michael"
Now showing 1 - 3 of 3
Results Per Page
Sort Options
Item Correcting Errors in Digital Lexicographic Resources Using a Dictionary Manipulation Language(Trojina Institute for Applied Slovene Studies, 2011-11) Zajic, David; Maxwell, Michael; Doermann, David; Rodrigues, Paul; Bloodgood, MichaelWe describe a paradigm for combining manual and automatic error correction of noisy structured lexicographic data. Modifications to the structure and underlying text of the lexicographic data are expressed in a simple, interpreted programming language. Dictionary Manipulation Language (DML) commands identify nodes by unique identifiers, and manipulations are performed using simple commands such as create, move, set text, etc. Corrected lexicons are produced by applying sequences of DML commands to the source version of the lexicon. DML commands can be written manually to repair one-off errors or generated automatically to correct recurring problems. We discuss advantages of the paradigm for the task of editing digital bilingual dictionaries.Item Interoperable Grammars(2008) Maxwell, Michael; David, AnneFor languages with significant inflectional morphology, development of a morphological parser is often a prerequisite to further computational linguistic capabilities. We focus on two difficulties for this development: the short lifetime of software such as parsing engines, and the difficulty of porting grammars to new parsing engines. We describe a methodology we have developed to promote portability, using a formal declarative grammar written in XML, which we supplement with a traditional descriptive grammar. The two grammars are combined into a single document using Literate Programming. The formal grammar is designed to be independent of a particular parsing engine’s programming language, thus helping solve the software lifetime and portability problems.Item Joint Grammar Development by Linguists and Computer Scientists(2008) Maxwell, Michael; David, AnneFor languages with inflectional morphology, development of a morphological parser is often a bottleneck for further development of computational linguistic capabilities. We focus on two difficulties: first, finding people with expertise in both computer programming and the linguistics of a particular language, and second, the short lifetime of software such as parsers. We then describe a methodology we have developed to split the task of building a parser for a language into two tasks, descriptive grammar development and formal grammar development. The two grammars are combined into a single document using Literate Programming. The formal grammar is designed not to be dependent on a particular parsing engine’s programming language, so that it can be readily ported to a new parsing engine, thus helping solve the software lifetime problem.