Browsing by Author "Weinberg, Amy"
Now showing 1 - 4 of 4
Results Per Page
Sort Options
Item Breaking the Resource Bottleneck for Multilingual Parsing(2002-05-22) Hwa, Rebecca; Resnik, Philip; Weinberg, AmyWe propose a framework that enables the acquisition of annotation-heavy resources such as syntactic dependency tree corpora for low-resource languages by importing linguistic annotations from high-quality English resources. We present a large-scale experiment showing that Chinese dependency trees can be induced by using an English parser, a word alignment package, and a large corpus of sentence-aligned bilingual text. As a part of the experiment, we evaluate the quality of a Chinese parser trained on the induced dependency treebank. We find that a parser trained in this manner out-performs some simple baselines inspite of the noise in the induced treebank. The results suggest that projecting syntactic structures from English is a viable option for acquiring annotated syntactic structures quickly and cheaply. We expect the quality of the induced treebank to improve when more sophisticated filtering and error-correction techniques are applied. (Also LAMP-TR-086) (Also UMIACS-TR-2002-35)Item Evaluating Translational Correspondence using Annotation Projection(2003-04-04) Hwa, Rebecca; Resnik, Philip; Weinberg, Amy; Kolak, OkanRecently, statistical machine translation models have begun to take advantage of higher level linguistic structures such as syntactic dependencies. Underlying these models is an assumption about the directness of translational correspondence between sentences in the two languages; however, the extent to which this assumption is valid and useful is not well understood. In this paper, we present an empirical study that quantifies the degree to which syntactic dependencies are preserved when parses are projected directly from English to Chinese. Our results show that although the direct correspondence assumption is often too restrictive, a small set of principled, elementary linguistic transformations can boost the quality of the projected Chinese parses by 76\% relative to the unimproved baseline. UMIACS-TR-2003-25 LAMP-TR-100 ,Item The Linguist and the Laundromat(1998-11-03) Weinberg, AmyThis paper resulted from a roundtable discussion at the 1998 CUNY Sentence Processing Conference held at Rutgers University. Jerry Fodor (Philosphy, Rutgers University) an argued there that an adequate lexical semantics had to invoke a criterion of Rever se Compositionality. Fodor gives the following definition of 'Reverse Compositionality'(RC): "Nothing belongs to the lexical entry for a lexical item except what that item contributes to the grammatical representation of its hosts" where 'host is defin ed as "any expression E ...of which E is a constituent. " Moreover, Fodor claims that invoking this criterion has broad consequences for theories of language processing and acquisition, particularly with respect to theories that attribute processing beha vior to "lexical effects. Fodor claims that "...most of what cognitive science blithely refers to as lexical effects in parsing and language learning aren't in fact mediated by information of the kind that lexical entries contain...." and "... that language acquisition delivers sh allow lexical entries consonant with reverse compositionality, and that parsing delivers correspondingly shallow lexical entries consonant with assigning tokens to their types, and that everything else will turn out to be 'performance theory' ... In this paper, I argue that frequency and other standard lexical processing effects can form a legitimate part of a theory of sentence processing even if it adopts the criterion of "reverse compositionaliy". Cases drawn from the literature are used to s ketch what a theory adopting Fodor's criterion and using frequency and/or probabalistic information would look like. This commentary will appear in Proceedings of CUNY Conference on Sentence Processing, 1998, S. Stevenson and P. Merlo, eds, J. Benjami ns.. Also cross-referenced as UMIACS-TR-98-52Item Minimalist Theory of Human Sentence Processing(1998-11-03) Weinberg, AmyResearch in the theory of human sentence processing can be characterized by 3 styles of explanation. Researchers taking the first track have tried to motivate principles of structural preference from extralinguistic considerations like storage capacity in working memory, or bounds on complexity of incremental analysis. Frazier and Rayner's (1982) Minimal Attachment and Right Association principles, and Gorrell's simplicity metric, are examples of this type of theory. The second track eschews "parsing st rategies", replacing them with a fairly complex tuning by speaker/hearers to frequency in the hearer's linguistic environment. The difficulty of recovering an analysis of a construction in a particular case is a function of how often similar structures o r thematic role arrays appear in the language as a whole. The work of Trueswell et al (1994), Jurafsky (1996) and MacDonald et al (1994) are examples of frequency or probability based constraint satisfaction theories. The third track takes a more represe ntational view and ties processing principles to independently needed restrictions derived from competence and language learning. This approach claims that the natural language faculty is extremely well designed in the sense that the same set of principl es that govern language learning also contribute to a theory of sentence processing. This track is represented by the work of Gibson (1981), Gorrell (1995) Pritchett (1992), Philips (1995, 1996) and Weinberg (1992), who argue that processing can be seen as the rapid incremental satisfaction of grammatical constraints such as the Theta Criterion, which are needed independently to explain language learning or language variation. A variant of this approach, represented by Crain and Steedman (1985) among ot hers constrains the grammatical source for parsing principles but locates these principles within a discourse or semantic, rather than a syntactic component. This paper proposes a model of the last type. We argue that a particular version of the Minimalist Program (Chomsky (1993), Uriagereka (1998)) provides principles needed to explain both initial human preferences for ambiguous structures and provides a t heory of reanalysis, explaining when initial preferences can be revised given subsequent disconfirming data, and when they lead to unrevisable garden paths. We compare our model to other linguistically motivated theories such as Philips (1995, 1996), ar guing that Minimalist principles subsume the generalizations captured by Philip's theory in a more empirically adequate way. Finally, we argue that the data presented argue for this theory over those motivated by extralinguistic principles. Also cross-referenced as UMIACS-TR-98-53