Linguistics
Permanent URI for this communityhttp://hdl.handle.net/1903/2255
Browse
87 results
Search Results
Item Future reference 'without' future morphology(2024) Mendes, Jéssica Viana; Hacquard, Valentine; Santorio, Paolo; Linguistics; Digital Repository at the University of Maryland; University of Maryland (College Park, Md.)In some languages, present morphology can be used to refer to non-scheduled future events. Since this form of future reference is constrained to certain subordinate environments, like conditional antecedents (‘If John gets a new job, he played his cards right’) and relative clauses (‘Everyone who gets invited to this party is very lucky’), I propose to call the phenomenon Subordinate Future (SF). Two factors have hindered our understanding of the SF: First, the SF often occurs in modalized sentences, which makes it difficult to tease apart its contribution from that of the environment. Second, present morphology in English can express several readings; therefore, the appearance of this future is not particularly informative. This dissertation brings new intra- and cross-linguistic evidence to bear on the nature and the meaning of the SF. I observe that, in addition to temporal displacement, the SF also introduces modal displacement. Then, I argue that the source of this modality is a subjunctive mood morpheme, which is silent in English, but pronounced in Portuguese. I proceed to decompose the semantics of the subjunctive. I propose that the subjunctive should be treated as a Heimian indefinite (Heim, 1982) ranging over situations. Simply put, the role of the subjunctive is to introduce a situation variable. The motivation for my proposal comes from the behavior of the subjunctive in relative clauses, and from the anaphoric pattern of sentences with the SF. In relative clauses, the SF blocks a specific reading of the DP. Besides that, the SF seems to be able to ‘bind’ the situation variable of predicates outside of its domain of c-command, giving rise to modal donkey anaphora. These two facts would be difficult to reconcile with a quantificational treatment of the subjunctive. I then turn my attention to the temporal interpretation of the phenomenon. As Crouch (1993, 1994) observed, this future is also able to anchor the temporal interpretation of clauses outside of its domain of c-command. I propose that this effect is a byproduct of modal donkey anaphora, and demonstrate how casting my proposal in terms of situations provides a natural account of the phenomenon. I conclude with a comparison between my proposal and existing accounts.Item A world without words: A non-lexicalist framework for psycho- and neuro-linguistics(2024) Krauska, Alexandra; Lau, Ellen; Linguistics; Digital Repository at the University of Maryland; University of Maryland (College Park, Md.)In standard models of language production or comprehension, the elements which are retrieved from memory and combined into a syntactic structure are “lemmas” or “lexical items”. Such models implicitly take a “lexicalist” approach, which assumes that lexical items store meaning, syntax, and form together, that syntactic and lexical processes are distinct, and that syntactic structure does not extend below the word level. Across the last several decades, linguistic research examining a typologically diverse set of languages has provided strong evidence against this approach. These findings suggest that syntactic processes apply both above and below the “word” level, and that both meaning and form are partially determined by the syntactic context. This has significant implications for psychological and neurological models of language processing as well as for the way that we understand different types of aphasia and other language disorders. As a consequence of the lexicalist assumptions of these models, many kinds of sentences that speakers produce and comprehend - in a variety of languages, including English - are challenging for them to account for. In order to move away from lexicalism in psycho- and neuro-linguistics, it is not enough to simply update the syntactic representations of words or phrases; the processing algorithms involved in language production are constrained by the lexicalist representations that they operate on, and thus also need to be reimagined. This dissertation discusses the issues with lexicalism in linguistic theory as well as its implications in psycho- and neuro-linguistics. In addition, I propose a non-lexicalist model of language production, the “WithOut Words” (WOW) model, which does not rely on lemma representations, but instead represents that knowledge as independent mappings between meaning and syntax, and syntax and form, with a single integrated stage for the retrieval and assembly of syntactic structure. Based on this, the model suggests that neural responses during language production should be modulated not just by the pieces of meaning, syntax, and form, but also by the complexity of the mapping processes which link those separate representations. This prediction is supported by the results of a novel experimental paradigm using electroencephalography (EEG) during language production, which observes greater neural responses for meaning-syntax and syntax-form mapping complexity in two separate time windows. Finally, I re-evaluate the dissociation between regular and irregular verbs in aphasia, which has been used as supporting evidence for a distinction between the grammar and the lexicon. By training recurrent neural networks and measuring their performance after lesioning, I show that the observed clinical data can be accounted for within a single mechanism. By moving away from lexicalist assumptions, the non-lexicalist framework described in this dissertation provides better cross-linguistic coverage and aligns better with contemporary syntactic theory.Item MODELING ADAPTABILITY MECHANISMS OF SPEECH PERCEPTION Nika Jurov(2024) Jurov, Nika; Feldman, Naomi H.; Idsardi, William; Linguistics; Digital Repository at the University of Maryland; University of Maryland (College Park, Md.)Speech is a complex, redundant and variable signal happening in a noisy and ever changing world. How do listeners navigate these complex auditory scenes and continuously and effortlessly understand most of the speakers around them? Studies show that listeners can quickly adapt to new situations, accents and even to distorted speech. Although prior research has established that listeners rely more on some speech cues (or also called features or dimensions) than others, it is yet not understood how listeners weight them flexibly on a moment-to-moment basis when the input might deviate from the standard speech. This thesis computationally explores flexible cue re-weighting as an adaptation mechanism using real speech corpora. The computational framework it relies on is rate distortion theory. This framework models a channel that is optimized on a trade off between distortion and rate: on the one hand, the input signal should be reconstructed with minimal error after it goes through the channel. On the other hand, the channel needs to extract parsimonious information from the incoming data. This channel can be implemented as a neural network with a beta variational auto-encoder. We use this model to show that two mechanistic components are needed for adaptation: focus and switch. We firstly show that focus on a cue mimics humans better than cue weights that simply depend on long term statistics as has been largely assumed in the prior research. And second, we show a new model that can quickly adapt and switch weighting the features depending on the input of a particular moment. This model's flexibility comes from implementing a cognitive mechanism that has been called ``selective attention" with multiple encoders. Each encoder serves as a focus on a different part of the signal. We can then choose how much to rely on each focus depending on the moment. Finally, we ask whether cue weighting is informed by being able to separate the noise from speech. To this end we adapt a feature disentanglement adversarial training from vision to disentangle speech (noise) features from noise (speech) labels. We show that although this does not give us human-like cue weighting behavior, there is an effect of disentanglement of weighting spectral information slightly more than temporal information compared to the baselines. Overall, this thesis explores adaptation computationally and offers a possible mechanistic explanation for ``selective attention'' with focus and switch mechanisms, based on rate distortion theory. It also argues that cue weighting cannot be determined solely on speech carefully articulated in laboratories or in quiet. Lastly, it explores a way to inform speech models from a cognitive angle to make the models more flexible and robust, like human speech perception is.Item Headedness and the Lexicon: The Case of Verb-to-Noun Ratios(MDPI, 2020-02-13) Polinsky, Maria; Magyar, LillaThis paper takes a well-known observation as its starting point, that is, languages vary with respect to headedness, with the standard head-initial and head-final types well attested. Is there a connection between headedness and the size of a lexical class? Although this question seems quite straightforward, there are formidable methodological and theoretical challenges in addressing it. Building on initial results by several researchers, we refine our methodology and consider the proportion of nouns to simplex verbs (as opposed to light verb constructions) in a varied sample of 33 languages to evaluate the connection between headedness and the size of a lexical class. We demonstrate a robust correlation between this proportion and headedness. While the proportion of nouns in a lexicon is relatively stable, head-final/object-verb (OV)-type languages (e.g., Japanese or Hungarian) have a relatively small number of simplex verbs, whereas head-initial/verb-initial languages (e.g., Irish or Zapotec) have a considerably larger percentage of such verbs. The difference between the head-final and head-initial type is statistically significant. We, then, consider a subset of languages characterized as subject-verb-object (SVO) and show that this group is not uniform. Those SVO languages that have strong head-initial characteristics (as shown by the order of constituents in a set of phrases and word order alternations) are characterized by a relatively large proportion of lexical verbs. SVO languages that have strong head-final traits (e.g., Mandarin Chinese) pattern with head-final languages, and a small subset of SVO languages are genuinely in the middle (e.g., English, Russian). We offer a tentative explanation for this headedness asymmetry, couched in terms of informativity and parsing principles, and discuss additional evidence in support of our account. All told, the fewer simplex verbs in head-final/OV-type languages is an adaptation in response to their particular pattern of headedness. The object-verb/verb-object (OV/VO) difference with respect to noun/verb ratios also reveals itself in SVO languages; some languages, Chinese and Latin among them, show a strongly OV ratio, whereas others, such as Romance or Bantu, are VO-like in their noun/verb ratios. The proportion of nouns to verbs thus emerges as a new linguistic characteristic that is correlated with headedness.Item The Extended Merge Hypothesis and the Fundamental Principle of Grammar(MDPI, 2021-10-20) Hornstein, NorbertThis paper discusses the main minimalist theory within the Minimalist Program, something I dub the (Weak) Merge Hypothesis (MH). (1) The (Weak) Merge Hypothesis (MH): Merge is a central G operation. I suggest that we extend (1) by adding to it a general principle that I dub the Fundamental Principle of Grammar (FPG). (2) The Fundamental Principle of Grammar (FPG): α and β can be grammatically related. (G-related) only if α and β have merged. Adding (1) and (2) gives us the Strong Merge Hypothesis. (3) The Strong Merge Hypothesis (SMH): All grammatical dependencies are mediated by Merge. SMH has some interesting consequences which the rest of the paper briefly reviews. Highlights include the Movement Theory of Construal, The Periscope Property on selection, as well as preserving the standard results from the Weak Merge Hypothesis.Item Categories with Complements(MDPI, 2022-09-15) Uriagereka, JuanVerbs and nouns gear θ-dependencies, Case, agreement, or construal relations. Building on Chomsky’s 1974 decomposition of such categories into ±N, ±V features, by translating said features into ±1, ±i scalars that allow for the construction of a vector space, this paper studies the possibility of organizing said features into 2 × 2 square matrices. In the system proposed to explore “head-complement” relations, operating on nouns yields a measurable/observable (Hermitian matrix), which in turn limits other potential combinations with abstract lexical categories. Functional/grammatical categories in the system deploy the same features, albeit organized differently in the matrix diagonal and off-diagonal. The algebraic result is a group with well-defined mathematical properties, which properly includes the Pauli group of standard use in quantum computation. In the system, the presumed difference between categories and interactions—here, in a context of the head-complement sort—reduces to whether the magnitude of the matrix eigenvalue is 1 or not, in the latter instance inducing asymmetric interactions.Item GENERATING AND MEASURING PREDICTIONS IN LANGUAGE PROCESSING(2023) Nakamura, Masato; Philips, Colin; Linguistics; Digital Repository at the University of Maryland; University of Maryland (College Park, Md.)Humans can comprehend utterances quickly, efficiently, and often robustly against noise in the inputs. Researchers have argued that such a remarkable ability is supported by prediction of upcoming inputs. If people use the context to infer what they would hear/see and prepare for likely inputs, they should be able to efficiently process the predicted inputs.This thesis investigates how contexts can predictively activate lexical representations (lexical pre-activation). I address two different aspects of prediction: (i) how pre-activation is generated using contextual information and stored knowledge, and (ii) how pre-activation is reflected in different measures. I first assess the linking hypothesis of the speeded cloze task, a measure of pre-activation, through computational simulations. I demonstrate that an earlier model accounts for qualitative patterns of human data but fails to predict quantitative patterns. I argue that a model with an additional but reasonable assumption of lateral inhibition successfully explains these patterns. Building on the first study, I demonstrate that pre-activation measures fail to align with each other in cases called argument role reversals, even if the time courses and stimuli are carefully matched. The speeded cloze task shows that “role-appropriate” serve in ... which customer the waitress had served is more strongly pre-activated compared to the “role- inappropriate” serve in ... which waitress the customer had served. On the other hand, the N400 amplitude, which is another pre-activation measure, does not show contrasts be- tween the role-appropriate and inappropriate serve. Accounting for such a mismatch between measures in argument role reversals provides insights into whether and how argument roles constrain pre-activation as well as how different measures reflect pre-activation. Subsequent studies addressed whether pre-activation is sensitive to argument roles or not. Analyses of context-wise variability of role-inappropriate candidates suggest that there are some role-inappropriate pre-activations even in the speeded cloze task. The next study at- tempts to directly contrast pre-activations of role-appropriate and inappropriate candidates, eliminating the effect of later confounding processes by distributional analyses of reaction times. While one task suggests that role-appropriate candidates are more strongly pre- activated compared to the role-inappropriate candidates, the other task suggests that they have matched pre-activation. Finally, I examine the influence of role-appropriate competitors on role-inappropriate competitors. The analyses of speeded cloze data suggest that N400 amplitudes can be sensitive to argument roles when there are strong role-appropriate competitors. This finding can be explained by general role-insensitivity and partial role-sensitivity in pre-activation processes. Combined together, these studies suggest that pre-activation processes are generally insensitive to argument roles, but some role-sensitive mechanisms can cause role-sensitivity in pre-activation measures under some circumstances.Item The Learning and Usage of Second Language Speech Sounds: A Computational and Neural Approach(2023) Thorburn, Craig Adam; Feldman, Naomi H; Linguistics; Digital Repository at the University of Maryland; University of Maryland (College Park, Md.)Language learners need to map a continuous, multidimensional acoustic signal to discrete abstract speech categories. The complexity of this mapping poses a difficult learning problem, particularly for second language learners who struggle to acquire the speech sounds of a non-native language, and almost never reach native-like ability. A common example used to illustrate this phenomenon is the distinction between /r/ and /l/ (Goto, 1971). While these sounds are distinct in English and native English speakers easily distinguish the two sounds, native Japanese speakers find this difficult, as the sounds are not contrastive in their language. Even with much explicit training, Japanese speakers do not seem to be able to reach native-like ability (Logan, Lively, & Pisoni, 1991; Lively, Logan & Pisoni, 1993) In this dissertation, I closely explore the mechanisms and computations that underlie effective second-language speech sound learning. I study a case of particularly effective learning--- a video game paradigm where non-native speech sounds have functional significance (Lim & Holt, 2011). I discuss the relationship with a Dual Systems Model of auditory category learning and extend this model, bringing it together with the idea of perceptual space learning from infant phonetic learning. In doing this, I describe why different category types are better learned in different experimental paradigms and when different neural circuits are engaged. I propose a novel split where different learning systems are able to update different stages of the acoustic-phonetic mapping from speech to abstract categories. To do this I formalize the video game paradigm computationally and implement a deep reinforcement learning network to map between environmental input and actions. In addition, I study how these categories could be used during online processing through an MEG study where second-language learners of English listen to continuous naturalistic speech. I show that despite the challenges of speech sound learning, second language listeners are able to predict upcoming material integrating different levels of contextual information and show similar responses to native English speakers. I discuss the implications of these findings and how the could be integrated with literature on the nature of speech representation in a second language.Item The competition–compensation account of developmental language disorder(Wiley, 2022-12-22) Harmon, Zara; Barak, Libby; Shafto, Patrick; Edwards, Jan; Feldman, Naomi H.Children with developmental language disorder (DLD) regularly use the bare form of verbs (e.g., dance) instead of inflected forms (e.g., danced). We propose an account of this behavior in which processing difficulties of children with DLD disproportionally affect processing novel inflected verbs in their input. Limited experience with inflection in novel contexts leads the inflection to face stronger competition from alternatives. Competition is resolved through a compensatory behavior that involves producing a more accessible alternative: in English, the bare form. We formalize this hypothesis within a probabilistic model that trades off context-dependent versus independent processing. Results show an over-reliance on preceding stem contexts when retrieving the inflection in a model that has difficulty with processing novel inflected forms. We further show that following the introduction of a bias to store and retrieve forms with preceding contexts, generalization in the typically developing (TD) models remains more or less stable, while the same bias in the DLD models exaggerates difficulties with generalization. Together, the results suggest that inconsistent use of inflectional morphemes by children with DLD could stem from inferences they make on the basis of data containing fewer novel inflected forms. Our account extends these findings to suggest that problems with detecting a form in novel contexts combined with a bias to rely on familiar contexts when retrieving a form could explain sequential planning difficulties in children with DLD.Item The role of internal constraints and stylistic congruence on a variant's social impact(Cambridge University Press, 2023-02-02) Vaughn, CharlotteIn natural conversation, multiple factors likely impact the social force of a sociolinguistic variant, yet researchers have tended to examine individual factors in isolation. This paper considers two underexamined factors together—the role of a variable's internal constraints and the role of stylistically congruent surrounding speech—to understand their combined influence on how a single variable's realization is socially interpreted. Focusing on English variable (ING), two accent rating experiments used stimuli varying the grammatical category of (ING) words and varying the stylistic congruence (natural sentences versus spliced stimuli) between (ING) realization and sentence frames. Results indicate that listeners showed sensitivity to (ING)'s internal constraints but only when the congruence between (ING)'s realization and other cues was not disrupted by using spliced stimuli. These findings suggest that internal constraints and stylistic congruence play a role in social signaling, and have methodological implications for the use of splicing.