Citation Handling: Processing Citation Texts in Scientific Documents
dc.contributor.advisor | Dorr, Bonnie | en_US |
dc.contributor.advisor | Zajic, David | en_US |
dc.contributor.author | Whidby, Michael Alan | en_US |
dc.contributor.department | Computer Science | en_US |
dc.contributor.publisher | Digital Repository at the University of Maryland | en_US |
dc.contributor.publisher | University of Maryland (College Park, Md.) | en_US |
dc.date.accessioned | 2012-10-11T05:51:34Z | |
dc.date.available | 2012-10-11T05:51:34Z | |
dc.date.issued | 2012 | en_US |
dc.description.abstract | Citation sentences (sentences that cite other papers) play a key role in the summarization of scientific articles. However, a citation-based summarization system that depends on generic natural language processing components, such as parsers or sentence compressors, will perform poorly if those components cannot handle citations correctly. In this thesis, I examine the effect of citation handling on parsing, sentence compression, and multi-document summarization. There are two types of citations that occur in citation sentences: constituent citations and parenthetical citations. I propose an automatic citation classifier based on training data created through Mechanical Turk tasks. I demonstrate that the use of type-specific citation handling as pre-processing improves the performance of a state-of-the-art generic parser, both for quality of the parse trees and running time. Extrinsic evaluations demonstrate that improving the performance of a parser on citation sentences in turn improves the performance of a sentence compressor, Trimmer (Zajic et al., 2007), and a multi-document summarization system, MASCS, according to several summarization measures. | en_US |
dc.identifier.uri | http://hdl.handle.net/1903/13176 | |
dc.subject.pqcontrolled | Computer science | en_US |
dc.subject.pquncontrolled | citation | en_US |
dc.subject.pquncontrolled | multi-document summarization | en_US |
dc.subject.pquncontrolled | parsing | en_US |
dc.subject.pquncontrolled | sentence compression | en_US |
dc.title | Citation Handling: Processing Citation Texts in Scientific Documents | en_US |
dc.type | Thesis | en_US |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- Whidby_umd_0117N_13447.pdf
- Size:
- 532.17 KB
- Format:
- Adobe Portable Document Format