Citation Handling for Improved Summarization of Scientific Documents

dc.contributor.authorWhidby, Michael
dc.contributor.authorZajic, David
dc.contributor.authorDorr, Bonnie
dc.date.accessioned2011-08-03T19:36:39Z
dc.date.available2011-08-03T19:36:39Z
dc.date.issued2011-07-25
dc.description.abstractIn this paper we present the first steps toward improving summarization of scientific documents through citation analysis and parsing. Prior work (Mohammad et al., 2009) argues that citation texts (sentences that cite other papers) play a crucial role in automatic summarization of a topical area, but did not take into account the noise introduced by the citations themselves. We demonstrate that it is possible to improve summarization output through careful handling of these citations. We base our experiments on the application of an improved trimming approach to summarization of citation texts extracted from Question-Answering and Dependency-Parsing documents. We demonstrate that confidence scores from the Stanford NLP Parser (Klein and Manning, 2003) are significantly improved, and that Trimmer (Zajic et al., 2007), a sentence-compression tool, is able to generate higher-quality candidates. Our summarization output is currently used as part of a larger system, Action Science Explorer (ASE) (Gove, 2011).en_US
dc.identifier.urihttp://hdl.handle.net/1903/11822
dc.language.isoen_USen_US
dc.relation.ispartofseriesUM Computer Science Department;CS-TR-4988
dc.relation.ispartofseriesUMIACS;UMIACS-TR-2011-14
dc.relation.ispartofseries;LAMP-TR-157
dc.titleCitation Handling for Improved Summarization of Scientific Documentsen_US
dc.typeTechnical Reporten_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
LAMP-TR-157.pdf
Size:
157.57 KB
Format:
Adobe Portable Document Format