Text Summarization via Hidden Markov Models and Pivoted QR Matrix Decomposition

dc.contributor.authorConroy, Johnen_US
dc.contributor.authorO'Leary, Dianne P.en_US
dc.date.accessioned2004-05-31T23:09:06Z
dc.date.available2004-05-31T23:09:06Z
dc.date.created2001-02en_US
dc.date.issued2001-05-10en_US
dc.description.abstractA sentence extract summary of a document is a subset of the document's sentences that contains the main ideas in the document. We present two approaches to generating such summaries. The first uses a pivoted QR decomposition of the term-sentence matrix in order to identify sentences that have ideas that are distinct from those in other sentences. The second is based on a hidden Markov model that judges the likelihood that each sentence should be contained in the summary. We compare the results of these methods with summaries generated by humans, showing that we obtain higher agreement than do earlier methods. (Cross-referenced as UMIACS-TR-2001-11)en_US
dc.format.extent658811 bytes
dc.format.mimetypeapplication/postscript
dc.identifier.urihttp://hdl.handle.net/1903/1119
dc.language.isoen_US
dc.relation.isAvailableAtDigital Repository at the University of Marylanden_US
dc.relation.isAvailableAtUniversity of Maryland (College Park, Md.)en_US
dc.relation.isAvailableAtTech Reports in Computer Science and Engineeringen_US
dc.relation.isAvailableAtUMIACS Technical Reportsen_US
dc.relation.ispartofseriesUM Computer Science Department; CS-TR-4221en_US
dc.relation.ispartofseriesUMIACS; UMIACS-TR-2001-11en_US
dc.titleText Summarization via Hidden Markov Models and Pivoted QR Matrix Decompositionen_US
dc.typeTechnical Reporten_US

Files

Original bundle
Now showing 1 - 2 of 2
No Thumbnail Available
Name:
CS-TR-4221.ps
Size:
643.37 KB
Format:
Postscript Files
Loading...
Thumbnail Image
Name:
CS-TR-4221.pdf
Size:
240.64 KB
Format:
Adobe Portable Document Format
Description:
Auto-generated copy of CS-TR-4221.ps