Large Latent Semantic Indexing via a Semi-Discrete Matrix Decomposition

dc.contributor.authorKolda, Tamara G.en_US
dc.contributor.authorO'Leary, Dianne P.en_US
dc.date.accessioned2004-05-31T22:42:24Z
dc.date.available2004-05-31T22:42:24Z
dc.date.created1996-11en_US
dc.date.issued1998-10-15en_US
dc.description.abstractWith the electronic storage of documents comes the possibility of building search engines that can automatically choose documents relevant to a given set of topics. In information retrieval, we wish to match queries with relevant documents. Documents can be represented by the terms that appear within them, but literal matching of terms does not necessarily retrieve all relevant documents. There are a number of information retrieval systems based on inexact matches. Latent Semantic Indexing represents documents by approximations and tends to cluster documents on similar topics even if their term profiles are somewhat different. This approximate representation is usually accomplished using a low-rank singular value decomposition (SVD) approximation. In this paper, we use an alternate decomposition, the semi-discrete decomposition (SDD). For equal query times, the SDD does as well as the SVD and uses less than one-tenth the storage for the MEDLINE test set. (Also cross-referenced as UMIACS-TR-96-83)en_US
dc.format.extent173916 bytes
dc.format.mimetypeapplication/postscript
dc.identifier.urihttp://hdl.handle.net/1903/857
dc.language.isoen_US
dc.relation.isAvailableAtDigital Repository at the University of Marylanden_US
dc.relation.isAvailableAtUniversity of Maryland (College Park, Md.)en_US
dc.relation.isAvailableAtTech Reports in Computer Science and Engineeringen_US
dc.relation.isAvailableAtUMIACS Technical Reportsen_US
dc.relation.ispartofseriesUM Computer Science Department; CS-TR-3713en_US
dc.relation.ispartofseriesUMIACS; UMIACS-TR-96-83en_US
dc.titleLarge Latent Semantic Indexing via a Semi-Discrete Matrix Decompositionen_US
dc.typeTechnical Reporten_US

Files

Original bundle

Now showing 1 - 2 of 2
No Thumbnail Available
Name:
CS-TR-3713.ps
Size:
169.84 KB
Format:
Postscript Files
Loading...
Thumbnail Image
Name:
CS-TR-3713.pdf
Size:
180.72 KB
Format:
Adobe Portable Document Format
Description:
Auto-generated copy of CS-TR-3713.ps