Efficient Retrieval of Similar Time Sequences Under Time Warping

dc.contributor.advisorFaloutsos, Christosen_US
dc.contributor.authorYi, B.en_US
dc.contributor.authorJagadish, H.V.en_US
dc.contributor.authorFaloutsos, Christosen_US
dc.contributor.departmentISRen_US
dc.date.accessioned2007-05-23T10:04:26Z
dc.date.available2007-05-23T10:04:26Z
dc.date.issued1997en_US
dc.description.abstractFast similarity searching in large time-sequence databases has attracted a lot of research interest. All of them use the Euclidean distance ($L_2$), or some variation of $L_p$ metric. $L_p$ metrics lead to efficient indexing, thanks to feature extraction (e.g., by keeping the first few DFT coefficients) and subsequent use of fast spatial access methods for the points in feature space. In this work we examine a popular, field-tested dissimilarity function, the "time warping" distance function which permits local accelerations and decelerations in the rate of the signals or sequences. This function is natural and suitable for several applications, like matching of voice, audio and medical signals (e.g., electrocardiograms). However, from the indexing viewpoint it presents two major challenges: (a) it does not lead to any natural "features", precluding the use of spatial access methods (b) it is quadratic ($O(len_1 * len_2)$) on the length of the sequences involved. Here we show how to overcome both problems: for the former, we propose using a modification of the so-called "FastMap", to map sequences into points, trading off a tiny amount of "recall" (typically zero) for large gains in speed. For the latter, we provide a fast, linear test, to help us discard quickly many of the false alarms that FastMap will typically introduce. Using both ideas in cascade, our proposed method achieved up to 7.8-time speed-up over the straightforward sequential scanning, on both read and synthetic datasets.en_US
dc.format.extent277009 bytes
dc.format.mimetypeapplication/pdf
dc.identifier.urihttp://hdl.handle.net/1903/5886
dc.language.isoen_USen_US
dc.relation.ispartofseriesISR; TR 1997-77en_US
dc.subjectdatabaseen_US
dc.subjectsimilarity matchingen_US
dc.subjectindexingen_US
dc.subjectsequence dataen_US
dc.subjecttime- warpingen_US
dc.subjectIntelligent Signal Processing en_US
dc.subjectCommunications Systemsen_US
dc.titleEfficient Retrieval of Similar Time Sequences Under Time Warpingen_US
dc.typeTechnical Reporten_US

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
TR_97-77.pdf
Size:
270.52 KB
Format:
Adobe Portable Document Format