Semiparametric Cluster Detection

dc.contributor.advisorKedem, Benjaminen_US
dc.contributor.authorWen, Shihuaen_US
dc.contributor.departmentMathematical Statisticsen_US
dc.contributor.publisherDigital Repository at the University of Marylanden_US
dc.contributor.publisherUniversity of Maryland (College Park, Md.)en_US
dc.date.accessioned2007-09-28T14:57:44Z
dc.date.available2007-09-28T14:57:44Z
dc.date.issued2007-06-08en_US
dc.description.abstractIn this dissertation, a Semiparametric density ratio testing method which borrows strength from two or more samples is applied to moving windows of variable size in cluster detection. This Semiparametric cluster detection method requires neither the prior knowledge of the underlying distribution nor the number of cases before scanning. To take into account the multiple testing problem induced by numerous overlapping windows, Storey's q-value method, a false discovery rate (FDR) methodology, is used in conjunction with the Semiparametric testing procedure. Monte Carlo power studies show that for binary data, the Semiparametric cluster detection method and its competitor, Kulldorff's scan statistics method, both achieve similar high power in detecting unknown hot-spot clusters. When the data are not binary, the Semiparametric methodology is still applicable, but Kulldorff's method may not be as it requires the choice of a correct probability model, namely the correct scan statistic, in order to achieve power comparable to that achieved by the Semiparametric method. Kulldorff's method with an inappropriate probability model may lose power. Moreover, when the data are binary, the Semiparametric density ratio model reduces to the same scan statistic as Kulldorff's Bernoulli model. If a cluster candidate is known, under certain conditions the Semiparametric method achieves a higher power than the power achieved by a certain focused test in testing the hy- pothesis of no cluster. The Semiparametric method potential in cluster detection is illustrated using a North Humberside childhood leukemia data set and a Maryland-DC-Virginia crime data set.en_US
dc.format.extent1662413 bytes
dc.format.mimetypeapplication/pdf
dc.identifier.urihttp://hdl.handle.net/1903/7204
dc.language.isoen_US
dc.subject.pqcontrolledStatisticsen_US
dc.subject.pquncontrolledsemiparametric density ratio modelen_US
dc.subject.pquncontrolledcluster detectionen_US
dc.subject.pquncontrolledscan statisticsen_US
dc.subject.pquncontrolledfalse discovery rateen_US
dc.subject.pquncontrolledq-valueen_US
dc.subject.pquncontrolledpoweren_US
dc.titleSemiparametric Cluster Detectionen_US
dc.typeDissertationen_US

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
umi-umd-4598.pdf
Size:
1.59 MB
Format:
Adobe Portable Document Format