Anti-Profiles for Anomaly Classification and Regression

Dinalankara, Wikum

Anti-Profiles for Anomaly Classification and Regression

dc.contributor.advisor	Bravo, Héctor C	en_US
dc.contributor.author	Dinalankara, Wikum	en_US
dc.contributor.department	Computer Science	en_US
dc.contributor.publisher	Digital Repository at the University of Maryland	en_US
dc.contributor.publisher	University of Maryland (College Park, Md.)	en_US
dc.date.accessioned	2015-09-18T05:46:08Z
dc.date.available	2015-09-18T05:46:08Z
dc.date.issued	2015	en_US
dc.description.abstract	Anomaly detection is a classical problem in Statistical Learning with wide-reaching applications in security, networks, genomics and others. In this work, we formulate the anomaly classification problem as an extension to the detection problem: how to distinguish between samples from multiple heterogenous classes that are anomalies relative to a well-defined, homogenous, normal class. Our formulation of this learning setting arises from studies in cancer genomics, where this problem follows from prognosis and diagnosis applications. Standard binary and multi-class classification schemes are not well suited to the anomaly classification task since they attempt to directly model these highly unstable, heterogeneous classes. In this work, we show that robust classifiers can be obtained by modeling the degree of deviation from the normal class as a stable characteristic of each anomaly class. To do so, we formalize the anomaly classification problem, characterize it statistically and computationally via kernel methods and propose a class of robust learning methods, anti-profiles, specifically designed for this task. We focus on an open area of research in cancer genomics which motivates this project: the classification of tumors for prognosis and diagnosis. We provide experimental results obtained by applying the anti-profile method to gene expression data. In addition we extend the anti-profile approach to use kernel functions, and develop a support-vector machine (SVM) based method for classification of anomalies based on their deviation from a stable normal class. We provide experimental results obtained by applying this method to genetic data to classify different stages of tumor progression, and show that this method provides much more stable classifiers than the application of regular classifiers. In addition we show that this approach can be applied to anomaly classification problems in other application domains. We conclude by developing an SVM for censored survival information and demonstrate that the anti-profile method can produce stable classifiers for modeling the clinical outcome of clinical studies of cancer.	en_US
dc.identifier	https://doi.org/10.13016/M2KH1G
dc.identifier.uri	http://hdl.handle.net/1903/16997
dc.language.iso	en	en_US
dc.subject.pqcontrolled	Computer science	en_US
dc.subject.pqcontrolled	Biostatistics	en_US
dc.subject.pquncontrolled	Anomaly Classification	en_US
dc.subject.pquncontrolled	Anomaly Detection	en_US
dc.subject.pquncontrolled	Cancer Genomics	en_US
dc.subject.pquncontrolled	Computational Biology	en_US
dc.subject.pquncontrolled	Machine Learning	en_US
dc.title	Anti-Profiles for Anomaly Classification and Regression	en_US
dc.type	Dissertation	en_US

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Dinalankara_umd_0117E_16439.pdf
Size:: 2.14 MB
Format:: Adobe Portable Document Format

Download

Collections

UMD Theses and Dissertations
Computer Science Theses and Dissertations