Activity Detection in Untrimmed Videos

dc.contributor.advisorChellappa, Ramaen_US
dc.contributor.authorGleason, Joshua Den_US
dc.contributor.departmentElectrical Engineeringen_US
dc.contributor.publisherDigital Repository at the University of Marylanden_US
dc.contributor.publisherUniversity of Maryland (College Park, Md.)en_US
dc.date.accessioned2023-10-12T05:36:14Z
dc.date.available2023-10-12T05:36:14Z
dc.date.issued2023en_US
dc.description.abstractIn this dissertation, we present solutions to the problem of activity detection in untrimmed videos, where we are interested in identifying both when and where various activity instances occur within an unconstrained video. Advances in machine learning, particularly the widespread adoption of deep learning-based methods have yielded robust solutions to a number of historically difficult computer vision application domains. For example, recent systems for object recognition and detection, facial identification, and a number of language processing applications have found widespread commercial success. In some cases, such systems have been able to outperform humans. The same cannot be said for the problem of activity detection in untrimmed videos. This dissertation describes our investigation and innovative solutions for the challenging problem of real-time activity detection in untrimmed videos. The main contributions of our work are the introduction of multiple novel activity detection systems that make strides toward the goal of commercially viable activity detection. The first work introduces a proposal mechanism based on divisive hierarchical clustering of objects to produce cuboid activity proposals, followed by a classification and temporal refinement step. The second work proposes a chunk-based processing mechanism and explores the tradeoff between tube and cuboid proposals. The third work explores the topic of real-time activity detection and introduces strategies for achieving this performance. The final work provides a detailed look into multiple novel extensions that improve upon the state-of-the-art in the field.en_US
dc.identifierhttps://doi.org/10.13016/dspace/6ku4-w6np
dc.identifier.urihttp://hdl.handle.net/1903/30959
dc.language.isoenen_US
dc.subject.pqcontrolledComputer scienceen_US
dc.subject.pquncontrolledAction Detectionen_US
dc.subject.pquncontrolledAction Recognitionen_US
dc.subject.pquncontrolledActivity Detectionen_US
dc.subject.pquncontrolledActivity Recognitionen_US
dc.subject.pquncontrolledComputer Visionen_US
dc.subject.pquncontrolledVideo Understandingen_US
dc.titleActivity Detection in Untrimmed Videosen_US
dc.typeDissertationen_US

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Gleason_umd_0117E_23671.pdf
Size:
4.47 MB
Format:
Adobe Portable Document Format