Human-Centric Deep Generative Models: The Blessing and The Curse

dc.contributor.advisorDavis, Larryen_US
dc.contributor.authorYu, Ningen_US
dc.contributor.departmentComputer Scienceen_US
dc.contributor.publisherDigital Repository at the University of Marylanden_US
dc.contributor.publisherUniversity of Maryland (College Park, Md.)en_US
dc.date.accessioned2021-09-17T05:40:52Z
dc.date.available2021-09-17T05:40:52Z
dc.date.issued2021en_US
dc.description.abstractOver the past years, deep neural networks have achieved significant progress in a wide range of real-world applications. In particular, my research puts a focused lens in deep generative models, a neural network solution that proves effective in visual (re)creation. But is generative modeling a niche topic that should be researched on its own? My answer is critically no. In the thesis, I present the two sides of deep generative models, their blessing and their curse to human beings. Regarding what can deep generative models do for us, I demonstrate the improvement in performance and steerability of visual (re)creation. Regarding what can we do for deep generative models, my answer is to mitigate the security concerns of DeepFakes and improve minority inclusion of deep generative models. For the performance of deep generative models, I probe on applying attention modules and dual contrastive loss to generative adversarial networks (GANs), which pushes photorealistic image generation to a new state of the art. For the steerability, I introduce Texture Mixer, a simple yet effective approach to achieve steerable texture synthesis and blending. For the security, my research spans over a series of GAN fingerprinting solutions that enable the detection and attribution of GAN-generated image misuse. For the inclusion, I investigate the biased misbehavior of generative models and present my solution in enhancing the minority inclusion of GAN models over underrepresented image attributes. All in all, I propose to project actionable insights to the applications of deep generative models, and finally contribute to human-generator interaction.en_US
dc.identifierhttps://doi.org/10.13016/piwu-o2rj
dc.identifier.urihttp://hdl.handle.net/1903/27856
dc.language.isoenen_US
dc.subject.pqcontrolledArtificial intelligenceen_US
dc.subject.pquncontrolledComputer visionen_US
dc.subject.pquncontrolledDeep learningen_US
dc.subject.pquncontrolledGenerative adversarial networksen_US
dc.subject.pquncontrolledGenerative modelingen_US
dc.subject.pquncontrolledVisual securityen_US
dc.titleHuman-Centric Deep Generative Models: The Blessing and The Curseen_US
dc.typeDissertationen_US

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Yu_umd_0117E_21879.pdf
Size:
50.52 MB
Format:
Adobe Portable Document Format