An efficient neural representation for videos

Chen, Hao

An efficient neural representation for videos

dc.contributor.advisor	Shrivastava, Abhinav	en_US
dc.contributor.author	Chen, Hao	en_US
dc.contributor.department	Computer Science	en_US
dc.contributor.publisher	Digital Repository at the University of Maryland	en_US
dc.contributor.publisher	University of Maryland (College Park, Md.)	en_US
dc.date.accessioned	2023-10-06T05:37:18Z
dc.date.available	2023-10-06T05:37:18Z
dc.date.issued	2023	en_US
dc.description.abstract	With the increasing popularity of videos, it has become crucial to find efficient and compact ways to represent them for easier storage, transmission, and downstream video tasks. Our dissertation proposes an innovative neural representation for videos called NeRV, which stores each video implicitly as a neural network. Building on NeRV, we introduce a hybrid representation for videos called HNeRV, which improves internal generalization and representation capacity. HNeRV allows for highly efficient video representation and compression, with a model size that can be up to 1000 times smaller than the original raw video. Apart from efficiency, HNeRV's simple decoding process, which involves a feedforward operation, enables fast video loading and easy deployment. To enhance efficiency, we develope an efficient neural video dataloader called NVLoader, which is 3-6 times faster than conventional video dataloaders. We also introduce the HyperNeRV framework to address encoding speed, which utilizes a hypernetwork to directly map input videos to NeRV model weights, resulting in a 10^4 faster encoding process. Aside from developing compact and implicit video neural representations, we explore several compelling applications, including frame interpolation, video restoration, and video editing. Furthermore, the compactness of these representations makes them an ideal output video format for video generation models, reducing the search space significantly. Additionally, they can serve as an efficient input for video understanding models.	en_US
dc.identifier	https://doi.org/10.13016/dspace/rpio-zrgb
dc.identifier.uri	http://hdl.handle.net/1903/30742
dc.language.iso	en	en_US
dc.subject.pqcontrolled	Computer science	en_US
dc.subject.pquncontrolled	efficient video loading	en_US
dc.subject.pquncontrolled	Implicit neural representation	en_US
dc.subject.pquncontrolled	Video compression	en_US
dc.subject.pquncontrolled	video editing	en_US
dc.subject.pquncontrolled	video restoration	en_US
dc.title	An efficient neural representation for videos	en_US
dc.type	Dissertation	en_US

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Chen_umd_0117E_23302.pdf
Size:: 27.75 MB
Format:: Adobe Portable Document Format

Download

Collections

UMD Theses and Dissertations
Computer Science Theses and Dissertations