Faculty Publications

Generative adversarial networks for video prediction with action control

Zhihang Hu, New Jersey Institute of Technology
Jason T.L. Wang, New Jersey Institute of Technology

Document Type

Conference Proceeding

Publication Date

1-1-2020

Abstract

The ability of predicting future frames in video sequences, known as video prediction, is an appealing yet challenging task in computer vision. This task requires an in-depth representation of video sequences and a deep understanding of real-word causal rules. Existing approaches for tackling the video prediction problem can be classified into two categories: deterministic and stochastic methods. Deterministic methods lack the ability of generating possible future frames and often yield blurry predictions. On the other hand, although current stochastic approaches can predict possible future frames, their models lack the ability of action control in the sense that they cannot generate the desired future frames conditioned on a specific action. In this paper, we propose new generative adversarial networks (GANs) for stochastic video prediction. Our framework, called VPGAN, employs an adversarial inference model and a cycle-consistency loss function to empower the framework to obtain more accurate predictions. In addition, we incorporate a conformal mapping network structure into VPGAN to enable action control for generating desirable future frames. In this way, VPGAN is able to produce fake videos of an object moving along a specific direction. Experimental results show that a combination of VPGAN and pre-trained image segmentation models outperforms existing stochastic video prediction methods.

Identifier

85090095722 (Scopus)

ISBN

[9783030561499]

Publication Title

Lecture Notes in Computer Science Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics

External Full Text Location

https://doi.org/10.1007/978-3-030-56150-5_5

e-ISSN

16113349

ISSN

03029743

First Page

Last Page

105

Volume

12158 LNAI

Grant

1927578

Fund Ref

National Science Foundation

Recommended Citation

Hu, Zhihang and Wang, Jason T.L., "Generative adversarial networks for video prediction with action control" (2020). Faculty Publications. 5549.
https://digitalcommons.njit.edu/fac_pubs/5549

This document is currently not available here.

COinS

DOI

10.1007/978-3-030-56150-5_5

Faculty Publications

Generative adversarial networks for video prediction with action control

Document Type

Publication Date

Abstract

Identifier

ISBN

Publication Title

External Full Text Location

e-ISSN

ISSN

First Page

Last Page

Volume

Grant

Fund Ref

Recommended Citation

DOI

Search

Browse

Author Corner

Links

Faculty Publications

Generative adversarial networks for video prediction with action control

Authors

Document Type

Publication Date

Abstract

Identifier

ISBN

Publication Title

External Full Text Location

e-ISSN

ISSN

First Page

Last Page

Volume

Grant

Fund Ref

Recommended Citation

Share

DOI

Search

Browse

Author Corner

Links