Multi-level Motion-Informed Approach for Video Generation with Key Frames

Ping Tat Sin, Hiu Fung Ng, Chi Keung Simon Shiu, Fu Lai Korris Chung, Hong Va Leong

Research output: Chapter in book / Conference proceedingConference article published in proceeding or bookAcademic researchpeer-review


Observing that a motion signal is decomposable into multiple levels, a video generation model which realizes this hypothesis is proposed. The model decomposes motion into a two-level signal involving a global path and local pattern. They are modeled via a latent path in the form of a composite Bezier spline along with a latent sine function respectively. In the application context, the model fills the research gap in its ability to connect an arbitrary number of input key frames smoothly. Experimental results indicate that the model improves in terms of the smoothness of the generated video. In addition, the ability of the model in separating global and local signal has been validated.

Original languageEnglish
Title of host publicationAdvances in Computer Graphics - 36th Computer Graphics International Conference, CGI 2019, Proceedings
EditorsMarina Gavrilova, Jian Chang, Nadia Magnenat Thalmann, Eckhard Hitzer, Hiroshi Ishikawa
Number of pages13
Volume11542 LNCS
ISBN (Print)9783030225131
Publication statusPublished - 1 Jan 2019
Event36th Computer Graphics International Conference, CGI 2019 - Calgary, Canada
Duration: 17 Jun 201920 Jun 2019

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume11542 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349


Conference36th Computer Graphics International Conference, CGI 2019


  • Global motion path
  • Latent path
  • Local motion pattern
  • Periodic latent function
  • Video generation with key frames

ASJC Scopus subject areas

  • Theoretical Computer Science
  • General Computer Science


Dive into the research topics of 'Multi-level Motion-Informed Approach for Video Generation with Key Frames'. Together they form a unique fingerprint.

Cite this