Visible to the public Characterizing and Improving Stability in Neural Style Transfer

TitleCharacterizing and Improving Stability in Neural Style Transfer
Publication TypeConference Paper
Year of Publication2017
AuthorsGupta, A., Johnson, J., Alahi, A., Fei-Fei, L.
Conference Name2017 IEEE International Conference on Computer Vision (ICCV)
Date PublishedOct. 2017
PublisherIEEE
ISBN Number978-1-5386-1032-9
KeywordsGram matrix representing style, Integrated optics, matrix algebra, Metrics, neural style transfer, Optical computing, Optical imaging, Optimization, pubcrawl, real-time methods, Real-time Systems, real-time video style transfer, recurrent convolutional network, recurrent neural nets, resilience, Resiliency, Scalability, solution set, Stability analysis, stability improvement, stylized images, temporal consistency loss, temporally consistent stylized videos, video signal processing, Videos, visible flickering
Abstract

Recent progress in style transfer on images has focused on improving the quality of stylized images and speed of methods. However, real-time methods are highly unstable resulting in visible flickering when applied to videos. In this work we characterize the instability of these methods by examining the solution set of the style transfer objective. We show that the trace of the Gram matrix representing style is inversely related to the stability of the method. Then, we present a recurrent convolutional network for real-time video style transfer which incorporates a temporal consistency loss and overcomes the instability of prior methods. Our networks can be applied at any resolution, do not require optical flow at test time, and produce high quality, temporally consistent stylized videos in real-time.

URLhttps://ieeexplore.ieee.org/document/8237700
DOI10.1109/ICCV.2017.438
Citation Keygupta_characterizing_2017