Standardization of Gram Matrix for Improved 3D Neural Style Transfer
Title | Standardization of Gram Matrix for Improved 3D Neural Style Transfer |
Publication Type | Conference Paper |
Year of Publication | 2019 |
Authors | Friedrich, T., Menzel, S. |
Conference Name | 2019 IEEE Symposium Series on Computational Intelligence (SSCI) |
Date Published | Dec. 2019 |
Publisher | IEEE |
ISBN Number | 978-1-7281-2485-8 |
Keywords | 2D binary monochromatic images, 3D geometries, 3D neural style transfer, 3D voxelized objects, binary 3D voxel representation, binary models, classifier network training, color information, computational geometry, convolutional neural nets, convolutional neural network architecture, image classification, Image color analysis, image colour analysis, image representation, learning (artificial intelligence), Mathematical model, Metrics, neural net architecture, neural style transfer, pubcrawl, resilience, Resiliency, Scalability, Shape, Solid modeling, standardization, standardized Gram matrix based loss function, stereo image processing, style similarity, Three-dimensional displays, Two dimensional displays, voxel |
Abstract | Neural Style Transfer based on convolutional neural networks has produced visually appealing results for image and video data in the recent years where e.g. the content of a photo and the style of a painting are merged to a novel piece of digital art. In practical engineering development, we utilize 3D objects as standard for optimizing digital shapes. Since these objects can be represented as binary 3D voxel representation, we propose to extend the Neural Style Transfer method to 3D geometries in analogy to 2D pixel representations. In a series of experiments, we first evaluate traditional Neural Style Transfer on 2D binary monochromatic images. We show that this method produces reasonable results on binary images lacking color information and even improve them by introducing a standardized Gram matrix based loss function for style. For an application of Neural Style Transfer on 3D voxel primitives, we trained several classifier networks demonstrating the importance of a meaningful convolutional network architecture. The standardization of the Gram matrix again strongly contributes to visually improved, less noisy results. We conclude that Neural Style Transfer extended by a standardization of the Gram matrix is a promising approach for generating novel 3D voxelized objects and expect future improvements with increasing graphics memory availability for finer object resolutions. |
URL | https://ieeexplore.ieee.org/document/9002780 |
DOI | 10.1109/SSCI44817.2019.9002780 |
Citation Key | friedrich_standardization_2019 |
- Solid modeling
- Metrics
- neural net architecture
- neural style transfer
- pubcrawl
- resilience
- Resiliency
- Scalability
- Shape
- Mathematical model
- standardization
- standardized Gram matrix based loss function
- stereo image processing
- style similarity
- Three-dimensional displays
- Two dimensional displays
- voxel
- 2D binary monochromatic images
- learning (artificial intelligence)
- image representation
- image colour analysis
- Image color analysis
- image classification
- convolutional neural network architecture
- convolutional neural nets
- computational geometry
- color information
- classifier network training
- binary models
- binary 3D voxel representation
- 3D voxelized objects
- 3D neural style transfer
- 3D geometries