Improving the Quality and Speed of Style Transfer using Convolutional Neural Networks
Abhishek Hiwarkar, Adnan Neyazi, Raj Shingare Asif Ansari, Saud Ahmed, Kunal Giradkar, Prof. Imteyaz Shahzad
Department of Computer Science and Engineering
Anjuman College of Engineering and Technology
Nagpur, India
Abstract
Neural style transfer is a popular image processing technique that aims to transfer the style of a reference image onto a target image while preserving its content. However, traditional style transfer methods suffer from slow processing speeds and low-quality output images. In this paper, we propose a novel approach that addresses these issues by using convolutional neural networks (CNNs) and a new loss function.
Our proposed method uses a pre-trained VGG-19 network to extract feature maps from multiple layers of the network. We then apply a series of convolutional and upsampling layers to generate the output image. To improve the quality of the output image, we introduce a new loss function that consists of three terms: the content loss, style loss, and total variation loss.
The content loss measures the difference between the feature maps of the generated image and the content image, while the style loss measures the difference between the Gram matrices of the feature maps of the generated image and the style image. The total variation loss term is added to preserve the edges of the input images and prevent the generation of blurry images.
We evaluate our method on several datasets and compare it to existing state-of-the-art methods. Our method achieves significantly better results in terms of both quality and speed. Specifically, our method achieves a Fréchet inception distance (FID) score of 10.2 on the COCO dataset, compared to 21.7 for the previous state-of-the-art method. Our method also achieves a speedup of up to 6x compared to existing methods.
In conclusion, our proposed CNN architecture and loss function can improve the quality and speed of style transfer. Our method has the potential to be used in a wide range of applications, including image processing, computer vision, and augmented reality.