Improving the Quality and Speed of Style Transfer using Convolutional Neural Networks

Version
Download 36
File Size 469.69 KB
File Count 1
Create Date 26/04/2023
Last Updated 05/05/2023

Download

Description

Improving the Quality and Speed of Style Transfer using Convolutional Neural Networks

Abhishek Hiwarkar, Adnan Neyazi, Raj Shingare Asif Ansari, Saud Ahmed, Kunal Giradkar, Prof. Imteyaz Shahzad

Department of Computer Science and Engineering

Anjuman College of Engineering and Technology

Nagpur, India

Abstract

Neural style transfer is a popular image processing technique that aims to transfer the style of a reference image onto a target image while preserving its content. However, traditional style transfer methods suffer from slow processing speeds and low-quality output images. In this paper, we propose a novel approach that addresses these issues by using convolutional neural networks (CNNs) and a new loss function.

Our proposed method uses a pre-trained VGG-19 network to extract feature maps from multiple layers of the network. We then apply a series of convolutional and upsampling layers to generate the output image. To improve the quality of the output image, we introduce a new loss function that consists of three terms: the content loss, style loss, and total variation loss.

The content loss measures the difference between the feature maps of the generated image and the content image, while the style loss measures the difference between the Gram matrices of the feature maps of the generated image and the style image. The total variation loss term is added to preserve the edges of the input images and prevent the generation of blurry images.

We evaluate our method on several datasets and compare it to existing state-of-the-art methods. Our method achieves significantly better results in terms of both quality and speed. Specifically, our method achieves a Fréchet inception distance (FID) score of 10.2 on the COCO dataset, compared to 21.7 for the previous state-of-the-art method. Our method also achieves a speedup of up to 6x compared to existing methods.

In conclusion, our proposed CNN architecture and loss function can improve the quality and speed of style transfer. Our method has the potential to be used in a wide range of applications, including image processing, computer vision, and augmented reality.

Improving the Quality and Speed of Style Transfer using Convolutional Neural Networks