SphereGAN: Novel and Improved Neural Network Developed by Researchers from Chung-Ang University
Deep neural networks are popularly used for object recognition, detection, and segmentation across different avenues. Of these, generative adversarial networks (GANs) are a superior class of neural networks whose performance exceeds that of conventional neural networks. They are meant to minimize the inconsistencies between real and fake data, and have proven successful for image detection, medical imaging, video prediction, 3D image reconstruction, and more.
Despite their growth over the last few years, they are not devoid of limitations. Training conventional GANs is difficult and involves very high computational costs, making them unreliable for complex computer vision problems. In addition, the data generated by them lacks diversity and appears unrealistic. Thus, it is not surprising that many GANs are unstable and can only handle simple data sets.
Recommended AI News: Matillion Launches Expanded Partner Network to Accelerate Modern Data Analytics for Enterprises
In this study, the team proposed a simple and effective integral probability metric (IPM) based GAN known as ‘SphereGAN’. SphereGAN uses multiple geometric moments to examine the inconsistencies between real and fake data distributions on the surface of a ‘hypersphere’. A hypersphere is a sphere with multiple dimensions which belongs to a branch of mathematics known as Riemannian geometry. “Compared to the trends of conventional generative models, we tried to bridge machine learning with the well-known sub-area of mathematics known as ‘Riemannian geometry’, since it has many positive implications for mathematical data,” said Prof. Kwon, explaining their motivation behind this unique approach.
They found that by incorporating Riemannian geometry in this model, SphereGAN demonstrated a considerably improved performance as compared to conventional GANs.
For starters, it could be mathematically verified, was found to undergo stable training, and was successful in generating realistic images. Moreover, it displayed superior mathematical properties which eliminated the need for additional processes such as virtual data sampling—a necessary requirement for conventional GANs. SphereGAN also used multiple geometric moments, which improved its accuracy in measuring distances—an important aspect of image generation.
Recommended AI News: Netskope Acquires WootCloud, Extending Zero Trust Capabilities to Enterprise IoT
On testing this model’s efficacy for unsupervised 2D image generation and 3D point cloud generation, the team observed that the resultant 2D images and 3D point clouds were more realistic than those produced by conventional GANs, regardless of the domain of the dataset.
What are the long-term implications of using SphereGAN? “GANs have been employed for numerous applications the industrial fields, such as the generation of ‘deep fake’ images,” explains Prof. Kwon. “The difference between these fake images and real ones cannot be distinguished by human perception. In the coming years, these and more advanced image generation applications will be possible with a robust model like SphereGAN”.
Recommended AI News: Capital One Enters Enterprise B2B Software Market with Launch of Capital One Software Business
[To share your insights with us, please write to sghosh@martechseries.com]
Comments are closed.