MITB Banner

A New Trend Of Training GANs With Less Data: NVIDIA Joins The Gang

Share

Following MIT, researchers at NVIDIA have recently developed a new augmented method for training Generative Adversarial Networks (GANs) with a limited amount of data. The approach is an adaptive discriminator augmentation mechanism that significantly stabilised training in limited data regimes.

Machine learning models are data-hungry. As a matter of fact, in the past few years, we have seen that models that are fed with silos of data produce outstanding predictive outcomes.

Alongside, with significant growth, Generative Adversarial Networks have been successfully used for various applications including high-fidelity natural image synthesis, data augmentation tasks, improving image compressions, etc. From emoting realistic expressions to traversing the deep space, and from bridging the gap between humans and machines to introduce new and unique art forms, GANs have it all covered.

The Need For Less Data

Although deep neural network models, including GANs, have shown impressive results, yet there remains a challenge of collecting a large number of specific datasets. In the case of GANs, the challenge is to collect a large enough set of images for a specific application that places constraints on the subject type, image quality, geographical location, time period, privacy, copyright status, among others.

Further, one of the key issues with small datasets is that the discriminator overfits the training examples and thus, its feedback to the generator becomes meaningless, and training starts to diverge.

According to the researchers, in almost all areas of deep learning, dataset augmentation is known to be the standard solution against overfitting. In contrast, a GAN trained under similar dataset augmentations learns to generate the augmented distribution, which is usually considered to be highly undesirable. 

This is the reason behind developing the new approach which does not require changes to loss-functions or network architectures and is applicable while training as well as during fine-tuning of an existing GAN on another dataset. The researchers also demonstrated how to use a wide range of augmentations to prevent the discriminator from overfitting, ensuring that none of the augmentations leaked to the generated images.

Behind the Model

The researchers called this approach as Adaptive Discriminator Augmentation (ADA), where they tested the method against a number of alternatives in artificially subsetting larger datasets such as FFHQ and LSUN CAT to study how the quantity of available training data affects training in GANs.

Popular dataset StyleGAN2 and BigGAN are considered as a baseline, however, the researchers chose StyleGAN2 because it provided more predictable results with a significantly lower variance between training runs. Also, the adaptive augmentation is compared against a wider set of alternatives, such as PA-GAN, WGAN-GP, zCR, auxiliary rotations, and spectral normalisation.  

During the process, the researchers studied the effectiveness of stochastic discriminator augmentation by performing exhaustive sweeps for different augmentation categories and dataset sizes. They observed that the optimal augmentation strength depends heavily on the amount of training data, and not all augmentation categories are equally useful in practice.

Wrapping Up

The results FFHQ and LSUN CAT dataset across training set sizes demonstrated that adaptive discriminator augmentation (ADA) improves FIDs substantially in limited data scenarios. Also, with ADA the augmentations do not leak, and thus the same diverse set of augmentations can be safely used in all datasets.

Thus, the researchers showed that the adaptive discriminator augmentation reliably stabilises training and vastly improves the resulting quality when training data is in short supply. The contributions in this work make it easier to train high-quality generative models with custom sets of images. 

They stated, “Of course, augmentation is not a substitute for real data— one should always try to collect a large, high-quality set of training data first, and only then fill the gaps using augmentation.” They added, “As future work, it would be worthwhile to search for the most effective set of augmentations and to see if recently published techniques, such as the U-net discriminator or multi-modal generator could also help with limited data.”

For more details, read the paper here.

Share
Picture of Ambika Choudhury

Ambika Choudhury

A Technical Journalist who loves writing about Machine Learning and Artificial Intelligence. A lover of music, writing and learning something out of the box.
Related Posts

CORPORATE TRAINING PROGRAMS ON GENERATIVE AI

Generative AI Skilling for Enterprises

Our customized corporate training program on Generative AI provides a unique opportunity to empower, retain, and advance your talent.

Upcoming Large format Conference

May 30 and 31, 2024 | 📍 Bangalore, India

Download the easiest way to
stay informed

Subscribe to The Belamy: Our Weekly Newsletter

Biggest AI stories, delivered to your inbox every week.

AI Forum for India

Our Discord Community for AI Ecosystem, In collaboration with NVIDIA. 

Flagship Events

Rising 2024 | DE&I in Tech Summit

April 4 and 5, 2024 | 📍 Hilton Convention Center, Manyata Tech Park, Bangalore

MachineCon GCC Summit 2024

June 28 2024 | 📍Bangalore, India

MachineCon USA 2024

26 July 2024 | 583 Park Avenue, New York

Cypher India 2024

September 25-27, 2024 | 📍Bangalore, India

Cypher USA 2024

Nov 21-22 2024 | 📍Santa Clara Convention Center, California, USA

Data Engineering Summit 2024

May 30 and 31, 2024 | 📍 Bangalore, India