Active Hackathon

NVIDIA Releases Version 2 Of GauGAN, Its Scaringly Accurate AI Art Solution

GauGAN2 can generate images from phrases like 'sunset at a beach,' which can then be further modified with adjectives like 'rocky beach,' or by changing 'sunset' to a different time of day or even modifying weather conditions. 

NVIDIA recently announced the latest version of NVIDIA Research’s AI painting demo, GauGAN2. The new model is powered by deep learning and consists of a text-to-image feature. The original version could only turn a rough sketch into a detailed image. GauGAN2 can generate images from phrases like ‘sunset at a beach,’ which can then be further modified with adjectives like ‘rocky beach,’ or by changing ‘sunset’ to a different time of day or even modifying weather conditions. 

Powered by generative adversarial networks (GAN), NVIDIA spokesperson said “With the press of a button, users can generate a segmentation map, a high-level outline that shows the location of objects in the scene. From there, they can switch to drawing, tweaking the scene with rough sketches using labels like sky, tree, rock and river, allowing the smart paintbrush to incorporate these doodles into stunning images.”


Sign up for your weekly dose of what's up in emerging technology.
Image Source: NVIDIA Research

By adding text-to-image capabilities, the new version of GauGAN is more customisable and can be tuned much quicker. Even a quick sketch is not nearly as fast and simple as typing a phrase. The latest version is also one of the first AI models to incorporate multiple modalities, text, semantic segmentation, sketch, and style, within a single GAN network.

The Text-based starting point, such as a ‘snow-capped mountain range,’ can be further customised with sketching. Users can add trees, change the height and size of objects, add clouds to the sky and much more. And then GauGAN2 generates a new, modified image.

GauGAN2 may prove useful for concept artists, as they can now create worlds with two suns, like Tatooine in Star Wars. ‘It’s an iterative process, where every word the user types into the text box adds more to the AI-created image,’ NVIDIA spokesperson added further. 

Earlier this year, NVIDIA released a tool built upon GauGAN, NVIDIA Canvas, which can be used on any NVIDIA RTX GPU. In addition, GauGAN2 has been trained on 10 million landscape images using the NVIDIA Selene supercomputer, which is among the world’s ten most powerful supercomputers. If you’d like to try your hand at creating simulated photos of places that never existed, head over to NVIDIA’s AI Playground and click on the “Launch Interactive Demo” button for GauGAN2.

More Great AIM Stories

Victor Dey
Victor is an aspiring Data Scientist & is a Master of Science in Data Science & Big Data Analytics. He is a Researcher, a Data Science Influencer and also an Ex-University Football Player. A keen learner of new developments in Data Science and Artificial Intelligence, he is committed to growing the Data Science community.

Our Upcoming Events

Conference, in-person (Bangalore)
Cypher 2022
21-23rd Sep

Conference, in-person (Bangalore)
Machine Learning Developers Summit (MLDS) 2023
19-20th Jan

Conference, in-person (Bangalore)
Data Engineering Summit (DES) 2023
21st Apr, 2023

3 Ways to Join our Community

Discord Server

Stay Connected with a larger ecosystem of data science and ML Professionals

Telegram Channel

Discover special offers, top stories, upcoming events, and more.

Subscribe to our newsletter

Get the latest updates from AIM

Ouch, Cognizant

The company has reduced its full-year 2022 revenue growth guidance to 8.5% – 9.5% in constant currency from the 9-11% in the previous quarter

The curious case of Google Cloud revenue

Porat had earlier said that Google Cloud was putting in money to make more money, but even with the bucket-loads of money that it was making, profitability was still elusive.