Written by

Bernard Marr

Bernard Marr is a world-renowned futurist, influencer and thought leader in the fields of business and technology, with a passion for using technology for the good of humanity. He is a best-selling author of 20 books, writes a regular column for Forbes and advises and coaches many of the world’s best-known organisations. He has over 2 million social media followers, 1 million newsletter subscribers and was ranked by LinkedIn as one of the top 5 business influencers in the world and the No 1 influencer in the UK.

Bernard’s latest book is ‘Business Trends in Practice: The 25+ Trends That Are Redefining Organisations’

View Latest Book

Follow Me

Bernard Marr ist ein weltbekannter Futurist, Influencer und Vordenker in den Bereichen Wirtschaft und Technologie mit einer Leidenschaft für den Einsatz von Technologie zum Wohle der Menschheit. Er ist Bestsellerautor von 20 Büchern, schreibt eine regelmäßige Kolumne für Forbes und berät und coacht viele der weltweit bekanntesten Organisationen. Er hat über 2 Millionen Social-Media-Follower, 1 Million Newsletter-Abonnenten und wurde von LinkedIn als einer der Top-5-Business-Influencer der Welt und von Xing als Top Mind 2021 ausgezeichnet.

Bernards neueste Bücher sind ‘Künstliche Intelligenz im Unternehmen: Innovative Anwendungen in 50 Erfolgreichen Unternehmen’

View Latest Book

Follow Me

Artificial Intelligence Explained: What Are Generative Adversarial Networks (GANs)?

2 July 2021

The field of artificial intelligence (AI) is fast-moving, and new breakthroughs are regularly made. One of the more recent terms rising to prominence is Generative Adversarial Network (GAN) – but what does it mean?

The principle behind the GAN was first proposed in 2014, and at its most basic level, it describes a system that pits two AI systems (neural networks) against each other to improve the quality of their results.

To understand how they work, imagine a blind forger trying to create copies of paintings by great masters. To start with, he has no idea what a painting should look like – but he happens to have a friend who has a photographic memory of every masterpiece that’s ever been painted.

This friend – a detective – has to determine whether the paintings his friend is showing match the features of those created by the real great masters, or are obvious forgeries.

This is the basic idea of how a GAN operates – only as they are AIs, both the forger and his friend are able to act at super-speed, making and detecting thousands of forgeries per second. Both of them then “learn” from the outcome to improve their future performance. As the detective becomes better at detecting forgeries, the forger must become better at creating them.

GANs have been the cause of a lot of excitement within the field of AI development in recent years, due to their ability to create “new” information following rules established by existing information. An example might be writing instruction manuals. By training a GAN on thousands of instruction manuals, it could one day be possible to create a system that could look at any tool, device or software and then create instructions on how to use it.

So, let’s look into how this works in a bit more depth. The “forger” network that creates fake data is termed the generative network, and its job is to read and understand the properties of the training data. It then attempts to replicate it by producing “candidate” datasets that follow the same rules.

The “detective” network tasked with determining whether the generative network is outputting false (artificially generated) data or real (training) data is known as the discriminative network. Because it competes against the generative network, the system as a whole is described as “adversarial.”

For a great working example of a GAN in action, look no further than the popular demonstration This Person Does Not Exist. The network powering the website has learned to produce ultra-realistic images of human faces that, while they follow all of the rules regarding the way a human face should look, do not exist outside of the computer programme.

While you might at first assume that the programme builds images of faces by putting together pieces from a database of eyes, ears, mouths, and hair, this isn’t the case. The “input” data for the generative network is simply a string of numbers – only the discriminative network sees the training data. The generative network improves its output based entirely on the output of the discriminative network.

As the only feedback the discriminative network gives is yes/no “guesses” at whether the generative output matches the training data, it takes many, many attempts before it starts to produce output that is acceptably close to the desired outputs – in this case, a realistic-looking image of a non-existent person.

(This example actually uses an updated model of the GAN known as proGAN which was developed by Nvidia last year, and works by gradually increasing the resolution of the image that the network generates, starting with a very low-resolution 4 pixel by 4 pixel image.)

The data used for training an adversarial network does not have to be labelled, as the discriminative network can make judgments on the output of the generative network based entirely on features of the training data itself. This means GANs have applications in unsupervised learning as well as supervised (where the data is labelled) and reinforcement learning.

Another useful feature of GANs is that they can be used to efficiently create training datasets for other AI applications. Most current AI techniques, in particular, Deep Learning, rely on access to large amounts of data for training purposes.

GANs can generate datasets that follow all of the rules of “natural” datasets and so, in theory, can be used for training of deep learning models. A great example of where this would be useful might be medical images, that can be expensive and time-consuming to collect for real – requiring both patient consent and medical expertise to label them.

GANs can be used for creating images, moving video images, text, and even music. While it’s clear that there is a lot of hype surrounding the concept at the moment, it is clearly one of the most interesting new concepts to appear from the AI field in recent years, and we can expect to see many exciting new applications based on it in the near future. 

Business Trends In Practice | Bernard Marr
Business Trends In Practice | Bernard Marr

Related Articles

What Are The Most In-Demand AI Skills | Bernard Marr

What Are The Most In-Demand AI Skills?

It’s predicted that 97 million jobs involving artificial intelligence (AI) will be created between 2022 and 2025. AI has the potential to transform every industry[...]

Human Vs. Artificial Intelligence: Why Finding The Right Balance Is Key To Success

Welcome to the age of blended workforces, where intelligent machines and humans combine to accelerate business success.[...]

10 Skills Robots Can’t Replace In The Workplace (Yet)

Machines can now do many things we would have deemed impossible a few decades ago – but there are still human skills robots can’t yet replace.[...]

The Biggest Technology Trends In Wine And Winemaking

When we think about the wine business, the images that come to mind might be more of vineyards stretching across the French countryside than of robots and digital transformation.[...]

The Benefits And Dangers Of Using AI In Recruitment

Many of us don’t like the idea of putting decisions that can seriously impact people’s lives in the hands of machines.[...]

What You Need To Know Before You Start Working With Artificial Intelligence

It seems like everyone is talking about artificial intelligence at the moment, and there’s good reason for that. We are seeing its revolutionary impact across just about every industry.[...]

Stay up-to-date

  • Get updates straight to your inbox
  • Join my 1 million newsletter subscribers
  • Never miss any new content

Social Media

Yearly Views


View Podcasts