MITB Banner

How to navigate uncertainty in deep learning

Methods such as Gaussian processes are very helpful in data analysis and decision making.
Share
How to navigate uncertainty in deep learning

Machine learning heavily relies on probability theory. Hence, managing uncertainty (read imperfect or incomplete information) is key to machine learning (ML) projects.

Ideally, deep learning makes it possible to produce dependable predictions on data from the same distribution the models were trained on. However, there are often disparities in the distribution of data on which the model was trained and to which a model is applied. For example, a 2018 study found that deep learning models trained to detect pneumonia in chest x-rays did not achieve the same degree of accuracy when they were evaluated on previously unseen data from hospitals. 

Methods such as Gaussian processes are very helpful in data analysis and decision making. For instance, an autonomous car may use this information to decide whether it should brake or not. 

That said, in assessing data and making decisions, it is important to also be able to question whether a model is certain about its output. While this is an underlying concern of Bayesian machine learning, deep learning models often ignore these questions— leading to situations in which it is difficult to tell whether a model is making a reasonable prediction or making guesses at random. 

Epistemic uncertainty 

There are two major types of uncertainty in deep learning: epistemic uncertainty and aleatoric uncertainty. 

Epistemic uncertainty specifically refers to what a model doesn’t know because it was fed inappropriate training data. This occurs when a model doesn’t have a sufficient amount of data and knowledge, which usually happens when there aren’t enough samples available for training the AI. 

The collection of observations acquired from the domain cannot be chosen without some systematic bias. While some level of bias is unavoidable, uncertainty increases if the level of variance and bias in the sample is an unsuitable representation of the task or project for which the model will be used. 

Unfortunately, in most cases, developers have little control over the sampling process, and obtain their data from a database or CSV file that they have access to. It is impossible to achieve complete coverage of a domain: there will always be some unobserved cases. 

Aleatoric uncertainty 

Aleatoric uncertainty describes the uncertainty that comes about as a result of the natural stochasticity of observations. Observations from a domain that has been used to train a model are always incomplete and imperfect. 

High aleatoric uncertainty occurs when there are few or no observations made while training a model. This type of uncertainty cannot be remedied by providing additional data. 

Noise in observations occurs when the observations from the domain aren’t concise: In other words, they contain noise. “Observations,” in this instance, refers to what was measured or collected: It is the input as well as the expected output of a model. “Noise,” on the other hand, refers to the variability in observation. This variability could be natural or an error, and affects both the input and the output of the model. 

Since data in the real world is messy and imperfect, we should be skeptical of data and develop systems that can navigate uncertainties. 

Error prone

ML models are susceptible to errors, but some models can be useful despite being wrong: The variables include the procedure used to develop the model, including the selection of samples, decisions made in training hyperparameters, and in the construction of model predictions. 

Hence, given the uncertainty in deep learning,  the goal should be to build models with good relative performance and improve on the established learning models to account for the margin of errors.

PS: The story was written using a keyboard.
Share
Picture of Srishti Mukherjee

Srishti Mukherjee

Drowned in reading sci-fi, fantasy, and classics in equal measure; Srishti carries her bond with literature head-on into the world of science and tech, learning and writing about the fascinating possibilities in the fields of artificial intelligence and machine learning. Making hyperrealistic paintings of her dog Pickle and going through succession memes are her ideas of fun.
Related Posts

CORPORATE TRAINING PROGRAMS ON GENERATIVE AI

Generative AI Skilling for Enterprises

Our customized corporate training program on Generative AI provides a unique opportunity to empower, retain, and advance your talent.

Upcoming Large format Conference

May 30 and 31, 2024 | 📍 Bangalore, India

Download the easiest way to
stay informed

Subscribe to The Belamy: Our Weekly Newsletter

Biggest AI stories, delivered to your inbox every week.

AI Courses & Careers

Become a Certified Generative AI Engineer

AI Forum for India

Our Discord Community for AI Ecosystem, In collaboration with NVIDIA. 

Flagship Events

Rising 2024 | DE&I in Tech Summit

April 4 and 5, 2024 | 📍 Hilton Convention Center, Manyata Tech Park, Bangalore

MachineCon GCC Summit 2024

June 28 2024 | 📍Bangalore, India

MachineCon USA 2024

26 July 2024 | 583 Park Avenue, New York

Cypher India 2024

September 25-27, 2024 | 📍Bangalore, India

Cypher USA 2024

Nov 21-22 2024 | 📍Santa Clara Convention Center, California, USA

Data Engineering Summit 2024

May 30 and 31, 2024 | 📍 Bangalore, India