MITB Banner

This Latest Research Uses Convolutional Neural Networks To Identify Real Eyes In Photos

Share

eyedetector-bn

eyedetector-bn

As technology gets smarter and powerful, computer vision applications are getting omnipresent and are pushing limits as to what can be achieved at a broader level. Computer vision (CV) is no longer restricted to a narrow range of inspection and automation evident in shop floors, or the manufacturing sector.

Be it autonomous driving or medical diagnosis, CV is exploring every critical and non-critical practical elements to resolve complexities associated with them. With areas such as artificial neural networks growing significantly, the field of CV can be coupled with them to augment applications such as facial recognition and video processing.

In this article, we will discuss a recent research study that has come up with detecting eye information in facial images through Convolutional Neural Networks (CNN). This will lay down a path in aiding CV applications that rely on facial features like eyes, in the future.

Current Eye Detection Methods

Eye detection in CV has been a subject of interest in the research community off late. Even though there are plenty of eye detection methods available, many of them lag at obtaining high-efficiency and accuracy in eye detection. Be it the standard oculography techniques, or detection through grey-level images of human faces, the results are not precisely accurate or robust.

In fact, algorithms such as ensemble regression trees, Viola-Jones algorithm etc., have also looked into capturing eye positions and made noteworthy progress. But, then again, they face trouble due to certain factors like illumination and image noises (colour, contrast etc.) innate in the pictures.

Now, a recent study by researchers Bin Li and Hong Fu, has worked on eye detection by adopting CNNs for the technique. Their method will help determine eye locations from images of human faces. Focussing on regions surrounding the eye for analysis, the CNNs would identify the exact eye details (right/left eye and centre of the eye)

How It Works

For the study, the researchers consider databases from GI4E and BioID apart from their own created datasets, for facial image data. Using these data, the method is divided into three steps. The first step entails the calculation of extreme points and gradient values in the images, to generate candidate eye regions. The second step involves deploying a set of CNNs which identifies the eye class (left/right eye) while the third step has another set of CNNs to locate the centre position of the eyes.

Image courtesy: Bin Li, Hong Fu

At the candidates region generation stage, the researchers choose image regions surrounding eyes to mitigate innate problems such as light variations, occlusion etc. in face images captured through face-detector software and algorithms. This is the reason they narrow it down to eye parts like pupil and iris. Li and Fu say:

“We need to quickly propose the valid eye candidate regions that can significantly reduce the search space of the accurate eye location. In our observation, we found that the pupil and iris were darker than other parts of the eye. The locations of the local extreme points in the image are more likely to be the rough centre positions of the eyes.”

In order to do this eye generation process, they use three Gaussian kernels to obtain Gaussian images so that each pixel in the image serves as the reference point amongst themselves for concatenating eye features.

Creating CNNs, Training and Evaluation

Once eye regions are generations, two sets of CNN are built to analyse these Gaussian images for eye classification (right/left eye) and eye centre detection respectively. The CNN architectures are given below.

Information Courtesy:Bin Li, Hong Fu

The first set of CNN has three convolutional layers with each layer accommodating a specified kernel size. This is to input the generated eye regions for classification. On the other hand, the second set of CNN consists of four layers i.e. a convolution layer, an average pooling layer, a fully connected layer, and a logistic perceptron. The outputs from the first set serves as the input for these CNNs.

Once the CNNs are created, they are ready for training. The researchers consider three ‘candidate boxes’ for ascertaining candidate eye regions. Furthermore, Li and Fu manually label images required for creating training samples. Now, training is carried out for both sets of CNNs.

To evaluate the performance, they test the method/algorithm on an Intel(R) Core(TM) i5-6600 desktop computer with 16GB RAM and NVIDIA GeForce GTX 745 GPU. All of the algorithm structure was made available through MATLAB.

The accuracy in detecting eye regions was accounted to be 99 percent and 90 percent respectively for eye classification and eye centre location.

Conclusion

Compared to earlier research in eye detection methods, Li and Fu’s study fare really good in accuracy as well as in terms of efficiency for eye detection. In addition, their study was quick and reduced training time significantly (just four hours). Although, their implementation needs to be tested on a large scale nonetheless the study offers a benchmark status for niche facial feature recognition such as eyes.

 

Share
Picture of Abhishek Sharma

Abhishek Sharma

I research and cover latest happenings in data science. My fervent interests are in latest technology and humor/comedy (an odd combination!). When I'm not busy reading on these subjects, you'll find me watching movies or playing badminton.
Related Posts

CORPORATE TRAINING PROGRAMS ON GENERATIVE AI

Generative AI Skilling for Enterprises

Our customized corporate training program on Generative AI provides a unique opportunity to empower, retain, and advance your talent.

Upcoming Large format Conference

May 30 and 31, 2024 | 📍 Bangalore, India

Download the easiest way to
stay informed

Subscribe to The Belamy: Our Weekly Newsletter

Biggest AI stories, delivered to your inbox every week.

AI Forum for India

Our Discord Community for AI Ecosystem, In collaboration with NVIDIA. 

Flagship Events

Rising 2024 | DE&I in Tech Summit

April 4 and 5, 2024 | 📍 Hilton Convention Center, Manyata Tech Park, Bangalore

MachineCon GCC Summit 2024

June 28 2024 | 📍Bangalore, India

MachineCon USA 2024

26 July 2024 | 583 Park Avenue, New York

Cypher India 2024

September 25-27, 2024 | 📍Bangalore, India

Cypher USA 2024

Nov 21-22 2024 | 📍Santa Clara Convention Center, California, USA

Data Engineering Summit 2024

May 30 and 31, 2024 | 📍 Bangalore, India

Subscribe to Our Newsletter

The Belamy, our weekly Newsletter is a rage. Just enter your email below.