MITB Banner

Can TensorFlow’s New Face Landmarks Model Improve Iris Tracking In Mobile Devices?

Share
Nvidia Shows How To Build AI Models At Scale With PyTorch Lightning

Open-source machine learning platform TensorFTlow has announced that it would be adding iris tracking to its face mesh package. The iris tracking has been added to this package through the TensorFlow.js face landmark detection model.

It must be noted that the face mesh package was introduced in TensorFlow.js earlier this year in March. This package uses a single camera to derive approximate 3D facial surface geometry from an image or a video stream, without even a depth sensor. It was introduced with an ability to locate eyes, nose, lips (along with lip contours), and facial silhouette. With the addition of iris tracking now, it would be possible to detect eye movements, including blinking. It is implemented through the MediaPipe iris model, which is again an open-source ML model, with no requirement for additional hardware.

Iris Landmark Tracking

Iris tracking, especially on mobile devices, is a challenging task to perform. There are several hurdles to overcome, such as constrained computing resources, presence or occlusion such as hair strands or squinting of eyes, and even variable light conditions available. Most often, separate hardware is deployed for this function. This hardware includes expensive headsets and remote eye-tracking systems. Its high cost is not the only problem of deploying hardware; given how bulky they are, it becomes for usage with mobile devices unfeasible.

However, with the new announcement from TensorFlow, the users will be able to upgrade to the new face landmark detection model by just making a few code changes and no additional hardware installations. This package can be installed in two ways: by using script tags or by using NPM.

Further, this new model offers three significant improvements: iris keypoints detection, better eyelid contour detection, and improved detection for rotated faces.

As discussed previously, this package cancels the need for having separate hardware, thereby establishing compatibility with mobile devices. The fact that this is a lightweight package that contains only 3MB of weights makes it even more suitable for real-time interference on mobile devices. 

Further, with TensorFlow.js, a user could choose between a variety of different backends such as WebGL and WebAssembly (WASM) with XNNPACK for devices with lower-end GPUs.

As part of future enhancements, the TensorFlow.js and the MediaPipe teams would now be adding depth estimation capabilities to the face landmark detection using improved iris coordinates. The teams will also make the code available for facilitating reproducible research and developer community’s further usage.

The full blog from TensorFlow can be found here.

About MediaPipe Iris

MediaPipe Iris was announced in August this year, as a machine learning model for accurate iris estimation built for use on modern mobile phones, desktops, laptops, and over the web. Along with tracking iris landmarks involving iris, pupil, and eye contour, this model also showcased that ability to determine the metric distance between the subject and the camera. It demonstrated an error rate of just 10% without the use of a depth sensor. This was ensured by relying on the fact that the horizontal iris diameter of the human eye remains constant across different populations along with some simple geometric arguments.

The model was built upon the previous work on 3D Face Meshes from which the eye region of the original image was isolated for use in the iris tracking system. The problem was broadly divided into two parts — eye contour estimation and iris location.

A multi-task model that consists of a unified encoder with different components for each task was designed for using task-specific training data. This model was trained upon manually annotated 50,000 images which described a variety of illumination conditions, and head rotation poses from diverse regions. 

A detailed blog on this can be read here.

PS: The story was written using a keyboard.
Share
Picture of Shraddha Goled

Shraddha Goled

I am a technology journalist with AIM. I write stories focused on the AI landscape in India and around the world with a special interest in analysing its long term impact on individuals and societies. Reach out to me at shraddha.goled@analyticsindiamag.com.
Related Posts

CORPORATE TRAINING PROGRAMS ON GENERATIVE AI

Generative AI Skilling for Enterprises

Our customized corporate training program on Generative AI provides a unique opportunity to empower, retain, and advance your talent.

Upcoming Large format Conference

May 30 and 31, 2024 | 📍 Bangalore, India

Download the easiest way to
stay informed

Subscribe to The Belamy: Our Weekly Newsletter

Biggest AI stories, delivered to your inbox every week.

AI Courses & Careers

Become a Certified Generative AI Engineer

AI Forum for India

Our Discord Community for AI Ecosystem, In collaboration with NVIDIA. 

Flagship Events

Rising 2024 | DE&I in Tech Summit

April 4 and 5, 2024 | 📍 Hilton Convention Center, Manyata Tech Park, Bangalore

MachineCon GCC Summit 2024

June 28 2024 | 📍Bangalore, India

MachineCon USA 2024

26 July 2024 | 583 Park Avenue, New York

Cypher India 2024

September 25-27, 2024 | 📍Bangalore, India

Cypher USA 2024

Nov 21-22 2024 | 📍Santa Clara Convention Center, California, USA

Data Engineering Summit 2024

May 30 and 31, 2024 | 📍 Bangalore, India