MITB Banner

Andrew Ng To Kickstart A New Generation Of AI

Share

“Andrew Ng will soon be launching a campaign; a competition to push for data-centric models.”

A lot of people joke about how 80 percent of machine learning is simply data cleaning. Additionally, many people look at machine learning as a glorified, technical version of statistics—a field that places a great deal of importance on data. If anything, this tells us one thing for sure: Data is critical. Even a well known face in the ML community, Andrew Ng has stressed how ML needs to take a more data-centric stance rather than a model-centric one.

Nearly 90 percent of ML models built globally are never brought to light, primarily because they cannot adjust to the variety of information available in real-world applications. In a 2020 survey, only 22 percent of companies had made use of their models, many of which took as long as 12 months to bring to users. Traditional software is backed by code while both code and data enable AI systems. However, many software developers still work on codes and model architectures rather than data when they find their ML models in a bit of a fix. 

Earlier this year, Andrew Ng brought attention to MLOps, which deals with utilising machine learning models in production systems. Andrew Ng believes that focusing on data here, instead of only working on improving one’s code, could unlock multitudes of new multimillion-dollar applications of artificial intelligence. He claims that current architectures are highly evolved for identifying photographs, recognising speech or generating text. Tinkering with their architecture is perhaps not the best method to enable them to perform better anymore. 

Ushering next gen AI

The solution Andrew Ng has proposed is to put aside the architecture of an AI model and focus on what it is working with, i.e. the data. By paying close attention to what a model learns and improving the quality of data, and subsequently retraining the ML model, engineers can build higher quality systems in a much shorter time.

Andrew Ng will be launching a campaign to explain this viewpoint on June 17th 2021. The campaign will jump-start with Landing AI’s (a company founded by Ng to increase the use of AI in traditional industries) competition—which will comprise contestants competing to attain the best performance by amending data in an otherwise fixed model. The competition will end on September 4th—which just so happens to coincide with John McCarthy’s birthday (he came up with the term artificial intelligence)—where the top three winners will be invited to a private roundtable event with Andrew Ng, himself, and have opportunities to discuss their ideas and thoughts with everyone present. 

Andrew Ng says that he hopes the competition will change the decades of model-centric tradition held by developers. Despite this model-centric approach, a lot of research backs Ng’s data-centric viewpoint. A Cambridge study reported that the most critical but often overlooked aspect in ML models is data dispersion. Smaller datasets have to deal with noisier data, while larger ones make it more difficult to label them. This makes for significant bottlenecks when deploying ML solutions into the real world. 

Keeping this in mind, Ng says that the shift to data-driven practices will help solve various challenges that AI currently faces, including learning how to perform a task from tens of thousands of data points (instead of the current millions!), learning to understand when humans do not agree (e.g. when different medical experts don’t agree to a diagnosis), picking up inconsistency among data sources, changes in data over time due to something like changes in behaviour, and creating useful synthetic data when actual data is not abundantly available.

Bringing this massive paradigm shift in how AI is built will not be easy. Andrew Ng feels that it will require as much research and development as the shift from ‘old fashioned AI to deep learning’ has in the recent decades. Andrew Ng’s DeepLearning.AI, is initiating a course to teach this data-centric approach on easy-to-reach platforms like Coursera (interestingly, also founded by Andrew Ng). He has also given various presentations on DeepLearning.AI’s YouTube channel and Amazon Web Service’s Machine Learning Summit. Andrew Ng believes that the right people can put this idea to use constructively to counter many issues, such as manufacturing, treating diseases, energy consumption and food production, all with the help of AI-backed with the appropriate data. 

Share
Picture of Mita Chaturvedi

Mita Chaturvedi

I am an economics undergrad who loves drinking coffee and writing about technology and finance. I like to play the ukulele and watch old movies when I'm free.
Related Posts

CORPORATE TRAINING PROGRAMS ON GENERATIVE AI

Generative AI Skilling for Enterprises

Our customized corporate training program on Generative AI provides a unique opportunity to empower, retain, and advance your talent.

Upcoming Large format Conference

May 30 and 31, 2024 | 📍 Bangalore, India

Download the easiest way to
stay informed

Subscribe to The Belamy: Our Weekly Newsletter

Biggest AI stories, delivered to your inbox every week.

AI Courses & Careers

Become a Certified Generative AI Engineer

AI Forum for India

Our Discord Community for AI Ecosystem, In collaboration with NVIDIA. 

Flagship Events

Rising 2024 | DE&I in Tech Summit

April 4 and 5, 2024 | 📍 Hilton Convention Center, Manyata Tech Park, Bangalore

MachineCon GCC Summit 2024

June 28 2024 | 📍Bangalore, India

MachineCon USA 2024

26 July 2024 | 583 Park Avenue, New York

Cypher India 2024

September 25-27, 2024 | 📍Bangalore, India

Cypher USA 2024

Nov 21-22 2024 | 📍Santa Clara Convention Center, California, USA

Data Engineering Summit 2024

May 30 and 31, 2024 | 📍 Bangalore, India

Subscribe to Our Newsletter

The Belamy, our weekly Newsletter is a rage. Just enter your email below.