How The Indian Government Is Using Machine Learning And Big Data For Better Metrics

Reuters/Anindito Mukherjee

For a country as big as India, being able to get the data right, is a herculean task. At least before data analytics can come into picture and made things accurate not just by intuition but with numbers to back it up.

India has already set foot on the path to building smart cities. According to the mid-year economic survey released last week, so far as the priority sector interventions are concerned, 22 of the 60 cities have already initiated the smart roads and 18 cities have initiated integrated command and control projects. Additionally, 20 cities have initiated smart water projects and 26 cities have started implementing the solar rooftop projects. Architectural, place-making and city beautification projects have been initiated in 18 cities.

Government turns to Geospatial Analytics

And with that, the government also seems to be committed to use a more complex but accurate form of decision making using machine learning.

“With recent advances in remote sensing technology and machine learning for processing satellite images, we can get much more granular data on how urbanisation is happening across India,” said the mid-year review survey.

The second volume of the Economic Survey released by the Union finance ministry shows that the use of geospatial analytics has begun to make its overdue presence felt in India as well.

“Based on publically available data from the Global Human Settlement Layer (GHSL), we look at how built-up areas show the evolution of human settlements across India since 1975. It is also possible to disaggregate official census population numbers according to the density and form of these settlements to get granular population figures across the country.”

The two metrics based on which India defines its urbanisation are: i) Administrative metric, ii) Census metric. But economists are now beginning to use machine learning—or the use of computer algorithms that learn from data—to extract information from satellite images.

Big Data, Big Information

This may be the first time that the Indian government is using geospatial analytics for deeper understanding of urbanization, but the use of big data is not new. Earlier this year, economists used monthly data of unreserved railway passengers over a period of five years to access the internal migration in the country. Surprisingly, the extent of migration in the analysis was far higher than what the census suggested. The big data metrics also allowed the economists to get a sense of labour flows throughout the country.

In another example, the government also used big data analytics to get an estimate of trade in the country. The economists used Central sales tax invoices for trade between two states to estimate the extent to which states trade with each other.

Moreover, the Indian government is also planning to use big data and analytics information available on corporate houses and individuals for income tax assessments. Several reports reveal that the government will start collecting statistics from usual sources like banks as well as online transactions and social media sites to match the spending and lifestyle patterns of a citizen with income declarations.

The Department of Science and Technology (DST), under the Ministry of Science and Technology and Earth Sciences has been tasked to develop Big Data Analytics (BDA) ecosystem.

National Data Sharing and Accessibility Policy (NDSAP) 2012 of DST is designed to promote data sharing and enable access to government owned data.

Big Data Analytics infrastructure development in India is being steered by the C-DAC (Centre for Development of Advanced Computing), Ministry of Electronics and Information Technology. C-DAC regularly conducts training on “Hadoop for Big Data Analytics” and “Analytics using Apache Spark” for various agencies including Defence.

With all the above stated examples and initiatives of the government, it is evident that big data, machine learning is the key to accuracy and efficient data collection. However, one will need to wait and watch as to how long it takes for the government to fully digitise India and take data collection to the next level.

More Great AIM Stories

Priya Singh
Priya Singh leads the editorial team at AIM and comes with over six years of working experience as a journalist across broadcast and digital platforms. She loves technology and an avid follower of business and startup news. She is also a self-proclaimed baker and a crazy animal lover.

More Stories


8th April | In-person Conference | Hotel Radisson Blue, Bangalore

Organized by Analytics India Magazine

View Event >>

30th Apr | Virtual conference

Organized by Analytics India Magazine

View Event >>

A beginner’s guide to Spatio-Temporal graph neural networks

Spatio-temporal graphs are made of static structures and time-varying features, and such information in a graph requires a neural network that can deal with time-varying features of the graph. Neural networks which are developed to deal with time-varying features of the graph can be considered as Spatio-temporal graph neural networks. 

Yugesh Verma
A guide to explainable named entity recognition

Named entity recognition (NER) is difficult to understand how the process of NER worked in the background or how the process is behaving with the data, it needs more explainability. we can make it more explainable.

Yugesh Verma
10 real-life applications of Genetic Optimization

Genetic algorithms have a variety of applications, and one of the basic applications of genetic algorithms can be the optimization of problems and solutions. We use optimization for finding the best solution to any problem. Optimization using genetic algorithms can be considered genetic optimization

3 Ways to Join our Community

Discord Server

Stay Connected with a larger ecosystem of data science and ML Professionals

Telegram Channel

Discover special offers, top stories, upcoming events, and more.

Subscribe to our newsletter

Get the latest updates from AIM