MITB Banner

Data Literacy For A Digital India: Why Should You Be Serious About Your Data

Share

“I believe people are smart and some people want to share more data than other people do. Let them know precisely what you’re going to do with their data.” 

Steve Jobs

By 2023, India is estimated to have 650-700 million smartphones as the number of internet users in the country crosses 800 million, according to a McKinsey report. India’s digital economy has the potential to create 60-65 million jobs by 2025. The report finds that India’s digital surge is well under way on the consumer side, even as its businesses show uneven adoption.

Indian government’s push for digitisation was further fueled by last year’s pandemic, which disrupted conventional ways of doing businesses. Conferences became virtual events, schools were converted into online classrooms, and the corporate team outings were replaced by online pizza deliveries.

“If India’s digital leap doesn’t match its growth in data literacy, millions of users will be left vulnerable.”

Huge swathes of the population have embraced internet-based services. However, this tectonic shift begs the question: What will happen to all the generated data? Is the online platform your kid uses to learn algebra susceptible to malware? What happens with the location data of millions of people who have migrated to online payment services? 

How many of the users are asking these questions? Not many. They trust the big tech to take care of it; all one has to do is press the OK on the prompts. In an ideal case, a user should be immune to exploitations. But, it is slowly becoming obvious that customers are the product for many of these service providers. Indian regulators are yet to catch up with their Western counterparts in antitrust hearings and similar lawsuits directed at the big tech. Or maybe there isn’t as much action happening in India to be worried about. In that case, the regulators need to prep its population. With the kind of surge in internet usage in India, it’s imperative to educate the masses on how their data is being used.

Also Read: Big Tech Breakup: Google Blamed For Running A Monopoly

For example, Facebook can track your apps to bombard you with customised ads. Ad revenue accounts for most of Facebook’s profits. The social media giant is incentivised to scrape as much as data it can to convince you to buy things you never knew you needed. 

For better or for worse, privacy policy changes triggered a mass exodus of WhatsApp users to safer messaging apps such as Signal and Telegram. Facebook-owned Whatsapp immediately issued a clarification to stop the departures, and many stayed back; who wants to miss their group chats. Until recently, a large section of Indian population was oblivious to data privacy issues. 

(Photo: Brian X. Chen for NYT)

Blaming Facebook for it’s not-so-transparent data mining strategies is fair. But, what would be the excuse for the exploitation that occurs on accounts of user’s ignorance. “Ask them. Ask them every time. Make them tell you to stop asking them if they get tired of your asking them,” urged Apple’s Steve Jobs back in 2010. Things have changed since then. The internet speeds have upped, and phones are now machine learning applications that process millions of data points. The scale at which the data is generated is unprecedented. So, Jobs’ advice to be relentless in making people know what they are getting into is what all service providers must consider. After a decade, Apple is finally paying heed to Jobs’ remarks. The above picture compares the privacy labels of Whatsapp(left) and Signal(right). 

Apple’s Privacy Nutrition Labels

(Image credits: Apple)

“Data brokers collect data on 700 million consumers worldwide, creating consumer profiles with up to 5,000 characteristics.”

In a recent report, Apple illustrated various ways in which data is exploited. The underlying mechanism can be summarised as follows:

  •  Apps are embedded with trackers. An average app has six trackers.
  • Trackers are used by developers to allow third parties to collect and link data.
  • Data is regularly collected and sold, licensed, or otherwise disclosed to third parties by the Data Brokers.
  • Apps request access to more data than is required.

Apple announced that their devices would inform its users on who is using their data. The Cupertino-based company has declared war on internet giant Facebook, whose whole business runs on ad revenue. Apple has divided the privacy label into three categories: data that is used to track you; linked to you; and data that is not linked to you.

Apple provides users with the “Ask App not to Track,” option to keep the developer from accessing the system’s IDFA or advertising identifier that is often used  for tracking. Apple also makes sure apps comply with their policies.

“Apple’s App Tracking Transparency will require apps to get the user’s permission before tracking their data.”

Apple attributes its success to four principles. The company is adamant on collecting only the minimum amount of data and processing data on the device instead of sending them to Apple servers. While informing people is key in maintaining transparency, the phone maker underlines the importance of hardware and software integration to keep the data secure.

However, Apple’s privacy labels have a lot of room for improvement. For example, Spotify lists “coarse location” as one of the data points. It is difficult for many to understand the mechanism behind the highly discouraging jargon. Though Apple has a long way to go, it at least has begun a dialogue, a movement, which is expected to force other service providers to follow suit. 

Quantifying Data Dignity

We have discussed data brokers, ad tech exploitations and many things the complex web of apps on your phone are subjected to or subject you to.  According to Apple, this data — when pieced together, shared, aggregated, and monetised –makes up a $227 billion-a-year industry. And, all this is happening as we speak without our notice. 

But, how do we put a price tag on data in an unbiased way?

A recent research lists the important valuation schemes to put a price tag on the data such as task-specificness, fairness and efficiency. To formulate a metric that would help price the data, the researchers designed a framework that is equivalent to Shapley value, which is popular with profit allocation schemes. Shapley value attaches a real-value number to each player in the game to indicate the relative importance of their contributions.

Also Read: Get Paid For Your Data, But How?

Jaron Lanier, a tech pioneer, estimates that the total value of data from a four-person household could fetch around $20,000. Lanier urges to look beyond monetary benefit. He wants people to retain their “data dignity.” Data exists because an individual exists and they should have the final say on what happens to their data while getting paid if they accept to share. Today, markets are ruled by data-driven solutions and there is an immediate need for a robust framework that will restore the data dignity of an individual. 

PS: The story was written using a keyboard.
Picture of Ram Sagar

Ram Sagar

I have a master's degree in Robotics and I write about machine learning advancements.
Related Posts

Download our Mobile App

CORPORATE TRAINING PROGRAMS ON GENERATIVE AI

Generative AI Skilling for Enterprises

Our customized corporate training program on Generative AI provides a unique opportunity to empower, retain, and advance your talent.

3 Ways to Join our Community

Telegram group

Discover special offers, top stories, upcoming events, and more.

Discord Server

Stay Connected with a larger ecosystem of data science and ML Professionals

Subscribe to our Daily newsletter

Get our daily awesome stories & videos in your inbox
Recent Stories

Featured

Subscribe to The Belamy: Our Weekly Newsletter

Biggest AI stories, delivered to your inbox every week.

AI Courses & Careers

Become a Certified Generative AI Engineer

AI Forum for India

Our Discord Community for AI Ecosystem, In collaboration with NVIDIA. 

AIM Conference Calendar

Immerse yourself in AI and business conferences tailored to your role, designed to elevate your performance and empower you to accomplish your organization’s vital objectives. Revel in intimate events that encapsulate the heart and soul of the AI Industry.

Flagship Events

Rising 2024 | DE&I in Tech Summit

April 4 and 5, 2024 | 📍 Hilton Convention Center, Manyata Tech Park, Bangalore

MachineCon GCC Summit 2024

June 28 2024 | 📍Bangalore, India

MachineCon USA 2024

26 July 2024 | 583 Park Avenue, New York

Cypher India 2024

September 25-27, 2024 | 📍Bangalore, India

Cypher USA 2024

Nov 21-22 2024 | 📍Santa Clara Convention Center, California, USA

Data Engineering Summit 2024

May 30 and 31, 2024 | 📍 Bangalore, India

Download the easiest way to
stay informed