MITB Banner

Data Visualization – A POV from Gramener

Share

Consider this price and sales table for four cities:

 

Can you figure out how each city is performing? Notice that the average for each city is the same.

Now take a look at the same data, plotted.

The patterns are a lot clearer now, and you can quickly see that:

  • The four cities are completely different in behavior and need different strategies for growth.
  • That Delhi is price sensitive, while Bangalore and Hyderabad are not
  • There is at least 1 data point each at Hyderabad and Mumbai that look like aberrations

This is an example of the issue industry faces today: significantly larger quantities of data, but still visualized as plain tables.

[divider top=”1″]

This is an anonymised version of our very first visualization.

 We were working with a leading mobile operator who provided us with minutes-of-usage data. We plotted this time-series on a calendar, creating the Calendar Map you see above. Red cells show days with lower usage, and green cells show days with higher usage.

This made it possible to spot a number of patterns that were relatively hidden until then. For example, on this calendar map, it’s obvious that call volumes are lower on Sundays. But 31st July was a relatively good Sunday, with high call volumes. That’s tough to spot on a line graph because it’s not high in absolute terms — just high for a Sunday.

With this visualization, our client discovered a number of insights in calling pattern behavior of their customer segments. For example, the share of rural traffic rises on Sundays, mainly because urban traffic falls while rural traffic is unaffected. It also made it possible for them to identify specific days on which their competitors’ call volumes shot up, and helped them identify which competitor’s campaigns were proving effective against them.

[divider top=”1″]

Here’s a visualization of the social network of geeks across different cities in India. An interactive version of this is available at http://gramener.com/codersearch. We built this to identify who would be a good candidate to hire, as well as decide which city is the best hunting ground for geeks.

Each circle represents a developer. The size indicates the number of followers they have on Github. The color indicates the language they code in. Networks of followers are connected by lines and clustered together.

This is an instance of transforming relatively unstructured data into quantitative metrics (distance between a pair of people; density of a network; etc) and displaying them purely visually, without any numbers. As a result, it conveys far more richness and meaning intuitively to the viewers.

[divider top=”1″]

Another instance is this visualization of the entire history of batting in Indian one-day cricket. The size of the box represents the number of runs scored by the player. The color indicates the speed at which they scored those runs (red is slow, green is fast.)

It’s evident that among the big scorers, Sehwag is India’s fastest run-getter. Clicking on the players shows an second drill-down featuring every match they’ve played. An interactive version of this is available at http://gramener.com/cricket/batting-India-plain

This compresses over 150 pages of information into a single sheet without any loss. Part of the power of data visualization comes in this ability to compress information and compactly convey insights.

[divider top=”1″]

We’re moving into visualizations of non-quantitative data. There’s a lot more text out there than numbers, and it’s possible to mine information from that. For example, even a pure-text corpus like the Mahabharata lends itself to social network analysis.

Share
Picture of Anand

Anand

Anand has advised and designed IT systems for organisations such as the Aditya Birla Group, Citigroup, Honda, ICICI, IBM, Oracle, RBS & SAP, Steelcase, Tesco, etc. He has an MBA from IIM Bangalore with two gold medals and a B.Tech from IIT Madras. He has worked at IBM, Lehman Brothers, The Boston Consulting Group , Infosys Consulting and co-founded Gramener where he is Chief Data Scientist. He blogs at s-anand.net.
Related Posts

CORPORATE TRAINING PROGRAMS ON GENERATIVE AI

Generative AI Skilling for Enterprises

Our customized corporate training program on Generative AI provides a unique opportunity to empower, retain, and advance your talent.

Upcoming Large format Conference

May 30 and 31, 2024 | 📍 Bangalore, India

Download the easiest way to
stay informed

Subscribe to The Belamy: Our Weekly Newsletter

Biggest AI stories, delivered to your inbox every week.

AI Courses & Careers

Become a Certified Generative AI Engineer

AI Forum for India

Our Discord Community for AI Ecosystem, In collaboration with NVIDIA. 

Flagship Events

Rising 2024 | DE&I in Tech Summit

April 4 and 5, 2024 | 📍 Hilton Convention Center, Manyata Tech Park, Bangalore

MachineCon GCC Summit 2024

June 28 2024 | 📍Bangalore, India

MachineCon USA 2024

26 July 2024 | 583 Park Avenue, New York

Cypher India 2024

September 25-27, 2024 | 📍Bangalore, India

Cypher USA 2024

Nov 21-22 2024 | 📍Santa Clara Convention Center, California, USA

Data Engineering Summit 2024

May 30 and 31, 2024 | 📍 Bangalore, India

Subscribe to Our Newsletter

The Belamy, our weekly Newsletter is a rage. Just enter your email below.