MITB Banner

Autoplotter Tutorial – Open Source Python Library For GUI Based EDA

In this article, we will explore Autoplotter through the following points and see how it saves time in Exploratory Data Analysis.
Share
Autoplotter Tutorial

Autoplotter is an open-source python library built on top of Dash which enables the user to do Exploratory Data Analysis using Graphical User Interface. Dash is a python framework built mainly on top of Flask and Plotly.js and used to create web apps. It is open-source and runs the app on the browser window. 

Autoplotter is just like a dashboard in which you can load any dataset and start Exploratory Data Analysis. It supports different types of visualizations, statistical analysis of data, plotting according to your selected features, and creates all the major plots. The best thing about Autoplotter is that you don’t have to waste your time writing code for data analysis and creating different kinds of visual analysis, autoplotter can do all this in just a few lines of code and that too really fast and with a visually appealing Graphical Interface.

In this article, we will explore Autoplotter through the following points and see how it saves time in Exploratory Data Analysis.

  • Using Autoplotter
  • Data Exploration
  • Statistical Analysis
  • Visualizations

Implementation

Like any other python library, we will install Autoplotter using pip install autoplotter.

a. Importing Autoplotter 

from autoplotter import run_app 
Import pandas as pd

b. Loading the Dataset

We will be using a Car Design Dataset which contains different attributes of car design companies. We will load this dataset using pandas and then use the autoplotter for further Exploratory Data Analysis.

df = pd.read_csv(‘car_design.csv’)

c. Loading the Autoplotter

Autolpotter creates the app for Exploratory Data Analysis in just one line of code and that too in just seconds.

run_app(df)
autoplotter

This is the basic layout of the application created, here you can clearly see that the app contains different sections like Data Exploration and Plots. Also, we can see that every attribute has a filter applied by which we can sort it in ascending and descending order and also see the missing or NaN values. 

Exploring different sections of the application:

1. Data Exploration

Data Exploration tab allows us to explore the data and mainly the univariate data analysis. It contains three sections.

a. Data Distribution

In data distribution, we can do univariate or bivariate data analysis to know the distribution of data. It supports different kinds of plots and graphs which are already available and can be selected through different widgets in the application. 

It provides us with a clear view of all the attributes which are present in the dataset.

autoplotter

b. Statistical Analysis

Statistical Analysis is just like the describe function in python, it gives us a clear picture of the statistical properties of the dataset including mean, median, quartiles, outliers, etc.

autoplotter

c. Association

This third portion allows us to find the association of the different attributes of the dataset. The best thing about autoplotter is the widgets which allow you to select different data points that you want to analyze.

In association, we can find the correlation between any two data points as you can see in the below image.



These three are the main parts of the Data Exploration tab.

2. Plots

Plots, as the name suggests, are used to create different plots here you can select any two data points and plot a variety of graphs in between them. 

All the graphs plotted can be manipulated using different parameters that are already loaded in the app. You don’t need to worry about the coding as Autolplotter is fully loaded with all the types of charts and plots and that too highly informative and visually appealing.

Other than the different charts and plots, auto plotter also gives you the choice of selecting the theme of your choice as you can see below in the image.



These plots also help you in finding if there are any irrelevant or missing data points in the dataset, as shown in the image below.



In the histogram above, we can clearly see that the third bar is labelled as ‘?’ which is not a valid data point so we need to remove it in the data cleaning part.

All these are the parts of Autoplotter and it makes your EDA process a lot faster and hassle-free

Conclusion

In this article, we have gone through Autoplotter, a GUI based application that is used for exploratory data analysis. We demonstrated the different parts of the autoplotter in the task of Data Exploration and Visualization. We can conclude that Autoplotter is a really useful tool that is available as an open-source tool.

PS: The story was written using a keyboard.
Picture of Himanshu Sharma

Himanshu Sharma

An aspiring Data Scientist currently Pursuing MBA in Applied Data Science, with an Interest in the financial markets. I have experience in Data Analytics, Data Visualization, Machine Learning, Creating Dashboards and Writing articles related to Data Science.
Related Posts

Download our Mobile App

CORPORATE TRAINING PROGRAMS ON GENERATIVE AI

Generative AI Skilling for Enterprises

Our customized corporate training program on Generative AI provides a unique opportunity to empower, retain, and advance your talent.

3 Ways to Join our Community

Telegram group

Discover special offers, top stories, upcoming events, and more.

Discord Server

Stay Connected with a larger ecosystem of data science and ML Professionals

Subscribe to our Daily newsletter

Get our daily awesome stories & videos in your inbox
Recent Stories

Featured

Subscribe to The Belamy: Our Weekly Newsletter

Biggest AI stories, delivered to your inbox every week.

AI Courses & Careers

Become a Certified Generative AI Engineer

AI Forum for India

Our Discord Community for AI Ecosystem, In collaboration with NVIDIA. 

AIM Conference Calendar

Immerse yourself in AI and business conferences tailored to your role, designed to elevate your performance and empower you to accomplish your organization’s vital objectives. Revel in intimate events that encapsulate the heart and soul of the AI Industry.

Flagship Events

Rising 2024 | DE&I in Tech Summit

April 4 and 5, 2024 | 📍 Hilton Convention Center, Manyata Tech Park, Bangalore

MachineCon GCC Summit 2024

June 28 2024 | 📍Bangalore, India

MachineCon USA 2024

26 July 2024 | 583 Park Avenue, New York

Cypher India 2024

September 25-27, 2024 | 📍Bangalore, India

Cypher USA 2024

Nov 21-22 2024 | 📍Santa Clara Convention Center, California, USA

Data Engineering Summit 2024

May 30 and 31, 2024 | 📍 Bangalore, India

Download the easiest way to
stay informed