MITB Banner

MasterClass: Performance Boosting ETL Workloads Using RAPIDS On Spark 3.0

As Data Scientists and Engineers, two of the biggest challenges you face are the exponential growth of data and the slow processing speeds.

Share

MasterClass: Performance Boosting ETL Workloads Using RAPIDS On Spark 3.0

Illustration by MasterClass: Performance Boosting ETL Workloads Using RAPIDS On Spark 3.0

With growing data, the time taken to run ETL (extract, transform, load) processes to support the myriad downstream workloads has also grown. At the current time, Apache Spark has emerged as the de-facto standard for streamlining at-scale ETL workloads and analytics processing. With Apache Spark, organisations are able to process large amounts of data in record time. Spark offers a set of easy-to-use APIs for ETL (extract, transform, load), machine learning, and graph processing for a variety of data sets from different sources. Currently, Spark is being run on millions of on-premise and cloud servers. 

NVIDIA introduced its end-to-end GPU acceleration to Apache Spark 3.0 in 2020. This allows data scientists and machine learning engineers, for the first time, to be able to apply GPU acceleration to ETL workloads. This capability also delivers the performance and the scale needed to bring together the power of AI and the potential of big data.

To help understand and appreciate the true potential of this technology, NVIDIA and Micropoint, with Analytics India Magazine, are organising a webinar on ‘Performance boosting ETL workloads using RAPIDS  on Spark 3.0’ on October 20th 2021. 

The session will be conducted by  Saurav Agarwal, Sr. Enterprise Architect – Big Data, Advanced Analytics & ML, at NVIDIA. He will be speaking about the most commonly used data architectures and ETL workloads, and how they can be accelerated using GPUs and RAPIDS on Adobe Spark 3.0.

Register Now

The webinar will cover —

  • ETL/data architecture and workflows in the industry
  • Hands-on examples of speeding up the workflows using open source plugins on Spark
  • Introduction to best practices around performance optimisation and speed ups.
  • Introduction to NVIDIA RAPIDS and how it can help boost performance of ETL workloads

Who should attend?

  • Data science, data engineering, analytics & Big Data enthusiasts
  • Data engineering professionals & aspirants
  • Aspiring data engineers
  • Working professionals interested in the analytics domain
  • Data science & analytics professionals looking to pivot
  • Students from engineering/technical background

Register Now

Speaker Details:

Saurav AgarwalSr. Enterprise Architect – Big Data, Advanced Analytics & ML

Saurav has around ten years of data industry experience implementing AI/data science/analytics solutions on big data platforms, including large-scale data lake systems. He is an experienced senior architect and seasoned data engineer with experience building distributed real-time data science pipelines. Along with having hands-on architecture and implementation experience in enterprise data landscapes, including Hadoop and Spark ecosystems, Saurav has been part of multiple large-scale projects covering end-to-end data landscape solutions for automotive, supply chain, healthcare, banks, fintech, and more. His top projects include streaming predictive alerts of heart ailments for a primary healthcare provider and building a petabyte-scale data lake for a large fintech firm and its various partner consumers.

Date: 20th October 2021

Time: 6:00 – 7:00 PM (IST)

Register Now

Share
Picture of Analytics India Magazine

Analytics India Magazine

Analytics India Magazine chronicles technological progress in the space of analytics, artificial intelligence, data science & big data by highlighting the innovations, players, and challenges shaping the future of India through promotion and discussion of ideas and thoughts by smart, ardent, action-oriented individuals who want to change the world.
Related Posts

CORPORATE TRAINING PROGRAMS ON GENERATIVE AI

Generative AI Skilling for Enterprises

Our customized corporate training program on Generative AI provides a unique opportunity to empower, retain, and advance your talent.

Upcoming Large format Conference

May 30 and 31, 2024 | 📍 Bangalore, India

Download the easiest way to
stay informed

Subscribe to The Belamy: Our Weekly Newsletter

Biggest AI stories, delivered to your inbox every week.

AI Courses & Careers

Become a Certified Generative AI Engineer

AI Forum for India

Our Discord Community for AI Ecosystem, In collaboration with NVIDIA. 

Flagship Events

Rising 2024 | DE&I in Tech Summit

April 4 and 5, 2024 | 📍 Hilton Convention Center, Manyata Tech Park, Bangalore

MachineCon GCC Summit 2024

June 28 2024 | 📍Bangalore, India

MachineCon USA 2024

26 July 2024 | 583 Park Avenue, New York

Cypher India 2024

September 25-27, 2024 | 📍Bangalore, India

Cypher USA 2024

Nov 21-22 2024 | 📍Santa Clara Convention Center, California, USA

Data Engineering Summit 2024

May 30 and 31, 2024 | 📍 Bangalore, India

Subscribe to Our Newsletter

The Belamy, our weekly Newsletter is a rage. Just enter your email below.