53.3% Data Scientists Prefer Python, According To PlaTo Survey Report By AIM

According to a recent report by Analytics India Magazine, the most preferred Data Science programming language used across organisations is Python, with 53.3% of the respondents utilising the language. Other languages that follow are R, Matlab, SAS, Scala, Java and more.

The report titled Analytics Platforms and Tools (PlaTo) Survey was conducted to understand the stack of platforms and tools adopted by leading Analytics, AI, & Data Science organisations. It included surveys across a wide range of platforms and tools including open source and commercial analytics platforms.

The survey was sent across the data science community to understand the adoption and usage of various Cloud Service providers, BI tools, Data Science platforms, AI frameworks, DevOp tools, distributed ML platforms, AutoML tools, Data Lake tools, and more. Respondents included a large spectrum of occupations and vocations including students, research scholars, entrepreneurs and senior professionals from various industries such as Domestic IT, BFSI, FMCG, Fintech, Fashion & Apparel and more.

The report suggests that the most preferred Cloud Service Provider is AWS with close to 50% of respondents using the service. Some of the other popular CSP are MS Azure, Google Cloud Platform which are used by 21.9% and 11.2% of the respondents respectively.

In terms of Database tools, MySQL was found to be the most preferred tool with 26.1% of the respondents using it. This is followed by Hadoop, BigQuery, Amazon Redshift and NoSQL. Whereas the most preferred Data Lake tool is AWS Lake Formation with 14.1% of the respondents utilising this tool. This is followed by Cloudera, Teradata, ADLS and others.

According to the report, Auto-KerasML is the most preferred AutoML tool followed by Auto-PyTorch and Auto-SKLEARN, whereas Git is the most preferred DevOps tool adopted by 28.4% of the respondents.

In terms of AI Frameworks, Scikit Learn is most preferred and is adopted by 19.9% of the respondents, followed by TensorFlow, Keras, PyTorch and Google ML.

Apache Spark was the most preferred Data Science Platform to simplify ML Workflows, with 31.6% of the respondents preferring it. This was followed equally by Databricks and IBM Cloud Pak.

The report aims to enable professionals to identify the most widely used tools in the market and help them build specific analytics capabilities and data science skills. The insights also aim to help organisations developing or creating an analytics function to select the appropriate stack of tools along their data science journeys.


More Great AIM Stories

Siddhartha Thomas
"Siddhartha is an industry research professional with areas of interest across the Digital Media,Traditional Media, and Technology sectors. Siddhartha studies and researches organizations and industries from the perspective of innovation, finance, and strategic management. He has extensive research and knowledge management experience across numerous large and small organizations."

More Stories

OUR UPCOMING EVENTS

8th April | In-person Conference | Hotel Radisson Blue, Bangalore

Organized by Analytics India Magazine

View Event >>

30th Apr | Virtual conference

Organized by Analytics India Magazine

View Event >>

MORE FROM AIM

3 Ways to Join our Community

Discord Server

Stay Connected with a larger ecosystem of data science and ML Professionals

Telegram Channel

Discover special offers, top stories, upcoming events, and more.

Subscribe to our newsletter

Get the latest updates from AIM