8 Ways To Spot A Fake Data Scientist

Data science is one of the fanciest jobs of the decade and there are a lot of people who are looking to call themselves data scientists even if that means they do not have the actual skills. However, it makes hiring data scientists a tedious job as there is no shortage of fake resumes floating around who are looking to get the role. The fraud also stems out from the fact that the job descriptions are not properly understood. This makes many people think that they are data scientists — just because they deal with data. 

To keep away from fake data scientists and hire only real data scientists, it is important for recruiters to be educated about the difference between roles like data scientist, data analyst, data engineers and others. It is also important for them to ask the right questions and keep an eye on some of the points discussed below to spot a fake data scientist. Here are some pointers: 

1| If the candidate does not have knowledge of basic statistical concepts: Most people, while they tend to learn advanced concepts of statistics and machine learning, would flounder on basic statistic techniques. A real data scientist would know basic concepts like clockwork. Start with asking concepts like hypothesis testing and regression. Grill on concepts like heteroscedasticity and probability distribution and you will know the difference.

AIM Daily XO

Join our editors every weekday evening as they steer you through the most significant news of the day, introduce you to fresh perspectives, and provide unexpected moments of joy
Your newsletter subscriptions are subject to AIM Privacy Policy and Terms and Conditions.

2| If the candidate doesn’t understand databases: While statistics is one part of it, the application would happen on a real database and that’s where real data scientists shine. Test on concepts like table joining or how to fetch database queries and you will know the strength of your candidate instantaneously. 

3| If the candidate doesn’t know to code: Real Data scientist is an amalgamation of skills, most vital of them are statistics, programming and business application. A statistician can be data scientists but not until he/she learn how to apply statistics in a business setting. So coding is an essential element for a data scientist. How would anyone apply statistics concepts to a problem unless its through coding on R or python on some dataset?

Download our Mobile App

4| If the candidate does not understand business application: This one is very essential. Real data scientists would know how to apply a statistics technique to a business problem. Just statisticians or “non-data scientists” would not be able to understand it. Test on concepts like Market basket analysis, cohort analysis, churn analysis, marketing mix modelling. Or just throw a business problem to them and ask them to solve it using data science. Asking specific questions about use cases will help to identify true data scientist rather than asking if they know Python or Hadoop.

5| If problem-solving is not one of the key skills: Problem-solving and analytical abilities are the must-have skills for data scientists. If a candidate fails to shows these skills during the interview process, they are not true data scientists. Data scientists go about problem-solving in a specific way which can be used as a way to judge how a person thinks and acts. 

6| If the candidate does not have any projects to showcase: The type and quality of projects that a candidate showcase is a telltale sign of his/her background. Look for signs of genuine impact on business rather than complexity. Most often “non-data scientists” would tend to showcase complexity through the work they have done in the past, whereas most problems have seemingly simple solutions. Rather ask how this project impacted the business, how was it deployed and how it changed existing processes.

7| If the candidate is not asking the right questions: The type of questions and interactions that happen during an interview can indicate if the candidate is genuine or not. A good data scientist might want to ask you questions about the company, how data is collected, team structure, budget of the company to tools and software, and more. Fake data scientists may not be well equipped to come out with such specific questions. 

8| If they lack showcase and networking: While this is not a crucial point, it may be one of the key indicators of whether a candidate is a genuine data scientist or not. It is only natural for a data scientist to be connected to fellow data scientists on social networking sites such as LinkedIn, but they have alarmingly low connects in the field, they may be posers. Also, data science is a hard skill and most would like to showcase it through hackathons etc. See if the candidate has appeared on some hackathons, or attended workshops, conferences etc. Again, not very crucial though.

Sign up for The Deep Learning Podcast

by Vijayalakshmi Anandan

The Deep Learning Curve is a technology-based podcast hosted by Vijayalakshmi Anandan - Video Presenter and Podcaster at Analytics India Magazine. This podcast is the narrator's journey of curiosity and discovery in the world of technology.

Srishti Deoras
Srishti currently works as Associate Editor at Analytics India Magazine. When not covering the analytics news, editing and writing articles, she could be found reading or capturing thoughts into pictures.

Our Upcoming Events

24th Mar, 2023 | Webinar
Women-in-Tech: Are you ready for the Techade

27-28th Apr, 2023 I Bangalore
Data Engineering Summit (DES) 2023

23 Jun, 2023 | Bangalore
MachineCon India 2023 [AI100 Awards]

21 Jul, 2023 | New York
MachineCon USA 2023 [AI100 Awards]

3 Ways to Join our Community

Telegram group

Discover special offers, top stories, upcoming events, and more.

Discord Server

Stay Connected with a larger ecosystem of data science and ML Professionals

Subscribe to our Daily newsletter

Get our daily awesome stories & videos in your inbox

Council Post: From Promise to Peril: The Pros and Cons of Generative AI

Most people associate ‘Generative AI’ with some type of end-of-the-world scenario. In actuality, generative AI exists to facilitate your work rather than to replace it. Its applications are showing up more frequently in daily life. There is probably a method to incorporate generative AI into your work, regardless of whether you operate as a marketer, programmer, designer, or business owner.

Meet the Tech Fanatic, Deedy

Debarghya Das or Deedy is the founding engineer of internal enterprise search space Glean, a company that strives to solve workplace search queries