Crunching numbers and spotting patterns have become the gold standard in the IT industry. And if you need any more affirmation why data analyst jobs are in demand, check out LinkedIn Most Promising Jobs 2017 with Data Engineer at number #9 and Analyst Manager making it to number #18. Another Glassdoor study of 50 Best Jobs in America puts Data Scientist at the top spot with Data Engineer coming at close #3 and Analytics Manager at an enviable #5. However, taking the top slot isn\u2019t easy. You need an armoury of data analyst skills if you want to clock year-on-year growth and fat pay package with this major career advancement.\n\nWhile the internet is abuzz with free resources on how to master the fundamentals of data science, sentiment analysis and fast track machine learning among others, Analytics India Magazine and UpGrad help you cut through the claptrap by listing down #5 basic skills needed to become a data analyst.\n\nPut yourself in the lead for some of the most highly sought after jobs with these skills:\n\nLet\u2019s start with a few basics:\n\nEducational Background: Not everybody can become a data analyst. You need to have a natural leaning towards math and statistics. All those years of learning calculus and probability will come handy. A degree in Computer Science is an added advantage.\n\nStatistics \n\nTo become a full-fledged data analyst, a thorough grounding in statistics is essential, being good at statistics will help you understand algorithms deeply and understand when they should be used. Brush up on applied statistics, linear algebra, real analysis, graph theory and numerical analysis.\u00a0 Linear algebra comes into play with regression, understanding data structures and prepare data for prescriptive and predictive data modelling.\n\nProgramming Skills\n\n1) Statistical Language\n\nSAS vs R vs Python: It\u2019s a question that needles most data nerds when it comes to picking up the analytical tool of choice.\u00a0 While SAS (has an expensive price tag) and Python (billed for low-scale data processing) are easy to learn, R (low level programming language) wins hands down thanks to its advanced computing capability, better graphical capabilities and advanced tools. Since R is open sourced, features and packages get added quickly as opposed to SAS. Another reason why R is thriving is it has a huge ecosystem backing it up that keep it up-to-speed with rich features.\n\nPro Tip: R\u2019s commercial appeal has made it a household name and while SAS is still widely used by enterprises, this statistical language is catching on. But R has a steep learning curve, so here\u2019s a guide that may help.\n\n2) Querying Language \n\nSQL: One of the oldest querying language, SQL is a general-purpose database language which is used for analytical as well as transactional queries. SQL is mainly used in day-to-day operations and cannot support petabytes of data.\n\nHive: This Hadoop query language was invented by Facebook\u2019s Data Infrastructure team. Right from the day that Hive was open sourced in 2008, it has become the popular choice for business analysts. The open source data warehousing solution that uses an SQL type language called HQL can support terabytes and petabytes of data as opposed to SQL. The downside is it only supports structured data.\n\nPIG: One of the biggest advantages for Pig is that it can process both structured and unstructured data and works over MapReduce. It is the go-to language for most programmers who tend to write scripts. What you need to do is learn Pig Latin that helps tackle structured\/unstructured and semi-structured with more ease as compared to Hive. Here\u2019s a bit of history trivia \u2013 Pig was created in Yahoo in 2006 to perform MapReduce jobs.\n\nPro Tip: Knowledge of SQL will help in picking up Pig and Hive.\n\n3) Scripting Language \n\nMATLAB: It\u2019s a language used for data mining and while some would argue that its popularity has declined, it wouldn\u2019t hurt to put it in your arsenal. Remember, MATLAB has been around for a long, long time, invented in late \u201870s as a tool for data analysis.\n\nPython: This is hands down one of the most popular scripting languages and its popularity stems from current stack \u2013 the core libraries NumPy, SciPy, Pandas, matplotlib, IPython. Perfect for modelling and analysis, it has one drawback though \u2013 scalability for large datasets.\n\nPro Tip: Python has a strong community and is best used for scraping website and data engineering. Guess what, it\u2019s so easy that people with a non-programming background can also master it.\n\nMachine Learning\n\nMachine Learning is not only a buzz word. It is finding a lot of utility across domains, hence it is turning out to be an essential skill that data professionals need to have. In ML, regression, classification and segmentation are the broad learning areas where analysts should focus.\n\nData Visualization\n\nYou have all this data; now how do you bring it to life.\u00a0 Your job as a data analyst would be to make evocative reports, find trends and communicate these findings to the top brass. Data visualization tools to master are Tableau, Microsoft Power BI, Oracle Visual Analyser, SAS Visual Analytics.\u00a0 If you like R, you can use the ggplot package to create highly interactive charts and graphs.\n\nPro Tip: Don\u2019t just learn the tools, try understanding the motive of visually encoding data as well.\n\nUnderstanding Databases\n\nEssentially used to better understand the customer, database analysis extends from basic analysis to complex data mining through various tools \u2013 Geographic Information System (GIS) or text analysis. The basic steps for analyzing database is extract, clean, merge, analyses and implement.\n\nData Munging or Data wrangling\n\nBefore you start extracting insights from reams of data, data must be cleaned. In plain speak, somebody needs to do the job of a janitor, which means, manually cleaning data and processing it in a unified format before it is analyzed.\u00a0 So far, excel has been used for cleaning and enriching data, but Stanford debuted an interactive tool, a work-in-progress called Wrangler.\n\nPro Tip: Give Wrangler a try and see how you can manipulate real world data and export it for used in Tableau or R.\n\nData analysts do not require advanced skills like data analysts, however, since these roles are multi-faceted and learning is a continuous process, with additional resources you can become a junior data scientist as well. Essentially, mathematics and statistics (32%), computer science (19%), and engineering (16%) are predominantly the most important fields of study for a data scientist. Data analysts are generally expected to be proficient with languages such as SAS and\/or R. It\u2019s advisable for people with computer science as background to know Python, Hadoop, and SQL coding. Additionally, working with unstructured data is an integral part of the job, so it\u2019s a good idea to be accustomed to unstructured databases. Moreover, a data scientist must imbibe qualities such as developing a business acumen or good communication\/presentation skills, as these skills will help stay ahead of the game.