What Programming Languages Do You Need to Work in Data Science?

Data science employers want their software engineers to know Python, along with Hadoop, R, and Spark, but the picture is changing quickly

1 min read

Illustration of people working with data in different ways
Illustration: Shutterstock

Data scientists and software engineers who work with big data are in high demand. Thinknum Media called this field the hottest profession in 2019. Job search site Indeed earlier this year reported that job listings for data scientists jumped 31 percent between 2017 and 2018, while searches only increased 14 percent.

But what skills do you need to fill this lucrative niche?

Indeed set out to answer that question by looking at 500 tech skill terms related to data science that appeared in tech jobs posted on the site during the past five years. The analysis determined that, while Python dominates, Spark is on the fastest growth path and demand for engineers familiar with the statistical programming language R is also growing fast. Also on the radar: Hadoop, Tableau, SAS, Matlab, Redshift, and TensorFlow. [See graph, below, which omits Python because demand is literally off the charts, and because it is not strictly a data science skill.]

In terms of exactly how these skills are being applied, Indeed looked four fields that require data scientists. Machine learning came out on top—and is growing the fastest—followed by artificial intelligence, deep learning, and natural language processing. [See graph, below.]

The Conversation (0)