About The Rankings

The Data Science Ranking is a rank of the popularity of data science based tools. The rank is recalculated monthly using metrics pulled from a variety of sources. The ranking is created and managed by the team behind Dremio, an open-source data tool. If you have feedback or suggestions for tools to add to the ranking, email us at contact@dremio.com

Rank Name Type Score
1  0 Python Language 1059
2  0 SQL Language 529
3  0 R Language 259
4  0 MATLAB Language 127
5  1 Pandas Library 92
6  1 TensorFlow Library 91
7  0 Scala Language 87
8  1 NumPy Library 63
9  2 SciPy Library 45
10  3 Spark Software 42
11  1 SAS Software 41
12  0 Anaconda Software 38
13  5 scikit-learn Library 38
14  0 Keras Library 37
15  0 Scrapy Library 20
16  1 RapidMiner Software 14
17  3 Dremio Software 13
18  2 KNIME Software 11
19  1 Theano Library 9
20  1 Statsmodels Library 8
21  0 Arrow Library 7

Methodology

Our popularity score is calculated by averaging and weighting a handful of metrics. The calculated "score" is a relative value, meaning the score of one tool is only meaningful when compared to the score of another. The metrics we use in rough order of importance are:

Technical Interest - Calculated with each tool's total number of tagged questions and question views within the last year on StackOverflow.

Search Presence - Calculated using the total number of results returned by search engines as well as the monthly search volume for keywords. For the count of search results we use each term in conjunction with its category to increase the relevance of the search - e.g. "Tableau Software". For monthly search volume we use each term in conjunction with an action-oriented qualifier - e.g. "Install NumPy".

Job Interest - Calculated using the number of people on LinkedIn who have shown interest in each topic and the number of LinkedIn job postings that include the term.

Search Trend - Calculated with Google Trends data. We compare the tool's current search volume relative to its search volume within the past year.

Domain Strength - Calculated from the total number of external domains linking to the tool's website.