Which programming language to learn - R or Python? Maggie Petrova Data Scientist @ObjectiveIT Mar-2018
Outline •What are the R and Python programming languages used for? •Which programming language is more popular? •Pros of R and Python •And the winner is •Learning curve of the language •Where do you start •My top 3 tips
What are the R and Python programming languages used for? Artificial intelligence (produces actions) Machine learning (produces predictions) Data science (produces insights) R Python
IEEE (Institute of Electrical and Electronics Engineers) Top Programming Languages Source: https://spectrum.ieee.org/static/interactive-the-top-programming-languages-2017
Popularity on Stack Overflow Source: https://stackoverflow.com/tags, as of 15/03/2018 See also: https://insights.stackoverflow.com/survey/2018#most-loved-dreaded-and-wanted https://madnight.github.io/githut/#/pull_requests/2017/4
Who is using Python and/or R Python R
Pros of R and Python What is R good at? R is a free, open-source programming language and software environment for statistical computing, visualisation and general computing. R is mainly used when the data analysis task requires standalone computing or analysis on individual servers. It’s great for exploratory work, and it's handy for almost any type of data analysis because of the huge number of packages. What is Python good at? Python is a dynamic and strongly typed programming language that is used for a wide range of applications. Python is used from testing microchips at Intel, to powering Instagram, to building video games with the PyGame library. It is a general-purpose, high-level programming language that is designed to emphasize usability. Python allows programmers to express concepts in fewer lines of clear code. You can use Python when your data analysis tasks need to be integrated with web apps or if statistics code needs to be incorporated into a production database. Being a fully fledged programming language, it’s a great tool to implement algorithms for production use.
And the winner is? ?
Transition to R or Python (setting the right expectations) Your background R Python Statistician Developer Non-statistician Analyst (I love Excel)
R learning curve
Python difficulty Malbolge is a programming language named after the eighth circle of hell in Dante's Inferno. It was specifically designed to be almost impossible to use, via a counter-intuitive 'crazy operation', and self- altering code.
Where do you start? Jupyter notebook Pythonpackages ggplot2 R libraryzoo stringr data.table dplyr RStudio Spyder IDE scikit-learn matplotlib pandas NumPy caret
Where do you start? Key libraries R Python Integrated development environment IDE RStudio Jupyter Notebook, Spyder, PyCharm, etc. Popular packages /libraries Data manipulation & computing dplyr, plyr, data.table Pandas, NumPy, SciPy Text mining stringr string Time series zoo, xts Prophet Visualisation and reporting results ggvis, lattice, ggplot2, shiny, RMarkdown Matplotlib, Seaborn, Plotly Machine learning caret, randomForest, nnet scikit-learn, Keras, TensorFlow, NLTK Data scraping rvest Scrapy
Where to learn Google’s Machine Learning Crash Course
My top 3 tips 1. Forget about Excel. 2. Learn by doing. Break down your task into small chunks and try to solve them one by one. 3. Use the community knowledge.

Which programming language to learn R or Python - MeasureCamp XII

  • 1.
    Which programming language tolearn - R or Python? Maggie Petrova Data Scientist @ObjectiveIT Mar-2018
  • 2.
    Outline •What are theR and Python programming languages used for? •Which programming language is more popular? •Pros of R and Python •And the winner is •Learning curve of the language •Where do you start •My top 3 tips
  • 3.
    What are theR and Python programming languages used for? Artificial intelligence (produces actions) Machine learning (produces predictions) Data science (produces insights) R Python
  • 4.
    IEEE (Institute ofElectrical and Electronics Engineers) Top Programming Languages Source: https://spectrum.ieee.org/static/interactive-the-top-programming-languages-2017
  • 5.
    Popularity on Stack Overflow Source:https://stackoverflow.com/tags, as of 15/03/2018 See also: https://insights.stackoverflow.com/survey/2018#most-loved-dreaded-and-wanted https://madnight.github.io/githut/#/pull_requests/2017/4
  • 6.
    Who is usingPython and/or R Python R
  • 7.
    Pros of Rand Python What is R good at? R is a free, open-source programming language and software environment for statistical computing, visualisation and general computing. R is mainly used when the data analysis task requires standalone computing or analysis on individual servers. It’s great for exploratory work, and it's handy for almost any type of data analysis because of the huge number of packages. What is Python good at? Python is a dynamic and strongly typed programming language that is used for a wide range of applications. Python is used from testing microchips at Intel, to powering Instagram, to building video games with the PyGame library. It is a general-purpose, high-level programming language that is designed to emphasize usability. Python allows programmers to express concepts in fewer lines of clear code. You can use Python when your data analysis tasks need to be integrated with web apps or if statistics code needs to be incorporated into a production database. Being a fully fledged programming language, it’s a great tool to implement algorithms for production use.
  • 8.
  • 9.
    Transition to Ror Python (setting the right expectations) Your background R Python Statistician Developer Non-statistician Analyst (I love Excel)
  • 10.
  • 11.
    Python difficulty Malbolge isa programming language named after the eighth circle of hell in Dante's Inferno. It was specifically designed to be almost impossible to use, via a counter-intuitive 'crazy operation', and self- altering code.
  • 12.
    Where do youstart? Jupyter notebook Pythonpackages ggplot2 R libraryzoo stringr data.table dplyr RStudio Spyder IDE scikit-learn matplotlib pandas NumPy caret
  • 13.
    Where do youstart? Key libraries R Python Integrated development environment IDE RStudio Jupyter Notebook, Spyder, PyCharm, etc. Popular packages /libraries Data manipulation & computing dplyr, plyr, data.table Pandas, NumPy, SciPy Text mining stringr string Time series zoo, xts Prophet Visualisation and reporting results ggvis, lattice, ggplot2, shiny, RMarkdown Matplotlib, Seaborn, Plotly Machine learning caret, randomForest, nnet scikit-learn, Keras, TensorFlow, NLTK Data scraping rvest Scrapy
  • 14.
    Where to learn Google’sMachine Learning Crash Course
  • 15.
    My top 3tips 1. Forget about Excel. 2. Learn by doing. Break down your task into small chunks and try to solve them one by one. 3. Use the community knowledge.

Editor's Notes

  • #4 Both languages are used for data science, ML and AI to produce insights, predictions and actions – but still – which one is better ?
  • #6 Stack Overflow is the largest, most trusted online community for developers to learn, share​ ​their programming ​knowledge, and build their careers
  • #12 Malbolge is a public domain esoteric programming language invented by Ben Olmstead in 1998, named after the eighth circle of hell in Dante's Inferno, the Malebolge. Malbolge was specifically designed to be almost impossible to use, via a counter-intuitive 'crazy operation', base-three arithmetic, and self-altering code.[1] 
  • #14 IDE - integrated development environment