15 steps to become a Data Scientist
- Pablo Marin
- Apr 13, 2016
- 4 min read
Data Scientist, Machine Learning, Deep Learning, A.I? ....
Have you heard these words lately? I hope you do because within the next year or two it will become the HOTTEST job/career on the computer science/developer world.

I am introducing in this blog: 15 steps to begin a career as a "Data Scientist". I said "begin" because, just as in any other field or industry, it is the experience and 10,000 hours of practice (read Malcom Gladwel's "Outliers") what will make you GOOD at something.
The world wide web is infinite, and sources of information about "data science" are uncountable. Some sources better than others obviously, but regardless the amount of information is A LOT. Is for this reason that I am writing this blog post. After many months of reading and researching, I put together below what I consider a decent study plan that will give you an excellent foundation to begin the journey in the fabulous world of Data Science and Big Data.
Before starting on this plan, let me make something very clear and be very blunt about it: This is not for anyone. The first thing you need to have before even thinking about being a data scientist is a college degree in Computer Science, Engineering, Math or Statistics.
If you don't have or plan to have any of these degrees, I sincerely, from the bottom of my heart, recommend that you do something else :)
Ok, so, without more introduction, here you go: 15 Steps to become a Data Scientist:
1) Read this book: Data Science from Scratch
2) Complete, with the certificate, the following online courses:
3) Readings on RNN (Recurrent Neural Networks):
4) Study and Practice on these Libraries and Machine learning tools:
- Azure ML
5) Complete the following training on Spark:
- Read this book: Learning Spark
- Complete this online course on EDX: Distributed Machine Learning with Spark
- Open a "Comunity Edition" acccount on Databricks and complete all the examples and training.
7) Training on H2O.ai:
8) Databricks + Spark + H2O.ai:
9) Training on Keras and Lasagne over Theano:
10) Training on TensorFlow:
11) Training on Data munging, formats and Visualization:
12) Tips and Tricks:
13) Print and hang these images on your wall:
- Image – Machine Learning Algorithms
- Image – SciKit-Learn Cheat Sheet
- Image - Azure ML Algorithm Cheat Sheet
- Image – Road to Data Scientist
- Image – Machine Learning Process
- Image – Azure ML Studio diagram
- Image – Learning Paradigms and Algorithms
- Image – Data Visualization in Python Cheat Sheet
14) Training on Kaggle:
15) Compete on Kaggle and Cortana Intelligence competitions until you reach top 10