M42 — Orion Nebula

Data Science Learning Resources

Andreas Maier

--

Data Science is a very interesting area to learn about. To help you with that I collected here some resources which are mostly free, but of high quality.

Absolute Beginner

If your are an absolute beginner and want to know more about data science I recommend the following courses:

Data Science and Machine Learning

For data science and machine learning in general I like the following books and lectures the most:

Beginners

Scipy Lecture Notes

Brownlee: Machine Learning Mastery

Segaran 2007: Programming Collective Intelligence

I want to note here that the term “collective intelligence” is much better than “artificial intelligence”. It makes clear that data scientists are not building some homunculus. Instead data scientists and machine learning engineers are extracting patterns from huge amounts of data and use these patterns to classify, predict or generate new data. Unfortunately the term “collective intelligence” — as suggested by the Segaran— did not become very popular, but I think it should be.

Grus 2019: Data Science from scratch

VanderPlas 2023: Python Data Science Handbook

Advanced

Bishop 2006: Pattern Recognition and Machine Learning

Hastie 2017: The Elements of Statistical Learning

Geron 2022: Hands-On Machine Learning with Scikit-Learn & TensorFlow

James 2023: An Introduction to Statistical Learning (with Applications in Python)

Deep Learning

If you want to know more about deep learning specifically you will find the following books and courses helpful:

GoodFellow et.al. 2016: Deep Learning

Howard, Gugger: Deep Learning for Coders with Fastai and PyTorch: AI Applications Without a PhD

Howard et.al: Practical Deep Learning for Coders

Zhang et.al. : Dive into Deep Learning

UPDATE:

This book was published just recently and is endorsed by the the “Godfathers of Deep Learning” Hinton, LeCun and Bengio:

Bishop et. al. 2023: Deep Learning: Foundations and Concepts

Recommendation

If I would have to choose just one resource from above it would be

Geron 2022: Hands-On Machine Learning with Scikit-Learn & TensorFlow

It has a very good mixture of theory and practice. It also includes Deep Learning, Transformers, GANS and Reinforcement Learning. It is up-to-date. It also has historical information and points to very interesting resources itself. It has everything, except for a low price.

--

--

Andreas Maier

PhD in Astrophysics, currently working as Senior Data Scientist & Machine Learning Engineer