There is a Japanese word, *tsundoku *(積ん読), which means buying and keeping a growing collection of books, even though you don’t really read them all. Consider getting these Machine Learning Books if you don’t want that to happen to you.

I think we Developers and Data Scientists are particularly prone to falling into this trap. Personally, I even hoard bookmarks: my phone’s Chrome browser has so many open tabs, the counter was replaced with a “:D” emoji.

In that zeal for **reading** and **learning** most of us experience, we usually end up **lost**, not sure of **which book to pick up next**. That’s why today I’ll give you a very **short list**: just 3 Machine Learning books, so that you won’t just bookmark it and forget it.

*Would you help me decide what to write about next? Tell me what's troubling you in this 30 seconds survey!*

## Machine Learning Books

Each of these books has **helped me immensely** in different stages of my **career as a Data Scientist**, particularly in my role as a Machine Learning Engineer.

The first one is more of an introductory piece, perfect if you don’t know **how to become a data scientist**.

The second and third ones dive much deeper into Machine Learning and Deep Learning respectively, and will help you become a more grounded professional.

Without further introduction, here come the books!

## O’Reilly: Data Science from Scratch with Python

I’ll review Data Science from Scratch with Python first, since it’s the most **introductory** or broad one in this list.

I have a very personal attachment to this book, since it’s the one that **got me my job**. That’s right! I knew next to nothing about Data Science, even what Data Science was, before picking up this book.

I did have a pretty strong Probability and Statistics background, and knew enough Python to defend myself. However, I was missing the **practical side** of it.

Data Science from Scratch did many things for me. It:

- Showed me how to
**process data in Python**efficiently and elegantly (following Python’s**good practices**). - Taught me how to
**implement**most simple**Machine Learning algorithms**from scratch. - Showed me what the
**day-to-day job**of a Data Scientist may look like. - Taught me how to
**communicate my results**to others clearly.

I wholeheartedly recommend it if you’re **new to the Data Science community**. It will give you a clear overview of most topics you’ll need in order to start being a productive Data Scientist.

It will also showcase Python’s most commonly used libraries and expose you to a lot of **idiomatic code**, which is always a plus.

Here’s a link to *Data Science from Scratch* on Amazon.

## Springer: Introduction to Statistical Learning

Introduction to Statistical Learning is the most comprehensive **Machine Learning book** I’ve found so far. I learned a lot from it, from **Unsupervised Learning** algorithms like K-Means Clustering, to **Supervised Learning** ones like XGBoost’s Boosted Trees.

The first chapters may feel a bit too introductory if you’re already working in this field (at least that was my experience). However, they also sum up many things you may not have learned in such an organized way before.

The later chapters are, however, where I think this book really shines. Its explanation of random forests, boosted trees and support vector machines are spot on.

Here are some of the topics you can learn from *Introduction to Statistical Learning*:

**Regression**and**Supervised Learning**Algorithms: from**Linear Regression**and**SVM**s to**tree-based**methods.**Unsupervised Learning**techniques: especially Clustering, including the K-Means algorithm.- Sampling methods, and other general Machine Learning
**core concepts**. - The meaning, advantages and disadvantages of
**metrics**such as**accuracy, recall, precision**, etc.

I think this book has been my best read so far this year, and it’s made me into a more round up Data Scientist. I recommend it if you have a bit more experience, but want to **polish your edges.** It is also a very good **reference** book to keep on your shelf.

It also shows everything’s implementation in R, which I didn’t find particularly useful, but it didn’t hurt. You’ll probably import most of this code from SciKit learn anyway.

As before, here’s a link to Springer’s Introduction to Statistical Learning on Amazon.

## Deep Learning by Goodfellow, Bengio et al.

Bengio’s *Deep Learning* blows my mind every time I open it. I’ll be the first to admit I haven’t really read it from start to finish. Yet.

The only reason it’s the last one in the list is because of its very **specific scope**: Artificial **Neural Networks** or **Deep Learning**.

However its first chapters, with an overview of Deep Learning’s **precursors** and what makes it different, and then the explanation of **how Deep Learning works**, are marvelous.

It even starts off by explaining everything you need to know **before studying deep learning**, with whole chapters dedicated to **linear algebra**, **probability** and **information theory**, and **numerical computation** methods.

The next chapters, which I’ve only partially read, serve as an awesome **reference** whenever you need to dive deeper into a particular Neural Network **architecture**.

They include in-depth explanations of **Convolutional** Neural Networks and **Recurrent** Neural Networks, along with many **regularization** or optimization methods.

The third and last section, which revolves around **cutting-edge technology**, explains **Generative models**, Autoencoders and many other interesting algorithms. Adding them to your own toolkit will probably give you a great boost!

The authors of this book are the **rock stars** of Machine Learning right now. One of them even won a Turing award recently, so I can’t think of better people to teach this subject. I really think this is the best Deep Learning book out there.

Here’s an Amazon link if you’re interested in the *Deep Learning* book.

## Conclusion

I went from a broad, introductory book to an advanced, specific one.

Each of these Machine Learning books has had a profound impact in my career and, to some degree, the way I see the world.

I really hope at least some of them will have the same positive impact on your life!

And if you’ve already read, or are reading, any of them, tell me what you think of them in the comments! I’d love to discuss any of them further, especially the Deep Learning book.

We can also discuss them on Twitter, Medium of dev.to if you’re interested.

I want to hear your opinions!

*(small disclaimer: all of these links are Amazon affiliate links, which means I get a small commission if you buy the books. However, I’ll only review books I’ve actually read, and have genuinely recommended to people in real life)*

Pingback: New top story on Hacker News: Machine Learning Books That Helped Me Level Up – Hckr News

Pingback: New top story on Hacker News: Machine Learning Books That Helped Me Level Up – Golden News

Pingback: New top story on Hacker News: Machine Learning Books That Helped Me Level Up – Outside The Know

Pingback: New top story on Hacker News: Machine Learning Books That Helped Me Level Up – News about world

Pingback: New top story on Hacker News: Machine Learning Books That Helped Me Level Up – World Best News

Pingback: New top story on Hacker News: Machine Learning Books That Helped Me Level Up – Latest news

Pingback: Machine Learning Books That Helped Me Level Up | My Tech Blog

Pingback: 3 Machine Learning Books that Helped me Level Up | Data Stuff – NewswireNow

Pingback: Machine Learning Books That Helped Me Level Up – Hacker News Robot

Pingback: Hackers news 3 Machine Learning Books that Helped me Level Up | Data Stuff - Trending News

Pingback: Machine Learning Books That Helped Me Level Up - TopFeatured.com

Michael Johnson JrLinear Regression is a prediction algorithm, not a classification algorithm. You may want to change this

Kabelo KelepileThis is amazing!

Pingback: 3 MACHINE LEARNING BOOKS THAT HELPED ME LEVEL UP AS A DATA SCIENTIST – winjhermlds

strikinglooThat’s right, it’s a regression algorithm, not necessarily a classification one. I’ll fix it right away!

BartoszCould you please give me a hint on which level of Statistic and Probability is sufficient enough to dive into O’Reilly’s, Data Science from Scratch with Python? Can you recommend any book about Statistic and Probability?

strikinglooThe book is pretty introductory: as long as you know calculus and linear algebra you shouldn’t have any problem. It doesn’t use any Statistics or Probability without first introducing them properly.

If you can code in Python, you’ll probably be better off learning Probability and Statistics from this book.

Pingback: My First 1000 Medium Followers: A Tech Writing Journey | Data Stuff

Pingback: Tech roundup 21: a journal published by a bot - Javi López G.

Martin„Introduction to Statitistical Learning“ is also as a (legal) free PDF available – very nice!

Pingback: Blog: AI and Mass Surveillance – Tim McCloud

Pingback: Convolutional Neural Networks: Python Tutorial (TensorFlow Eager API)

Pingback: Programming Books for Beginners to Read During Lockdown | Data Stuff

Pingback: 5 Probability Distributions Every Data Scientist Should Know | Data Stuff

Pingback: K Means Clustering with Dask (Image Filters for Cat Pictures) | Data Stuff

Pingback: FuzzyWuzzy: Start Using the String Distance Library Today | Data Stuff