The Hundred-Page Machine Learning Book

Andriy Burkov

Everything you really need to know in Machine Learning in a hundred pages.

The author is letting you choose the price you pay for this book!

Pick Your Price...

Everything you really need to know in Machine Learning in a hundred pages.

PDF

EPUB

About

About the Book

Peter Norvig, Research Director at Google, co-author of AIMA, the most popular AI textbook in the world: "Burkov has undertaken a very useful but impossibly hard task in reducing all of machine learning to 100 pages. He succeeds well in choosing the topics — both theory and practice — that will be useful to practitioners, and for the reader who understands that this is the first 100 (or actually 150) pages you will read, not the last, provides a solid introduction to the field."

Aurélien Géron, Senior AI Engineer, author of the bestseller Hands-On Machine Learning with Scikit-Learn and TensorFlow: "The breadth of topics the book covers is amazing for just 100 pages (plus few bonus pages!). Burkov doesn't hesitate to go into the math equations: that's one thing that short books usually drop. I really liked how the author explains the core concepts in just a few words. The book can be very useful for newcomers in the field, as well as for old-timers who can gain from such a broad view of the field."

Gareth James, Professor of Data Sciences and Operations, co-author of the bestseller An Introduction to Statistical Learning, with Applications in R: "This is a compact “how to do data science” manual and I predict it will become a go-to resource for academics and practitioners alike. At 100 pages (or a little more), the book is short enough to read in a single sitting. Yet, despite its length, it covers all the major machine learning approaches, ranging from classical linear and logistic regression, through to modern support vector machines, deep learning, boosting, and random forests. There is also no shortage of details on the various approaches and the interested reader can gain further information on any particular method via the innovative companion book wiki. The book does not assume any high level mathematical or statistical training or even programming experience, so should be accessible to almost anyone willing to invest the time to learn about these methods. It should certainly be required reading for anyone starting a PhD program in this area and will serve as a useful reference as they progress further. Finally, the book illustrates some of the algorithms using Python code, one of the most popular coding languages for machine learning. I would highly recommend “The Hundred-Page Machine Learning Book” for both the beginner looking to learn more about machine learning and the experienced practitioner seeking to extend their knowledge base."

***

As its title says, it's the hundred-page machine learning book. It was written by an expert in machine learning holding a Ph.D. in Artificial Intelligence with almost two decades of industry experience in computer science and hands-on machine learning.

This is a unique book in many aspects. It is the first successful attempt to write an easy to read book on machine learning that isn't afraid of using math. It's also the first attempt to squeeze a wide range of machine learning topics in a systematic way and without loss in quality.

The book contains only those parts of the huge body of material on machine learning developed since the 1960s that have proven to have a significant practical value. A beginner in machine learning will find in this book just enough details to get a comfortable level of understanding of the field and start asking the right questions. Practitioners with experience will use this book as a collection of pointers to the directions of further self-improvement.

The book also comes in handy when brainstorming at the beginning of a project, when you try to answer the question whether a given technical or business problem is "machine-learnable" and, if yes, which techniques you should try to solve it.

The book comes with a wiki which contains pages that extend some book chapters with additional information: Q&A, code snippets, further reading, tools, and other relevant resources. Thanks to the continuously updated wiki this book like a good wine keeps getting better after you buy it.

Share this book

Feedback

Email the author

Price

Pick Your Price...

Minimum price

$20.00

$40.00

You pay

$40.00

Author earns

$32.00

Author

About the Author

Andriy Burkov

Andriy Burkov holds a PhD in Artificial Intelligence. He works as a machine learning team leader at TalentNeuron.

Episode 319

An Interview with Andriy Burkov

Table of Contents

Preface

1 Introduction

1.1 What is Machine Learning

1.2 Types of Learning

1.2.1 Supervised Learning

1.2.2 Unsupervised Learning

1.2.3 Semi-Supervised Learning

1.2.4 Reinforcement Learning

1.3 How Supervised Learning Works

1.4 Why the Model Works on New Data

2 Notation and Definitions

2.1 Notation

2.1.1 Data Structures

2.1.2 Capital Sigma Notation

2.1.3 Capital Pi Notation

2.1.4 Operations on Sets

2.1.5 Operations on Vectors

2.1.6 Functions

2.1.7 Max and Arg Max

2.1.8 Assignment Operator

2.1.9 Derivative and Gradient

2.2 Random Variable

2.3 Unbiased Estimators

2.4 Bayes’ Rule

2.5 Parameter Estimation

2.6 Parameters vs. Hyperparameters

2.7 Classification vs. Regression

2.8 Model-Based vs. Instance-Based Learning

2.9 Shallow vs. Deep Learning

3 Fundamental Algorithms

3.1 Linear Regression

3.1.1 Problem Statement

3.1.2 Solution

3.2 Logistic Regression

3.2.1 Problem Statement

3.2.2 Solution

3.3 Decision Tree Learning

3.3.1 Problem Statement

3.3.2 Solution

3.4 Support Vector Machine

3.4.1 Dealing with Noise

3.4.2 Dealing with Inherent Non-Linearity

3.5 k-Nearest Neighbors

4 Anatomy of a Learning Algorithm

4.1 Building Blocks of a Learning Algorithm

4.2 Gradient Descent

4.3 How Machine Learning Engineers Work

4.4 Learning Algorithms’ Particularities

5 Basic Practice

5.1 Feature Engineering

5.1.1 One-Hot Encoding

5.1.2 Binning

5.1.3 Normalization

5.1.4 Standardization

5.1.5 Dealing with Missing Features

5.1.6 Data Imputation Techniques

5.2 Learning Algorithm Selection

5.3 Three Sets

5.4 Underfitting and Overfitting

5.5 Regularization

5.6 Model Performance Assessment

5.6.1 Confusion Matrix

5.6.2 Precision/Recall

5.6.3 Accuracy

5.6.4 Cost-Sensitive Accuracy

5.6.5 Area under the ROC Curve (AUC)

5.7 Hyperparameter Tuning

5.7.1 Cross-Validation

6 Neural Networks and Deep Learning

6.1 Neural Networks

6.1.1 Multilayer Perceptron Example

6.1.2 Feed-Forward Neural Network Architecture

6.2 Deep Learning

6.2.1 Convolutional Neural Network

6.2.2 Recurrent Neural Network

7 Problems and Solutions

7.1 Kernel Regression

7.2 Multiclass Classification

7.3 One-Class Classification

7.4 Multi-Label Classification

7.5 Ensemble Learning

7.5.1 Boosting and Bagging

7.5.2 Random Forest

7.5.3 Gradient Boosting

7.6 Learning to Label Sequences

7.7 Sequence-to-Sequence Learning

7.8 Active Learning

7.9 Semi-Supervised Learning

7.10 One-Shot Learning

7.11 Zero-Shot Learning

8 Advanced Practice

8.1 Handling Imbalanced Datasets

8.2 Combining Models

8.3 Training Neural Networks

8.4 Advanced Regularization

8.5 Handling Multiple Inputs

8.6 Handling Multiple Outputs

8.7 Transfer Learning

8.8 Algorithmic Efficiency

9 Unsupervised Learning

9.1 Density Estimation

9.2 Clustering

9.2.1 K-Means

9.2.2 DBSCAN and HDBSCAN

9.2.3 Determining the Number of Clusters

9.2.4 Other Clustering Algorithms

9.3 Dimensionality Reduction

9.3.1 Principal Component Analysis

9.3.2 UMAP

9.4 Outlier Detection

10 Other Forms of Learning

10.1 Metric Learning

10.2 Learning to Rank

10.3 Learning to Recommend

10.3.1 Factorization Machines

10.3.2 Denoising Autoencoders

10.4 Self-Supervised Learning: Word Embeddings

11 Conclusion

11.1 Topic Modeling

11.2 Gaussian Processes

11.3 Generalized Linear Models

11.4 Probabilistic Graphical Models

11.5 Markov Chain Monte Carlo

11.6 Genetic Algorithms

11.7 Reinforcement Learning

Index

Get the free sample chapters

Click the buttons to get the free sample in PDF or EPUB, or read the sample online here

Download Sample PDF Download Sample EPUB

The Leanpub 60 Day 100% Happiness Guarantee

Within 60 days of purchase you can get a 100% refund on any Leanpub purchase, in two clicks.

Now, this is technically risky for us, since you'll have the book or course files either way. But we're so confident in our products and services, and in our authors and readers, that we're happy to offer a full money back guarantee for everything we sell.

You can only find out how good something is by trying it, and because of our 100% money back guarantee there's literally no risk to do so!

So, there's no reason not to click the Add to Cart button, is there?

See full terms...

Earn $8 on a $10 Purchase, and $16 on a $20 Purchase

We pay 80% royalties on purchases of $7.99 or more, and 80% royalties minus a 50 cent flat fee on purchases between $0.99 and $7.98. You earn $8 on a $10 sale, and $16 on a $20 sale. So, if we sell 5000 non-refunded copies of your book for $20, you'll earn $80,000.

(Yes, some authors have already earned much more than that on Leanpub.)

In fact, authors have earned over $14 million writing, publishing and selling on Leanpub.

Learn more about writing on Leanpub

Free Updates. DRM Free.

If you buy a Leanpub book, you get free updates for as long as the author updates the book! Many authors use Leanpub to publish their books in-progress, while they are writing them. All readers get free updates, regardless of when they bought the book or how much they paid (including free).

Most Leanpub books are available in PDF (for computers) and EPUB (for phones, tablets and Kindle). The formats that a book includes are shown at the top right corner of this page.

Finally, Leanpub books don't have any DRM copy-protection nonsense, so you can easily read them on any supported device.

Learn more about Leanpub's ebook formats and where to read them

Write and Publish on Leanpub

You can use Leanpub to easily write, publish and sell in-progress and completed ebooks and online courses!

Leanpub is a powerful platform for serious authors, combining a simple, elegant writing and publishing workflow with a store focused on selling in-progress ebooks.

Leanpub is a magical typewriter for authors: just write in plain text, and to publish your ebook, just click a button. (Or, if you are producing your ebook your own way, you can even upload your own PDF and/or EPUB files and then publish with one click!) It really is that easy.

Learn more about writing on Leanpub