Machine Learning Interview Course
An in-depth guide to the machine learning interview. This course includes 30 interview questions and answers asked by top companies.
Click below to get a free account
The Machine Learning Interview course has all the questions and answers you need to ace your machine learning interviews.
30 interview questions and answers
Key machine learning terms defined
Complex concepts visualized
Written by a data science hiring manager with 8 years of experience
Each lesson explains how to answer a common interview question.
Overcome common data problems
Choose the right algorithm
Properly validate your model
Tune your model without overfitting
Explain the exact math behind model training
You will use the Machine Learning Interview course throughout the data science interview process.
Phone screen
Take-home challenge
Onsite interview
Question #1: What's the most likely cause of a logistic regression model with low accuracy?
FREE PREVIEWAnswer #1: The bias-variance tradeoff is ...
FREE PREVIEWQuestion #2: How would you troubleshoot a logistic regression model with low accuracy?
FREE PREVIEWAnswer #2: Symptoms of ...
FREE PREVIEWQuestion #3: How would you troubleshoot a decision tree with low accuracy?
FREE PREVIEWAnswer #3: It’s likely the decision tree ...
FREE PREVIEWQuestion #1: Why would gradient descent fail to converge?
Answer #1: Gradient descent is ...
Question #2: What would happen if you choose a learning rate that is too small?
Answer #2: If you pick a learning rate that is ...
Question #3: How would you overcome the problems associated with choosing learning rates?
Answer #3: You can use an ...
Question #4: What’s the difference between gradient descent and stochastic gradient descent?
Answer #4: Gradient descent uses …
Question #1: How does linear regression learn?
Answer #1: To understand how a model learns, we must ...
Question #2: How does logistic regression learn?
Answer #2: Logistic regression models use …
Question #3: What’s the difference between evaluation and loss metrics?
Answer #3: Sometimes the model is validated...
Question #1 : What would you do to determine the cause of coefficients that don’t make sense?
Answer #1: You should …
Question #2: How would you improve a linear regression model that has coefficients that don’t make sense?
Answer #2: Given that a ...
Question #3: What method would you choose to validate your model?
Answer #3: The simplest form of validation is ...
Question #1: What problem could class imbalance cause and what techniques would you consider to counteract this?
Answer #1: Class imbalance occurs when ...
Question #2: Why is 99% accuracy probably not a good thing for this model?
Answer #2: For classification problems, it’s useful to ...
Question #3: What’s a better way to measure model performance for imbalanced classes?
Answer #3: First, you can look at ...
Question #4: How would evaluate a model for which both precision and recall are important?
Answer #4: When it's desirable to account for both ...
Question #1: How would you choose a threshold for classification?
Answer #1: The default threshold for classification...
Question #2: How would you change the threshold to account for mispredictions of different dollar values?
Answer #2: The best threshold...
Question #3: How would you change the model to account for outcomes of different dollar values?
Answer #3: If they both have the same predicted probability ...
Question #1: How would you choose what type of model to build?
Answer #1: The most straightforward algorithm selection ...
Question #2: What algorithms would you choose to build this model?
Answer #2: There are dozens of popular classifiers to try ...
Question #3: Next time you build a classifier on a different data set will the same algorithm produce the best results?
Answer # 3: The only way to find out ...
Question #1: How would you handle missing data?
Answer #1: Missing data occurs when...
Question #2: How would you handle outliers?
Answer #2: Outliers are ...
Question #3: How would you handle features on different scales?
Answer #3: Some machine learning models ...
Question #4: How would you handle multicollinearity?
Answer #4: Multicollinearity is when ...
Question #1: What’s the curse of dimensionality and why is it a problem?
Answer #1: The curse of dimensionality refers to ...
Question #2: How would you handle high dimensional data?
Answer #2: Before moving forward with modeling ...
Question #3: What are some common dimensionality reduction techniques?
Answer #3: Below are some common dimensionality reduction techniques …
Please tell me a bit more about yourself
Next Steps
You get access to the content for a full year from the day you sign up. The best time to start preparing for a new job is when you don’t think you need one. Things change fast and nothing’s worse than suddenly finding yourself unprepared to be on the job market. Think of interview preparation as job insurance -- you hope you don’t need to use it, but if do, you’ll sure be glad you had it.
I founded Decode Data Science after nearly a decade of experience as a data scientist at top tech companies. The content is based on patterns mined from hundreds of data science interviews conducted over the past 5 years.
Interviewers may overlook mistakes in other areas, but they expect candidates to know machine learning thoroughly. Too many candidates treat machine learning models as a black box -- data goes in and predictions come out! This is the most dangerous type of data scientist. They can make wrong predictions with high confidence and managers avoid hiring them at all costs!
Upgrade to a premium account