Questions: Maximum Likelihood Estimation

5 questions to test your understanding

Score: 0 / 5

Question 1 Multiple Choice

A statistician writes L(θ) = ∏ p(xᵢ|θ) after observing data x₁, ..., xₙ. Which statement correctly describes what L(θ) is?

AA probability distribution over possible parameter values — the probability that θ takes each value given the data

BA measure of how probable the observed data would be for each candidate value of θ, with the data held fixed

CThe marginal probability of the data summed over all possible parameter values

DA probability distribution over possible datasets for a fixed value of θ

Question 2 Multiple Choice

You flip a coin 10 times and observe 7 heads. What does MLE give as the estimate of the probability of heads?

A0.5 — a fair coin is the most principled default assumption

B0.7 — this is the parameter value that makes observing exactly 7 heads in 10 flips most probable

CIt cannot be determined without specifying a prior distribution over the probability of heads

D0.7 if the coin is known to be biased; 0.5 if the coin is assumed fair

Question 3 True / False

The likelihood function L(θ) is a probability distribution over the parameter θ and therefore integrates (or sums) to 1 over most possible values of θ.

TTrue

FFalse

Question 4 True / False

Maximizing the log-likelihood ℓ(θ) = Σ log p(xᵢ|θ) gives the same θ̂ as maximizing the likelihood L(θ) = ∏ p(xᵢ|θ).

TTrue

FFalse

Question 5 Short Answer

What is the central question MLE asks, and how does it differ from the question that a probability mass or density function answers?

Think about your answer, then reveal below.