Questions: Maximum Likelihood Estimation

5 questions to test your understanding

Score: 0 / 5
Question 1 Multiple Choice

A statistician writes L(θ) = ∏ p(xᵢ|θ) after observing data x₁, ..., xₙ. Which statement correctly describes what L(θ) is?

AA probability distribution over possible parameter values — the probability that θ takes each value given the data
BA measure of how probable the observed data would be for each candidate value of θ, with the data held fixed
CThe marginal probability of the data summed over all possible parameter values
DA probability distribution over possible datasets for a fixed value of θ
Question 2 Multiple Choice

You flip a coin 10 times and observe 7 heads. What does MLE give as the estimate of the probability of heads?

A0.5 — a fair coin is the most principled default assumption
B0.7 — this is the parameter value that makes observing exactly 7 heads in 10 flips most probable
CIt cannot be determined without specifying a prior distribution over the probability of heads
D0.7 if the coin is known to be biased; 0.5 if the coin is assumed fair
Question 3 True / False

The likelihood function L(θ) is a probability distribution over the parameter θ and therefore integrates (or sums) to 1 over most possible values of θ.

TTrue
FFalse
Question 4 True / False

Maximizing the log-likelihood ℓ(θ) = Σ log p(xᵢ|θ) gives the same θ̂ as maximizing the likelihood L(θ) = ∏ p(xᵢ|θ).

TTrue
FFalse
Question 5 Short Answer

What is the central question MLE asks, and how does it differ from the question that a probability mass or density function answers?

Think about your answer, then reveal below.