5 questions to test your understanding
The LMS algorithm uses a stochastic gradient: w(n+1) = w(n) − μ·e(n)·x(n), where e(n) is instantaneous error and μ is step-size. Why does LMS converge slowly compared to Recursive Least Squares (RLS), which computes the least-squares solution at each step?
In Normalized LMS (NLMS), the step-size is normalized by the input power: μ_n = μ / (α + ||x(n)||²). How does this normalization improve robustness, and what is the trade-off?
Recursive Least Squares (RLS) maintains an M×M matrix P (inverse of input autocorrelation) and updates it via: P(n) = [P(n−1) − P(n−1)x(n)x(n)ᵀP(n−1) / (1 + x(n)ᵀP(n−1)x(n))] / λ, where λ is a forgetting factor < 1. What does the forgetting factor accomplish in a time-varying environment?
Echo cancellation in teleconferencing uses an adaptive filter to model and subtract the acoustic feedback path (speaker-to-microphone through the room). Why is this problem hard, and why does RLS outperform LMS here?
Explain the relationship between Kalman filtering and optimal adaptive filtering: when is a Kalman filter the optimal adaptive filter, and what assumptions does it require?