Why is the Lebesgue integral for non-negative functions defined using a supremum over simple functions below f, and what goes wrong if you try to apply the same approach to functions that can take negative values?
Think about your answer, then reveal below.
Model answer: For non-negative functions, approximating from below is unambiguous: simple functions below f contribute less area than f, and the supremum captures the exact total area. The definition is always well-defined because the supremum of non-negative numbers is either finite or +∞, never undefined. For functions with negative values, the 'approximate from below' approach creates the problem ∞ − ∞: if f has a large positive part and a large negative part, the supremum of simple functions below f could be +∞ (capturing the positive part) and the 'correction' for the negative part would also be −∞, leaving the integral as ∞ − ∞, which is undefined. The solution is to decompose f into its positive part f⁺ = max(f, 0) and negative part f⁻ = max(−f, 0), integrate each using the non-negative definition, and compute ∫f = ∫f⁺ − ∫f⁻, which is well-defined as long as at least one of ∫f⁺, ∫f⁻ is finite.
This two-step structure — define for non-negative functions first, then extend by decomposition — is how measure theory builds integration rigorously. The non-negative case establishes the floor; the general case inherits its properties. The Monotone Convergence Theorem, Fatou's Lemma, and the Dominated Convergence Theorem are all proved first for non-negative functions and then extended.