The diagonal argument proves there are uncomputable languages: list all TMs and strings; construct a language differing from the i-th TM's language on the i-th string. This language cannot be recognized by any TM. Diagonalization, adapted from Cantor's set theory, establishes fundamental limits: computation is countable but languages are uncountable.
You already know that a universal Turing machine can simulate any other Turing machine, and from your study of cardinality you know that some infinite sets are larger than others — the reals are uncountable while the naturals are countable. Diagonalization brings these ideas together to prove something profound: there exist languages that no Turing machine can ever recognize, no matter how clever the algorithm.
The argument starts with a counting observation. Every Turing machine can be encoded as a finite string over some fixed alphabet — its states, transitions, and tape symbols can all be written down. Since finite strings over a finite alphabet are countable (you can list them in order of length), there are only countably many Turing machines: M₁, M₂, M₃, and so on. Similarly, strings over the input alphabet are countable: s₁, s₂, s₃, and so on. Now consider the set of all *languages* over this alphabet. A language is any subset of all strings, and the set of all subsets of a countable set is uncountable (by Cantor's theorem). So there are uncountably many languages but only countably many Turing machines. By pigeonhole, most languages have no Turing machine that recognizes them. This is already a stunning conclusion, but it doesn't give us a specific uncomputable language — diagonalization does.
Construct a table where row *i* corresponds to Turing machine Mᵢ and column *j* corresponds to string sⱼ. Entry (i, j) is 1 if Mᵢ accepts sⱼ, and 0 if it doesn't. Each row encodes the language of one Turing machine. Now define a new language D by flipping the diagonal: string sᵢ is in D if and only if Mᵢ does *not* accept sᵢ. This language D differs from every row in the table. It differs from M₁'s language on s₁, from M₂'s language on s₂, and in general from Mᵢ's language on sᵢ. Therefore D is not the language of any Turing machine — it is unrecognizable. The technique works because you've systematically ensured disagreement with every possible machine at a specific, targeted point.
This is exactly Cantor's diagonal argument applied to computation rather than real numbers. Cantor proved the reals are uncountable by assuming a complete list and constructing a real differing from every listed number in one decimal digit. Here, you assume a complete list of Turing machines and construct a language differing from every machine's language on one specific string. The method is the same; only the domain changes. The philosophical consequence is that uncomputability is not a deficiency of current technology — it is a mathematical certainty. There are well-defined problems with well-defined answers that no algorithm, on any computer, running for any amount of time, can solve. The halting problem, which you'll encounter next, is the most famous specific example of this phenomenon.