What is the crucial distinction between arithmetic's ability to represent its own syntax and its inability to represent its own semantics, and why does this distinction matter?
Think about your answer, then reveal below.
Model answer: Syntax concerns the formal structure of expressions — which strings are formulas, what their Gödel codes are, whether one formula is a substitution instance of another. These are all recursive (computable) functions on natural numbers, and arithmetic can define them. Semantics concerns what formulas mean — whether they are true in the standard model. Truth is not a recursive predicate; it cannot be captured by any arithmetical formula. The distinction matters because self-reference through syntax (via Gödel coding) is available in arithmetic and enables Gödel's incompleteness theorems, while self-reference through semantics is blocked by Tarski's theorem. Arithmetic can 'talk about' its own proofs (a syntactic notion) but cannot 'talk about' its own truth (a semantic notion). This explains why Gödel produced incompleteness results (using provability, not truth) while Tarski showed truth is irreducibly metalinguistic.
The syntax/semantics divide is one of the deepest distinctions in logic. Syntax is algorithmic — it can be mechanically checked and encoded. Semantics is interpretive — it requires a model, a structure, an interpretation that stands outside the formal system. Tarski's theorem makes this informal distinction into a mathematical theorem: the gap between syntax and semantics is not closable from within a sufficiently rich language.