Chain Rule for Multivariable Functions

College Depth 62 in the knowledge graph I know this Set as goal
Unlocks 785 downstream topics
chain-rule composition derivatives

Core Idea

For z = f(x, y) where x = x(t) and y = y(t), the chain rule gives dz/dt = (∂f/∂x)(dx/dt) + (∂f/∂y)(dy/dt). For z = f(x, y) where x = x(s, t), the general case involves partial derivatives: ∂z/∂s and ∂z/∂t.

Explainer

In single-variable calculus, the chain rule says: if z = f(x) and x = g(t), then dz/dt = (dz/dx)(dx/dt). You can think of this as "the rate at which z changes with t equals the rate z changes with x, times the rate x changes with t" — a chain of rates multiplied together. The multivariable chain rule generalizes this, but with one crucial twist: when z depends on *multiple* intermediate variables, each one contributes its own chain, and you sum all the contributions.

Consider z = f(x, y) where both x and y depend on a parameter t — perhaps t is time and (x(t), y(t)) is the position of a moving particle. As t changes, both x and y change simultaneously, and both changes feed into z. The total rate of change is dz/dt = (∂f/∂x)(dx/dt) + (∂f/∂y)(dy/dt). The two terms are independent contributions: the first captures how much z changes due to x's movement, the second captures how much z changes due to y's movement. Because x and y change simultaneously, you *add* the contributions rather than multiply. This additive structure is the hallmark of the multivariable chain rule.

A useful visual aid is the dependency diagram: draw z at the top, with branches down to x and y (intermediate variables), and further branches from x and y down to t (the ultimate variable). Each path from z to t contributes one term: multiply the derivatives along that path, then sum across all paths. For ∂z/∂s when x = x(s,t) and y = y(s,t), there are two paths — through x and through y — giving ∂z/∂s = (∂f/∂x)(∂x/∂s) + (∂f/∂y)(∂y/∂s). Adding more intermediate or final variables just adds more branches to the diagram.

This formula is not just a computational trick — it captures how disturbances propagate through functional dependencies. In physics, if the temperature T(x, y, z) of a fluid depends on position, and a particle moves along a path (x(t), y(t), z(t)), then dT/dt = ∂T/∂x · ẋ + ∂T/∂y · ẏ + ∂T/∂z · ż — the material derivative. In optimization and machine learning, the chain rule (extended to vector form as the Jacobian product rule) is the foundation of backpropagation. Mastering the dependency-diagram approach now lets you handle compositions of any complexity by mechanically reading off the paths.

Practice Questions 5 questions

Prerequisite Chain

Counting to 10Counting to 20Understanding ZeroThe Number ZeroCounting to FiveOne-to-One CorrespondenceCombining Small Groups Within 5Addition Within 10Addition Within 20Two-Digit Addition Without RegroupingTwo-Digit Addition with RegroupingAddition Within 100Repeated Addition as MultiplicationMultiplication Facts Within 100Division as Equal SharingDivision as Grouping (Measurement Division)Division: Grouping (Repeated Subtraction) ModelDivision: Fair Sharing ModelDivision as Equal SharingDivision as GroupingBasic Division FactsDivision Facts Within 100Two-Digit by One-Digit DivisionDivision with RemaindersRemainders and Quotients in DivisionDivision Word ProblemsIntroduction to Long DivisionFactors and MultiplesPrime and Composite NumbersEquivalent FractionsRelating Fractions and DecimalsDecimal Place ValueIntegers and the Number LineOpposites and Additive InversesAbsolute ValueAdding IntegersSubtracting IntegersMultiplying IntegersDividing IntegersUnit RatesProportionsPercent ConceptConverting Between Fractions, Decimals, and PercentsOperations with Rational NumbersTwo-Step EquationsSolving Multi-Step EquationsEquations with Variables on Both SidesLiteral EquationsSlope-Intercept FormPoint-Slope FormWriting Linear EquationsParallel and Perpendicular Line SlopesGraphing Linear EquationsPiecewise FunctionsOne-Sided LimitsContinuity DefinitionLimit Definition of the DerivativePower RuleConstant Multiple and Sum/Difference RulesProduct RuleChain RuleChain Rule for Multivariable FunctionsChain Rule for Multivariable Functions

Longest path: 63 steps · 256 total prerequisite topics

Prerequisites (1)

Leads To (1)