A topic in the Open Knowledge Graph — a free, open map of 15,290 topics and the order to learn them in.

Chain Rule for Multivariable Functions

College Depth 77 in the knowledge graph ☐ I know this ☆ Set as goal

5,780topics build on this

335prerequisites beneath it

Chain Rule Partial Derivatives: Definition and Computation +3 more→→Backpropagation Algorithm Chain Rule for Multivariable Functions +4 more

Core Idea

If f(x, y) has continuous partials and x = x(t), y = y(t), then df/dt = (∂f/∂x)(dx/dt) + (∂f/∂y)(dy/dt). For compositions like f(g(x, y), h(x, y)), the chain rule tracks how changes propagate through each layer.

Explainer

From single-variable calculus you know the chain rule: if y = f(g(t)), then dy/dt = f'(g(t)) · g'(t). The idea is that a small change in t propagates through g first, producing a change in g(t), which then propagates through f. In multivariable calculus the same logic applies, but now the "middle variable" x = x(t) is not a single number — it may be a point (x(t), y(t)) in the plane, and f depends on *both* components. Each component of the path contributes its own chain of partial derivatives, and all contributions are added.

The formula df/dt = (∂f/∂x)(dx/dt) + (∂f/∂y)(dy/dt) has a natural reading: the rate at which f changes as t changes is the sum of (how sensitive f is to x) × (how fast x is moving) plus (how sensitive f is to y) × (how fast y is moving). Each partial derivative plays the role that f'(g(t)) played in the single-variable rule — it measures sensitivity along one direction — and each dx/dt or dy/dt measures how fast the path is moving in that direction. If x and y are independent (x(t) = t, y(t) = 0), the formula reduces to the single-variable derivative in x, as expected.

The general multivariable chain rule is most cleanly written using Jacobians. If x: ℝᵏ → ℝⁿ is a differentiable function and f: ℝⁿ → ℝᵐ is differentiable, then the derivative of the composition f(x(t)) is the matrix product Df · Dx — the Jacobian of f multiplied by the Jacobian of x. For scalar-valued f this becomes a row vector (the gradient ∇f) dotted with the matrix of partial derivatives of x. The summation form you saw above is just this matrix product written out explicitly for the case n = 2, m = 1, k = 1.

A powerful consequence is implicit differentiation in several variables, which you will meet next. If F(x, y) = 0 defines y implicitly as a function of x, then differentiating both sides with respect to x and applying the chain rule gives (∂F/∂x) + (∂F/∂y)(dy/dx) = 0, so dy/dx = −(∂F/∂x)/(∂F/∂y) wherever ∂F/∂y ≠ 0. The chain rule is also the engine behind the gradient and directional derivatives: the rate of change of f along a path with velocity vector v is exactly ∇f · v, which is the chain rule applied to the path x(t) with x'(t) = v.

Practice Questions 5 questions

Prerequisite Chain

Understanding Zero → The Number Zero → Counting to Five → Counting to 10 → Counting to 20 → Counting a Set of Objects Up to 20 → Cardinality: The Last Number Counted → Matching Numerals to Quantities → Subitizing Small Quantities → Addition Within 10 → Number Bonds to 10 → Addition Within 20 → Doubles and Near Doubles → Doubles Facts Within 10 → Near Doubles Facts Within 20 → Mental Math Strategies for Addition → Mental Math: Adding and Subtracting Tens → Addition Within 100 → Repeated Addition as Multiplication → Multiplication as Equal Groups → Multiplication: Arrays → Basic Multiplication Facts (0s, 1s, 2s, 5s, 10s) → Multiplication Facts Within 100 → Division as Equal Sharing → Division as Grouping (Measurement Division) → Division: Grouping (Repeated Subtraction) Model → Division: Fair Sharing Model → Division as Equal Sharing → Division as Grouping → Basic Division Facts → Division Facts Within 100 → Multiplication and Division Fact Families → Relationship Between Multiplication and Division → Division Facts as Inverse of Multiplication → Remainders and Quotients in Division → Division Word Problems → Multi-Step Word Problems → Solving Multi-Step Word Problems → Multiplication Word Problems → Division Word Problems → Introduction to Long Division → Factors and Multiples → Prime and Composite Numbers → Equivalent Fractions → Relating Fractions and Decimals → Decimal Place Value → Integers and the Number Line → Comparing and Ordering Integers → Absolute Value → Adding Integers → Subtracting Integers → Multiplying Integers → Dividing Integers → Unit Rates → Proportions → Percent Concept → Converting Between Fractions, Decimals, and Percents → Operations with Rational Numbers → Two-Step Equations → Solving Multi-Step Equations → Equations with Variables on Both Sides → Literal Equations → Slope-Intercept Form → Point-Slope Form → Writing Linear Equations → Parallel and Perpendicular Line Slopes → Graphing Linear Equations → Piecewise Functions → One-Sided Limits → Continuity Definition → Limits and Continuity in Multiple Variables → Functions of Several Variables → Continuity in Multiple Variables → Partial Derivatives: Definition and Computation → Differentiability in Multiple Variables → Differentiability in Multivariable Functions → Total Differential and Linear Approximation → Chain Rule for Multivariable Functions

Longest path: 78 steps · 335 total prerequisite topics

Prerequisites (5)

Partial Derivatives: Definition and Computationhard Chain Rulehard Differentiability in Multiple Variablessoft Differentiability in Multivariable Functionssoft Total Differential and Linear Approximationsoft

Leads To (6)

Backpropagation Algorithmhard Chain Rule for Multivariable Functionshard Directional Derivatives and the Gradientsoft Implicit Differentiationsoft Policy Gradient Methodshard The One-Dimensional Wave Equationhard