Chain rule
Encyclopedia : C : CH : CHA : Chain rule
In calculus, the chain rule is a formula for the derivative of the composite of two functions.
Explanation
In intuitive terms, if a variable, y, depends on a second variable, u, which in turn depends on a third variable, x, then the rate of change of y with respect to x can be computed as the product of the rate of change of y with respect to u multiplied by the rate of change of u with respect to x.
The chain rule may be stated in any of several equivalent forms:
- [ (f \circ g)'(x) = (f(g(x)))' = f'(g(x)) g'(x),\,]
- [\frac = \frac \cdot \frac ,]
Example
Suppose, for example, that one is climbing a mountain at a rate of 0.5 kilometres per hour. The temperature is lower at higher elevations; suppose the rate by which it decreases is 6 °C per kilometre. If one multiplies 6 °C per kilometre by 0.5 kilometre per hour, one obtains 3 °C per hour. This calculation is a typical chain rule application.The general power rule
The general power rule (GPR) is derivable via the chain rule.Example I
Consider [f(x) = (x^2 + 1)^3]. [f(x)] is comparable to [h(g(x))] where [g(x) = x^2 + 1] and [h(x) = x^3]; thus,
[f '(x) \,] [= 3(x^2 + 1)^2(2x) \,] [= 6x(x^2 + 1)^2 \,] Example II
In order to differentiate the trigonometric function- [f(x) = \sin(x^2),\,]
- [f'(x) = 2x \cos(x^2) \,]
Chain rule for several variables
The chain rule works for functions of several variables as well. For example, if we have a function [f(u(x, y), v(x, y))] where- [u(x, y) = 3x + y^2 \mbox v(x, y) = \sin(xy) \mbox f=u+v,\,]
- [=+=3 + \cos(xy)y.]
Proof of the chain rule
Let f and g be functions and let x be a number such that f is differentiable at g(x) and g is differentiable at x. Then by the definition of differentiability,- [ g(x+\delta)-g(x)= \delta g'(x) + \epsilon(\delta) \,] where [ \frac \to 0 \,] as [\delta\to 0.]
- [ f(g(x)+\alpha) - f(g(x)) = \alpha f'(g(x)) + \eta(\alpha) \,] where [\frac \to 0 \,] as [\alpha\to 0. \,]
[ f(g(x+\delta))-f(g(x))\, ] [= f(g(x) + \delta g'(x)+\epsilon(\delta)) - f(g(x)) \,] [ = \alpha_\delta f'(g(x)) + \eta(\alpha_\delta) \,] where [\alpha_\delta = \delta g'(x) + \epsilon(\delta) \,]. Observe that as [\delta\to 0,] [\frac\to g'(x)] and [\frac\to 0]. Hence
- [ \frac \to g'(x)f'(g(x))\mbox \delta \to 0.]
The fundamental chain rule
The chain rule is a fundamental property of all definitions of derivative and is therefore valid in much more general contexts. For instance, if E, F and G are Banach spaces (which includes Euclidean space) and f : E → F and g : F → G are functions, and if x is an element of E such that f is differentiable at x and g is differentiable at f(x), then the derivative (the Fréchet derivative) of the composition g o f at the point x is given by- [\mbox_x\left(g \circ f\right) = \mbox_\left(g\right) \circ \mbox_x\left(f\right).]
A particularly clear formulation of the chain rule can be achieved in the most general setting: let M, N and P be Ck manifolds (or even Banach-manifolds) and let
- f : M → N and g : N → P
- [\mbox\left(g \circ f\right) = \mboxg \circ \mboxf.]
Tensors and the chain rule
See tensor field for an advanced explanation of the fundamental role the chain rule plays in the geometric nature of tensors.Higher derivatives
Faà di Bruno's formula generalizes the chain rule to higher derivatives. The first few derivatives are- [\frac = \frac \frac]
- [ \frac = \left(\frac\right)^2 \frac + \frac\frac]
- [ \frac = \left(\frac\right)^3 \frac + 3\frac\frac \frac + \frac \frac]
- [ \frac = \left(\frac\right)^4 \frac + 6 \left(\frac\right)^2 \frac \frac + \left\ \frac + 3\left(\frac\right)^2\right\} \frac + \frac \frac]
See also
From Wikipedia, the Free Encyclopedia. Original article here. Support Wikipedia by contributing or donating.
All text is available under the terms of the GNU Free Documentation License See Wikipedia Copyrights for details.
