{{Short description|Notion in calculus}} {{other uses of|differential|topic=mathematics|Differential (mathematics)}} {{Calculus |Differential}}

In calculus, the '''differential''' represents the principal part of the change in a function <math>y = f(x)</math> with respect to changes in the independent variable. The differential <math>dy</math> is defined by <math display="block">dy = f'(x)\,dx,</math> where <math>f'(x)</math> is the derivative of {{math|''f''}} with respect to <math>x</math>, and <math>dx</math> is an additional real variable (so that <math>dy</math> is a function of <math>x</math> and <math>dx</math>). The notation is such that the equation

<math display="block">dy = \frac{dy}{dx}\, dx</math>

holds, where the derivative is represented in the Leibniz notation <math>dy/dx</math>, and this is consistent with regarding the derivative as the quotient of the differentials. One also writes

<math display="block">df(x) = f'(x)\,dx.</math>

The precise meaning of the variables <math>dy</math> and <math>dx</math> depends on the context of the application and the required level of mathematical rigor. The domain of these variables may take on a particular geometrical significance if the differential is regarded as a particular differential form, or analytical significance if the differential is regarded as a linear approximation to the increment of a function. Traditionally, the variables <math>dx</math> and <math>dy</math> are considered to be very small (infinitesimal), and this interpretation is made rigorous in non-standard analysis.

==History and usage== The differential was first introduced via an intuitive or heuristic definition by Isaac Newton and furthered by Gottfried Leibniz, who thought of the differential&nbsp;{{math|''dy''}} as an infinitely small (or infinitesimal) change in the value&nbsp;{{mvar|y}} of the function, corresponding to an infinitely small change&nbsp;{{math|''dx''}} in the function's argument&nbsp;{{mvar|x}}. For that reason, the instantaneous rate of change of {{mvar|y}} with respect to {{mvar|x}}, which is the value of the derivative of the function, is denoted by the fraction

<math display="block"> \frac{dy}{dx} </math> in what is called the Leibniz notation for derivatives. The quotient <math>dy/dx</math> is not infinitely small; rather it is a real number.

The use of infinitesimals in this form was widely criticized, for instance by the famous pamphlet The Analyst by Bishop Berkeley. Augustin-Louis Cauchy (1823) defined the differential without appeal to the atomism of Leibniz's infinitesimals.<ref>For a detailed historical account of the differential, see {{harvnb|Boyer|1959}}, especially page 275 for Cauchy's contribution on the subject. An abbreviated account appears in {{harvnb|Kline|1972|loc=Chapter 40}}.</ref><ref>Cauchy explicitly denied the possibility of actual infinitesimal and infinite quantities {{harv|Boyer|1959|pp=273–275}}, and took the radically different point of view that "a variable quantity becomes infinitely small when its numerical value decreases indefinitely in such a way as to converge to zero" ({{harvnb|Cauchy|1823|p=12}}; translation from {{harvnb|Boyer|1959|p=273}}).</ref> Instead, Cauchy, following d'Alembert, inverted the logical order of Leibniz and his successors: the derivative itself became the fundamental object, defined as a limit of difference quotients, and the differentials were then defined in terms of it. That is, one was free to ''define'' the differential <math>dy</math> by an expression <math display="block">dy = f'(x)\,dx</math> in which <math>dy</math> and <math>dx</math> are simply new variables taking finite real values,<ref>{{harvnb|Boyer|1959|p=275}}</ref> not fixed infinitesimals as they had been for Leibniz.<ref>{{harvnb|Boyer|1959|p=12}}: "The differentials as thus defined are only new ''variables'', and not fixed infinitesimals..."</ref>

According to {{harvtxt|Boyer|1959|p=12}}, Cauchy's approach was a significant logical improvement over the infinitesimal approach of Leibniz because, instead of invoking the metaphysical notion of infinitesimals, the quantities <math>dy</math> and <math>dx</math> could now be manipulated in exactly the same manner as any other real quantities in a meaningful way. Cauchy's overall conceptual approach to differentials remains the standard one in modern analytical treatments,<ref>{{harvnb|Courant|1937a|loc=II, §9}}: "Here we remark merely in passing that it is possible to use this approximate representation of the increment <math>\Delta y</math> by the linear expression <math>hf(x)</math> to construct a logically satisfactory definition of a "differential", as was done by Cauchy in particular."</ref> although the final word on rigor, a fully modern notion of the limit, was ultimately due to Karl Weierstrass.<ref>{{harvnb|Boyer|1959|p=284}}</ref>

In physical treatments, such as those applied to the theory of thermodynamics, the infinitesimal view still prevails. {{harvtxt|Courant|John|1999|p=184}} reconcile the physical use of infinitesimal differentials with the mathematical impossibility of them as follows. The differentials represent finite non-zero values that are smaller than the degree of accuracy required for the particular purpose for which they are intended. Thus "physical infinitesimals" need not appeal to a corresponding mathematical infinitesimal in order to have a precise sense.

Following twentieth-century developments in mathematical analysis and differential geometry, it became clear that the notion of the differential of a function could be extended in a variety of ways. In real analysis, it is more desirable to deal directly with the differential as the principal part of the increment of a function. This leads directly to the notion that the differential of a function at a point is a linear functional of an increment <math>\Delta x</math>. This approach allows the differential (as a linear map) to be developed for a variety of more sophisticated spaces, ultimately giving rise to such notions as the Fréchet or Gateaux derivative. Likewise, in differential geometry, the differential of a function at a point is a linear function of a tangent vector (an "infinitely small displacement"), which exhibits it as a kind of one-form: the exterior derivative of the function. In non-standard calculus, differentials are regarded as infinitesimals, which can themselves be put on a rigorous footing (see differential (infinitesimal)).

==Definition==

thumb|upright=1.25|The differential of a function <math>f(x)</math> at a point <math>x_0</math>. The differential is defined in modern treatments of differential calculus as follows.<ref>See, for instance, the influential treatises of {{harvnb|Courant|1937a}}, {{harvnb|Kline|1977}}, {{harvnb|Goursat|1904}}, and {{harvnb|Hardy|1908}}. Tertiary sources for this definition include also {{harvnb|Tolstov|2001}} and {{harvnb|Itô|1993|loc=§106}}.</ref> The differential of a function <math>f(x)</math> of a single real variable <math>x</math> is the function <math>df</math> of two independent real variables <math>x</math> and <math>\Delta x</math> given by

<math display="block">df(x, \Delta x) \ \stackrel{\mathrm{def}}{=} \ f'(x)\,\Delta x.</math>

One or both of the arguments may be suppressed, i.e., one may see <math>df(x)</math> or simply <math>df</math>. If <math>y = f(x)</math>, the differential may also be written as <math>dy</math>. Since <math>dx(x,\Delta x)=\Delta x</math>, it is conventional to write <math>dx=\Delta x</math> so that the following equality holds:

<math display="block">df(x) = f'(x) \, dx</math>

This notion of differential is broadly applicable when a linear approximation to a function is sought, in which the value of the increment <math>\Delta x</math> is small enough. More precisely, if <math>f</math> is a differentiable function at <math>x</math>, then the difference in <math>y</math>-values

<math display="block">\Delta y \ \stackrel{\rm{def}}{=}\ f(x+\Delta x) - f(x)</math>

satisfies

<math display="block">\Delta y = f'(x)\,\Delta x + \varepsilon = df(x) + \varepsilon\,</math>

where the error <math>\varepsilon</math> in the approximation satisfies <math>\varepsilon /\Delta x\rightarrow 0</math> as <math>\Delta x\rightarrow 0</math>. In other words, one has the approximate identity

<math display="block">\Delta y \approx dy</math>

in which the error can be made as small as desired relative to <math>\Delta x</math> by constraining <math>\Delta x</math> to be sufficiently small; that is to say, <math display="block">\frac{\Delta y - dy}{\Delta x}\to 0</math> as <math>\Delta x\rightarrow 0</math>. For this reason, the differential of a function is known as the principal (linear) part in the increment of a function: the differential is a linear function of the increment <math>\Delta x</math>, and although the error <math>\varepsilon</math> may be nonlinear, it tends to zero rapidly as <math>\Delta x</math> tends to zero.

==Differentials in several variables== {| class="wikitable" |+ !Operator / Function !<math>f(x)</math> !<math>f(x, y, u(x, y), v(x, y))</math> |- |Differential |1: <math>df \, \overset{\underset{\mathrm{def}}{}}{=} \, f'_x\,dx</math> |2: <math>d_x f \, \overset{\underset{\mathrm{def}}{}}{=} \, f'_x\,dx</math>

3: <math>df \, \overset{\underset{\mathrm{def}}{}}{=} \, f'_x dx + f'_y dy + f'_u du + f'_v dv</math> |- |Partial derivative |<math>f'_x \, \overset{\underset{\mathrm{(1)}}{}}{=} \, \frac{df}{dx}</math> |<math>f'_x \, \overset{\underset{\mathrm{(2)}}{}}{=} \, \frac{d_x f}{dx} = \frac{\partial f}{\partial x}</math> |- |Total derivative |<math>\frac{df}{dx} \, \overset{\underset{\mathrm{(1)}}{}}{=} \, f'_x</math> |<math>\frac{df}{dx} \, \overset{\underset{\mathrm{(3)}}{}}{=} \, f'_x + f'_u \frac{du}{dx} + f'_v \frac{dv}{dx}; (f'_y \frac{dy}{dx} = 0) </math> |} Following {{harvtxt|Goursat|1904|loc=I, §15}}, for functions of more than one independent variable,

<math display="block"> y = f(x_1,\dots,x_n), </math>

the '''partial differential''' of {{mvar|y}} with respect to any one of the variables&nbsp;{{math|''x''<sub>''i''</sub>}} is the principal part of the change in {{mvar|y}} resulting from a change&nbsp;{{math|''dx''<sub>''i''</sub>}} in that one variable. The partial differential is therefore

<math display="block"> \frac{\partial y}{\partial x_i} dx_i </math>

involving the partial derivative of {{mvar|y}} with respect to&nbsp;{{math|''x''<sub>''i''</sub>}}. The sum of the partial differentials with respect to all of the independent variables is the '''total differential'''

<math display="block"> dy = \frac{\partial y}{\partial x_1} dx_1 + \cdots + \frac{\partial y}{\partial x_n} dx_n, </math>

which is the principal part of the change in {{mvar|y}} resulting from changes in the independent variables&nbsp;{{math|''x''<sub>''i''</sub>}}.

More precisely, in the context of multivariable calculus, following {{harvtxt|Courant|1937b}}, if {{math|''f''}} is a differentiable function, then by the definition of differentiability, the increment

<math display="block">\begin{align} \Delta y &{}~\stackrel{\mathrm{def}}{=}~ f(x_1+\Delta x_1, \dots, x_n+\Delta x_n) - f(x_1,\dots,x_n)\\ &{}= \frac{\partial y}{\partial x_1} \Delta x_1 + \cdots + \frac{\partial y}{\partial x_n} \Delta x_n + \varepsilon_1\Delta x_1 +\cdots+\varepsilon_n\Delta x_n \end{align}</math>

where the error terms {{math|''ε''<sub>&nbsp;''i''</sub>}} tend to zero as the increments {{math|Δ''x''<sub>''i''</sub>}} jointly tend to zero. The total differential is then rigorously defined as

<math display="block">dy = \frac{\partial y}{\partial x_1} \Delta x_1 + \cdots + \frac{\partial y}{\partial x_n} \Delta x_n.</math>

Since, with this definition, <math display="block">dx_i(\Delta x_1,\dots,\Delta x_n) = \Delta x_i,</math> one has <math display="block">dy = \frac{\partial y}{\partial x_1}\,d x_1 + \cdots + \frac{\partial y}{\partial x_n}\,d x_n.</math>

As in the case of one variable, the approximate identity holds

<math display="block">dy \approx \Delta y</math>

in which the total error can be made as small as desired relative to <math display="inline">\sqrt{\Delta x_1^2+\cdots +\Delta x_n^2}</math> by confining attention to sufficiently small increments.

=== Application of the total differential to error estimation === In measurement, the total differential is used in estimating the error <math>\Delta f</math> of a function <math>f</math> based on the errors <math>\Delta x,\Delta y,\ldots </math> of the parameters <math>x, y, \ldots</math>. Assuming that the interval is short enough for the change to be approximately linear:

<math display="block">\Delta f(x)=f'(x)\Delta x</math>

and that the parameters <math>x, y, \ldots</math> are functionally independent, so that the total differential applies, the change in <math>f</math> induced by simultaneous changes in all parameters is

<math display="block">\Delta f \approx f_x \Delta x + f_y \Delta y + \cdots .</math>

The derivative <math>f_x</math> with respect to the parameter <math>x</math> gives the sensitivity of the function <math>f</math> to a change in <math>x</math>, in particular to the error <math>\Delta x</math>. How the individual errors <math>\Delta x, \Delta y, \ldots</math> combine into a bound on <math>\Delta f</math> depends on what is known about them.

If the errors are known only as bounds on magnitude (i.e., <math>|\Delta x| \le \delta x</math>, etc.) and may have arbitrary signs, the corresponding worst-case bound on <math>\Delta f</math> is obtained by taking absolute values, since signed contributions could otherwise cancel:

<math display="block">|\Delta f| \le |f_x|\, \delta x + |f_y|\, \delta y + \cdots .</math>

If instead the errors are modelled as statistically independent random variables with standard deviations <math>\sigma_x, \sigma_y, \ldots</math>, then by propagation of variance the standard deviation of <math>\Delta f</math> adds in quadrature:

<math display="block">\sigma_f^2 = f_x^2\, \sigma_x^2 + f_y^2\, \sigma_y^2 + \cdots .</math>

The first formula is a deterministic upper bound, the second a typical (one-standard-deviation) estimate; the two coincide only in the trivial single-variable case.

From the worst-case principle the familiar error rules of summation, multiplication, etc. are derived, e.g.: {{block indent | em = 1.6 | text = Let <math>f(a,b) = ab</math>. Then the finite error can be approximated as <math display="block">\Delta f = f_a \Delta a + f_b \Delta b.</math> Evaluating the derivatives: <math display="block">\Delta f = b \Delta a + a \Delta b.</math> Dividing by {{math|''f''}}, which is {{math|''a'' × ''b''}} <math display="block">\frac{\Delta f}{f} = \frac{\Delta a}{a} + \frac{\Delta b}{b}.</math> }} That is to say, in multiplication, the total relative error is the sum of the relative errors of the parameters.

To illustrate how this depends on the function considered, consider the case where the function is <math>f(a,b)=a\ln b</math> instead. Then, it can be computed that the error estimate is <math display="block">\frac{\Delta f}{f} = \frac{\Delta a}{a} + \frac{\Delta b}{b \ln b}</math> with an extra {{math|ln ''b''}} factor not found in the case of a simple product. This additional factor tends to make the error smaller, as the denominator {{math|b ln ''b''}} is larger than a bare&nbsp;{{mvar|b}}.

==Higher-order differentials== Higher-order differentials of a function {{math|1=''y'' = ''f''(''x'')}} of a single variable {{mvar|x}} can be defined via:<ref>{{harvnb|Cauchy|1823}}. See also, for instance, {{harvnb|Goursat|1904|loc=I, §14}}.</ref> <math display="block">d^2y = d(dy) = d(f'(x)dx) = (df'(x))dx = f''(x)\,(dx)^2,</math> and, in general, <math display="block">d^ny = f^{(n)}(x)\,(dx)^n.</math> Informally, this motivates Leibniz's notation for higher-order derivatives <math display="block">f^{(n)}(x) = \frac{d^n f}{dx^n}.</math> When the independent variable {{mvar|x}} itself is permitted to depend on other variables, then the expression becomes more complicated, as it must include also higher order differentials in {{mvar|x}} itself. Thus, for instance, <math display="block"> \begin{align} d^2 y &= f''(x)\,(dx)^2 + f'(x)d^2x \\[1ex] d^3 y &= f'''(x)\, (dx)^3 + 3f''(x)dx\,d^2x + f'(x)d^3x \end{align}</math> and so forth.

Similar considerations apply to defining higher order differentials of functions of several variables. For example, if {{math|''f''}} is a function of two variables {{mvar|x}} and {{mvar|y}}, then <math display="block">d^nf = \sum_{k=0}^n \binom{n}{k}\frac{\partial^n f}{\partial x^k \partial y^{n-k}}(dx)^k(dy)^{n-k},</math> where <math display="inline">\binom{n}{k}</math> is a binomial coefficient. In more variables, an analogous expression holds, but with an appropriate multinomial expansion rather than binomial expansion.<ref>{{harvnb|Goursat|1904|loc=I, §14}}</ref>

Higher order differentials in several variables also become more complicated when the independent variables are themselves allowed to depend on other variables. For instance, for a function {{math|''f''}} of {{mvar|x}} and {{mvar|y}} which are allowed to depend on auxiliary variables, one has <math display="block">d^2f = \left(\frac{\partial^2f}{\partial x^2} (dx)^2 + 2\frac{\partial^2f}{\partial x\partial y} dx\,dy + \frac{\partial^2f}{\partial y^2} (dy)^2\right) + \frac{\partial f}{\partial x}d^2x + \frac{\partial f}{\partial y} d^2y.</math>

Because of this notational awkwardness, the use of higher order differentials was roundly criticized by {{harvtxt|Hadamard|1935}}, who concluded: {{blockquote|Enfin, que signifie ou que représente l'égalité <math display="block">d^2z = r\,dx^2 + 2s\,dx\,dy + t\,dy^2\,?</math> A mon avis, rien du tout.}}

That is: ''Finally, what is meant, or represented, by the equality [...]? In my opinion, nothing at all.'' In spite of this skepticism, higher order differentials did emerge as an important tool in analysis.<ref>In particular to infinite dimensional holomorphy {{harv|Hille|Phillips|1974}} and numerical analysis via the calculus of finite differences.</ref>

In these contexts, the {{mvar|n}}-th order differential of the function {{math|''f''}} applied to an increment {{math|Δ''x''}} is defined by <math display="block">d^nf(x,\Delta x) = \left.\frac{d^n}{dt^n} f(x+t\Delta x)\right|_{t=0}</math> or an equivalent expression, such as <math display="block">\lim_{t\to 0}\frac{\Delta^n_{t\Delta x} f}{t^n}</math> where <math>\Delta^n_{t\Delta x} f</math> is an ''n''th forward difference with increment {{math|''t''Δ''x''}}.

This definition makes sense as well if {{math|''f''}} is a function of several variables (for simplicity taken here as a vector argument). Then the {{mvar|n}}-th differential defined in this way is a homogeneous function of degree {{mvar|n}} in the vector increment {{math|Δ''x''}}. Furthermore, the Taylor series of {{math|''f''}} at the point {{mvar|x}} is given by <math display="block">f(x+\Delta x) \sim f(x) + df(x,\Delta x) + \frac{1}{2}d^2f(x,\Delta x) + \cdots + \frac{1}{n!} d^n f(x,\Delta x) + \cdots</math> The higher order Gateaux derivative generalizes these considerations to infinite dimensional spaces.

==Properties== A number of properties of the differential follow in a straightforward manner from the corresponding properties of the derivative, partial derivative, and total derivative. These include:<ref>{{harvnb|Goursat|1904|loc=I, §17}}</ref>

* Linearity: For constants {{math|''a''}} and {{math|''b''}} and differentiable functions {{math|''f''}} and {{math|''g''}}, <math display="block">d(af+bg) = a\,df + b\,dg.</math> * Product rule: For two differentiable functions {{math|''f''}} and {{math|''g''}}, <math display="block">d(fg) = f\,dg+g\,df.</math>

An operation {{math|''d''}} with these two properties is known in abstract algebra as a derivation. They imply the power rule <math display="block"> d( f^n ) = n f^{n-1} df </math> In addition, various forms of the chain rule hold, in increasing level of generality:<ref>{{harvnb|Goursat|1904|loc=I, §§14,16}}</ref>

* If {{math|1=''y'' = ''f''(''u'')}} is a differentiable function of the variable {{mvar|u}} and {{math|1=''u'' = ''g''(''x'')}} is a differentiable function of {{mvar|x}}, then <math display="block">dy = f'(u)\,du = f'(g(x))g'(x)\,dx.</math> * If {{math|1=''y'' = ''f''(''x''<sub>1</sub>, ..., ''x''<sub>''n''</sub>)}} and all of the variables&nbsp;{{math|''x''<sub>1</sub>, ..., ''x''<sub>''n''</sub>}} depend on another variable&nbsp;{{mvar|t}}, then by the chain rule for partial derivatives, one has <math display="block">\begin{align} dy = \frac{dy}{dt} dt &= \frac{\partial y}{\partial x_1} dx_1 + \cdots + \frac{\partial y}{\partial x_n} dx_n\\[1ex] &= \frac{\partial y}{\partial x_1} \frac{dx_1}{dt} \, dt + \cdots + \frac{\partial y}{\partial x_n} \frac{dx_n}{dt} \, dt. \end{align}</math> Heuristically, the chain rule for several variables can itself be understood by dividing through both sides of this equation by the infinitely small quantity {{math|''dt''}}. * More general analogous expressions hold, in which the intermediate variables {{math|''x''<sub>''i''</sub>}} depend on more than one variable.

==General formulation== {{See also|Fréchet derivative|Gateaux derivative}} A consistent notion of differential can be developed for a function {{math|''f'' : '''R'''<sup>''n''</sup> → '''R'''<sup>''m''</sup>}} between two Euclidean spaces. Let {{math|'''x''',Δ'''x''' ∈ '''R'''<sup>''n''</sup>}} be a pair of Euclidean vectors. The increment in the function {{math|''f''}} is <math display="block">\Delta f = f(\mathbf{x}+\Delta\mathbf{x}) - f(\mathbf{x}).</math> If there exists an {{math|''m'' × ''n''}} matrix {{mvar|A}} such that <math display="block">\Delta f = A\Delta\mathbf{x} + \|\Delta\mathbf{x}\|\boldsymbol{\varepsilon}</math> in which the vector {{math|'''''ε''''' → 0}} as {{math|Δ'''x''' → 0}}, then {{math|''f''}} is by definition differentiable at the point {{math|'''x'''}}. The matrix {{mvar|A}} is sometimes known as the Jacobian matrix, and the linear transformation that associates to the increment {{math|Δ'''x''' ∈ '''R'''<sup>''n''</sup>}} the vector {{math|''A''Δ'''x''' ∈ '''R'''<sup>''m''</sup>}} is, in this general setting, known as the differential {{math|''df''(''x'')}} of {{math|''f''}} at the point {{mvar|x}}. This is precisely the Fréchet derivative, and the same construction can be made to work for a function between any Banach spaces.

Another fruitful point of view is to define the differential directly as a kind of directional derivative: <math display="block">df(\mathbf{x},\mathbf{h}) = \lim_{t\to 0}\frac{f(\mathbf{x}+t\mathbf{h})-f(\mathbf{x})}{t} = \left.\frac{d}{dt} f(\mathbf{x}+t\mathbf{h})\right|_{t=0},</math> which is the approach already taken for defining higher order differentials (and is most nearly the definition set forth by Cauchy). If {{mvar|t}} represents time and '''x''' position, then '''h''' represents a velocity instead of a displacement as we have heretofore regarded it. This yields yet another refinement of the notion of differential: that it should be a linear function of a kinematic velocity. The set of all velocities through a given point of space is known as the tangent space, and so {{math|''df''}} gives a linear function on the tangent space: a differential form. With this interpretation, the differential of {{math|''f''}} is known as the exterior derivative, and has broad application in differential geometry because the notion of velocities and the tangent space makes sense on any differentiable manifold. If, in addition, the output value of {{math|''f''}} also represents a position (in a Euclidean space), then a dimensional analysis confirms that the output value of ''df'' must be a velocity. If one treats the differential in this manner, then it is known as the pushforward since it "pushes" velocities from a source space into velocities in a target space.

==Other approaches== {{Main|Differential (infinitesimal)}} Although the notion of having an infinitesimal increment {{math|''dx''}} is not well-defined in modern mathematical analysis, a variety of techniques exist for defining the infinitesimal differential so that the differential of a function can be handled in a manner that does not clash with the Leibniz notation. These include:

* Defining the differential as a kind of differential form, specifically the exterior derivative of a function. The infinitesimal increments are then identified with vectors in the tangent space at a point. This approach is popular in differential geometry and related fields, because it readily generalizes to mappings between differentiable manifolds. * Differentials as nilpotent elements of commutative rings. This approach is popular in algebraic geometry.<ref>{{Harvnb|Eisenbud|Harris|1998}}.</ref> * Differentials in smooth models of set theory. This approach is known as synthetic differential geometry or smooth infinitesimal analysis and is closely related to the algebraic geometric approach, except that ideas from topos theory are used to ''hide'' the mechanisms by which nilpotent infinitesimals are introduced.<ref>See {{Harvnb|Kock|2006}} and {{Harvnb|Moerdijk|Reyes|1991}}.</ref> * Differentials as infinitesimals in hyperreal number systems, which are extensions of the real numbers which contain invertible infinitesimals and infinitely large numbers. This is the approach of nonstandard analysis pioneered by Abraham Robinson.<ref name="nonstd">See {{Harvnb|Robinson|1996}} and {{Harvnb|Keisler|1986}}.</ref>

== Examples and applications == Differentials may be effectively used in numerical analysis to study the propagation of experimental errors in a calculation, and thus the overall numerical stability of a problem {{harv|Courant|1937a}}. Suppose that the variable {{mvar|x}} represents the outcome of an experiment and {{mvar|y}} is the result of a numerical computation applied to ''x''. The question is to what extent errors in the measurement of {{mvar|x}} influence the outcome of the computation of ''y''. If the {{mvar|x}} is known to within Δ''x'' of its true value, then Taylor's theorem gives the following estimate on the error Δ''y'' in the computation of ''y'': <math display="block">\Delta y = f'(x)\Delta x + \frac{(\Delta x)^2}{2}f''(\xi)</math> where {{math|1=''ξ'' = ''x'' + ''θ''Δ''x''}} for some {{math|0 < ''θ'' < 1}}. If {{math|Δ''x''}} is small, then the second order term is negligible, so that Δ''y'' is, for practical purposes, well-approximated by {{math|1=''dy'' = ''f'''(''x'') Δ''x''}}.

The differential is often useful to rewrite a differential equation <math display="block"> \frac{dy}{dx} = g(x) </math> in the form <math display="block"> dy = g(x)\,dx, </math> in particular when one wants to separate the variables.

==Notes== <references/>

== See also == * Notation for differentiation

== References == {{sfn whitelist|CITEREFTolstov2001}} *{{Citation | last1=Boyer | first1=Carl B. | author1-link=Carl Benjamin Boyer | title=The history of the calculus and its conceptual development | publisher=Dover Publications | location=New York | mr=0124178 | year=1959}}. *{{citation|first=Augustin-Louis|last=Cauchy|author-link=Augustin-Louis Cauchy|chapter=<!--Quatrième leçon: Différentialles des fonctions d'une seule variable-->|title=Résumé des Leçons données à l'Ecole royale polytechnique sur les applications du calcul infinitésimal|year=1823|url=http://math-doc.ujf-grenoble.fr/cgi-bin/oeitem?id=OE_CAUCHY_2_4_9_0|access-date=2009-08-19|archive-url=https://web.archive.org/web/20070708104336/http://math-doc.ujf-grenoble.fr/cgi-bin/oeitem?id=OE_CAUCHY_2_4_9_0|archive-date=2007-07-08|url-status=dead}}. *{{Citation | last1=Courant | first1=Richard |author-link=Richard Courant | title=Differential and integral calculus. Vol. I | publisher=John Wiley & Sons | location=New York | series=Wiley Classics Library | isbn=978-0-471-60842-4 | mr=1009558 | year=1937a|publication-date=1988}}. *{{Citation | last1=Courant | first1=Richard | author-link=Richard Courant |title=Differential and integral calculus. Vol. II | publisher=John Wiley & Sons | location=New York | series=Wiley Classics Library | isbn=978-0-471-60840-0 | mr=1009559 | year=1937b|publication-date=1988}}. *{{Citation | last1=Courant | first1=Richard | author-link1=Richard Courant| last2=John | first2=Fritz |author-link2=Fritz John| title=Introduction to Calculus and Analysis Volume 1|series=Classics in Mathematics| publisher=Springer-Verlag | location=Berlin, New York | isbn=3-540-65058-X | year=1999 | mr=1746554 }} * {{Citation| author1-link=David Eisenbud|first1=David|last1=Eisenbud|author2-link=Joe Harris (mathematician)|first2=Joe|last2=Harris| year = 1998 |title = The Geometry of Schemes| publisher = Springer-Verlag| isbn = 0-387-98637-5}}. *{{Citation | last1=Fréchet | first1=Maurice | author1-link= Maurice Fréchet | title=La notion de différentielle dans l'analyse générale | mr=1509268 | year=1925 | journal=Annales Scientifiques de l'École Normale Supérieure |series=Série 3 | issn=0012-9593 | volume=42 | pages=293–323| doi=10.24033/asens.766 | doi-access=free }}. *{{Citation | last1=Goursat | first1=Édouard | author-link=Édouard Goursat|title=A course in mathematical analysis: Vol 1: Derivatives and differentials, definite integrals, expansion in series, applications to geometry| publisher=Dover Publications | location=New York | others= E. R. Hedrick | mr=0106155 | year=1904 | publication-date=1959|url=https://archive.org/details/coursemathanalys01gourrich}}. *{{citation|last=Hadamard|first=Jacques|author-link=Jacques Hadamard|title=La notion de différentiel dans l'enseignement|journal=Mathematical Gazette|volume=XIX|year=1935|issue=236|pages=341–342|doi=10.2307/3606323 |jstor=3606323}}. *{{Citation | last1=Hardy | first1=Godfrey Harold | author1-link=G. H. Hardy | title=A Course of Pure Mathematics | publisher=Cambridge University Press | isbn=978-0-521-09227-2 | year=1908}}. *{{Citation | last1=Hille | first1=Einar | author-link1=Einar Hille | last2=Phillips | first2=Ralph S. | author-link2=Ralph Phillips (mathematician) | title=Functional analysis and semi-groups | publisher=American Mathematical Society | location=Providence, R.I. | mr=0423094 | year=1974}}. *{{Citation | last1=Itô | first1=Kiyosi |author-link=Kiyosi Itô | title=Encyclopedic Dictionary of Mathematics | publisher=MIT Press | edition=2nd | isbn=978-0-262-59020-4 | year=1993}}. *{{citation|chapter=Chapter 13: Differentials and the law of the mean|title=Calculus: An intuitive and physical approach|first=Morris|last=Kline|author-link=Morris Kline|publisher=John Wiley and Sons|year=1977}}. *{{Citation | last1=Kline | first1=Morris | author1-link=Morris Kline | title=Mathematical thought from ancient to modern times | year=1972 | publisher=Oxford University Press | edition=3rd | isbn=978-0-19-506136-9 | publication-date=1990 | url-access=registration | url=https://archive.org/details/mathematicalthou00klin }} * {{Citation |author-link=Howard Jerome Keisler|first=H. Jerome|last=Keisler|title=Elementary Calculus: An Infinitesimal Approach|edition=2nd|year=1986|url=http://www.math.wisc.edu/~keisler/calc.html}}. * {{Citation | first=Anders|last= Kock|url=https://users-math.au.dk/kock/sdg99.pdf|title= Synthetic Differential Geometry|publisher= Cambridge University Press|edition= 2nd|year=2006}}. * {{Citation | last1=Moerdijk|first1= I.|author-link1=Ieke Moerdijk|last2=Reyes|first2=G.E.|title=Models for Smooth Infinitesimal Analysis|publisher= Springer-Verlag|year= 1991}}. * {{Citation | last1=Robinson | first1=Abraham | author1-link=Abraham Robinson | title=Non-standard analysis | publisher=Princeton University Press | isbn=978-0-691-04490-3 | year=1996}}. *{{springer|id=D/d031810|title=Differential|first=G.P.|last=Tolstov}}.

==External links== *[http://demonstrations.wolfram.com/DifferentialOfAFunction/ Differential Of A Function] at Wolfram Demonstrations Project

{{DEFAULTSORT:Differential Of A Function}} Category:Differential calculus Category:Generalizations of the derivative Category:Linear operators in calculus