Jump to content

Derivative - inverse function – "Math for Non-Geeks"

From Wikibooks, open books for an open world


In the following article we will investigate the conditions under which the inverse function of a bijective function is differentiable at one point. We will also derive a formula with which we can explicitly determine the derivative of the inverse function. The practical thing about this formula is that it allows us to determine the derivative at certain points, even if we do not know the inverse function explicitly or it is insanely difficult.

Motivation

[edit | edit source]

Let us first consider a linear function as an example. For this it is very easy to determine the derivative of the inverse function. Non-constant linear functions are bijective and therefore invertible on . In this case we can calculate the inverse function explicitly and differentiate it. Concretely we choose with . The inverse function is

is differentiable on and for all .

Let us next consider the function . Here we have to be careful, because it is not injective on all of and therefore not invertible. But if we restrict the domain of definition to , then is bijective. The inverse function is the square root function

For differentiability we have to consider another thing: is not differentiable at . We can show this by examining the differential quotient. Or we consider the following:

Since the root function is the inverse function of the square function , there is . At zero there is thus in particular

If now was differentiable at 0, then the chain rule would yield

So cannot be differentiable at 0. However, on , the function is differentiable, and there is

This example shows that in case , it may happen that is not differentiable although is differentiable everywhere.

In the two examples it was relatively easy to determine the derivative of the inverse function directly (it was a polynomial). But what about more complicated functions, for example as an inverse function of ? Here we cannot simply calculate the derivative of the inverse function, if only derivatives of exponentials and polynomials are known. It may even occur that a bijective function cannot be inverted explicitly. In these cases it would be good to have a general formula with which we can determine the derivative of from the derivative of . If we look again at the derivative from the second example, we may see the following:

Since there is for all and for all . In the first example (straight lines), there is also

Can this be chance? Actually, it's not: the formula is valid for a general. Consider being differentiable at and being differentiable at . By definition of the inverse function,

for all . Now we take the derivative and obtain by the chain rule:

Here we have used that in and in are differentiable. Now we divide on both sides by (note: this only possible if the expression is not equal to zero), and get

or equivalently

So the formula also holds in general under certain conditions. Now the question is, under which conditions at the derivative of exists.

  • On the one hand the must exist. This is exactly the case if is bijective, which is exactly the case if is surjective and strictly monotone.
  • As we have seen above, must be differentiable in the point with .
  • We will see that we need one more condition, namely that is continuous in . If the domain of definition of is an interval, then this is always fulfilled according to the theorem about continuity of the inverse function.

These are the conditions necessary for our formula to hold. Let's put it into a theorem:

Theorem: derivative of the inverse function

[edit | edit source]

Theorem and proof

[edit | edit source]

Theorem (Derivative of the inverse function)

Let and be an interval. further, let be a surjective and strictly monotone function, which is differentiable in where . Then, has an inverse function , which is differentiable at and there is:

Remarks:

  • The surjectivity of is equivalent to .
  • If is differentiable on all of , then according to the monotonicity criterion the strict monotonicity can be seen most easily by or .
  • As we have seen above with the derivative of the square root function in , the condition must not be omitted under any circumstances. Otherwise, it produces "infinite derivatives", which are not well-defined!
  • The theorem also holds if is not an interval. But then it must be demanded additionally that in is continuous. Furthermore, and must be accumulation points of and respectively.
  • If is additionally continuous, then by continuity of the inverse function it follows that is an interval.

Summary of proof (Derivative of the inverse function)

First of all we justify that exists. Then we conclude by the theorem about the continuity of the inverse function that is continuous. We show that the differential quotient exists and has the value . That is, that for every sequence with there is .

Proof (Derivative of the inverse function)

is surjective and strictly monotone, i.e. bijective. So the inverse function exists. Since we have assumed that is an interval, the theorem about the continuity of the inverse function implies that is continuous on . There is thus with . Let now be a sequence in with , then there is

Hence, is differentiable in and there is .

Alternative proof (Derivative of the inverse function)

Another way of proof is given by an equivalent characterization of the derivative: is differentiable in if and only if there is a function continuous at with

If this is the case, then . Since by assumption, and is strictly monotone, follows for all . If we now set and , the above equation is

This is now equivalent to

Since and are continuous at we also get continuity of at . If we now use again the equivalent characterization of continuity, it follows from the last equation that is differentiable in with

Memory rule and visualization

[edit | edit source]
Graph of with derivative
Graph of with derivative

Using Leibniz's notation for the derivative, the formula of the derivative of the inverse function can be illustrated by a simple fraction-swap trick: For and there is

We can also visualize the formula graphically: If the function is differentiable at , then corresponds to the slope of the tangent to the graph in . Hence,

We now obtain the graph of the inverse function in two steps:

  1. First we have to rotate the graph of by (clockwise or counter-clockwise). The resulting graph has the slope at the point , because the tangent at this point is perpendicular to the original tangent.
  2. Then we have to mirror the graph (horizontally or vertically). The sign of the tangent gradient is reversed.

Altogether we get

Extension to the whole domain

[edit | edit source]

The converse of the theorem also holds:

Theorem (Converse of the theorem about inverse function derivative)

Let and be an interval. Further, let be a surjective, strictly monotone function, which is differentiable at . If further, the inverse function is differentiable at , then there is: and

Proof (Converse of the theorem about inverse function derivative)

The proof works with the trick from the introduction. For all we have

Under the above conditions, the left-hand side is differentiable at (chain rule) with

Because 0 has no divisor (other than 0) in , there must be and we get

Let us now additionally demand in the original theorem that is differentiable on all of with . Then we can determine the derivative function of on all of :

Theorem (Derivative of the inverse function)

Let and be an interval. Further, let be a surjective, differentiable, strictly monotone function with for all . Then has a differentiable inverse function, whose derivative is given by:

Examples

[edit | edit source]

Example (linear functions)

Let , and

a linear function. Then is surjective and strictly monotonically increasing, if , and strictly monotonically decreasing, if . Furthermore, is differentiable on all of with derivative . According to the theorem about the derivative of the inverse function there is thus for all

We could also have calculated this directly, as above.

Example (Root functions)

Let for

Then is differentiable and has the derivative . So it is monotonically increasing. Furthermore, is surjective. The inverse function is the -th root function

For every our theorem now yields

.

If is odd, then the formula holds even for all .

Example (Logarithmic functions)

Let us look at the exponential function

We have learned that . So the function is differentiable, and because of strictly monotonically increasing. Furthermore, is surjective. The inverse function is the (natural) logarithm function

Our theorem now implies for :

Exercises

[edit | edit source]

Math for Non-Geeks: Template:Aufgabe

Math for Non-Geeks: Template:Aufgabe