21  Definition & Properties

Highlights of this Chapter: we prove many foundational theorems about the derivative that one sees in an early calculus course. We see how to take the derivative of scalar multiples, sums, products, quotients and compositions. We also compute - directly from the definition - the derivative of exponential functions. This leads to an important discovery: there is a unique simplest, or natural exponential, whose derivative is itself. This is the origin of e in Analysis.

Finally - on to some calculus! Here we will define the derivative, and study its properties. This may sound daunting at first, remembering back to the days of calculus when it all seemed so new and advanced. But hopefully, after so much exposure to sequences and series during this course, the rigorous notion of a derivative will feel more just like a nice application of what we’ve learned, than a whole new theory.

21.1 Difference Quotients

The derivative is defined to capture the slope of a graph at a point. Elementary algebra tells us we can compute the slope of a line given two points as rise over run, and so we can compute the slope of a secant line of a function between the points a,t as

f(t)f(a)ta

The derivative is the limit of this, as ta:

Definition 21.1 (The Derivative) Let f be a function defined on an open interval containing a. Then f is differentiable at a if the following limit of difference quotients exists. In this case, we define the limiting value to be the derivative of f at a. f(a)=Df(a)=limtaf(t)f(a)ta

Exercise 21.1 (Equivalent Formulation) Prove that we may alternatively use the following limit definition to calculate the derivative: f(a)=limh0f(a+h)f(a)h

Example 21.1 The function f(x)=x2 is differentiable at x=2.

This is a classic problem from calculus 1, whose argument is already pretty much rigorous! We wish to compute the limit

limx2x24x2

So, we choose an arbitrary sequence xn with xn2 but xn2 and compute

limxn24xn2=lim(xn+2)(xn2)xn2=limxn+2

Where the arithmetic is justified since xn2 for all n by definition, so everything is defined. But now, as xn2 we can just use the limit laws to see

limxn+2=2+2=4

Since xn was arbitrary, this holds for all such sequences, so the limit exists and equals 4. Because this limit defines the derivative, we have that f is differentiable at 2 and

f(2)=4

Exercise 21.2 Compute the derivative of f(x)=x3 at an arbitrary point aR, directly from the definition and show f(a)=3a2.

As defined above, the derivative is a limit ta, which depends on values of t both greater than and less than a. But sometimes its useful to have a notion of the derivative that only cares about one sided limits (for instance, when computing the slope at the end of an interval). We give the analogous definition below

Definition 21.2 (One Sided Derivatives) Let f be a function defined at a; then its 1-sided derivatives are defined by the following limits, when they exist

D+f(a)=limta+f(t)f(a)ta Df(a)=limtaf(t)f(a)ta

This definition, together with our previous work on limits, implies that a function f is differentiable if and only if its two one sided derivatives exist and are equal. This is useful in practice, for instance in showing the non-differentiability of the absolute value:

Exercise 21.3 Show that f(x)=|x| is not differentiable at x=0.

21.2 Derivative as a Function

So far we have been discussing the derivative at a point as a number; the result of a limiting process. But we can let this point vary, and produce a function taking in x and outputting the derivative at x:

Definition 21.3 (The Function f) Let f be a function, and suppose that the derivative of f exists at each point of a set DR. Then we may define a function f:DR by

f:xf(x)=limtxf(t)f(x)tx

If f is continuous, f is called continuously differentiable on D.

For example, f(x)=x3 is continuously differentiable on R since by we see its derivative is the function x3x2, and this is a polynomial: we proved all polynomials are continuous.

Since the derivative of a function yields another function, we can look at iterating this process to produce higher derivatives

Definition 21.4 (nth Derivatives) Given a differentiable function f, the second derivative f is defined as the derivative of f. A function is twice differentiable at x if limxaf(x)f(a)xa exists. Continuing inductively, we define the nth derivative of a function at a as the derivative of the n1st derivative of f at a.

We will use the prime notation for small numbers of derivatives, like f(x), f(x) and f(x). For higher derivatives it is traditional to denote via the number of derivatives in parentheses: f(2)=f, f(3)=f and so on; so f(47) for the 47th derivative of f.

Exercise 21.4 (A Difference Quotient for 2nd Derivative) If f is twice differentiable at a, show that f(a)=limh0f(a+2h)2f(a+h)+f(a)h2

Find a limit depending only on f (not f or f) which computes the third derivative

Its useful to have a notation for functions which admit k derivatives, we say a function is Ck if you can differentiate it k times (but not necessairly k+1 times). And, we call a function smooth if you can differentiate it n times for any nN. The set of smooth functions is denoted C.

21.2.1 Continuity

Before jumping in we prove one small oft-useful result often not mentioned in a calculus class, relating differentiability to continuity.

Theorem 21.1 (Differentiable implies Continuous) Let f be differentiable at aR. Then f is continuous at a.

Proof. Since f is differentiable at a, we know the limit of the difference quotient is finite limxaf(x)f(a)xa=f(a) We also know that limxa(xa)=0$ So, using the limit theorems we may multiply these together and get what we want. Precisely, let xna be any sequence with xna for all n. Then we have

0=(0)(f(a))=(limxna)(limf(xn)f(a)xna)=lim((xna)f(xn)f(a)xna)=lim(f(xn)f(a))

Thus lim(f(xn)a)=0 so by the limit theorems we see limf(xn)=a. Since xn was arbitrary with xna this holds for any such sequence, we see that f is continuous at a using the sequence definition.

Remark 21.1. There is a little gap not explicitly spelled out at the end of the proof above, that we should fill in now (to assure ourselves this style of reasoning always works). We just proved that for sequences xna the property we want holds, but continuity requires this fact for all arbitrary sequences. How do we bridge this gap? Let yna be an arbitrary sequence: then we split into the subsequences xna and the subsequence of all terms =a. If either of these is finite, we can just truncate the original sequence at a point past which all terms are of one or the other: each of these has limf(xn)=f(a) so we are done. In the case that both are infinite, we just use that we have separated our sequence into a union of two subsequences, each with the same limit! Thus the overall limit exists.

Thus continuous functions must be differentiable, but what can we say about the derivative itself? If a function is everywhere differentiable must the derivative itself be continuous? In fact not, as the following example shows

Example 21.2 While its hard to imagine a function that is differentiable at every point but not continuously differentiable such things exist. For example f(x)={x2sin(1x2)x00x=0

Its possible to find a formula for f(x) when x0, and show that limx0f(x) does not exist (we will do this later). However one can also calculate directly the derivative at zero: and find f(0)=0. This means limx0f(x)f(limx0x) as one side does not exist and the other is zero: thus f is not continuous at 0.

Exercise 21.5 For f(x) as above in , calculate f(0) directly using the limit definition. (Perhaps surprisingly, all you need to know about the sine function here is that it is bounded between 1 and 1!)

21.3 Field Operations

Here we prove the ‘derivative laws’ of Calculus I:

21.3.1 Sums and Multiples

Theorem 21.2 (Differentiating Constant Multiples) Let f be a function and cR. Then if f is differentiable at a point aR so is cf, and (cf)(a)=c(f(a))

Proof. Let’s use the difference quotient with a+hn to change things up: Let hn0 be arbitrary, and we wish to compute the limit limcf(a+hn)cf(a)hn By the limit laws we can pull out the constant c, and the remainder converges to f(a), as f is assumed to be differentiable at a.

=climf(a+hn)f(a)hn=cf(a)

Because this is true for all sequences hn0 with hn0, the limit exists, and equals cf(a).

Theorem 21.3 (Differentiating Sums) Let f,g be functions which are both differentiable at a point aR. Then f+g is also differentiable at a, and (f+g)(a)=f(a)+g(a)

Exercise 21.6 Prove the differentiability rule for sums.

21.3.2 Products and Quotients

Theorem 21.4 (Differentiating Products) Let f,g be functions which are both differentiable at a point aR. Then fg is differentiable at a and

(fg)(a)=f(a)g(a)+f(a)g(a)

Proof. Let f,g be differentiable at aR, and choose an arbitrary sequence ana. Then we wish to compute

limf(an)g(an)f(a)g(a)ana

To the numerator we add 0=f(an)g(a)f(an)g(a) and regroup with algebra:

=limf(an)g(an)f(an)g(a)+f(an)g(a)f(a)g(a)ana =limf(an)g(an)f(an)g(a)ana+f(an)g(a)f(a)g(a)ana

Using the limit laws, we can take each of these limits individually so long as they exist (which we will show they do). But even more, note that the first term has a common factor of f(an) in the numerator that can be factored out, and the second a common factor of g(a). Thus, by the limit laws, we see

=(limf(an))(limg(an)g(a)ana)+g(a)(f(an)f(a)ana)

Because f is differentiable at a, its continuous at a, and so we know limf(an)=f(a). The other two limits above converge to the derivatives f(a) and g(a) respectively. Thus, alltogether we find the resulting limit to be

f(a)g(a)+f(a)g(a)

As this was the result for an arbitrary sequence ana with ana, it must be the same for all sequences, meaning the limit exists, and

(fg)(a)=f(a)g(a)+f(a)g(a)

Exercise 21.7 Let f be a function and aR be a point such that f(a)0 and f is differentiable at a. Prove that 1/f is also differentiable at a and (1f)(a)=f(a)f(a)2

Theorem 21.5 (Differentiating Quotients) Let f,g be a functions which are differentiable at a point aR and assume g(a)0. Then the function f/g is also differentiable at a and (fg)(a)=f(a)g(a)f(a)g(a)g(a)2

Exercise 21.8 Use the Reciprocal Rule and Product Rule to prove the quotient rule.

21.4 Compositions and Inverses

21.4.1 The Chain Rule

Theorem 21.6 (The Chain Rule) If g(x) is differentiable at aR and f(x) is differentiable at g(a) then the composition fg is differentiable at a, with (fg)(a)=f(g(a))g(a)

Proof (Wish this Worked!). We are taking the derivative at a, so let xna wtih xna be arbitrary. Then the limit defining [f(g(a))] is

limf(g(xn))f(g(a))xna

We multiply the numerator and denominator of this fraction by $g(xn)g(a) and regroup:

f(g(xn))f(g(a))xna=f(g(xn))f(g(a))xnag(xn)g(a)g(xn)g(a)=limf(g(xn))f(g(a))g(xn)g(a)g(xn)g(a)xna

Because g is continuous at a, we know g(xn)a, and because f is differentiable at g(a) we recognize the first term here as the limit defining f at g(a)! Since the second term is the limit defining the derivative of g, both of these exist by our assumptions, and so by the limit theorems we can compute

=(limf(g(xn))f(g(a))g(xn)g(a))(limg(xn)g(a)xna) =f(g(a))g(a)

Unfortunately, this proof fails at one crucial step! Wile we do know that xna0 (in the definition of limxa, we only choose sequences xna with xna) we do not know that the other denominator g(xn)g(a) is nonzero.

If this problem could only happen finitely many times it would be no trouble - we could just truncate the beginning of our sequence and rest assured we had not affected the value of the limit. But functions - even differentiable functions - can be pretty wild. The function x2sin(1/x) (from ) ends up equaling zero infinitely often in any neighborhood of zero! So such things are a real concern.

Happily the fix - while tedious - is straightforward. It’s given below.

Exercise 21.9 We define the auxiliary function d(y) as follows:

d(y)={f(y)f(g(a))yg(a)yg(a)f(g(a))y=g(a)

This function equals our problematic difference quotient most of the time, but equals the quantity we want it to be when the denominator is zero.

Prove that d is continuous at g(c) and we may use d in place of the difference quotient in our computation: that for all xa, the following equality holds:

f(g(x))f(g(a))xa=d(g(x))g(x)g(a)xa

Given this, the original proof is rescued:

Proof. We are taking the derivative at a, so let xna with xna be arbitrary. Then the limit defining [f(g(a))] is (by the exercise)

limf(g(xn))f(g(a))xna=limd(g(xn))g(xn)g(a)xna

Because d is continuous at g(a) and g(xn)g(a) we know d(g(xn))d(g(a))=f(g(a)). And, as g is differentiable at a we know the limit of the difference quotient exists. Thus, by the limit laws we can separate them and

=(limd(g(xn)))(g(xn)g(a)xna)=f(g(a))g(a)

21.4.2 Differentiating Inverses

Theorem 21.7 (Differentiating Inverses) Let f be an invertible function and aR a point where f(a)=b. Assume f is differentiable at a with f(a)0. Then its inverse function f1 is differentiable at b, and (f1)(b)=1f(a)

One may be tempted to prove this using the chain rule, by the following argument: since ff1(x)=x we differentiate to yield (ff1(x))=1 and apply the chain rule to the left hand side, resulting in f(f1(x))(f1)(x)=1

Solving for (f1) and plugging in x=b yields the result. However a more careful review shows doesn’t actually do what we think: in applying the chain rule, we’ve implicitly assumed that f1 is invertible; which is part of what we want to prove! (This proof does go through when we already know f1 to be differentiable, but we are unfortunately not often already in possession of that knowledge). Below we give a direct proof of the theorem from the limit definition, fixing this oversight:

Proof. We attempt to compute the limit defining the derivative for f1: limybf1(y)f1(b)yb. To compute such a limit we choose an arbitrary sequence yny with yny and evaluate limf1(yn)f1(b)ynb By definition b=f(a), and for each n there is a unique xn such that yn=f(xn): making these substitutions yields limf1(f(xn))f1(f(a))f(xn)f(a) The composition f1f is the identity since they are inverse functions so f1(f(xn))=xn and f1(f(a))=a. Making these additional substitutions our limit statement becomes limxnaf(xn)f(a)

By assumption f is differentiable at a and f(a)0, so we know that f(a)=limf(xn)f(a)xna

The limit we are interested in is the reciprocal of this, and as the limit value is nonzero by assumption, the limit laws imply

limxnaf(xn)f(a)=lim1f(xn)f(a)xna=1limf(xn)f(a)xna=1f(a)

Since the sequence yn was arbitrary, this argument holds for any such sequence. Thus the limit defining (f1)(b) exists, and (f1)(b)=1f(a).

Exercise 21.10 Compute the derivative of y=x using this idea.

21.5 The Power Rule

Perhaps the most memorable fact from Calculus I is the power rule, that (xn)=nxn1. In this short section, we prove the power level at various levels of generality, starting with natural number exponents and proceeding to arbitrary real exponents.

Exercise 21.11 (Power Rule: Integer Exponents)  

We can use the chain rule, and the the functional equation for roots to differentiate n^{th}$ roots as well:

Proposition 21.1 If R(x)=x1/n is the nth root function, then R(x)=1nx1n1

Proof. The definition of the nth root function is that R(x)n=x. We differentiate this equation with the chain rule, using that n is a natural number exponent:

(R(x)n)=nR(x)n1R(x) The other side was x, whose derivative is 1. Thus, nR(x)n1R(x)=1 and, solving for R yields

R(x)=1nR(x)n1=1n(x1/n)n1=1nxn1n =1nx1n1

Exercise 21.12 (Power Rule: Rational Exponents) Run a similar argument to the nth root case to prove that if r>0 is rational, then xr is differentiable and (xr)=rxr1.

When it comes to arbitrary real exponents one can use their definition as limits of rational powers, and work to differentiate such a limit. This is possible but requires an exchange of limits, so needs care. Another method is to use the work we’ve already put into understanding exponentials and logarithms to help us out!

21.6 Problems

The pasting lemma has a differentiable analog, which shows exactly when gluing two pieces (like the absolute value) is differentiable, and when its not.

Exercise 21.13 Let f,g be two continuous and differentiable functions with aR a point such that f(a)=g(a). Prove that the piecewise function h(x)={f(x)xag(x)x>a is differentiable at a if and only if f(a)=g(a). (recall we saw such a function is always continuous at a in ?exr-pasting-lemma).

Exercise 21.14 (Differentiable, but The Derivative is Not Continuous) While its hard to imagine a function that is differentiable at every point but not continuously differentiable such things exist. For example f(x)={x2sin(1x2)x00x=0

Assume for the sake of this problem that sin(x) is a differentiable function on the entire real line, and prove that f(x) is differentiable at every nonzero point, using the product/chain rules.

At x=0 this method fails, but we can compute f(0) directly using the limit definition. Do this, and show you get zero. (Perhaps surprisingly, all you need to know about the sine function here is that it is bounded between 1 and 1!)