25  Axioms

Highlights of this Chapter: we give an axiomatic definition of integration, and use these axioms to propose a formula for computing integrals using sums.

The integral is meant to measure the (net) area. When f is positive, for instance, we learn in calculus that abfdx should be the area under f between a and b. That is, it should be the area of the region R={(x,y)R2axb0yf(x)}. But how does one measure area? Perhaps surprisingly, this turns out to be much more difficult than it sounds, and a full resolution took until the beginnings of the 20th century, with the advent of measure theory.

n b Happily, we do not need the full generalities of this theory to introduce the single variable integration theories one first meets in analysis, and we can do something conceptually simpler. Following the example we set with our introduction to the elementary functions, we seek an axiomatic description of what integration is ‘about’, before we demand a procedure to calculate it. In this chapter we carry this out: proposing a simple set of axioms that anything worthy of being called an integral must satisfy, and then use these to produce a formula which can calculate integrals in every case we will need.

25.1 Characterizing Integration

If we are to propose axioms for integration, we must first think carefully about what we expect an integral to be. Looking back to calculus, we recall integration to be a procedure taking a function f and a closed interval [a,b] and producing a number, which we denote abf(x)dx. That is, at its most basic level, an integral is a function taking an interval and a function defined on that interval to a number. To remind ourselves of the calculus notation (while still maintaining something distinct, to help us stay formal), we write this function as

{f,I}IfR

Fixing the interval I, we can think of I{} as a map assigning real valued functions on I to numbers, and fixing f:RR, we may think of {}f as a map assigning intervals to numbers (the integral of f over that interval). From each of these perspectives there are some obvious properties that anything worthy of being called an integral must satisfy:

  • It gets rectangles right: If f(x)=kR is a constant function, its graph is a horizontal line, which encloses a rectangle over an interval I=[a,b]. But we know the area of a rectangle should be its base times its height! So [a,b]k should be k(ba).

  • It gets inequalities right If f(x)g(x) on an interval I, the graph of f lies fully underneath the graph of g, so the net area under f cannot be greater than the area under g. That is, fg on I should imply IfIg.

  • Area is additive Given an interval I=[a,b] one should be able to find the net area under f by breaking the interval into pieces, finding the area under f on each piece, and adding up the results. This is simplest when we consider just two pieces: for c[a,b] we should have [a,b]f=[a,c]f+[c,b]f.

Since these are properties we clearly want an integral to have, we might wish to take them directly as axioms: and basically that is what we will do! But there is a slight subtlety we need to contend with: the collection of all real valued functions contains some wild beasts, and we shouldn’t be so hasty as to assume that it makes sense to measure the area under the curve of every function (indeed, for our theory, it will turn out that the *area under the function which is 1 on the rationals and zero on the irrationals, is undefined). So, instead of insisting these hold for all functions, we will formalize an integral as declaring a subset of functions to be integrable, and only imposing these axioms on that subset.

Definition 25.1 (Axiomatic Integration) For any closed interval J=[a,b] we denote by I(J) the set of integrable functions on J. Then a collection of functions I(J)R is an integral, and denoted fJf if it satisfies the following axioms:

  • If kR then f(x)=k is an element of I([a,b]) for any interval [a,b] and [a,b]k=k(ba).

  • If f,gI([a,b]) and f(x)g(x) for all x[a,b] then [a,b]f[a,b]g

  • If [a,b] is an interval and c(a,b), then fI([a,b]) if and only if fI([a,c]) and fI([c,b]). Furthermore, in this case their values are related by [a,b]f=[a,c]f+[c,b]f

Note these axioms do not aim to uniquely specify an integral, but rather to delineate properties that anything worthy of being called an integral must have. Over the past two centuries, there has been quite a lot of work done studying the possible different integrals - all the different functions that satisfy this definition. However through all this work a beautiful story has emerged: for all their differences, all the various constructions give exactly the same answers for the continuous functions, and those answers can be calculated directly from the axioms themselves! Its this streamlined, abstract thread that we will pursue in this course.

25.1.1 Improper Integrals

We have axiomatized the integral for bounded functions on closed intervals, but the definition can be naturally extended to unbounded intervals and (certain) unbounded functions via limits.

Definition 25.2 (Improper Integrals: Unbounded Intervals) The integral of a bounded function f on a ray [a,) is defined as a limit of its integrals over growing closed intervals [a,)f:=limb[a,b]f with the analogous definition for rays (,b]. The integral over the entire real line is defined by taking each endpoint to ± separately

Rf:=limab[a,b]f

That is, both orders limalimb and limblima exist and are equal.

Definition 25.3 (Improper Integrals: Unbounded Functions) If f is defined on (a,b] and integrable on each subinterval [t,b] for t>a, we define the improper integral on (a,b] as a limit (a,b]f=limta+[t,a]f Similarly for functions unbounded on [a,b) but bounded on each [a,t][a,b), we define [a,b)f=limtb[a,t]f.

If a function defined on [a,b] is unbounded in a neighborhood of some c(a,b) but is integrable on every subinterval missing c, we say the integral on [a,b] exists if and only if the integral on [a,c) and (c,b] both exist, in which case we define it as equal to their sum.

25.2 The Integral as a Function

WHile our definition strictly only gives the value of integrals on one interval at a time, the axioms let us do a bit of work and define the Integral Function F:[a,b] of an integrable function f:[a,b] and study its properties.

Proposition 25.1 (The Integral as a Function) If fI([a,b]) is an integrable function, then there exists a function F:[a,b]R defined by F(x)=[a,x]f

Proof. This is just subdivision at work: for any x[a,b] we may write [a,b]=[a,x][x,b]. Then Axiom III implies that f is integrable on [a,x], and so the number [a,x]f is defined. This assignment describes a real valued function

x[a,x]f

We can learn much from the axioms about this integral function: they imply that when a function is integrable, its integral is continuous!

Theorem 25.1 If fI([a,b]) is a bounded integrable function, then its integral F(x)=[a,x]f is continuous.

Proof. Let f be integrable and bounded by M on [a,b], and set F(x)=[a.x]f. Begin by choosing an ε>0. We will prove something even strong than asked - that f is uniformly continuous by finding a δ>0 where if |yx|<δ we have |F(y)F(x)|<ε. Let’s unpack this a bit: if x<y are two points of [a,b],

F(y)F(x)=[a,y]f[a,x]f

But subdivision (Axiom III) implies F(y)=[a,y]f=[a,x]f+[x,y]f=F(x)+[x,y]f

Thus F(y)F(x) is just the integral of f on the subinterval [x,y][a,b]. Because f is bounded by M we know Mf(x)M. By subdivsion, f is then integrable on every sub-interval I[a,b], and by comparison (Axiom II) this implies M|I|IfM|I|

So, we choose δ=ε/M. This immediately yields what we want, as if |yx|<δ,

ε=Mδ<M|yx|[x,y]fM|yx|<Mδ=ε

Thus |F(y)F(x)|=|[x,y]f,dx|<ε.

Remark 25.1. Of course, the proven result is not really stronger than what was asked, since we began on a closed interval, and we know that continuous on a closed interval implies uniformly continuous.

However, if you look carefully at the proof you see we nowhere used that the original domain was a closed interval! So what we have really proven is that the area function F(x)=[a,b]f is uniformly continuous anytime f is bounded!

As defined, the integral is only a function of x for x greater than the chosen starting endpoint. While this is what is desired in many applications, its also useful to be able to extend the definition to make sense for x below the starting point as well. This is the integral often met in calculus, which we call the oriented integral as it changes sign when the interval from a to x is traced backwards.

Definition 25.4 (The Oriented Integral) Given a function f which is integrable on the interval between a and b, the oriented integral of f is denoted abf and equals abf={[a,b]fab[b,a]fb<a

Corollary 25.1 Given any function f which is integrable on an interval [a,b], for any c[a,b] the oriented integral defines a function [a,b]R by xcxf

25.3 How to Compute?

Just as we can use these axioms to prove theorems about any possible integral, we can also use it to compute values. We content ourselves with a simple example here, to illustrate, and follow with a more general discussion.

25.3.1 Integrating f(x)=x

Given the function f(x)=x, lets temporarily assume that we have some integral satisfying the axioms above, and also that this integral considers x to be integrable on the interval [0,1]. From here, it turns out the axioms unambiguously determine its value, allowing us to prove

Theorem 25.2 If is any integral for which f(x)=x is integrable, then [0,1]x=12.

We proceed in steps. First, note that on the interval [0,1] we know (by definition) that f(x)=x is between 0 and 1. So, by axiom 2 0x1[0,1]0[0,1]x[0,1]1

The upper and lower bounds here are constants, and so we can evaluate their integrals by axiom 1, giving

0[0,1]x1

This of course is a pretty terrible estimate; but we can easily use the same ideas to do better! Indeed, since we’ve assumed that x is integrable on [0,1] axiom 3 ensures us that its also integrable on I1=[0,1/2] and I2=[1/2,1] (or any other subintervals, for that matter). On these smaller intervals, we have better understanding of the behavior of f(x), and thus better estimates:

xI10f(x)=x1/2xI21/2f(x)=x1

To each (constant!) bound we can apply axiom 1 to integrate, it and then apply axiom 2 to ensure the inequality is preserved. Thus

0=I10I1xI112=14 14=I212I2xI21=12

Adding these two intequalities gives 14I1x+I2x34, but then using axiom 2 we recognize that the sum in the middle is just the subdivision of [0,1]x at 1/2. Thus we have

14[0,1]x34

Of course we can do better. If we divide [0,1] into N intervals of equal length 1/N (say Ii=[(i1)/N,i/N]) then by using the subdivision axiom inductively, we find f(x)=x is integrable on each, and

[0,1]x=I1x+[1/n,1]x=I1x+I2x+[2/n,1]x==i=1NIix

On each of these intervals Ii we can easily bound f(x)=x:since its monotone increasing, its smallest value is its left endpoint and its largest value is its right endpoint: (i1)/Nxi/N. Thus, by axioms 1 and 2

i1N2=Iii1NIixIiiN=iN2

Summing up these inequalities, and recalling [0,1]x=i=0NIix, we find i=1Ni1N2[0,1]xi=1NiN2

This inequality must hold for all values of N, which turns out to be enough to completely fix the value of [0,1]x

Exercise 25.1 (Integrating x)  

  • Call the lower estimate LN and the upper estimate UN. Prove that as N, so long as one of these sums converges so does the other, and their values are equal. Thus, the constant sequence [0,1]x is squeezed between LN and UN, so in the limit must also take their common value!

  • Next, prove that UN12 as N *Hint: use previous homework, where we did summation by parts to find a formula for i=1Ni=1+2+3+N.

Throughout this entire calculation we’ve only used the axioms, and the assumption that x is integrable. Thus, we’ve proven a pretty strong result: no matter how you try to precisely define integrals, there is an unambiguous choice for the value of [0,1]x. If its defined at all, it must equal exactly 1/2.

Exercise 25.2 (Integrating x, Part II) Extend the above result to show that if x is integrable on [a,b], then [a,b]x=b22a22 (Its OK if you assume in your proof that 0<a<b to cut down on worrying about negative numbers, the proof of the general case is not much more work)

Hint: first, generalize the work we did together in the book above, from the interval [0,1] to a general interval [0,c] and prove that [0,c]x=c22. Then use the fact that you know this for all c>0 and the subdivision axiom to get what you want.*

Exercise 25.3 (Integrating x2) Prove that if x2 is integrable on [0,c] that its value must be [0,c]x2=c33

Use this to deduce that for any interval [a,b], (feel free to just do the case a,b0) [a,b]x2=b33a33

Hint: follow the similar process to what we did above: using the axioms to bound by sums, and then using the summation by parts formula from earlier in the course to calculate the limit

25.3.2 The Darboux Integral

We can take this kind of reasoning even farther, and propose a means of calculating integrals of arbitrary functions, whenever they are forced to exist by the axioms. The main idea is the same: to estimate an integral we use subdivision to break the domain into smaller and smaller pieces, and then use inequalities to get better and better estimates of f on each piece. It will be useful to give a name to such a subdivision of the interval: a partition.

Definition 25.5 (Partitions) A partition of the interval I=[a,b] is a finite ordered set P={t0,t1,,tn} with a=t0<t1<<tn1<tN=b.

  • N is called the length of the partition
  • We write Pi=[ti,ti+1] for the ith interval of P, and |Pi|=(ti+1ti) for its width.
  • The maxwidth of P is the maximal width of the P’s intervals, maxwidth(P)=max0i<N{|Pi|}.
  • The set of all partitions on a fixed interval I is denoted PI.PI={P:P is a partition of I}

On each partition, we can try to find bounds on the value of our function f. We no longer know the lower bound will occur at the left side and the upper bound at the right side, or even that the lower and upper bounds are even achieved by some points in the domain (we do know this when f is continuous, by the extreme value theorem, but we can’t say much for a general f). But, even when the max and min do not exist the infimum and supremum always do (for any boounded nonempty set,by completeness).

Definition 25.6 (Upper and Lower Sums) Let f be a function, and P a partition of the closed interval I. For each segment Pi=[ti,ti+1], we define mi=infxPi{f(x)}Mi=supxPi{f(x)}

We then define the upper sum UI(f,P) and the lower sum LI(f,P) as

LI(f,P)=0i<Nmi|Pi| UI(f,P)=0i<NMi|Pi|s

Using the subdivision axiom on our partition, we see that if f is integrable, we must have

0i<Nmi|Pi|0i<NIif0i<NMi|Pi|

This is great in that given any partition we can get some nice bounds on the possible values of our integral If, but we can’t focus on a single partition and need to think more generally. Let’s look at each inequality separately. What does it mean that LI(f,P) is less than or equal to the integral, for any partition? Turning this around, we are saying that If is an upper bound for the lower sums. So this upper bound can’t be less than the least upper bound, meaning

supPPILI(f,P)If

Similarly, the second inequality tells us that If is a lower bound for the set of all possible upper sums. It must then be less than or equal to the greatest lower bound, so

IfinfPPIUI(f,P)

These quantities prove extremely useful estimates, so we will give them a name:

Definition 25.7 (Upper and Lower Integrals) Let f be a function on the closed interval I. Then we define the upper integral UI(f) and the lower integral LI(f) as U(f)=infPPI{UI(f,P)} L(f)=supPPI{LI(f,P)}

It may happen that for a given function f, the upper integral and lower integral are not equal to one another. In this case, the best bounds we could think to construct from the axioms (looking over all possible partitions) aren’t enough to nail down a value for the integral of the function.

Exercise 25.4 Prove the characteristic function of the rationals has U(χ)=1 and L(χ)=0 on the interval [0,1].

But sometimes we will find that U(f)=L(f). Since we know L(f)If and IfU(f) whenver the integral exists, this then uniquely specifies the value: if f is integrable under any possible definition of the integral, its value must be this common quantity U(f)=L(f) we computed here. Taking this one step further actually provides a really reasonable potential definition of an integral: given a function f we compute the upper and lower sums: if they are not equal, we declare the function is not integrable. But if they are equal, we define the integral to be their common value (as we must, as this is the only option!). This definition is due to Darboux, and is called the Darboux integral.

Definition 25.8 (Darboux Integral) Let f be a function on the closed interval I. Then f is Darboux-Integrable on I if U(f)=L(f), and we define the integral to be this common value: [a,b]f=U(f)=L(f)

Because we used the axioms (and only the axioms) to come up with this construction, its perhaps not surprising that the resulting thing actually does satisfy the axioms, so is an example of an integral. But since we are working with limit-like quantities (infima and suprema) we should to be careful and actually check nothing goes wrong. This is the content of the next (optional) chapter.