6  Convergence

Having formalized the number line, we can now get to work. If we want to rigorously understand any of the approximation efforts of the ancients, we must think about sequences.

Definition 6.1 (Sequence) A sequence is an infinite ordered list of numbers (s1,s2,s3,,sn,) Each individual element is a term of the sequence, with an subscript (the index) denoting its position in the list.

Most often, we take the set of indices to be 1,2,3,, but any infinite subset of the integers will do. For example, the sequence pn of perimeters of inscribed n-gons starts with index 3 (the triangle), as this is the smallest polygon. And, the subsequence Archimedes used to calculate π started with the hexagon and then iterated doubling: P6,P12,P24, so has index set {6,12,24,48,96,192,384,}

Formally, we note all of this is captured using functions, though we will not need this perspective during our day-to-day usage of sequences.

Remark 6.1. Let IZ be any infinite set of indices. Then a sequence is a function s:IR.

While sequence itself is just an infinite ordered list of numbers, to work with such an object we often require a way to compute its terms. Sometimes this is hard! For example, the sequence

πn= the number of prime numbers n 

Is called the prime counting function, and being able to compute its exact values efficiently would be monumental progress in number theory. In practice, sequences that we can compute with efficiently are often presented to us in one of two ways:

an=n2+13n2,bn=sin(1n)cn=1+nn+1

Here’s some example sequences that are important both to us, and the history of analysis:

Example 6.1 (Babylonians and 2) Starting from rectangle of width and height w,h, the Babylonians created a new rectangle whose width was the average of these, and whose height was whatever is required to keep the area 2: wnew=w+h2hnew=2wnew

This because we can solve for h in terms of w, this induces a recursive sequence for the widths. Starting from some (wn,hn) we have

wn+1=wn+hn2=wn+2wn2=wn2+1wn

Thus, in modern terminology the babylonian procedure defines a recursive sequence, given any starting rectangle. If we begin with the rectangle of wdith 2 and height 1, we get

w0=2,wn+1=wn2+1wn

Exercise 6.1 (Babylonians and 2) Following the same type of reasoning as for width, use the babylonian procedure to produce a recursive formula for the sequence of heights hn, for a rectangle starting with h=1.

Example 6.2 An infinite sum is a type of recursively defined sequence, built from another sequence called its terms. Assume that an is any sequence. Then we build a sequence sn by

s0=a0sn+1=sn1+an

Unpacking this, we see that s1=s0+a1=a0+a1, and thus s2=s1+a2=a0+a1+a2 etc.

6.1 Convergence

The reason to define a sequence precisely is that we are interested in making rigorous the idea of infinitely many steps, the way the Babylonians may have pictured running their procedure an infinite number of times to produce a perfect square, or Archimedes who ran his side-doubling procedure infinitely many times to produce a circle.

In both cases, there was some number L out there at infinity that they were probing with a sequence. We call such a number L the limit of the sequence.

Definition 6.2 (Convergent Sequence) A sequence sn converges to a limit L if for all ε>0 there is some threshold N past which every further term of the sequence is within ε of L. Formally, this is the logic expression ε>0Nn>N|snL|<ε When a sequene converges to L we write limsn=LorsnL

A sequence is divergent if its not convergent. The definition of convergence formalizes the idea the ancients sought if you keep calculating terms, you’ll get as close as you like to the number you seek

That is, the definition sets up a challenge between you (the computer of the sequence) and the error tolerance. Once you set a certain amount of acceptable error ε, the definition furnishes an N and guarantees that if you compute the sequence out until N you’ll be within the tolerated error - and if you keep computing more terms, the approximation will never get worse. Its good to look at some specific examples, while getting comfortable with this:

Exercise 6.2 (Understanding Convergence) Consider the sequence an=1n2+13. Feel free to use a calculator (even just the google search bar) to experiment and answer these questions.

  • What value L do you think this sequence converges to?
  • If ε=1/10, what value of N ensures that an is always within ε of L for, n>N?
  • If ε=1/100, what value of N ensures that an is always within ε of L for, n>N?

Exercise 6.3 (Convergence and 2) This problem concerns the babylonian sequence for 2 in . Again, use a calculator to play around and answer the following

  • For which value of N are we guaranteed that wn calculates the first two decimal places 2 correctly, when n>N?
  • For which value of N are we guaranteed that wn calculates the first eight decimal places 2 correctly, when n>N?

6.1.1 The εN Game

To prove a sequence converges, we need to work through the string of quantifiers εNn… This sets up a sort of imagined battle between an imagined foe setting a value of ε, and you needing to come up with an N such that you can get the sequence within ε of the limit.

Here’s one incredibly useful example, that will serve as the basis of many future calculations.

Proposition 6.1 (1/n converges to 0.) Prove that the sequence sn=1/n of reciprocals of the natural numbers converges to 0.

Proof. Let ε>0. Then set N=1/ε, and choose arbitrary n>N. Since n>1/ε it follows that 1/n<ε, and hence that |1n0|<ε

Since n>N was arbitrary, this holds for all such n, and we have proved for this ε, theres an N with n>N implying the sequence 1/n is within ε of the proposed limit 0. Since ε was also arbitrary, we have in fact proved this for all positive epsilon, and thus we conclude 1n0

Often when working out such a computation, the scratch work is backwards of the final proof. In a proof, you need to fix an arbitrary epsilon, then

Exercise 6.4 (nn+1 converges to 1)  

Sometimes the scratch work takes a bit more thinking or algebraic manipulation. Its OK if the scratch work isn’t fully rigorous or perfectly written, as long as the eventual proof is! Here’s an example of some scratch work taking a naive approach (just “solve for n”) that arrives at an easy bound, and a nice formal proof verifying it.

Example 6.3 (nn2+1 converges to 0.)  

Proof (Scratch). We want nn2+1<ε, and we attempt to solve this inequality for n by multiplying through and using the quadratic formula: n<εn2+ε0<εn2n+ε The zeroes satisfy n=1±14ε22ε The larger of these is the one with the + sign, so as long ad n is bigger than this the proof will work. This term as written is rather annoying to deal with as we have 14ε2 under the square root, and to do a formal proof using it as is, we’d need to ensure this wasn’t negative. But since we are only looking for a bound, we can use that 1+14ε22ε<1+12ε=1ε and just require n>1ε.

Proof (Formal). Let ε>0 and set N=1ε. For any n>N we see that 1n2<ε2, and 1n2+1<1n2 so 1n2+1<ε2. Multiplying by n gives nn2+1<nε2<1εε2=ε Thus for any n>N we have |nn2+10|<ε so the sequence converges to 0 by definition.

Example 6.4 (12n0) Here’s a sketch of an argument: you should fill in the details. Let ε>0. Then we want to find an N where n>N implies 1/2n<ε. First, we prove by induction that 2nn for all n. Thus, 1/2n<1/n, and so it suffices to find N where 1/n<ε. But this is exactly what we did above in the proof that 1/n0. So this is possible, and hence 1/2n0.

Exercise 6.5 Give an example of the following, or explain why no such example can exist.

  • A sequence with infinitely many terms equal to zero, but does not converge to zero.
  • A sequence with infinitely many terms equal to zero, which converges to a nonzero number.
  • A sequence of irrational numbers that converges to a rational number.
  • A convergent sequence where every term is an integer.

Exercise 6.6 Prove, directly from the definition of convergence, that 2n25n+125

6.1.2 Divergence

The definition of convergence picks out a very nice class of sequences: those that get arbitrarily close to a fixed value, as their index grows. The rest of sequences - anything that does not have this nice property, are all lumped into the category of divergent.

Definition 6.3 (Divergence) A sequence diverges if its not true that for any ε you can find an N where beyond that, all terms of the sequence differ from some constant (the limit) less than ε.

Phrasing this positively: a sequence an diverges if for every value of a, there exists some ε>0 where no matter which N you pick, there’s always some n>N where |ana|>ε. There’s a lot of quantifiers here! Written out in first order logic:

aRε>0Nn>N|ana|>ε

Again, its easiest to illustrate with an example:

Example 6.5 ((1)n Diverges) Here’s the idea: The sequence sn=(1)n alternates back and forth from 1 to 1 forever. Assume for the sake of contradiction that it in fact converges to some real number L. Then (by definition) eventually all terms must get within ε of L, but this is impossible if ε is small as every term differs from its successor by 2.

Proof. Note that for all n, |snsn+1|=2 as when n is even this is |1(1)|=|2| and when n is odd its |1(1)|=|2|. Assume for the sake of contradiction that snL for some LR, and set ε=12. This implies there exists an N such that for all n>N we have |snL|<12. Choosing some n>N we use the triangle inequality to see 2=|snsn+1|=|snL+Lsn+1||snL|+|Ls+n+1|ε+ε=1

Thus we’ve proven 2<1 which is a contradiction, so it must not be true that snL for any L: the sequence diverges.

Definition 6.4 (Diverging to ±) A sequence sn diverges to if for all M>0 there exists an threshold past which the sequence is always larger than M. As a logic statement, M>0Nn>Nsn>M

Exercise 6.7 (n2 diverges to .)  

Exercise 6.8  

  • Give an example of two divergent sequences an,bn where an+bn is convergent.
  • Give an example of two divergent sequences an,bn where anbn is convergent.

6.2 Uniqueness

Theorem 6.1 (Limits are unique) Let an be a convergent sequence. Then there exists a unique aR with ana.

Here’s a sketch of the idea, which uses several big ideas that can be recycled in similar arguments:

  • We prove uniqueness by showing that if x and y were both limits, then x=y.
  • We prove x=y by showing that for every ε>0 the difference |xy|<ε.
  • We prove |xy|<ε by an ε/2 argument:
    • We add zero in a clever way: |xy|=|xan+any|
    • We use the triangle inequality |xan+any||xan|+|any|
    • We use the fact that anx and any to make each of |anx| and |any| less than ε/2.

Proof. Assume that a sequence an converges to two limits anx and any. Then for any ε we can find an N1 where n>N1 implies |anx|<ε/2 and an N2 where n>N2 implies |any|<ε/2. Setting N=max{N1,N2} we see for any n>N that |xy|=|xan+any||xan|+|any|<ε2+ε2=ε

Thus for any positive ε we know |xy|<ε, so in particular |xy|ε, and |xy| can’t be positive. Since absolute values are always nonnegative, the only remaining option is that |xy|=0. But this means xy=0 and hence x=y: the two limits are equal.

There’s one more uniqueness-type theorem about limits that’s useful to get a handle on. We just saw that the limit is uniquely determined by the sequence, but we can say something slightly stronger. Its uniquely determined by the end of the sequence: if you throw away the first finitely many terms, it won’t change the limit.

Definition 6.5 A shifted sequence the result of shifting the indices by a constant k, deleting the first k terms. Precisely, given a sequence an and some kN, the sequence sn=an+k is the result of shifting a by k. s0=ak,s1=ak+1,s2=ak+2,

Proposition 6.2 Shifting a convergent sequence does not change its limit.

Proof (Scratch Work). Assume that an converges to a, and define the sequence sn by deleting the first k terms of an, that is, sn=an+k. We claim that sna.

Let ε>0 and choose an N such that if n>N we know that |ana|<ε (we know such an N exists by the assumption ana). Now consider |sna|. Since sn=an+k, we know |sna|<ε because we already knew |an+ka|<ε: we knew this for every single index bigger than N.

Thus, for all n>N we have |sna|<ε, which is the definition of sna.

This can be generalized, to show that any two sequences which are eventually the same have the same limit. Since the first finite part of any sequence is irrelevant to its limiting behavior, its nice to have a word for “the rest of the sequence, after throwing away an unspecified amount at the beginning”. This is called the tail.

Definition 6.6 (Tail of a Sequence) The tail of a sequence is what remains after chopping off an arbitrary (finite) number of terms from the beginning of the sequence. Two sequences have the same tail if they agree after some point: more precisely, an and bn have the same tail if there is an Na and Nb such that for all kN aNa+k=bNb+k

Example 6.6 (Tail of a Sequence) The following two sequences have the same tail: an=1,1,4,3,1,5,1,3,1,4,7,8,9,10,11,12,13,14, bn=4,3,9,10,11,12,13,14,15,16,17,18

We can see this because a13=b3=9, and a14=b4=10, and a15=b5=11…for every k we have that a13+k=b3+k so they agree after chopping the first 12 terms off of an and the first two terms off of bn.

Exercise 6.9 (Convergence only depends on the tail) If two sequences have the same tail, then they either both converge or both diverge, and if they converge, they have the same limit.

6.3 Important Sequences

We will soon develop several theorems that let us calculate many limits without tediously chasing down an N for every ε. But there are still several ‘basic limits’ that we will need to know, that will prove useful as building blocks of more complicated limits, as well as foundations to further theory in analysis. We compute several of them here: you should not worry too hard about committing these to memory; but rather read the proofs as examples of how to play the εN game in tricky situations.

The first and most important is familiar from above:

Example 6.7 As n the sequence 1/n converges to 0.

The next is useful in developing the theory of power series, and other things. We’ll prove it using Bernoulli’s inequality:

Example 6.8 Let |a|<1, then the sequence an of repeated powers of a converges to 0.

Proof (Proof). First note that if a=0 then an=0n=0, which clearly converges to zero so we can assume a0 in what follows.

Let ε>0. We wish to find an N such that n>N implies |an0|=|an|=|a|n<ε. Since |a|<1 by assumption, we know 1/|a|>1 and so we may write 1/|a|=1+x for some x>0. This is helpful because Bernouli’s inequality tell us that (1+x)n>1+nx>nx and so, we know |a|n=1(1+x)n11+nx1nx For this to be less than ε, we need 1/nx<ε, or n>1xε. Thus setting N=1xε will do it.

This proof is OK as written because it produces a value of N in terms of epsilon, and also shows (in the previous line) for larger n that the quantity we are trying to bound is even smaller. But it might be more readable to rewrite it in “reverse”: first proposing the value N=1/xε we discovered, and then showing it works (do this as an exercise). As another exercise, its useful to look at what happens for a>1.

Exercise 6.10 If a>1 then an diverges to infinity.

This next is an essential building block of the theory of exponential functions: we again make use of Bernoulli’s inequality in the proof below, but suggest an alternative (second) proof as an exercise

Example 6.9 Let a>0. Then the sequence a1/n converges to 1.

Proof. We proceed in two cases, starting first with a>1. Fix an ε>0; we want to find an N where n>N implies |a1/n1|<ε. Since a<1 is positive, it follows that a1/n>11/n=1. Call bn=a1/n1 the quantity we are trying to show is small. Then we can apply Bernoulli’s inequality to see

a=(a1/n)n=(1+bn)n1+nbnnbn$

Thus, for all n, we see 0bnan, and for this to be less than ε, we need only assure a/n<ε so n>a/ε: choosing N=a/ε will do.

n>N=aεbn=|a1/n1|<ε

Now we turn to the second case, which is 0<a<1. Note that for any a in this case we know 1/a>1, so the work we just did applies directly to 1/a, and (1a)1/n1. Unpacking this, that means for any ε>0 there is some N where n>N implies |1an1|<ε

We can simplify the fraction inside the aboslute value and then multiply the entire inequality through by a1/n to see

|1a1/na1/n|<ε|1a1/n|<εa1/n

But since 0<a<1 we know a1/n<11/n=1 so the right hand side is already less than ε, and we are done.

Exercise 6.11 In this problem you give an altenative proof that a1/n1 for all a>0, using the geometric sum. Recall, this stated that for any |r|<1 and nN, 1+r+r2+rn1=1rn1r

  • First consider only the case that a>1. Show that the geometric sum can be rewritten rn1=(r1)(1+r+r2++rn1), and use this to prove that a1n(a1/n1). Hint: apply it to r=a1/n and do some estimating.
  • Use this to show that 0a1/n1a1n for all n, and then prove (either directly, or using the squeeze theorem) that this implies a1/n1.
  • Now consider the case 0<a<1 and show the same. Hint: if 0<a<1 then 1/a>1, and perhaps you can do a similar trick to the textbook?

As a (challenging!) exercise, one might consider what happens if instead of taking the nth root of a fixed constant, we take the nth root of n itself. This

Exercise 6.12 The sequence n1/n converges to 1.

6.4 Topology

With an eye to topology, everything about sequences and convergence can be rephrased in terms of open sets, instead of with talk about ε and inequalities.

Definition 6.7 (Neighborhoods) A neighborhood of a point x is any open set U containing x. The ε-neighborhood of x is the neighborhood Uε=(xε,x+ε)

Definition 6.8 (Convergence and ε-Neighborhoods) A sequence an converges to a if every ε neighborhood contains all but finitely many terms of the sequence.

That this is equivalent to , because the definition of epsilon neighborhood exactly captures the interval discussed in the original definition of convergence.

Exercise 6.13 (Convergence and ε-Neighborhoods) The definition of convergence in terms of epsilon neighborhoods is equivalent to the usual definition in terms of absolute values and inequalities.

The definition of an epsilon neighborhood makes sense only somewhere like the real line, where we can talk about intervals. So, the general topological definition must dispense with this notion and talk just about open sets:

Definition 6.9 A sequence an converges to a if every neighborhood contains all but finitely many terms of the sequence.

Exercise 6.14 (Convergence and Neighborhoods) Prove this is equivalent to convergence using ε neighborhoods. Hint: show that every neighborhood contains some epsilon neighborhood. Can you show that is enough?

6.5 Problems

Exercise 6.15 Come up with a recursive sequence that could be used to formally understand the infinite expression below: 1+1+1+1+1+1+

Exercise 6.16 Given two sequences xn,yn that converge to a, show the interleaved sequence x1,y1,x2,y2, converges to a.

Exercise 6.17 Let an0 and let bn be a sequence such that for all n you know |bnL|<an. Prove that limbn=L.