I wanted to put up a little summary of some of the most important things to remember from the end of last semester. There was a sudden input of new concepts, so let’s put some of them down here to get a clear reminder of what we need to know. A few things in this post:

  • The antiderivative
  • Sigma notation
  • Areas under curves

Antiderivatives

An antiderivative of a function f on an open interval I is a function F such that:

 

F'(x)=f(x) for every x\in I

 

Note that we say an antiderivative, not the antiderivative. There can be many functions whose derivatives give the same thing. While we know that:

 

\frac{d}{dx}\sin x=\cos x

 

and therefore  \sin x is an antiderivative of \cos x, we can also say that:

 

\frac{d}{dx}(\sin x+3)=\cos x

 

So \sin x+3 is also an antiderivative of \cos x. In fact for any constant c it is true that \sin x+c is an antiderivative of \cos x. We will come up with some clever notation for the antiderivative soon.

 

Sigma notation

Sigma notation is a shorthand way of writing “a load of things added together”. I could write 1+4+9+16+25+36+49, or I could write:

 

\sum_{i=1}^7 i^2

 

Which is shorthand for “the sum of all terms from i=1 to i=7 of the form i^2 for integer i”. It is going to be useful to remember that some sums can be simplified a lot. For instance, the following identities are useful:

 

  • \sum_{i=1}^n i=\frac{n(n+1)}{2}
  • \sum_{i=1}^n i^2=\frac{n(n+1)(2n+1)}{6}
  • \sum_{i=1}^n i^3=\left(\frac{n(n+1)}{2}\right)^2
  • \sum_{i=0}^{n-1} a r^i=a\frac{r^n-1}{r-1}

It’s also important to remember what happens when we multiply through a sum by a constant, add two sums together and take the difference of two sums. These are all in Stewart page 309. Proofs of these are also here.

 

Areas under curves

This is in Stewart chapter 4.1. Remember that if we want to calculate the area under a curve (and within an interval), the first thing that we can do is to try and find an approximate value for it. To do this, we approximate the curve by a series of rectangles, which roughly match the shape of the curve. There are a number of different ways of doing this, but one thing which we will do for now is to keep the widths of the rectangles all the same (we will almost always do this, though you don’t actually have to).

 

We’ll start with an arbitrary function f and move on to concrete examples later. Let’s try and work out the area under the curve y=f(x) between x=a and x=b where a and b are two values in the domain of the function, and we are assuming that the function is reasonably well behaved between a and b (“reasonably well behaved” can be defined later, but basically we don’t want it to be jumping around too much, although it can jump around a bit). In fact here we choose to look at the area under the curve between x=0.5 and x=1.5. Let’s look at the area that we’re trying to calculate. It’s the area shaded under the curve here:

pl1

We can try approximating it by a single rectangle whose height is the left-most value of the function on the interval (ie. the value of the function at x=0.5) or we can try and approximate it by the right-most value (ie. the value of the function at x=1.5). We see below, the red rectangle on the left underestimates the area, whereas the red rectangle on the right overestimates the area. What are these areas? pl2

Well, the area in the left plot is:

A_L=(1.5-0.5) f(0.5)

 

and the area on the right is:

 

A_R=(1.5-0.5) f(1.5)

 

The subscripts L and R correspond to taking the left most point of the rectangle to be the value of the function at that position, and the right most, respectively. You will often see these written as L_n and $R_n$ where the n is the number of rectangles used in the approximation.

It looks like our approximating rectangles aren’t very good approximations of the true area, but at least we’ve got what seems to be a lower bound and an upper bound.

 

 

Can we do any better?

Well, how about we try and approximate it with two rectangles? We will split the interval into two and put in two rectangles. The plot on the left takes the left-most point in each rectangle to be the value of the function, whereas in the right plot, it is the right-most point.pl3

What are the areas of these rectangles? Well the left plot is:

 

A_L=(1-0.5) f(0.5)+(1.5-1) f(1)

 

and the plot on the right has area

 

A_R=(1-0.5) f(1)+(1.5-1) f(1.5)

 

In fact we can simplify this a bit, because the widths of each rectangle is the same: They are the total length of the interval (1.5-0.5) divided by 2. Let’s remember that we first called the left and right points a and b, so we can write the widths of the rectangles as (b-a)/2.

 

Again, it looks like with two rectangles, we haven’t done a great job at approximating the area, so let’s try and use a few more rectangles.

pl4

The left plot is made up of five rectangles, each with width (b-a)/5. Let’s call this width \Delta x. The area of the rectangles is:

 

A_L=\Delta x f(a)+\Delta x f(a+\Delta x)+\Delta x f(a+2\Delta x)+\Delta x f(a+3\Delta x)+\Delta x f(a+4\Delta x)

 

Let’s give the positions of the left most points of each rectangle a name. The first one is going to be called x_0 and is actually the point x=a. The second is called x_1, the next is x_2, etc. The point x=b is called x_5. This is shown in the plot below:

pl6

 

So we can write the above as:

 

A_L=\Delta x f(x_0)+\Delta x f(x_1)+\Delta x f(x_2)+\Delta x f(x_3)+\Delta x f(x_4)

 

We can write this in a simpler way using the Sigma notation:

 

A_L=\sum_{i=0}^4 \Delta x f(x_i)

 

Note that this is exactly the same as writing:

 

A_L=\sum_{i=1}^5 \Delta x f(x_{i-1})

 

Check this!

 

OK, we could have done exactly the same thing with the right plot, and we would have gotten:

 

A_R=\sum_{i=1}^5 \Delta x f(x_i)

 

Can we do any better? Well, let’s try with 15 rectangles instead of 5.

pl5

We can immediately write the area as:

 

A_L=\sum_{i=1}^{15} \Delta x f(x_{i-1})

 

for the left plot and

 

A_R=\sum_{i=1}^{15} \Delta x f(x_{i})

 

for the right plot. OK, so it seems that the area of the rectangles should be getting closer and closer to the true area under the curve, and in fact the left and right plots have rectangles which are looking more and more like each other. The left plot is underestimating by only a bit, and the right plot is overestimating by only a bit. Maybe, if we take a lot of rectangles, we will get a better and better approximation. In fact, let’s even take the limit that there are an infinite number!

Let’s write our \Delta x in terms of a and b and the number of rectangles (which we will call n), so that we keep track of what we are taking to infinity. In fact \Delta x=(b-a)/n.

The following actually occurs in Stewart 4.2, and is covered in a lot more detail in this post. Skip to that post if you want to see things done a bit more gradually.

 

So, the true area under the curve in the first plot seems to be:

 

A=\lim_{n\rightarrow\infty} \sum_{i=1}^n\frac{b-a}{n} f(x_{i-1})

 

So now we are adding up an infinite number of rectangles but the area of each rectangle is going to zero. So long as we do things in the right way, the answer should be finite. Aren’t limits wonderful! Well, so long as they converge, we’re happy.

 

One thing that we can note is that it doesn’t actually matter whether we take the left plots or the right plots. In fact we could just make sure that the height of each rectangle is the value of the function somewhere in the interval of the width of the rectangle. We will call this f(x_i^*) to mean “the value of the function somewhere in the i^{th} interval.

 

Let’s look at an example. What is the area under the curve y=e^x between x=0 and x=1? Well, according to the above it should be:

 

A=\lim_{n\rightarrow\infty} \sum_{i=1}^{n}\frac{1-0}{n} e^{x_{i-1}}

 

Where x_{i-1}=\frac{i-1}{n}. Make sure that you understand this. So:

 

A=\lim_{n\rightarrow\infty} \sum_{i=1}^{n}\frac{1}{n} e^{\frac{i-1}{n}}

 

We can rewrite this as:

 

A=\lim_{n\rightarrow\infty} \frac{1}{n} \sum_{i=1}^{n}\left(e^{\frac{1}{n}}\right)^{i-1}

 

Let’s call r=e^{\frac{1}{n}} for now, so the sum that we have is:

 

\sum_{i=1}^{n} r^{i-1}=\frac{r^n-1}{r-1}

 

So we have (just substituting back in for r):

 

A=\lim_{n\rightarrow\infty} \frac{1}{n} \frac{r^n-1}{r-1}=\lim_{n\rightarrow\infty} \frac{1}{n} \frac{e-1}{e^{\frac{1}{n}}-1}

 

Check this! Now, let’s let h=\frac{1}{n}, so now we have:

 

A=\lim_{h\rightarrow 0} h \frac{e-1}{e^h-1}=(e-1)\lim_{h\rightarrow 0} \frac{h}{e^h-1}

 

Now we can use L’hospital’s rule. Take the derivative of the numerator and the denominator in the fraction and we get:

 

A=(e-1)\lim_{h\rightarrow 0} \frac{1}{e^h}=(e-1).

 

So this is it! That’s the area. The area under the curve of y=e^x from x=0 to x=1 is e-1, which, enticingly, can also be written as e^1-e^0.

 

OK, that’s enough for now, but we will see that in fact in general we won’t need to perform all these complicated sums when we want to calculate areas under curves – we will be able to use antiderivatives, but we will see why soon enough.

How clear is this post?