Table Of ContentArchimedes’ Quadrature of the Parabola
Notes for Math 1103
Mark Reeder
January 20, 2015.
1 Introduction
Archimedes to Dositheus greeting. 1
When I heard that Conon, who was my
friend in his lifetime, was dead, but that
you were aquainted with Conon and withal
versed in geometry, while I grieved for the
loss not only of a friend but of an ad-
mirable mathematician, I set myself the
task of communicating to you, as I had in-
tended to send to Conon, a certain geomet-
rical theorem which had not been investi-
gated before but has now been investigated
by me, and which I first discovered by means of mechanics and then exhibited by means of ge-
ometry.
Now some of the earlier geometers tried to prove it possible to find a rectilineal area equal to a
given circle and a given segment of a circle ... assuming lemmas not easily conceded 2, so that
it was recognized by most people that the problem was not solved.
But I am not aware that any one of my predecessors has attempted to square the segment bounded
by a straight line and a section of a right angled cone [a parabola], of which problem I have now
discovered the solution.
For it is here shown that every segment bounded by a straight line and a section of a right angled
cone [a parabola] is four-thirds of the triangle which has the same base and equal height of the
segment, and for the demonstration of this property the following lemma is assumed: that the
excess by which the greater of (two) unequal areas exceeds the less can, by being added to itself,
be made to exceed any given finite area.
1From the standard translation by T.L. Heath. Paragraph breaks and footnote added by me.
2A lemma is a preliminary result which in those days was often taken for granted. Archimedes is politely
reminding Dositheus that these lemmas are false.
1
The earlier geometers have also used this lemma; for it is by use of this same lemma that they
have shown that circles are to one another as the duplicate ratio of their diameters, and that
spheres are to one another in the triplicate ratio of their diameters ...
Archimedes says that if you take any segment of a parabola cut off by a line AB, and if C is
the point on the segment furthest from AB, then the area of the parabolic segment ABC is
four-thirds the area of the triangle ABC.
B
A
C
Why did Archimedes want to prove this? He observes that his predecessors were unable to find
the area of a circle (or equivalently, an ellipse). This was the most important unsolved problem
at that time (indeed, for the next 1500 years) so it may have seemed natural to Archimedes to
try another conic section. And perhaps he chose the parabola because it is a very special conic
section, lying on the border between ellipses and hyperbolas.
How did Archimedes figure out the ratio four-thirds, or even that the ratio is the same for
all parabolic segments? According to his preface, Archimedes used “mechanics”. This was a
method which he devised for comparing the area of two geometric figures by regarding area as
weight, and placing the figures on opposite sides of an imaginary see-saw. He frequently used
this method to make guesses which he then rigorously proved using geometry.
2 The idea of Archimedes’ proof
The point C furthest from AB is called the vertex of the parabolic segment. We will call the
triangle ABC the vertex triangle. Archimedes proves that the area of any parabolic segment
is four-thirds the area of its vertex triangle.
2
His main insight is that when we remove the vertex triangle ABC from the parabolic segment
we are left with two smaller parabolic segments which themselves have vertex triangles ACE
and BCD. Removing this layer of two triangles we get four even smaller parabolic segments,
whose four vertex triangles AFE,EGC,CHD,DIB form the next layer, and so on.
B
I
D
A
H
F
C
E
G
The first three layers of vertex triangles.
The idea of filling out a curved figure such as a parabolic segment with rectilineal figures such
as triangles is called the “Method of Exhaustion”. It was discovered prior to Archimedes, and
is the basis of what we now call Integral Calculus, but was in the old days called Quadrature.
3 Eudoxus and the Method of Exhaustion
If we had to name a single inventor of Integral Calculus, it would have to be Eudoxus of Cnidus
(approx. 390-340 BC). All manuscripts directly attributed to Eudoxus seem to have been lost,
3
but we know from commentaries by others that his work is preserved in Euclid’s Elements,
especially in books V and X. One of his results is the lemma quoted above in Archimedes’
preface, which we can rephrase more succinctly as follows:
Given a magnitude M and a smaller magnitude E, then some multiple of E exceeds M.
This is used in the Elements prove Proposition 1 of Book X:
Proposition X.1
Given a magnitude M and a smaller magnitude E, if we remove at least half of M, and then
remove at least half of the remainder, and from this remainder remove at least half, leaving a
smaller remainder, and so on, then eventually E will exceed the remainder.
That is, from M we remove M ≥ 1M, leaving remainder R = M−M , and from R we remove
1 2 1 1 1
M ≥ 1R , leaving remainder R = R − M , and from R we remove M ≥ 1R , and so on.
2 2 1 2 1 1 2 3 2 2
Prop. X.1 says that eventually R < E.
n
This is more or less how Prop. X.1 is stated in the Elements. Today we would rephrase Prop.
X.1 as follows.
Proposition X.1, version 2
If M ,M ,M ,... are obtained as above then for any number E > 0, we shall have
1 2 3
M −(M +M +···+M ) < E for large enough n.
1 2 n
In modern notation we write this as
lim(M +M +···+M ) = M. (1)
1 2 n
n→∞
Despite the presence of “infinity” 3 there is nothing mystical here. Equation (1) means precisely
what is stated in Prop. X.1 version 2.
The letter E stands for “error”. Think of the error as an requirement for accuracy that you
don’t know in advance, so you must be able to respond to any requirement for accuracy by
3Theinfinitysymbol∞wasfirstusedin1655byJohnWallis, apredecessorofNewton, fromwhomwe’llhear
much more later.
4
taking enough terms. A smaller E will require more terms. Thus (1) means that the finite sums
M +M +···+M give an arbitrarily good approximation to M.
1 2 n
This is one of the key ideas in our course. Typically M ,M ,... will be simple quantities such as
1 2
integers, orareasoftrianglesandM willbeadifficultquantitysuchasπ ortheareaofaparabolic
segment. If we can find simple quantities M , M ,..., each more than half of the remainder,
1 2
then by Prop. X.1 we will know M in the sense that we have arbitrarily good approximations
to M.
Sometimes we know M in advance, and it is the approximation itself that is useful.
Example 1: Suppose M is a line segment of length one. Remove exactly half, leaving a
remainder of one-half. Remove half of this remainder, leaving one-fourth. Remove half of this
new remainder, leaving one eighth, and so on. So M = 1/2, M = 1/4, M = 1/8 and
1 2 3
M = 1/2n, so we have
n
1 1 1
M +M +···+M = + +···+ ,
1 2 n 2 4 2n
and the remainder at the nth stage is
1
R = M −(M +M +···+M ) = .
n 1 2 n 2n
Prop. X.1 guarantees that for any E > 0 we will have R < E if n is large enough. Indeed this
n
will be the case if 2n > 1/E. For example if E = 0.1 then n ≥ 4 works. If E = .01 then n ≥ 7
works, and so on. Thus Prop. X.1 is confirmed in this example. In our modern notation we
express this as
(cid:18) (cid:19)
1 1 1 1
lim + + +···+ = 1.
n→∞ 2 4 8 2n
Example 2: Suppose M is a square of side-length one. Remove all but the lower right quarter-
square. From this remainder remove all but its lower-right quarter-square, and so on. At each
stage we’re removing three-fourths of a square, leaving a quarter-square, so M = 3, M =
1 4 2
3 · 1 = 3 , M = 3 · 1 = 3 and M = 3 , so we have
4 4 42 3 4 42 43 n 4n
3 3 3
M +M +···+M = + +···+ ,
1 2 n 4 42 4n
and the remainder at the nth stage is
1
R = M −(M +M +···+M ) = .
n 1 2 n 4n
Prop. X.1 guarantees that for any E > 0 we’ll have R < E if n is large enough. Indeed this
n
will be the case if 4n > 1/E. For example if E = 0.1 then n ≥ 2 works. If E = .01 then n ≥ 4
5
works, and so on. Thus Prop. X.1 is confirmed again. In our modern notation we express this
as
(cid:18) (cid:19)
3 3 3 3
lim + + +··· = 1.
n→∞ 4 42 43 4n
If we divide everything by 3, we get
(cid:18) (cid:19)
1 1 1 1 1
lim + + +··· = .
n→∞ 4 42 43 4n 3
This example 2 is used by Archimedes, as one of two applications that he makes of Prop. X.1.
4 Properties of the Parabola
We now begin Archimedes’ proof of his area formula for the parabola. He starts with some
properties of the parabola that were well-known in ancient times, but may be unfamilar to us.
Tangent Property: The tangent line to the segment at the vertex is parallel to the secant.
That is, PQ is parallel to AB.
B
Q
A
C
P
Bisecting Property: The line through C parallel to the axis bisects the secant.
That is, M is the midpoint of AB.
6
axis B
M
A
C
Equation of the Parabola: If Y on CM and X on the parabola are such that XY is parallel
to the secant AB then CY is to CM as the square on XY is to the square on AM.
That is,
CY XY2
= .
CM AM2
B
M
A
Y
C
X
These properties were known to Archimedes from Euclid’s Conics, which is work of Euclid that
has been completely lost. They are also in Apollonius’ Conics, which has survived and is the
main existing reference for the ancient Greek approach to conic sections. The ancient proofs of
the above properties are purely geometrical and quite involved. We can prove them more easily
using coordinates and algebra, as follows.
7
Let us put cartesian coordinates s,t on the plane so that the axis of the parabola is the s-axis
which intersects the parabola at (0,0), and the t-axis is perpendicular to the s-axis. Then the
equation of the parabola is s = kt2 for some constant k.
s B
M
A
Y
X
C
t
Let A = (a,ka2), B = (b,kb2), so that AB has slope
kb2 −ka2
m = = k(a+b).
b−a
Then the equation of AB is s = m(t−a)+ka2 = mt−kab.
Fix any point C = (c,kc2) on the parabola with a ≤ c ≤ b, and let f(t) be the distance from C
to a point (t,mt−kab) on AB varying with t. Minimizing f(t) for a ≤ t ≤ b, we find that the
distance from C to AB is given by the function
(mc−kc2 −kab)2
g(c) = .
1+m2
The function g(c) (of the variable c) attains its maximum when mc − kc2 − kab does, which
means
m a+b
c = = .
2k 2
Thus the vertex of the parabolic segment is C = (c,kc2), where c = m/2k. Now we can prove
the three properties of the parabola that we need.
The tangent line to the parabola s = kt2 at C has slope 2kc = m, which is also the slope of AB,
hence the tangent property holds.
The line through C parallel to the s-axis is the line t = c, which meets AB at the point
(cid:0) (cid:1)
M = (c,mc−kab) = 1(a+b), 1(ka2 +kb2)
2 2
8
whose coordinates are the averages of those of A = (a,ka2) and B = (b,kb2). Hence M is the
midpoint of AB, and the bisecting property holds.
For the equation, we first check that
k(b−a)2 (b−a)2
CM = and AM2 = ·(1+m2).
4 4
Next, Y = (c,y) for some number y, so that
CY = y −kc2.
Finally, X = (x,kx2) for some number x. Since XY has slope m and passes through (x,kx2), it
follows that y and x satisfy the equation
y −kx2 = m(c−x),
which can also be written as
y −kc2 = k(c−x)2.
Now we have
XY2 = (c−x)2 +(y −kx2) = (c−x)2 +m2(c−x)2 = (c−x)2(1+m2).
It now follows that
XY2 4(c−x)2 4(y −kc2) CY
= = = ,
AM2 (b−a)2 k(b−a)2 CM
as claimed.
5 Exhausting the Parabola
Remove the vertex triangle ABC, so M = area(ABC). From the remaining two segments
1
remove their vertex triangles AEC and BCD, so that M = area(AEC) + area(BCD). From
2
the remaining four segments remove their vertex triangles AFE, EGC, CHD, DIB, whose
sum is M , and so on. In order to invoke Prop. X.1 we must show that the area of a vertex
3
triangle is at least one-half of its parabolic segment. Since our original parabolic segment is
completely general, it suffices to consider the vertex triangle ABC.
9
B
M
Q
A
C
P
Draw parallels to the axis through A,B and C, and let PQ be the tangent line to the parabola
through C. By the tangent property, ABQP is a parallelogram. Since a diagonal divides the
area of a parallelogram in half, it follows that the vertex triangle ABC is exactly half of ABQP.
But ABQP contains the parabolic segment, so ABC is at least half of the parabolic segment,
as claimed.
It now follows from Prop. X.1 that the area S of the segment is given (in our modern notation)
by the limit
S = lim(M +M +···+M ),
1 2 n
n→∞
where M is the area of the nth layer of vertex triangles.
n
In the next section we will show that each layer is one-fourth of the layer immediately above it.
That is, M = 1M , M = 1M = 1 M , and so on. Then by the result in example 2 of section
2 4 1 3 4 2 42 1
3, we have
S = lim(M +M +M +···+M )
1 2 3 n
n→∞
(cid:18) (cid:19)
1 1 1
= lim M + M + M +···+ M
n→∞ 1 4 1 42 1 4n 1
(cid:18) (cid:19)
1 1 1
= M +M · lim + +···+
1 1 n→∞ 4 42 4n
1
= M +M ·
1 1
3
4
= ·M .
1
3
10
Description:Archimedes’ Quadrature of the Parabola Notes for Math 1103 Mark Reeder January 20, 2015. 1 Introduction Archimedes to Dositheus greeting. 1 When I heard that Conon