Basel problem

The Basel problem is a famous problem in number theory, first posed by Pietro Mengoli in 1644, and solved by Leonhard Euler in 1735. The problem had withstood the attacks of the leading mathematicians of the day, so Euler's solution gained him immediate notoriety at the age of 28. Euler generalised the problem considerably, and his ideas were taken up years later by Bernhard Riemann in his seminal 1859 paper On the Number of Primes Less Than a Given Magnitude, in which he defined his zeta function and proved its basic properties. The problem is named after Basel, hometown of Euler as well as of the Bernoulli family, who unsuccessfully attacked the problem.

The Basel problem asks for the precise sum of the reciprocals of the squares of the positive integers, i.e. the precise sum of the infinite series

\sum_{n=1}^\infin \frac{1}{n^2} = \frac{1}{1^2} + \frac{1}{2^2} + \frac{1}{3^2} + \frac{1}{4^2} + \cdots

The series is approximately equal to 1.644934. The Basel problem asks for the exact sum of this series, (in closed form), as well as a proof that this sum is correct. Euler found the exact sum π2/6 and announced this discovery in 1735. His arguments were based on manipulations that were not justified at the time, and it was not until about 10 years later that he was able to produce a truly rigorous proof.

Contents

Euler attacks the problem

Euler's original "derivation" of the value π2/6 is clever and original. He essentially extended observations about finite polynomials and assumed that these same properties hold true for infinite series. Of course, Euler's original reasoning requires justification, but even without justification, by simply obtaining the correct value, he was able to verify it numerically against partial sums of the series. The agreement he observed gave him sufficient confidence to announce his result to the mathematical community.

To follow Euler's argument, recall the Taylor series expansion of the sine function

\sin(x) = x - \frac{x^3}{3!} + \frac{x^5}{5!} - \frac{x^7}{7!} + \cdots

Dividing through by x, we have

\frac{\sin(x)}{x} = 1 - \frac{x^2}{3!} + \frac{x^4}{5!} - \frac{x^6}{7!} + \cdots

Now, the roots (zeros) of sin(x)/x occur precisely at x = ±nπ, where n = 1, 2, 3, ... Let us assume we can express this infinite series as a product of linear factors given by its roots, just as we do for finite polynomials:

\frac{\sin(x)}{x} = \left(1 - \frac{x}{\pi}\right)\left(1 + \frac{x}{\pi}\right)\left(1 - \frac{x}{2\pi}\right)\left(1 + \frac{x}{2\pi}\right)\left(1 - \frac{x}{3\pi}\right)\left(1 + \frac{x}{3\pi}\right) \cdots
= \left(1 - \frac{x^2}{\pi^2}\right)\left(1 - \frac{x^2}{4\pi^2}\right)\left(1 - \frac{x^2}{9\pi^2}\right) \cdots

If we formally multiply out this product and collect all the x2 terms, we see that the x2 coefficient of sin(x)/x is

-\left(\frac{1}{\pi^2} + \frac{1}{4\pi^2} + \frac{1}{9\pi^2} + \cdots \right) = -\frac{1}{\pi^2}\sum_{n=1}^{\infty}\frac{1}{n^2}.

But from the original infinite series expansion of sin(x)/x, the coefficient of x2 is −1/(3!) = −1/6. These two coefficients must be equal; thus,

-\frac{1}{6} = -\frac{1}{\pi^2}\sum_{n=1}^{\infty}\frac{1}{n^2}.

Multiplying through both sides of this equation by −π2 gives the sum of the reciprocals of the positive square integers.

The Riemann zeta function

The Riemann zeta function ζ(s) is one of the most important functions in mathematics, because of its relationship to the distribution of the prime numbers. The function is defined for any complex number s with real part > 1 by the following formula:

\zeta(s) = \sum_{n=1}^\infin \frac{1}{n^s}.

Taking s = 2, we see that ζ(2) is equal to the sum of the reciprocals of the squares of the positive integers:

\zeta(2) = \sum_{n=1}^\infin \frac{1}{n^2} = \frac{1}{1^2} + \frac{1}{2^2} + \frac{1}{3^2} + \frac{1}{4^2} + \cdots \approx 1.644934.

How do we know it converges at all? We can demonstrate this with the following inequality:

\zeta(2) = \sum_{n=1}^\infin \frac{1}{n^2} < 1 + \sum_{n=2}^\infin \frac{1}{(n-1)n} = 1 + \sum_{n=2}^\infin \left( \frac{1}{n-1} - \frac{1}{n} \right) = 2.

This gives us the upper bound ζ(2) < 2, but the exact value ζ(2) = π2/6 was unknown for some time, until Leonhard Euler computed it in 1735. It can be shown that ζ(s) has a nice expression in terms of the Bernoulli numbers whenever s is a positive even integer.

A rigorous proof

The following argument proves the identity ζ(2) = π2/6, where ζ(s) is the Riemann zeta function. It is by far the simplest proof yet available; while most proofs utilise results from advanced mathematics, such as Fourier analysis, complex analysis, and multivariable calculus, the following does not even require single-variable calculus (although a single limit is taken at the end).

History of this proof

The origin of the proof is unclear. It appeared in the journal Eureka in 1982, attributed to John Scholes, but Scholes claims he learned the proof from Peter Swinnerton-Dyer, and in any case he maintains the proof was "common knowledge at Cambridge in the late 1960s".

What you need to know

To understand the proof, you will need to understand the following facts:

(cosx + isinx)n = cos(nx) + isin(nx).
Proof: This can be proved from Euler's formula; see the article for more details.
(x+y)^n=\sum_{k=0}^n{n \choose k}x^ky^{n-k}
where we have the binomial coefficients
{n \choose k}=\frac{n!}{k!(n-k)!} (See factorial)
Proof: This requires mathematical induction and some properties of the binomial coefficients.

Missing image
Circle-trig6.png
Image:Circle-trig6.png

Now, take the reciprocal of everything and square. Remember that the inequality switches direction.

The proof

The main idea behind the proof is to bound the partial sums

\sum_{k=1}^m \frac{1}{k^2} = \frac{1}{1^2} + \frac{1}{2^2} + \cdots + \frac{1}{m^2}

between two expressions, each of which will tend to π2/6 as m approaches infinity. The two expressions are derived from identities involving the cotangent and cosecant functions. These identities are in turn derived from De Moivre's formula, and we now turn to establishing these identities.

Let x be a real number with 0 < x < π/2, and let n be a positive integer. Then from De Moivre's formula and the definition of the cotangent function, we have

\frac{\cos (nx) + i \sin (nx)}{(\sin x)^n} = \frac{(\cos x + i\sin x)^n}{(\sin x)^n} = \left(\frac{\cos x + i \sin x}{\sin x}\right)^n = (\cot x + i)^n.

From the binomial theorem, we have

(\cot x + i)^n = {n \choose 0} \cot^n x + {n \choose 1} (\cot^{n-1} x)i + \cdots + {n \choose {n-1}} (\cot x)i^{n-1} + {n \choose n} i^n
= \left[ {n \choose 0} \cot^n x - {n \choose 2} \cot^{n-2} x \pm \cdots \right] \; + \; i\left[ {n \choose 1} \cot^{n-1} x - {n \choose 3} \cot^{n-3} x \mp \cdots \right].

Combining the two equations and equating imaginary parts gives the identity

\frac{\sin (nx)}{(\sin x)^n} = \left[ {n \choose 1} \cot^{n-1} x - {n \choose 3} \cot^{n-3} x \mp \cdots \right].

We take this identity and set n = 2m + 1, where m is a positive integer, and x = rπ/(2m + 1), where r = 1, 2, ..., m. Then nx = rπ, so that sin(nx) = 0, and so

0 = {{2m+1} \choose 1} \cot^{2m} x - {{2m+1} \choose 3} \cot^{2m-2} x \mp \cdots + (-1)^m{{2m+1} \choose {2m+1}}.

This equation holds for each of the values x = rπ/(2m + 1), where r = 1, 2, ..., m. These values of x are distinct numbers strictly between 0 and π/2. Since the function cot2(x) is 1-1 on the interval (0, π/2), the numbers cot2(x) = cot2(rπ/(2m + 1)) are therefore distinct for r = 1, 2, ..., m. But by the above equation, each of these m distinct numbers is a root of the mth degree polynomial

p(t) := {{2m+1} \choose 1}t^m - {{2m+1} \choose 3}t^{m-1} \mp + (-1)^m{{2m+1} \choose {2m+1}}.

This means that the numbers x = cot2(rπ/(2m + 1)), for r = 1, 2, ..., m are precisely the roots of the polynomial p(t). But we can calculate the sum of the roots directly by examining the coefficients, and the comparison shows that

\cot ^2 \left(\frac{\pi}{2m+1}\right) + \cot ^2 \left(\frac{2 \pi}{2m+1}\right) + \cdots + \cot ^2 \left(\frac{m \pi}{2m+1}\right)
= {{2m+1} \choose 3} / {{2m+1} \choose 1} = \frac{(2m)(2m-1)}{6}.

Substituting the identity csc2 x = cot2 x + 1, we have

\csc ^2 \left(\frac{\pi}{2m+1}\right) + \csc ^2 \left(\frac{2 \pi}{2m+1}\right) + \cdots + \csc ^2 \left(\frac{m \pi}{2m+1}\right)
= {{2m+1} \choose 3} / {{2m+1} \choose 1} + m = \frac{(2m)(2m+2)}{6}.

Now consider the inequality cot2 x < 1/x2 < csc2 x. If we add up all these inequalities for each of the numbers x = rπ/(2m + 1), and if we use the two identities above, we get

\frac{(2m)(2m-1)}{6} < \left( \frac{2m+1}{\pi} \right) ^2 + \left( \frac{2m+1}{2 \pi} \right) ^2 + \cdots + \left( \frac{2m+1}{m \pi} \right) ^2 < \frac{(2m)(2m+2)}{6}.

Multiplying through by (π/(2m + 1))2, this becomes

\frac{\pi ^2}{6}\left(\frac{2m}{2m+1}\right)\left(\frac{2m-1}{2m+1}\right) < \frac{1}{1^2} + \frac{1}{2^2} + \cdots + \frac{1}{m^2} < \frac{\pi ^2}{6}\left(\frac{2m}{2m+1}\right)\left(\frac{2m+2}{2m+1}\right).

As m approaches infinity, the left and right hand expressions each approach π2/6, so by the squeeze theorem,

\zeta(2) = \sum_{k=1}^\infin \frac{1}{k^2} = \lim_{m \to \infty}\left(\frac{1}{1^2} + \frac{1}{2^2} + \cdots + \frac{1}{m^2}\right) = \frac{\pi ^2}{6}

and this completes the proof. Q.E.D.

References

External links

See also: Basel problem, 1644, 1735, 1859, 1982, Basel, Bernhard Riemann, Bernoulli, Bernoulli number, Binomial coefficient