Pafnuty Chebyshev proved Bertrand's Postulate that there is at least one prime between a number and its double, , and came close to proving the Prime Number Theorem.
In this post we'll recreate his estimates for the prime counting function which came close to the actual PNT. The approach is inspired by a note by Franz Lemmermeyer [pdf].
A Bit Of History
We've previously discussed how Gauss and Legendre used experimental evidence to suggest that the prime counting function is of the form , and that this approximation gets better for larger - in the sense that the ratio between the true count and this approximation gets ever closer to 1.
This is the Prime Number Theorem (PNT), which took another 100 years to be proved by Hadamard and de la Vallee-Poussin in 1896.
Before that final proof, around 1850, Chebyshev got close to proving the PNT by showing that can be bounded as follows, where are constants .
This is a remarkable result, and in fact was strong enough to prove Bertrand's Postulate that there is a prime between a number and its double, .
Starting With Combinations
We start with an observation about binomial coefficients:
Because is only one of those terms we can say:
Looking at that fraction, we can say that the numerator contains primes , and these primes don't appear in the denominator. This means each of those primes divides the whole fraction without remainder, and so does the product of those primes.
How many primes are there ? The answer is of course .
If we raised to this number, it would be smaller than the actual product of those primes . This is because each of those primes is greater than .
We can take logs and simplify:
We now have a relation relating to . Intuitively it hints at being of the form .
To find a form for not involving we use induction.
Upper Bound
Let's try the following form for :
We can examine our inequality with :
Our proposed form for gives us
That last inequality appears to be only true for because is only true for . However, numerically checking for confirms it is true for all .
We've shown that if is true for it is also true for . We've also shown it is true for some , for example, because . This completes the induction proof for this form of .
Now, if we set we can say
That least step requires , but we can numerically verify that is true for .
And so finally we have an upper bound for the prime counting function ,
On its own, the upper bound isn't enough to tell us about the true form of . We need the lower bound to be of a similar form, and fairly tight too.
Lower Bound
Going back to the binomial coefficients
we can see that is the largest element, which means it must be than the average of these terms (we have terms but we can combine the first and last which each have a value of 1), so
Taking logs gives us
We now need to think about the prime factorisation of . To do this we need to take a short diversion.
We know that is , so we need to think about the prime factorisation of . Luckily this is not too difficult. If is the highest power of a prime dividing , then This is because in the set of numbers there are multiples of , multiples of , multiples of , and so on, each contributing 1 to . Now, because is , and because , we can say Let's focus on that form . It only takes values 0 or 1. This is easy to see if we write where is the integer part of , and is the fraction. The largest is because a larger would give which isn't possible. So reduces as follows: Knowing how factorises into primes will be helpful. |
We can now proceed:
Re-arranging gives us a lower bound for :
This is almost of the form we want .
We can use induction again to prove that
Let's start with , so
We're going to replace that with an expression for , and by assuming :
We can manually show that the desired expression is valid for
This gives us:
So we finally have
The following illustrates these lower and upper bounds for .
Bertrand's Postulate
Can Chebyshev's estimates be used to prove Bertrand's Postulate that there is a prime between and ? Let's find out.
Bertrand's postulate is the same as saying .
The smallest difference is when we take the lower bound for , and the upper bound for :
We can see that must be positive, and so
In our case, the bounds and aren't tight enough to prove Bertrand's Postulate.
Chebyshev and others actually developed tighter bounds for which do meet this requirement, and , but those tighter bounds require much more work.
Thoughts
It is interesting that
- Chebyshev's estimates for the prime counting function are derived from the binomial coefficients of the expansion , a rather surprising source of useful inequalities.
- the estimates are intimately linked to the prime factorisation of factorials.
- the trail, albeit with some algebra work, can lead to a form for that is , a form which hints at the Prime Number Theorem.
Additional work by Chebyshev showed that and can be as close to 1 as desired for large enough .