Thursday, 24 June 2021

Discrete Sums to Continuous Integrals with Abel's Partial Summation Formula

It is often easier to understand how a discrete function behaves if it can be expressed as a continuous function. Abel's partial summation formula allows us to write a discrete sum as continuous integral.



In this blog post we'll derive the partial summation formula, and show a simple example of its power.

The video for this post is at [youtube], and the slides are here [pdf].


A Useful Object

The following is a sum over an arithmetic function a(n) weighted by a smooth function f(x)

x1<nx2a(n)f(n)

This is a useful general object to find an integral form for, because it gives us flexibility in choosing the arithmetic and smooth functions. 

To be more precise about the functions, a(n) takes only positive integers n1, and f(x) has a continuous derivative over the domain [x1,x2]. Both a(n) and f(x) can be complex.


Deriving An Integral Form

If you look ahead, the derivation of an integral form for the sum might look overwhelming, but it is just lots of simple algebra.

Because a(n) is only defined over integers n1, we can clarify the sum by setting m1=x1  and m2=x2. Remember that x  is the largest integer up to, and including, x.

x1<nx2a(n)f(n)=n=m1+1m2a(n)f(n)

Let's define A(x)=nxa(n). By definition a(n)=A(n)A(n1) so we can replace a(n).

m1+1m2a(n)f(n)=m1+1m2[A(n)A(n1)]f(n)=m1+1m2A(n)f(n)m1m21A(n)f(n+1)

The two sums have different limits for n, but both cover [m1+1,m21].

m1+1m2a(n)f(n)=m1+1m21A(n)[f(n)f(n+1)]+A(m2)f(m2)A(m1)f(m1+1)

Noticing that nn+1f(t)dt=f(n+1)f(n) allows us to introduce the integral.

m1+1m2a(n)f(n)=m1+1m21A(n)nn+1f(t)dt+A(m2)f(m2)A(m1)f(m1+1)

Now, because A(t)=A(n) over the interval [n,n+1), we can move A(n) it inside the integral as A(t).

m1+1m2a(n)f(n)=m1+1m21nn+1A(t)f(t)dt+A(m2)f(m2)A(m1)f(m1+1)

That sum of integrals over consecutive intervals can be simplified to a single integral. 

x1<nx2a(n)f(n)=m1+1m2A(t)f(t)dt+A(m2)f(m2)A(m1)f(m1+1)

We now need to adjust the integration limits back to x1 and x2, not forgetting the intervals to m1 and m2. Writing out the integrals that split the interval [x1,x2] is helpful.

x1x2Xdt=m1+1m2Xdt+x1m1+1Xdt+m2x2Xdt

Rearranging, and noticing that A(t)=A(x1) over the interval [x1,m1+1), and A(t)=A(x2) over the interval [m2,x2], gives us the following.

m1+1m2A(t)f(t)dt=x1m1+1A(t)f(t)dt+m2x2A(t)f(t)dtx1x2A(t)f(t)dt=A(x1)[f(m1+1)f(x1)]+A(x2)[f(x2)f(m2)]x1x2A(t)f(t)dt

We plug this integral back into our object, then use A(m1)=A(x1) and A(m2)=A(x2).

x1<nx2a(n)f(n)=A(x2)f(m2)A(x1)f(m1+1)+A(x1)[f(m1+1)f(x1)]+A(x2)[f(x2)f(m2)]x1x2A(t)f(t)dt

Many of these terms cancel out, leaving us with the much neater Abel Identity, as it is also called.

x1<nx2a(n)f(n)=A(x2)f(x2)A(x1)f(x1)x1x2A(t)f(t)dt

In many cases n starts at 1, and so the formula reduces further. 

1nx2a(n)f(n)=A(x2)f(x2)1x2A(t)f(t)dt

The lower limit of the integral is 1 because A(t)=0 in the range [0,1).


Example: Growth of 1/n

Abel's identity is particularly useful for clarifying the asymptotic behaviour of discrete functions. Let's illustrate its use to show the how the harmonic series 1N1/n grows.

We can choose a(n)=1 and f(x)=1/x, which means A(x)=x  and f(x)=1/x2

nN1n=A(N)f(N)1NA(t)ft)=N{N}N+1Nt{t}t2dt

We've used x=x{x}, where {x} is the fractional part of x. This also lets us split the integral into manageable parts.

0<nN1n=1+O(1N)+1N1tdt1N{t}t2dt

Because {t} is only ever in the range [0,1), the last integral is always less than 1N1/t2dt, that is, O([1/t]1N).

0<nN1n=1+O(1N)+ln(N)+O(11N)=ln(N)+O(1)

This tells us the harmonic series grows like ln(N), a powerful result derived easily using the Abel identity. It also tells us the difference is bounded by 1. In fact, the difference tends to the Euler–Mascheroni constant γ0.5772 which pops up in several areas of number theory and analysis.

Convergence Of Dirichlet Series

The Riemann Zeta series is an example of a Dirichlet series.

ζ(s)=1ns=11s+12s+13s+14s+

Dirichlet series have the general form an/ns, in contrast to the more familiar power series anzn.



In this blog post we'll explore when these series converge, first looking at absolute convergence, and then more general convergence.


The video for this post is at [youtube], and the slides are here [pdf].


Abscissa Of Absolute Convergence

A series converges absolutely even when all its terms are replaced by their magnitudes, sometimes called absolute values. This is quite a strong condition, and not all series that converge do so absolutely.

Let's assume a Dirichlet series converges absolutely at s1=σ1+it1, and consider another point s2=σ2+it2 where σ2σ1. On the complex plane, s2 is to the right of s1.

Now let's compare the magnitudes of the terms in this series at s1 and s2. Remember nσ+it=nσeitlnn, and because the magnitude of any eiθ is 1, we can simplify |nσ+it|=nσ.

|anns1|=|an|nσ1|an|nσ2=|anns2|

This is simply telling us that the magnitude of each term in the series at s2 is less than or equal to the magnitude of the same term at s1. So if the series converges at s1, it must also converge at s2. More generally, the series converges at any s=σ+it where σσ1

If our series doesn't converge everywhere, the s for which it diverges must therefore have σ<σ1. We can see there must be a minimum σa, called the abscissa of absolute convergence, such that the series converges for all σ>σa

Notice how absolute convergence depends only on the real part of s. Working out the domain of convergence along the real line automatically gives us the domain of convergence in the complex plane.

For example, in we previously showed the series 1/nσ converges for real σ>1. We also know the series diverges at σ=1. These two facts allow us to say σa=1, and so the series converges for all complex s=σ+it where σ>1.

It's interesting that the region of convergence for a Dirichlet series is a half-plane, whereas the region for the more familiar power series anzn is a circle.


Abscissa Of Convergence

Absolute convergence is easier to explore as we don't need to consider the effect of complex terms which contribute a negative amount to the overall magnitude of the series. For example, a term eiπ=1 can partially cancel the effect of a term 2ei2π=+2. This cancelling effect can mean some series do converge, even if not absolutely.

Our strategy, inspired by Apostol, will be to show that if a Dirichlet series is bounded at s0=σ0+it0 then it is also bounded at s=σ+it, where σ>σ0, and then push a little further to show it actually converges at that s

Let's start with a Dirichlet series an/ns that we know has bounded partial sums at a point s0=σ0+it0 for all x1

|nxanns0|M

Being bounded is not as strong a requirement as convergence, the partial sums could oscillate for example.

We'll use Abel's partial summation formula, explained in a later blog, which relates a discrete sum to a continuous integral.

x1<nx2bnf(n)=B(x2)f(x2)B(x1)f(x1)x1x2B(t)f(t)dt

Because we're comparing to s0, we'll define f(x)=xs0s and bn=an/ns0. Here B(x) is defined as nxbn, and so |B(x)|M.

x1<nx2anns=x1<nx2bnf(n)=B(x2)x2ss0B(x1)x1ss0+(ss0)x1x2B(t)tss0+1dt

We now consider the magnitude of the series, which is never more than the sum of the magnitudes of its parts, and make use of |B(x)|M

|x1<nx2anns||B(x2)x2ss0|+|B(x1)x1ss0|+|(ss0)x1x2B(t)tss0+1dt|Mx2σ0σ+Mx1σ0σ+|ss0|Mx1x2tσ0σ1dt

Because x1<x2, we can say Mx2σ0σ+Mx1σ0σ<2Mx1σ0σforσ>σ0. Despite appearances, evaluating the integral is easy.

|x1<nx2anns|2Mx1σ0σ+|ss0|M(x2σ0σx1σ0σσ0σ)2Mx1σ0σ(1+|ss0|σσ0)

The last step uses |x2σ0σx1σ0σ|=x1σ0σx2σ0σ<x1σ0σ<2x1σ0σ

The key point is that x1<nx2an/ns is bounded if nxan/ns0 is bounded, where σ>σ0.

Let's see if we can push this result about boundedness to convergence. 

|x1<nx2anns|2Mx1σ0σ(1+|ss0|σσ0)=Kx1σ0σ

Here K doesn't depend on x1. If we let x1 then Kx1σ0σ0, which means the magnitude of the tail of the infinite sum an/ns diminishes to zero, and so the series is not just bounded, it also converges. 

Let's summarise our results so far:

  • If nxan/ns0 is bounded, the infinite sum an/ns converges for σ>σ0.
  • With the special case of s0=0, if nxan is bounded, the infinite sum an/ns converges for σ>0.

The special case is particularly useful as we can sometimes say whether a series converges for σ>0 just by looking at the coefficients an.

Following the same logic as for σa, it is clear there is an abscissa of convergence σc where a Dirichlet series converges for σ>σc, and diverges for σ<σc.


Maximum Difference Between σc And σa

We know that not all convergent series are absolutely convergent, so we can say σaσc. We shouldn't have to increase σ by too much before a conditionally convergent series converges absolutely.

If a series converges at s0, the magnitude of terms is bounded. We can call this bound C.

|anns|=|anns01nss0|C1nσσ0

We know series of the form nσ0σ only converge for σ0σ>1, so we can say if σ is larger than σc by at least 1, the series converges absolutely.

0σaσc1


Example: Alternating Zeta Function

Let's apply our results to the alternating zeta function, also called the eta function.

η(s)=(1)n+1ns=11s12s+13s14s+

At s0=0 the partial sum nx(1)n+1 oscillates but is always bounded 1, and so (1)n+1/ns converges for σ>0


Wednesday, 23 June 2021

Into The Complex Domain

The Riemann Zeta function encodes information about the primes. We've already seen how we can use it to easily prove there are infinite primes, and that the prime harmonic series diverges, telling us the primes aren't so sparse.



ζ(s)=n1ns=p(11ps)1

We've thought about the zeta function with s as an integer, for example the harmonic series with s=1, and the Basel problem with s=2. We've also considered s as a real number, and asked whether the zeta function converges for values of s between 1 and 2. We proved the zeta function converges for s>1.

Riemann was the first to consider s as a complex number. If we think the zeta function over the complex domain might reveal new insights into the primes, we need to understand how it behaves. Exploring where it converges is a good start.


The video for this blog is at [youtube]. Slides are online [pdf].


Convergence For σ>1

It has become tradition to write complex s as s=σ+it, where σ is the real part of s.

1ns=1nσ+it=1nσ1nit

Let's look at the series with each term replaced by its magnitude. 

|1ns|=|1nσ1nit|

Rewriting nit as eitln(n) makes clear it has a magnitude of 1.

|1ns|=1nσ

We've already shown 1/nσ converges for σ>1 where σ is real. This tells us 1/ns converges absolutely for for σ>1. Since absolute convergence implies convergence, we can say 1/ns converges whenever the real part of s is more than 1, that is σ>1.


Divergence For σ0

Let's look again at the terms in the sum.

|1ns|=|1nσ1nit|=1nσ

If σ<0, the magnitude of the terms grows larger than 1. If σ=0, the magnitude of each term is exactly 1. For any series to converge, a necessary requirement is that the terms get smaller towards zero. This means 1/ns diverges for σ0.


Divergence For σ<1

Having found the Riemann Zeta function converges for σ>1, and diverges for σ0, we are left with a gap 0<σ1. To fill this gap we need to understand more generally when series of the form an/ns, called Dirichlet series, converge or diverge. 

A later blog post will explain how Dirichlet series converge in half-planes to the right of an abscissa of convergence σc. That is, they converge at an s=σ+it where σ>σc.

Because ζ(s) converges for σ>1, and we know it diverges at s=1+0i, the abscissa of absolute convergence is at σc=1. We can now say ζ(s) diverges for σ<1.


Visualising The Zeta Function For s>1

Visualising a function helps us understand it. The plot below shows a surface plot of ln|ζ(s)| for σ>1. Because the values of ζ(s) are complex, it is easier to plot the magnitude of ζ(s). Taking the logarithm scales down very large values to better show the function's features. 

The spike around s=1+0i corresponds to the divergent harmonic series ζ(1).



The surface seems to smooth out to the right as σ grows larger. Again, it is natural to ask what value it approaches. Let's look again at the terms of the sum ζ(s).

ζ(s)=11s+12s+13s+14s+

As s all the terms 1/ns0, except the first term which remains 1. To be more precise, the magnitude of each term |ns|=nσ tends to zero as σ for all n>1. That means the magnitude of the function ζ(s)1 as σ.


Hints The Function Extends Into σ1

Looking again at the plot above, we can see that aside from s=1+0i, the function doesn't seem to diverge along the line s=1+it. It looks like the surface has been prematurely cut off, and would continue smoothly into σ1 if allowed.

The plot below shows a contour plot of ln|ζ(s)|. The contours do indeed appear artificially cut off, as if they should continue smoothly to the left of σ=1.



The intuition that a function should continue smoothly without abrupt changes corresponds to a powerful property of many functions we come across. If we can show ζ(s) is one of these well-behaved functions, we are then justified in trying to extend it to the left of σ=1.