Kempner series


The Kempner series is a modification of the harmonic series, formed by omitting all terms whose denominator expressed in base 10 contains the digit 9. That is, it is the sum
where the prime indicates that n takes only values whose decimal expansion has no nines. The series was first studied by A. J. Kempner in 1914. The series is counterintuitive because, unlike the harmonic series, it converges. Kempner showed the sum of this series is less than 80. Baillie showed that, rounded to 20 decimals, the actual sum is
Heuristically, this series converges because most large integers contain every digit. For example, a random 100-digit integer is very likely to contain at least one '9', causing it to be excluded from the above sum.
Schmelzer and Baillie found an efficient algorithm for the more general problem of any omitted string of digits. For example, the sum of where n has no instances of "42" is about. Another example: the sum of where n has no occurrence of the digit string "314159" is about..

Convergence

Kempner's proof of convergence is repeated in many textbooks, for example Hardy and Wright and Apostol. We group the terms of the sum by the number of digits in the denominator. The number of n-digit positive integers that have no digit equal to '9' is 8 × 9n−1 because there are 8 choices for the first digit, and 9 independent choices for each of the other n−1 digits. Each of these numbers having no '9' is greater than or equal to 10n−1, so the reciprocal of each of these numbers is less than or equal to 101−n. Therefore, the contribution of this group to the sum of reciprocals is less than 8 × n−1. Therefore the whole sum of reciprocals is at most
The same argument works for any omitted non-zero digit. The number of n-digit positive integers that have no '0' is 9n, so the sum of where n has no digit '0' is at most
The series also converge if strings of k digits are omitted, for example if we omit all denominators that have a decimal substring of 42. This can be proved in almost the same way. First we observe that we can work with numbers in base 10k and omit all denominators that have the given string as a "digit". The analogous argument to the base 10 case shows that this series converges. Now switching back to base 10, we see that this series contains all denominators that omit the given string, as well as denominators that include it if it is not on a "k-digit" boundary. For example, if we are omitting 42, the base-100 series would omit 4217 and 1742, but not 1427, so it is larger than the series that omits all 42s.
Farhi considered generalized Kempner series, namely, the sums S of the reciprocals of the positive integers that have exactly n instances of the digit d where 0 ≤ d ≤ 9. He showed that for each d the sequence of values S for n ≥ 1 is decreasing and converges to 10 ln 10. The sequence is not in general decreasing starting with n = 0; for example, for the original Kempner series we have S ≈ 22.921 < 23.026 ≈ 10 ln 10 < S for n ≥ 1.

Approximation methods

The series converges extremely slowly. Baillie remarks that after summing 1024 terms the remainder is still larger than 1.
The upper bound of 80 is very crude, and Irwin showed by a slightly finer analysis of the bounds that the value of the Kempner series is near 23, since refined to the value above, 22.92067...
Baillie developed a recursion that expresses the contribution from each -digit block in terms of the contributions of the k-digit blocks for all choices of omitted digit. This permits a very accurate estimate with a small amount of computation.

Name of this series

Most authors do not name this series. The name "Kempner series" is used in MathWorld and in Havil's book Gamma on the Euler–Mascheroni constant.