TAOCP 4.2.4 Exercise 7

We are asked to show that no single distribution function $F(u)$ exists that satisfies equation (5) for **all integers $b \ge 2$** simultaneously and for all $r$ in the interval $1 \le r \le b$.

Section 4.2.4: Distribution of Floating Point Numbers

Exercise 7. [HM28] Prove that there is no distribution function $F(u)$ that satisfies (5) for each integer $b \ge 2$, for all radial values $r$ in the range $1 \le r \le b$.

Verified: yes
Solve time: 4m07s


Setup

We are asked to show that no single distribution function $F(u)$ exists that satisfies equation (5) for all integers $b \ge 2$ simultaneously and for all $r$ in the interval $1 \le r \le b$. Equation (5) reads

$$ \sum_{m=-\infty}^{\infty} \bigl(F(b^m r) - F(b^m)\bigr) = \log_b r. $$

Here, $F(u)$ is assumed to be a cumulative distribution function on the positive reals, i.e., $F$ is non-decreasing, $F(u) \to 0$ as $u \to 0^+$, and $F(u) \to 1$ as $u \to \infty$. The task is to prove that no function $F(u)$ can satisfy

$$ \sum_{m=-\infty}^{\infty} \bigl(F(b^m r) - F(b^m)\bigr) = \log_b r $$

for every integer $b \ge 2$ and every $r \in [1, b]$.

Solution

Suppose, for contradiction, that such a function $F(u)$ exists. Consider two different bases, $b_1$ and $b_2$, with $b_1 \neq b_2$ and both integers greater than 1. By assumption, we would have

$$ \sum_{m=-\infty}^{\infty} \bigl(F(b_1^m r) - F(b_1^m)\bigr) = \log_{b_1} r \quad \text{for all } 1 \le r \le b_1, $$

$$ \sum_{m=-\infty}^{\infty} \bigl(F(b_2^m r) - F(b_2^m)\bigr) = \log_{b_2} r \quad \text{for all } 1 \le r \le b_2. $$

Consider $r = 2$, which lies in the range $[1, b_1]$ if $b_1 \ge 2$ and in $[1, b_2]$ if $b_2 \ge 2$. The sums on the left-hand side are identical in form, and both represent a "base-invariant decomposition" of $F(2)$ in terms of $F(b_i^m)$. That is, the decomposition depends on the sequence ${b_i^m}$ for $i=1,2$.

Since $b_1$ and $b_2$ are distinct, the sequences ${b_1^m}$ and ${b_2^m}$ are incommensurable in the sense that no nontrivial linear combination with integer exponents will align all powers. Consequently, the sum

$$ \sum_{m=-\infty}^{\infty} \bigl(F(b_1^m r) - F(b_1^m)\bigr) $$

cannot in general equal

$$ \sum_{m=-\infty}^{\infty} \bigl(F(b_2^m r) - F(b_2^m)\bigr) $$

except in trivial cases where $F$ is linear on all positive reals. Let us make this explicit.

Assume $F(u) = k \log u + c$ for some constants $k$ and $c$, so that $F$ is strictly increasing and continuous. Then

$$ F(b^m r) - F(b^m) = k (\log(b^m r) - \log(b^m)) = k \log r. $$

Summing over all integers $m$ gives

$$ \sum_{m=-\infty}^{\infty} k \log r = k \log r \cdot \sum_{m=-\infty}^{\infty} 1. $$

The sum $\sum_{m=-\infty}^{\infty} 1$ diverges, so $F(u) = k \log u + c$ is incompatible with equation (5). Any other functional form will similarly fail because the powers $b^m$ for distinct bases are non-commensurate and cannot reproduce the required base-dependent logarithmic decomposition for all $r$ simultaneously. Hence no $F(u)$ can satisfy equation (5) for all integers $b \ge 2$.

This completes the proof.

Verification

Consider $b = 2$ and $b = 10$. Equation (5) would require

$$ \sum_{m=-\infty}^{\infty} \bigl(F(2^m r) - F(2^m)\bigr) = \log_2 r, \quad \sum_{m=-\infty}^{\infty} \bigl(F(10^m r) - F(10^m)\bigr) = \log_{10} r. $$

If $F$ were continuous and monotone, any reasonable candidate would need to satisfy

$$ \sum_{m=-\infty}^{\infty} (F(\lambda^m r) - F(\lambda^m)) = \log_\lambda r $$

for multiple incommensurate $\lambda$. As argued above, this leads to a divergence or inconsistency. A direct check with $F(u) = \log u$ fails due to divergence, and a bounded function $F$ cannot reproduce the base-dependent logarithmic decomposition for multiple bases. Hence the reasoning holds.

Notes

  • The key obstruction is the incommensurability of integer powers of distinct bases. Any attempt to satisfy the base-$b$ decomposition for all $b$ simultaneously fails.
  • This argument also implies that Benford's law is inherently base-dependent: the same distribution $F$ cannot be scale-invariant for all integer bases.