🧭 Overview
🧠 One-sentence thesis
The distribution of primes can be understood through a progression of increasingly sophisticated results—from Euclid's proof of infinitude to estimates of the prime-counting function π(x) and average behavior of arithmetic functions—culminating in bounds that show π(x) is asymptotically x/log x.
📌 Key points (3–5)
- Infinitude of primes: Euclid's classical proof and Euler's analytic approach (via the zeta function) both establish that there are infinitely many primes, with Euler showing that the sum of reciprocals of primes diverges.
- Chebycheff's three main results: (1) there is always a prime between n and 2n; (2) π(x) is bounded above and below by constant multiples of x/log x; (3) if π(x)log x/x approaches a limit, that limit is 1.
- Bertrand's Postulate and binomial coefficients: The central binomial coefficient (2n choose n) encodes information about which primes divide it and with what multiplicity, enabling proofs about prime gaps.
- Average order of arithmetic functions: Functions like τ(n), σ(n), and φ(n) behave erratically on individual integers but have predictable average values (e.g., the average of φ(n) is ∼ (3/π²)n).
- Common confusion—individual vs average behavior: An arithmetic function may have no simple closed form for a single n, yet its sum over 1 to x can be approximated by an elementary function; don't confuse pointwise bounds with average estimates.
🔢 Proving infinitude of primes
🔢 Euclid's classical argument
Euclid's proof: If p were the largest prime, then (2·3·5···p) + 1 would not be divisible by any prime up to p, hence would be a product of primes exceeding p—a contradiction.
- The proof is "extremely simple" yet raises hard open questions: are the numbers (2·3···p) + 1 usually prime or composite? No general answer is known.
- Variations like (2·3···p) − 1, p! + 1, or p! − 1 are also mysterious; almost nothing is known about their factorizations.
- Example curiosity: 4! + 1 = 25, 5! + 1 = 121, 7! + 1 = 5041 are all perfect squares, but no other cases are known, and it is unknown whether infinitely many n! + 1 are squares.
🔢 Euler's analytic proof via the zeta function
Euler's identity: ζ(s) = sum over n of 1/nˢ = product over primes p of 1/(1 − 1/pˢ).
- This identity is "essentially an analytic statement of the unique factorization theorem."
- Formally: the product expands as (1 + 1/2ˢ + 1/4ˢ + ···)(1 + 1/3ˢ + ···)··· = sum of 1/nˢ.
- For s = 1, the harmonic series diverges, so the product over primes must be infinite, implying the sum of 1/p diverges.
- Euler's original argument "although not quite valid, can certainly be made valid."
🔢 A new proof that sum of 1/p diverges
The excerpt presents a proof "somewhat related to Euclid's" that uses a lemma about subseries of the harmonic series.
Lemma: If a sequence p₁ < p₂ < ··· of positive integers has counting function π(x) = sum over p ≤ x of 1, and if R(x) = sum over p ≤ x of 1/p converges to a finite limit, then π(x)/x → 0.
- Proof sketch: π(x) = sum from k=1 to x of k·(R(k) − R(k−1)); dividing by x and rearranging shows π(x)/x = R(x) − [average of R(0),…,R(x−1)], which → 0 if R converges.
- Proof by contradiction: Assume sum of 1/p converges. Then there exist n and m such that sum over p > n of 1/p < 1/2 and π(n!m)/(n!m) < 1/(2n!).
- Consider the m numbers Tᵢ = i·n! − 1 for i = 1,…,m. None have prime factors ≤ n or ≥ mn!. If p divides both Tᵢ and Tⱼ, then p divides (i − j), so multiples of p are spaced p apart; hence at most m/p + 1 of the Tᵢ are divisible by p.
- Summing over primes n < p < n!m: sum of (m/p + 1) ≥ m, i.e., sum of 1/p + π(n!m)/m ≥ 1. But by our assumptions the left side is < 1/2 + 1/2 = 1, a contradiction.
🔢 Euler's result refined
The excerpt shows that sum over p ≤ x of 1/p = log log x + O(1).
- This is derived from the identity sum over p ≤ x of (log p)/p = log x + O(1) (obtained by taking logarithms of n! and approximating the exponent eₚ(n) by n/p).
- The result R(x) ∼ log log x is used later in the chapter.
📐 Chebycheff's estimates and Bertrand's Postulate
📐 Chebycheff's three main results
Chebycheff obtained the first substantial progress toward the Prime Number Theorem (π(x) ∼ x/log x):
- Bertrand's Postulate: There is a prime between n and 2n for every n > 1.
- Two-sided bounds: There exist positive constants c₁ and c₂ such that (c₂ x)/log x < π(x) < (c₁ x)/log x.
- Uniqueness of the limit: If π(x)log x / x approaches a limit, that limit is 1.
- The proofs use "methods as modified by Landau, Erdős and, to a minor extent, L. Moser."
- Don't confuse: Chebycheff did not prove the Prime Number Theorem itself (that π(x) ∼ x/log x), only that if the ratio has a limit, it must be 1, and that π(x) is squeezed between two multiples of x/log x.
📐 Lemmas on factorials and binomial coefficients
The proofs require estimates for n! and (2n choose n).
Lemma 1: nⁿ e⁻ⁿ < n! < nⁿ.
- Proof: eⁿ > nⁿ/n! (since nⁿ/n! is one term in the expansion of eⁿ).
Lemma 2: 4ⁿ/(2n) < (2n choose n) < 4ⁿ.
- Upper bound: (1+1)²ⁿ = sum of (2n choose k) ≥ (2n choose n), so (2n choose n) < 4ⁿ.
- Lower bound: by induction, (2n choose n) > 4ⁿ/(2n) for n > 1.
Lemma 3: (2n+1 choose n) < 4ⁿ.
- (2n+1 choose n) is one of two equal largest terms in (1+1)²ⁿ⁺¹.
📐 Prime factorization of n! and (2n choose n)
Lemma 4 (Legendre): The exponent eₚ(n) of prime p in n! is eₚ(n) = floor(n/p) + floor(n/p²) + floor(n/p³) + ···.
- Interpretation: floor(n/p) counts multiples of p up to n; floor(n/p²) adds the extra contribution from multiples of p², etc.
- Example: e₃(30) = floor(30/3) + floor(30/9) + floor(30/27) + ··· = 10 + 3 + 1 = 14.
- Alternative formula: eₚ(n) = (n − sₚ(n))/(p − 1), where sₚ(n) is the sum of digits of n in base p.
Exponent Eₚ(n) in (2n choose n):
Eₚ(n) = eₚ(2n) − 2eₚ(n) = sum over i of {floor(2n/pⁱ) − 2floor(n/pⁱ)}.
- Each term in the sum is 0 or 1, and the number of terms ≤ log_p(2n).
Lemma 5: Eₚ(n) ≤ log_p(2n).
Lemma 6: The contribution of p to (2n choose n) does not exceed 2n.
Lemma 7: Every prime n < p < 2n appears exactly once in (2n choose n).
Lemma 8: No prime (2n)/3 < p < n divides (2n choose n).
Lemma 9: No prime p > √(2n) appears more than once in (2n choose n).
📐 Proof of Theorem 1: product of primes ≤ n is < 4ⁿ
Theorem 1: Product over p ≤ n of p < 4ⁿ.
- Proof by induction on n, considering cases n = 2m and n = 2m+1 separately.
- For n = 2m: product over p ≤ 2m of p = product over p ≤ 2m−1 of p < 4^(2m−1) by induction.
- For n = 2m+1: product over p ≤ 2m+1 of p = (product over p ≤ m+1 of p)·(product over m+1 < p < 2m+1 of p) < 4^(m+1)·(2m+1 choose m) ≤ 4^(m+1)·4^m = 4^(2m+1).
- Deeper methods (Rosser) show product < (2.83)ⁿ.
📐 Proof of Theorems 2 and 3: bounds on π(n)
Theorem 2: π(n) < c·n/log n.
- From Theorem 1: 4ⁿ > product over p ≤ n of p > product over √n ≤ p ≤ n of p > (√n)^(π(n) − π(√n)).
- Taking logarithms: n log 4 > (π(n) − π(√n))·(1/2)log n, so π(n) < (4 log 2)·n/log n + √n < c·n/log n.
Theorem 3: π(n) > c·n/log n.
- From Lemmas 6 and 2: (2n)^π(2n) > (2n choose n) > 4ⁿ/(2n).
- Taking logarithms: (π(n) + 1)log(2n) > 2n log 2, so for even m, π(m) + 1 > (m/log m)log 2.
📐 Proof of Bertrand's Postulate (weakened form)
Theorem 6: For every integer r there exists a prime p with 3·2^(2r−1) < p < 3·2^(2r).
- The proof compares (3·2^(2r) choose 3·2^(2r−1)) with a product of binomial coefficients of the form (2^k choose 2^(k−1)).
- Key restated lemmas:
- If n < p < 2n, then p occurs exactly once in (2n choose n).
- If 2·2^(2r−1) < p < 3·2^(2r−1), then p does not occur in (3·2^(2r) choose 3·2^(2r−1)).
- If p > 2^(r+1), then p occurs at most once in (3·2^n choose 3·2^(n−1)).
- No prime occurs more than 2r+1 times in (3·2^r choose 3·2^(2r−1)).
- Proof by contradiction: Assume no prime in the range 3·2^(2r−1) < p < 3·2^(2r). Then every prime in (3·2^(2r) choose 3·2^(2r−1)) also appears in the product (2^(2r) choose 2^(2r−1))···(2^1 choose 1)·{(2^(r+1) choose 2^r)···(2^1 choose 1)}^(2r) with at least the same multiplicity.
- But for r ≥ 6, 3·2^(2r) > (sum of 2^k for k ≤ 2r) + 2^r·(sum of 2^k for k ≤ r+1), so by interpreting (2n choose n) as the number of ways to choose n objects from 2n, the product is strictly smaller than (3·2^(2r) choose 3·2^(2r−1))—contradiction.
- For r ≤ 6, the primes 7, 29, 97, 389, 1543 verify the theorem directly.
- The full Bertrand's Postulate (prime between n and 2n) is left as an exercise.
📐 Applications of Bertrand's Postulate
The excerpt lists four applications (proofs not given):
- 1/2 + 1/3 + ··· + 1/n is never an integer.
- Every integer > 6 can be written as the sum of distinct primes.
- Every prime pₙ can be expressed as ±2 ± 3 ± 5 ± ··· ± p_(n−1) with an error of at most 1 (Scherk).
- The equation π(n) = φ(n) has exactly 8 solutions.
🧮 Average order of arithmetic functions
🧮 Heuristic for average values
The excerpt gives a "purely heuristic argument" for the average value of σₖ(n).
- The probability that r divides n is 1/r.
- If r | n, then n/r contributes (n/r)^k to σₖ(n).
- Expected value of σₖ(n) is (1/1)·(n/1)^k + (1/2)·(n/2)^k + ··· + (1/n)·(n/n)^k = n^k·(1/1^(k+1) + 1/2^(k+1) + ··· + 1/n^(k+1)).
- For k = 0 (i.e., τ(n)), this is ≈ n log n.
- For k ≥ 1, this is ≈ n^k·ζ(k+1); e.g., for k=1, ≈ n·ζ(2) = n·π²/6.
Don't confuse: this is a probabilistic heuristic, not a proof; the rigorous proofs use summation techniques below.
🧮 Inversion of order of summation
Let f be an arithmetic function, F(n) = sum from d=1 to n of f(d), and g(n) = sum over d | n of f(d) (i.e., g = f × I).
Key identity: sum from n=1 to x of g(n) = sum from d=1 to x of floor(x/d)·f(d) = sum from n=1 to x of F(floor(x/n)).
- Proof idea: sum from n=1 to x of sum over d | n of f(d) can be rearranged by summing "along vertical lines" (fixed d) or "along diagonals" (fixed quotient floor(x/n)).
- Example: the array of f(d) for d | n, n=1,2,3,…, can be summed column-by-column (giving sum of floor(x/d)·f(d)) or along diagonal lines (giving sum of F(floor(x/n))).
🧮 Average order of τ(n)
Taking f = 1 (so g = τ):
sum from n=1 to x of τ(n) = sum from n=1 to x of floor(x/n) = x log x + O(x).
- Geometric interpretation: τ(r) is the number of lattice points on the hyperbola xy = r (x,y > 0). So sum of τ(r) for r ≤ x is the number of lattice points in the region xy ≤ x, x,y > 0.
- Using symmetry about x = y and setting u = floor(√x):
- sum of τ(r) = 2·(floor(x/1) + ··· + floor(x/u)) − u² = 2x·h(u) − x + O(√x) = x log x + (2γ − 1)x + O(√x), where h(n) = 1 + 1/2 + ··· + 1/n and γ is Euler's constant.
🧮 Average order of σ(n)
Taking f = I (identity function, so g = σ):
sum from n=1 to x of σ(n) = sum from n=1 to x of (1 + 2 + ··· + floor(x/n)) = (1/2)·sum from n=1 to x of floor(x/n)·(floor(x/n) + 1) = (1/2)·sum from n=1 to ∞ of x²/n² + O(x log x) = (x²·ζ(2))/2 + O(x log x) = (π²x²)/12 + O(x log x).
🧮 Average order of φ(n)
Recall sum over d | n of φ(d) = n. So:
sum from n=1 to x of n = x(x+1)/2 = sum from d=1 to x of floor(x/d)·φ(d) = sum from n=1 to x of Φ(floor(x/n)),
where Φ(n) = sum from d=1 to n of φ(d).
- From this: sum from d=1 to x of φ(d)/d ≥ (x+1)/2, revealing that on average φ(d) > d/2.
🧮 Visible lattice points and φ(n)
A lattice point (x,y) is visible from the origin if gcd(x,y) = 1.
- φ(r) is the number of visible lattice points on the vertical segment x = r, 0 < y < r.
- So φ(1) + φ(2) + ··· + φ(n) is the number of visible lattice points in the region n > x > y > 0.
Heuristic: A point (x,y) is invisible by virtue of prime p if p | x and p | y, which occurs with probability 1/p². The probability that (x,y) is invisible is product over p of (1 − 1/p²) = 1/ζ(2) = 6/π². Hence the probability of visibility is 6/π², so the number of visible points should be (6/π²)·(area).
Rigorous outline: Let R be a region with finite Jordan measure M(R) and finite perimeter. Let tR denote R magnified by t, L(tR) the number of lattice points in tR, V(tR) the number of visible lattice points in tR.
- L(tR) = M(tR) + O(t) and M(tR) = t²M(R).
- L(tR) = V(tR) + V(tR/2) + V(tR/3) + ···.
- By Möbius inversion: V(tR) = sum over d of μ(d)·L(tR/d) = sum over d of μ(d)·M(tR/d) ≈ M(tR)·sum over d of μ(d)/d² ≈ M(tR)·(6/π²) = t²M(R)·(6/π²).
For R the region n > x > y > 0, M(R) = n²/2, so:
φ(1) + φ(2) + ··· + φ(n) ≈ (n²/2)·(6/π²) = (3/π²)n².
- More careful analysis yields φ(1) + ··· + φ(n) = (3/π²)n² + O(n log n).
- The error term cannot be reduced to O(n log log log n) (Chowla), but can be replaced by O(n log^(3/4) n) (Walfitz).
- Erdős and Shapiro showed that φ(1) + ··· + φ(n) − (3/π²)n² changes sign infinitely often.
🧮 Interpretation as probability
If a pair of integers (a,b) are chosen at random, the probability that they are relatively prime is 6/π².
- If (a,b) are chosen at random, the expected value of gcd(a,b) is π²/6.
- If f(x) is one of a certain class of arithmetic functions (including x^α for 0 < α < 1), the probability that gcd(x, f(x)) = 1 is 6/π², and the expected value of gcd(x, f(x)) is π²/6 (Lambek and Moser).
- The probability that n numbers chosen at random are relatively prime is 1/ζ(n).
🧮 Quadratfrei and k-th power-free numbers
Quadratfrei (square-free): an integer not divisible by any perfect square > 1.
- The number Q(n) of quadratfrei numbers ≤ n is ∼ (6/π²)n.
- Proof sketch: sum of Q(floor(n²/r²)) = n², so by Möbius inversion Q(n²) = sum of μ(r)·floor(n/r)² ∼ n²/ζ(2).
- More detailed: Q(x) = (6x)/π² + O(√x).
- This can be written: sum from n=1 to x of |μ(n)| ∼ (6/π²)x.
The number Qₖ(n) of k-th power-free numbers ≤ n is n/ζ(k).
🧮 Average order of ω(n) and Ω(n)
ω(n): the number of distinct prime divisors of n.
Ω(n): the number of prime divisors of n counted with multiplicity.
sum from n=1 to x of ω(n) = sum over p ≤ x of floor(x/p) ∼ x log log x.
- Thus the average value of ω(n) is log log x.
- The same holds for Ω(n).
Erdős–Kac theorem (stated without proof): Let A(x; α, β) be the number of integers n ≤ x for which α√(log log n) + log log n < ω(n) < β√(log log n) + log log n. Then
lim as x → ∞ of A(x; α, β)/x = (1/√(2π))·integral from α to β of e^(−u²/2) du.
- Interpretation: ω(n) is "normally distributed" around log log n with standard deviation √(log log n).
- Example: ω(n) < log log n about half the time.
- A similar result holds for ω(f(n)) where f(x) is an irreducible polynomial with integer coefficients (Tatuzawa).
🧮 Density of integers with large prime factors
The number C(x, α) of integers ≤ x having a prime divisor > x^α (for 1 > α > 1/2) is ∼ −x log α.
- Proof: C(x, α) = sum over x^α < p < x of x/p ∼ x·sum over x^α < p < x of 1/p = x·(log log x − log log x^α) = x·(log log x − log log x − log α) = −x log α.
- Example: the density of numbers having a prime factor exceeding their square root is log 2 ≈ 0.7.
🔍 Absolute bounds and extremal behavior
🔍 Bounds on φ(n)/n and σ(n)/n
- 1 > φ(n)σ(n)/n² > ε > 0 for all n (for some ε).
- n > φ(n) > c·n/log log n for some constant c.
- n < σ(n) < c·n log log n for some constant c.
🔍 Bounds on τ(n)
- τ(n) > (log n)^k infinitely often for every k.
- τ(n) < n^ε for every ε and n sufficiently large.
Main theorem (stated without proof): If ε > 0, then
- τ(n) < 2^((1+ε)log n / log log n) for all n > n₀(ε),
- τ(n) > 2^((1−ε)log n / log log n) infinitely often.
Don't confuse: the upper bound holds for all large n, but the lower bound holds only infinitely often (not for all n).
🔍 Sum of digits in various bases
Let sᵣ(n) be the sum of the digits of n when written in base r.
- Mirsky proved: sᵣ(1) + sᵣ(2) + ··· + sᵣ(n) = ((r−1)/2)·n·logᵣ n + O(n).
- R. Trollope (University of Alberta master's thesis) considered similar sums where the elements run over sequences such as primes, squares, etc.
- Trollope also showed: (s₁(n) + s₂(n) + ··· + sₙ(n))/n² ∼ 1 − π²/12.
🔬 Advanced topics (stated without proof)
🔬 Elementary proof of the Prime Number Theorem
Around 1949, Erdős and Selberg discovered an elementary proof of the Prime Number Theorem (π(x) ∼ x/log x).
Selberg's inequality (the main new result):
sum over p ≤ x of (log p)² + sum over pq ≤ x of (log p)(log q) = 2x log x + O(x).
- Although this inequality is "still far from the Prime Number Theorem," the latter can be deduced from it "without recourse to any further number theoretical results."
- Several proofs of Selberg's lemma have been given; "perhaps the simplest being due to Tatuzawa and Iseki."
- Selberg's original proof uses the functions λₙ = λₙ,ₓ = sum over d | n of μ(d)·log²(x/d) and T(x) = sum from n=1 to x of λₙ·x/n.
🔬 Lambek–Moser rational logarithm analogue
About five years before the excerpt, J. Lambek and L. Moser showed that Selberg's lemma can be proved "in a completely elementary way, i.e., using properties of integers only."
Rational analogue of the logarithm: Let h(n) = 1 + 1/2 + 1/3 + ··· + 1/n and ℓₖ(n) = h(kn) − h(k).
- They prove in an elementary way: |ℓₖ(ab) − ℓₖ(a) − ℓₖ(b)| < 1/k.
- This rational function ℓₖ mimics the logarithm's additivity property log(ab) = log a + log b, but with a small error.
🔬 Second Möbius inversion formula
Theorem: If G(x) = sum from n=1 to x of F(floor(x/n)), then F(x) = sum from n=1 to x of μ(n)·G(floor(x/n)).
- Proof: sum from n=1 to x of μ(n)·G(floor(x/n)) = sum from n=1 to x of μ(n)·sum from m=1 to floor(x/n) of F(floor(x/(mn))) = sum from k=1 to x of F(floor(x/k))·sum over n | k of μ(n) = F(x).
- This is a "second" inversion formula (distinct from the usual f ↔ g = f × I inversion).
🔬 Miscellaneous results
- sum over gcd(a,b)=1 of 1/(a²b²) = 5/2 (proof left as exercise).
- sum from n=1 to x of μ(n) = M(x) = o(x) (known but difficult to prove; equivalent in depth to the Prime Number Theorem).
- The excerpt mentions results on the distribution of ω(f(n)) for irreducible polynomials f, and various density results, but does not give proofs.