Electromagnetics

Units

1.1 Units

🧭 Overview

🧠 One-sentence thesis

Units must always be explicitly stated and consistently applied to avoid ambiguity, errors, and misunderstandings in expressing physical quantities.

📌 Key points (3–5)

What a unit is: the measure used to express a physical quantity (e.g., meters for distance).
Prefixes simplify notation: standard prefixes like "kilo-" or "milli-" scale units to keep values in a manageable range (0.001 to 10,000).
Always indicate units: omitting units is a common source of error; the same number can mean different things depending on the unit system.
Common confusion: the constant "3" in "l = 3t" is ambiguous—it could be 3 m/s or 3 km/h depending on the units of l and t; always attach units to constants.
Units enable error-checking: dimensional correctness (e.g., an electric field expression must reduce to V/m) helps catch mistakes in formulas.

📏 What units are and why they matter

📏 Definition and role

Unit: the measure used to express a physical quantity.

A physical quantity is incomplete without its unit.
Example: "6,371,000" alone is meaningless; "6,371,000 meters" specifies the mean radius of the Earth.
The excerpt emphasizes that failure to indicate units is a common source of error and misunderstandings.

🌍 Unit systems in use

The excerpt mentions three systems:

System	Description	Example base unit
SI (International System of Units)	The "metric system"; most popular for engineering	meter (m) for distance
English system	Still used in some regions and applications	mile for distance
CGS (centimeter-gram-second)	Common in physics and material science; some constants become unitless	erg for energy (not joule)

This work uses SI exclusively.
Don't confuse: the same physical constant can have different numerical values or even become unitless in different systems, so stating units is critical.

🔢 Prefixes and abbreviations

🔢 Standard prefixes

Prefixes modify units to keep numerical values in a convenient range (typically 0.001 to 10,000).

Example: Earth's radius is more commonly written as 6371 kilometers (km) rather than 6,371,000 meters.
The excerpt provides a table of standard prefixes:
- Large: exa (E, 10¹⁸), peta (P, 10¹⁵), tera (T, 10¹²), giga (G, 10⁹), mega (M, 10⁶), kilo (k, 10³)
- Small: milli (m, 10⁻³), micro (μ, 10⁻⁶), nano (n, 10⁻⁹), pico (p, 10⁻¹²), femto (f, 10⁻¹⁵), atto (a, 10⁻¹⁸)

🔤 Standard abbreviations

To avoid tedious writing, use standard abbreviations for both prefixes and base units.

Example: "6371 km" instead of "6371 kilometers."
The excerpt lists commonly-used units in electromagnetics (Table 1.2):
- Distance: meter (m)
- Time: second (s)
- Current: ampere (A)
- Charge: coulomb (C)
- Voltage: volt (V)
- Resistance: ohm (Ω)
- Frequency: hertz (Hz)
- Energy: joule (J)
- Power: watt (W)
- And others (farad, henry, tesla, weber, newton).

⚠️ Ambiguity and how to avoid it

⚠️ The problem with omitting units

Consider the expression: l = 3t (where l is length and t is time).

If l is in meters and t is in seconds, then "3" means "3 m/s."
If l is in kilometers and t is in hours, then "3" means "3 km/h"—literally a different equation.
Writing "l = 3t m/s" does not resolve the ambiguity; we still don't know the units of the constant "3."
Writing "l = 3t where l is in meters and t is in seconds" is unambiguous but awkward for complex expressions.

✅ Better notation practices

The excerpt recommends two approaches:

Attach units to the constant directly:
- Write: l = (3 m/s) t
- This makes it clear that the constant has units of m/s.
Separate the constant and its units:
- Write: l = a t where a = 3 m/s
- This separates the issue of units from the more important fact that l is proportional to t, and the constant of proportionality (a) is known.

Don't confuse: attaching units only to variables (e.g., "l = 3t m/s") leaves the constant's units ambiguous.

🔧 SI fundamentals and derived units

🔧 Seven fundamental SI units

SI defines seven base units from which all others are derived:

Quantity	Unit	Abbreviation
Distance	meter	m
Time	second	s
Current	ampere	A
Mass	kilogram	kg
Temperature	kelvin	K
Particle count	mole	mol
Luminosity	candela	cd

🔧 Derived electromagnetic units

Electromagnetic quantities are derived from the fundamental units.

Example: coulomb (C) for charge and volt (V) for electric potential are derived units.
The excerpt notes that in other systems (e.g., CGS), some physical constants become unitless, highlighting the importance of stating which system is in use.

🛠️ Units as an error-checking tool

🛠️ Dimensional correctness

A frequently-overlooked feature: units help verify mathematical expressions.

Example: electric field intensity is specified in volts per meter (V/m).
An expression for electric field that yields V/m is "dimensionally correct" (though not necessarily correct in all other respects).
An expression that cannot be reduced to V/m cannot be correct.
This provides a quick sanity check: if the units don't match the expected quantity, the formula is wrong.

Notation

1.2 Notation

🧭 Overview

🧠 One-sentence thesis

This section establishes a consistent symbolic notation system for vectors, coordinates, integrals, and approximations that will be used throughout the book to represent electromagnetic quantities and mathematical operations.

📌 Key points (3–5)

Vectors vs scalars: boldface indicates vectors (e.g., E); non-boldface indicates scalars; unit vectors use a circumflex (e.g., x-hat).
Three coordinate systems: position can be expressed in Cartesian (x, y, z), cylindrical (ρ, φ, z), or spherical (r, θ, φ) coordinates.
Phasors and geometry: tildes mark phasor quantities; script letters denote curves, surfaces, and volumes.
Common confusion—approximation symbols: "approximately equal" has two meanings: "∼=" for rounding precision vs "≈" for inherent inequality even with exact precision.
Integration notation: subscripts indicate what is being integrated over; a circle on the integral sign marks closed curves or surfaces.

📐 Vector and scalar notation

📐 How vectors are marked

Vector: Boldface indicates a vector quantity (e.g., E for electric field intensity).

Quantities not in boldface are scalars (single numbers without direction).
When handwriting, common alternatives are "E" with an arrow above or "E" with an underline.

🎯 Unit vectors

Unit vector: A vector with magnitude equal to one, indicated by a circumflex (ˆ).

Example: x-hat (ˆx) points in the +x direction with length 1.
Spoken aloud as "x hat."
Used to build position vectors in any coordinate system.

🗺️ Position and coordinate systems

🗺️ Three ways to specify position

Coordinate system	Symbols	Description
Cartesian	(x, y, z)	Rectangular grid
Cylindrical	(ρ, φ, z)	Radial distance ρ, angle φ, height z
Spherical	(r, θ, φ)	Radial distance r, polar angle θ, azimuthal angle φ

Coordinate-independent position: the symbol r can represent position without choosing a system.
Example: in Cartesian, r = x-hat·x + y-hat·y + z-hat·z.
Don't confuse: some sources use r for cylindrical radial coordinate (instead of ρ) or R for spherical radial (instead of r).

⏱️ Time

The symbol t indicates time throughout the book.

🌊 Phasors and geometric entities

🌊 Phasor notation

Phasor: A tilde indicates a phasor quantity (e.g., Ṽ for voltage phasor, Ẽ for electric field phasor).

Phasors are complex-number representations used in time-harmonic analysis.

🔲 Curves, surfaces, and volumes

Script letters denote geometric entities:
- C for a curve
- S for a surface
- V for a volume (often the volume enclosed by a closed surface S)
Example: an open surface might be S, and the curve bounding it is C.

∫ Integration notation

∫ Open integrals

A single integral sign with subscript indicates integration over that entity:
- ∫_C ... dl = integral over curve C
- ∫_S ... ds = integral over surface S
- ∫_V ... dv = integral over volume V

∮ Closed integrals

Closed curve: one that forms an unbroken loop (e.g., a circle).
Closed surface: one that encloses a volume with no openings (e.g., a sphere).

A circle superimposed on the integral sign marks closed paths:
- ∮_C ... dl = integral over closed curve C
- ∮_S ... ds = integral over closed surface S

≈ Approximation and equality symbols

≈ Three levels of "approximately equal"

Symbol	Meaning	When to use	Example from excerpt
∼=	Approximately equal (rounding)	Equality exists but not expressed with full precision	π ∼= 3.14
≈	Approximately equal (inherent)	Quantities are unequal even with exact precision	e^x ≈ 1 + x for x ≪ 1; e^0.1 ≈ 1.1 vs actual e^0.1 ∼= 1.1052
∼	On the order of	Within a factor of 10 or so	μ ∼ 10^5 for iron alloys (exact values vary by factor of ~5)

Common confusion: "∼=" is for rounding (the true value exists but is truncated), while "≈" is for approximations that are fundamentally unequal (e.g., truncating an infinite series).

≡ Definition symbol

The symbol "≡" means "is defined as" or "is equal as the result of a definition."

🔢 Other notation

🔢 Complex numbers

j = √(−1) (the imaginary unit).
Used in phasor representations and complex analysis.

🔢 Physical constants

The excerpt notes that Appendix C contains notation for commonly-used physical constants.
(No further detail is provided in this excerpt.)

Coordinate Systems

1.3 Coordinate Systems

🧭 Overview

🧠 One-sentence thesis

The three coordinate systems most commonly used in engineering analysis—Cartesian, cylindrical, and spherical—provide different frameworks for describing positions in three-dimensional space, with some variation in notation conventions.

📌 Key points (3–5)

Three main systems: Cartesian, cylindrical, and spherical coordinate systems are the most common in engineering.
Notation varies: variable names are not universal; for example, r may replace ρ in cylindrical systems, and R may replace r in spherical systems.
Each system uses different coordinates: Cartesian uses (x, y, z); cylindrical uses (ρ, φ, z); spherical uses (r, θ, φ).
Common confusion: the radial coordinate notation differs between cylindrical and spherical systems and is not standardized across sources.
Visual representation: figures illustrate how each coordinate system maps three-dimensional space.

📐 The three coordinate systems

📐 Cartesian coordinate system

The Cartesian coordinate system uses three perpendicular axes to describe position.

Uses three coordinates: x, y, and z.
Each coordinate represents distance along one of three mutually perpendicular axes.
This is the most familiar system, corresponding to standard rectangular coordinates.
Example: a point might be located at x = 2, y = 3, z = 1, meaning 2 units along the x-axis, 3 along y, and 1 along z.

🔄 Cylindrical coordinate system

The cylindrical coordinate system describes position using a radial distance from a central axis, an angle around that axis, and a height along the axis.

Uses three coordinates: ρ (radial distance), φ (angle), and z (height).
The radial coordinate ρ measures distance from the z-axis in the x-y plane.
The angular coordinate φ specifies rotation around the z-axis.
The z coordinate is the same as in Cartesian coordinates.
Notation warning: it is common to see r used instead of ρ for the radial coordinate.

🌐 Spherical coordinate system

The spherical coordinate system describes position using a radial distance from the origin and two angles.

Uses three coordinates: r (radial distance), θ (polar angle), and φ (azimuthal angle).
The radial coordinate r measures distance from the origin.
The angle θ typically measures the angle from the z-axis (polar angle).
The angle φ measures rotation around the z-axis (azimuthal angle).
Notation warning: it is common to see R used instead of r for the radial coordinate.

⚠️ Notation variations and conventions

⚠️ Non-universal variable usage

The excerpt explicitly states that "the use of variables is not universal," meaning different sources may use different symbols for the same coordinate.

System	Standard notation (excerpt)	Common alternative	What it represents
Cylindrical	ρ	r	Radial distance from z-axis
Spherical	r	R	Radial distance from origin

Don't confuse: the symbol r can mean different things depending on whether you're working in cylindrical or spherical coordinates.
When reading other sources, check which notation convention is being used.
The angular coordinates (φ and θ) are more standardized but still require attention to definition.

📚 Additional resources

The excerpt points to Wikipedia articles for more detailed information:

"Cylindrical coordinate system" on Wikipedia
"Spherical coordinate system" on Wikipedia

These resources can provide additional context and examples beyond what is covered in this section.

Electromagnetic Field Theory: A Review

1.4 Electromagnetic Field Theory: A Review

🧭 Overview

🧠 One-sentence thesis

Electromagnetic field theory unifies electric and magnetic phenomena through Maxwell's equations, showing that time-varying electric and magnetic fields are coupled and give rise to wave propagation.

📌 Key points (3–5)

Static vs. time-varying fields: In electrostatics and magnetostatics, electric and magnetic fields are independent; when fields vary with time, they become coupled through Faraday's law and displacement current.
Two interpretations of each field: Electric fields can be described as intensity (energy, E) or flux density (D); magnetic fields similarly have intensity (H) and flux density (B).
Maxwell's equations govern all cases: Four equations (Gauss's laws for E and B, Faraday's law, Ampere's law) describe field behavior in both static and dynamic scenarios, with additional terms appearing in the time-varying case.
Common confusion—field coupling: Static fields are independent, but time-varying magnetic flux induces electric fields (Faraday) and time-varying electric flux induces magnetic fields (displacement current).
Wave solutions emerge from coupling: In source-free, lossless media, the coupled time-varying equations reduce to wave equations, leading to uniform plane wave solutions with characteristic velocity, wavelength, and impedance.

⚡ Electric Fields and Electrostatics

⚡ Charge and current fundamentals

Charge is the ultimate source of electric fields, measured in coulombs (C).
The electron is defined to have negative charge.
Charge distributions can be described as:
- Line charge density (C/m)
- Surface charge density (C/m²)
- Volume charge density (C/m³)
Electric current describes net motion of charge, measured in amperes (A), and can be expressed as surface current density (A/m) or volume current density (A/m²).

🔋 Electric field intensity (E)

Electric field intensity E: the energy interpretation of the electric field, with units of N/C or V/m, related to energy associated with charge and forces between charges.

Relationship to potential: The electric potential V over a path C is given by the negative line integral of E along that path.
Conservative property: The electrostatic field is conservative, meaning the line integral of E around any closed loop equals zero (Kirchhoff's voltage law for electrostatics).
Gradient relationship: E points in the direction where potential V decreases most rapidly, with magnitude equal to the rate of change: E equals negative gradient of V.

🌊 Electric flux density (D)

Electric flux density D: the flux interpretation of the electric field, with units of C/m², quantifying the effect of charge as a flow emanating from the charge.

Gauss's law for electric fields: The electric flux through a closed surface equals the enclosed charge.
Material relationship: Within a material, D equals permittivity (ε) times E.
Permittivity values:
- Free space: ε₀ = 8.854 × 10⁻¹² F/m
- Materials often described by relative permittivity εᵣ = ε/ε₀ (unitless)

🔀 Boundary conditions for electric fields

At the boundary between two material regions (Region 1 and Region 2):

Tangential component of E: continuous across the boundary (no discontinuity).
Normal component of D: any discontinuity must be supported by surface charge density on the boundary.
Don't confuse: E and D have different boundary behaviors—E's tangential part is continuous, D's normal part may jump.

🧲 Magnetic Fields and Magnetostatics

🧲 Magnetic flux density (B)

Magnetic flux density B: the flux interpretation of the magnetic field, with units of Wb/m², quantifying the field as a flow associated with, but not emanating from, the source.

Gauss's law for magnetic fields: The magnetic flux through any closed surface is zero.
Key implication: There is no "magnetic charge" analogous to electric charge; magnetic field sources cannot be localized.
Field line behavior: Magnetic field lines form closed loops (they do not start or end at points).
Magnetic flux Φ: measured through a specified surface, with units of Wb.

🔧 Magnetic field intensity (H)

Magnetic field intensity H: the energy interpretation of the magnetic field, with units of A/m, related to energy associated with sources of the magnetic field.

Ampere's law for magnetostatics: The line integral of H around a closed path equals the enclosed current.
Material relationship: Within a homogeneous material, B equals permeability (μ) times H.
Permeability values:
- Free space: μ₀ = 4π × 10⁻⁷ H/m
- Materials often described by relative permeability μᵣ = μ/μ₀ (unitless)

🔀 Boundary conditions for magnetic fields

At the boundary between two material regions:

Normal component of B: continuous across the boundary (no discontinuity).
Tangential component of H: any discontinuity must be supported by surface current density on the boundary.
Don't confuse: B and H have opposite boundary behaviors compared to D and E—B's normal part is continuous, H's tangential part may jump.

🔄 Time-Varying Fields and Coupling

⚡ Faraday's law and the Maxwell-Faraday equation

Faraday's law: A time-varying magnetic flux induces an electric potential in a closed loop: V equals negative time derivative of magnetic flux Φ.
Maxwell-Faraday equation (integral form): The line integral of E around a closed path equals the negative time derivative of magnetic flux through the surface bounded by that path.
Key insight: Time-varying magnetic flux generates an electric field.
Coupling emerges: Electric and magnetic fields become coupled when magnetic flux varies with time.

🔁 Displacement current and generalized Ampere's law

Generalized Ampere's law: The line integral of H around a closed path equals the enclosed current plus the time derivative of electric flux through the surface.
Displacement current: The new term (time derivative of D) allows time-varying electric flux to be a source of magnetic field.
Reciprocal coupling: Just as time-varying B creates E, time-varying D creates H.

📊 Static vs. time-varying comparison

Aspect	Static Fields	Time-Varying Fields
Field independence	Electric and magnetic fields are independent	Fields are coupled
Faraday's law	Line integral of E around closed loop = 0	Line integral of E = negative time derivative of B flux
Ampere's law	Line integral of H = enclosed current only	Line integral of H = enclosed current + time derivative of D flux
Gauss's laws	Same (unchanged)	Same (unchanged)
Boundary conditions	Same (unchanged)	Same (unchanged)

🌊 Wave Phenomena

🎵 Time-harmonic fields and phasors

Time-harmonic fields: Fields that vary sinusoidally with time.
Phasor representation: Complex-valued quantities (indicated by tilde ~) representing magnitude and phase of sinusoidal waveforms.
Frequency relationship: ω = 2πf, where f is frequency in Hz.
Maxwell's equations in phasor form: Differential equations with time derivatives replaced by multiplication by jω.

🌐 Wave equations in source-free, lossless media

In regions free of sources (no charges or currents) and with no losses (conductivity σ = 0):

Simplified Maxwell equations: Reduce to four equations relating E and H through permittivity ε and permeability μ.
Vector wave equations: Both E and H satisfy the same form: second spatial derivative plus β² times the field equals zero.
Phase constant β: Equals ω times square root of (μ times ε).
Wave solutions: Solutions to these equations represent electromagnetic waves propagating through the medium.

✈️ Uniform plane waves

Uniform plane waves: Solutions to the wave equations constrained to exhibit constant magnitude and phase in a plane.

Example structure: For a plane perpendicular to z-direction, E has only x and y components, each consisting of forward-traveling (e to the negative jβz) and backward-traveling (e to the positive jβz) terms.
Phase velocity: vₚ = ω/β = 1/√(με), the speed at which wave crests propagate.
Wavelength: λ = 2π/β, the spatial period of the wave.
Mutual perpendicularity: E, H, and propagation direction are mutually perpendicular.

🔌 Plane wave relationships and impedance

Wave impedance η: Equals square root of (μ/ε), also called intrinsic impedance of the medium, with units of ohms.
Field relationships: E equals negative η times (propagation direction cross H); H equals (1/η) times (propagation direction cross E).
Propagation direction: Points in the same direction as E cross H.
Power density: S = |E|²/(2η) in W/m², assuming E is in peak units.
Example: In a uniform plane wave, if you know E and the propagation direction, you can immediately find H using the wave impedance.

🔧 Material Properties and Conductivity

⚡ Ohm's law for electromagnetics

Conductivity σ: A material property (units S/m) quantifying how easily electrons move in response to an electric field.

Ohm's law: Volume current density J equals conductivity σ times electric field intensity E.
Physical mechanism: In some materials, loosely-bound electrons can be moved by electric fields, creating current.
Range of values: From negligible (insulators) to very large (good conductors, including most metals).

🎯 Perfect conductors

Perfect conductor: A material within which E is essentially zero regardless of current density J; conductivity σ approaches infinity.

Equipotential property: Perfect conductors are equipotential regions—the potential difference between any two points within is zero.
Verification: This follows from the relationship between potential and E (V equals negative line integral of E).
Don't confuse: A perfect conductor doesn't mean "no current"; it means "no electric field needed to support any current."

🧱 Common material assumptions

Four properties often assumed for constitutive parameters (ε, μ, σ):

Property	Definition	Implication
Homogeneity	Uniform over the space occupied	Constitutive parameters constant at all locations
Isotropy	Behaves the same regardless of orientation	Response independent of direction relative to sources/fields
Linearity	Properties don't depend on applied sources/fields	Superposition applies: response to multiple sources = sum of individual responses
Time-invariance	Properties don't vary with time	Material behavior is constant over time

Example: A homogeneous, isotropic, linear, time-invariant material allows straightforward application of Maxwell's equations with constant ε, μ, and σ throughout the region at all times.

Lorentz Force

2.1 Lorentz Force

🧭 Overview

🧠 One-sentence thesis

The Lorentz force describes the combined force that electric and magnetic fields exert on a charged particle, causing it to exhibit both rotational and translational motions.

📌 Key points (3–5)

What the Lorentz force is: the total force on a charged particle from both electric and magnetic fields combined.
Two separate contributions: electric field produces force in the field direction; magnetic field produces force perpendicular to both the particle's velocity and the field.
The magnetic component depends on motion: only moving charges experience magnetic force; stationary charges feel only electric force.
Common confusion: "Lorentz force" is not a new type of force—it is simply a concise term for the sum of electric and magnetic contributions.
Observable effects: charged particles in electromagnetic fields exhibit cyclotron (rotational) and drift (translational) motions.

⚡ The two components of force

⚡ Electric field contribution

The force experienced by a particle with charge q in the presence of electric field intensity E is: F_e = q E

The electric force points in the same direction as the electric field (for positive charge).
This force does not depend on whether the particle is moving or stationary.
Example: a stationary charged particle in an electric field will accelerate along the field lines.

🧲 Magnetic field contribution

The force experienced by a particle with charge q in the presence of magnetic flux density B is: F_m = q v × B, where v is the velocity of the particle.

The magnetic force is a cross product: it is perpendicular to both the velocity and the magnetic field.
Key difference from electric force: the particle must be moving to experience magnetic force.
The magnitude depends on both the speed of the particle and the angle between velocity and field.
Example: a particle moving parallel to the magnetic field experiences zero magnetic force; maximum force occurs when moving perpendicular to the field.

🔗 Combined Lorentz force

🔗 The total force equation

The Lorentz force experienced by the particle is simply the sum of these forces: F = F_e + F_m = q(E + v × B)

The term "Lorentz force" refers to the combined contributions, not a separate phenomenon.
Both components act simultaneously when both fields are present.
Don't confuse: this is not a new force law—it is the principle of superposition applied to electric and magnetic forces.

🎯 Common application

The Lorentz force concept is commonly used to analyze the motions of charged particles in electromagnetic fields.
The combined force causes charged particles to exhibit two types of motion:
- Cyclotron motion: rotational motion (circular or helical paths)
- Drift motion: translational motion (net displacement)

🌀 Particle motion patterns

🌀 Cyclotron motion

When a magnetic field is present (directed toward the viewer in the excerpt's example), charged particles move in circular paths.
The direction of rotation depends on the sign of the charge:
- Positive charge rotates one way
- Negative charge rotates the opposite way
Example: electrons in a magnetic field move in circles (as shown in the figure with electrons from an electron gun).

➡️ Drift motion

When both electric and magnetic fields are present, particles exhibit translational (drift) motion in addition to rotation.
The electric field component causes net displacement while the magnetic field causes the circular component.
The combined effect produces helical or drifting circular trajectories.

🔄 Sign-dependent behavior

Charge type	Magnetic field only	Electric + Magnetic fields
Positive	Circular motion (one direction)	Circular + drift motion
Negative	Circular motion (opposite direction)	Circular + drift motion

Don't confuse: the direction of cyclotron rotation reverses with charge sign, but both charge types can drift in similar patterns depending on field configuration.

Magnetic Force on a Current-Carrying Wire

2.2 Magnetic Force on a Current-Carrying Wire

🧭 Overview

🧠 One-sentence thesis

A magnetic field exerts a force on current-carrying wires because current consists of moving charged particles, and this force can cause translation in straight wire segments, zero net translation but possible rotation in closed loops, and attraction or repulsion between parallel wires depending on current direction.

📌 Key points (3–5)

Force on a wire segment: A straight current-carrying wire in a uniform magnetic field experiences a force perpendicular to both the current direction and the field.
Closed loops behave differently: A rigid closed loop in a uniform magnetic field experiences zero net translational force but may rotate, forming the basis for electric motors.
Parallel wires attract or repel: Two parallel wires carrying currents in the same direction attract each other; currents in opposite directions cause repulsion.
Common confusion: Zero net force on a closed loop does not mean no motion—the loop can still rotate even though it won't translate through space.
Force magnitude depends on geometry: The force per unit length between parallel wires is proportional to both currents and inversely proportional to their separation distance.

⚡ Force on wire segments

⚡ From particle force to wire force

The starting point is the force on a single charged particle moving in a magnetic field: force equals charge times velocity cross product with magnetic field.
For a differential charge element dq moving with velocity v, the differential force is dq times v cross B.
The key insight: charge times velocity (dq times v) equals current times differential length (I times dl).
- Units check: charge·(distance/time) = (charge/time)·distance = current·distance
This transforms the particle-level force into a wire-level force.

📏 Straight wire in uniform field

The force experienced by a straight segment of current-carrying wire in a spatially-uniform magnetic field is given by: force = I times length-vector cross B₀

I = current in amperes
length-vector = the wire segment length pointing in the current direction
B₀ = the uniform magnetic flux density
The force is perpendicular to both the current direction and the magnetic field direction.
Example: A straight wire segment carrying current through a uniform magnetic field will be pushed sideways, neither along the current nor along the field lines.

🔄 Closed loops in uniform fields

🔄 Zero net translational force

When a rigid closed loop of current sits in a uniform magnetic field:

The force on the entire loop is found by integrating the differential force around the complete path.
Since current I and field B₀ are constants, they can be pulled out of the integral.
The remaining integral of dl around a closed loop equals zero (you return to your starting point).
Result: The net force on a current-carrying loop in a uniform field is zero.

🔁 Rotation without translation

Don't confuse zero net force with no motion at all.
Forces on opposite sides of the loop can be equal in magnitude but opposite in direction.
These opposing forces create rotation rather than translation.
Example: One side of a rectangular loop is pushed up while the opposite side is pushed down—the loop spins but doesn't move through space.
This is the fundamental principle behind electric motors.

🧲 Parallel current-carrying wires

🧲 Setup and field calculation

The excerpt analyzes two infinitely long, parallel wires:

Both wires lie in the same plane, separated by distance d
Wire 1 carries current I₁; wire 2 carries current I₂
Wire 1 creates a magnetic field B₁ that affects wire 2
At the location of wire 2, the field from wire 1 has magnitude (μ₀I₁)/(2πd)

⬆️⬇️ Attraction versus repulsion

Current directions	Force behavior	Physical reason
Same direction (I₁I₂ positive)	Wires attract	Magnetic field from wire 1 pulls wire 2 toward it
Opposite directions (I₁I₂ negative)	Wires repel	Magnetic field from wire 1 pushes wire 2 away

The force per unit length has magnitude: (μ₀I₁I₂)/(2πd)
Larger currents or smaller separation → stronger force
The excerpt notes this force is usually small but becomes significant with large currents or many conductors bundled together (as in motors).

🔋 Practical example: DC power cable

The excerpt provides a worked example:

A 12 V battery connected to a 10 Ω load through a cable with 3 mm conductor separation
Current = 12 V / 10 Ω = 1.2 A
One conductor carries +1.2 A away from battery; the other carries −1.2 A back (opposite directions)
Force per unit length ≈ 96.0 μN (micronewtons), repulsive
This force is quite small, explaining why it's not commonly noticed in everyday cables.

💾 Energy storage connection

If wires are fixed in position and cannot move, these magnetic forces represent stored potential energy.
This is precisely the energy stored by an inductor.
Example: The two wire segments could be adjacent windings in a coil-shaped inductor.

Torque Induced by a Magnetic Field

2.3 Torque Induced by a Magnetic Field

🧭 Overview

🧠 One-sentence thesis

A magnetic field exerts perpendicular forces on current-carrying structures that create torque, causing rotation, which is the fundamental mechanism behind electric motors.

📌 Key points (3–5)

Why rotation occurs: magnetic fields exert forces perpendicular to current flow, and these forces act on different parts of a current loop at different distances from a pivot, creating torque.
What torque measures: the rotational force depends on the lever arm (distance from pivot), the applied force, and the angle between them; magnitude has units of N·m.
Static vs. sustained rotation: a simple current loop rotates one-quarter turn and stops; motors sustain rotation by periodically reversing current direction or magnetic field direction.
Common confusion: torque direction vs. rotation direction—torque vector points perpendicular to the plane of rotation (use right-hand rule), not in the direction the object spins.
Key dependencies: torque on a current loop is proportional to loop area, current magnitude, and magnetic field strength.

🔄 What torque is and how it works

🔄 Definition of torque

Torque T: the cross product of the lever arm d (position vector from origin to force application point) and the force F applied at that point: T = d × F.

The lever arm d = r − r₀ gives the location of the force application point r relative to the local origin r₀.
Torque is a position-free vector perpendicular to both d and F.
The magnitude has SI base units of N·m and quantifies the energy associated with rotational force.

🧭 Direction and the right-hand rule

Torque does not point in the direction of rotation.
Instead, use the right-hand rule: point your right thumb in the direction of T, and your curled fingers point in the direction of rotation.
Example: if torque points upward (+z direction), rotation occurs counterclockwise when viewed from above.

📏 Lever arm length matters

Torque magnitude increases with increasing lever arm magnitude |d|.
In other words, the same applied force produces more torque when applied farther from the pivot.
This explains why longer wrenches make it easier to turn bolts.

➕ Superposition principle

Torque satisfies superposition: the total torque from forces applied to multiple rigidly-connected lever arms is the sum of individual torques.
This principle allows calculating total torque on complex structures by summing contributions from each segment.

🔁 Current loop in a magnetic field

🔁 The basic setup

Consider a rigid rectangular current loop attached to a non-conducting shaft that can rotate freely without friction.
The loop has four straight segments (A, B, C, D) carrying current I.
A spatially-uniform, static impressed magnetic flux density B₀ = x̂ B₀ exists throughout the domain.
"Impressed" means the field exists independently, in the absence of any other structure.

⚡ Forces on each segment

The force on each segment is F = I l × B₀, where l is a vector with magnitude equal to segment length, pointing in the current direction.

Segment	Current direction	Force result	Why
A	+z direction (length L)	+y direction: ILB₀	Cross product of z and x gives y
C	−z direction (length L)	−y direction: ILB₀	Opposite current gives opposite force
B	−x direction (length L)	Zero	Cross product of x and x is zero
D	+x direction (length L)	Zero	Cross product of x and x is zero

Segments B and D experience zero force because their current flows parallel (or anti-parallel) to the magnetic field.
Segments A and C experience equal and opposite forces in the y direction.

🔧 Calculating total torque

Total torque T = T_A + T_B + T_C + T_D.
For segment A: T_A = (W/2) x̂ × F_A = ẑ (LW/2) IB₀, where W/2 is the lever arm distance.
For segment C: T_C = ẑ (LW/2) IB₀ (same direction, adds to total).
For segments B and D: T_B = T_D = 0 (because forces are zero).
Total torque: T = ẑ LW IB₀.

📐 What the torque depends on

The final torque formula shows three proportional relationships:

Loop area (LW): larger loops experience more torque.
Current magnitude (I): more current produces more torque.
Magnetic field strength (B₀): stronger fields produce more torque.

Don't confuse: torque points in the +z direction, but this indicates rotation in the +φ direction (around the z-axis), not motion along the z-axis.

🛑 Why the loop stops and how motors sustain rotation

🛑 Static analysis limitation

The analysis applies only at the instant shown in the figure (static analysis).
Forces F_A and F_C remain in +y and −y directions regardless of rotation.
As the loop rotates, forces F_B and F_D become non-zero but are always equal and opposite, so they don't affect rotation.

🔄 The quarter-turn problem

The loop rotates one-quarter turn and then comes to rest (possibly with damped oscillation).
At the rest position, the lever arms for segments A and C point in the same directions as their respective forces.
When lever arm and force are parallel, the cross product is zero: T_A = T_C = 0.
Once stopped, both net translational force and net torque are zero—no further motion occurs.

⚙️ Sustaining rotation for motors

To create a working motor, rotation must be sustained. Methods include:

Periodically reverse current direction:
- Reverse I as the loop passes the quarter-turn position.
- This reverses F_A and F_C, propelling the loop toward the half-turn position.
- Continue reversing current at each quarter-turn to sustain rotation.
Periodically reverse magnetic field direction:
- Change the direction of the impressed magnetic field B₀ to achieve the same effect.
Combine multiple loops and fields:
- Use multiple current loops or multiple time-varying impressed magnetic fields.
- With appropriate combinations of loops, fields, and waveforms, sustained torque can be achieved throughout the entire rotation cycle.

Example: The DC motor shown in Figure 2.6 uses brushes (motionless leads labeled "+" and "−") combined with shaft motion to periodically alternate current direction between two coils, creating nearly constant torque.

🔀 Common confusion: static vs. dynamic

Static analysis: calculates forces and torque at one instant; predicts initial motion.
Dynamic behavior: describes what happens over time as the structure moves.
Don't assume static torque continues unchanged—as the loop rotates, lever arm orientations change, affecting torque magnitude and direction.

The Biot-Savart Law

2.4 The Biot-Savart Law

🧭 Overview

🧠 One-sentence thesis

The Biot-Savart Law provides a direct method to calculate the magnetic field produced by any steady current distribution, especially when symmetry is insufficient to apply Ampere's law in integral form.

📌 Key points (3–5)

What BSL solves: the magnetic field from any DC current distribution, particularly when problems lack the symmetry needed for simple Ampere's law solutions.
Core mechanism: BSL calculates the magnetic field contribution from each differential current element and integrates over the entire current path.
Inverse square law: magnetic field strength decreases with the inverse square of distance from the source.
Direction insight: the magnetic field from a current element is perpendicular to both the current direction and the line from source to field point.
Common confusion: BSL applies only under magnetostatic (steady, DC) conditions; time-varying currents require more complex approaches (Jefimenko's equations).

🔧 When and why to use BSL

🔧 The symmetry problem

Ampere's law in integral form works well when problems exhibit high symmetry (e.g., infinitely long straight wire in cylindrical coordinates).
Many practical problems lack this symmetry—example: a single current loop.
For non-symmetric problems, the differential form of Ampere's law is needed, and BSL is the solution to that differential form.

🎯 What BSL provides

BSL solves for a differential-length current element and allows building up the total field by integration.

It eliminates the need to solve differential equations directly.
It gives physical insight into magnetic field behavior.

🧮 The mathematical structure

🧮 The differential current element

Current element: I times dl, where I is current magnitude (amperes) and dl is a differential-length vector indicating current direction at the source point.

The source point is labeled r′ (where the current is).
The field point is labeled r (where you want to know the magnetic field).
R is the vector pointing from source to field point: R = r − r′.

🧮 The field contribution formula

The magnetic field contribution dH at the field point from one current element is:

dH(r) = (I dl) / (4π R²) × (unit vector in R direction)
In words: current element divided by 4π times distance squared, crossed with the direction from source to field.

🧮 Integration for total field

To find the total magnetic field from a wire of any shape:

Integrate dH over the entire length of the wire.
H(r) = (I / 4π) times the integral over the path C of (dl × unit R) / R².

🔍 Physical insights from BSL

🔍 Inverse square law

Magnetic field magnitude decreases in proportion to R⁻² (one over distance squared).
This is the same distance dependence as Coulomb's law for electric fields.

🔍 Perpendicularity

The magnetic field direction from a differential current element is perpendicular to:

The direction of current flow.
The vector pointing from source to field point.

This cross-product relationship helps anticipate field directions in complex problems.

🔍 Analogy to Coulomb's law

BSL is analogous to Coulomb's law for electric fields.
Coulomb's law solves the differential form of Gauss's law.
BSL solves the differential form of Ampere's law.
Don't confuse: BSL applies only to magnetostatic (steady) conditions; time-varying fields require different equations.

🔄 Extensions to other current distributions

🔄 The concept of current moment

All current distributions can be expressed in terms of current moment, which has units of ampere·meter (A·m):

Current type	Replacement for I dl	Units
Line current	I dl	A·m
Surface current	J_s ds (surface current density × area element)	A·m
Volume current	J dv (volume current density × volume element)	A·m
Moving particle	q v (charge × velocity)	A·m

All these quantities represent the same physical concept: current moment.
BSL applies to all with the appropriate substitution.
Example: for surface current, replace I dl with J_s ds in the BSL formula.

🔄 Dimensional consistency

The excerpt emphasizes checking dimensional consistency:

C·m/s = (C/s)·m = A·m, confirming that charge times velocity has the same units as current times length.

📐 Worked example: circular current loop

📐 Problem setup

A ring of radius a in the z = 0 plane, centered at the origin.
Current I flows in the φ (azimuthal) direction.
Goal: find magnetic field intensity along the z axis.

📐 Solution approach

Express source position (on the ring) and field position (on z axis) in cylindrical coordinates.
Calculate R (vector from source to field point) and its magnitude.
Apply BSL formula for dH.
Integrate over the entire loop (φ from 0 to 2π).

📐 Key result

The magnetic field along the z axis is:

H = (unit z direction) × (I a²) / (2 [a² + z²]^(3/2))
Direction: consistent with the right-hand rule (fingers curl with current, thumb points in field direction).

📐 Symmetry argument

During integration, the ρ (radial) component integrates to zero:

For any angle φ, the radial unit vector at φ + π is opposite to that at φ.
These contributions cancel when summed around the loop.
This is an example of using symmetry to simplify calculations.

⚠️ Limitations and scope

⚠️ Magnetostatic conditions only

BSL applies only when currents and magnetic fields are steady (DC, not varying with time).
If variation over time is significant, the problem becomes much more complicated.
For time-varying cases, see Jefimenko's equations (mentioned in additional reading).

⚠️ When to use integral vs differential forms

Integral form of Ampere's law: best when high symmetry exists.
BSL (differential solution): best when symmetry is lacking or for arbitrary current distributions.

Force, Energy, and Potential Difference in a Magnetic Field

2.5 Force, Energy, and Potential Difference in a Magnetic Field

🧭 Overview

🧠 One-sentence thesis

A magnetic field exerts force on moving charged particles but does no work itself; instead, potential differences arise when particles move through the field due to other forces, leading to induced voltages that can drive currents in closed loops.

📌 Key points (3–5)

Magnetic force is perpendicular to motion: the force on a charged particle is always perpendicular to its velocity, so the magnetic field itself does no work.
Potential difference requires external motion: changes in potential energy occur only when particles are moved by mechanical forces or electric fields through a magnetic field.
Induced voltage formula: the potential difference between two points is calculated by integrating the cross product of velocity and magnetic field along the path.
Common confusion: the magnetic field induces voltage, not current—current is simply the response to that voltage once a closed circuit exists.
Connection to Faraday's law: the motional emf derived here is a special case of Faraday's law for time-varying magnetic flux.

⚡ Magnetic force fundamentals

⚡ Force on a moving charge

The force experienced by a particle at location r bearing charge q due to a magnetic field is: force equals q times velocity cross product with magnetic flux density B at r.

The force depends on three factors: charge magnitude, velocity (speed and direction), and the magnetic field strength.
Key constraint: the cross product means force and velocity are always perpendicular.
Example: a particle moving north through an eastward magnetic field experiences a force pointing up or down, never north.

🔄 Motion does not result from magnetic force

The excerpt emphasizes a counterintuitive point: the motion described by velocity v is not caused by the magnetic force.
Instead, the reverse is true: the particle's motion (from other sources) gives rise to the magnetic force.
The motion may come from an electric field or from the particle being part of a moving structure.

🔋 Work and potential energy

🔋 Why magnetic fields do no work

Work is calculated as force dot product with displacement in the direction of motion.
Since any velocity component due to the magnetic field is perpendicular to the magnetic force, the dot product equals zero.
Core principle stated in the excerpt:

In the absence of a mechanical force or an electric field, the potential energy of a charged particle remains constant regardless of how it is moved by the magnetic force. The magnetic field does no work.

🛤️ Potential energy changes from other forces

Changes in potential energy must be completely due to position changes from other forces (mechanical or Coulomb/electric).
The magnetic field merely increases or decreases the potential difference once the particle has moved.
Don't confuse: the magnetic field affects the potential difference, but doesn't provide the energy for motion.

📐 Calculating potential difference

📐 Incremental work formula

For a short distance where the magnetic force change is negligible: incremental work equals force dot product with unit direction vector times distance.
The dot product ensures only the force component parallel to motion contributes to energy.

🧮 Path integration for total work

For longer distances, sum contributions along the path as the cross product of velocity and magnetic field may vary.
The general formula integrates along path C: work equals charge times the line integral of (velocity cross B) dot product with differential length element.

⚡ Voltage definition

Electric potential difference V₂₁ between start point (1) and end point (2) is defined as the work done by traversing the path, per unit of charge.

Units: joules per coulomb, which equals volts.
Final formula: potential difference equals the line integral along C of (velocity cross B) dot product with differential length.

🧲 Practical examples

🧲 Straight wire moving through a field

The excerpt presents a wire parallel to the y-axis moving in the +z direction through a magnetic field pointing in the x direction:

The cross product yields a result pointing in the y direction with magnitude B times v.
Integrating along the wire length l gives: voltage equals B times v times l.
Endpoint 2 is at higher potential than endpoint 1 by this amount.
Key insight: this voltage exists even though the wire is perfectly conducting (no electric field) and mechanical forces are equal on both ends.

🔁 Closed loop requirement

For practical applications, a closed loop is needed so current can flow.
For a closed loop, use a closed line integral (integral around the complete path).
Additional requirement: the cross product of velocity and B must vary over the path—otherwise the integral equals zero.

🔌 Time-varying loop example

The excerpt describes a rectangular loop with three fixed sides and one moving "shorting bar":

Only the moving bar contributes to the integral (other sides have zero velocity).
Result: voltage equals B times v times l, same as the straight wire case.
This voltage drives a current of Bvl/R through the resistor.
Important distinction: the magnetic field induces the voltage, not the current; current is simply the response to the potential.

🔗 Connection to Faraday's law

🔗 Motional emf as a special case

Faraday's law states: voltage equals negative time derivative of magnetic flux, where flux is the surface integral of B dot product with differential surface area.

The motional emf formula derived in this section is a special case of Faraday's law.
It applies specifically to "motional emf"—voltage induced by motion through a magnetic field.
The time-varying loop example can also be solved using Faraday's law by treating the surface bounded by the loop as time-varying.

🔄 Two perspectives on the same phenomenon

Approach	What varies	Formula emphasis
Motional emf (this section)	Position/velocity of conductor	Velocity cross B integrated along path
Faraday's law	Magnetic flux through surface	Time rate of change of flux

Both give the same result for moving conductors in magnetic fields.
The motional emf perspective emphasizes the mechanical motion; Faraday's law emphasizes flux change.

Poynting's Theorem

3.1 Poynting’s Theorem

🧭 Overview

🧠 One-sentence thesis

Poynting's theorem expresses conservation of energy by stating that electromagnetic power entering a region must be either dissipated as heat, stored in electric fields, or stored in magnetic fields—the only three possibilities allowed by classical physics.

📌 Key points (3–5)

Four ways to manipulate EM energy: transferred (waves/transmission lines), stored in electric fields (capacitance), stored in magnetic fields (inductance), or dissipated as heat (resistance).
The theorem's equation: net power flowing into a volume equals power dissipated plus power stored in electric fields plus power stored in magnetic fields.
The Poynting vector: E × H quantifies spatial power density (W/m²) and direction of electromagnetic wave propagation.
Common confusion: P_net,in = 0 does not mean no power flows; it means inward and outward power flows balance (net is zero).
Why it matters: the theorem is derived directly from Maxwell's equations, so it represents all possibilities for electromagnetic energy manipulation in classical physics.

⚡ The Four Fundamental Possibilities

⚡ How electromagnetic energy can be manipulated

Despite electromagnetic theory's apparent complexity, there are only four ways to manipulate electromagnetic energy:

Transferred: conveyed by transmission lines or in waves
Stored in electric fields: associated with capacitance
Stored in magnetic fields: associated with inductance
Dissipated: converted to heat (resistance)

🔄 Conservation principle

Poynting's theorem: an expression of conservation of energy that elegantly relates these various possibilities.

The theorem works in both directions
Power flowing out must have originated from active sources or released from stored field energy
Since derived directly from Maxwell's equations, these are all the possibilities allowed by classical physics

🧮 The Mathematical Statement

🧮 The core equation

The theorem states:

P_net,in = P_E + P_M + P_Ω

Where for a volume V:

P_net,in: net power flow into V
P_E: power associated with energy storage in electric fields within V
P_M: power associated with energy storage in magnetic fields within V
P_Ω: power dissipated (converted to heat) in V

📋 Requirements

The constituents of V must be:

Linear: D = ε E where permittivity ε is constant with respect to time
Time-invariant: material properties don't change over time
Note: ε may vary with position, just not with time

🔥 Power Dissipation (Ohmic Loss)

🔥 Joule's law

Joule's law: gives the power dissipated due to finite conductivity of material.

P_Ω = integral over V of (E · J) dv

Can be rewritten using Ohm's law (J = σ E):

P_Ω = integral over V of σ |E|² dv

Where σ is conductivity (S/m).

🔥 The mechanism

Electric field converts into conduction current
Conduction current converts into heat
Commonly known as ohmic loss or joule heating
Units check: E (V/m) times J (A/m²) yields W/m³ (power density); integration over volume yields W (power)

🔋 Energy Storage in Fields

🔋 Electric field storage

P_E = (1/2) × (∂/∂t) × integral over V of (E · D) dv

Can be rewritten as:

P_E = ∂/∂t of W_e

Where W_e = (1/2) × integral over V of ε |E|² dv is the energy stored in the electric field within V.

ε is permittivity (F/m)
This represents the time rate of change of stored electric field energy

🔋 Magnetic field storage

P_M = (1/2) × (∂/∂t) × integral over V of (H · B) dv

Can be rewritten as:

P_M = ∂/∂t of W_m

Where W_m = (1/2) × integral over V of μ |H|² dv is the energy stored in the magnetic field within V.

μ is permeability (H/m)
This represents the time rate of change of stored magnetic field energy

🌊 The Poynting Vector

🌊 Definition and meaning

Poynting vector: S = E × H, which quantifies the spatial power density (SI base units of W/m²) of an electromagnetic wave and the direction in which it propagates.

P_net,in = negative integral over surface S of (E × H) · ds

Where:

S is the closed surface bounding volume V
ds is the outward-facing normal to the surface
The negative sign indicates power flow into V through S

🌊 Net vs. total flow

Don't confuse: P_net,in represents net flux, not total flux.

P_net,in does not imply power is entirely inward or outward
It represents the sum of inward and outward flowing power
Example: If P_net,in = 0, power still flows—inward and outward flows simply balance

🌊 Verification with plane waves

The excerpt demonstrates the Poynting vector interpretation using a uniform plane wave incident on a lossless cylindrical region:

In a lossless region: P_Ω = 0 (no dissipation)
With no other fields: W_e = W_m = 0
Therefore: P_net,in = 0
This means: P_in - P_out = 0 (inward power equals outward power)
Power flows through the cylinder but is not absorbed

📐 Derivation Highlights

📐 Starting points

The derivation begins with:

Ampere's law (differential form): ∇ × H = J + (∂/∂t) D
Maxwell-Faraday equation: ∇ × E = -(∂/∂t) B
Vector identity: ∇ · (F × G) = G · (∇ × F) - F · (∇ × G)

📐 Key mathematical relationships

Two preliminary results for linear, time-invariant materials:

(∂/∂t)(E · D) = 2 E · (∂/∂t) D
(∂/∂t)(H · B) = 2 H · (∂/∂t) B

These expressions have units of power and are essential for connecting field time derivatives to energy storage rates.

📐 Final transformation

The divergence theorem converts the volume integral into a surface integral:

Integral over V of ∇ · (E × H) dv = surface integral over S of (E × H) · ds

This transformation connects power flow through the boundary surface to the volume's internal energy changes.

Poynting Vector

3.2 Poynting Vector

🧭 Overview

🧠 One-sentence thesis

The Poynting vector E × H represents both the magnitude of spatial power density and the direction of electromagnetic power flow, as confirmed by applying Poynting's theorem to a lossless region.

📌 Key points (3–5)

What the Poynting vector is: S ≡ E × H, with SI base units of W/m².
What it represents: spatial power density (magnitude) and direction of power flow (direction).
How it is confirmed: by applying Poynting's theorem to a uniform plane wave in a lossless cylindrical region, showing that power in equals power out.
Common confusion: "net power in = 0" does not mean no power flows; it means power flowing in equals power flowing out.
Naming coincidence: the Poynting vector is named after physicist J.H. Poynting; the fact that it also "points" in the direction of power flow is a remarkable coincidence.

🔬 Applying Poynting's theorem to a lossless region

🔬 The setup

A uniform plane wave propagates along direction k̂ into a homogeneous, lossless cylindrical region V bounded by closed surface S.
The cylinder axis is aligned with k̂; the two flat ends are perpendicular to k̂.
The surface S consists of three parts: left end S₁, curved side S₂, and right end S₃.

⚡ Poynting's theorem in this scenario

Poynting's theorem: P_net,in = P_Ω + ∂W_e/∂t + ∂W_m/∂t

Because the region is lossless, P_Ω = 0 (no power dissipation).
Presuming no other fields, stored electric and magnetic energies W_e and W_m are also zero.
Therefore, P_net,in = 0.

🔄 What "net power in = 0" actually means

It does not mean no power flows into V.
It means the net power is zero: power in equals power out.
Rewritten: P_net,in = P_in − P_out = 0.
Don't confuse: zero net flow with zero total flow; power is flowing through, but in balance.

🧮 Deriving the interpretation

🧮 Surface integral breakdown

P_net,in is expressed as: −∮_S (E × H) · ds = 0.
Breaking the surface into three parts:
- Left end S₁: ds = −k̂ ds (outward-facing).
- Curved side S₂: ds is perpendicular to E × H (perpendicular to k̂), so the integral is zero.
- Right end S₃: ds = +k̂ ds (outward-facing).

🎯 Matching power flow

The equation simplifies to: ∫{S₁} (E × H) · k̂ ds − ∫{S₃} (E × H) · k̂ ds = 0.
Comparing with P_in − P_out = 0:
- The first integral (over S₁) must be P_in.
- The second integral (over S₃) must be P_out.
Since k̂ is the direction in which E × H points, the magnitude and direction of E × H correspond to spatial power density and direction of power flow.

📐 Final result

P_in = ∫_{S₁} (E × H) · k̂ ds

This confirms that the Poynting vector S = E × H represents:
- Magnitude: spatial power density (W/m²).
- Direction: direction of power flow.

🏷️ Naming and historical note

🏷️ Origin of the name

The Poynting vector is named after English physicist J.H. Poynting, one of the co-discoverers of the concept.
The fact that this vector also "points" in the direction of power flow is simply a remarkable coincidence.
Don't confuse: the name comes from the person, not the verb "pointing."

Wave Equations for Lossy Regions

3.3 Wave Equations for Lossy Regions

🧭 Overview

🧠 One-sentence thesis

Wave equations for electromagnetic propagation in lossy media replace the real permittivity ε with a complex permittivity εc that accounts for energy conversion into current and heat, introducing both phase propagation (β) and attenuation (α) constants.

📌 Key points (3–5)

What "loss" means: conversion of wave energy into conduction current and subsequently into heat, described by Ohm's law with conductivity σ.
Complex permittivity εc: combines physical permittivity ε and conductivity σ into a single constant with real and imaginary parts, allowing lossy media to be treated with equations identical in form to lossless equations.
Propagation constant γ = α + jβ: replaces the purely real β² from lossless media; α (real part) represents attenuation, β (imaginary part) represents phase propagation.
Common confusion: in lossless media, σ = 0 so εc reduces to ε and α = 0, recovering the familiar β = ω√(με); in lossy media, both α and β depend on the ratio ε″/ε′.
Why it matters: these equations describe realistic wave behavior in materials with non-negligible conductivity and provide the foundation for understanding signal attenuation.

📐 Wave equations in lossless vs lossy media

📐 Lossless wave equations (starting point)

The lossless, source-free wave equations in differential phasor form are:

∇² Ẽ + ω²μεẼ = 0
∇² H̃ + ω²μεH̃ = 0

The constant ω²με is labeled β², where β is the phase propagation constant.

🔥 What "loss" means physically

Loss: the conversion of energy from the propagating wave into current, and subsequently to heat.

This mechanism is described by Ohm's law: J̃ = σẼ
σ is conductivity (SI base units of S/m or A/(V·m))
J̃ is conduction current density (SI base units of A/m²)
In lossless media, σ is zero (or negligible), so J is presumed zero
Don't confuse: loss is not about the wave "disappearing"; it is about energy conversion into heat via induced current

🔄 Upgrading to lossy equations

To account for loss, we cannot assume J = 0. The excerpt returns to Maxwell's equations in phasor form:

∇ · Ẽ = ρ̃v / ε
∇ × Ẽ = −jωμH̃
∇ · H̃ = 0
∇ × H̃ = J̃ + jωεẼ

In a source-free region, ρ̃v = 0, but J̃ may be non-zero if σ is non-zero.

🧩 Complex permittivity concept

🧩 Decomposing current contributions

The excerpt identifies two possible contributions to J̃:

J̃ = J̃imp + J̃ind
J̃imp: impressed sources of current (independent, like an independent current source in circuit theory)
J̃ind: current induced by loss (depends on the field)

In the absence of impressed sources:

J̃ = 0 + σẼ = σẼ

🔢 Defining complex permittivity

Substituting J̃ = σẼ into ∇ × H̃ = J̃ + jωεẼ:

∇ × H̃ = σẼ + jωεẼ
= (σ + jωε)Ẽ
= jωεcẼ

where the new constant εc is defined as:

Complex permittivity: εc ≡ ε − j(σ/ω)

In the lossless case, εc = ε (the imaginary part → 0)
The effect of material loss is represented as a non-zero imaginary component of permittivity

📝 Alternative notation

Complex permittivity is also commonly expressed as:

εc = ε′ − jε″

where ε′ and ε″ are real-valued constants. In this case:

ε′ = ε (physical permittivity)
ε″ = σ/ω

Why this notation is useful:

Some authors use ε to refer to both physical and complex permittivity; the "ε′ − jε″" notation mitigates confusion
Often more convenient to specify ε″ at a frequency than σ, which may also be frequency-dependent
The ratio ε″/ε′ is known as loss tangent (explained elsewhere)
Nonlinearity of permittivity can also be accommodated as an imaginary component; the "ε″" notation allows both effects (nonlinearity and conductivity) using common notation

In this section: focus remains exclusively on conductivity.

⚙️ Deriving the lossy wave equations

⚙️ Modified Maxwell equations

With complex permittivity, Maxwell's equations become:

∇ · Ẽ = 0
∇ × Ẽ = −jωμH̃
∇ · H̃ = 0
∇ × H̃ = jωεcẼ

These are identical to the lossless equations, except ε has been replaced by εc.

📊 Wave equations with γ²

Similarly, replacing ω²με in the lossless wave equations:

∇² Ẽ + ω²μεcẼ = 0
∇² H̃ + ω²μεcH̃ = 0

The excerpt defines a new constant:

γ² ≡ −ω²μεc

(Note the minus sign, which is customary and yields notational convenience later.)

The wave equations become:

∇² Ẽ − γ²Ẽ = 0
∇² H̃ − γ²H̃ = 0

Comparison: In the lossless case, ω²μεc → ω²με = β² as expected.

🔍 Understanding the propagation constant γ

🔍 Why γ is complex-valued

At first glance, γ = √(γ²). However:

Every number has two square roots
For β = √(β²) = ω√(με), there is no concern because β is positive by definition
γ² is complex-valued, so both possible values of √(γ²) are potentially complex
We lack clear guidance on which value is appropriate and physically relevant

🧮 Defining α and β

The excerpt proceeds by considering the special case γ = jγ″ (purely imaginary):

γ² = −(γ″)²
The wave equations become ∇² Ẽ + (γ″)²Ẽ = 0 and ∇² H̃ + (γ″)²H̃ = 0
Comparing to lossless equations, γ″ plays the exact same role as β

Formal definitions:

β ≡ Im{γ}: the phase propagation constant (imaginary part of γ)

α ≡ Re{γ}: the attenuation constant (real part of γ)

Such that:

γ = α + jβ

where α and β are real-valued constants.

Don't confuse: β in the lossy case is not equal to ω√(με); it simplifies to that value only in the lossless case.

🧪 Solving for α and β explicitly

Starting from γ² = (α + jβ)²:

γ² = α² − β² + j(2αβ)

Expanding γ² = −ω²μ(ε − jσ/ω):

γ² = −ω²με + jωμσ

Equating real and imaginary parts:

α² − β² = −ω²με
2αβ = ωμσ

Solving these simultaneous equations yields:

Constant	Formula
α	ω√[(με′/2)(√(1 + (ε″/ε′)²) − 1)]^(1/2)
β	ω√[(με′/2)(√(1 + (ε″/ε′)²) + 1)]^(1/2)

Verification in lossless case:

If σ = 0, then ε″ = 0
Equation for α yields α = 0
Equation for β yields β = ω√(με), as expected

📦 Summary statement

Electromagnetic wave equations for lossy media: Equations ∇² Ẽ − γ²Ẽ = 0 and ∇² H̃ − γ²H̃ = 0 with γ = α + jβ, where α and β are positive real-valued constants determined by the formulas above.

🔗 Connection to transmission line theory

🔗 Analogy with TEM transmission lines

The excerpt points out a useful analogy:

In the section "Wave Propagation on a TEM Transmission Line," potential and current along a transverse electromagnetic (TEM) transmission line satisfy the same wave equations developed here
They have the same complex-valued propagation constant γ = α + jβ
β has the same physical interpretation as the phase propagation constant
α has the same interpretation in both applications: the attenuation constant (explained in another section)

Example: A signal propagating along a lossy transmission line experiences both phase delay (governed by β) and amplitude decay (governed by α), just like a wave in a lossy medium.

Complex Permittivity

3.4 Complex Permittivity

🧭 Overview

🧠 One-sentence thesis

Complex permittivity generalizes the relationship between electric field and electric flux density to account for both delayed material response and conduction loss, unifying two physically distinct loss mechanisms into a single frequency-dependent parameter.

📌 Key points (3–5)

What complex permittivity models: both the delayed response of materials to changing electric fields and loss due to non-zero conductivity.
How it works: in the phasor domain, the relationship D = εE becomes D̃ = εc·Ẽ, where εc = ε′ − jε″ is complex-valued and frequency-dependent.
Two sources of the imaginary part: ε″ can arise from conduction loss (ε″ = σ/ω) or from delayed material response (frequency-dependent terms from higher-order time derivatives).
Common confusion: measured complex permittivity may include both delayed response and conduction loss, so ε″ from measurement cannot be assumed to represent only conduction loss.
Frequency dependence: the real and imaginary parts vary with frequency, often simply at low frequencies but becoming complex at higher frequencies (e.g., above 1 GHz for typical dielectrics).

🔄 From simple to complex permittivity

🔄 Simple permittivity in ideal media

Permittivity (ε): the constant relating electric field intensity E (V/m) to electric flux density D (C/m²) via D = εE (SI units: F/m).

In simple media, ε is a real positive constant that does not depend on how E changes with time.
The response D to a change in E is instantaneous and without delay.
This is an idealization; practical materials behave differently.

⏱️ Delayed response in practical materials

In real materials, the change in D in response to a change in E may depend on how E changes.
The response may not be instantaneous but may take time to fully manifest.
This is modeled by a generalized relationship:
- D = a₀E + a₁(∂E/∂t) + a₂(∂²E/∂t²) + a₃(∂³E/∂t³) + ...
- where a₀, a₁, a₂, ... are real-valued constants and the series is infinite.
In practice, higher-order terms diminish in importance; often only the first few terms are significant.
In many common materials, only the first term matters (a₀ ≈ ε, aₙ ≈ 0 for n ≥ 1), but this depends on frequency.

🧮 Phasor-domain formulation

🧮 Converting to phasors

In the phasor domain, time differentiation (∂/∂t) becomes multiplication by jω.
The generalized relationship becomes:
- D̃ = a₀Ẽ + a₁(jω)Ẽ + a₂(jω)²Ẽ + a₃(jω)³Ẽ + ...
- = a₀Ẽ + jωa₁Ẽ − ω²a₂Ẽ − jω³a₃Ẽ + ...
- = (a₀ + jωa₁ − ω²a₂ − jω³a₃ + ...)Ẽ
The factor in parentheses is a complex-valued number depending on frequency ω and material parameters a₀, a₁, ...

🔢 Complex permittivity definition

Complex permittivity (εc): a complex-valued, frequency-dependent constant such that D̃ = εcẼ.

Written as εc = ε′ − jε″, where both ε′ and ε″ are real-valued and frequency-dependent.
Real part (ε′): ε′ = a₀ − ω²a₂ + ... (even powers of ω, real coefficients).
Imaginary part (ε″): ε″ = −ωa₁ + ω³a₃ − ... (odd powers of ω, real coefficients).
This formulation captures the delayed response behavior of practical materials.

🔀 Two interpretations of complex permittivity

🔀 Delayed response vs conduction loss

The excerpt identifies two distinct physical origins for complex permittivity:

Origin	Real part (ε′)	Imaginary part (ε″)	Physical meaning
Delayed response	a₀ − ω²a₂ + ...	−ωa₁ + ω³a₃ − ...	Time lag in material polarization
Conduction loss	ε (simple permittivity)	σ/ω	Energy dissipation due to conductivity σ

Both use the same notation εc = ε′ − jε″.
In Section 3.3 (referenced), εc was defined to accommodate loss from non-zero conductivity σ, with ε′ = ε and ε″ = σ/ω.
In this section, εc accommodates delayed response, with ε′ and ε″ given by the frequency-dependent series above.

⚠️ Ambiguity in measured permittivity

Don't confuse: in practical work, it may not be clear which combination of effects a measured εc represents.
If εc is obtained by measurement, both delayed response and conduction loss may be present.
Therefore, a measured ε″ cannot be assumed to represent only conduction loss (i.e., equal to σ/ω).
The measurement may include a significant frequency-dependent contribution from delayed response behavior.
Example: a material with both non-zero conductivity and slow polarization will show ε″ contributions from both mechanisms.

📈 Frequency dependence

📈 Typical behavior

Complex permittivity is frequency-dependent for practical materials.
Figure 3.3 (referenced) shows an example for a typical dielectric material (a polymer):
- At frequencies below ~1 GHz: frequency dependence is simple and slowly-varying.
- At higher frequencies: the real and imaginary components become relatively complex.
This frequency dependence arises naturally from the series expansion (powers of ω in ε′ and ε″).

📈 Why frequency matters

At low frequencies, higher-order terms (ω²a₂, ω³a₃, ...) are negligible, so behavior is simpler.
At high frequencies, these terms become significant, leading to more complex frequency dependence.
The distinction between "simple" and "complex" permittivity often depends on the frequency range of interest.

Loss Tangent

3.5 Loss Tangent

🧭 Overview

🧠 One-sentence thesis

Loss tangent provides a single dimensionless ratio that quantifies how much energy is lost in a material, whether from conduction or delayed dielectric response, and it determines the relative magnitude of conduction current to displacement current.

📌 Key points (3–5)

What loss tangent measures: the ratio of conduction current to displacement current, expressed as tan δ = σ/(ωε) or equivalently tan δ = ε″/ε′.
How to interpret it: zero for lossless materials; increases with increasing loss; the angle δ represents the phase relationship between conduction and displacement currents.
Two physical origins: complex permittivity can arise from ohmic (conduction) loss or from delayed response of polarization to changing electric fields.
Common confusion: the mathematical expression tan δ = ε″/ε′ does not distinguish between ohmic loss and delayed response, even though their physical manifestations differ (e.g., heating vs. no heating).
Why it matters: loss tangent concisely quantifies the effect of loss on electromagnetic fields within materials.

🔌 The two currents in materials

⚡ Conduction vs. displacement current

The excerpt starts from Ampere's law in phasor form:

∇ × H̃ = J̃ + jω D̃

Conduction current J̃ = σ Ẽ: real-valued, represents charge flow.
Displacement current jω D̃ = jωε Ẽ: imaginary-valued, represents changing electric field.
The total current is the sum: ∇ × H̃ = σ Ẽ + jωε Ẽ.

📐 Geometric interpretation

In the phasor (complex) plane, conduction current lies on the real axis and displacement current lies on the imaginary axis.
The angle δ between the total current and the displacement current is defined by the ratio of these two components.
Example: if conduction current is small compared to displacement current, δ is small; if conduction dominates, δ approaches 90°.

🧮 Definition and formulas

📏 Loss tangent as a ratio

Loss tangent: tan δ = σ/(ωε), the ratio of conduction current magnitude to displacement current magnitude.

From the current perspective: tan δ = (magnitude of conduction current) / (magnitude of displacement current).
From the permittivity perspective: tan δ = ε″/ε′, where ε′ is the real part and ε″ is the imaginary part of complex permittivity εc = ε′ − jε″.
The two expressions are equivalent: σ/(ωε) = ε″/ε′.

🎯 What the value tells you

tan δ = 0: lossless material (σ = 0).
tan δ increases: more loss, meaning more energy dissipation.
The excerpt emphasizes that loss tangent "increases with increasing loss," making it a convenient single-number summary of material loss.

🔀 Two sources of complex permittivity

🔥 Ohmic (conduction) loss

When a material has non-zero conductivity σ, free charges move and dissipate energy as heat.
This makes the permittivity complex: ε″ = σ/ω.
Physical manifestation: a material with large loss tangent due to ohmic loss "might become hot when a large electric field is applied."

⏳ Delayed response

Permittivity can also be complex because the displacement field D responds with a delay to changes in the electric field E.
This delayed response is frequency-dependent and does not necessarily involve conduction.
Physical manifestation: a material with large loss tangent due to delayed response "might not" become hot.

⚠️ The key distinction and confusion

Aspect	Ohmic loss	Delayed response
Origin	Non-zero conductivity σ	Frequency-dependent polarization lag
Physical effect	Heating	May not heat
Mathematical form	Both contribute to ε″	Both contribute to ε″

Don't confuse: The formula tan δ = ε″/ε′ is the same regardless of the physical origin of ε″. The excerpt warns:

The expression for loss tangent given by Equation 3.81 and Figure 3.5 does not distinguish between ohmic loss and delayed response.

If you measure εc, both effects may be present.
You cannot assume ε″ represents only conduction loss (i.e., ε″ = σ/ω); it may include a significant frequency-dependent contribution from delayed response.
Example: a measured ε″ at high frequency might include both conduction and polarization delay, so using tan δ alone does not tell you which mechanism dominates.

📊 Practical implications

🔬 Measurement and interpretation

When complex permittivity εc is obtained by measurement, both delayed response and conduction loss may be represented.
The excerpt cautions: "it is not reasonable to assume a value of ε″ obtained by measurement represents only conduction loss."
Frequency dependence is important: below ~1 GHz, permittivity varies slowly; at higher frequencies, it becomes more complex.

🧪 Why loss tangent is useful

It provides "an alternative way to quantify the effect of loss on the electromagnetic field within a material."
It is dimensionless and concise, making it easier to compare materials.
It directly relates to the phase angle δ between conduction and displacement currents, which has geometric meaning in phasor analysis.

Plane Waves in Lossy Regions

3.6 Plane Waves in Lossy Regions

🧭 Overview

🧠 One-sentence thesis

Plane waves propagating through lossy materials exhibit exponential attenuation with distance and a complex-valued wave impedance that causes the electric and magnetic fields to be out of phase.

📌 Key points (3–5)

Wave equations in lossy media: the same form as lossless media, but the constant β² is replaced by the complex-valued constant −γ².
Attenuation mechanism: the real part (α) of the propagation constant γ = α + jβ causes exponential decay of wave magnitude with distance via the factor e^(−αz).
Phase propagation: the imaginary part (β) of γ determines phase variation with distance, just as in the lossless case.
Complex wave impedance: in lossy materials, wave impedance becomes complex-valued, meaning E and H are not in phase.
Common confusion: the wave impedance correction factor depends on loss tangent (ε″/ε′), which can arise from either ohmic loss or delayed material response—the math treats both identically, but physical manifestations differ (e.g., heating vs. no heating).

📐 Wave equations and solutions

📐 Starting equations for lossy regions

The electromagnetic wave equations for source-free regions with possibly-lossy material are:

∇²Ẽ − γ²Ẽ = 0
∇²H̃ − γ²H̃ = 0

where γ² = −ω²με_c (ε_c is complex permittivity accounting for loss).

These equations permit various wave geometries: plane waves, cylindrical waves, spherical waves.
The excerpt focuses on the special case of uniform plane waves (uniform magnitude and phase in planes of constant z).

🔄 Relationship to the lossless case

The derivation follows the same procedure as for lossless media.
The only difference: the real-valued constant +β² in the lossless case is replaced by the complex-valued constant −γ².
The solution is obtained by a simple modification of the lossless solution.

🧮 General plane wave solution

For a wave with uniform magnitude and phase in planes of constant z, the electric field is:

Ẽ = x̂Ẽ_x + ŷẼ_y

where:

Ẽ_x = E⁺_x0 e^(−γz) + E⁻_x0 e^(+γz)
Ẽ_y = E⁺_y0 e^(−γz) + E⁻_y0 e^(+γz)

The complex-valued coefficients (E⁺_x0, E⁻_x0, E⁺_y0, E⁻_y0) are determined by boundary conditions outside the region of interest.

The first term in each component represents a wave propagating in the +z direction.
The second term represents a wave propagating in the −z direction.
These expressions are identical to voltage and current in lossy transmission lines.

🌊 Attenuation and phase propagation

🌊 Decomposing the propagation constant

The propagation constant γ is written explicitly as:

γ = α + jβ

where:

α is the attenuation constant (positive real-valued, depends on ω and material properties).
β is the phase propagation constant (positive real-valued, depends on ω and material properties).

📉 Special case: x̂-polarized wave in +ẑ direction

Consider:

Ẽ = x̂ E⁺_x0 e^(−γz)

Substituting γ = α + jβ:

Ẽ = x̂ E⁺_x0 e^(−(α+jβ)z) = x̂ E⁺_x0 e^(−αz) e^(−jβz)

📉 Role of the attenuation constant α

The factor e^(−αz) is real-valued and describes how magnitude varies with distance.
Magnitude is reduced inverse-exponentially with increasing distance along the direction of propagation.
Example: as z increases, the wave amplitude decreases exponentially, so a wave traveling through lossy material becomes weaker.

The presence of loss in material gives rise to a real-valued factor e^(−αz) which describes the attenuation of the wave with propagation in the material.

🔁 Role of the phase propagation constant β

The factor e^(−jβz) determines the variation of phase with distance.
β plays precisely the same role as in the lossless case.
Phase propagation is independent of the attenuation mechanism.

🧲 Magnetic field and wave impedance

🧲 Plane wave relationships

The relationships between electric and magnetic fields apply exactly as in the lossless case:

H̃ = (1/η) k̂ × Ẽ
Ẽ = −η k̂ × H̃

where:

k̂ is the direction of propagation.
η is the wave impedance.

🔧 Wave impedance in lossy media

In the lossless case, η = √(μ/ε). In the possibly-lossy case, replace ε = ε′ with ε_c = ε′ − jε″:

η → η_c = √(μ/ε_c) = √(μ/(ε′ − jε″))

This can be rewritten as:

η_c = √(μ/ε′) · [1 − j(ε″/ε′)]^(−1/2)

The wave impedance for a lossy material equals √(μ/ε′) (the impedance if we neglected loss) times a correction factor that accounts for loss.
The correction factor is complex-valued.
Therefore, E and H are not in phase when propagating through lossy material.

🔧 Updated plane wave relationships

In the phasor domain:

H̃ = (1/η_c) k̂ × Ẽ
Ẽ = −η_c k̂ × H̃

with the complex-valued wave impedance η_c given above.

⚡ Wave power in lossy media

⚡ Poynting vector definitions

The instantaneous Poynting vector (applicable to any time-varying wave):

S = E × H (units: W/m², indicates power density and direction of power flow)

For sinusoidally-varying waves represented as phasors, the time-average Poynting vector is:

S_ave = (1/2) Re{Ẽ × H̃*}

where H̃* is the complex conjugate of H̃.

⚡ Lossless case (for comparison)

In a lossless medium, the time-average power density for a uniform plane wave is:

S_ave = |E₀|²/(2η) (lossless case)

where |E₀| is the peak magnitude of the electric field intensity phasor, and η is the wave impedance.

⚡ Lossy case setup

For a uniform plane wave in a possibly-lossy medium:

Ẽ = x̂ E₀ e^(−αz) e^(−jβz)
H̃ = ŷ (E₀/η_c) e^(−αz) e^(−jβz)

where:

α and β are the attenuation constant and phase propagation constant.
η_c is the complex-valued wave impedance.
These expressions describe a +x̂-polarized wave propagating in the +ẑ direction.

The excerpt notes these choices are made for convenience but does not complete the derivation of S_ave for the lossy case in the provided text.

🔍 Don't confuse: loss tangent origins

Recall from the preceding section that loss tangent (ε″/ε′) can arise from:

Ohmic (conduction) loss: material might become hot when a large electric field is applied.
Delayed response of polarization: material might not heat up.

The mathematical expression for wave impedance correction [1 − j(ε″/ε′)]^(−1/2) does not distinguish between these two physical mechanisms, even though their physical manifestations differ.

Wave Power in a Lossy Medium

3.7 Wave Power in a Lossy Medium

🧭 Overview

🧠 One-sentence thesis

In lossy media (materials with significant conductivity), wave power density decreases exponentially with distance and is scaled down by a factor that depends on the complex wave impedance, unlike lossless media where power density remains constant along the propagation path.

📌 Key points (3–5)

What this section addresses: power associated with waves in materials that have conductivity σ significantly greater than zero (lossy media).
Key difference from lossless case: power density now includes an exponential decay factor e^(−2αz) and a scaling factor involving the magnitude and phase of the complex wave impedance.
Common confusion: in lossless media the wave impedance η is real and power density is constant along z; in lossy media the wave impedance η_c is complex-valued, introducing both magnitude and phase corrections.
Why it matters: the time-average power density formula shows how conductivity reduces power as the wave propagates, proportional to the square of field magnitudes.

📐 Foundational concepts

📐 Poynting vector and time-average power

The Poynting vector S = E × H indicates the power density (W/m²) of a wave and the direction of power flow; this is "instantaneous" power applicable regardless of time variation.

For sinusoidally varying waves represented as phasors, the time-average Poynting vector is:
- S_ave = (1/2) Re{Ẽ × Ẽ*}
This expression applies to both lossless and lossy media; the difference lies in the field expressions and wave impedance.

🔁 Review: lossless case

In a lossless medium, the time-average power density is:
- S_ave = |E₀|² / (2η)
Here |E₀| is the peak magnitude of the electric field intensity phasor, and η is the (real-valued) wave impedance.
Power density is constant along the propagation direction (no decay).

🌊 Wave fields in lossy media

🌊 Electric and magnetic field phasors

The uniform plane wave in a possibly-lossy medium is expressed as:

Electric field: Ẽ = x̂ E₀ e^(−αz) e^(−jβz)
Magnetic field: H̃ = ŷ (E₀ / η_c) e^(−αz) e^(−jβz)

Where:

α: attenuation constant (accounts for loss)
β: phase propagation constant
η_c: complex-valued wave impedance

Example: a wave propagating in the +ẑ direction with +x̂ polarization; the exponential e^(−αz) causes the amplitude to decay with distance z.

🔍 Why complex wave impedance matters

In lossless media, η is real and has no phase.
In lossy media, η_c is complex: η_c = |η_c| e^(jψ_η), where ψ_η is the phase.
The complex conjugate and its reciprocal introduce both magnitude and phase corrections:
- (η_c*)^(−1) = |η_c|^(−1) e^(+jψ_η)
- Re{(η_c*)^(−1)} = |η_c|^(−1) cos ψ_η

🔋 Time-average power density in lossy media

🔋 Derivation from the Poynting vector

Starting from S_ave = (1/2) Re{Ẽ × H̃*}, substitute the field expressions:

S_ave = (1/2) ẑ Re{ |E₀|² / η_c* e^(−2αz) }
Separate the complex impedance: S_ave = ẑ (|E₀|² / 2) Re{1 / η_c*} e^(−2αz)
Use the real part: Re{1 / η_c*} = |η_c|^(−1) cos ψ_η

Final result:

S_ave = ẑ (|E₀|² / (2|η_c|)) e^(−2αz) cos ψ_η

📉 Physical interpretation

Factor	Meaning
\|E₀\|² / (2\|η_c\|)	Base power density (similar to lossless case but with \|η_c\| instead of η)
e^(−2αz)	Exponential decay with distance; power density proportional to (e^(−αz))², i.e., square of field amplitude decay
cos ψ_η	Phase correction due to complex impedance; this factor is < 1 when ψ_η ≠ 0

Don't confuse: the e^(−2αz) factor means power decays faster than field amplitude (which decays as e^(−αz)); power is proportional to the square of |E| or |H|.

✅ Consistency check with lossless case

When the medium is lossless:

α = 0 (no attenuation)
|η_c| = η (real-valued impedance)
ψ_η = 0 (no phase)

Then:

S_ave = ẑ (|E₀|² / (2η)) · 1 · 1 = |E₀|² / (2η)
This matches the lossless formula, confirming the derivation.

🔻 Effect of loss on power density

🔻 Two sources of reduction

Distance-dependent decay: the factor e^(−2αz) causes power density to drop exponentially as the wave propagates.
One-time scaling: the factor (|η| / |η_c|) cos ψ_η < 1 reduces power density relative to a lossless medium with the same field magnitude.

🔻 Combined effect

The reduction in power density due to non-zero conductivity is proportional to:
- A distance-dependent factor e^(−2αz)
- An additional factor that depends on the magnitude and phase of η_c
Example: at distance z, if α is large, e^(−2αz) becomes very small, meaning most of the wave's power has been absorbed by the lossy medium.

Decibel Scale for Power Ratio

3.8 Decibel Scale for Power Ratio

🧭 Overview

🧠 One-sentence thesis

The decibel (dB) scale transforms power ratios into a logarithmic form that simplifies calculations involving ratios spanning many orders of magnitude and converts multiplication/division into addition/subtraction.

📌 Key points (3–5)

What the dB scale does: converts power ratios from linear form (P₁/P₀) into logarithmic form (10 log₁₀ of the ratio), making large ratios easier to work with.
Sign convention: positive dB means output power exceeds input power (gain); negative dB means output power is less than input power (loss).
Common confusion: voltage ratios require 20 log₁₀ (not 10 log₁₀) only if the associated impedances are equal; otherwise the relationship breaks down.
Calculation advantage: the dB scale transforms division into subtraction and multiplication into addition, simplifying cascaded system analysis.
Absolute units: dBm and similar units express absolute power levels in dB relative to a reference (e.g., 1 mW for dBm).

📐 Core definition and formulas

📐 Linear vs logarithmic power ratio

Power gain G (linear units): G = P₁ / P₀

Power gain G (dB): G = 10 log₁₀(P₁ / P₀) dB

In linear units, a ratio greater than 1 means gain; less than 1 means loss.
In dB, a positive value means gain; a negative value means loss.
The "dB" notation denotes a unitless quantity expressed on the logarithmic decibel scale.

🔄 Gain vs loss

Power loss L: L = 1/G in linear units; L = −G when expressed in dB.

Engineers typically interpret a power ratio as "gain" when output power is expected to exceed input power (e.g., amplifiers).
Engineers typically interpret a power ratio as "loss" when output power is expected to be less than input power (e.g., lossy transmission lines).
Example: A cable with 2 W input and 10 μW output has G = 5 × 10⁻⁶ (linear) = −53.0 dB, so L = +53.0 dB.

🌊 Application to spatial power density

🌊 Power density ratios

The decibel scale applies identically to spatial power densities (units of W/m²):

Loss from S₀ to S₁: L = 10 log₁₀(S₀ / S₁) dB

The common units of m⁻² in numerator and denominator cancel, leaving a simple power ratio.
This works because the ratio of two power densities is dimensionless after cancellation.

⚡ Voltage and current ratios

⚡ The impedance-dependent trap

When impedances are equal (R₁ = R₀):

If P₁ = V₁² / R₁ and P₀ = V₀² / R₀, then:
G = 10 log₁₀(V₁² / V₀²) = 10 log₁₀[(V₁ / V₀)²] = 20 log₁₀(V₁ / V₀) dB
The factor of 20 (not 10) arises because power is proportional to voltage squared.

When impedances are not equal (R₁ ≠ R₀):

The relationship G = 20 log₁₀(V₁ / V₀) does not hold.
You must return to the power ratio definition: G = 10 log₁₀[(V₁² / R₁) / (V₀² / R₀)].

🔌 Voltage gain Gᵥ

Voltage gain Gᵥ: Gᵥ = 20 log₁₀(V₁ / V₀) dB (applies regardless of impedances)

Voltage gain Gᵥ is defined independently of impedance.
Gᵥ equals power gain G only if the associated impedances are equal.
Don't confuse: voltage gain and power gain are different quantities unless impedances match.

🧮 Calculation simplification

🧮 Division becomes subtraction

For a signal with power P₀ injected into a transmission line with loss L:

Linear units: P₁ = P₀ / L (division)
dB units: 10 log₁₀ P₁ = 10 log₁₀ P₀ − 10 log₁₀ L
Simplified form: P₁ = P₀ − L (dB) (subtraction)

This transformation makes cascaded calculations and visualization much easier.

Example: If P₀ = +20 dBm and L = 5 dB, then P₁ = 20 − 5 = +15 dBm.

📏 Absolute power units

📏 dBm and related units

When you take 10 log₁₀ of an absolute power (not a ratio), the result is in dB relative to the original power unit.

dBm: power expressed in dB relative to 1 mW

Power (linear)	Power (dBm)
1 mW	0 dBm
10 mW	+10 dBm
0.1 mW	−10 dBm

"0 dBm" means 0 dB relative to 1 mW, which is simply 1 mW.
Other reference units follow the same pattern (e.g., dBW for watts).

Attenuation Rate

3.9 Attenuation Rate

🧭 Overview

🧠 One-sentence thesis

Attenuation rate converts the attenuation constant into a decibel-per-unit-length measure that simplifies loss calculations for wave propagation over any distance.

📌 Key points (3–5)

What attenuation rate measures: loss in dB per unit length (e.g., dB/m) for waves traveling through a medium or transmission line.
How it relates to attenuation constant: attenuation rate ≈ 8.69 × α, where α is the attenuation constant (the real part of the propagation constant).
Why it's useful: it allows quick calculation of total loss by simple multiplication—attenuation rate (dB/m) × distance (m) = total loss (dB).
Common confusion: attenuation rate is not the same as the attenuation constant α; the conversion factor 8.69 comes from converting natural exponentials to the decibel scale.
Applies broadly: the concept works for both guided waves (transmission lines) and unguided waves (spatial power densities), because the power ratio calculation is the same.

📐 Deriving attenuation rate from power loss

📐 Power ratio along a transmission line

Consider a wave traveling in the +z direction on a transmission line.
Let P₀ be the power at z = 0, and P₁ be the power at z = l.
The power ratio (a loss, since it's greater than 1 for attenuation) is:
- P₀ / P₁ = e^(2αl) in linear units
- where α is the attenuation constant (the real part of the propagation constant γ = α + jβ).

🔢 Converting to decibels

Expressing the power ratio in dB:
- 10 log₁₀(P₀ / P₁) = 10 log₁₀(e^(2αl)) = 20αl log₁₀(e) ≈ 8.69αl dB
This is the total loss in dB over distance l.

📏 Defining attenuation rate

Attenuation rate: the loss in dB per unit length, obtained by dividing the total loss by the distance.

Dividing the total loss by l:
- attenuation rate ≈ 8.69α
Units: dB per unit length (e.g., dB/m if α is in m⁻¹).
Example: if α is expressed in m⁻¹, then attenuation rate has units of dB/m.

🧮 Using attenuation rate for practical calculations

🧮 Simple multiplication for total loss

The key utility: total loss = attenuation rate × distance.
This transforms a complex exponential calculation into straightforward multiplication.
Example: if attenuation rate is 0.0738 dB/m and the cable is 100 m long, the loss is (0.0738 dB/m) × (100 m) = 7.4 dB.

🔌 Example: long coaxial cable

A coaxial cable has attenuation constant α ≈ 8.5 × 10⁻³ m⁻¹.
Step 1: Calculate attenuation rate:
- attenuation rate ≈ 8.69 × (8.5 × 10⁻³) ≈ 0.0738 dB/m
Step 2: Calculate loss for 100 m:
- loss ≈ (0.0738 dB/m) × (100 m) ≈ 7.4 dB
Note: It is equally valid to express attenuation rate as 7.4 dB per 100 m; the concept is flexible.

🌐 Applies to unguided waves too

The same concept works for unguided waves (e.g., radio waves in free space or lossy media).
Reason: spatial power density has units of W/m², so the m⁻² terms cancel in the power density ratio, leaving a simple power ratio.
The attenuation rate calculation proceeds identically.

🔗 Connection to the decibel scale

🔗 Why 8.69?

The factor 8.69 arises from converting the natural exponential base e to the base-10 logarithm used in decibels.
Specifically: 20 log₁₀(e) ≈ 8.69.
Don't confuse: α itself is not in dB; it is a constant with units of inverse length (e.g., m⁻¹). The conversion to dB/length requires the 8.69 factor.

🔗 Decibel subtraction for loss

Recall from the decibel discussion: division in linear units becomes subtraction in dB.
If output power P₁ = P₀ / L (where L is loss in linear units), then in dB:
- P₁ (dB) = P₀ (dB) − L (dB)
Attenuation rate fits this framework: total loss in dB is simply subtracted from the input power in dB to find the output power in dB.

🔗 Units matter

The units of power in dB are relative to the original power units.
Example: if power is in mW, then 10 log₁₀(power in mW) has units of dBm (dB relative to 1 mW).
- 0 dBm = 1 mW
- +10 dBm = 10 mW
- −10 dBm = 0.1 mW
Attenuation rate itself is always in dB per unit length, independent of the absolute power units.

Poor Conductors

3.10 Poor Conductors

🧭 Overview

🧠 One-sentence thesis

Poor conductors are materials with low but non-negligible conductivity that behave mostly like ideal dielectrics except for significant ohmic loss, characterized by a loss tangent much less than 1.

📌 Key points (3–5)

What defines a poor conductor: loss tangent (ε″/ε′) ≪ 1, meaning loss is small relative to the lossless permittivity but may still be significant in practice.
Phase propagation: β for a poor conductor is approximately the same as for an ideal dielectric (β ≈ ω√(με′)).
Attenuation behavior: α is proportional to conductivity σ and wave impedance η, and varies slowly with frequency (approximately α ≈ ½ση).
Common confusion: "significant loss" depends on application—the same material (e.g., coaxial cable dielectric) may be treated as lossless for short lengths/low frequencies but lossy for long lengths/high frequencies.
Wave impedance approximation: ηc ≈ η (the lossless value), with the imaginary part being negligible.

🔍 Definition and criterion

🔍 What is a poor conductor

A poor conductor is a material for which conductivity is low, yet sufficient to exhibit significant loss.

The loss referred to here is conversion of electric field to current through Ohm's law (ohmic loss).
"Significant" depends on the application: short lengths at low frequencies may allow treating the material as lossless (σ = 0), but long lengths or higher frequencies require accounting for loss.
The material can still be treated in most other respects as an ideal dielectric.

📏 Quantitative criterion using loss tangent

The excerpt defines poor conductors using complex permittivity εc = ε′ − jε″:

ε″ quantifies loss; ε′ exists independently of loss (ε′ = ε for perfectly lossless materials).
Loss tangent = ε″/ε′ (see Section 3.5 referenced in the excerpt).

Criterion for poor conductor:

ε″/ε′ ≪ 1

This means ε″ is very small relative to ε′, so the material behaves mostly as an ideal dielectric except that ohmic loss may not be negligible.

🧪 Examples from the excerpt

Material	Application	Loss tangent
FR4 (fiberglass epoxy)	Printed circuit board substrate	ε″/ε′ ∼ 0.008 over typical frequency range
Polyethylene	Dielectric spacer in coaxial cables	Loss may or may not be significant depending on application

Don't confuse: the same material can be treated as lossless or lossy depending on cable length and frequency.

📐 Propagation constant characteristics

📐 General propagation constant

Starting from the general form:

γ² = −ω²μεc
γ = jω√(μ)√(ε′ − jε″)
γ = jω√(με′)√(1 − j(ε″/ε′))

The requirement ε″/ε′ ≪ 1 allows linearization using the binomial series.

🔢 Binomial series approximation

The excerpt invokes:

(1 + x)ⁿ = 1 + nx + n(n−1)/2! x² + ...

For x ≪ 1, terms with xⁿ (n ≥ 2) are very small, so:

(1 + x)ⁿ ≈ 1 + nx  for x ≪ 1

Applying with n = 1/2 and x = −j(ε″/ε′):

(1 − j(ε″/ε′))^(1/2) ≈ 1 − j(ε″/2ε′)

This yields:

γ ≈ jω√(με′)(1 − j(ε″/2ε′))
γ ≈ jω√(με′) + ω√(με′) · (ε″/2ε′)

🌊 Phase and attenuation constants

🌊 Phase propagation constant β

From the imaginary part of γ:

β = Im{γ} ≈ ω√(με′)  (poor conductor)

Remarkable result: β for a poor conductor is approximately equal to β for an ideal dielectric.

The phase propagation behaves as if the material were lossless.
Example: wave velocity and wavelength calculations can use the lossless formula.

📉 Attenuation constant α

From the real part of γ:

α = Re{γ} ≈ ω√(με′) · (ε″/2ε′)  (poor conductor)

Alternative forms given in the excerpt:

α ≈ ½β(ε″/ε′)  (poor conductor)

If εc is determined entirely by ohmic loss, then ε″/ε′ = σ/(ωε), so:

α ≈ ω√(με′) · σ/(2ωε)
α ≈ σ/2 · √(μ/ε)
α ≈ ½ση  (poor conductor)

where η = √(μ/ε′) is the wave impedance presuming lossless material.

🎯 Two remarkable properties of α

Weak frequency dependence: Factors of ω have been eliminated (except through the constitutive parameters σ, μ, ε, which vary slowly with frequency). Therefore, α varies slowly with frequency.
Proportionality: α is proportional to σ and η. This makes it easy to anticipate how attenuation changes with conductivity and wave impedance.

Example: doubling conductivity σ approximately doubles the attenuation constant α.

🔌 Wave impedance in poor conductors

🔌 General complex wave impedance

Recall the general form:

ηc = √(μ/ε′) · [1 − j(ε″/ε′)]^(−1/2)

🔌 Approximation for poor conductors

Applying the same binomial approximation (with n = −1/2):

ηc ≈ √(μ/ε′) · [1 − j(ε″/2ε′)]  (poor conductor)

Key observations:

Re{ηc} ≈ η (the lossless wave impedance)
Im{ηc} ≪ Re{ηc} (imaginary part is negligible)

Usual approximation:

ηc ≈ η  (poor conductor)

This means the wave impedance can be calculated as if the material were lossless, even though loss is present.

Don't confuse: the wave impedance is approximately real and lossless-like, but the attenuation constant α is non-zero and accounts for the loss.

Good Conductors

3.11 Good Conductors

🧭 Overview

🧠 One-sentence thesis

Good conductors are materials with very high conductivity (loss tangent ≫ 1) that behave almost like perfect conductors but exhibit significant resistive loss, with attenuation and phase constants approximately equal and both proportional to the square root of frequency.

📌 Key points (3–5)

Defining criterion: A good conductor has ε″/ε′ ≫ 1 (loss tangent much greater than 1), meaning conductivity dominates over permittivity effects.
Key approximation: For good conductors, α ≈ β, and both increase approximately as √f (square root of frequency).
Common confusion: "Loss" means different things—for non-conductors it means electric field energy converting to current; for conductors it means current energy dissipated in resistance.
Wave impedance behavior: The phase of ηc is always ≈ π/4, meaning the magnetic field is phase-shifted by π/4 relative to the electric field.
Surprising result: Phase velocity in good conductors is typically a tiny fraction of c (e.g., ~0.03% at typical conductivities), revealing that signals in transmission lines travel primarily in the space between conductors, not within them.

🔍 What makes a conductor "good"

🔍 Definition and criterion

A good conductor is a material having loss tangent much greater than 1.

The quantitative criterion is:

ε″/ε′ ≫ 1 (good conductor)
This means ε″ (related to conductivity σ) is very large relative to ε′ (permittivity).
Most metals qualify, especially high-conductivity metals like gold, copper, and aluminum.

⚠️ Disambiguating "loss"

The term "loss" has opposite meanings depending on material type:

Material type	What "loss" means	Goal
Non-conductors (dielectrics)	Electric field energy converting to current	Efficiently sustain electric field (requires low σ)
Conductors	Current energy dissipated in resistance	Efficiently sustain current (requires high σ)

Key insight: A good ("low-loss") conductor has high conductivity, so power dissipated in resistance is low relative to the current it carries.

Don't confuse: "Good conductor" does not mean "lossless"—it means the material behaves mostly like a perfect conductor but still exhibits significant resistive loss.

📐 Propagation constants in good conductors

📐 Attenuation constant α

Starting from the general propagation constant γ and using the approximation that ε″/ε′ ≫ 1:

α ≈ ω√(με″/2)

When loss is entirely due to conductivity (ε″ = σ/ω):

α ≈ √(ωμσ/2) = √(πfμσ)

Key characteristics:

α increases approximately as √f (square root of frequency).
α does not depend on ε′ (permittivity).
Since conductivity changes slowly with frequency, α varies slowly except for the √f dependence.

Example: The attenuation rate of transmission lines increases approximately as √f because the principal contribution comes from resistance in the conductors.

📐 Phase propagation constant β

For good conductors:

β ≈ α (good conductor)

This is a remarkable result:

β also increases as √f.
β also does not depend on ε′.
The attenuation and phase constants are approximately equal.

🌊 Wave impedance in good conductors

🌊 Complex impedance expression

Starting from the general wave impedance and applying the good conductor approximation:

ηc ≈ √(μ/(2ε″)) · (1 + j)

When loss is entirely due to conductivity:

ηc ≈ √(πfμ/σ) · (1 + j) = (α/σ) · (1 + j)

🌊 Key characteristics

No dependence on ε′: The physical permittivity does not affect wave impedance in good conductors (just as it doesn't affect α and β).
Phase is always ≈ π/4: The phase of ηc (denoted ψη) is approximately π/4 for any good conductor.
Magnitude is very different from poor conductors: Unlike poor conductors where ηc ≈ η = √(μ/ε), good conductors have much lower impedance magnitude.

🔄 Implications of π/4 phase

Two important consequences:

Field phase shift: Since ηc is the ratio of electric to magnetic field magnitude, the magnetic field phase is shifted by ≈ π/4 relative to the electric field.
Power density: Power density is proportional to cos(ψη). For good conductors, the ability to "extinguish" a wave is determined entirely by α (specifically proportional to e^(-αl) where l is distance traveled), not by the phase relationship.

Important note: Only a perfect conductor (σ → ∞) completely suppresses wave propagation; waves always penetrate some distance into any "good" conductor (measured by skin depth).

🚀 Phase velocity in good conductors

🚀 Surprising result

For good conductors:

vp = ω/β ≈ √(4πf/(μσ))

Key observations:

Phase velocity increases with frequency.
Phase velocity decreases with conductivity.
Phase velocity is usually a tiny fraction of c.

Example: For a non-magnetic (μ ≈ μ₀) good conductor with typical σ ~ 10⁶ S/m at 1 GHz:

vp ~ 100 km/s
This is only ~0.03% of the speed of light in free space

🚀 Profound implication for transmission lines

This result reveals something fundamental about signal propagation:

The information conveyed by signals propagating along transmission lines travels primarily within the space between the conductors, and not within the conductors.

Why: If information traveled primarily in the conductors, the apparent phase velocity would be orders of magnitude less than c (as shown above). But actual transmission line signals have phase velocity within an order of magnitude of c.

Remarkable fact: Classical transmission line theory using the R′, G′, C′, L′ equivalent circuit model correctly predicts this behavior, even though it doesn't explicitly consider guided waves traveling between the conductors.

📏 Comparison: good vs poor conductors

Property	Poor conductor (ε″/ε′ ≪ 1)	Good conductor (ε″/ε′ ≫ 1)
β	≈ ω√(με′) (like ideal dielectric)	≈ α ≈ √(πfμσ)
α	≈ (1/2)β(ε″/ε′) (small)	≈ β ≈ √(πfμσ)
ηc	≈ √(μ/ε′) (real, ≈ η)	≈ √(πfμ/σ)·(1+j) (complex)
Phase of ηc	≈ 0	≈ π/4
Depends on ε′?	Yes	No
Frequency dependence	Weak (through parameters)	Strong (∝ √f)

Skin Depth

3.12 Skin Depth

🧭 Overview

🧠 One-sentence thesis

Skin depth characterizes how far electromagnetic waves penetrate into lossy materials before their field magnitudes drop to about 37% of their original values, with good conductors exhibiting very shallow penetration that increases at lower frequencies.

📌 Key points (3–5)

What skin depth measures: the distance at which electric and magnetic field magnitudes are reduced by a factor of 1/e (approximately 0.368), or equivalently where about 86.5% of wave power is lost.
How to calculate it: skin depth equals 1/α (the reciprocal of the attenuation constant); for good conductors it depends on frequency, permeability, and conductivity.
Frequency dependence: in good conductors, skin depth decreases as frequency increases—higher frequencies penetrate less deeply.
Common confusion: thickness measured in multiples of skin depth matters more than absolute thickness; the same material thickness provides very different shielding at different frequencies.
Why it matters: skin depth determines electromagnetic shielding effectiveness and explains why AC current flows primarily near conductor surfaces rather than uniformly throughout.

📏 Definition and basic concept

📏 What skin depth represents

Skin depth δ_s: the distance over which the magnitude of the electric or magnetic field is reduced by a factor of 1/e ≈ 0.368.

This is an alternative way to characterize attenuation instead of using the attenuation constant α directly.
The excerpt notes that particular values of α "do not necessarily provide an intuitive sense" of attenuation rate, making skin depth a more practical measure.
The definition comes from the equation: e raised to the power (−α times δ_s) equals e raised to the power (−1), which equals approximately 0.368.

⚡ Power interpretation

Since power is proportional to the square of field magnitude, skin depth can also be interpreted as the distance where power is reduced by (1/e) squared, approximately 0.135.
In other words: δ_s is the distance at which approximately 86.5% of the power in the wave is lost.
This dual interpretation (field magnitude vs. power) provides flexibility depending on the application.

🧮 Calculation methods

🧮 General formula

The basic relationship is:

δ_s = 1/α (where α is the attenuation constant in units of inverse meters).
This makes skin depth "easy to compute" according to the excerpt.

🔌 Good conductor formula

For good conductors specifically:

δ_s ≈ 1 divided by the square root of (π times f times μ times σ)
Where f is frequency, μ is permeability, and σ is conductivity.
The excerpt notes this formula applies because for good conductors, α ≈ square root of (π times f times μ times σ).

Parameter	Effect on skin depth
Frequency (f)	Higher frequency → smaller skin depth (inverse square root relationship)
Conductivity (σ)	Higher conductivity → smaller skin depth (inverse square root relationship)
Permeability (μ)	Higher permeability → smaller skin depth (inverse square root relationship)

🔬 Practical example: aluminum shielding

🔬 Material properties

The excerpt provides aluminum as a worked example:

Conductivity: σ ≈ 3.7 × 10⁷ S/m
Permeability: μ ≈ μ₀ (non-magnetic)
These properties hold "over a broad range of radio frequencies."

📊 At 10 MHz (high frequency case)

Skin depth: δ_s ≈ 26 micrometers
A 1/16-inch (approximately 1.59 mm) aluminum sheet has thickness of approximately 61 skin depths.
Power reduction after passing through: e raised to the power (−122), which is "effectively zero from a practical engineering perspective."
Conclusion: 1/16-inch aluminum provides excellent shielding at 10 MHz.

📊 At 1 kHz (low frequency case)

Skin depth: δ_s ≈ 2.6 mm (100 times larger than at 10 MHz)
The same 1/16-inch aluminum sheet is now only approximately 0.6 skin depths thick.
Power reduction: only e raised to the power (−1.2) ≈ 0.3, meaning power density is reduced by only approximately 70%.
Conclusion: 1/16-inch aluminum "provides very little shielding at 1 kHz."

⚠️ Key insight: relative thickness matters

Don't confuse absolute thickness with effectiveness—the same physical thickness can be highly effective or nearly useless depending on frequency.
The excerpt demonstrates that the ratio of material thickness to skin depth determines shielding performance, not thickness alone.
Example: the aluminum sheet goes from 61 skin depths (excellent shielding) to 0.6 skin depths (poor shielding) simply by changing frequency from 10 MHz to 1 kHz.

🌊 Connection to wave propagation in conductors

🌊 Background context from preceding section

The excerpt includes context explaining why skin depth matters:

Waves can penetrate some distance into any conductor that is merely "good" (finite conductivity).
Only a perfect conductor (σ approaching infinity) can completely suppress wave propagation.
Field magnitudes are proportional to e raised to the power (−α times l), where l is distance traveled through the material.

🔄 Implications for current flow

The excerpt transitions to discussing AC current flow in good conductors:

In the DC case, current density J is uniform throughout a wire with uniform conductivity.
In the AC case, the electric field E exists as a wave that attenuates as it propagates into the conductor.
For a perfect conductor, E approaches zero everywhere inside, forcing current to exist entirely as surface current.
For good (finite-conductivity) conductors, current density diminishes with depth into the material, concentrated near the surface within a few skin depths.

📡 Transmission line insight

The excerpt notes a "profound" implication:

Information conveyed by signals on transmission lines travels "primarily within the space between the conductors, and not within the conductors."
This is because phase velocity in good conductors is typically a tiny fraction of the speed of light (example given: approximately 0.03% at 1 GHz for typical conductors).
Yet transmission line signals propagate at speeds within an order of magnitude of the speed of light, proving the energy travels in the space between conductors, not inside them.

4.1 AC Current Flow in a Good Conductor

🧭 Overview

🧠 One-sentence thesis

In AC conditions, current in a good conductor concentrates near the surface and decays exponentially inward—a phenomenon known as the skin effect—whereas in a perfect conductor all current exists entirely on the surface, and in DC conditions current distributes uniformly throughout.

📌 Key points (3–5)

DC vs AC distribution: At DC, current density is uniform throughout a wire; at AC, current density is maximum at the surface and decays exponentially inward.
Perfect conductor limiting case: In a perfect conductor (infinite conductivity), the electric field and current density both go to zero inside the material, forcing all current to exist as surface current.
Wave propagation direction: The relevant wave propagates perpendicular to the wire axis (from surface inward), not along the axis; this explains the radial decay of current density.
Common confusion: Don't apply Ohm's law naively when conductivity approaches infinity—both E and σ change simultaneously, so J = σE gives no useful information about current in a perfect conductor.
Frequency dependence: Higher frequency increases the attenuation constant α, concentrating current closer to the surface; lower frequency spreads current more uniformly.

🔌 DC baseline and the transition to AC

🔌 Uniform current in the DC case

In a simple DC circuit with a cylinder-shaped wire, a steady current flows through the wire.
As long as conductivity σ (SI base units S/m) is uniform throughout, the current density J (SI base units A/m²) is also uniform throughout.
The electric field intensity E is constant in the DC case.

🌊 Electric field as a wave in AC

In the AC case, E exists as a wave rather than a constant field.
In a good conductor, the magnitude of E decreases in proportion to e^(−αd), where α is the attenuation constant and d is distance traversed by the wave.
The attenuation constant α increases with increasing conductivity σ, so higher conductivity causes faster decay of E.

⚡ The perfect conductor limiting case

⚡ Zero field and current inside

In the limiting case of a perfect conductor, α approaches infinity, so E approaches zero everywhere inside the material.
Any current within the wire must result from either an impressed source or a response to E.
Without either, we conclude that in the AC case, J approaches zero everywhere inside a perfect conductor.

🔄 Surface current as the only solution

Surface current: current that exists entirely outside the wire material, yet bound to the surface of the wire.

If J = 0 inside the material, current must still pass through the wire somehow.
The only possibility: current exists entirely on the surface of the material.
Key conclusion: In the AC case, the current passed by a perfectly-conducting material lies entirely on the surface of the material.
This case is unobtainable in practice, but provides a starting point for understanding finite conductivity.

🚫 Why Ohm's law doesn't help here

You might be tempted to invoke Ohm's law (J = σE) to argue against the conclusion that J → 0.
However, Ohm's law provides no useful information when σ → ∞ and E → 0 simultaneously.
What Ohm's law really says in this case: E = J/σ → 0 because J must be finite and σ → ∞.
Don't confuse: This is not a violation of Ohm's law; it's a case where the law becomes indeterminate and doesn't constrain the solution.

🌀 Wave propagation in finite conductivity

🌀 Finite conductivity allows penetration

If σ is merely finite (not infinite), then α is also finite.
Wave magnitude may be non-zero over finite distances inside the material.
This opens the possibility of current existing below the surface, not just on it.

📐 Principal directions of propagation

The two principal directions are:
1. Parallel to the axis of the wire
2. Perpendicular to the axis of the wire (radially inward)
Waves in any other direction can be expressed as a linear combination of waves in these principal directions.
Therefore, we need only consider these two directions for a complete picture.

⬇️ Perpendicular (radial) propagation

Boundary condition: Non-zero surface current, inferred from the perfectly-conducting case.
Depth dependence: E deep inside the wire must be weaker than E closer to the surface, because a wave deep inside has traversed more material.
Applying Ohm's law (J = σE), current deep inside the wire similarly diminishes for a good conductor.
Conclusion: A wave travels in the perpendicular direction (from surface toward center), decreasing in magnitude with increasing distance from the surface.

↔️ No axial propagation

We cannot infer the presence of a wave traveling along the axis of the wire.
There is no apparent boundary condition to be satisfied on either end of the wire.
If such a wave existed, different cross-sections would exhibit different radial distributions of current, which is not consistent with physical observations.
Conclusion: The only relevant wave is one that travels from the surface inward (perpendicular direction).

🧴 The skin effect

🧴 Definition and behavior

Skin effect: the phenomenon in which AC current in a good conductor is distributed with maximum current density on the surface of the wire, and the current density decays exponentially with increasing distance from the surface.

The term refers to the notion of current forming a skin-like layer below the surface of the wire.
Since current density is proportional to electric field magnitude, and E decays exponentially inward, so does J.

📊 Frequency dependence of current distribution

Frequency	Attenuation constant α	Current distribution
DC (zero)	Zero	Uniform throughout wire
Low (intermediate)	Small (intermediate)	Intermediate state—some concentration near surface
High	Large	Concentrated close to surface

Since α increases with increasing frequency, the specific distribution of current within the wire depends on frequency.
At high frequencies, current is concentrated close to the surface.
At DC, current is uniformly distributed throughout the wire.
At intermediate frequencies, current is in an intermediate state.

🖼️ Visual representation

Figure 4.2 illustrates the distribution of AC current in a wire of circular cross-section.
Shading indicates current density: darker near the surface, lighter toward the center.
Example: In a thick wire at high frequency, most current flows in a thin layer near the surface; the core carries very little current.

Impedance of a Wire

4.2 Impedance of a Wire

🧭 Overview

🧠 One-sentence thesis

The impedance of a wire carrying AC current depends on frequency because current concentrates near the surface (skin effect), making resistance proportional to the square root of frequency and introducing a reactive component that can be modeled as an equivalent inductance.

📌 Key points (3–5)

DC vs AC impedance: DC impedance is simply resistance R = l/(σA); AC impedance is complex (R + jX) and depends on skin depth.
Skin effect drives frequency dependence: current concentrates near the surface at AC, so effective cross-sectional area shrinks to δₛW (skin depth × width), not the full physical area.
Resistance grows with frequency: AC resistance is proportional to √f, so quadrupling frequency doubles resistance.
Reactive component mimics inductance: the imaginary part of impedance equals the real part (X ≈ R), but it's not true energy storage—it arises from phase shift between potential and current.
Common confusion: the "equivalent inductance" decreases with √f (opposite to resistance), and it's not magnetic energy storage but rather a phase-shift effect from wave propagation in the conductor.

🔌 DC and perfect-conductor baselines

🔌 DC case: uniform current

DC impedance: R = l/(σA), where l is length, σ is conductivity, and A is cross-sectional area.

Current is uniformly distributed throughout the wire.
Impedance is purely resistive (no reactive component).
This formula does not apply when current is time-varying.

⚡ Perfect conductor: zero impedance

A perfect conductor (σ → ∞) has zero impedance at any frequency.
No mechanism exists to dissipate or store energy.
This is an idealization; all practical wires are "good" but not perfect conductors.

🌊 AC case: skin effect and non-uniform current

🌊 Why current concentrates at the surface

At AC, a wave propagates inward from the surface of the wire (perpendicular to the wire axis).
The wave magnitude decays exponentially with distance from the surface (attenuation constant α).
By Ohm's law (J = σE), current density J also decays exponentially inward.
Result: maximum current density is on the surface; it falls off exponentially toward the center.

📉 Skin depth as effective thickness

Skin depth δₛ: the depth to which uniform (DC) current would need to flow to produce the same resistance as the observed AC resistance.

The product δₛW (skin depth × width) replaces the physical cross-sectional area A in the resistance formula.
For a cylindrical wire of radius a, when δₛ ≪ a, replace W with the circumference 2πa.
Example: a wire with large radius compared to skin depth behaves as if current flows only in a thin shell near the surface.

🔍 When the model is valid

The derived AC formulas apply when δₛ ≪ a (skin depth much smaller than wire radius).
This requires f ≫ 1/(πμσa²).
For typical good-conductor wires, this condition holds at MHz frequencies and above.
Don't confuse: if a < δₛ or a ∼ δₛ, current density is significant throughout the wire (including the axis), and the surface-current model breaks down.

📐 Deriving AC impedance: the planar model

📐 Planar geometry simplification

Exact solution for a cylindrical wire is difficult.
Instead, model a semi-infinite planar conductor with a plane wave propagating inward from the surface (z = 0, increasing z = increasing depth).
Electric field: Ẽ = x̂ E₀ exp(−αz) exp(−jβz).
Current density: J̃ = σẼ = x̂ σE₀ exp(−z/δₛ) exp(−jz/δₛ) (using α ≈ β for good conductors).

🧮 Integrating to find net current

Net current Ĩ is the integral of J̃ over a cross-section S perpendicular to J̃.
Choose S as a rectangle of width W (in y) extending to infinity in z.
The integral over z from 0 to ∞ of exp(−Kz) with K = (1+j)/δₛ yields 1/K.
Result: Ĩ ≈ σE₀W δₛ/(1+j).

⚡ Voltage and impedance

Voltage Ṽ = E₀l (integrate electric field along length l on the surface where z = 0).
Impedance Z = Ṽ/Ĩ ≈ (1+j)/(σδₛ) · (l/W).
For a cylindrical wire with δₛ ≪ a, replace W with 2πa: Z ≈ (1+j)/(σδₛ) · l/(2πa).

🔢 AC resistance and its frequency dependence

🔢 Ohmic resistance formula

Ohmic resistance (AC): R ≈ l/(σ(δₛ · 2πa)) when δₛ ≪ a.

The real part of Z is the resistance.
Compare to DC: R_DC = l/(σA) with A = πa².
The effective area is now δₛ · 2πa (a thin shell), not the full cross-section.

📈 Proportional to √f

Since δₛ ≈ 1/√(πfμσ) for a good conductor, substitute into the resistance formula.
Result: R ≈ (1/2)√(μf/(πσ)) · (l/a).
Key finding: resistance is proportional to √f.
Example: increasing frequency by a factor of 4 increases resistance by a factor of 2.

🔄 Bridging DC and high-frequency regimes

At very low frequencies (f ≪ 1/(πμσa²)), use the DC formula R = l/(σπa²).
At high frequencies (f ≫ 1/(πμσa²)), use R ≈ (1/2)√(μf/(πσ)) · (l/a).
Between these regimes, resistance changes slowly (∝ √f), so you can estimate by comparing DC and high-frequency values.

Frequency regime	Condition	Resistance behavior
DC / very low f	δₛ ≫ a	R ≈ l/(σπa²), constant
Intermediate	δₛ ∼ a	R increases ∝ √f
High f (MHz+)	δₛ ≪ a	R ≈ (1/2)√(μf/(πσ)) · (l/a)

📊 Example: RG-59 coaxial cable inner conductor

Parameters: μ ≈ μ₀, σ ≈ 2.28×10⁷ S/m, a ≈ 0.292 mm.
High-frequency formula valid for f ≫ 130 kHz.
At 13 MHz: R′ (resistance per unit length) ≈ 0.82 Ω/m.
At DC (f ≪ 130 kHz): R′ ≈ 0.16 Ω/m, approximately constant.
As frequency increases from DC to 13 MHz, R′ rises monotonically from ≈0.16 to ≈0.82 Ω/m, then continues ∝ √f.

⚙️ Reactive component and equivalent inductance

⚙️ Imaginary part of impedance

From Z ≈ (1+j)/(σδₛ) · (l/W), the imaginary part X equals the real part R.
X ≈ R ≈ l/(σ(δₛ · 2πa)).
This reactance is positive, so it resembles inductive reactance.

🔋 Not true inductance

Important: this reactance is not due to magnetic energy storage (true inductance).
It arises from a phase shift between potential and current, the same phase shift found between electric and magnetic fields in a propagating wave in a good conductor.
The term "equivalent inductance" is used for circuit modeling convenience, not physical accuracy.

📉 Equivalent inductance formula

Equivalent inductance: L_eq ≈ (1/(4π^(3/2))) √(μ/(σf)) · (l/a) when δₛ ≪ a.

Unlike resistance, L_eq decreases with √f.
Example (RG-59 inner conductor): L′_eq ≈ 3.61×10⁻⁵ H·Hz^(1/2) / √f per unit length.
At f ≈ 9.52 kHz, this equals the magnetostatic inductance (≈370 nH/m).
At higher frequencies, the skin-effect inductance becomes less important relative to magnetostatic inductance.

🌈 Dispersion consequence

Phase velocity in a low-loss transmission line is approximately 1/√(L′C′).
Skin effect increases L′ at lower frequencies, so phase velocity decreases.
Result: higher frequencies travel faster than lower frequencies—this is chromatic dispersion.
Dispersion causes significant distortion for wideband signals.

🔌 Equivalent circuit model

A practical wire at AC can be modeled as an ideal resistor (Equation 4.17) in series with an ideal inductor (Equation 4.22).

Resistance increases with √f; equivalent inductance decreases with √f.
This circuit model is valid when δₛ ≪ a (typically MHz and above).

🧱 Surface impedance: a material property

🧱 Definition and units

Surface impedance Z_S = (1+j)/(σδₛ).

Z_S is a materials property, independent of geometry (length l and width W).
Terminal impedance Z ≈ Z_S · (l/W).
Units: Ω/□ ("ohms per square") to distinguish from terminal impedance (Ω).

🔬 Applications

Surface impedance characterizes AC impedance of a material without reference to dimensions.
Analogous to intrinsic impedance η (also a materials property).
The real part of Z_S is called surface resistance or sheet resistance.
Commonly used to specify sheet materials in electronics and semiconductor manufacturing.

🔍 Comparison to intrinsic impedance

Both Z_S and η are materials properties (not geometry-dependent).
η describes wave impedance in a medium; Z_S describes AC impedance of a conductor surface.
Don't confuse: Z_S has units Ω/□; terminal impedance Z has units Ω.

Surface Impedance

4.3 Surface Impedance

🧭 Overview

🧠 One-sentence thesis

Surface impedance is a materials property that characterizes AC impedance independently of geometry, enabling standardized specification of conductor behavior at high frequencies.

📌 Key points (3–5)

Definition: Surface impedance Z_S separates material properties from geometry in the impedance formula, expressed in units of "ohms per square" (Ω/□).
Relationship to terminal impedance: Terminal impedance Z equals surface impedance Z_S multiplied by the ratio of length to width (l/W).
Materials property: Unlike terminal impedance, surface impedance depends only on conductivity σ and skin depth δ_s, not on the physical dimensions of the conductor.
Common confusion: Surface impedance has units of ohms but is labeled "ohms per square" to distinguish it from terminal impedance; it's analogous to intrinsic impedance η.
Practical use: Widely used to specify sheet materials in electronics and semiconductor manufacturing, where the real part is called surface resistance or sheet resistance.

🔬 From terminal to surface impedance

🔬 Starting with terminal impedance

The excerpt begins with the impedance formula for a good conductor:

Z ≈ (1 + j)/(σδ_s) · (l/W)

Where:

σ = conductivity (S/m)
δ_s = skin depth
l = length
W = width

This formula mixes two types of information:

Material properties: σ and δ_s (intrinsic to the conductor material)
Geometry: l and W (how the conductor is shaped)

🎯 Motivation for separation

The excerpt emphasizes that σ and δ_s "are constitutive parameters of material, and do not depend on geometry; whereas l and W describe geometry."

This observation motivates defining a quantity that captures only the material behavior, independent of how the conductor is cut or shaped.

📐 Definition and formula

📐 Surface impedance defined

Surface impedance Z_S: Z_S = (1 + j)/(σδ_s)

This extracts the material-dependent part from the terminal impedance formula.

🔗 Relating surface to terminal impedance

Once surface impedance is defined, terminal impedance becomes:

Z ≈ Z_S · (l/W)

Why this matters:

Any conductor made from the same material has the same Z_S
Different shapes (different l/W ratios) scale the terminal impedance proportionally
Example: Two copper sheets with different dimensions share the same Z_S but have different terminal impedances

📏 Units: "ohms per square"

The excerpt notes that although Z_S has units of ohms, it is indicated as:

"Ω/□" ("ohms per square")

Purpose of special notation: Prevents confusion with terminal impedance Z, which also has units of ohms but represents a different physical quantity.

Don't confuse:

Terminal impedance Z (ohms) = depends on both material and geometry
Surface impedance Z_S (Ω/□) = depends only on material

🧲 Comparison with intrinsic impedance

🧲 Analogous materials property

The excerpt draws a parallel:

"In this way, it is like the intrinsic or 'wave' impedance η, which is also a materials property."

Both Z_S and η:

Characterize material behavior
Are independent of geometry
Have units of ohms (though Z_S uses Ω/□ notation)

Key difference (implied by context):

Intrinsic impedance η relates to wave propagation in the bulk material
Surface impedance Z_S relates to current flow at AC frequencies in conductors

🏭 Practical applications

🏭 Sheet materials in manufacturing

The excerpt highlights real-world use:

"Surface impedance is often used to specify sheet materials used in the manufacture of electronic and semiconductor devices"

Why sheet materials: The l/W ratio becomes particularly simple for thin sheets, making Z_S a natural specification parameter.

🔴 Surface resistance / sheet resistance

The excerpt notes:

"the real part of the surface impedance is more commonly known as the surface resistance or sheet resistance"

Industry terminology:

Surface impedance Z_S = complex quantity (has real and imaginary parts)
Surface resistance = real part of Z_S
Sheet resistance = same as surface resistance (common in semiconductor industry)

Example: A manufacturer might specify a conductive film by its sheet resistance in Ω/□, allowing designers to calculate terminal impedance for any geometry.

Plane Waves at Normal Incidence on a Planar Boundary

5.1 Plane Waves at Normal Incidence on a Planar Boundary

🧭 Overview

🧠 One-sentence thesis

When a plane wave strikes a planar boundary between two materials perpendicularly, the reflection and transmission behavior is fully determined by the wave impedances of the two regions, analogous to voltage reflection in transmission lines.

📌 Key points (3–5)

What happens at the boundary: an incident wave splits into a reflected wave (traveling backward) and a transmitted wave (traveling forward into the second medium).
The reflection coefficient: the ratio of reflected to incident field amplitude depends on the wave impedances of both regions, given by Γ₁₂ = (η₂ − η₁)/(η₂ + η₁).
Boundary conditions determine unknowns: continuity of tangential electric and magnetic fields at the boundary provides two equations to solve for the reflected and transmitted amplitudes.
Common confusion: wave impedance η (a material property) vs terminal impedance—surface impedance uses units of Ω/□ ("ohms per square") to distinguish it.
Power split: the fraction of power density reflected is |Γ₁₂|², and the fraction transmitted is 1 − |Γ₁₂|².

🌊 The incident, reflected, and transmitted waves

🌊 Incident wave as the stimulus

The incident electric field in Region 1 is given as E_i(z) = x-hat E_i0 exp(−jβ₁z) for z ≤ 0.
β₁ = ω√(μ₁ε₁) is the phase propagation constant in Region 1.
E_i0 is a complex constant representing the amplitude and phase.
The associated magnetic field is H_i(z) = y-hat (E_i0/η₁) exp(−jβ₁z), where η₁ = √(μ₁/ε₁) is the wave impedance.
Why it matters: all other field contributions (reflected and transmitted) can be expressed in terms of E_i0.

🔄 Reflected wave in Region 1

Symmetry and the wave equation imply a wave traveling in the −z direction.
Electric field: E_r(z) = x-hat B exp(+jβ₁z) for z ≤ 0, where B is unknown.
Magnetic field: H_r(z) = −y-hat (B/η₁) exp(+jβ₁z).
The negative sign in the magnetic field arises because the direction of propagation is reversed.
Don't confuse: the reflected wave uses the same β₁ and η₁ as the incident wave because it is still in Region 1.

➡️ Transmitted wave in Region 2

Symmetry implies a wave traveling in the +z direction in Region 2, with no backward-traveling component for z > 0.
Electric field: E_t(z) = x-hat C exp(−jβ₂z) for z ≥ 0, where C is unknown.
Magnetic field: H_t(z) = y-hat (C/η₂) exp(−jβ₂z).
β₂ = ω√(μ₂ε₂) and η₂ = √(μ₂/ε₂) are the phase constant and wave impedance in Region 2.

🔗 Applying boundary conditions

🔗 Continuity of tangential electric field

The tangential component of total electric field must be continuous at z = 0.
Total field in Region 1: E₁(z) = E_i(z) + E_r(z).
Total field in Region 2: E₂(z) = E_t(z).
At the boundary: E_i(0) + E_r(0) = E_t(0), which gives E_i0 + B = C.
This is the first equation relating the unknowns B and C.

🔗 Continuity of tangential magnetic field

No impressed surface current exists, so the tangential magnetic field must also be continuous.
Total magnetic field in Region 1: H₁(z) = H_i(z) + H_r(z).
Total magnetic field in Region 2: H₂(z) = H_t(z).
At the boundary: H_i(0) + H_r(0) = H_t(0), which gives (E_i0/η₁) − (B/η₁) = (C/η₂).
This is the second equation needed to solve for B and C.

🧮 Solving the system

Substitute C = E_i0 + B from the first equation into the second equation.
Solve for B: B = Γ₁₂ E_i0, where Γ₁₂ = (η₂ − η₁)/(η₂ + η₁).
Solve for C: C = (1 + Γ₁₂) E_i0.
Result: the reflected field is E_r(z) = x-hat Γ₁₂ E_i0 exp(+jβ₁z) for z ≤ 0, and the transmitted field is E_t(z) = x-hat (1 + Γ₁₂) E_i0 exp(−jβ₂z) for z ≥ 0.

📐 The reflection coefficient and special cases

📐 Definition and meaning

Reflection coefficient Γ₁₂: the ratio of reflected to incident electric field amplitude, given by Γ₁₂ = (η₂ − η₁)/(η₂ + η₁).

The subscript "12" indicates incidence from Region 1 toward Region 2.
Γ₁₂ is a complex number in general, encoding both amplitude and phase changes.

✅ Special case: matched media

If Region 2 is identical to Region 1, then η₂ = η₁.
Γ₁₂ = 0, so there is no reflection.
The transmitted field equals the incident field, as expected.

⚡ Special case: perfect conductor

If Region 2 is a perfect conductor, the electric field inside must be zero.
η₂ = 0 (ratio of electric to magnetic field is zero in a perfect conductor).
Γ₁₂ = −1, so the reflected field has the same magnitude but opposite sign.
1 + Γ₁₂ = 0, confirming that the transmitted field is zero.
The boundary condition E_i(0) + E_r(0) = 0 is satisfied by the sign change.

🔄 Alternative expressions for Γ₁₂

For lossless, non-magnetic media (μ = μ₀), the reflection coefficient can be written in terms of permittivity: Γ₁₂ = (√ε₁ − √ε₂)/(√ε₁ + √ε₂).
In terms of relative permittivity: Γ₁₂ = (√ε_r1 − √ε_r2)/(√ε_r1 + √ε_r2).
In optics, using the refractive index n = √ε_r: Γ₁₂ = (n₁ − n₂)/(n₁ + n₂).

🔌 Analogy to transmission lines

🔌 Voltage reflection coefficient

In transmission lines, the voltage reflection coefficient is Γ = (Z_L − Z₀)/(Z_L + Z₀), where Z_L is the load impedance and Z₀ is the characteristic impedance.
Comparing to Γ₁₂ = (η₂ − η₁)/(η₂ + η₁), we see η₁ is analogous to Z₀ and η₂ is analogous to Z_L.
Matched load: η₂ = η₁ is like Z_L = Z₀ (no reflection).
Short-circuit load: η₂ = 0 is like Z_L = 0 (total reflection with sign

Plane Waves at Normal Incidence on a Material Slab

5.2 Plane Waves at Normal Incidence on a Material Slab

🧭 Overview

🧠 One-sentence thesis

A plane wave incident on a material slab sandwiched between two semi-infinite media can be analyzed by treating the slab as an equivalent single boundary with a complex wave impedance that accounts for reflections at both interfaces and the phase shift within the slab.

📌 Key points (3–5)

The double-boundary problem: A slab (Region 2) between two semi-infinite media (Regions 1 and 3) creates two boundaries where waves reflect and transmit, requiring simultaneous solution of boundary conditions at both interfaces.
Equivalent wave impedance: The slab can be replaced by an equivalent single boundary with wave impedance η_eq that captures the combined effect of both interfaces and the slab thickness.
Complex impedance without loss: Even for lossless materials, η_eq is generally complex-valued; the phase represents standing-wave effects inside the slab, not material loss.
Common confusion: The complex value of η_eq does not indicate lossy material—it arises from the phase relationship between electric and magnetic fields due to reflections within the slab, analogous to transmission line input impedance.
Power conservation: In lossless slabs, all power not reflected from the first boundary is transmitted into Region 3, so transmitted power fraction equals 1 minus the squared magnitude of the equivalent reflection coefficient.

🏗️ Problem setup and field structure

🏗️ Geometry and regions

Three regions: Region 1 (semi-infinite, z ≤ −d), Region 2 (slab, −d ≤ z ≤ 0, thickness d), Region 3 (semi-infinite, z ≥ 0).
Boundaries: First boundary at z = −d (between Regions 1 and 2), second boundary at z = 0 (between Regions 2 and 3).
All media are "simple" and lossless: real-valued permittivity and permeability, with imaginary permittivity ε″ = 0.

📡 Wave components in each region

The excerpt identifies six wave components (three forward, three reflected):

Region	Forward wave	Reflected wave	Total field
Region 1	Incident: E_i, H_i (traveling −z)	Reflected: E_r, H_r (traveling +z, amplitude B)	Sum of incident and reflected
Region 2	Transmitted: E_t2, H_t2 (traveling −z, amplitude C)	Reflected: E_r2, H_r2 (traveling +z, amplitude D)	Sum of both components
Region 3	Transmitted: E_t, H_t (traveling −z, amplitude F)	None	Only transmitted wave

Unknowns: Four complex constants B, C, D, F must be determined.
Symmetry: The problem assumes the same polarization and direction structure as the single-boundary case.

🔗 Boundary conditions

Continuity of total electric and magnetic fields at each boundary yields four equations:

At z = 0: E_t2(0) + E_r2(0) = E_t(0) and H_t2(0) + H_r2(0) = H_t(0).
At z = −d: E_i(−d) + E_r(−d) = E_t2(−d) + E_r2(−d) and similarly for H.
These four equations form a system of simultaneous linear equations solvable for the four unknowns.

🔄 Equivalent single-boundary representation

🔄 Key insight: replacing the slab

The double-boundary problem can be replaced by an "equivalent" single-boundary problem where the region to the right of z = −d has an equivalent wave impedance η_eq.

Equivalence means: Fields in Region 1 (incident and reflected) are identical to the original problem.
Why useful: Most applications care only about reflection into Region 1 and transmission into Region 3, not the internal fields in Region 2.
Example: Instead of solving for all internal fields, we find one parameter (η_eq) that captures the slab's effect on reflection and transmission.

🧮 Deriving η_eq

The equivalent wave impedance is defined as the ratio of total electric to total magnetic field intensity at the z = −d boundary in Region 2:

η_eq = η_2 × [1 + Γ_23 × exp(−j 2 β_2 d)] / [1 − Γ_23 × exp(−j 2 β_2 d)]

where:

η_2 is the wave impedance in Region 2.
Γ_23 = (η_3 − η_2) / (η_3 + η_2) is the reflection coefficient at the Region 2–3 boundary.
β_2 = ω × sqrt(μ_2 ε_2) is the phase propagation constant in Region 2.
d is the slab thickness.

Sanity check: If Regions 2 and 3 are identical (η_2 = η_3), then Γ_23 = 0 and η_eq = η_2, as expected.

🌀 Complex impedance interpretation

Don't confuse with loss: η_eq is generally complex-valued even for lossless materials.
The imaginary part does not indicate material loss; it represents the phase shift between electric and magnetic fields due to standing waves inside the slab.
Analogy: This is precisely analogous to transmission line input impedance, which is complex even for lossless lines with real characteristic and load impedances.

📊 Reflection and transmission calculations

📊 Equivalent reflection coefficient

Once η_eq is known, define:

Γ_1,eq = (η_eq − η_1) / (η_eq + η_1)

This is used exactly like the single-boundary reflection coefficient Γ_12.
It determines the reflected fields in Region 1: the amplitude B = Γ_1,eq × E_i^0.

💡 Power density ratios

Reflected power fraction: S_r^ave / S_i^ave = |Γ_1,eq|², where S_r^ave and S_i^ave are reflected and incident power densities.
Transmitted power fraction (for lossless slab): S_t^ave / S_i^ave = 1 − |Γ_1,eq|², by conservation of power.
No power is dissipated in a lossless slab, so all power not reflected must be transmitted into Region 3.

📡 Example: WiFi through glass

The excerpt provides a worked example (Example 5.2 and 5.3):

Setup: WiFi signal at 2.45 GHz, glass pane 1 cm thick, ε_r = 4, lossless.
Regions: 1 and 3 are free space (η_1 = η_3 ≈ 376.7 Ω), Region 2 is glass (η_2 ≈ 188.4 Ω).
Calculation steps:
1. Γ_23 ≈ 0.3333.
2. β_2 ≈ 102.6 rad/m.
3. η_eq ≈ 117.9 − j78.4 Ω (complex, but glass is lossless).
4. Γ_1,eq ≈ −0.4859 − j0.2354.
5. Reflected power: |Γ_1,eq|² ≈ 29.2%.
6. Transmitted power: 1 − |Γ_1,eq|² ≈ 70.8%.
Interpretation: About 29% of WiFi power is reflected by the glass pane, and about 71% passes through.

🎯 Total transmission condition

🎯 Goal: zero reflection

The excerpt introduces a class of applications seeking "total transmission":

Definition: 100% of incident power transmitted into Region 3, 0% reflected back into Region 1.
This requires Γ_1,eq = 0, which means η_eq = η_1.

🛡️ Applications

Radome: A protective covering around an antenna that does not interfere with transmitted or received waves.
RF and optical filtering: Structures that pass or reject waves in narrow frequency ranges.

🔧 Conditions for total transmission

The excerpt states that conditions for total transmission will be described, involving:

Matching the equivalent wave impedance η_eq to the impedance of Region 1 (η_1).
This depends on the permittivity and permeability of all three regions, the slab thickness d, and the frequency (through β_2).

(Note: The excerpt cuts off before fully detailing these conditions in Section 5.3.)

Total Transmission Through a Slab

5.3 Total Transmission Through a Slab

🧭 Overview

🧠 One-sentence thesis

Total transmission (100% power passing through a slab with 0% reflection) can be achieved by designing the slab thickness and material properties to satisfy either half-wave matching or quarter-wave matching conditions.

📌 Key points (3–5)

Total transmission definition: 100% of incident power passes through the slab into Region 3, with 0% reflected back into Region 1.
Two design strategies: half-wave matching (when materials on both sides of the slab are the same) and quarter-wave matching (when materials differ but are related by a specific impedance relationship).
Half-wave matching: slab thickness must be an integer multiple of half-wavelengths; no restriction on slab material properties, only that the two outer regions have equal wave impedance.
Quarter-wave matching: slab thickness must be an odd multiple of quarter-wavelengths, and the slab's wave impedance must equal the geometric mean of the two outer regions' impedances.
Common confusion: both methods are narrowband—they achieve total transmission only at the design frequency; as frequency changes, reflection increases and transmission decreases.

🔍 Condition for total transmission

🔍 Zero reflection requirement

Total transmission requires that the equivalent reflection coefficient equals zero: Γ₁,eq = 0.

From the reflection coefficient formula, Γ₁,eq = 0 when the equivalent impedance η_eq equals the impedance of Region 1 (η₁).
The equivalent impedance depends on the slab impedance η₂, the thickness d, the phase propagation constant β₂, and the reflection coefficient at the Region 2–3 interface (Γ₂₃).
The excerpt derives that total transmission requires: η₁ = η₂ × [(1 + P)η₃ + (1 − P)η₂] / [(1 − P)η₃ + (1 + P)η₂], where P = exp(−j2β₂d).

🧮 Two categories of solutions

The excerpt identifies two general categories: half-wave matching and quarter-wave matching.
Both arise from different choices of the parameter P (which encodes the phase shift through the slab).
Each category has distinct requirements for material properties and thickness.

🌊 Half-wave matching

🌊 When to use half-wave matching

Applies when the material on either side of the slab is the same: η₁ = η₃.
The excerpt calls this common value η_ext (external impedance).
No restriction on the permittivity or permeability of the slab itself.
The only constraint on Regions 1 and 3 is that they have equal wave impedance.

📏 Thickness requirement

Setting P = +1 satisfies the total transmission condition.
This requires 2β₂d = 2πm, where m = 1, 2, 3, ...
Solving for thickness: d = (λ₂/2) × m, where λ₂ is the wavelength inside the slab.
In words: the slab thickness must be an integer number of half-wavelengths at the frequency of interest.

📡 Example: radome design

Scenario: A 60 GHz radar antenna needs weather protection; the radome panel material has relative permeability ≈ 1 and relative permittivity = 4; minimum thickness is 3 mm for mechanical integrity.

Solution steps:

Phase velocity in the slab: v_p = c / √ε_r ≈ 1.5 × 10⁸ m/s.
Wavelength in the slab: λ₂ = v_p / f ≈ 2.5 mm.
Minimum half-wave thickness: d = λ₂/2 ≈ 1.25 mm (too thin).
Next option: d = λ₂ ≈ 2.5 mm (still too thin).
Next option: d = 3λ₂/2 ≈ 3.75 mm (meets the 3 mm requirement).
Selected thickness: d ≈ 3.75 mm.

⚠️ Narrowband limitation

Half-wave matching designs are narrowband: total transmission occurs only at the design frequency.
As frequency increases or decreases from the design frequency, reflection increases and transmission decreases.
Don't confuse: the design works perfectly at one frequency but degrades at others.

🔷 Quarter-wave matching

🔷 When to use quarter-wave matching

Requires that the wave impedances in each region are different and related in a particular way.
The slab impedance must be the geometric mean of the two outer regions: η₂ = √(η₁η₃).
Unlike half-wave matching, the slab material properties are constrained by this impedance relationship.

📏 Thickness requirement

Setting P = −1 satisfies the total transmission condition.
This requires 2β₂d = π + 2πm, where m = 0, 1, 2, ...
Solving for thickness: d = λ₂/4 + (λ₂/2) × m, where λ₂ is the wavelength inside the slab.
In words: the slab must be one-quarter wavelength thick, or some integer number of half-wavelengths thicker if needed.

📡 Example: radome with embedded antenna

Scenario: A 60 GHz radar antenna is embedded in a lossless material with relative permeability ≈ 1 and relative permittivity = 2; the radome panel is placed between this material and free space; the panel must be lossless and non-magnetic; no minimum thickness requirement.

Solution steps:

Required slab impedance: η₂ = √(η₁η₃) = √[(η₀/√2) × η₀] ≈ 317 Ω.
Since the panel is non-magnetic, the relative permittivity must be: ε_r = (η₀/η₂)² ≈ 1.41.
Phase velocity in the slab: v_p = c / √ε_r ≈ 2.53 × 10⁸ m/s.
Wavelength in the slab: λ₂ = v_p / f ≈ 4.20 mm.
Minimum thickness: d = λ₂/4 ≈ 1.05 mm.
Design: relative permittivity ≈ 1.41, thickness ≈ 1.05 mm.

🔄 Comparison with half-wave matching

Feature	Half-wave matching	Quarter-wave matching
When to use	Same material on both sides (η₁ = η₃)	Different materials on each side
Slab material constraint	None (any permittivity/permeability)	Must satisfy η₂ = √(η₁η₃)
Minimum thickness	λ₂/2 (half-wavelength)	λ₂/4 (quarter-wavelength)
Thickness formula	d = (λ₂/2) × m, m = 1, 2, 3, ...	d = λ₂/4 + (λ₂/2) × m, m = 0, 1, 2, ...
Bandwidth	Narrowband (both methods)	Narrowband (both methods)

🎯 Applications

🎯 Radome design

Radome: a protective covering which partially or completely surrounds an antenna, but nominally does not interfere with waves being received by or transmitted from the antenna.

The excerpt provides two radome design examples using the two matching strategies.
The choice between half-wave and quarter-wave matching depends on whether the materials on both sides of the radome are the same or different.

🎯 RF and optical wave filtering

Another application mentioned is passing or rejecting waves falling within a narrow range of frequencies.
The narrowband nature of both matching methods makes them suitable for frequency-selective applications.

Propagation of a Uniform Plane Wave in an Arbitrary Direction

5.4 Propagation of a Uniform Plane Wave in an Arbitrary Direction

🧭 Overview

🧠 One-sentence thesis

Ray-fixed coordinates provide a unique, coordinate-system-independent way to describe uniform plane waves propagating in any direction with any polarization, solving the problem that arises when waves travel obliquely rather than along coordinate axes.

📌 Key points (3–5)

The coordinate-system problem: the same physical wave can be described by different mathematical expressions depending on how you orient your coordinate system, which complicates analysis.
Ray-fixed coordinates solution: a representation that depends only on the wave's own characteristics (direction of propagation and polarization), not on arbitrary coordinate choices.
Common confusion: changing the coordinate system makes the wave appear to change direction or polarization, but the physical wave itself remains identical—only the description changes.
Why it matters: oblique incidence problems (e.g., waves hitting surfaces at angles) become much easier to analyze when you can express everything in one consistent coordinate system.
Conversion procedure: ray-fixed representations can be converted back to standard Cartesian coordinates by decomposing the wave vector and polarization into components.

🔄 The coordinate system dependency problem

🔄 Same wave, different descriptions

The excerpt demonstrates that a single physical wave can be written in multiple ways depending on coordinate system orientation:

Original: electric field points in +x direction, wave travels in +z direction
After rotating x-axis to y-axis position: same wave now appears to point in -y direction
After rotating z-axis by 180°: same wave now appears to travel in opposite direction with +y polarization

Key insight: These are all the same physical wave—only the mathematical description changes with coordinate rotation.

🎯 When this becomes a real problem

The excerpt identifies a practical limitation:

When a wave propagates obliquely (not along a coordinate axis), it becomes impossible to select a single coordinate system orientation where the direction of propagation, reference polarization, and surface normal can all be described with one basis vector each.

Example: A wave hitting a surface at an angle requires describing three different directions simultaneously, which becomes awkward in standard Cartesian coordinates.

📐 Ray-fixed coordinates

📐 The core representation

Ray-fixed coordinates: a unique set of coordinates determined from the characteristics of the wave itself, rather than being determined arbitrarily and separately from the wave.

The ray-fixed expression is:

Electric field at position r equals unit polarization vector times amplitude times exponential of negative j times wave vector dot position
Where wave vector k equals unit direction vector times phase constant beta

Why "ray-fixed": The coordinate system is tied to the wave's own properties, not to an external reference frame.

🔑 Key components

Component	Symbol	Meaning
Position	r	The point where field is evaluated
Reference polarization	unit vector e	Direction the field points (when phase is between -π/2 and +π/2)
Wave vector	k	Direction of propagation times beta
Phase constant	β	Related to wavelength in the medium

✨ Uniqueness property

The excerpt emphasizes:

In ray-fixed coordinates, a wave can be represented by one—and only one—expression, which is the same expression regardless of the orientation of the "global" coordinate system.

This eliminates the ambiguity shown in the earlier examples where rotating coordinates changed the mathematical form.

🧩 Minimal basis requirement

Only two basis directions must be defined:

The propagation direction (unit vector k-hat)
The polarization direction (unit vector e-hat)

If a third direction is needed, use the cross product of these two (either k-hat cross e-hat or e-hat cross k-hat). The excerpt notes that one choice corresponds to the magnetic field polarization.

🔀 Converting ray-fixed to Cartesian coordinates

🔀 The general procedure

The excerpt provides a systematic conversion method:

Decompose the wave vector into Cartesian components:
- k equals k_x times x-hat plus k_y times y-hat plus k_z times z-hat
- Where k_x equals beta times (unit propagation direction dot x-hat)
- Similarly for k_y and k_z
Compute the dot product:
- Wave vector dot position equals k_x times x plus k_y times y plus k_z times z
Rewrite the field expression:
- Electric field becomes polarization vector times amplitude times three separate exponential terms (one for each coordinate)

🔀 Optional polarization decomposition

If needed, the polarization unit vector can also be broken into Cartesian components:

Unit vector e equals (e dot x-hat) times x-hat plus (e dot y-hat) times y-hat plus (e dot z-hat) times z-hat

📝 Worked example interpretation

The excerpt includes an example of a wave propagating away from the z-axis with z-direction polarization:

Polarization: unit vector e equals z-hat (field oscillates vertically)
Propagation direction: unit vector k-hat equals x-hat times cosine phi plus y-hat times sine phi (radially outward in the x-y plane)
Wave vector: k equals beta times (x-hat cosine phi plus y-hat sine phi)
Cartesian form: electric field equals z-hat times amplitude times exponential of negative j beta x cosine phi times exponential of negative j beta y sine phi

The angle phi specifies which radial direction: phi equals zero gives propagation in +x direction, phi equals π/2 gives +y direction, etc.

🎓 Reference polarization concept

🎓 Definition clarification

Reference polarization: the direction in which the electric field points when the real part of (amplitude times exponential phase term) is greater than or equal to zero; i.e., when the phase is between -π/2 and +π/2 radians.

Plain language: As the wave oscillates, the field vector flips back and forth. The reference polarization is the "positive" direction—the direction it points during the "positive half" of the oscillation cycle.

🎓 Coordinate-system independence

The excerpt stresses that reference polarization is a physical property of the wave:

When you rotate coordinates, the reference polarization appears to change mathematically
But the physical direction in space remains exactly the same
Only the coordinate system used to describe that direction has changed

Don't confuse: A change in the mathematical expression due to coordinate rotation versus an actual physical change in the wave itself.

Decomposition of a Wave into TE and TM Components

5.5 Decomposition of a Wave into TE and TM Components

🧭 Overview

🧠 One-sentence thesis

Decomposing an obliquely incident plane wave into transverse electric (TE) and transverse magnetic (TM) components simplifies the analysis of wave reflection and transmission at planar boundaries by separating the wave into two polarizations that can be analyzed independently and then summed.

📌 Key points (3–5)

Why decomposition is needed: For oblique incidence (non-perpendicular arrival), field vector directions change upon reflection/transmission, making direct analysis difficult; TE-TM decomposition simplifies this.
What TE and TM mean: TE has the electric field perpendicular to the plane of incidence; TM has the magnetic field perpendicular (equivalently, electric field parallel) to the plane of incidence.
How to decompose: Express the incident wave's polarization in a ray-fixed coordinate system using two basis vectors—one perpendicular and one parallel to the plane of incidence.
Common confusion: TE-TM decomposition is undefined for normal incidence (TEM case), where both electric and magnetic fields are already perpendicular to the boundary and no unique plane of incidence exists.
Key advantage: Analyze TE and TM components separately, then sum the results to obtain the behavior of the combined wave.

📐 The plane of incidence and ray-fixed coordinates

📐 Defining the plane of incidence

The plane of incidence is the plane in which both the normal to the surface (n-hat) and the direction of propagation (k-hat-i) lie.

This plane serves as the reference for defining "transverse" (perpendicular) and "parallel" directions.
The unit normal n-hat points into the region from which the wave is incident.
The unit vector k-hat-i indicates the direction the incident wave propagates.
Example: If the boundary is the x-y plane and the wave arrives from above at an angle, the plane of incidence might be the x-z plane containing both the propagation direction and the surface normal.

🧭 Ray-fixed coordinate system

The excerpt constructs a coordinate system attached to the incident ray using two basis vectors:

e-hat-perpendicular: perpendicular to the plane of incidence, defined as:
- e-hat-perpendicular = (k-hat-i cross n-hat) divided by the magnitude of (k-hat-i cross n-hat)
- This is perpendicular to both k-hat-i and n-hat.
e-hat-i-parallel: parallel to the plane of incidence, defined as:
- e-hat-i-parallel = e-hat-perpendicular cross k-hat-i
- This is perpendicular to both e-hat-perpendicular and k-hat-i, and lies in the plane of incidence.

These two basis vectors, together with k-hat-i, form a complete coordinate system for describing the incident wave.
The ray-fixed system simplifies expressing the wave's polarization relative to the plane of incidence.

🔀 Decomposing the electric field

🔀 Expressing polarization in ray-fixed coordinates

The incident electric field is initially written as:

E-tilde-i = e-hat-i times E-i-0 times exponential of (minus j times k-i dot r)

Where e-hat-i is the reference polarization unit vector. To decompose, express e-hat-i in the ray-fixed system:

e-hat-i = (e-hat-i dot e-hat-perpendicular) times e-hat-perpendicular + (e-hat-i dot e-hat-i-parallel) times e-hat-i-parallel + (e-hat-i dot k-hat-i) times k-hat-i

Since the electric field is always perpendicular to the propagation direction, e-hat-i dot k-hat-i = 0, leaving:

e-hat-i = (e-hat-i dot e-hat-perpendicular) times e-hat-perpendicular + (e-hat-i dot e-hat-i-parallel) times e-hat-i-parallel

⚡ TE and TM components defined

Substituting the decomposed polarization into the electric field expression yields:

E-tilde-i = e-hat-perpendicular times E-i-TE times exponential of (minus j times k-i dot r) + e-hat-i-parallel times E-i-TM times exponential of (minus j times k-i dot r)

Where:

E-i-TE = E-i-0 times (e-hat-i dot e-hat-perpendicular)
E-i-TM = E-i-0 times (e-hat-i dot e-hat-i-parallel)

First term (TE component):

The TE component is the component for which E-tilde-i is perpendicular to the plane of incidence.

"TE" stands for "transverse electric."
The electric field vector is perpendicular to the plane of incidence.

Second term (TM component):

The TM component is the component for which H-tilde-i is perpendicular to the plane of incidence; i.e., the component for which E-tilde-i is parallel to the plane of incidence.

"TM" stands for "transverse magnetic."
The magnetic field (not electric) is perpendicular to the plane of incidence.
Since the magnetic field is perpendicular to both the electric field and the propagation direction, having the magnetic field perpendicular to the plane of incidence means the electric field is parallel to it.

🧮 Superposition principle

The total wave is the sum of its TE and TM components.
You can analyze TE and TM separately, then add the results to get the combined wave behavior.
This is the key simplification: analyzing two special polarizations is easier than analyzing an arbitrary polarization directly.

🏷️ Terminology and special cases

🏷️ Alternative names

The excerpt notes that TE and TM are commonly but not universally used:

Term	TE alternative	TM alternative
Descriptive name	Perpendicular polarization	Parallel polarization
Subscript notation	⊥ or "s" (from German "senkrecht" = perpendicular)	‖ or "p"

🔄 Component vs mode

For a single plane wave, "TE component" and "TE mode" are synonymous (same for TM).
Don't confuse: In more complex scenarios (e.g., inside a waveguide), multiple unique TE modes may collectively form the TE component, and similarly for TM.
The excerpt emphasizes this distinction is important in general, though not for the single plane wave case being discussed.

🎯 Normal incidence (TEM case)

When the wave is normally incident (k-hat-i = minus n-hat):

The formula for e-hat-perpendicular becomes zero (the cross product of parallel vectors is zero).
The TE-TM decomposition is undefined because there is no unique plane of incidence.
Both electric and magnetic fields are already perpendicular to the boundary.

A wave which is normally-incident on a planar surface is said to be transverse electromagnetic (TEM) with respect to that boundary. There is no unique TE-TM decomposition in this case.

This is not a problem: the normal incidence case is handled separately (as in Section 5.1 of the text).
Example: A wave traveling straight down onto a horizontal surface has no preferred "plane of incidence"—any vertical plane containing the propagation direction would work equally well.

🎯 Why TE-TM decomposition matters

🎯 Simplifying oblique incidence analysis

The excerpt states the core motivation at the beginning:

Normal incidence (perpendicular arrival): Field vector directions of reflected and transmitted waves are the same as the incident wave (possibly with a sign change), making analysis straightforward.
Oblique incidence (non-perpendicular arrival): Field vector directions generally differ between incident, reflected, and transmitted waves, adding complexity.
Solution: Decompose the incident wave into TE and TM components, each with a specific polarization relative to the plane of incidence.

🔧 Practical workflow

Identify the plane of incidence (contains both propagation direction and surface normal).
Construct the ray-fixed coordinate system (e-hat-perpendicular and e-hat-i-parallel).
Project the incident wave's polarization onto these two basis vectors to find TE and TM amplitudes.
Analyze reflection and transmission for TE and TM separately (each is relatively simple).
Sum the TE and TM results to obtain the total reflected and transmitted fields.

The excerpt emphasizes that analyzing TE and TM cases is "relatively simple," whereas direct analysis of arbitrarily-polarized waves is "relatively difficult."
This decomposition is a standard technique for a "broad range of problems in electromagnetics" involving scattering at planar boundaries.

Plane Waves at Oblique Incidence on a Planar Boundary: TE Case

5.6 Plane Waves at Oblique Incidence on a Planar Boundary: TE Case

🧭 Overview

🧠 One-sentence thesis

When a TE-polarized plane wave strikes a planar boundary at an oblique angle, the reflected and transmitted waves maintain TE polarization, with reflection and transmission coefficients determined by the wave impedances and angles of incidence and transmission.

📌 Key points (3–5)

TE polarization maintained: The electric field remains perpendicular to the plane of incidence for incident, reflected, and transmitted waves because nothing in the problem causes polarization change.
Phase matching requirement: For boundary conditions to hold everywhere on the boundary, the phases of incident, reflected, and transmitted waves must match at every point along the boundary.
Reflection coefficient formula: The TE reflection coefficient depends on wave impedances (eta 1 and eta 2) and angles (psi i and psi t), reducing to the normal incidence formula when all angles are zero.
Common confusion: The angle of transmission (psi t) is not equal to the angle of incidence; it depends on the ratio of phase constants (beta 1 / beta 2) and can lead to unusual situations when the arcsin argument exceeds one.
Key angles: Angle of reflection equals angle of incidence (psi r = psi i), while angle of transmission follows a more complex relationship involving the phase constant ratio.

📐 Problem setup and field expressions

📐 Geometry and incident wave

The problem considers:

A planar boundary at z = 0 between two semi-infinite, lossless regions
Wave incident from Region 1
TE (transverse electric) polarization

TE incident electric field: The electric field intensity of the incident wave is given by y-hat times E i TE times exponential of negative j times k i dot r, where r is the position, k i is the wave vector (unit direction vector k-hat i times beta 1), and beta 1 = omega times square root of mu 1 epsilon 1 is the phase propagation constant in Region 1.

📐 Postulated reflected and transmitted fields

Based on symmetry and experience with normal incidence:

Reflected wave: Also TE-polarized (y-hat direction), with unknown amplitude B and wave vector k r = k-hat r times beta 1
Transmitted wave: Also TE-polarized (y-hat direction), with unknown amplitude C and wave vector k t = k-hat t times beta 2, where beta 2 = omega times square root of mu 2 epsilon 2

Why polarization is preserved: The excerpt states "there is nothing present in the problem that could account for a change in polarization." The boundary symmetry ensures reflected and transmitted fields maintain the same polarization as the incident field.

🔗 Boundary conditions and phase matching

🔗 Electric field boundary condition

The tangential component of total electric field must be continuous across the boundary at z = 0.

Total field in Region 1: sum of incident and reflected fields
Total field in Region 2: transmitted field only
All electric field components are already tangent to the boundary (all in y-hat direction)

Applying continuity at every point r 0 on the boundary (where z = 0): E i TE times exponential of negative j k i dot r 0 plus B times exponential of negative j k r dot r 0 equals C times exponential of negative j k t dot r 0

🔗 Phase matching criterion

Phase matching requirement: For the boundary condition equation to be true at every point r 0 on the boundary, it must be true that k i dot r 0 = k r dot r 0 = k t dot r 0.

Why this matters: The excerpt explains "we are requiring the phases of each field in Regions 1 and 2 to be matched at every point along the boundary. Any other choice will result in a violation of boundary conditions at some point along the boundary."

After enforcing phase matching, the boundary condition simplifies to: E i TE + B = C

This is one equation with two unknowns (B and C), so a second equation is needed.

🧲 Magnetic field analysis

🧲 Magnetic field components

Using plane wave relationships (cross product of electric and magnetic fields points in propagation direction):

Incident magnetic field:

Direction: z-hat times sin psi i minus x-hat times cos psi i
Magnitude factor: E i TE / eta 1, where eta 1 = square root of mu 1 / epsilon 1 is wave impedance in Region 1
Unit propagation vector: k-hat i = x-hat times sin psi i plus z-hat times cos psi i

Reflected magnetic field:

Direction: z-hat times sin psi r plus x-hat times cos psi r
Magnitude factor: B / eta 1
Unit propagation vector: k-hat r = x-hat times sin psi r minus z-hat times cos psi r (note the minus sign on z component)

Transmitted magnetic field:

Direction: z-hat times sin psi t minus x-hat times cos psi t
Magnitude factor: C / eta 2, where eta 2 = square root of mu 2 / epsilon 2
Unit propagation vector: k-hat t = x-hat times sin psi t plus z-hat times cos psi t

🧲 Magnetic field boundary condition

Since there is no current on the boundary, the tangential component of total magnetic field must be continuous.

The x-hat component is tangent to the boundary. Applying continuity and using phase matching: negative (cos psi i) times E i TE / eta 1 plus (cos psi r) times B / eta 1 equals negative (cos psi t) times C / eta 2

This provides the second equation needed to solve for B and C.

🔢 Solution for reflection and transmission coefficients

🔢 Solving for amplitude B

Combining the two boundary condition equations:

E i TE + B = C (from electric field)
The magnetic field equation above

Eliminating C from equation 2 using equation 1, then solving for B: B = [(eta 2 times cos psi i minus eta 1 times cos psi t) / (eta 2 times cos psi r plus eta 1 times cos psi t)] times E i TE

🔢 TE reflection coefficient

TE reflection coefficient: Gamma TE = (eta 2 times cos psi i minus eta 1 times cos psi t) / (eta 2 times cos psi i plus eta 1 times cos psi t), where the second form uses psi r = psi i.

Special case check: The excerpt notes "Equation 5.141 becomes the reflection coefficient for normal (TEM) incidence when psi i = psi r = psi t = 0, as expected." This confirms the formula reduces correctly.

🔢 Transmission coefficient

From the electric field boundary condition: C = (1 + Gamma TE) times E i TE

Summary of field solutions:

Reflected electric field: y-hat times Gamma TE times E i TE times exponential of negative j k r dot r
Transmitted electric field: y-hat times (1 + Gamma TE) times E i TE times exponential of negative j k t dot r

📏 Angles of reflection and transmission

📏 Angle relationships

The phase matching condition (Equation 5.122) determines the propagation directions.

Angle of reflection: psi r = psi i

The excerpt calls this "the unsurprising result that angle of reflection equals angle of incidence."

Angle of transmission: psi t = arcsin of [(beta 1 / beta 2) times sin psi i]

📏 Potential issue with transmission angle

Don't confuse: The excerpt warns "there is something fishy about this equation: It seems possible for the argument of arcsin to be greater than one." This oddity is addressed in a later section (5.8), suggesting special physical phenomena (like total internal reflection) can occur.

Simplified reflection coefficient: Using psi r = psi i allows eliminating psi r from the reflection coefficient formula, yielding the form shown above with only psi i and psi t.

💡 Practical example: air-to-glass interface

💡 Problem setup

Example 5.7 considers:

TE plane wave incident from air onto glass
Glass relative permittivity: 2.1
Angle of incidence: 30 degrees
Goal: determine power reflected and transmitted relative to incident power

💡 Calculation steps

Wave impedances:

Air: eta 1 approximately eta 0 approximately 376.7 ohms
Glass: eta 2 approximately eta 0 / square root of 2.1 approximately 260.0 ohms

Phase constant ratio: beta 1 / beta 2 approximately (omega times square root of mu 0 epsilon 0) / (omega times square root of mu 0 times 2.1 epsilon 0) approximately 0.690

Transmission angle: psi t = arcsin of (0.690 times sin 30 degrees) approximately 20.2 degrees

Reflection coefficient: Gamma TE approximately negative 0.2220

💡 Power results

Quantity	Formula	Result
Power reflected (fraction)	magnitude of Gamma TE squared	approximately 0.049 (4.9%)
Power transmitted (fraction)	1 minus magnitude of Gamma TE squared	approximately 0.951 (95.1%)

Interpretation: Most of the power (95.1%) is transmitted into the glass; only a small fraction (4.9%) is reflected back into air for this geometry and material combination.

Plane Waves at Oblique Incidence on a Planar Boundary: TM Case

5.7 Plane Waves at Oblique Incidence on a Planar Boundary: TM Case

🧭 Overview

🧠 One-sentence thesis

The reflection coefficient for a TM-polarized plane wave at an oblique angle depends on both the angle of incidence and the polarization orientation relative to the boundary, yielding different power transmission than the TE case for the same materials and angle.

📌 Key points (3–5)

TM wave setup: The magnetic field is transverse (parallel to the boundary), and the electric field has components both parallel and perpendicular to the boundary.
Reflection coefficient formula: Gamma-TM equals (negative eta-1 times cosine psi-i plus eta-2 times cosine psi-t) divided by (positive eta-1 times cosine psi-i plus eta-2 times cosine psi-t), where eta is wave impedance and psi is angle.
Angle relationships: The angle of reflection equals the angle of incidence; the transmission angle is determined by arcsin of (beta-1 divided by beta-2 times sine of psi-i).
Common confusion: TM versus TE polarization—the fraction of power reflected and transmitted depends on polarization; the same angle and materials give different results for TM and TE waves.
Sign convention matters: Choosing negative y-hat for the reflected magnetic field ensures the reflection coefficient converges to the normal-incidence result when the angle is zero.

📐 Problem geometry and field definitions

📐 Boundary and regions

The boundary lies at the z = 0 plane, separating two semi-infinite lossless regions.
Region 1 is where the wave originates; Region 2 is the transmission region.
The wave propagates obliquely, meaning it approaches the boundary at an angle (not perpendicular).

🧲 Incident TM wave

The incident magnetic field intensity is H-tilde-i-TM(r) = y-hat times H-i-TM times exponential of (negative j times k-i dot r).

k-i is the wave vector: k-i = k-hat-i times beta-1, where k-hat-i is the unit direction vector and beta-1 = omega times square root of (mu-1 times epsilon-1) is the phase propagation constant in Region 1.
The magnetic field points in the y direction (parallel to the boundary).
The electric field has components in both x and z directions, determined by the plane wave relationship: E-tilde-i = negative eta-1 times (k-hat-i cross H-tilde-i-TM).

🔄 Reflected and transmitted waves

Reflected wave: H-tilde-r(r) = negative y-hat times B times exponential of (negative j times k-r dot r), where B is an unknown constant and k-r = k-hat-r times beta-1.
Transmitted wave: H-tilde-t(r) = y-hat times C times exponential of (negative j times k-t dot r), where C is an unknown constant and k-t = k-hat-t times beta-2.
The choice of negative y-hat for the reflected wave is a sign convention that ensures consistency with the normal-incidence reflection coefficient when the angle is zero.
Don't confuse: The reflected wave uses negative y-hat, not positive y-hat; using the opposite sign would give a reflection coefficient with the wrong sign for normal incidence.

🔗 Boundary conditions and phase matching

🔗 Tangential magnetic field continuity

No impressed current at the boundary means the tangential component of the total magnetic field must be continuous across z = 0.
Total field in Region 1: H-tilde-1(r) = H-tilde-i-TM(r) + H-tilde-r(r).
Total field in Region 2: H-tilde-2(r) = H-tilde-t(r).
All magnetic field components are already tangent to the boundary (they point in the y direction).
At any point r-0 on the boundary (where z = 0, so r-0 = x-hat times x plus y-hat times y), continuity requires: H-i-TM times exponential of (negative j times k-i dot r-0) minus B times exponential of (negative j times k-r dot r-0) equals C times exponential of (negative j times k-t dot r-0).

🌊 Phase matching requirement

For boundary conditions to hold at every point on the boundary, the phases must match: k-i dot r-0 = k-r dot r-0 = k-t dot r-0.

This is the "phase matching" condition—it ensures the exponential factors are equal at every point along the boundary.
Without phase matching, boundary conditions would be violated at some points.
This condition determines the directions of propagation (k-hat-r and k-hat-t) for the reflected and transmitted waves.
After enforcing phase matching, the magnetic field boundary condition simplifies to: H-i-TM minus B equals C.

⚡ Tangential electric field continuity

The electric field components are derived from the magnetic fields using plane wave relationships.
Incident electric field: E-tilde-i(r) = (x-hat times cosine psi-i minus z-hat times sine psi-i) times eta-1 times H-i-TM times exponential of (negative j times k-i dot r).
Reflected electric field: E-tilde-r(r) = (x-hat times cosine psi-r plus z-hat times sine psi-r) times eta-1 times B times exponential of (negative j times k-r dot r).
Transmitted electric field: E-tilde-t(r) = (x-hat times cosine psi-t minus z-hat times sine psi-t) times eta-2 times C times exponential of (negative j times k-t dot r).
The x-component is tangent to the boundary; continuity requires: (cosine psi-i) times eta-1 times H-i-TM plus (cosine psi-r) times eta-1 times B equals (cosine psi-t) times eta-2 times C.
After applying phase matching, this becomes the second equation needed to solve for B and C.

🧮 Reflection coefficient derivation

🧮 Solving for the reflection coefficient

Two equations with two unknowns (B and C):
1. H-i-TM minus B equals C (from magnetic field continuity).
2. (cosine psi-i) times eta-1 times H-i-TM plus (cosine psi-r) times eta-1 times B equals (cosine psi-t) times eta-2 times C (from electric field continuity).
Eliminate C by substituting the first equation into the second.
Solve for B: B = [(negative eta-1 times cosine psi-i plus eta-2 times cosine psi-t) divided by (positive eta-1 times cosine psi-r plus eta-2 times cosine psi-t)] times H-i-TM.
Define the reflection coefficient: B = Gamma-TM times H-i-TM.

📊 Reflection coefficient formula

Gamma-TM = (negative eta-1 times cosine psi-i plus eta-2 times cosine psi-t) divided by (positive eta-1 times cosine psi-r plus eta-2 times cosine psi-t).

eta-1 and eta-2 are the wave impedances in Regions 1 and 2: eta = square root of (mu divided by epsilon).
psi-i, psi-r, and psi-t are the angles of incidence, reflection, and transmission, measured from the normal to the boundary.
Since the angle of reflection equals the angle of incidence (psi-r = psi-i), the formula simplifies to: Gamma-TM = (negative eta-1 times cosine psi-i plus eta-2 times cosine psi-t) divided by (positive eta-1 times cosine psi-i plus eta-2 times cosine psi-t).
This is the electric field reflection coefficient; the reflected electric field amplitude is Gamma-TM times the incident electric field amplitude.

🔢 Transmission coefficient

From the first boundary condition: C = H-i-TM minus B = (1 minus Gamma-TM) times H-i-TM.
Converting to electric field: C = (1 minus Gamma-TM) times E-i-TM divided by eta-1.
The transmitted electric field is: E-tilde-t(r) = (x-hat times cosine psi-t minus z-hat times sine psi-t) times (1 minus Gamma-TM) times (eta-2 divided by eta-1) times E-i-TM times exponential of (negative j times k-t dot r).

📐 Angles of reflection and transmission

📐 Angle of reflection

The angle of reflection equals the angle of incidence: psi-r = psi-i.

This is derived from the phase matching condition k-i dot r-0 = k-r dot r-0.
It is the familiar "law of reflection."
Example: If a wave hits the boundary at 30 degrees from the normal, it reflects at 30 degrees on the opposite side of the normal.

🌈 Angle of transmission (refraction)

The transmission angle is given by: psi-t = arcsin of [(beta-1 divided by beta-2) times sine of psi-i].

beta-1 and beta-2 are the phase propagation constants: beta = omega times square root of (mu times epsilon).
This is Snell's law of refraction.
The ratio beta-1 divided by beta-2 equals the square root of [(mu-1 times epsilon-1) divided by (mu-2 times epsilon-2)]; for non-magnetic materials (mu-1 = mu-2 = mu-0), this simplifies to the square root of (epsilon-1 divided by epsilon-2), which is the ratio of refractive indices.
Potential issue: The argument of arcsin can exceed one, which would make the angle undefined; this situation is addressed in a later section (not fully covered in this excerpt).

🔍 Example calculation

Scenario: TM wave incident from air (eta-1 approximately 376.7 ohms) onto glass (relative permittivity 2.1, so eta-2 approximately 260.0 ohms) at psi-i = 30 degrees.
Step 1: Calculate beta-1 divided by beta-2 approximately 0.690.
Step 2: psi-t = arcsin(0.690 times sine 30 degrees) approximately 20.2 degrees.
Step 3: Substitute into Gamma-TM formula: Gamma-TM approximately negative 0.1442.
Power reflection: magnitude-squared of Gamma-TM approximately 0.021, or about 2.1% of incident power is reflected.
Power transmission: 1 minus magnitude-squared of Gamma-TM approximately 0.979, or about 97.9% of incident power is transmitted.
Don't confuse: This result (2.1% reflected for TM) differs from the TE case (4.9% reflected for the same angle and materials); polarization matters.

🔄 Polarization dependence and sign convention

🔄 Polarization matters

Polarization	Reflection at 30° (air to glass)	Transmission at 30°
TE	~4.9%	~95.1%
TM	~2.1%	~97.9%

The fraction of power reflected and transmitted depends on the polarization of the incident wave relative to the boundary, as well as the angle of incidence.
For the same materials and angle, TE and TM waves give different reflection coefficients.
Any plane wave can be decomposed into TE and TM components, so understanding both cases is essential.

➖ Sign convention for reflected field

The reflected magnetic field is defined with negative y-hat (not positive y-hat).
Why this choice? When psi-i = psi-t = 0 (normal incidence), the TM reflection coefficient Gamma-TM reduces to the same formula as the normal-incidence reflection coefficient.
If positive y-hat were chosen, the reflection coefficient would have the opposite sign for normal incidence, which would be awkward and inconsistent.
Don't confuse: Some references use the opposite sign convention; be aware of which convention is in use when comparing results.
The choice of sign convention affects the form of the reflection coefficient but does not change the physical result (the magnitude is the same).

🌊 Phase matching as a universal condition

🌊 Phase matching across TE and TM

The phase matching requirement k-i dot r-0 = k-r dot r-0 = k-t dot r-0 emerges independently in both TE and TM cases.
It enforces continuity of the phase of electric and magnetic fields across the boundary.
Because any plane wave can be decomposed into TE and TM components, the phase matching condition is universal.
This condition determines the angles of reflection and transmission, which are the same for TE and TM waves (only the reflection and transmission coefficients differ).

Angles of Reflection and Refraction

5.8 Angles of Reflection and Refraction

🧭 Overview

🧠 One-sentence thesis

The angles at which waves reflect and refract at a boundary between two materials are governed by a phase-matching requirement that yields the law of reflection (angle of reflection equals angle of incidence) and Snell's law (which relates the angle of transmission to the material properties and angle of incidence).

📌 Key points (3–5)

Phase matching requirement: continuity of phase across the boundary forces a relationship among the incident, reflected, and transmitted wave vectors, independent of polarization.
Law of reflection: the angle of reflection equals the angle of incidence.
Snell's law (law of refraction): relates the angle of transmission to the material properties (permittivity and permeability) and the angle of incidence.
Common confusion: direction of refraction—when a wave enters a higher-permittivity medium, it bends toward the normal; when entering a lower-permittivity medium, it bends away from the normal.
Total internal reflection threshold: when the argument of the arcsine exceeds 1, the transmitted angle becomes undefined, leading to total internal reflection.

🔗 Phase matching and the boundary condition

🔗 The phase matching requirement

Phase matching requirement: the dot product of each wave vector (incident, reflected, transmitted) with any position vector on the boundary must be equal.

Mathematically: k_i · r_0 = k_r · r_0 = k_t · r_0 for any point r_0 on the boundary.
This enforces continuity of the phase of electric and magnetic fields across the boundary.
The requirement emerges independently in both TE and TM cases, so it applies to any incident plane wave regardless of polarization.

🧩 Why phase matching matters

Any plane wave can be decomposed into TE and TM components.
Since the same phase-matching condition appears in both cases, it must hold universally.
This single requirement is the key to deriving both the angle of reflection and the angle of transmission.

📐 Law of reflection

📐 Angle of reflection equals angle of incidence

From the phase-matching requirement and the geometry of the wave vectors, one finds: ψ_r = ψ_i.
Both angles are measured from the surface normal and are limited to the range from negative π/2 to positive π/2.
This result is independent of material properties and frequency.

Example: A light beam hits a mirror at 60° from the normal; it reflects at 60° on the opposite side of the normal.

🌈 Snell's law (law of refraction)

🌈 The general form of Snell's law

Snell's law (law of refraction): √(μ_r1 ε_r1) sin ψ_i = √(μ_r2 ε_r2) sin ψ_t, where μ_r and ε_r are the relative permeability and permittivity of each region.

Refraction is transmission with a change in the direction of propagation.
The angle of transmission ψ_t is given by: ψ_t = arcsin[(√(μ_r1 ε_r1) / √(μ_r2 ε_r2)) sin ψ_i].
The transmitted angle does not depend on frequency, except to the extent that the material properties might vary with frequency.

🔬 Special case: non-magnetic media

For non-magnetic media, assume μ_r1 = μ_r2 = 1.
Snell's law simplifies to: √(ε_r1) sin ψ_i = √(ε_r2) sin ψ_t.
In optics, permittivities are often expressed as indices of refraction: n_1 = √(ε_r1) and n_2 = √(ε_r2).
Optical form: n_1 sin ψ_i = n_2 sin ψ_t.
Explicit formula for non-magnetic media: ψ_t = arcsin[(√(ε_r1) / √(ε_r2)) sin ψ_i].

🎯 Direction of refraction: toward or away from the normal

Condition	Direction of refraction	Explanation
ε_r2 > ε_r1 (entering higher permittivity)	ψ_t < ψ_i	Transmitted wave bends toward the surface normal
ε_r2 < ε_r1 (entering lower permittivity)	ψ_t > ψ_i	Transmitted wave bends away from the surface normal

Example (air to glass): A light beam at 60° from the normal in air enters glass with relative permittivity 2.28; the transmitted angle is observed to be 35°, which is closer to the normal (toward the normal).

Example (glass to air): A wave traveling from glass to air refracts away from the normal, making underwater objects appear displaced to an observer above water.

Don't confuse: The direction of bending depends on whether you are entering a denser (higher permittivity) or less dense (lower permittivity) medium.

⚠️ Total internal reflection

⚠️ When the arcsine becomes undefined

When ε_r2 < ε_r1 (lower permittivity in the second medium), the ratio √(ε_r1 / ε_r2) sin ψ_i can exceed 1 for sufficiently large angles of incidence.
Since the sine function only yields values between -1 and +1, the arcsine function becomes undefined.
This condition leads to the phenomenon of total internal reflection.

⚠️ The threshold condition

When ε_r2 < ε_r1, the transmitted angle ψ_t can reach π/2 radians (90°), corresponding to propagation parallel to the boundary.
Beyond this threshold angle of incidence, the transmitted wave cannot propagate into the second medium, and all power is reflected back into the first medium.
The excerpt notes that unique physical considerations apply in this regime, addressed in a later section.

🔺 Application: prisms

🔺 How prisms work

A prism is a waveguiding device that uses refraction to change the direction of waves (commonly light, but also applicable to radio waves).
The triangular shape causes the wave to refract at each surface, bending the propagation direction.

🌈 Color separation by prisms

Prisms can separate white light into constituent colors (frequencies).
This separation occurs because the permittivity of the prism material is a function of frequency.
Since the angle of refraction depends on permittivity, each frequency refracts by a different amount.
Conversely, if the prism material's permittivity exhibits negligible variation with frequency, all colors refract by the same amount and no separation occurs.

Example: A glass prism splits white light into a rainbow because the refractive index of glass varies slightly with the wavelength of light.

TE Reflection in Non-Magnetic Media

5.9 TE Reflection in Non-Magnetic Media

🧭 Overview

🧠 One-sentence thesis

For TE waves in non-magnetic materials, the reflection coefficient can be expressed entirely in terms of the incident angle and the relative permittivities, and when the wave travels from lower to higher permittivity, the coefficient is always real-valued, negative, and approaches −1 at grazing incidence.

📌 Key points (3–5)

Non-magnetic simplification: when both materials have permeability equal to free space, the reflection coefficient depends only on relative permittivities and the incident angle.
Snell's law reduces: in non-magnetic media, the ratio of propagation constants simplifies to the square root of the ratio of relative permittivities.
Two regimes: when ε_r1 < ε_r2 (e.g., air to glass), Γ_TE is real-valued and negative; when ε_r1 > ε_r2 (e.g., glass to air), Γ_TE can become complex-valued, leading to total internal reflection.
Common confusion: the reflection coefficient depends on the direction of travel—traveling from lower to higher permittivity behaves differently from the reverse.
Grazing incidence behavior: as the incident angle approaches 90 degrees, Γ_TE approaches −1, and larger permittivity ratios make the boundary behave more like a perfect conductor.

🔧 Simplifications for non-magnetic materials

🧲 What "non-magnetic" means

Non-magnetic materials: materials with permeability not significantly different from the permeability of free space (μ₀).

The excerpt assumes μ₁ = μ₂ = μ₀ for both regions.
This assumption is practical because many real materials (glass, air, plastics) have permeability close to free space.

📐 Snell's law simplification

The general form of Snell's law relates the sine of the transmission angle to the sine of the incident angle via the ratio of propagation constants (β₁/β₂).
In non-magnetic media, the ratio β₁/β₂ simplifies:
- β₁/β₂ = (ω√(μ₁ε₁)) / (ω√(μ₂ε₂)) = √(ε₁/ε₂)
- Since permittivity ε = ε₀ × ε_r, this further reduces to √(ε_r1/ε_r2).
Result: sin(ψ_t) = √(ε_r1/ε_r2) × sin(ψ_i).

🔌 Wave impedance simplification

Wave impedance η = √(μ/ε).
In non-magnetic media:
- η₁ = √(μ₀/(ε_r1 ε₀)) = η₀/√(ε_r1)
- η₂ = √(μ₀/(ε_r2 ε₀)) = η₀/√(ε_r2)
- where η₀ is the wave impedance in free space.
This allows the reflection coefficient formula to be rewritten in terms of relative permittivities only.

🧮 Deriving the simplified reflection coefficient

🧮 Starting formula

The general TE reflection coefficient is:
- Γ_TE = (η₂ cos(ψ_i) − η₁ cos(ψ_t)) / (η₂ cos(ψ_i) + η₁ cos(ψ_t))
The goal is to express this entirely in terms of ψ_i and the permittivity ratio, eliminating the need to calculate ψ_t separately.

🔄 Substitution steps

Replace η₁ and η₂ with their non-magnetic forms: η₀/√(ε_r1) and η₀/√(ε_r2).
Multiply numerator and denominator by √(ε_r2)/η₀ to simplify.
Use the identity cos(ψ) = √(1 − sin²(ψ)) to express cos(ψ_t) in terms of sin(ψ_t).
Substitute Snell's law: sin(ψ_t) = √(ε_r1/ε_r2) × sin(ψ_i).
Result: cos(ψ_t) = √(ε_r2/ε_r1 − sin²(ψ_i)).

📝 Final simplified form

The reflection coefficient becomes:
- Γ_TE = (cos(ψ_i) − √(ε_r2/ε_r1 − sin²(ψ_i))) / (cos(ψ_i) + √(ε_r2/ε_r1 − sin²(ψ_i)))
Advantage: depends only on ψ_i and the ratio ε_r2/ε_r1; no need to compute ψ_t first.

🔍 Behavior for different material combinations

⚖️ Same media on both sides

When ε_r1 = ε_r2 (same material on both sides), Γ_TE = 0.
This is expected: no boundary means no reflection.

⬆️ Lower to higher permittivity (ε_r1 < ε_r2)

Example: wave traveling from air toward glass.
The term (ε_r2/ε_r1 − sin²(ψ_i)) is always positive because ε_r2/ε_r1 > 1 and sin²(ψ_i) ≤ 1.
Therefore, Γ_TE is always real-valued.
Γ_TE is negative for all angles of incidence.
As ψ_i approaches 90° (grazing incidence), Γ_TE approaches −1.
At any fixed angle, Γ_TE trends toward −1 as the ratio ε_r2/ε_r1 increases.

⬇️ Higher to lower permittivity (ε_r1 > ε_r2)

Example: wave traveling from glass toward air.
The term (ε_r2/ε_r1 − sin²(ψ_i)) can become negative when sin²(ψ_i) > ε_r2/ε_r1.
When this happens, Γ_TE becomes complex-valued.
This leads to total internal reflection, which the excerpt notes is addressed in another section.
Don't confuse: the direction of travel matters—air-to-glass and glass-to-air behave very differently.

📊 Observations from the reflection coefficient plot

📈 General trends

Figure 5.20 shows Γ_TE plotted for various permittivity ratios over all incident angles from 0° (normal incidence) to 90° (grazing incidence).
The curves are labeled with the ratio ε_r2/ε_r1.

📉 Key observations for ε_r1 < ε_r2

Observation	Description
Sign	Γ_TE is always negative
Real-valued	No complex component; no total internal reflection
Grazing limit	Γ_TE decreases to −1 as ψ_i approaches 90°
Ratio dependence	Larger ε_r2/ε_r1 makes Γ_TE more negative at any given angle

🔌 Analogy to perfect conductor

As ε_r2/ε_r1 → ∞, the behavior becomes increasingly similar to having a perfect conductor in Region 2.
This makes intuitive sense: a very high permittivity acts like a "wall" to the wave, reflecting it strongly.
Example: if Region 1 is air (ε_r1 ≈ 1) and Region 2 has ε_r2 = 100, the reflection is much stronger than if ε_r2 = 2.

TM Reflection in Non-Magnetic Media

5.10 TM Reflection in Non-Magnetic Media

🧭 Overview

🧠 One-sentence thesis

In non-magnetic media, the TM reflection coefficient can be reduced to depend only on relative permittivities and angle of incidence, and at Brewster's angle the TM component is completely transmitted with zero reflection, enabling polarization filtering.

📌 Key points (3–5)

Non-magnetic simplification: when permeabilities equal μ₀, the reflection coefficient depends only on relative permittivities (εᵣ₁ and εᵣ₂) and angle of incidence.
Brewster's angle (polarizing angle): the specific angle of incidence at which the TM reflection coefficient equals zero, so only TE components are reflected.
Behavior when εᵣ₁ < εᵣ₂: the TM reflection coefficient is real-valued, starts negative at normal incidence, crosses zero at Brewster's angle, and approaches +1 at grazing incidence.
Common confusion: TM vs TE behavior—TE reflection is always negative when εᵣ₁ < εᵣ₂, but TM reflection changes sign from negative to positive as angle increases.
Practical application: Brewster's angle can isolate TE and TM components by suppressing the TM reflection, regardless of the incident wave's polarization mix.

🔬 Simplifying the reflection coefficient for non-magnetic media

🔬 Starting point and assumptions

The general TM reflection coefficient is:

Γ_TM = (−η₁ cos ψᵢ + η₂ cos ψₜ) / (+η₁ cos ψᵢ + η₂ cos ψₜ)

where ψᵢ and ψₜ are angles of incidence and transmission, and η₁ and η₂ are wave impedances.

Non-magnetic assumption: many practical materials have permeability not significantly different from free space permeability μ₀, so μ₁ = μ₂ = μ₀.

🧮 Deriving Snell's law for non-magnetic media

When permeabilities are equal to μ₀, the ratio β₁/β₂ simplifies:

β₁/β₂ = √(ε₁/ε₂) = √(εᵣ₁/εᵣ₂)
Snell's law becomes: sin ψₜ = √(εᵣ₁/εᵣ₂) sin ψᵢ

This eliminates permeability from the equation entirely.

🧮 Expressing wave impedances

In non-magnetic media:

η₁ = η₀ / √εᵣ₁
η₂ = η₀ / √εᵣ₂

where η₀ is the wave impedance in free space.

🧮 Final simplified form

After substitutions and algebraic manipulation, the reflection coefficient becomes:

Γ_TM = (−(εᵣ₂/εᵣ₁) cos ψᵢ + √(εᵣ₂/εᵣ₁ − sin² ψᵢ)) / (+(εᵣ₂/εᵣ₁) cos ψᵢ + √(εᵣ₂/εᵣ₁ − sin² ψᵢ))

Key advantage: this expression depends only on ψᵢ (angle of incidence) and the ratio εᵣ₂/εᵣ₁; no need to first calculate ψₜ.

📊 Behavior patterns for different media combinations

📊 Same media (εᵣ₁ = εᵣ₂)

When both sides have the same permittivity, Γ_TM = 0 as expected (no boundary, no reflection).

📊 Wave traveling toward denser medium (εᵣ₁ < εᵣ₂)

Example: air toward glass.

Property	Behavior
Real vs complex	Always real-valued (√(εᵣ₂/εᵣ₁ − sin² ψᵢ) is always positive)
At normal incidence (ψᵢ = 0)	Negative value
As ψᵢ increases	Increases from negative, crosses zero, becomes positive
At grazing incidence (ψᵢ → π/2)	Approaches +1
As εᵣ₂/εᵣ₁ → ∞	Approaches −1 at any given angle (similar to perfect conductor)

Don't confuse with TE: TE reflection coefficient is always negative when εᵣ₁ < εᵣ₂, but TM changes sign.

📊 Wave traveling toward less dense medium (εᵣ₁ > εᵣ₂)

Example: glass toward air.

The term εᵣ₂/εᵣ₁ − sin² ψᵢ can become negative.
When negative, Γ_TM becomes complex-valued.
This leads to total internal reflection (addressed in another section).

🎯 Brewster's angle (polarizing angle)

🎯 Definition and significance

Brewster's angle ψᵢB: the angle of incidence at which Γ_TM = 0.

Also called the polarizing angle because at this angle:

The TM component has zero reflection (Γ_TM = 0).
The TE component still reflects normally.
The reflected wave becomes purely TE, regardless of the incident wave's polarization mix.

🎯 Physical mechanism

When a wave with both TE and TM components is incident at Brewster's angle:

The TE component scatters into reflected and transmitted TE waves.
The TM component transmits completely (no reflection).
The total reflected wave = TE only.

Practical use: this principle can isolate or suppress the TM component of a wave.

🎯 Deriving the formula

Γ_TM = 0 when the numerator equals zero. Let R = εᵣ₂/εᵣ₁:

Start: −R cos ψᵢB + √(R − sin² ψᵢB) = 0
Square both sides: R² cos² ψᵢB = R − sin² ψᵢB
Use cos² = 1 − sin²: R² (1 − sin² ψᵢB) = R − sin² ψᵢB
Rearrange: (1 − R²) sin² ψᵢB = R − R²
Solve: sin ψᵢB = √((R − R²)/(1 − R²))

Simpler form (using a right-triangle interpretation):

tan ψᵢB = √(εᵣ₂/εᵣ₁)

This is the most practical formula for calculating Brewster's angle.

🎯 Example calculation

Scenario: plane wave incident from air (εᵣ₁ = 1) onto glass (εᵣ₂ = 2.1), containing both TE and TM components. At what angle will the reflected wave be purely TE?

Solution:

tan ψᵢB = √(2.1/1) ≈ 1.449
ψᵢB ≈ 55.4°

At this angle, Γ_TM = 0, so the reflected wave contains no TM component and must be purely TE.

🔄 Comparison with TE and perfect conductors

🔄 Similarity to TE at extreme permittivity ratios

As εᵣ₂/εᵣ₁ → ∞:

Both TE and TM reflection coefficients trend toward −1 at any given angle.
The behavior becomes increasingly similar to reflection from a perfect conductor in Region 2.

🔄 Key difference from TE

Aspect	TE component (εᵣ₁ < εᵣ₂)	TM component (εᵣ₁ < εᵣ₂)
Sign at normal incidence	Negative	Negative
Sign at grazing incidence	Negative (approaches −1)	Positive (approaches +1)
Sign change?	No, always negative	Yes, crosses zero at Brewster's angle
Zero reflection?	Never	Yes, at ψᵢB

Don't confuse: the TE component never has zero reflection when εᵣ₁ < εᵣ₂, but the TM component does at Brewster's angle.

Total Internal Reflection

5.11 Total Internal Reflection

🧭 Overview

🧠 One-sentence thesis

Total internal reflection occurs when a wave incident from a denser medium exceeds a critical angle, causing complete reflection with zero power transmission into the second medium, and this phenomenon enables practical applications such as fiber optics.

📌 Key points (3–5)

What it is: complete reflection at a boundary between two media (neither being a perfect conductor) with no power transmitted into the second region.
When it happens: when the angle of incidence exceeds the critical angle, defined by the ratio of the media's electromagnetic properties.
Key threshold: the critical angle is the boundary condition where the transmitted wave would propagate parallel to the interface; beyond this angle, normal transmission theory breaks down.
Common confusion: although all power reflects, fields still exist on the opposite side of the boundary (evanescent waves) to satisfy boundary conditions—these fields carry no power.
Practical importance: total internal reflection is the enabling principle of fiber optics.

🔬 Physical mechanism and conditions

🔬 How waves bend at boundaries

When a wave crosses from one medium to another, Snell's law governs the relationship between incident angle and transmitted angle.
The excerpt states: angle of reflection equals angle of incidence.
When the product of relative permeability and permittivity in medium 1 exceeds that in medium 2, the transmitted wave bends away from the surface normal.
The transmitted angle can reach 90 degrees (parallel to the boundary) even when the incident angle is less than 90 degrees.

⚠️ What happens beyond the threshold

If the incident angle increases further, calculating the transmitted angle using Snell's law yields an arcsine argument greater than 1.
Since sine values range only between -1 and +1, the arcsine function becomes undefined.
This signals that the existing theory is inadequate and a new phenomenon occurs.

🎯 The critical angle

🎯 Definition and formula

Critical angle: the threshold angle of incidence at which the transmitted wave would propagate parallel to the boundary.

The critical angle is found by setting the transmitted angle to 90 degrees (π/2).
For non-magnetic materials, the formula simplifies to: critical angle = arcsine of the square root of (permittivity of region 2 divided by permittivity of region 1).
This formula requires that region 1 has higher permittivity than region 2 for total internal reflection to be possible.

📐 Behavior in three regimes

Condition	Transmitted angle	What happens
Incident angle < critical angle	Normal refraction	Wave transmits into region 2 with power
Incident angle = critical angle	90° (parallel to boundary)	Transition point
Incident angle > critical angle	Undefined (complex-valued)	Total internal reflection occurs

💫 Reflection coefficient behavior

💫 Above the critical angle

When the incident angle exceeds the critical angle, the reflection coefficient becomes complex-valued.
The excerpt shows that the square root term in the reflection coefficient formula becomes the square root of a negative number, yielding an imaginary result.
The reflection coefficient can be written as (A - jB) / (A + jB), where A and B are positive real numbers.
The numerator and denominator have equal magnitude, so the magnitude of the reflection coefficient equals 1.

🔄 Complete reflection

A reflection coefficient magnitude of 1 means all power is reflected.
No power is transmitted into the second medium.
This holds for both TE (transverse electric) and TM (transverse magnetic) polarization components.
The reflection coefficient has a phase that depends on the incident angle and material properties.

🌊 The Goos-Hänchen effect

🌊 Phase shift phenomenon

The phase of the reflection coefficient is zero when the incident angle is below the critical angle.
When the incident angle exceeds the critical angle, the phase becomes non-zero and varies with the incident angle.
This phase variation is known as the Goos-Hänchen effect.

📍 Observable displacement

The Goos-Hänchen phase shift appears as a spatial displacement between the point of incidence and the point of reflection.
Example: In a demonstration with laser light in a dielectric rod, the reflected beam appears to emerge from a point displaced from where the incident beam strikes the boundary.
This is a startling visual phenomenon that reveals the non-trivial phase behavior during total internal reflection.

🧪 Practical example

🧪 Glass-air boundary demonstration

A demonstration shows light incident from glass (relative permittivity ≈ 2.28) onto a boundary with air (relative permittivity ≈ 1).
The critical angle for this configuration is approximately 41.5 degrees.
In the demonstration, the incident angle is approximately 50 degrees, which exceeds the critical angle.
Result: total internal reflection is observed—all light reflects back into the glass with none transmitted into the air.

🔦 Why fiber optics work

Total internal reflection is the enabling principle of fiber optics.
Light traveling through an optical fiber (denser medium) strikes the fiber-air boundary at angles exceeding the critical angle.
The light remains trapped inside the fiber, reflecting repeatedly without power loss to the outside.
This allows light signals to propagate over long distances with minimal loss.

🤔 Apparent paradox and resolution

🤔 The boundary condition puzzle

The complete reflection seems to contradict electromagnetic boundary conditions.
Boundary conditions require tangential components of electric and magnetic fields to be continuous across the boundary.
Question: How can these field components be continuous if no power crosses the boundary?

✅ The resolution

There is a field on the opposite side of the boundary, but it carries zero power.
This field exists solely to satisfy the boundary conditions.
The excerpt indicates this is explained through the concept of evanescent waves (covered in the following section).
Don't confuse: the presence of a field does not necessarily mean power transmission—the evanescent field has no energy flow.

Key takeaway: Total internal reflection is not simply "bouncing off" a boundary; it is a threshold phenomenon governed by the critical angle, producing complete reflection with a complex-valued reflection coefficient and giving rise to non-propagating evanescent fields on the transmission side that satisfy boundary conditions without carrying power.

Evanescent Waves

5.12 Evanescent Waves

🧭 Overview

🧠 One-sentence thesis

When total internal reflection occurs, an evanescent wave exists on the transmitted side of the boundary to satisfy electromagnetic boundary conditions, but it carries no power and decays exponentially with distance from the boundary.

📌 Key points (3–5)

The paradox: Total internal reflection produces a complex reflection coefficient with magnitude 1, yet boundary conditions require continuous tangential field components across the boundary.
The solution: A field exists on the opposite side of the boundary—an evanescent wave—that satisfies boundary conditions but transmits zero power.
Mathematical approach: The angle of transmission becomes complex-valued when the angle of incidence exceeds the critical angle.
Common confusion: Unlike uniform plane waves, the evanescent wave is a surface wave that decays exponentially with distance into the second region and does not propagate independently.
Key characteristic: The evanescent field exists only while the incident wave illuminates the boundary; it vanishes instantly when the source is turned off because it contains no energy.

🔍 The boundary condition puzzle

🔍 The apparent contradiction

When total internal reflection occurs (angle of incidence greater than the critical angle):

The reflection coefficient has magnitude equal to 1, meaning all incident power reflects back.
Yet electromagnetic boundary conditions require tangential components of electric and magnetic fields to be continuous across the boundary.
Question: How can fields be continuous if no power crosses the boundary?

💡 The resolution

There must be a field on the opposite side of the boundary, but—somehow—it must have zero power.

This field is the evanescent wave, which:

Satisfies the mathematical boundary conditions.
Carries no energy or power.
Exists only to enforce continuity requirements.

🧮 Mathematical framework with complex angles

🧮 Complex-valued transmission angle

The excerpt introduces a complex transmission angle ψ_tc to describe the transmitted field:

Below critical angle: ψ_tc equals the real transmission angle ψ_t.
At critical angle: ψ_tc equals π/2 (parallel to boundary).
Above critical angle: ψ_tc = ψ' + jψ'', where ψ' = π/2 and ψ'' is a real positive number.

The excerpt notes that trigonometric identities allow sine and cosine to be computed for complex angles, even though the results may also be complex-valued.

🔢 Trigonometric identities for complex angles

When the angle has an imaginary component jψ'':

cos(jψ'') = cosh(ψ''), which is real-valued.
sin(jψ'') = j sinh(ψ''), which is imaginary-valued.

For the full complex angle when total internal reflection occurs (ψ' = π/2):

sin(ψ_tc) = cosh(ψ'')
cos(ψ_tc) = -j sinh(ψ'')

Both cosh and sinh of real arguments are real-valued, so these expressions are well-defined.

🌊 Properties of the evanescent wave

🌊 Propagation behavior

The wave in Region 2 propagates according to:

e^(-j k_t · r) = e^(-j(β₂ cosh ψ'') x) e^(-(β₂ sinh ψ'') z)

Breaking this down:

The first exponential describes propagation in the +x direction (parallel to the boundary).
The second exponential describes exponential decay in the +z direction (perpendicular to the boundary, into Region 2).
Both β₂ cosh ψ'' and β₂ sinh ψ'' are real-valued and positive constants.

📉 Exponential decay characteristic

The magnitude of the transmitted field decreases exponentially with increasing z; i.e., maximum at the boundary, asymptotically approaching zero with increasing distance from the boundary.

The field is strongest right at the boundary.
It becomes negligible beyond a few wavelengths from the boundary.
This is fundamentally different from uniform plane waves (incident, reflected, or normally transmitted waves).

🏷️ Classification as a surface wave

This wave may be described as a surface wave.

More specifically, it is an evanescent wave:

When total internal reflection occurs, the transmitted field is an evanescent wave; i.e., a surface wave which conveys no power and whose magnitude decays exponentially with increasing distance into Region 2.

⚡ The zero-power characteristic

⚡ Why no power is transmitted

The strangest feature: the evanescent field acts like a wave but conveys no power.

It exists solely to enforce electromagnetic boundary conditions at the boundary.
It does not exist independently of the incident and reflected fields.

🧪 Thought experiment: laser illumination

The excerpt describes an experiment to illustrate the zero-energy nature:

Setup: A laser illuminates a boundary where total internal reflection occurs.

All incident power reflects back.
An evanescent wave exists on the opposite side.

When the laser turns off:

The reflected light continues propagating to infinity (it carries energy).
The evanescent wave vanishes instantly at the moment illumination ceases.
Reason: there is no power—hence no energy—in the evanescent wave.

Example: Think of the evanescent field as a "shadow" of the incident field that must exist mathematically but contains no substance to propagate on its own.

⚠️ Don't confuse with transmitted waves

Feature	Evanescent wave (total internal reflection)	Transmitted wave (normal refraction)
Power	Zero	Non-zero
Spatial behavior	Exponential decay perpendicular to boundary	Uniform plane wave
Independence	Exists only while incident wave is present	Propagates independently
Distance traveled	Negligible beyond a few wavelengths	Propagates to infinity

Phase and Group Velocity

6.1 Phase and Group Velocity

🧭 Overview

🧠 One-sentence thesis

Phase velocity describes the speed of constant-phase points in a wave, but when waves carry information through modulation or dispersion, group velocity—the speed at which disturbances and information propagate—becomes the meaningful measure of propagation speed.

📌 Key points (3–5)

Phase velocity measures how fast points of constant phase move in a sinusoidal wave; it equals wavelength times frequency and depends only on material properties (μ and ε) for simple waves.
Why phase velocity has limits: it assumes perfect uniformity over all space and time, which cannot carry information; information requires non-uniformity (modulation).
Group velocity measures the speed of disturbances and information; it is the ratio of change in frequency to change in phase propagation constant.
Common confusion: phase velocity can exceed the speed of light in waveguides, but group velocity (information speed) always remains below c—no physical laws are violated because the universal speed limit applies to information, not phase.
When they differ: in uniform sinusoidal waves they are equal, but dispersion and modulation cause group velocity to differ from phase velocity.

🌊 Phase velocity fundamentals

🌊 What phase velocity measures

Phase velocity is the speed at which a point of constant phase travels as the wave propagates.

For a sinusoidal wave of form A cos(ωt − βz + ψ), phase velocity is straightforward to calculate.
At any given time, the distance between points of constant phase is one wavelength λ.
Therefore: phase velocity v_p = λf (wavelength times frequency).
Alternative expressions: v_p = ω/β, or v_p = 1/sqrt(με) for simple matter.

🧱 Phase velocity as a material property

Since v_p depends only on the constitutive properties μ (permeability) and ε (permittivity), it can be viewed as a property of the material itself.
This holds for waves in "simple matter" where β = ω sqrt(με).

⚠️ The uniformity limitation

Equations for phase velocity assume a wave with precisely the same behavior over all possible time (from −∞ to +∞) and all possible position z (from −∞ to +∞).
Critical constraint: this uniformity over all space and time precludes using such a wave to send information.
To transmit information, the source must vary at least one parameter as a function of time:
- A (amplitude modulation)
- ω (frequency modulation)
- ψ (phase modulation)
Information can only be transmitted by making the wave non-uniform in some respect.

📡 Group velocity and information propagation

📡 Why group velocity is needed

When parameters vary with time or position (through modulation, dispersion, or waveguide effects), the instantaneous distance between constant-phase points may differ greatly from λ.
The instantaneous frequency of variation may be very different from f.
In these cases, phase velocity equations may not provide a meaningful value for propagation speed.

📡 Definition of group velocity

Group velocity, v_g, is the ratio of the apparent change in frequency ω to the associated change in the phase propagation constant β; i.e., Δω/Δβ.

As Δβ becomes vanishingly small: v_g = ∂ω/∂β
Note the similarity to phase velocity (v_p = ω/β) but with a derivative instead of a simple ratio.

📡 What group velocity represents

Group velocity is the speed at which a disturbance in the wave propagates.
Information may be conveyed as meaningful disturbances relative to a steady-state condition.
Therefore, group velocity is the speed of information in a wave.

🔄 When phase and group velocity are equal

For uniform sinusoidal waves (form A cos(ωt − βz + ψ)), the group velocity calculation yields v_g = 1/sqrt(με) = v_p.
In other words, for these simple waves, group velocity equals phase velocity.

🔬 Dispersion example: square-law media

🔬 The dispersion scenario

The excerpt provides an example of a non-magnetic dispersive medium where relative permittivity ε_r varies as the square of frequency over a narrow range centered at ω₀:

ε_r = K(ω/ω₀)² where K is a real-valued positive constant.

🔬 Calculating both velocities

For this dispersive material:

Phase velocity: v_p = (ω₀/sqrt(K)) · (ω/sqrt(μ₀ε₀))
Group velocity calculation uses v_g = ∂ω/∂β = (∂β/∂ω)⁻¹
After working through the derivatives: v_g = (1/2)(ω/β) = (1/2)v_p

🔬 Key result

In this square-law dispersion case, group velocity is always half the phase velocity.

⚡ Waveguides and the speed of light

⚡ Phase velocity exceeding c

In waveguides (devices for guiding wave propagation), phase velocity may exceed the speed of light in vacuum, c.
This might seem to violate physical laws, but it does not.

⚡ Why no laws are violated

Quantity	Speed in waveguides	Physical constraint
Phase velocity	May exceed c	No universal limit on phase velocity
Group velocity	Remains less than c	Must obey universal speed limit
Information speed	Equals group velocity	Must be ≤ c

The universal "speed limit" c applies to information, not simply to points of constant phase.
Since information propagates at group velocity (not phase velocity), and group velocity remains below c, no physical laws are violated.

⚠️ Don't confuse

Phase velocity > c does not mean information travels faster than light.
The speed limit applies to what can carry information (disturbances, signals), not to mathematical phase points.

Markdown Copy Insert Apply

Parallel Plate Waveguide: Introduction

6.2 Parallel Plate Waveguide: Introduction

🧭 Overview

🧠 One-sentence thesis

The parallel plate waveguide serves as a fundamental model for analyzing wave propagation between two perfectly-conducting plates, with solutions obtained by decomposing the electric field into transverse electric (TE) and transverse magnetic (TM) components that satisfy the wave equation and boundary conditions.

📌 Key points (3–5)

What the structure is: two infinite perfectly-conducting plates separated by a lossless material, used to guide wave propagation.
Why it matters: serves as a rudimentary model applicable to engineering problems like microstrip line analysis and ionospheric radio wave propagation.
How to solve it: decompose the wave equation in Cartesian coordinates, then separate into TE and TM components to simplify the problem.
Common confusion: TE vs TM—TE has electric field parallel to the plates (only E_y non-zero), while TM has magnetic field parallel to the plates (only H_y non-zero).
Key simplification: restricting to two dimensions (x and z) with no y-variation eliminates partial derivatives with respect to y, making the equations more tractable.

📐 Physical structure and setup

📐 Geometry of the waveguide

Two perfectly-conducting plates located at z = 0 and z = a.
Plates are assumed infinite in extent, so no need to consider fields outside the region between them (z < 0 or z > a).
The region between plates contains an ideal lossless material with real-valued permeability μ and real-valued permittivity ε.

Parallel plate waveguide: a device for guiding the propagation of waves between two perfectly-conducting plates.

🎯 Applications

The excerpt lists two specific engineering problems where this model applies:

Analysis of fields within microstrip line.
Propagation of radio waves in the ionosphere.

Example: When radio waves travel through the ionosphere, the parallel plate model can approximate the behavior even though the physical structure differs—the model captures the essential wave-guiding physics.

🧮 Mathematical formulation

🧮 The governing wave equation

In phasor form, the electric field intensity is governed by:

Wave equation: del-squared of E-tilde plus beta-squared times E-tilde equals zero

Where beta equals omega times the square root of mu times epsilon.

This is a partial differential equation.
Combined with boundary conditions from the perfectly-conducting plates, it determines a unique solution.
Analysis is easiest in Cartesian coordinates.

📊 Cartesian decomposition

Express the electric field E-tilde as the sum of three components:

x-component: x-hat times E_x-tilde
y-component: y-hat times E_y-tilde
z-component: z-hat times E_z-tilde

This allows the wave equation to split into three separate equations, one for each component (E_x, E_y, E_z).

The Laplacian operator del-squared in Cartesian coordinates equals:

Second partial derivative with respect to x, plus
Second partial derivative with respect to y, plus
Second partial derivative with respect to z

🔧 Two-dimensional simplification

The excerpt restricts attention to scenarios with:

Complete description in two dimensions: x and z only.
No variation in y direction.

Why this matters: All partial derivatives with respect to y become zero, considerably simplifying the three component equations.

Don't confuse: This is not a requirement but a representative simplification for a broad class of relevant problems.

🔀 TE and TM decomposition

🔀 Why decompose

After reducing to two dimensions, the excerpt states a "more useful approach" is to:

Decompose the total electric field into TE and TM components.
Determine solutions for these components separately.
Sum them to get the total field.

⚡ Transverse Electric (TE) component

TE component: the electric field is parallel to the plates and transverse (perpendicular) to the plane shown in the figure.

Characteristics:

E_x-tilde = 0
E_z-tilde = 0
Only E_y-tilde may be non-zero

Physical meaning: The electric field points along the y-direction, parallel to the conducting plates.

🧲 Transverse Magnetic (TM) component

TM component: the magnetic field intensity (H-tilde) is parallel to the plates and transverse to the plane shown in the figure.

Characteristics:

H_x-tilde = 0
H_z-tilde = 0
Only H_y-tilde may be non-zero

Physical meaning: The magnetic field points along the y-direction, parallel to the conducting plates.

🔍 How to distinguish TE from TM

Component	What is transverse (parallel to plates)	What can be non-zero
TE	Electric field	Only E_y
TM	Magnetic field	Only H_y

Common confusion: Both TE and TM refer to fields being transverse to the propagation plane, but TE means the electric field is transverse, while TM means the magnetic field is transverse.

🎯 Boundary conditions and solution strategy

🎯 Boundary conditions at the plates

At the perfectly-conducting surfaces (z = 0 and z = a):

The tangent component of E-tilde must be zero.

Why this matters: These boundary conditions, combined with the wave equations, determine the unique solution for the electric field.

📝 Solution roadmap

The excerpt outlines the following approach:

Start with the wave equation for electric field.
Express in Cartesian coordinates and decompose into x, y, z components.
Simplify by assuming no y-variation (two-dimensional problem).
Further decompose into TE and TM components.
Solve each component separately subject to boundary conditions.

The excerpt states that solutions are presented in subsequent sections:

Section 6.3: Electric component of TE solution
Section 6.4: Magnetic component of TE solution
Section 6.5: Electric component of TM solution
Magnetic component of TM solution can be determined by straightforward variation (not presented)

🧩 TE electric field equation

For the TE case, since E_x-tilde = E_z-tilde = 0, only one equation remains:

Second partial derivative of E_y-tilde with respect to x, plus
Second partial derivative of E_y-tilde with respect to z, equals
Negative beta-squared times E_y-tilde

The general solution has the form of exponential terms involving:

Constants A, B, C, D (complex-valued)
Constants k_x and k_z (real-valued)
Exponential terms in both x and z directions with both positive and negative exponents

Physical interpretation: The constants k_x and k_z are given variable names "with advance knowledge of their physical interpretation," but at this stage they are simply unknowns to be determined by boundary conditions.

Parallel Plate Waveguide: TE Case, Electric Field

6.3 Parallel Plate Waveguide: TE Case, Electric Field

🧭 Overview

🧠 One-sentence thesis

The TE electric field in a parallel plate waveguide consists of discrete modes, each with its own cutoff frequency below which it cannot propagate, and the solution reveals that phase velocity exceeds the unbounded-space wave speed while information still travels slower than light.

📌 Key points (3–5)

What the TE electric field looks like: only the y-component is non-zero; it is a sum of sinusoidal modes in the x-direction multiplied by a propagating wave in the z-direction.
Modes and cutoff frequency: each mode m exists only above a cutoff frequency; below that frequency the mode does not propagate.
How to distinguish modes: higher-order modes have higher cutoff frequencies, smaller propagation constants, and more sinusoidal half-periods across the waveguide width.
Common confusion—phase vs group velocity: phase velocity in the waveguide exceeds the unbounded-space wave speed, but information (group velocity) still travels slower than light, so no physical laws are violated.
Why boundary conditions matter: the requirement that the tangential electric field vanish at perfectly conducting surfaces determines which modes can exist and their spatial structure.

🧮 Starting equation and general solution

🧮 The governing partial differential equation

The TE component has only E_y non-zero (E_x = E_z = 0), so the relevant equation is:

The second partial derivative of E_y with respect to x plus the second partial derivative of E_y with respect to z equals negative beta squared times E_y.

This is the wave equation in two dimensions (x and z).
Beta is the propagation constant in the unbounded medium.

🌊 General solution structure

The general solution is:

E_y = (exponential with negative j k_z z) times [A times exponential with negative j k_x x plus B times exponential with positive j k_x x] plus (exponential with positive j k_z z) times [C times exponential with negative j k_x x plus D times exponential with positive j k_x x].
The first term (with negative j k_z z) represents a wave propagating in the positive z direction.
The second term (with positive j k_z z) represents a wave propagating in the negative z direction.
A, B, C, D are complex constants; k_x and k_z are real constants.

🚫 Eliminating backward-traveling waves

If sources exist only on the left (z < 0) and there is no scattering structure on the right (z > 0), then there can be no wave components propagating in the negative z direction.

This means C = D = 0.
The solution simplifies to: E_y = (exponential with negative j k_z z) times [A times exponential with negative j k_x x plus B times exponential with positive j k_x x].

🔗 Relationship between k_x, k_z, and beta

By substituting the simplified solution back into the governing equation, the excerpt verifies that it is indeed a solution, but only if:

beta squared equals k_x squared plus k_z squared.

This confirms that k_x and k_z are the components of the propagation vector k.
The propagation vector is: beta times unit vector k-hat equals unit vector x-hat times k_x plus unit vector y-hat times k_y plus unit vector z-hat times k_z.
In this problem, k_y = 0 (no variation in the y-direction).

🔒 Applying boundary conditions

🔒 Boundary condition at x = 0

The component of E that is tangent to a perfectly conducting surface is zero.

Applied to this problem: E_y = 0 at x = 0 and E_y = 0 at x = a (where a is the plate separation).
At x = 0, the solution requires A + B = 0, so B = negative A.
Using a trigonometric identity, the solution becomes: E_y = E_y0 times (exponential with negative j k_z z) times sine of (k_x x).
E_y0 is a newly defined complex constant (equal to j times 2B).

🔒 Boundary condition at x = a

Applying E_y = 0 at x = a:

E_y0 times (exponential with negative j k_z z) times sine of (k_x a) = 0.
The exponential factor cannot be zero, and E_y0 = 0 yields only trivial solutions.
Therefore: sine of (k_x a) = 0.
This requires: k_x a = m pi, where m is an integer.

🚫 Excluding certain integer values

m = 0 is not of interest because it yields k_x = 0, which gives the trivial solution E_y = 0.
Each negative integer value of m is excluded because the associated solution differs from the corresponding positive value only in sign, which can be absorbed into the arbitrary constant E_y0.
Therefore, m = 1, 2, 3, ... are the valid mode numbers.

🎼 Modes and cutoff frequency

🎼 What a mode is

Each solution associated with a particular value of m is referred to as a mode, which has a particular value of k_x.

The family of solutions is given by: E_y = E_y0 times (exponential with negative j k_z z) times sine of (k_x x), with k_x = m pi / a.
Each mode m has its own k_x and k_z.

📏 Determining k_z for each mode

For mode m, k_z is obtained from the relationship beta squared = k_x squared + k_z squared:

k_z = square root of [beta squared minus k_x squared] = square root of [beta squared minus (m pi / a) squared].
Since k_z is specified to be real-valued, we require: beta squared minus (m pi / a) squared > 0.
This constrains beta: beta > m pi / a.

🚦 Cutoff frequency

Recall that beta = omega times square root of (mu epsilon), and omega = 2 pi f, where f is frequency.

Solving for f: f > m / (2 a times square root of mu epsilon).
Therefore, each mode exists only above a certain frequency, which is different for each mode.

The cutoff frequency f_c for mode m is: f_c^(m) = m / (2 a times square root of mu epsilon).

At frequencies below the cutoff frequency for mode m, that mode exhibits imaginary-valued k_z.
The propagation constant must have a real-valued component in order to propagate; therefore, modes below cutoff do not propagate and may be ignored.

📋 Summary of the solution

For the scenario (sources on the left, no scattering on the right), the electric field component of the TE solution is:

Unit vector y-hat times E_y = unit vector y-hat times the sum (from m = 1 to infinity) of E_y^(m).
Each modal component E_y^(m) is:
- 0 if f < f_c^(m) (below cutoff).
- E_y0^(m) times (exponential with negative j k_z^(m) z) times sine of (k_x^(m) x) if f ≥ f_c^(m) (at or above cutoff).
Where:
- k_z^(m) = square root of [beta squared minus (k_x^(m)) squared].
- k_x^(m) = m pi / a.
- E_y0^(m) is a complex constant depending on sources or boundary conditions to the left.

🔍 Examining specific modes

🥇 Lowest-order mode (m = 1)

For m = 1:

Cutoff frequency: f_c^(1) = 1 / (2 a times square root of mu epsilon).
This mode can exist if f > 1 / (2 a times square root of mu epsilon).
k_x^(1) = pi / a.
k_z^(1) = square root of [beta squared minus (pi / a) squared].
E_y^(1) = E_y0^(1) times (exponential with negative j k_z^(1) z) times sine of (pi x / a).

Spatial structure:

This mode has the form of a plane wave propagating in the positive z direction with phase propagation constant k_z^(1).
The wave is non-uniform, with magnitude proportional to sine of (pi x / a) within the waveguide.
Magnitude is zero at the perfectly conducting surfaces (x = 0 and x = a) and maximum in the center of the waveguide.

🥈 Second-order mode (m = 2)

For m = 2:

Cutoff frequency: f_c^(2) = 1 / (a times square root of mu epsilon).
This frequency is higher than f_c^(1), so the m = 1 mode can exist at any frequency at which the m = 2 mode exists.
k_x^(2) = 2 pi / a.
k_z^(2) = square root of [beta squared minus (2 pi / a) squared].
E_y^(2) = E_y0^(2) times (exponential with negative j k_z^(2) z) times sine of (2 pi x / a).

Spatial structure:

The wave propagates in the positive z direction with phase propagation constant k_z^(2), which is less than k_z^(1).
Magnitude is proportional to sine of (2 pi x / a) within the waveguide.
Magnitude is zero at the PEC surfaces; for m = 2, there are two maxima with respect to x, and the magnitude in the center of the waveguide is zero.

🔢 Pattern for higher-order modes

This pattern continues for higher-order modes:

Each successive mode exhibits higher cutoff frequency.
Each successive mode has a smaller propagation constant k_z.
Each successive mode shows an increasing integer number of sinusoidal half-periods in magnitude across the waveguide width.

📐 Example: single-mode TE propagation

Scenario: An air-filled parallel plate waveguide with plates separated by 1 cm. Determine the frequency range for which one (and only one) propagating TE mode is assured.

Solution:

Single-mode TE propagation is assured by limiting frequency f to greater than the cutoff frequency for m = 1, but lower than the cutoff frequency for m = 2.
Any frequency higher than the cutoff frequency for m = 2 allows at least 2 modes to exist.
Calculating the cutoff frequencies:
- f_c^(1) = 1 / (2 a times square root of mu_0 epsilon_0) ≈ 15.0 GHz.
- f_c^(2) = 2 / (2 a times square root of mu_0 epsilon_0) ≈ 30.0 GHz.
Therefore, the frequency range is: 15.0 GHz ≤ f ≤ 30.0 GHz.

🚀 Phase velocity and group velocity

🚀 Phase velocity in the waveguide

For the lowest-order mode m = 1, the phase velocity v_p is:

v_p = omega / k_z^(1) = omega / square root of [omega squared times mu epsilon minus (pi / a) squared].
Recall that the speed of an electromagnetic wave in unbounded space (not in a waveguide) is 1 / square root of (mu epsilon).
Example: the speed of light in free space is 1 / square root of (mu_0 epsilon_0) = c.

Key observation:

The phase velocity indicated by the formula is greater than 1 / square root of (mu epsilon).
For example, it is faster than light would travel in the same material (presuming it were transparent).

🤔 Why phase velocity can exceed the speed of light

At first glance, phase velocity exceeding the speed of light may seem impossible. However:

Information travels at the group velocity v_g, and not necessarily the phase velocity.

Although not demonstrated in the excerpt, the group velocity in the parallel plate waveguide is always less than 1 / square root of (mu epsilon).
Therefore, no physical laws are broken, and signals travel somewhat slower than the speed of light, as they do in any other structure used to convey signals.

🔄 Don't confuse phase velocity with information speed

Phase velocity: the speed at which the phase of the wave propagates; can exceed the speed of light in unbounded space.
Group velocity: the speed at which information (or energy) propagates; always less than the speed of light in the same material.
The excerpt notes that the speed of propagation is different for [each mode], though the sentence is incomplete.

Parallel Plate Waveguide: TE Case, Magnetic Field

6.4 Parallel Plate Waveguide: TE Case, Magnetic Field

🧭 Overview

🧠 One-sentence thesis

The magnetic field component of the TE solution in a parallel plate waveguide is derived from the electric field using Maxwell-Faraday's equation, revealing that current flows on the conducting surfaces and the magnetic field orientation varies with position inside the waveguide.

📌 Key points (3–5)

Derivation method: The TE magnetic field is obtained by applying the curl operator to the TE electric field via Maxwell-Faraday's equation.
Boundary behavior: At the perfectly-conducting surfaces, the magnetic field is non-zero and parallel to those surfaces, with the x-component zero and the z-component non-zero.
Surface currents: The discontinuity in the tangent component of the magnetic field at the conducting surfaces requires surface currents to flow on those surfaces.
Common confusion: The magnetic field direction changes with position—parallel to the conducting surfaces at the boundaries but perpendicular to them at locations corresponding to electric field maxima.
Modal structure: Each mode contributes both x and z components to the total magnetic field, with the solution expressed as a sum over all modes.

🧮 Deriving the magnetic field from the electric field

🧮 Starting point and governing equation

The excerpt begins with the TE electric field already determined in Section 6.3:

The TE component of the electric field: ŷ Ẽ_y = ŷ sum over m from 1 to infinity of Ẽ^(m)_y

The magnetic field is obtained from Maxwell-Faraday's equation:

∇ × Ẽ = −jωμ H̃

Rearranging: H̃ = (j/ωμ) ∇ × Ẽ = (j/ωμ) ∇ × (ŷ Ẽ_y)

🔧 Applying the curl operator

The curl operator in this geometry has 6 terms, but only 2 are non-zero because the x̂ and ẑ components of Ẽ are zero.
The two remaining terms are: −x̂ (∂Ẽ_y/∂z) and +ẑ (∂Ẽ_y/∂x).
Thus: H̃ = (j/ωμ) [−x̂ (∂Ẽ_y/∂z) + ẑ (∂Ẽ_y/∂x)]

📐 Mode-by-mode calculation

Because differentiation is a linear operator, the curl can be evaluated for each mode separately and then summed.
For mode m, the electric field is: Ẽ^(m)_y = E^(m)_y0 exp(−jk^(m)_z z) sin(k^(m)_x x)
Taking the z-derivative: ∂Ẽ^(m)_y/∂z = [E^(m)_y0 exp(−jk^(m)_z z) sin(k^(m)_x x)] (−jk^(m)_z)
Taking the x-derivative: ∂Ẽ^(m)_y/∂x = [E^(m)_y0 exp(−jk^(m)_z z) cos(k^(m)_x x)] (+k^(m)_x)

🧲 The magnetic field solution

🧲 Modal components

The total magnetic field is assembled as:

x̂ H̃_x + ẑ H̃_z = x̂ sum over m from 1 to infinity of H̃^(m)_x + ẑ sum over m from 1 to infinity of H̃^(m)_z

Where the modal components are:

H̃^(m)_x = −(k^(m)_z / ωμ) E^(m)_y0 exp(−jk^(m)_z z) sin(k^(m)_x x)
H̃^(m)_z = +j (k^(m)_x / ωμ) E^(m)_y0 exp(−jk^(m)_z z) cos(k^(m)_x x)

⚠️ Constraints and caveats

Modes may only exist at frequencies greater than the associated cutoff frequencies.
The same caveats about cutoff frequencies and the locations of sources and structures from the electric field solution continue to apply.

🔲 Behavior at the conducting surfaces

🔲 At the surface x = 0

At the perfectly-conducting (PEC) surface at x = 0:

H̃^(m)_x (x = 0) = 0
H̃^(m)_z (x = 0) = +j (k^(m)_x / ωμ) E^(m)_y0 exp(−jk^(m)_z z)

The magnetic field vector is non-zero and parallel to the PEC surface.

🔲 At the surface x = a

At the PEC surface at x = a:

H̃^(m)_x (x = a) = 0
H̃^(m)_z (x = a) = −j (k^(m)_x / ωμ) E^(m)_y0 exp(−jk^(m)_z z)

Again, the magnetic field is non-zero and parallel to the surface.

⚡ Surface currents

The excerpt draws an important conclusion:

Current flows on the PEC surfaces of the waveguide.

Why this happens:

The magnetic field is identically zero inside a PEC material.
Boundary conditions require that discontinuity in the tangent component of H must be supported by a surface current.
Therefore, surface currents must flow on the conducting plates.

Don't confuse with: This is similar to what happens in a coaxial transmission line, where signals can be described either in terms of potentials and currents on the conductors or electromagnetic fields between them. The parallel plate waveguide is only slightly more complicated because it uses combinations of TE and TM modes instead of a single TEM mode.

🔄 Magnetic field orientation inside the waveguide

🔄 Position-dependent direction

The magnetic field vector points in different directions depending on position:

Location	Orientation	Details
At conducting surfaces (x = 0 or x = a)	Parallel to surfaces	Only z-component is non-zero; x-component is zero
At m locations between x = 0 and x = a	Perpendicular to surfaces	These locations correspond to maxima in the electric field

🔄 Physical interpretation

The magnetic field is not uniform in direction throughout the waveguide.
At the boundaries, it must be tangent to the conducting surfaces (to support surface currents).
At interior locations where the electric field is strongest, the magnetic field reorients to be perpendicular to the surfaces.
This variation in orientation is a characteristic feature of TE modes in waveguides.

Parallel Plate Waveguide: TM Case, Electric Field

6.5 Parallel Plate Waveguide: TM Case, Electric Field

🧭 Overview

🧠 One-sentence thesis

The TM (transverse magnetic) mode in a parallel plate waveguide consists of a family of discrete modes indexed by integer m, where each mode propagates only above a specific cutoff frequency, and uniquely the m=0 mode can propagate at any frequency.

📌 Key points (3–5)

What TM means: the magnetic field vector is perpendicular to the plane of interest and parallel to the conducting surfaces, pointing only in the y-direction.
How modes work: each integer m (0, 1, 2, ...) defines a mode with its own cutoff frequency; only modes above their cutoff frequency can propagate.
Key difference from TE: the m=0 mode exists in the TM case but not in the TE case, and it has zero cutoff frequency so it propagates at any frequency and any plate separation.
Common confusion: m=0 yields k_x = 0 in both TE and TM, but in TE this produces zero field (trivial), whereas in TM the E_x component can still be nonzero.
Constraint on propagation: the relationship beta squared = k_x squared + k_z squared must hold, identical to the TE case.

🧲 TM field structure and governing equation

🧲 What transverse magnetic means

Transverse magnetic (TM): the magnetic field vector is perpendicular to the plane of interest and therefore parallel to the conducting surfaces.

In this geometry, the magnetic field has only a y-component: H-tilde = y-hat H-tilde_y.
There is no component in the x or z directions.
This is the defining characteristic that distinguishes TM from TE modes.

📐 The governing partial differential equation

The magnetic component of the TM field satisfies:

Second partial derivative with respect to x of H-tilde_y, plus second partial derivative with respect to z of H-tilde_y, equals negative beta squared times H-tilde_y.
This is the same form as in the TE case, but applied to the magnetic field component instead of the electric field component.

🌊 General solution form

The general solution is:

H-tilde_y = exp(negative j k_z z) times [A exp(negative j k_x x) + B exp(positive j k_x x)] + exp(positive j k_z z) times [C exp(negative j k_x x) + D exp(positive j k_x x)].
A, B, C, D are complex-valued constants; k_x and k_z are real-valued constants.
The first term (with exp(negative j k_z z)) represents a wave propagating in the positive z direction.
The second term (with exp(positive j k_z z)) represents a wave propagating in the negative z direction.

🚫 Simplification by source restriction

If sources exist only on the left (z < 0) side and there is no structure capable of reflection on the right (z > 0) side:

There can be no wave components propagating in the negative z direction.
Therefore C = 0 and D = 0.
The solution simplifies to: H-tilde_y = exp(negative j k_z z) times [A exp(negative j k_x x) + B exp(positive j k_x x)].

🔗 Propagation constraint and vector relationship

🔗 Verifying the solution

By computing the second partial derivatives and summing:

Second partial with respect to x yields: negative k_x squared times H-tilde_y.
Second partial with respect to z yields: negative k_z squared times H-tilde_y.
Sum: negative (k_x squared + k_z squared) times H-tilde_y.

Comparing to the governing equation:

This confirms the solution is valid under the constraint: beta squared = k_x squared + k_z squared.

🧭 Physical interpretation of k_x and k_z

The constraint confirms that k_x and k_z are components of the propagation vector:

k = beta times k-hat = x-hat k_x + y-hat k_y + z-hat k_z.
k-hat is the unit vector pointing in the direction of propagation.
In this problem, k_y = 0.
This is precisely the same constraint identified in the TE case.

⚡ Deriving the electric field from the magnetic field

⚡ Using Ampere's law

The electric field is obtained from the magnetic field using:

Curl of H-tilde = j omega epsilon times E-tilde.
Rearranging: E-tilde = (1 / j omega epsilon) times curl of H-tilde.

🔄 Applying the curl operator

Since H-tilde has only a y-component:

The curl expression (from Appendix B.2) has 6 terms, but only 2 are nonzero.
The two remaining terms are: negative x-hat times (partial of H-tilde_y with respect to z) and positive z-hat times (partial of H-tilde_y with respect to x).
Therefore: E-tilde = (1 / j omega epsilon) times [negative x-hat (partial H-tilde_y / partial z) + z-hat (partial H-tilde_y / partial x)].

📊 Components of the electric field

Using the derivatives of the simplified magnetic field solution:

Component	Expression
E-tilde_x	(k_z / omega epsilon) exp(negative j k_z z) [A exp(negative j k_x x) + B exp(positive j k_x x)]
E-tilde_z	(k_x / omega epsilon) exp(negative j k_z z) [negative A exp(negative j k_x x) + B exp(positive j k_x x)]

The solution is now reduced to finding constants A, B, and either k_x or k_z.
This is accomplished by enforcing boundary conditions.

🛡️ Boundary conditions and mode structure

🛡️ Tangential electric field at conducting surfaces

The component of the electric field tangent to a perfectly-conducting surface is zero:

At x = 0: E-tilde_z (x = 0) = 0.
At x = a: E-tilde_z (x = a) = 0.

🔢 Applying the first boundary condition (x = 0)

From the expression for E-tilde_z at x = 0:

(k_x / omega epsilon) exp(negative j k_z z) [negative A (1) + B (1)] = 0.
The exponential factor has unit magnitude and cannot be zero.
Instead of requiring k_x = 0 (too restrictive), we require A = B.

This allows rewriting E-tilde_z using a trigonometric identity:

E-tilde_z = (j 2 B k_x / omega epsilon) exp(negative j k_z z) sin(k_x x).

And E-tilde_x becomes:

E-tilde_x = (2 B k_z / omega epsilon) exp(negative j k_z z) cos(k_x x).
Define E_x0 = 2 B k_z / omega epsilon for convenience.
Then: E-tilde_x = E_x0 exp(negative j k_z z) cos(k_x x).

🔢 Applying the second boundary condition (x = a)

From E-tilde_z at x = a:

(j 2 B k_z / omega epsilon) exp(negative j k_z z) sin(k_x a) = 0.
Requiring B = 0 or k_z = 0 yields only trivial solutions.
Therefore: sin(k_x a) = 0, which requires k_x a = m pi where m is an integer.

🎯 Mode enumeration

Each integer m defines a mode:

k_x = m pi / a for m = 0, 1, 2, ...
Each mode is associated with a particular value of k_x.
Don't confuse with TE: in TE, m = 0 yields zero field (not of interest); in TM, m = 0 yields E-tilde_z = 0 but E-tilde_x is not necessarily zero, so m = 0 is a valid mode.

📡 Cutoff frequency and propagation conditions

📡 Determining k_z for each mode

For mode m, k_z is obtained from the constraint:

k_z = square root of (beta squared minus k_x squared) = square root of [beta squared minus (m pi / a) squared].
Since k_z is specified to be real-valued, we require: beta squared minus (m pi / a) squared > 0.

🚦 Cutoff frequency for each mode

The constraint on beta translates to a constraint on frequency:

beta > m pi / a.
Recall beta = omega times square root of (mu epsilon) and omega = 2 pi f.
Solving for f: f > m / (2 a square root of (mu epsilon)).

Cutoff frequency for mode m: f_c^(m) = m / (2 a square root of (mu epsilon)).

Each mode exists only above its cutoff frequency.
At frequencies below the cutoff, k_z becomes imaginary-valued and the mode does not propagate.

🌟 Special case: the m = 0 mode

The m = 0 mode has unique properties:

Cutoff frequency is zero: f_c^(0) = 0.
This mode is always able to propagate for any plate separation a > 0 and any frequency f > 0.
This is a remarkable difference from the TE case, where m = 0 is not available.

Example: A waveguide with any plate separation can support the m = 0 mode at any frequency, but higher modes (m = 1, 2, ...) require the frequency to exceed their respective cutoffs.

📋 Complete TM electric field solution

📋 Summary of the solution

The electric field component of the TM field is given by:

E-tilde = sum from m = 0 to infinity of [x-hat E-tilde_x^(m) + z-hat E-tilde_z^(m)].

Where for each mode m:

E-tilde_x^(m):
- 0 if f < f_c^(m).
- E_x0^(m) exp(negative j k_z^(m) z) cos(k_x^(m) x) if f ≥ f_c^(m).
E-tilde_z^(m):
- 0 if f < f_c^(m).
- j (k_x^(m) / k_z^(m)) E_x0^(m) exp(negative j k_z^(m) z) sin(k_x^(m) x) if f ≥ f_c^(m).

🔧 Mode parameters

For each mode m:

k_z^(m) = square root of [beta squared minus (k_x^(m)) squared].
k_x^(m) = m pi / a.
The coefficients E_x0^(m) depend on sources and/or boundary conditions to the left of the region of interest.

🎯 Assumptions

This solution presumes:

All sources lie to the left of the region of interest.
No additional sources or boundary conditions to the right of the region of interest.

🌊 The TM₀ mode: special properties

🌊 Why the m = 0 mode is special

The m = 0 mode, commonly referred to as the "TM₀" mode:

Is of particular importance in the analysis of microstrip transmission lines.
Has zero cutoff frequency, so it exists at any frequency and any plate separation.
For this mode: k_x^(0) = 0, k_z^(0) = beta.

✈️ Uniform plane wave behavior

The electric field for the TM₀ mode is:

E-tilde = x-hat E_x0^(0) exp(negative j beta z).
This has the form of a uniform plane wave propagating in the positive z direction (squarely between the plates).
The phase and group velocities are equal to each other and to omega / beta.
This is precisely as expected for a uniform plane wave in unbounded media, as if the plates did not exist.

Don't confuse: even though the plates impose boundary conditions for higher modes, the TM₀ mode propagates as if in free space, with no dependence on the plate separation a.

Parallel Plate Waveguide: The TM₀ Mode

6.6 Parallel Plate Waveguide: The TM0 Mode

🧭 Overview

🧠 One-sentence thesis

The TM₀ mode (m = 0) in a parallel plate waveguide behaves as a uniform plane wave propagating between the plates with no cutoff frequency, making it the only mode that can exist at any frequency and plate separation.

📌 Key points (3–5)

What makes TM₀ special: it is the lowest-order TM mode (m = 0), exists only in TM (not TE), and has zero cutoff frequency so it propagates at any frequency and any non-zero plate separation.
How it behaves: the electric field has the form of a uniform plane wave traveling squarely between the plates, as if the plates did not exist.
Phase and group velocities: both equal ω/β, exactly as expected for a uniform plane wave in unbounded media.
Common confusion: TM₀ is unique—higher-order modes (m ≥ 1) have non-zero cutoff frequencies and cannot propagate below those frequencies, but TM₀ always propagates.
Why it matters: TM₀ is the foundation for microstrip transmission lines and is often the only propagating mode in practical structures like printed circuit boards at lower frequencies.

🌊 The nature of the TM₀ mode

🔢 Definition and uniqueness

The TM₀ mode: the m = 0 mode of the TM field in a parallel plate waveguide.

It is the lowest-order mode in the TM decomposition.
Only exists in TM, not in TE—this is a key distinction from higher-order modes.
The cutoff frequency for this mode is zero, meaning it can propagate at any frequency and within any non-zero plate separation a.

📐 Mathematical form

For TM₀, the wave numbers are: k⁽⁰⁾ₓ = 0 and k⁽⁰⁾_z = β.
The electric field intensity is:
- E-tilde = x-hat times E⁽⁰⁾ₓ₀ times e to the power of (minus j β z)
This form is identical to a uniform plane wave propagating in the +z direction (squarely between the plates).
Example: imagine a wave traveling straight down the center line between two parallel metal plates, with the electric field pointing in the x direction and varying only with distance z.

🧲 Associated magnetic field

The magnetic field is derived using the plane wave relationship:
- H-tilde = (1/η) times k-hat cross E-tilde
Substituting the TM₀ electric field:
- H-tilde = y-hat times (E⁽⁰⁾ₓ₀ / η) times e to the power of (minus j β z)
The magnetic field points in the y direction, perpendicular to both the electric field (x direction) and the direction of propagation (z direction).
This confirms the plane wave character: E, H, and the propagation direction are mutually perpendicular.

🚀 Propagation characteristics

⚡ Phase and group velocities

Both phase velocity and group velocity equal ω/β.
This is precisely as expected for a uniform plane wave in unbounded media.
The plates do not alter the wave's velocity—it propagates as if the plates did not exist.
Don't confuse: higher-order modes (m ≥ 1) have dispersion and different phase/group velocities, but TM₀ behaves like a simple plane wave.

🔓 Zero cutoff frequency

The cutoff frequency for mode m is: f⁽ᵐ⁾_c = m / (2a times square root of (μ times ε)).
For m = 0, this gives f⁽⁰⁾_c = 0.
Implication: TM₀ can propagate at any frequency, no matter how low, and in any non-zero plate separation.
Example: if you have plates separated by 1 mm or 10 mm, and operate at 1 MHz or 10 GHz, TM₀ will always propagate.

🖥️ Practical application: printed circuit boards

🛠️ PCB as a parallel plate waveguide

A common PCB structure: a dielectric slab (e.g., 1.575 mm thick, relative permittivity ≈ 4.5) sandwiched between two copper planes.
If the copper planes have high enough conductivity, their inward-facing surfaces can be treated as perfectly conducting.
Deep inside the sandwich (away from edges), the PCB is well-modeled as a parallel plate waveguide.

📡 Mode analysis in the example

The electromagnetic field consists of a combination of TE and TM modes.
Which modes are active depends on:
- The source (a mode must be "stimulated" by the source to propagate).
- Modal cutoff frequencies.
For the given PCB (a = 1.575 mm, μ ≈ μ₀, ε ≈ 4.5 ε₀):
- Cutoff frequency for mode m: f⁽ᵐ⁾_c ≈ (44.9 GHz) times m.
- For m = 1, f⁽¹⁾_c ≈ 44.9 GHz, which is much greater than 10 GHz.
Conclusion: below 10 GHz, the only mode that can propagate is TM₀.
The field deep inside the PCB is a single plane wave with the TM₀ structure, propagating away from the source.

🏃 Phase velocity in the PCB

Phase velocity: v_p = 1 / square root of (μ times ε).
For the example: v_p ≈ 1.41 × 10⁸ m/s ≈ 0.47 c (where c is the speed of light in vacuum).
This is slower than the speed of light in vacuum because the dielectric has relative permittivity > 1.

🔗 Connection to microstrip transmission lines

The scenario described (TM₀ propagating in a PCB sandwich) is essentially a very rudimentary form of microstrip transmission line.
This highlights the practical importance of the TM₀ mode in real-world applications like signal transmission in electronic circuits.

🔍 Key distinctions and common confusions

🆚 TM₀ vs higher-order modes

Feature	TM₀ (m = 0)	Higher-order modes (m ≥ 1)
Cutoff frequency	Zero	Non-zero (increases with m)
Propagation	At any frequency	Only above cutoff frequency
Field structure	Uniform plane wave	More complex spatial variation
Phase/group velocity	Equal, like unbounded plane wave	Dispersive, frequency-dependent

🚫 TM₀ vs TE modes

TM₀ exists only in the TM case.
The m = 0 mode does not exist in the TE case.
Don't confuse: TE modes start at m = 1, but TM modes include m = 0.

🧩 Why TM₀ dominates in practice

In many practical structures (like the PCB example), the geometry and operating frequency are such that higher-order modes are below cutoff.
TM₀, with its zero cutoff frequency, is often the only propagating mode.
This simplifies analysis and design: the field can be treated as a single plane wave.

General Relationships for Unidirectional Waves

6.7 General Relationships for Unidirectional Waves

🧭 Overview

🧠 One-sentence thesis

For waves traveling in a single direction within waveguides, all field components can be determined from just the components in the direction of propagation, dramatically simplifying the analysis of electromagnetic waves in enclosed spaces.

📌 Key points (3–5)

Core simplification: Unidirectional wave assumption allows separate analysis of forward and backward waves, which can then be summed using superposition.
Key result: Knowing only the z-components of electric and magnetic fields (Ẽ_z and H̃_z) is sufficient to determine all other field components.
Critical parameter k_ρ: This quantity relates to field variation perpendicular to propagation direction and must be non-zero for the method to work.
Common confusion: These relationships do NOT work for uniform plane waves or TEM waves (where k_ρ = 0), only for non-TEM waveguide modes.
Further decomposition: Unidirectional waves can be split into TE (transverse electric, Ẽ_z = 0) and TM (transverse magnetic, H̃_z = 0) components for even simpler analysis.

🎯 Why unidirectional analysis matters

🎯 The difficulty of waveguide analysis

Analyzing electromagnetic waves in enclosed spaces and waveguides is "quite difficult" in general.
The task becomes "dramatically simplified" if the wave can be assumed to propagate in a single direction.

🔄 No loss of generality through superposition

Within a straight waveguide, waves can travel either "forward" or "backward."
The principle of superposition allows treating these two unidirectional cases separately.
The final solution is simply the sum of the two unidirectional results.
Example: Instead of solving for bidirectional waves simultaneously, solve for +z-traveling wave, solve for -z-traveling wave, then add them.

🧮 Mathematical decomposition strategy

📐 Starting point: Maxwell's curl equations

The derivation begins with Maxwell's curl equations in phasor form:

∇ × Ẽ = -jωμ H̃
∇ × H̃ = +jωϵ Ẽ

The strategy is derived in Cartesian coordinates for rectangular waveguides, but the underlying approach is generally applicable.

🔀 Component breakdown

Each field is expressed as three Cartesian components:

Ẽ = x̂ Ẽ_x + ŷ Ẽ_y + ẑ Ẽ_z
H̃ = x̂ H̃_x + ŷ H̃_y + ẑ H̃_z

This allows extracting separate equations for each component using the curl formula in Cartesian coordinates.

🌊 Exploiting unidirectional propagation

For a wave traveling in the +ẑ direction:

Each magnetic field component contains a factor e^(-jk_z z), where k_z is the phase propagation constant in the direction of travel.
The remaining factors depend only on x and y, not on z.
Components are decomposed as: H̃_x = h̃_x(x,y) e^(-jk_z z), and similarly for y and z components.

Key advantage: Partial derivatives with respect to z reduce to simple algebraic operations:

∂H̃_x/∂z = -jk_z H̃_x (multiplication instead of differentiation)

🔗 Simultaneous equations

After substitution and simplification, six simultaneous equations result (Equations 6.130–6.135 in the excerpt):

Three equations express Ẽ_x, Ẽ_y, Ẽ_z in terms of magnetic field components and their derivatives.
Three equations express H̃_x, H̃_y, H̃_z in terms of electric field components and their derivatives.

These represent Maxwell's curl equations specialized for a unidirectional +ẑ-traveling wave.

🎁 The main result: transverse components from longitudinal components

📊 Simplified field expressions

With algebraic manipulation, the transverse components (x̂ and ŷ) can be expressed using only the longitudinal components (ẑ):

Component	Depends on
Ẽ_x	Derivatives of Ẽ_z and H̃_z with respect to x and y
Ẽ_y	Derivatives of Ẽ_z and H̃_z with respect to x and y
H̃_x	Derivatives of Ẽ_z and H̃_z with respect to x and y
H̃_y	Derivatives of Ẽ_z and H̃_z with respect to x and y

All four transverse components are given by equations 6.136–6.139 in the excerpt.

🔑 The parameter k_ρ

k_ρ² is defined as: β² - k_z²

Physical interpretation:

β is the "overall" phase propagation constant.
k_z is the phase propagation constant for propagation in the ẑ direction.
The equation β² = k_ρ² + k_z² is an expression of the Pythagorean theorem.
k_ρ must be associated with field variation in directions perpendicular to ẑ.
In cylindrical coordinates, this perpendicular direction is ρ̂, hence the subscript "ρ."
k_ρ plays a special role in determining the structure of fields within the waveguide.

✅ When the method works

Summary: If you know the wave is unidirectional, then knowledge of the components of Ẽ and H̃ in the direction of propagation is sufficient to determine each of the remaining components of Ẽ and H̃.

Requirements:

The wave must be unidirectional.
k_ρ may not be zero (otherwise division by zero occurs).
For non-TEM waves in waveguides, either Ẽ_z or H̃_z must be non-zero, ensuring k_ρ is non-zero.

⚠️ Limitations and special cases

🚫 When the method fails: TEM waves

Don't confuse: These expressions don't work for uniform plane waves or other transverse electromagnetic (TEM) waves.

For a uniform plane wave:

Ẽ_z = H̃_z = 0 (no longitudinal components)
k_z = β (all propagation is in the z-direction)
Therefore k_ρ = 0 (from the definition k_ρ² = β² - k_z²)
This yields "zero divided by zero" for each remaining field component—nonsensical results.

The same issue arises for any TEM wave.

✨ TE and TM decomposition

For even greater simplicity, unidirectional waves can be decomposed into two types:

Mode type	Definition	Meaning
TE (Transverse Electric)	Ẽ_z = 0	Electric field has no component in propagation direction
TM (Transverse Magnetic)	H̃_z = 0	Magnetic field has no component in propagation direction

Advantage: For a wave that is either TE or TM, equations 6.136–6.139 reduce to just one term each, making calculations much simpler.

Example: For a TM mode (H̃_z = 0), all transverse field components depend only on Ẽ_z and its derivatives, with no magnetic field longitudinal component to consider.

📦 Application context: PCB example

📦 Printed circuit board waveguide

The excerpt includes an example of a parallel-plate waveguide formed by a PCB:

1.575 mm-thick dielectric slab (relative permittivity ≈ 4.5) sandwiched between two copper planes.
Copper surfaces treated as perfectly conducting.
Analysis limited to the field deep within the sandwich, away from edges.

🔢 Mode analysis results

Cutoff frequency for mode m: f_c^(m) = m/(2a√(μϵ))
For this PCB: f_c^(m) ≈ (44.9 GHz) × m
Since f_c^(1) is much greater than 10 GHz, only the TM₀ mode can propagate below 10 GHz.
The field can be interpreted as a single plane wave with TM₀ structure.
Phase velocity: v_p = 1/√(μϵ) ≈ 1.41 × 10⁸ m/s ≈ 0.47c (about 47% the speed of light).

This scenario represents a rudimentary form of microstrip transmission line.

Rectangular Waveguide: TM Modes

6.8 Rectangular Waveguide: TM Modes

🧭 Overview

🧠 One-sentence thesis

The TM modes in a rectangular waveguide are completely determined by solving the wave equation for the z-component of the electric field with boundary conditions imposed by perfectly-conducting walls, yielding discrete modes labeled by integer indices m and n.

📌 Key points (3–5)

What TM means: transverse magnetic modes have the magnetic field component in the propagation direction equal to zero (H̃z = 0), so the field is completely determined by the electric field's z-component.
How modes arise: applying boundary conditions (tangential electric field must be zero at perfectly-conducting walls) restricts the solution to discrete modes indexed by positive integers m and n.
Separation of variables technique: the partial differential equation is solved by expressing the field as a product of functions that each depend on only one spatial variable.
Common confusion: not all integer pairs (m,n) yield propagating waves—when m or n is zero, the field is trivial (zero-valued); when m or n is too large for a given frequency, the propagation constant becomes imaginary.
Mode notation: each valid solution is called a mode and labeled TM_mn, where m and n are positive integers determining the spatial variation in the x and y directions.

📐 Geometry and setup

📐 Waveguide structure

A rectangular waveguide is a conducting cylinder with rectangular cross-section.
Walls are located at x = 0, x = a, y = 0, and y = b, so the cross-sectional dimensions are a and b.
The interior is filled with a lossless material with real-valued permeability μ and permittivity ε.
Walls are assumed to be perfectly conducting.

🎯 Problem scope

Analysis is limited to regions free of sources.
Focus is on waves propagating in the +z direction (the other direction follows by symmetry).
The total field is a superposition of unidirectional waves in both directions.

🧲 TM mode definition

🧲 What transverse magnetic means

TM (transverse magnetic): the magnetic field component in the direction of propagation is zero, i.e., H̃z = 0.

"Transverse" means perpendicular to the propagation direction (z-axis).
For TM modes, the magnetic field lies entirely in the transverse plane (x-y plane).
The electric field has a z-component, so it is not transverse.

🔑 Why E_z determines everything

Once the z-component of the electric field (Ẽz) is known, all other field components can be calculated.
The excerpt provides formulas (Equations 6.151–6.154) that express Ex, Ey, Hx, and Hy in terms of derivatives of Ẽz.
This simplification reduces the problem to finding a single scalar function instead of six field components.

🧮 Mathematical solution method

🧮 Starting wave equation

The electric field intensity satisfies the wave equation: ∇²Ẽ + β²Ẽ = 0.
β = ω√(με), where ω is angular frequency.
In Cartesian coordinates, this splits into three separate equations for the x, y, and z components.

🌊 Propagation factor separation

The z-component is expressed as: Ẽz = ẽz(x,y) exp(-jkz·z).
The factor exp(-jkz·z) describes propagation along z with phase constant kz.
The function ẽz(x,y) describes how the field varies in the transverse plane.
Substituting this form into the wave equation yields a partial differential equation for ẽz in x and y only.

✂️ Separation of variables

The technique assumes ẽz(x,y) can be written as a product: X(x)·Y(y).
X depends only on x, and Y depends only on y.
Substituting this product form and dividing through yields an equation where terms depending only on x, only on y, and constants are separated.
This allows splitting into two ordinary differential equations, one for X(x) and one for Y(y).

📝 General solutions

The separated equations are: d²X/dx² + kx²X = 0 and d²Y/dy² + ky²Y = 0.
These are familiar one-dimensional equations with sinusoidal solutions.
General solutions: X = A·cos(kx·x) + B·sin(kx·x) and Y = C·cos(ky·y) + D·sin(ky·y).
The constants kx and ky are related by: kx² + ky² = kρ², where kρ² = β² - kz².

🚧 Boundary conditions and mode quantization

🚧 Perfectly-conducting wall requirement

The tangential component of the electric field must be zero at a perfectly-conducting wall.
The z-component of Ẽ is tangent to all four walls, so Ẽz must be zero at x = 0, x = a, y = 0, and y = b.
These four conditions translate into requirements on X and Y.

🔢 Determining constants

Applying Ẽz(x=0) = 0 requires X(0) = 0, which forces A = 0.
Applying Ẽz(x=a) = 0 then requires sin(kx·a) = 0, so kx = mπ/a for integer m.
Similarly, applying Ẽz(y=0) = 0 forces C = 0.
Applying Ẽz(y=b) = 0 requires sin(ky·b) = 0, so ky = nπ/b for integer n.

🚫 Trivial vs valid modes

When m = 0 or n = 0, the corresponding wavenumber (kx or ky) is zero, leading to zero-valued fields.
These are not physically interesting solutions.
Only positive integer values of m and n (m = 1, 2, 3, ... and n = 1, 2, 3, ...) yield non-trivial modes.
Don't confuse: m and n must both be at least 1 for TM modes; zero values are excluded.

📊 Final mode expressions

📊 Individual mode formula

Each mode is labeled TM_mn and given by:
- Ẽz^(m,n) = E0^(m,n) · sin(mπx/a) · sin(nπy/b) · exp(-jkz^(m,n)·z)
E0^(m,n) is an arbitrary amplitude constant.
The propagation constant for mode (m,n) is: kz^(m,n) = √[ω²με - (mπ/a)² - (nπ/b)²].

📊 Total field as superposition

The most general TM field is a sum over all modes:
- Ẽz = Σ(m=1 to ∞) Σ(n=1 to ∞) Ẽz^(m,n)
Each mode contributes independently.
The amplitude of each mode depends on the excitation conditions (not discussed in the excerpt).

📊 Mode naming convention

Modes are referred to as TM_mn, where m and n are the integer indices.
Example: TM_12 corresponds to m = 1 and n = 2.
This notation immediately tells you the spatial variation pattern in the transverse plane.

⚠️ Propagation constant behavior

⚠️ Real vs imaginary values

The propagation constant kz^(m,n) from the formula is not necessarily real-valued.
For a given frequency ω, if m or n is too large, the term under the square root becomes negative, making kz imaginary.
An imaginary kz corresponds to exponential decay (evanescent mode) rather than propagation.

⚠️ Frequency dependence

For any fixed m, increasing n eventually makes kz imaginary.
For any fixed n, increasing m eventually makes kz imaginary.
This means only a finite number of modes actually propagate at a given frequency.
The excerpt notes this phenomenon is common to both TM and TE modes and is addressed in a separate section (6.10).

Condition	kz^(m,n) character	Physical meaning
ω²με > (mπ/a)² + (nπ/b)²	Real	Propagating mode
ω²με < (mπ/a)² + (nπ/b)²	Imaginary	Evanescent (non-propagating) mode

Rectangular Waveguide: TE Modes

6.9 Rectangular Waveguide: TE Modes

🧭 Overview

🧠 One-sentence thesis

The TE (transverse electric) modes in a rectangular waveguide are completely determined by solving the wave equation with boundary conditions, yielding discrete modes labeled by integers m and n that describe how the magnetic field varies across the waveguide cross-section.

📌 Key points (3–5)

What TE modes are: waves in which the electric field component along the propagation direction (z-direction) is zero, so the electric field is perpendicular (transverse) to propagation.
How modes are found: by solving the wave equation using separation of variables and applying boundary conditions at the perfectly-conducting walls, which forces specific discrete values of wave numbers.
Mode labeling: each valid combination of integers m and n defines a distinct TE mode (written as TE_mn), and the general field is a sum over all these modes.
Common confusion: not all integer pairs are valid—the TE_00 mode has no non-zero electric field and is not of practical interest; also, the wave number k_z can be imaginary for certain m,n combinations, meaning those modes do not propagate.
Why it matters: understanding TE modes is essential for designing waveguides that transport radio frequency signals (especially 3–30 GHz and higher) by controlling which modes can propagate.

📐 Geometry and setup

📐 Waveguide structure

A rectangular waveguide is a conducting cylinder of rectangular cross-section used to guide the propagation of waves.

Walls are located at x = 0, x = a, y = 0, and y = b, so the cross-sectional dimensions are a (width) and b (height).
The interior is filled with a lossless material with real-valued permeability μ and permittivity ε.
Walls are assumed to be perfectly conducting.
The waveguide is commonly used for radio frequency signals in the SHF band (3–30 GHz) and higher.

🎯 Focus on TE modes

TE (transverse electric) component: defined by the property that the electric field component along z is zero (E_z = 0), meaning the electric field is transverse (perpendicular) to the direction of propagation.

The analysis considers waves propagating in the +z direction (the other direction follows by symmetry).
Once the z-component of the magnetic field (H_z) is known, all other field components can be calculated from it.
The region of interest is free of sources.

🧮 Mathematical solution method

🧮 Starting wave equation

The magnetic field intensity in phasor form satisfies the wave equation:

The Laplacian of H plus beta-squared times H equals zero.
Beta equals omega times the square root of μ times ε.
This partial differential equation, combined with boundary conditions from the perfectly-conducting walls, determines a unique solution.

🔀 Separation of variables technique

Separation of variables: a technique recognizing that the field component H_z(x,y) can be written as the product of a function X(x) that depends only on x and a function Y(y) that depends only on y.

The z-dependence is assumed to be of the form exp(-j k_z z), where k_z is the phase propagation constant.
After substituting H_z = X(x) Y(y) exp(-j k_z z) into the wave equation and simplifying, the problem splits into two one-dimensional differential equations: one for X(x) and one for Y(y).
Each equation has the form of a second derivative plus a constant times the function equals zero.
The constants k_x and k_y must satisfy: k_x squared plus k_y squared equals k_ρ squared, where k_ρ squared equals beta squared minus k_z squared.

📝 General solutions before boundary conditions

The one-dimensional differential equations have standard solutions:

X(x) = A cos(k_x x) + B sin(k_x x)
Y(y) = C cos(k_y y) + D sin(k_y y)
A, B, C, D are constants to be determined by boundary conditions.
At this stage, k_x and k_y are also unknown constants.

🚧 Applying boundary conditions

🚧 Boundary condition requirement

Electromagnetic boundary condition: any component of the electric field that is tangent to a perfectly-conducting wall must be zero.

At x = 0 and x = a: the y-component of E (E_y) must be zero.
At y = 0 and y = b: the x-component of E (E_x) must be zero.
Using the relationships between E and H for TE modes, these conditions translate into requirements on the derivatives of X and Y at the walls.

🔢 Determining the constants

After applying the boundary conditions:

The requirement that the derivative of X at x = 0 is zero forces B = 0.
The requirement that the derivative of Y at y = 0 is zero forces D = 0.
The remaining conditions force sin(k_x a) = 0 and sin(k_y b) = 0.
This means k_x = m π / a and k_y = n π / b, where m and n are non-negative integers (0, 1, 2, ...).

Don't confuse: Unlike TM modes (where A = 0 and C = 0, leading to sine functions), TE modes have B = 0 and D = 0, leading to cosine functions in the spatial variation.

🎼 TE mode structure

🎼 General TE field expression

The complete TE magnetic field component along z is a sum over all modes:

H_z = sum over m from 0 to infinity, sum over n from 0 to infinity of H_z^(m,n)
Each mode is: H_z^(m,n) = H_0^(m,n) cos(m π x / a) cos(n π y / b) exp(-j k_z^(m,n) z)
H_0^(m,n) is an arbitrary constant (consolidating A and C) determined by sources and other conditions.

📊 Mode parameters

Parameter	Expression	Meaning
k_x	m π / a	Wave number in x-direction; m = 0, 1, 2, ...
k_y	n π / b	Wave number in y-direction; n = 0, 1, 2, ...
k_z^(m,n)	square root of (ω² μ ε - (m π / a)² - (n π / b)²)	Phase propagation constant along z for mode (m,n)

🏷️ Mode notation

TE_mn notation: customary and convenient way to refer to TE modes in a rectangular waveguide, where m and n are the integer indices.

Example: TE_12 mode is given by the expression with m = 1 and n = 2.
Each pair (m, n) defines a distinct mode with its own spatial pattern and propagation constant.

⚠️ Special cases and limitations

⚠️ The TE_00 mode problem

Although the mathematics allows m = 0 and n = 0 (the TE_00 mode), this mode has no non-zero electric field components:

For TE_00, k_ρ = 0, so E_x would be constant.
But E_x must be zero at y = 0 and y = b to satisfy boundary conditions, so E_x is zero everywhere.
Similarly, E_y must be zero at x = 0 and x = a, so E_y is zero everywhere.
Therefore, TE_00 is not of practical interest.

🌊 Imaginary wave numbers

The propagation constant k_z^(m,n) is not necessarily real-valued:

For any given m, k_z^(m,n) will be imaginary for all n greater than some value.
Similarly, for any given n, k_z^(m,n) will be imaginary for all m greater than some value.
When k_z is imaginary, the mode does not propagate but instead decays exponentially along z.
This phenomenon (addressed in Section 6.10) is common to both TE and TM modes and relates to cutoff frequencies.

Don't confuse: An imaginary k_z does not mean the mode is invalid mathematically, but it does mean the mode is evanescent (non-propagating) rather than propagating.

🔗 Completing the solution

🔗 Finding other field components

Once H_z is known, the remaining non-zero field components can be determined using specific formulas:

E_x = -j (ω μ / k_ρ²) times the partial derivative of H_z with respect to y
E_y = +j (ω μ / k_ρ²) times the partial derivative of H_z with respect to x
H_x = -j (k_z / k_ρ²) times the partial derivative of H_z with respect to x
H_y = -j (k_z / k_ρ²) times the partial derivative of H_z with respect to y
These formulas are simplified versions of more general equations, valid when E_z = 0 (the TE condition).

📚 Summary of TE mode determination

The TE (E_z = 0) component of the unidirectional (+z-traveling) wave in a rectangular waveguide is completely determined by the sum over modes, with each mode defined by the cosine spatial variation, the propagation factor, and the discrete wave numbers.

The solution method: solve the wave equation using separation of variables, apply boundary conditions to find discrete allowed values of k_x and k_y, then sum over all valid modes.
The result is a complete description of how TE waves propagate in the waveguide.

Rectangular Waveguide: Propagation Characteristics

6.10 Rectangular Waveguide: Propagation Characteristics

🧭 Overview

🧠 One-sentence thesis

Rectangular waveguides exhibit cutoff behavior where modes only propagate above specific frequencies, and propagating modes travel with group velocities slower than unbounded media while displaying phase velocities that exceed the speed of light in the same medium.

📌 Key points (3–5)

Cutoff frequency: each mode (m,n) has a minimum frequency below which it cannot propagate and instead decays exponentially.
Phase vs group velocity: phase velocity exceeds the unbounded medium speed, but group velocity (the actual information speed) is always slower.
Common confusion: phase velocity greater than c does not violate physics—phase velocity is not the speed of information transfer, only the speed of constant-phase points.
Dispersion effects: both modal dispersion (different modes travel at different speeds) and chromatic dispersion (same mode's speed varies with frequency) occur.
Why it matters: understanding cutoff and propagation speeds is essential for designing waveguides that efficiently transfer power at intended frequencies.

🚫 Cutoff phenomenon

🚫 What cutoff means

A mode is cut off when its magnitude decreases exponentially with distance rather than maintaining constant amplitude during propagation.

Cutoff occurs when the wavenumber k(m,n) squared becomes negative.
The wave does not effectively convey power through the waveguide in this state.
Since waveguides are intended for efficient power transfer, avoiding cutoff is critical.

🔢 When cutoff happens

The mathematical condition for cutoff:

When omega squared times mu times epsilon is less than (m pi / a) squared plus (n pi / b) squared.
This translates to frequency being below a threshold value.
For higher values of m or n, the cutoff frequency increases.

📏 Cutoff frequency formula

Cutoff frequency f(mn): the lowest frequency for which mode (m,n) is able to propagate (not cut off).

The formula is: f(mn) equals v(pu) divided by 2, times the square root of (m/a) squared plus (n/b) squared.

Where:

v(pu) = 1 / square root of (mu times epsilon) is the phase velocity in unbounded medium with the same properties.
a and b are the waveguide dimensions.
m and n are the mode indices.

Example from WR-90 waveguide:

Air-filled with a = 22.86 mm and b = 10.16 mm.
Lowest cutoff frequency f(10) = 6.557 GHz (the minimum usable frequency).
Next cutoff frequencies: f(20) = 13.114 GHz, f(01) = 14.754 GHz.
Since a > b, the TE(10) mode has the lowest cutoff.

🏃 Phase velocity behavior

🏃 Phase velocity exceeds unbounded speed

Phase velocity v(p) in the waveguide is given by: omega divided by k(m,n).

After mathematical manipulation, this becomes:

v(p) = v(pu) divided by the square root of [1 minus (f(mn)/f) squared].
For any propagating mode, f > f(mn), so v(p) > v(pu).
In vacuum-filled waveguides, v(p) > c (speed of light).

🤔 Why this doesn't violate physics

Phase velocity is not the speed at which information travels.
It is merely the speed at which a point of constant phase moves.
The complex field structure creates phase points that travel faster than information can be conveyed.
To send information, you must create a disturbance in the sinusoidal excitation.
No fundamental physical principles are violated.

Don't confuse: Phase velocity with the actual speed of signal propagation—they are different quantities in waveguides.

📡 Group velocity and information speed

📡 What group velocity represents

Group velocity v(g): the speed at which information actually travels through the waveguide.

The formula is: v(g) = v(pu) times the square root of [1 minus (f(mn)/f) squared].

Key characteristics:

Always less than v(pu) for propagating modes.
In unbounded space, v(g) = v(p), but not in waveguides.
This is the physically meaningful speed for signal transmission.

Example from WR-90 at 10 GHz:

Only TE(10) mode propagates (f(10) = 6.557 GHz, f(20) = 13.114 GHz).
Group velocity calculated as approximately 2.26 × 10⁸ m/s.
This is about 75.5% of c (speed of light).

🌈 Two types of dispersion

Dispersion type	What it means	Cause
Modal dispersion	Different modes travel at different speeds	f(mn) varies by mode; higher-order modes propagate more slowly
Chromatic dispersion	Same mode's speed varies with frequency	Group velocity formula depends on frequency f

Both effects mean that:

Speed depends on the ratio f(mn)/f.
Generally decreases with increasing frequency for any given mode.
Signals may distort if they contain multiple modes or broad frequency content.

⚡ Practical speed summary

The excerpt emphasizes:

Signal speed in rectangular waveguide = group velocity of the associated mode.
This speed is less than propagation speed in unbounded media with same properties.
Speed depends on f(mn)/f ratio and generally decreases with increasing frequency for a given mode.

7.1 Parallel Wire Transmission Line

🧭 Overview

🧠 One-sentence thesis

Parallel wire transmission line offers a low-cost, low-loss alternative to coaxial cable for radio applications up to about 100 MHz, but trades self-shielding for differential symmetry and is best suited for differential signal sources and loads.

📌 Key points (3–5)

Physical structure: two wires separated by a dielectric spacer (commonly "twin lead"), with wire diameter d and center-to-center spacing D.
Key trade-offs vs coaxial: lower cost and lower loss, but lacks self-shielding—fields are exposed and can interact with nearby structures.
Differential nature: symmetric conductor geometry with no favored ground, making it ideal for differential sources/loads like dipole antennas and differential amplifiers.
Common confusion: parallel wire line is differential (symmetric, no ground preference), whereas coaxial line is single-ended (outer conductor is the datum/ground).
Characteristic impedance: tends to be large (e.g., 300 Ω) because the ratio D/d is typically large; increases with increasing wire spacing relative to diameter.

📐 Physical structure and design parameters

📐 Twin lead construction

A parallel wire transmission line consists of wires separated by a dielectric spacer.

Common implementation: "twin lead"—wires held in place by a mechanical spacer made of the same low-loss dielectric material that jackets each wire.
Why the spacer can be neglected: very little total energy from the electric and magnetic fields lies inside the jacket/spacer material, so it is usually ignored for analysis and electrical design.
Example: the spacer keeps the wires at a fixed distance but does not significantly affect the electromagnetic behavior.

🔧 Key design parameters (Figure 7.2)

Wire diameter: d
Center-to-center spacing: D
Typically D is much greater than d (D ≫ d), which simplifies calculations and leads to higher characteristic impedance.

⚡ Field structure and propagation

⚡ TEM mode

The associated field structure is transverse electromagnetic (TEM) and is therefore completely described by a single cross-section along the line.

The electric and magnetic fields lie entirely in the plane perpendicular to the direction of propagation (Figure 7.3).
Because it is TEM, a single cross-section captures the entire field pattern.
Expressions for these fields exist but are complex and not particularly useful except for calculating other parameters like characteristic impedance.

🚀 Phase velocity

Under the assumption that the wire jacket/spacer material has negligible effect and the line is suspended in air (relative permittivity ε_r ≈ 1), the phase velocity v_p is approximately c (the speed of light in free space).
In practical twin-lead: the plastic jacket/spacer reduces phase velocity by a few percent up to about 20%, so v_p ≈ 0.8c to 0.9c depending on materials and construction details.

🔌 Characteristic impedance

🔌 Lumped element model

The characteristic impedance is determined using the lumped element transmission line model:

Z₀ = square root of [(R′ + jωL′) / (G′ + jωC′)]

where R′, G′, C′, and L′ are resistance, conductance, capacitance, and inductance per unit length, respectively.

🧮 Low-loss approximation

Assuming "low-loss" conditions: R′ ≪ ωL′ and G′ ≪ ωC′
Then the characteristic impedance simplifies to:

Z₀ ≈ square root of (L′ / C′)

The problem reduces to determining inductance and capacitance per unit length.

📏 Formulas for L′ and C′

Inductance per unit length:
L′ = (μ₀ / π) × ln[(D/d) + square root of ((D/d)² − 1)]
Capacitance per unit length:
C′ = (πε) / ln[(D/d) + square root of ((D/d)² − 1)]
Simplified for D ≫ d: Because wire separation D is typically much greater than wire diameter d, the square root term simplifies to D/d, yielding:
- L′ ≈ (μ₀ / π) × ln(2D/d)
- C′ ≈ (πε) / ln(2D/d)

📊 Final expression for Z₀

Combining the simplified inductance and capacitance:

Z₀ ≈ (1/π) × (η₀ / square root of ε_r) × ln(2D/d)

where η₀ = square root of (μ₀ / ε₀) is the intrinsic impedance of free space, and ε = ε_r × ε₀.

Key observation: characteristic impedance increases with increasing D/d ratio.
Because this ratio is large, parallel wire line tends to have large characteristic impedance relative to other TEM transmission lines (e.g., coaxial line, microstrip line).

🔢 Example: 300 Ω twin-lead

Common implementation: 300 Ω twin-lead
Typical dimensions: wire diameter ≈ 1 mm, wire spacing ≈ 6 mm
Relative permittivity: ε_r ≈ 1 (jacket/spacer have small effect on fields)
Calculated impedance: using the formula, Z₀ ≈ 298 Ω, as expected.

🆚 Comparison with coaxial line

🆚 Advantages and disadvantages

Feature	Parallel wire line	Coaxial line
Cost	Lower	Higher
Loss (up to ~100 MHz)	Lower	Higher
Self-shielding	No—fields are exposed and prone to interaction with nearby structures	Yes—outer conductor isolates fields
Signal type	Differential (symmetric, no favored ground)	Single-ended (outer conductor is datum/ground)
Typical applications	Dipole antennas, differential amplifiers	General-purpose, where shielding is needed

🔄 Differential vs single-ended

Parallel wire line is differential: conductor geometry is symmetric, and neither conductor is favored as a signal datum ("ground").
Coaxial line is single-ended: conductors have different cross-sections, and the outer conductor is favored as the datum.
Don't confuse: differential does not mean "two signals"; it means the two conductors are symmetric and neither is designated as ground.
Example: parallel wire line is commonly used with differential signal sources/loads (e.g., dipole antenna, differential amplifier), where symmetry is important.

🚫 Limitation: lack of self-shielding

The electromagnetic fields of parallel wire line are exposed and prone to interaction with nearby structures and devices.
This prevents the use of parallel wire line in many applications where isolation is required.
Example: in environments with many nearby electronic devices, coaxial line is preferred to avoid interference.

🎯 Typical applications

🎯 Frequency range and use cases

Frequency range: often employed in radio applications up to about 100 MHz
Why this range: lower cost and lower loss than coaxial line in this frequency range
Common applications:
- Dipole antennas (differential structure matches the line's differential nature)
- Differential amplifiers (symmetric signal handling)
- Any application where signal sources and/or loads are also differential

Microstrip Line Redux

7.2 Microstrip Line Redux

🧭 Overview

🧠 One-sentence thesis

The characteristic impedance of microstrip transmission lines depends primarily on the ratio of substrate height to trace width (h/W) and the dielectric's relative permittivity, with different approximation methods applicable depending on whether the trace is wide, narrow, or intermediate relative to the substrate thickness.

📌 Key points (3–5)

What microstrip is: a narrow metallic trace separated from a ground plane by a dielectric slab, commonly used in printed circuit boards; operates in TM₀ mode with TEM field structure.
How Z₀ depends on geometry: characteristic impedance scales roughly in proportion to h/W (substrate height over trace width) and inversely with the square root of relative permittivity.
Three analysis regimes: wide (W ≫ h), narrow (W ≪ h), and intermediate (W ~ h) cases require different approximation methods.
Common confusion: microstrip vs stripline—these are distinct transmission line types; microstrip is single-ended with asymmetric geometry and one conductor serving as ground.
Effective permittivity concept: because fields exist in both the dielectric and air above, wavelength and phase velocity are determined by an effective relative permittivity (roughly the average of the dielectric and air).

🏗️ Structure and operating principles

🏗️ Physical structure

Components: narrow metallic trace (width W, thickness t), dielectric slab (height h, relative permittivity εᵣ typically 2–10), and metallic ground plane.
The dielectric typically has permeability approximately equal to free space (μ ≈ μ₀).
The structure resembles a parallel plate waveguide but with one "plate" (the trace) having finite width.

⚡ Field structure and TEM operation

Microstrip lines nominally exhibit transverse electromagnetic (TEM) field structure, operating below the cutoff frequency of all but the TM₀ mode (whose cutoff frequency is zero).

TM₀ mode: takes the form of a uniform plane wave with electric field perpendicular to the plates and magnetic field parallel to the plates.
The propagation direction E × H always points along the transmission line axis.
Fringing fields: the limited trace width W causes significant field deviations beyond the trace edges and above the trace; these fringing fields can significantly affect characteristic impedance Z₀.

🔌 Single-ended configuration

The conductor geometry is asymmetric.
The ground plane serves as ground for both the transmission line and the source/load.
This distinguishes microstrip from other transmission line types like stripline.

📏 Wide microstrip case (W ≫ h)

📐 Field distribution

Most energy lies directly underneath the trace.
Fringing fields are relatively negligible.
The structure resembles a parallel-plate capacitor with plate spacing d = h and area A = W·l (where l is length).

🧮 Deriving Z₀ for wide lines

For low-loss transmission lines, the characteristic impedance simplifies to:

Z₀ ≈ √(L′/C′)

where L′ and C′ are inductance and capacitance per unit length.

Capacitance per unit length:

Treating the structure as a parallel-plate capacitor: C ≈ εA/d
For microstrip: C′ ≈ εW/h (when W ≫ h)

Inductance per unit length:

Using Ampere's law and boundary conditions, the magnetic flux between trace and ground plane is Φ ≈ μ₀Hhl
The boundary condition requires surface current density Jₛ ≈ H, so current I ≈ HW
Inductance L = Φ/I ≈ μ₀hl/W
Therefore L′ ≈ μ₀h/W (when W ≫ h)

Final result:

Z₀ ≈ (η₀/√εᵣ) · (h/W) for W ≫ h

where η₀ is the free-space wave impedance (√(μ₀/ε₀)).

📊 Key insight

Characteristic impedance is proportional to h/W.
This h/W dependence appears in all three cases (wide, narrow, intermediate).

📏 Narrow microstrip case (W ≪ h)

🌊 Field distribution

Much of the energy lies beyond and above the trace, not directly underneath.
The field structure is complex but can be modeled by considering boundary conditions: tangential E must be zero on trace and ground plane surfaces.

🔗 Parallel wire line analogy

The fields in narrow microstrip resemble those of a parallel wire transmission line with:

Wire diameter d = W
Wire spacing D = 2h

Modeling approach:

The fields in the dielectric spacer are similar to those between the wires of a parallel wire line.
Introducing the ground plane does not perturb the upper half-space fields, since they already satisfy the required boundary conditions.
This analogy provides a rough guide to field structure, at least in the dielectric region.

🧮 Approximate Z₀ for narrow lines

Using the parallel wire line analogy:

Z₀ ~ (1/π) · (η₀/√εᵣ) · ln(4h/W) for W ≪ h

Accuracy note:

This estimate provides only "order of magnitude" accuracy (hence the "~" symbol).
About half the relevant field space is ignored in this approximation.
However, it accurately captures the dependence on εᵣ and h/W, useful for design adjustments.

📏 Intermediate case (W ~ h)

🎯 Wheeler 1977 formula

For the intermediate case where h and W are equal to within an order of magnitude, a widely-accepted formula provides accurate estimates across the full range of h/W:

Z₀ ≈ (42.4 Ω / √(εᵣ + 1)) × ln[1 + (4h/W′) · (K + √(K² + (1 + 1/εᵣ)²π²/2))]

where:

K = (14 + 8/εᵣ) / 11 · (4h/W′)
W′ is W adjusted for trace thickness t (typically W′ ≈ W when t ≪ W and t ≪ h)

🔍 Formula characteristics

Contains the factor 4h/W, consistent with the narrow approximation.
Shows Z₀ increases with increasing h/W and decreasing εᵣ, as in both wide and narrow cases.
Valid over the full range of h/W, not just the intermediate case.

📊 Comparison of approximations

Case	Accuracy	Best use
Wide approximation	Accurate only for h/W < 0.1 or so	Understanding h/W dependence
Narrow approximation	Overestimates by ~40%, but correct trend	Understanding scaling behavior
Wheeler 1977	Accurate across full range	Precise design calculations

Don't confuse: The wide and narrow approximations are not meant for precise calculations in the intermediate range; they serve as useful guides for understanding how Z₀ changes with parameters, especially when making adjustments to an existing design.

🔧 Practical design example: FR4 circuit boards

📐 FR4 parameters

FR4 printed circuit boards have:

Substrate thickness h ≈ 1.575 mm
Relative permittivity εᵣ ≈ 4.5

🎯 50 Ω design rule

In FR4 construction (substrate thickness 1.575 mm, relative permittivity ≈ 4.5, negligible trace thickness), Z₀ ≈ 50 Ω requires a trace width of about 3 mm.

Z₀ scales roughly in proportion to h/W around this value.
This is a commonly-used result for practical microstrip design.

Example: For h/W ≈ 0.5 in FR4, the characteristic impedance is approximately 50 Ω, meaning trace width is about 3 mm.

📊 Approximation performance in FR4

Wide approximation: accurate only for h/W < 0.1
Narrow approximation: overestimates by factor of ~1.4 but shows correct rate of increase with h/W
Wheeler 1977: accurate across the entire range

🌊 Wavelength and phase velocity

🔄 Effective relative permittivity concept

Because the guided wave exists in both the dielectric spacer (εᵣ) and the air above (εᵣ = 1), an effective relative permittivity is used:

εᵣ,eff ≈ (εᵣ + 1) / 2

This is a crude approximation—the average of the dielectric's relative permittivity and free space.

Why this matters: The electromagnetic field structure in microstrip exhibits TEM behavior similar to uniform plane waves, but the wave exists in two different media.

📏 Phase propagation constant

The phase propagation constant β can be approximated as:

β ≈ β₀ · √εᵣ,eff

where β₀ = ω√(μ₀ε₀) is the free-space phase propagation constant.

In other words, β in microstrip equals the free-space value times a correction factor √εᵣ,eff.

🌊 Wavelength formula

λ = λ₀ / √εᵣ,eff

where λ₀ is the free-space wavelength c/f.

Wavelength in microstrip is shorter than in free space by a factor of √εᵣ,eff.

🚀 Phase velocity formula

vₚ = c / √εᵣ,eff

Phase velocity in microstrip is slower than c by a factor of √εᵣ,eff.

📊 FR4 example

For FR4 with εᵣ ≈ 4.5:

εᵣ,eff ≈ (4.5 + 1) / 2 = 2.75
Estimated phase velocity: c / √2.75 ≈ 60% of c
Estimated wavelength: 60% of free-space wavelength
In practice: values are typically slightly less (50%–55%) due to the crude approximation of εᵣ,eff

Don't confuse: The effective permittivity is not the actual permittivity of either material; it is a modeling tool to account for the wave existing in two different media simultaneously. Manufacturing variations in εᵣ typically make more precise estimates of εᵣ,eff less relevant in practice.

Attenuation in Coaxial Cable

7.3 Attenuation in Coaxial Cable

🧭 Overview

🧠 One-sentence thesis

Attenuation in coaxial cable arises from conductor resistance and spacer conductance, and can be minimized by choosing specific ratios of inner-to-outer conductor radii (between 2.72 and 3.59) that correspond to characteristic impedances in the range of approximately 40–77 Ω depending on the dielectric.

📌 Key points (3–5)

Two sources of loss: resistance per unit length R′ (from inner and outer conductors) and conductance per unit length G′ (from current leaking through the spacer material).
Total attenuation splits into two components: α = α_R + α_G, where α_R comes from conductor resistance and α_G comes from spacer conductance.
Geometry matters for α_R but not α_G: the ratio b/a (outer-to-inner radius) affects conductor resistance loss but has no effect on spacer conductance loss.
Common confusion—dielectric permittivity tradeoff: minimizing α_R requires minimizing ε_r, but minimizing α_G requires maximizing ε_r, creating a frequency-dependent tradeoff.
Optimum geometry exists: the ratio b/a that minimizes attenuation lies between e ≈ 2.72 and 3.59, yielding characteristic impedances roughly 40–77 Ω (scaled by √ε_r).

🔧 Physical parameters and structure

🔧 Coaxial cable geometry

The excerpt defines the following parameters for coaxial cable design:

Symbol	Meaning
a	Radius of the inner conductor
b	Radius of the outer conductor
σ_ic	Conductivity of the inner conductor (S/m)
σ_oc	Conductivity of the outer conductor (S/m)
σ_s	Conductivity of the spacer (dielectric) material (S/m)
ε_r	Relative permittivity of the spacer
μ	Permeability (assumed μ₀ for non-magnetic conductors)

The conductors are assumed non-magnetic.
The spacer is a lossy dielectric with both permittivity and conductivity.

🔧 Lumped-element interpretation

In the lumped-element equivalent circuit model, R′ represents the physical resistance in the inner and outer conductors, whereas G′ represents loss due to current flowing directly between the conductors through the spacer material.

R′ = resistance per unit length (Ω/m).
G′ = conductance per unit length (S/m).
These two parameters capture the two physical mechanisms of attenuation.

⚡ Resistance per unit length (R′)

⚡ How R′ arises from skin effect

At high frequencies, current flows only in a thin layer near the conductor surface (skin depth δ).

Inner conductor: R′_ic ≈ 1 / (2πa · δ_ic · σ_ic), valid when δ_ic ≪ a.
Outer conductor: R′_oc ≈ 1 / (2πb · δ_oc · σ_oc), valid when δ_oc ≪ t (thickness of outer conductor).
Total: R′ = R′_ic + R′_oc.

The skin depth depends on conductivity:

δ_ic = √(2 / ω μ₀ σ_ic)
δ_oc = √(2 / ω μ₀ σ_oc)

Don't confuse: the effective cross-sectional area is the circumference times the skin depth, not the full conductor cross-section, because current concentrates near the surface at high frequency.

⚡ Two design cases

The excerpt identifies two cases based on the relative conductivity of the outer conductor:

Case	Condition	Simplified R′	Constant C
Case I	σ_oc ≫ σ_ic	R′ ≈ (1 / 2πδ_ic σ_ic) · (1/a)	C = 0
Case II	σ_oc = σ_ic	R′ ≈ (1 / 2πδ_ic σ_ic) · (1/a + 1/b)	C = 1

Both cases can be written as R′ ≈ (1 / 2πδ_ic σ_ic) · (1/a + C/b).
Example: if the outer conductor is much more conductive (e.g., a thick copper braid), Case I applies and the outer conductor contributes negligibly to R′.

🌊 Conductance per unit length (G′)

🌊 G′ from spacer leakage

The conductance per unit length of coaxial cable is simply that of the associated coaxial structure at DC.

G′ = 2π σ_s / ln(b/a).
Unlike resistance, conductance is independent of frequency (as long as σ_s does not vary with frequency).
This represents current leaking radially through the lossy dielectric spacer between the conductors.

🌊 Why geometry matters less for G′

The formula shows G′ depends on the ratio b/a through the logarithm, but the excerpt later shows that when computing α_G, the ln(b/a) terms cancel out.
Result: α_G ≈ (η₀ / 2) · (σ_s / √ε_r), which is independent of a and b.
Example: doubling both a and b leaves G′ nearly unchanged because the ratio b/a stays constant.

📉 Attenuation constant (α)

📉 Splitting α into two components

The attenuation factor for voltage and current waves is e^(−αz), where z is distance along the cable.

The excerpt defines α_R associated with R′ and α_G associated with G′.
Key relationship: α = α_R + α_G.
This split allows independent analysis of conductor loss and spacer loss.

📉 Approximate formulas

The excerpt postulates (and validates via Example 7.4 for RG-59 cable):

α_R ≈ K_R · R′ / Z₀
α_G ≈ K_G · G′ · Z₀

where Z₀ is the characteristic impedance:

Z₀ ≈ (η₀ / 2π) · (1 / √ε_r) · ln(b/a) (low-loss approximation).

Validation: the example shows that K_R = K_G = 1/2 gives excellent agreement with the exact formula from the transmission-line equations.

Final result:

α ≈ α_G + α_R, where α_G = R′ / (2 Z₀) and α_R = G′ Z₀ / 2.

📉 Frequency dependence

α_R increases with frequency (because skin depth decreases, raising R′).
α_G is frequency-independent (because G′ does not depend on frequency).
At sufficiently high frequency, α_R dominates over α_G (as shown in Figure 7.12 for RG-59).

Don't confuse: at low frequency, spacer loss (α_G) may be significant; at high frequency, conductor loss (α_R) dominates.

🎯 Minimizing attenuation

🎯 Minimizing α_G (spacer loss)

Expanding α_G:

α_G ≈ (η₀ / 2) · (σ_s / √ε_r).
To minimize α_G: minimize the ratio σ_s / √ε_r.
Interestingly, the physical dimensions a and b have no discernible effect on α_G.

🎯 Minimizing α_R (conductor loss)

Expanding α_R:

α_R = (1 / 2√2) · η₀ · √(ω μ₀ ε_r / σ_ic) · [(1/a + C/b) / ln(b/a)].
To minimize α_R: minimize ε_r / σ_ic and choose the optimal geometry (ratio b/a).
Maximizing σ_ic (conductor conductivity) is expected; minimizing ε_r is less obvious.

🎯 The dielectric tradeoff

α_G is minimized by maximizing ε_r.
α_R is minimized by minimizing ε_r.
Common confusion: there is no single "best" ε_r; the optimum depends on frequency (since α_R dominates at high frequency, minimizing ε_r is preferred for high-frequency applications).
The excerpt notes that σ_s may vary with ε_r, so a general conclusion about optimum σ_s and ε_r is not appropriate without knowing material properties.

🎯 Optimal geometry (b/a ratio)

To find the geometry that minimizes α_R, the excerpt takes the derivative of α_R with respect to a, sets it to zero, and solves:

Result: ln(b/a) = 1 + C·(b/a).
Case I (C = 0, σ_oc ≫ σ_ic): b/a = e ≈ 2.72.
Case II (C = 1, σ_oc = σ_ic): b/a ≈ 3.59.

Interpretation: the optimum ratio of outer-to-inner radius lies between 2.72 and 3.59, depending on the relative conductivity of the two conductors.

🎯 Corresponding characteristic impedances

Substituting the optimal b/a values into the formula for Z₀:

Z₀ ≈ 59.9 Ω / √ε_r to 76.6 Ω / √ε_r.
For air-filled cable (ε_r = 1), the optimum Z₀ is roughly 60–77 Ω.
For dielectric-filled cable (ε_r > 1), the optimum impedance is lower.

Example: RG-59 cable has ε_r ≈ 2.25 and C = 0, so the attenuation-optimized impedance would be Z₀ ≈ 39.9 Ω. The actual RG-59 impedance is about 75 Ω, indicating that RG-59 is not optimized for attenuation; other considerations (power handling, standard values) apply.

📐 Mathematical details (addendum)

📐 Derivative used in optimization

To evaluate the derivative ∂α_R / ∂a, the excerpt needs:

∂/∂a [a² ln(b/a)].

Using the chain rule:

∂/∂a [a² ln(b/a)] = [∂/∂a a²] · ln(b/a) + a² · [∂/∂a ln(b/a)].
∂/∂a a² = 2a.
∂/∂a ln(b/a) = ∂/∂a [ln(b) − ln(a)] = −1/a.
Result: ∂/∂a [a² ln(b/a)] = 2a ln(b/a) − a.

This result is substituted into the optimization equation to find the condition ln(b/a) = 1 + C·(b/a).

📐 Setting the derivative to zero

After substitution, the factor in square brackets must equal zero:

[−(1/a²) ln(b/a) + (1/a + C/b) / (a ln²(b/a))] = 0.
Simplifying yields ln(b/a) = 1 + C·(b/a).
This transcendental equation is solved numerically or by trial and error for Case II (C = 1), yielding b/a ≈ 3.59.

Don't confuse: the derivative is with respect to a, but the result is expressed as a ratio b/a because that ratio determines both Z₀ and the geometry.

Power Handling Capability of Coaxial Cable

7.4 Power Handling Capability of Coaxial Cable

🧭 Overview

🧠 One-sentence thesis

The power handling capability of coaxial cable is maximized when the ratio of the outer conductor radius to the inner conductor radius is approximately 1.65, corresponding to a characteristic impedance of about 30 Ω for air-filled cables.

📌 Key points (3–5)

What limits power handling: electric field breakdown and arcing in the dielectric when the field becomes too large.
Optimal geometry: the ratio b/a (outer radius to inner radius) of approximately 1.65 maximizes safe power transfer for a given maximum electric field.
Optimal impedance: for air-filled cables, the power-optimized impedance is about 30 Ω; for dielectric-filled cables it is lower (30 Ω divided by the square root of the relative permittivity).
Common confusion: higher permittivity might seem better for power handling from the formula, but materials with higher permittivity often have higher conductivity in the spacer, causing more ohmic heating—so high-power RF cables often use air despite its lowest permittivity.
Trade-offs in design: coaxial cables are not always optimized for power handling; other considerations include attenuation minimization and standard impedance values.

⚡ What power handling means

⚡ Definition and limitation

Power handling: the maximum power that can be safely transferred by a transmission line.

Power is limited because excessive electric field causes dielectric breakdown and arcing.
Breakdown can damage the line and connected devices, so it must be avoided.
The design question: how to maximize the safe power P_max for a given maximum safe electric field E_pk?

🔬 The design approach

The analysis starts by finding the electric potential V within the cable using Laplace's equation (the derivative of V with respect to angle φ and position z are both zero due to symmetry and uniformity).
The solution shows that the electric field intensity E is radial and inversely proportional to the radial distance ρ.
The maximum electric field occurs at the surface of the inner conductor (at radius a).

📐 Deriving the optimal geometry

📐 Electric field at maximum

The maximum electric field intensity is:

E_pk = V₀ / (a · ln(b/a))
where V₀ is the voltage between inner and outer conductors, a is the inner conductor radius, and b is the outer conductor radius.

🔋 Maximum safe power

When source and load impedances are matched to the characteristic impedance Z₀, the power transferred is V₀² / (2Z₀).

The maximum safe power is:

P_max = (π · E²_pk · a² · ln(b/a)) / (η₀ / √ε_r)
where η₀ is the intrinsic impedance of free space and ε_r is the relative permittivity of the dielectric.

🎯 Finding the optimum

To maximize P_max, take the derivative with respect to a and set it to zero:

The derivative yields: 2a · ln(b/a) − a = 0
Simplifying: 2 ln(b/a) − 1 = 0
Solving for the ratio: b/a = √e ≈ 1.65

Example: If the inner conductor has radius 1 mm, the optimal outer conductor radius for power handling is about 1.65 mm.

🔌 Optimal impedance for power handling

🔌 Impedance at optimal ratio

Substituting b/a = √e into the characteristic impedance formula:

Z₀ ≈ 30.0 Ω / √ε_r

For air-filled cables (ε_r = 1):

Z₀ ≈ 30 Ω

For dielectric-filled cables (ε_r ≥ 1):

The optimum impedance is less than 30 Ω.

📊 Summary of power-handling optimization

Parameter	Optimal value	Notes
Radius ratio b/a	≈ 1.65	Maximizes safe power transfer
Impedance (air-filled)	≈ 30 Ω	Typical for high-power applications
Impedance (dielectric-filled)	< 30 Ω	Decreases with √ε_r

⚠️ Practical considerations and caveats

⚠️ Why higher permittivity isn't always better

The formula suggests maximizing ε_r to maximize power handling, but:

Materials with higher ε_r may also have higher conductivity σ_s in the spacer.
Higher spacer conductivity means more current flows through the spacer, causing more ohmic heating.
This problem is so severe that high-power RF cables often use air as the spacer, even though air has the lowest possible ε_r.

🔥 Factors not fully constrained

The conductivities of the inner and outer conductors (σ_ic and σ_oc) do not appear in the power-handling formula.
However, limited conductivity can cause significant ohmic heating in the conductors, which this analysis does not consider.
Don't confuse: the analysis optimizes for dielectric breakdown, not for thermal limits from conductor heating.

🎯 Actionable finding

The main actionable result is the ratio of radii (b/a ≈ 1.65); other parameters require additional analysis beyond dielectric breakdown considerations.

Why 50 Ohms?

7.5 Why 50 Ohms?

🧭 Overview

🧠 One-sentence thesis

50 Ω emerged as the standard characteristic impedance for transmission lines and RF components because it represents a practical compromise among competing technical requirements—minimizing attenuation, maximizing power handling, and matching common antenna impedances—rather than being inherently superior to other values.

📌 Key points (3–5)

The short answer: Nothing is inherently special about 50 Ω; it is a round-number compromise chosen from a range of useful impedances (10s of ohms) to avoid manufacturing products for every possible value.
Attenuation optimization: Coaxial cable attenuation is minimized at 60–77 Ω for air-filled cable and 40–50 Ω for plastic-filled cable (relative permittivity ≈ 2.25).
Power handling optimization: Power handling capability peaks at about 30 Ω for air-filled coaxial cable (outer-to-inner conductor radius ratio ≈ 1.65).
Common confusion: 50 Ω vs 75 Ω—both are standards, but 75 Ω is preferred when attenuation is the primary concern and for certain antenna applications.
Antenna compatibility: 50 Ω is reasonably close to quarter-wave monopole impedance (≈36 + j21 Ω) and half-wave dipole impedance (≈73 + j42 Ω), making it a balanced choice for both.

🎯 The compromise nature of 50 Ω

🎯 Why no single "perfect" impedance exists

The excerpt emphasizes that 50 Ω is not optimal for any single technical criterion.
Different applications prioritize different factors: attenuation, power handling, or antenna matching.
Engineers settled on 50 Ω as a round number near the middle of the useful range (10s of ohms).
It is not practical to manufacture products for every impedance in the useful range, so a few standard values were chosen.

🔄 The trade-off between attenuation and power handling

Attenuation-optimized impedance: 60–77 Ω for air-filled cable.
Power-handling-optimized impedance: about 30 Ω for air-filled cable.
50 Ω sits between these two extremes, making it a compromise for air-filled coaxial cables.
Example: If you need both low attenuation and high power handling, neither 30 Ω nor 77 Ω is ideal—50 Ω balances both needs.

📉 Attenuation considerations

📉 Minimum attenuation range

Attenuation in coaxial cable is minimized for characteristic impedances in the range (60 Ω) / √ǫᵣ to (77 Ω) / √ǫᵣ, where ǫᵣ is the relative permittivity of the spacer material.

For air-filled cable (ǫᵣ ≈ 1): optimal range is 60–77 Ω.
For plastic-filled cable (ǫᵣ ≈ 2.25): optimal range is about 40–50 Ω.
50 Ω is clearly reasonable if a single standard value is to be established for all such cable.

🧪 Why 75 Ω is a secondary standard

75 Ω is closer to the attenuation-optimized range (60–77 Ω) for air-filled cables.
The excerpt notes that one can make a case for 75 Ω especially for applications where attenuation is the primary consideration.
Don't confuse: 50 Ω is the general-purpose standard; 75 Ω is preferred when loss minimization is critical.

⚡ Power handling considerations

⚡ Optimal power handling impedance

Power handling capability of coaxial cable is optimized when the ratio of radii of the outer to inner conductors b/a is about 1.65. For air-filled cables typically used in high-power applications, this corresponds to a characteristic impedance of about 30 Ω.

30 Ω is significantly less than the 60–77 Ω that minimizes attenuation.
High-power applications often use air-filled cables, where the trade-off is most visible.
50 Ω can be viewed as a compromise: it sacrifices some power handling (compared to 30 Ω) and some attenuation performance (compared to 60–77 Ω).

📡 Antenna compatibility

📡 Common antenna impedances

The excerpt lists three commonly-encountered antennas and their impedances:

Antenna type	Impedance	Closest standard
Half-wave dipole	≈73 + j42 Ω	75 Ω
Quarter-wave monopole	≈36 + j21 Ω	50 Ω
Folded half-wave dipole	≈300 Ω	300 Ω (balanced) or 75 Ω (via balun)

📡 Why 50 Ω is a balanced choice

50 Ω is reasonably close to the quarter-wave monopole impedance (≈36 + j21 Ω).
75 Ω is very close to the half-wave dipole impedance (≈73 + j42 Ω).
If you desire a single characteristic impedance equally convenient for applications involving either type of antenna, then 50 Ω is a reasonable choice.
Example: A system that might connect to either a monopole or a dipole benefits from 50 Ω as a middle ground.

📡 The folded dipole and 75 Ω

Folded half-wave dipoles have an impedance of about 300 Ω and are balanced (not single-ended).
They are commonly used in FM, TV, and land mobile radio base stations.
A balun (balanced-to-unbalanced converter) can step down impedance by a factor of 4 (from 300 Ω to 75 Ω) easily and inexpensively.
This creates an additional application for 75 Ω coaxial line.

🛠️ Practical manufacturing considerations

🛠️ Microstrip transmission line

It is quite simple to implement microstrip transmission line with characteristic impedance in the range 30–75 Ω.
Example: 50 Ω on commonly-used 1.575 mm FR4 (a printed circuit board material) requires a width-to-height ratio of about 2, so the trace is about 3 mm wide.
This is a very manageable size and easily implemented in printed circuit board designs.
Don't confuse: This is about ease of fabrication, not electrical performance—50 Ω happens to correspond to practical physical dimensions.

🛠️ Why standardization matters

Manufacturing and selling products for every possible impedance in the useful range is not practical or efficient.
Engineers settled on 50 Ω and a few other values (75 Ω, 300 Ω) to accommodate the smaller number of applications with specific compelling considerations.
Standardization enables interoperability: signal sources, amplifiers, filters, antennas, and other RF components can all specify the same port impedance.

Optical Fiber: Method of Operation

8.1 Optical Fiber: Method of Operation

🧭 Overview

🧠 One-sentence thesis

Optical fiber confines light through total internal reflection, which occurs when light traveling in a higher-permittivity core strikes the boundary with lower-permittivity cladding at a sufficiently shallow angle.

📌 Key points (3–5)

Basic structure: optical fiber consists of a transparent dielectric core surrounded by cladding with lower permittivity.
Key mechanism: total internal reflection traps light within the fiber when the angle of incidence meets a critical threshold.
Material requirement: the fiber core must have higher permittivity (higher index of refraction) than the cladding.
Critical angle criterion: light must approach the boundary at an angle greater than or equal to arcsin(n_c / n_f) to remain confined.
Common confusion: the fiber appears as an unbounded half-space to the light ray because the fiber diameter is about 10,000 times larger than the optical wavelength.

🏗️ Structure and materials

🔧 Physical construction

Optical fiber in its simplest form: concentric regions of dielectric material with a circular transparent core surrounded by a jacket called cladding.

The core is the central circular region through which light propagates.
The cladding is the outer dielectric jacket surrounding the core.
Both regions are transparent dielectric materials.

📐 Index of refraction

Index of refraction (n): the square root of relative permittivity.

The excerpt defines n_f = √ε_r,f for the fiber core.
Similarly, n_c = √ε_r,c for the cladding.
The design requirement is n_f > n_c (fiber index greater than cladding index).
This inequality creates the conditions necessary for total internal reflection.

Example: Typical values are n_f = 1.52 and n_c = 1.49.

🔄 Total internal reflection mechanism

💡 How light is confined

Light rays in the fiber strike the boundary with the cladding.
When the angle of incidence satisfies the critical angle criterion, the light reflects back into the fiber instead of escaping into the cladding.
This reflection mechanism "contains the light within the fiber."
Power remains in the fiber and is "reflected onward" as long as the criterion is met.
If the criterion is not satisfied, power is lost into the cladding.

📏 Critical angle criterion

The condition for total internal reflection is:

θ_i ≥ arcsin(√(ε_r,c / ε_r,f)) = arcsin(n_c / n_f)

θ_i is the angle of incidence (the angle at which the light ray approaches the boundary).
The right side gives the minimum angle required.
Rays approaching from angles that satisfy this criterion remain confined.

Example: For n_f = 1.52 and n_c = 1.49, the critical angle is approximately 78.8°. This is "quite reasonable" because light should travel approximately parallel to the fiber axis (θ_i ≈ 90°) anyway.

🚫 Don't confuse: angle measurement

The angle θ_i is measured relative to the normal (perpendicular) to the boundary.
A larger θ_i means the ray is traveling more parallel to the fiber axis.
The excerpt notes that 90° corresponds to light traveling parallel to the axis, which is the desired direction.

🔬 Why the uniform plane wave approximation works

📊 Scale comparison

Quantity	Typical size	Implication
Optical wavelength in free space	120 nm to 700 nm	Very small
Wavelength in fiber	Slightly shorter (by √ε_r,f)	Still very small
Fiber diameter	Order of millimeters	About 10,000× larger than wavelength

🌐 Unbounded half-space perspective

The fiber diameter is "about 4 orders of magnitude greater than the wavelength."
From the perspective of the light ray, the fiber "appears to be an unbounded half-space."
The cladding also "appears to be an unbounded half-space."
The boundary between them appears planar (flat).
This justifies treating the light ray as a uniform plane wave at a planar boundary.

⚠️ Practical constraint: bending radius

🔀 Curvature limitation

Total internal reflection imposes a limit on how sharply fiber optic cable can be bent.
If the radius of curvature is too small, the critical angle will be exceeded at the bend.
When the critical angle is exceeded, light escapes into the cladding even if it was initially traveling parallel to the fiber axis.
This loss occurs "even for light rays which are traveling" in the desired direction before the bend.

Acceptance Angle

8.2 Acceptance Angle

🧭 Overview

🧠 One-sentence thesis

Light must arrive within a specific cone of angles (the acceptance angle) to achieve total internal reflection and propagate without loss through an optical fiber.

📌 Key points (3–5)

The core constraint: only light incident within a certain range of angles will undergo total internal reflection inside the fiber and propagate without loss.
Acceptance angle definition: the maximum angle of incidence (relative to the fiber axis) that still permits total internal reflection at the cladding boundary.
Numerical aperture (NA): a related parameter commonly used in datasheets, defined as the sine of the acceptance angle when light arrives from air.
Common confusion: the acceptance angle is measured from the fiber axis, not from the normal to the entrance surface; it defines a cone, not a single ray direction.
Why it matters: to effectively launch light into a fiber, the source must emit within the cone of acceptance; otherwise, light will leak out at the cladding.

🔍 The injection problem

🔍 What happens when light enters the fiber

Light arrives from a medium with refractive index n₀ at an incident angle θᵢ.
It refracts into the fiber core (index nf) at transmission angle θ₂.
Inside the fiber, the light hits the cladding boundary at angle θ₃.
For lossless propagation, total internal reflection must occur at the cladding, which requires:

sin θ₃ ≥ nc / nf where nc is the cladding index and nf is the fiber core index.

🧮 Relating the angles

The angle at the cladding boundary θ₃ is geometrically related to the transmission angle θ₂:
- θ₃ = π/2 − θ₂
- Therefore sin θ₃ = cos θ₂.
Using trigonometric identities and Snell's law at the entrance (sin θ₂ = (n₀ / nf) sin θᵢ), the excerpt derives a constraint on the incident angle θᵢ.
The final result is:

sin θᵢ ≤ (1 / n₀) × square root of (nf² − nc²)

🎯 The cone of acceptance

Only light arriving within a cone around the fiber axis will satisfy the constraint.
The half-angle of this cone is the acceptance angle θₐ.
Example: if the acceptance angle is 17.5°, light must arrive from within 17.5° of the fiber axis to ensure total internal reflection.

📐 Acceptance angle and numerical aperture

📐 Acceptance angle θₐ

Acceptance angle θₐ: the maximum angle of incidence (measured from the fiber axis) that results in total internal reflection within the fiber.

Defined as:
- θₐ = arcsin [(1 / n₀) × square root of (nf² − nc²)]
This is the largest angle at which light can enter and still propagate without loss.
The associated cone of acceptance is illustrated in the excerpt's Figure 8.4.

🔢 Numerical aperture (NA)

Numerical aperture NA: a dimensionless parameter equal to (1 / n₀) × square root of (nf² − nc²), commonly used in fiber optic datasheets.

NA = sin θₐ when light arrives from air (n₀ ≈ 1).
Because n₀ is typically very close to 1, datasheets often define NA simply as the square root of (nf² − nc²).
NA is preferred over the acceptance angle in specifications because it is a single number that captures the fiber's light-gathering ability.

Parameter	Definition	Typical use
Acceptance angle θₐ	Maximum incident angle for total internal reflection	Describes the cone geometry
Numerical aperture NA	sin θₐ (for n₀ ≈ 1)	Datasheet specification

🧪 Worked example from the excerpt

Typical fiber: nf = 1.52, nc = 1.49, n₀ = 1 (air).
NA = square root of (1.52² − 1.49²) ≈ 0.30.
Since sin θₐ = NA, θₐ = 17.5°.
Interpretation: light must arrive from within 17.5° of the fiber axis to ensure total internal reflection.

🧷 Why the acceptance angle matters

🧷 Effective light launching

To inject light efficiently into the fiber, the source must emit within the cone of acceptance.
Light arriving at steeper angles (larger than θₐ) will not undergo total internal reflection and will leak into the cladding.
This constraint affects the design of fiber optic connectors and light sources.

🧷 Relationship to fiber parameters

The acceptance angle depends on the difference between the core and cladding indices (nf and nc).
A larger difference (nf² − nc²) increases NA and θₐ, making the fiber easier to couple light into.
Don't confuse: the acceptance angle is not determined by the fiber diameter or length; it is purely a function of the refractive indices.

Dispersion in Optical Fiber

8.3 Dispersion in Optical Fiber

🧭 Overview

🧠 One-sentence thesis

Dispersion in optical fiber—caused by light traveling different path lengths—smears pulses at the output and fundamentally limits the maximum data rate, independent of any signal loss.

📌 Key points (3–5)

What causes dispersion: light follows many different paths through the fiber, each with a different length and propagation time.
How dispersion distorts signals: pulses "smear" at the output; when delay spread becomes too large relative to pulse width, adjacent pulses overlap and become undetectable.
Key quantity—delay spread: the difference between maximum and minimum propagation times (τ_max − τ_min); it increases linearly with cable length.
Fundamental limit: dispersion imposes a maximum data rate that decreases linearly as cable length increases, even before considering any media loss.
Common confusion: longer cables don't just attenuate signals—they also spread pulses in time, which is a separate, fundamental constraint on data rate.

🌈 Why light takes multiple paths

🌈 The variety of paths

Light does not travel only along the fiber axis; it bounces within the fiber at various angles.
Shortest path (minimum time): light travels parallel to the axis (Figure 8.5(a)).
Longest path (maximum time): light bounces at the critical angle for total internal reflection (Figure 8.5(c)); any steeper angle would not be completely reflected and would likely not survive to the end.
Intermediate paths: a continuum of possibilities exists between the two extremes (Figure 8.5(b)).

🔄 Why all paths coexist

Even if light is inserted carefully, fiber is rarely installed in a straight line.
Each curve in the fiber creates new angles of incidence at the core-cladding boundary.
Result: all possible paths depicted in the figures are likely to exist simultaneously.

📉 How dispersion smears pulses

📉 The smearing effect

Dispersion: the phenomenon where light following different paths arrives at different times, distorting signals.

Consider a digital input signal with period T and pulse width t_on (Figure 8.6).
Light following the shortest path arrives first (time τ_min).
Light following the longest path arrives last (time τ_max).
If the difference (τ_max − τ_min) is much smaller than the pulse width t_on, degradation is small.
Otherwise, pulses "smear" at the output (Figure 8.7): the top waveform corresponds to minimum path length, the middle to maximum path length, and the bottom shows the sum of all arriving waveforms.

⚠️ The overlap problem

As (τ_max − τ_min) becomes a larger fraction of t_on, adjacent pulses can overlap.
Overlapping pulses present a serious challenge for detecting individual pulses at the receiver.
Condition to avoid overlap: τ_max − τ_min + t_on must be less than T (the period).
Don't confuse: the issue is not just that pulses are wider—it's that they can merge with neighboring pulses, making detection impossible.

🧮 Calculating delay spread

🧮 Delay spread definition

Delay spread (τ): the difference between maximum and minimum propagation times, τ_max − τ_min.

This quantity determines the minimum period T and thus the maximum data rate.
Requirement for overlap-free transmission: τ < T − t_on.
The delay spread imposes a minimum value on T, which in turn imposes a maximum value on the transmission rate.

⏱️ Minimum propagation time

Minimum time τ_min is the cable length l divided by the phase velocity v_p of light in the core.
Phase velocity is the speed of light in free space c divided by the refractive index n_f of the core.
Result: τ_min = (l × n_f) / c.

⏱️ Maximum propagation time

Maximum time τ_max is different because the maximum path length is different.
The maximum path length is l / cos(θ₂), where θ₂ is the angle between the axis and the direction of travel.
For the longest path, θ₂ is determined by the threshold angle for total internal reflection: cos(θ₂) = n_c / n_f (where n_c is the cladding refractive index).
Result: τ_max = (l × n_f²) / (c × n_c).

📐 Delay spread formula

Delay spread τ = (l × n_f / c) × (n_f / n_c − 1).
Key insight: τ increases linearly with cable length l.
Therefore, the maximum supportable data rate decreases linearly as cable length increases.
This is a fundamental limitation from dispersion, independent of any media loss.

📊 Practical implications and solutions

📊 Example calculation

Scenario: A 1 m multimode fiber with n_f = 1.52 and n_c = 1.49, transmitting data with t_on = T / 2.

Quantity	Value	Implication
Delay spread τ	~102 ps	Time difference between fastest and slowest paths
Minimum period T	>204 ps	To avoid overlap, T > 2τ
Maximum data rate	~4.9 Gb/s	One bit per period: 1/T

What happens at 1 km: delay spread increases by a factor of 1000 (linear with length), so maximum data rate drops to ~4.9 Mb/s—a dramatic reduction.

🔧 Solutions to dispersion

Repeaters: divide the link into smaller segments separated by devices that receive the dispersed signal, demodulate it, regenerate the original signal, and transmit the restored signal to the next repeater.
Single-mode fiber: has intrinsically less dispersion than the multimode fiber assumed in the analysis above.
Don't confuse: these solutions address dispersion (pulse spreading), not signal attenuation—they are separate problems.

🎯 Key takeaway

Dispersion is a fundamental constraint on data rate, independent of signal loss.
The maximum supportable data rate decreases linearly with increasing cable length due to dispersion alone.
This is why fiber optic systems must carefully manage dispersion, especially over long distances.

9.1 Radiation from a Current Moment

🧭 Overview

🧠 One-sentence thesis

The current moment—a simple, point-like current distribution at the origin—produces a spherical wave whose electric field is proportional to the current magnitude, inversely proportional to distance, and varies with angle as sine theta, providing a fundamental building block for analyzing more complex current distributions.

📌 Key points (3–5)

What a current moment is: the simplest possible volume current density, existing only at the origin and defined using the volumetric sampling function delta(r).
Why it matters: although practical currents rarely exist in this exact form, the current moment serves as a building block for constructing realistic current distributions via superposition.
Key field behavior: far from the source, the radiated electric field is proportional to 1/r, contains the phase factor exp(-j beta r), points in the theta-hat direction, and varies as sin theta.
Common confusion: the solution is approximate and valid only far from the source (distances much greater than a wavelength, r >> lambda), not everywhere.
Equivalence: the current moment analyzed here is effectively the same as a Hertzian dipole in practical engineering applications.

🔬 Defining the current moment

🔬 Mathematical definition

Current moment: a current density distribution Delta J(r) = z-hat I Delta l delta(r), where I Delta l is the scalar current moment (units: ampere times meter) and delta(r) is the volumetric sampling function.

The volumetric sampling function delta(r) is zero everywhere except at the origin (r = 0).
When integrated over any volume containing the origin, delta(r) yields 1.
Units: delta(r) has units of inverse cubic meters (m^-3), so Delta J(r) has units of A/m^2, confirming it is a volume current density.
Key insight: this is the simplest possible form of volume current density because it exists only at a single point.

🧱 Why it's a building block

Practical current distributions generally do not exist in precisely this point-like form.
However, complex current distributions can be constructed from many current moments placed at different locations.
Radiation from the complex distribution is calculated by summing the radiation from each constituent current moment (principle of superposition).
Example: a realistic antenna current can be modeled as many tiny current moments distributed along its length.

📡 Physical expectations before solving

📡 Proportionality to source strength

Electric fields are proportional to the currents that create them.
Therefore, Delta tilde E(r) must be proportional to the magnitude of tilde I Delta l.

📡 Spherical wave behavior

Sufficiently far from the origin, the field should behave like a spherical wave.
Power density in a spherical wave is proportional to 1/r^2 (spreading over larger spheres).
Since time-average power density is proportional to the square of the electric field magnitude, the field itself must be proportional to 1/r.
Phase should change at rate beta (the phase propagation constant, equal to 2 pi / lambda), so the field contains the factor exp(-j beta r).

📡 Direction and symmetry

Magnetic field direction: Ampere's law indicates that a z-hat-directed current at the origin produces a phi-hat-directed magnetic field in the z = 0 plane.
Electric field direction: Poynting's theorem requires the cross product of electric and magnetic fields to point in the direction of power flow (away from the source, i.e., +r-hat direction). Therefore, in the z = 0 plane, Delta tilde E points in the -z-hat direction. In general, Delta tilde E points in the theta-hat direction.
Angular variation: the field must be zero along the z axis (theta = 0 and theta = pi). Symmetry suggests maximum magnitude at theta = pi/2. The simplest variation satisfying these constraints is proportional to sin theta.
Azimuthal symmetry: radial symmetry means the field does not depend on phi.

🧮 The solution for radiated electric field

🧮 Assembled form

Combining all the physical expectations, the radiated electric field has the form:

Delta tilde E(r) approximately equals theta-hat C (tilde I Delta l) (sin theta) exp(-j beta r) / r

where C is a constant accounting for all proportionality factors.

🧮 Determining the constant

The units of Delta tilde E(r) are volts per meter (V/m).
Therefore, C must have units of ohms per meter (Ω/m).
The wave impedance of the medium eta has units of ohms (Ω), so C is likely proportional to eta.
The rigorous solution (presented later in the text) gives: C = j eta beta / (4 pi).
Final result: Delta tilde E(r) approximately equals theta-hat (j eta beta / 4 pi) (tilde I Delta l) (sin theta) exp(-j beta r) / r.

🧮 Validation

This solution satisfies the wave equation: nabla-squared Delta tilde E(r) + beta-squared Delta tilde E(r) = 0.
The excerpt notes that confirming this is straightforward (substitute and evaluate).

⚠️ Limitations and applicability

⚠️ Approximation and distance requirement

The solution is approximate (hence the "approximately equals" symbol).
The presumption of a simple spherical wave is valid only at distances far from the source.
Distance criterion: the solution is valid for distances much greater than a wavelength (r >> lambda).
Don't confuse: this is a far-field approximation; the solution may not be accurate close to the source.

⚠️ AC vs DC distinction

If the current is steady (DC), the problem falls within magnetostatics: the outcome is completely described by the magnetic field, and there can be no radiation.
The analysis limits attention to the AC case, for which radiation is possible.
Phasor representation is used: Delta tilde J(r) = z-hat tilde I Delta l delta(r), where tilde I Delta l is the scalar current moment expressed as a phasor.

⚠️ Hertzian dipole equivalence

Hertzian dipole: typically defined as a straight, infinitesimally-thin filament of current with length very small relative to a wavelength, but not precisely zero.

This interpretation does not change the solution obtained for the current moment.
In practical engineering applications, the current moment and the Hertzian dipole are effectively the same.

Magnetic Vector Potential

9.2 Magnetic Vector Potential

🧭 Overview

🧠 One-sentence thesis

The magnetic vector potential reduces the problem of finding electromagnetic fields radiated by a current distribution from a system of four coupled Maxwell equations to a single inhomogeneous wave equation that can be solved more easily.

📌 Key points (3–5)

What the magnetic vector potential is: a single vector field that represents both electric and magnetic fields simultaneously, simplifying the solution process.
How it simplifies the problem: instead of solving four coupled partial differential equations (Maxwell's equations), you solve one inhomogeneous wave equation for the vector potential, then derive the fields from it.
The Lorenz gauge condition: a constraint (divergence of A plus jωμε times V equals zero) that dramatically simplifies the wave equation and leads to the classical interpretation of V as scalar electric potential.
Common confusion: the scalar field V initially appears arbitrary, but choosing it to satisfy the Lorenz gauge condition is not arbitrary—it has deep physical meaning and simplifies the mathematics.
Why it matters: this approach provides a systematic procedure to determine fields radiated by any specified current distribution, with the source current appearing explicitly in the inhomogeneous wave equation.

🎯 The radiation problem and setup

🎯 What problem we're solving

Goal: determine the electromagnetic fields radiated by a specified current distribution.
The known quantity is the current distribution (denoted with tilde J).
The desired quantities are the electric and magnetic fields (denoted with tilde E and tilde H).
The excerpt uses phasor representation for time-harmonic (sinusoidally-varying) currents.
Any other time-domain variation can be represented using sums of time-harmonic solutions via the Fourier transform, so there is no loss of generality.

📐 The starting equations

The appropriate equations are Maxwell's equations in phasor form:

Divergence of tilde E equals tilde rho_v divided by epsilon (Equation 9.7)
Curl of tilde E equals negative j omega mu times tilde H (Equation 9.8)
Divergence of tilde H equals zero (Equation 9.9)
Curl of tilde H equals tilde J plus j omega epsilon times tilde E (Equation 9.10)

Where tilde rho_v is the volume charge density.

🌐 Scope limitation

The excerpt limits scope to problems where tilde rho_v equals zero (neutral charge).
This covers most engineering problems concerned with propagation through homogeneous media like free space.
A counter-example would be propagation through a plasma, which by definition consists of non-zero net charge.
With this simplification, Equation 9.7 becomes: divergence of tilde E equals zero (Equation 9.11).

Don't confuse: the limitation to neutral charge is not a fundamental restriction but a practical choice covering most engineering scenarios; the method can be extended to charged media.

🧲 Defining the magnetic vector potential

🧲 The core definition

The magnetic vector potential tilde A is defined by: tilde B equals curl of tilde A (Equation 9.12), where tilde B equals mu times tilde H is the magnetic flux density.

This definition says the magnetic flux density is the curl of the vector potential.
For this to be reasonable, substituting curl of tilde A for mu times tilde H in Maxwell's equations must yield consistent results.
The excerpt checks consistency with three equations where the magnetic field appears.

✅ Consistency check 1: Gauss's law for magnetic fields

Substituting into Equation 9.9 gives: divergence of (curl of tilde A) equals zero (Equation 9.13).
This is a mathematical identity that applies to any vector field (see Equation B.24 in Appendix B.3).
Therefore, Equation 9.12 is consistent with Gauss's law for magnetic fields automatically.

✅ Consistency check 2: Faraday's law

Substituting into Equation 9.8: curl of tilde E equals negative j omega times (curl of tilde A) (Equation 9.14).
Gathering terms: curl of (tilde E plus j omega times tilde A) equals zero (Equation 9.15).
Now define a new scalar field tilde V such that: negative gradient of tilde V equals tilde E plus j omega times tilde A (Equation 9.16).
Using this definition, Equation 9.15 becomes: curl of (negative gradient of tilde V) equals zero, or curl of gradient of tilde V equals zero (Equation 9.18).
This is another mathematical identity that applies to any vector field (see Equation B.25 in Appendix B.3).
Therefore, tilde V can be any mathematically-valid scalar field, and Equation 9.12 is consistent with Equation 9.8 for any choice of tilde V.

🔌 Connection to electrostatics

Equation 9.16 is very similar to E equals negative gradient of V from electrostatics, where V is the scalar electric potential field.
Equation 9.16 is an enhanced version that accounts for the coupling with H (represented by A) in the time-varying (non-static) case.
However, at this stage, we are not yet compelled to make any particular choice for tilde V—this freedom will be exploited later.

Don't confuse: the scalar field V here is not exactly the same as the electrostatic potential; it's a generalization for time-varying fields, but the connection becomes clear when we apply the Lorenz gauge condition.

🌊 Deriving the wave equation

🌊 Consistency check 3: Ampère's law with Maxwell correction

Starting with Equation 9.10 and substituting:

Curl of (one over mu times curl of tilde A) equals tilde J plus j omega epsilon times tilde E (Equation 9.19).
Multiplying both sides by mu: curl of curl of tilde A equals mu times tilde J plus j omega mu epsilon times tilde E (Equation 9.20).
Use Equation 9.16 to eliminate tilde E: curl of curl of tilde A equals mu times tilde J plus j omega mu epsilon times (negative gradient of tilde V minus j omega times tilde A) (Equation 9.21).
After algebra: curl of curl of tilde A equals omega squared mu epsilon times tilde A minus j omega mu epsilon times gradient of tilde V plus mu times tilde J (Equation 9.22).

🔄 Applying a vector identity

Replace the left side using vector identity B.29: curl of curl of tilde A equals gradient of (divergence of tilde A) minus Laplacian of tilde A (Equation 9.23).
Equation 9.22 becomes: gradient of (divergence of tilde A) minus Laplacian of tilde A equals omega squared mu epsilon times tilde A minus j omega mu epsilon times gradient of tilde V plus mu times tilde J (Equation 9.24).
Multiplying both sides by negative one and rearranging: Laplacian of tilde A plus omega squared mu epsilon times tilde A equals gradient of (divergence of tilde A) plus j omega mu epsilon times gradient of tilde V minus mu times tilde J (Equation 9.25).
Combining terms on the right: Laplacian of tilde A plus omega squared mu epsilon times tilde A equals gradient of (divergence of tilde A plus j omega mu epsilon times tilde V) minus mu times tilde J (Equation 9.26).

🎯 The Lorenz gauge condition

Consider the expression (divergence of tilde A plus j omega mu epsilon times tilde V) appearing in parentheses on the right side.
We established earlier that tilde V can be essentially any scalar field—we are free to choose.
Invoking this freedom, we now require tilde V to satisfy: divergence of tilde A plus j omega mu epsilon times tilde V equals zero (Equation 9.27).

The Lorenz gauge condition: divergence of tilde A plus j omega mu epsilon times tilde V equals zero (Equation 9.27).

This choice dramatically simplifies Equation 9.26 to: Laplacian of tilde A plus omega squared mu epsilon times tilde A equals negative mu times tilde J (Equation 9.28).

🌊 The resulting wave equation

The wave equation for tilde A: Laplacian of tilde A plus omega squared mu epsilon times tilde A equals negative mu times tilde J (Equation 9.28).

This is the same wave equation that determines tilde E and tilde H in source-free regions, except the right-hand side is not zero.
Using mathematical terminology, this is an inhomogeneous partial differential equation, where the inhomogeneous part includes the source current tilde J.

Don't confuse: "inhomogeneous" here is a mathematical term meaning the equation has a non-zero right-hand side (the source term), not a statement about the medium's properties.

🛠️ The solution procedure

🛠️ Three-step process

Now we have what we need to find the electromagnetic fields radiated by a current distribution. The procedure is:

Solve for tilde A: Solve the partial differential Equation 9.28 for tilde A along with the appropriate electromagnetic boundary conditions.
Find tilde H: tilde H equals (one over mu) times curl of tilde A (from Equation 9.12).
Find tilde E: tilde E may now be determined from tilde H using Equation 9.10.

📦 Summary definition

The magnetic vector potential tilde A is a vector field, defined by Equation 9.12, that is able to represent both the electric and magnetic fields simultaneously.

Procedure summary: To determine the electromagnetic fields radiated by a current distribution tilde J, one may solve Equation 9.28 for tilde A and then use Equation 9.12 to determine tilde H and subsequently tilde E.

Specific techniques for performing this procedure—in particular, for solving the differential equation—vary depending on the problem and are discussed in other sections of the book.

🔬 Physical interpretation and deeper meaning

🔬 The Lorenz gauge is not arbitrary

The Lorenz gauge condition (Equation 9.27) is not quite as arbitrary as the derivation implies.
There is some deep physics at work here.
Specifically, the Lorenz gauge leads to the classical interpretation of tilde V as the familiar scalar electric potential, as noted previously in this section.

🌌 Unification of electric and magnetic fields

It should be clear that the electric and magnetic fields are not merely coupled quantities, but in fact two aspects of the same field; namely, the magnetic vector potential.
Modern physics (quantum mechanics) yields the magnetic vector potential as a description of the "electromagnetic force."
This is a single entity which constitutes one of the four fundamental forces recognized in modern physics.
The other three fundamental forces are: gravity, the strong nuclear force, and the weak nuclear force.

Don't confuse: the magnetic vector potential is not just a mathematical convenience—it has fundamental physical significance in quantum mechanics, where it is more fundamental than the electric and magnetic fields themselves.

📚 Additional context

The excerpt references:

"Lorenz gauge condition" on Wikipedia
"Magnetic potential" on Wikipedia
PBS Space Time video "Quantum Invariance & The Origin of The Standard Model," available on YouTube

🔧 Connection to the Hertzian dipole

🔧 Context from the preceding section

The excerpt begins with a note about the Hertzian dipole from the previous section:

A Hertzian dipole is typically defined as a straight infinitesimally-thin filament of current with length which is very small relative to a wavelength, but not precisely zero.
This interpretation does not change the solution obtained, thus the current moment and the Hertzian dipole are effectively the same in practical engineering applications.
The radiated electric field formula is approximate (hence the "approximately equals" symbol), valid at distances much greater than a wavelength (r much greater than lambda).

Don't confuse: the approximation is due to the presumption of a simple spherical wave, which is only valid in the "far field" (distances much greater than a wavelength from the source).

Solution of the Wave Equation for Magnetic Vector Potential

9.3 Solution of the Wave Equation for Magnetic Vector Potential

🧭 Overview

🧠 One-sentence thesis

The magnetic vector potential due to current distributions can be solved systematically by starting with a single-point current moment and generalizing to arbitrary current distributions through superposition.

📌 Key points (3–5)

The wave equation: the magnetic vector potential ˜A satisfies a wave equation driven by current density ˜J, with propagation constant γ.
Building-block strategy: solve first for a current moment (current at a single point), then extend to general distributions.
Radiation condition: physical reasoning requires the field magnitude to decay with distance (not grow), which constrains the sign of the exponent in the solution.
Common confusion: the solution contains e^(±γr) initially, but only the negative sign is physical because fields must diminish away from the source, not amplify.
Generalization path: from a point source → displaced point source → line current filament, using superposition in linear media.

🌊 The wave equation and propagation constant

🌊 Wave equation for magnetic vector potential

The magnetic vector potential ˜A due to current density ˜J satisfies: ∇²˜A − γ²˜A = −μ˜J

This is an inhomogeneous partial differential equation.
The term −μ˜J acts as the "driving" or "source" term.
The equation can be solved given appropriate boundary conditions.

🔢 Propagation constant γ

γ² is defined as −ω²με, where ω is angular frequency, μ is permeability, and ε is permittivity.

In lossy media, γ can be written as γ = α + jβ, where:
- α is the attenuation constant (real, positive).
- β is the phase propagation constant (real, positive).
The excerpt assumes homogeneous and time-invariant media (μ and ε constant in space and time).

🎯 Current moment: the point-source solution

🎯 What is a current moment

A current moment is a distribution of current that exists at a single point, expressed as ˜J(r) = ˆl ˜I Δl δ(r).

˜I has units of current (amperes).
Δl has units of length (meters).
ˆl is the direction of current flow.
δ(r) is the volumetric sampling function (Dirac delta function).

📐 The volumetric sampling function δ(r)

δ(r) = 0 for r ≠ 0.
The integral of δ(r) over any volume V that includes the origin equals 1.
δ(r) has SI base units of inverse volume (m⁻³).
This ensures ˜J(r) has units of A/m², consistent with volume current density.
The current moment exists only at the origin and nowhere else.

✅ General solution for a current moment at the origin

The solution to the wave equation for a current moment at the origin is: ˜A(r) = ˆl μ ˜I Δl e^(±γr) / (4πr)

The ± sign initially gives two mathematical solutions.
To confirm this is a solution, substitute it back into the wave equation and verify the equality holds.
Physical reasoning (the radiation condition) will constrain the sign.

🚫 The radiation condition: choosing the physical solution

🚫 Why the negative sign is required

Consider the factor e^(±αr)/r, which determines how magnitude depends on distance r.
Negative sign (e^(−αr)): the factor decays exponentially with increasing r, reaching zero as r → ∞. This matches physical expectation: radiated field magnitude diminishes with distance from the source.
Positive sign (e^(+αr)): the factor increases to infinity as r → ∞. This is unphysical because it implies energy is introduced independently from the source.

The radiation condition: field magnitude must diminish to zero as distance from the source increases to infinity.

This is essentially a boundary condition at r → ∞.
Invoking the radiation condition, the solution becomes: ˜A(r) = ˆl μ ˜I Δl e^(−γr) / (4πr)

🔋 Loss-free (lossless) case

When α = 0 (no attenuation), the radiation condition alone cannot constrain the sign of γ.
However, in practical engineering there is always some media loss (α might be negligible but not exactly zero).
Therefore, the solution for lossless conditions (including free space) is assumed to be: ˜A(r) = ˆl μ ˜I Δl e^(−jβr) / (4πr)
Don't confuse: even in "lossless" problems, we use the negative sign based on the limiting behavior from lossy cases.

🌐 Spherical wave interpretation

The factor e^(−jβr) determines the phase dependence on distance r.
Surfaces of constant phase correspond to spherical shells concentric with the source.
Thus, ˜A(r) is a spherical wave.

🔄 Generalizing the solution: displaced sources and superposition

🔄 Current moment displaced from the origin

If the current moment is located at r′ (not the origin), the current distribution is: ˜J(r) = ˆl ˜I Δl δ(r − r′)
The solution depends only on the distance between the "field point" r (where we observe ˜A) and the "source point" r′ (where the current moment lies).
Replace r with |r − r′| in the solution: ˜A(r) = ˆl μ ˜I Δl e^(−γ|r−r′|) / (4π|r − r′|)
The potential does not depend on angles θ or φ, only on the distance |r − r′|.

🧵 Filament of current along a path C

A current filament following a path C through space can be viewed as a collection of many discrete current moments distributed along the path.
The contribution of the nth current moment at rₙ is: Δ˜A(r; rₙ) = ˆl(rₙ) μ ˜I(rₙ) Δl e^(−γ|r−rₙ|) / (4π|r − rₙ|)
Both the current ˜I and direction ˆl may vary with position along C.

➕ Superposition in linear media

Because the medium is linear, superposition applies.
Add the contributions from all N current moments: ˜A(r) ≈ Σ (from n=1 to N) Δ˜A(r; rₙ)
This gives: ˜A(r) ≈ (μ/4π) Σ (from n=1 to N) ˆl(rₙ) ˜I(rₙ) e^(−γ|r−rₙ|) / |r − rₙ| Δl

🔗 Continuum limit: line integral

Let Δl → 0 so that Δl becomes the differential length dl.
Replace the discrete sum with a line integral over the path C.
Replace rₙ with r′ (the continuum of source locations).

The magnetic vector potential for a line distribution of current is: ˜A(r) = (μ/4π) ∫_C ˆl(r′) ˜I(r′) e^(−γ|r−r′|) / |r − r′| dl

Given ˜A(r), the magnetic and electric fields can be determined using the procedure from Section 9.2.
This formula is the key result for arbitrary line current distributions.

📚 Summary table: solution generalization

Configuration	Current distribution	Solution for ˜A(r)
Current moment at origin	˜J(r) = ˆl ˜I Δl δ(r)	ˆl μ ˜I Δl e^(−γr) / (4πr)
Current moment at r′	˜J(r) = ˆl ˜I Δl δ(r − r′)	ˆl μ ˜I Δl e^(−γ\|r−r′\|) / (4π\|r − r′\|)
Line current along path C	Continuum of moments along C	(μ/4π) ∫_C ˆl(r′) ˜I(r′) e^(−γ\|r−r′\|) / \|r − r′\| dl

Each step builds on the previous one using superposition.
The strategy is: solve the simplest case (point source), then generalize by replacing distances and summing/integrating contributions.

Radiation from a Hertzian Dipole

9.4 Radiation from a Hertzian Dipole

🧭 Overview

🧠 One-sentence thesis

The Hertzian dipole—a zero-length current moment—produces spherical electromagnetic waves whose magnitude depends on electrical length and varies with direction (maximum perpendicular to the current, zero along it), and can be rigorously derived using magnetic vector potential.

📌 Key points (3–5)

What a Hertzian dipole is: an electrically-short, infinitesimally-thin straight filament of uniform current, modeled mathematically as a current moment at a single point.
Why it matters: serves as a "building block" for modeling physically-realizable antennas by summing contributions from multiple Hertzian dipoles with different positions, magnitudes, and phases.
Far-field behavior: at distances much greater than a wavelength (r ≫ λ), the radiated field behaves as a spherical wave with magnitude inversely proportional to distance and phase fronts that appear locally planar.
Directional pattern: field magnitude is zero along the current direction and maximum in the plane perpendicular to it (proportional to sin θ).
Common confusion: the Hertzian dipole can be represented either as constant current over finite length or as a current moment at a point—these are mathematically equivalent but the point representation is more versatile for calculations.

📐 Mathematical representation

📐 Current moment formulation

Current moment: the product Ĩ Δl (SI base units of A·m), representing the strength of the Hertzian dipole.

The excerpt replaces "constant current over finite length" with a current moment located at a single point:

ΔJ̃(r) = l̂ Ĩ Δl δ(r)

Where:

l̂ is the direction of current flow
Ĩ is the current
Δl is the length
δ(r) is the volumetric sampling function (zero everywhere except at the origin; integrates to 1 over any volume containing the origin)

Why this representation: it is "mathematically more versatile" for constructing complex current distributions as sums of individual dipoles.

🧲 Magnetic vector potential

For a ẑ-directed Hertzian dipole at the origin in lossless media:

Ã(r) = ẑ μ Ĩ Δl e^(-jβr) / (4πr)

Where:

μ is permeability
β is the phase constant (2π/λ)
r is distance from the origin
The exponential and 1/r factors indicate a spherical wave

This potential is the starting point for deriving the electric and magnetic fields.

🌊 Far-field approximation

🌊 When it applies

The far-field approximation is valid when:

r ≫ λ (distance much greater than wavelength)
The medium has low loss

Why distance matters: at r ≫ λ, the term (jβ + 1/r) simplifies to just jβ because 1/r becomes negligible compared to 2π/λ.

🧭 Magnetic field in the far field

H̃ ≈ φ̂ j Ĩ · β Δl / (4π) · (sin θ) · e^(-jβr) / r

Key features:

Direction: always φ̂ (azimuthal direction in spherical coordinates), consistent with the Biot-Savart law
Electrical length: the factor β Δl has units of radians, showing that field magnitude depends on electrical length, not just physical length
Spherical wave: the e^(-jβr)/r factor means constant-phase surfaces are concentric spheres, and magnitude decreases as 1/r
Angular dependence: sin θ factor means zero field along the current direction (θ = 0 or π) and maximum in the perpendicular plane (θ = π/2)

Example: If you place a ẑ-directed dipole at the origin, the magnetic field is strongest in the x-y plane and vanishes along the z-axis.

⚡ Electric field in the far field

Ẽ ≈ θ̂ j η Ĩ · β Δl / (4π) · (sin θ) · e^(-jβr) / r

Where η is the wave impedance.

How it's derived: The excerpt uses two methods:

Direct method: apply Ampere's law (Ẽ = (1/jωε) ∇ × H̃)
Plane wave approximation: use Ẽ = -η r̂ × H̃

Why plane wave relationships work: Far from the dipole, the radius of curvature of spherical phase fronts is very large, so they "appear to be locally planar" to an observer—the arriving wave looks like a plane wave.

🔄 Relationship between E and H

The electric and magnetic fields are:

Perpendicular to each other (Ẽ in θ̂ direction, H̃ in φ̂ direction)
Both perpendicular to the direction of propagation (r̂)
Related by the wave impedance η
Have the same angular dependence (sin θ)

Don't confuse: "locally planar" does not mean the wave is actually a plane wave—it's still spherical, but the curvature is negligible over small regions far from the source.

🧱 Building-block concept

🧱 Why use Hertzian dipoles

The excerpt emphasizes that the Hertzian dipole is a "building block" for modeling real antennas:

Method:

Model complex current distributions as the sum of multiple Hertzian dipoles
Each dipole has its own position, magnitude, and phase
Sum the contributions of all individual dipoles
Superposition applies because the medium is linear

Advantage: reduces the problem of analyzing a complex antenna to summing simpler, well-understood components.

🔧 Mathematical versatility

The current-moment representation (using the delta function) is "essentially equivalent" to the finite-length constant-current representation but is "mathematically more versatile."

Why: The point-source formulation makes it easier to:

Position dipoles at arbitrary locations
Sum contributions from multiple dipoles
Apply superposition systematically

Example: A wire antenna can be modeled as a line of Hertzian dipoles, each with different current magnitude and phase, then integrated along the wire length.

📊 Summary of far-field results

Quantity	Expression	Key features
Magnetic field	H̃ ≈ φ̂ j Ĩ β Δl (sin θ) e^(-jβr) / (4πr)	Direction: φ̂; depends on electrical length β Δl
Electric field	Ẽ ≈ θ̂ j η Ĩ β Δl (sin θ) e^(-jβr) / (4πr)	Direction: θ̂; related to H̃ by impedance η
Validity	r ≫ λ	Far from source; low-loss media
Angular pattern	sin θ	Zero along current axis; max perpendicular
Wave type	Spherical	Magnitude ∝ 1/r; phase fronts are spheres

Final note from excerpt: "The electric and magnetic fields far (i.e., ≫ λ) from a ẑ-directed Hertzian dipole having constant current Ĩ over length Δl, located at the origin, are given by Equations 9.67 and 9.64, respectively."

Radiation from an Electrically-Short Dipole

9.5 Radiation from an Electrically-Short Dipole

🧭 Overview

🧠 One-sentence thesis

The electrically-short dipole (ESD) radiates fields that are identical in pattern to a Hertzian dipole but with half the magnitude, because its triangular current distribution integrates to half the value of the Hertzian dipole's uniform current.

📌 Key points (3–5)

What defines an ESD: current aligned along a straight line with length L much less than λ/2, and a triangular magnitude distribution with constant phase.
How it differs from a Hertzian dipole: the Hertzian dipole has constant current over its length (rarely seen in practice), while the ESD has a triangular distribution (commonly encountered in real antennas).
Why the field is half that of a Hertzian dipole: the integral over the triangular current distribution is exactly one-half the integral over the uniform distribution.
Common confusion: ESDs are physically realizable, whereas Hertzian dipoles are idealized models not found in practice.
Radiation pattern: field magnitude is zero along the dipole axis (z-direction), maximum in the perpendicular plane, with power flowing radially outward.

📐 Defining the electrically-short dipole

📏 Two defining characteristics

An electrically-short dipole (ESD) is characterized by:

Alignment: current flows along a straight line.
Electrical shortness: length L is much less than half a wavelength (L ≪ λ/2).

The term "electrically-short" refers to the length being small compared to the wavelength, not physical size.

📊 Current distribution shape

The current distribution of an ESD is approximately triangular in magnitude and approximately constant in phase.

Why triangular?

Current distributions cannot change rapidly over distances much less than a wavelength (similar to transmission line behavior with λ/2 period).
Current must be zero at the ends of the dipole (because the dipole ends there).
These constraints lead to a triangular shape.

Mathematically: Ĩ(z) ≈ I₀(1 - 2|z|/L), where I₀ is the maximum current magnitude and phase.

🔍 Common confusion: ESD vs Hertzian dipole

Feature	Hertzian Dipole	Electrically-Short Dipole (ESD)
Current distribution	Constant over length	Triangular in magnitude
Physical realizability	Idealized, rarely/approximately seen	Good approximation to real short antennas
Field magnitude	Reference value	Exactly half the Hertzian dipole

Don't confuse: both are "electrically-short dipoles," but the term "ESD" in this context specifically refers to the triangular distribution unless noted otherwise.

🧮 Deriving the radiated field

🧩 Approximation method: sum of Hertzian dipoles

Instead of direct integration of the magnetic vector potential, the excerpt uses a simpler approach:

Imagine the ESD as many short segments radiating independently.
Each segment is so short that its current is approximately constant → treat each as a Hertzian dipole.
Sum the contributions from all segments.

Advantage: we already have the solution for each Hertzian dipole segment from Section 9.4.

📝 Starting from the Hertzian dipole solution

For a z-directed Hertzian dipole at the origin, the radiated electric field is:

Ẽ(r) ≈ θ̂ jη Ĩ · β Δl / (4π) (sin θ) e^(-jβr) / r

where:

η is wave impedance (≈377 Ω for free space)
β = 2π/λ is the phase propagation constant
Assumes lossless media and field points far from the dipole (r ≫ λ)

🔧 Building the integral

For a differential segment at position z′:

dẼ(r; z′) ≈ θ̂′ jηβ Ĩ(z′) / (4π) (sin θ′) e^(-jβ|r - ẑz′|) / |r - ẑz′| dz′

Integrate from -L/2 to +L/2 to get the total field from the ESD.

🎯 Simplifying assumptions for far fields

📏 Parallel ray approximation

Since r ≫ L (distance to field point is much greater than dipole length):

θ′ ≈ θ (angle is approximately constant)
θ̂′ ≈ θ̂ (unit vector is approximately constant)
These can be moved outside the integral.

📐 Distance approximation

The vector r is approximately parallel to r - ẑz′, so:

|r - ẑz′| ≈ r - r̂ · ẑz′

For magnitude: |r - ẑz′| ≈ r (since r ≫ L)

For phase: More careful treatment needed because phase is sensitive.

Using e^(-jβ|r - ẑz′|) ≈ e^(-jβr) e^(+jβ r̂·ẑz′)
Worst case: field points along z-axis where r̂·ẑ = ±1
Phase variation β r̂·ẑz′ varies by much less than π radians (since L ≪ λ)
Therefore: e^(-jβ|r - ẑz′|) ≈ e^(-jβr)

Example: Even in the worst case along the z-axis, the phase difference across the dipole is negligible because the dipole is electrically short.

🧮 Evaluating the integral

After simplifications, the integral reduces to:

Ẽ(r) ≈ θ̂ jηβ/(4π) (sin θ) e^(-jβr)/r ∫_{-L/2}^{+L/2} Ĩ(z′) dz′

The integral over the triangular distribution equals I₀L/2 (from inspection of the triangular shape).

📡 Final field expressions and characteristics

⚡ Electric field

Ẽ(r) ≈ θ̂ jη I₀ · βL/(8π) (sin θ) e^(-jβr)/r

Or equivalently, using β = 2π/λ:

Ẽ(r) ≈ θ̂ j ηI₀/4 · L/λ (sin θ) e^(-jβr)/r

Valid for r ≫ λ (far from the dipole).

🧲 Magnetic field

Since the wave appears locally planar at far distances, use the plane wave relationship H̃ = (1/η) r̂ × Ẽ:

H̃(r) ≈ φ̂ j I₀/4 · L/λ (sin θ) e^(-jβr)/r

🎯 Why exactly half the Hertzian dipole magnitude

The ESD field magnitude is exactly one-half that of a Hertzian dipole with current moment ẑ I₀L/2.

Reason: The integral over the triangular current distribution is one-half the integral over the uniform current distribution of a Hertzian dipole.

Triangular distribution: area under triangle = (1/2) × base × height
Uniform distribution: area = base × height
Ratio = 1/2

Don't confuse: The variation in magnitude, phase, and polarization with field point location is identical for both; only the overall magnitude differs by a factor of 2.

📊 Spatial radiation pattern

In planes of constant φ (Figures 9.9–9.10):

Field magnitude follows sin θ pattern: zero along z-axis, maximum in perpendicular plane.
Electric field oriented in θ̂ direction, magnetic field in φ̂ direction.

In the z = 0 plane (Figures 9.11–9.12):

Uniform magnitude in all directions (circular pattern).
Field orientations confirm radial power flow.

Power flow: The Poynting vector Ẽ × H̃ is always directed radially outward from the dipole, confirming that power flows away from the source in all directions.

Example: An observer on the z-axis sees zero field, while an observer in the xy-plane (perpendicular to the dipole) sees maximum field strength.

Far-Field Radiation from a Thin Straight Filament of Current

9.6 Far-Field Radiation from a Thin Straight Filament of Current

🧭 Overview

🧠 One-sentence thesis

The electric field radiated by a thin straight current filament can be calculated by treating the filament as a collection of many short Hertzian dipoles and summing their contributions, which simplifies dramatically when field points are far from the source.

📌 Key points (3–5)

What "thin straight filament" means: a current distribution aligned along a straight line whose cross-section is much smaller than a wavelength (electrically thin).
The key simplification method: approximate the filament as many short segments (Hertzian dipoles) radiating independently, then sum their fields.
Far-field approximations: when distance r is much greater than both the filament length L and wavelength λ, angles and unit vectors become approximately constant, and the parallel-ray approximation applies.
Common confusion: the distance |r - ẑz'| must be treated differently for magnitude (use ≈ r) versus phase (use ≈ r - z' cos θ) because phase is more sensitive to small differences.
Final result: the radiated electric field is given by an integral over the current distribution, with magnitude proportional to 1/r and phase varying as e^(-jβr).

🔧 Defining the current distribution

🔧 What is a thin straight filament

A thin straight current filament: a current distribution aligned along a straight line, with maximum cross-sectional dimension much less than a wavelength (electrically thin).

The current magnitude and phase along the filament are described by the phasor Ĩ(z) (units: amperes).
The only general constraint is that current must be zero at the ends: Ĩ(z) = 0 for |z| ≥ L/2.
Examples mentioned: electrically-short dipole (Section 9.5) and half-wave dipole (Section 9.7).
The excerpt suggests reviewing the electrically-short dipole section first for a gentler introduction.

📐 Why this class matters

Despite different lengths L and current distributions, members of this class share common radiation characteristics.
These commonalities become especially apparent when examining field points far from the current.
The excerpt focuses on the far-field case to exploit these simplifications.

🧩 The Hertzian dipole approximation method

🧩 Why use Hertzian dipoles

Alternative approach: calculate magnetic vector potential Ã by integration, then derive H̃ and Ẽ through curl operations and Ampere's law.
Simpler approach chosen: treat the filament as many short segments radiating independently.
Each segment is so short (relative to both filament length and wavelength) that current over it can be approximated as constant.
This makes each segment effectively a Hertzian dipole, for which a solution already exists.

📝 Starting from the known solution

The excerpt recalls that a ẑ-directed Hertzian dipole at the origin radiates:

Ẽ(r) ≈ θ̂ jη Ĩ (β Δl) / (4π) (sin θ) e^(-jβr) / r

Where:

η is the wave impedance of the medium (≈ 377 Ω for free space)
β = 2π/λ is the phase propagation constant
The medium is assumed lossless (attenuation constant α ≈ 0)
Field points are far from the filament (r ≫ λ)

🔄 Adapting to a segment at position z'

For a segment at the origin (z' = 0), the field is written with current Ĩ(0).
Shrinking segment length Δl to differential length dz' gives the differential contribution dẼ.
For any segment at position z', the contribution becomes:

dẼ(r; z') ≈ θ̂' jηβ Ĩ(z') / (4π) (sin θ') e^(-jβ|r - ẑz'|) / |r - ẑz'| dz'

Note: θ is replaced by θ' because the ray r - ẑz' forms a different angle with respect to ẑ.
Similarly, θ̂ is replaced by θ̂', which varies with z'.

➕ Integrating over all segments

The total electric field is obtained by integrating contributions from all segments:

Ẽ(r) ≈ ∫[from -L/2 to +L/2] dẼ(r̂; z')

Expanded form:

Ẽ(r) ≈ j (ηβ)/(4π) ∫[from -L/2 to +L/2] θ̂' Ĩ(z') (sin θ') e^(-jβ|r - ẑz'|) / |r - ẑz'| dz'

🌐 Far-field approximations

🌐 Simplifying angles and unit vectors

Key assumption: r ≫ L (distance to field points is much greater than filament length).

Consequences:

θ' ≈ θ (angle is approximately constant along the filament)
θ̂' ≈ θ̂ (unit vector is approximately constant)
These can be moved outside the integral:

Ẽ(r) ≈ θ̂ j (ηβ)/(4π) (sin θ) ∫[from -L/2 to +L/2] Ĩ(z') e^(-jβ|r - ẑz'|) / |r - ẑz'| dz'

📏 The parallel ray approximation

Geometric insight: Since r ≫ L, the vector r is approximately parallel to r - ẑz'.

This leads to: |r - ẑz'| ≈ r - r̂ · ẑz' ≈ r - z' cos θ

Don't confuse: This approximation must be applied differently depending on whether it affects magnitude or phase.

⚖️ Different treatments for magnitude vs phase

Context	Approximation	Reason
Magnitude	\|r - ẑz'\| ≈ r	The magnitude of r - r̂ · ẑz' must be approximately r since r ≫ L
Phase	\|r - ẑz'\| ≈ r - z' cos θ	Phase is more sensitive; the term z' cos θ matters in the exponential

For the phase-dependent part: e^(-jβ|r - ẑz'|) ≈ e^(-jβr) e^(+jβz' cos θ)

The first exponential e^(-jβr) can be factored out of the integral; the second e^(+jβz' cos θ) remains inside.

🎯 Why these are called far-field approximations

Far field approximation: simplifications valid only for distances "far" from the source, specifically when r ≫ L and r ≫ λ.

These assumptions allow treating angles and unit vectors as constant.
They enable the parallel-ray approximation for distance calculations.
Without these, the integral would be much more complex.

📐 The final formula

📐 Applying all simplifications

After applying magnitude and phase approximations to the integral:

Ẽ(r) ≈ θ̂ j (ηβ)/(4π) e^(-jβr)/r (sin θ) · ∫[from -L/2 to +L/2] Ĩ(z') e^(+jβz' cos θ) dz'

Eliminating the factor β using β = 2π/λ:

Ẽ(r) ≈ θ̂ j (η)/(2λ) e^(-jβr)/r (sin θ) · ∫[from -L/2 to +L/2] Ĩ(z') e^(+jβz' cos θ) dz'

📋 Summary of the result

The electric field radiated by a thin, straight, ẑ-directed current filament of length L located at the origin and aligned along the z axis is given by the formula above. This expression is valid for r ≫ L and r ≫ λ.

Structure of the formula:

Direction: θ̂ (perpendicular to the radial direction, in the plane containing ẑ and r̂)
Magnitude decay: proportional to 1/r
Phase: e^(-jβr) represents outward-propagating wave
Angular dependence: (sin θ) factor, plus the integral which also depends on θ
Current distribution: encoded in the integral over Ĩ(z')

🧲 Finding the magnetic field

At field points satisfying r ≫ L and r ≫ λ:

The wave appears locally planar.
Therefore, the plane wave relationship applies: H̃ = (1/η) r̂ × Ẽ
This avoids needing separate calculation of the magnetic field.

✅ Verification

The excerpt notes that one can verify Equation 9.96 yields the expected result for the electrically-short dipole (Section 9.5), confirming the method's consistency.

Far-Field Radiation from a Half-Wave Dipole

9.7 Far-Field Radiation from a Half-Wave Dipole

🧭 Overview

🧠 One-sentence thesis

The half-wave dipole produces a radiation pattern similar to the electrically-short dipole but with greater maximum field magnitude for the same terminal current, and its far-field can be calculated using a cosine current distribution and far-field approximations.

📌 Key points (3–5)

Far-field approximation: when the observation distance r is much greater than both the antenna length L and wavelength λ, magnitude and phase can be simplified separately.
Half-wave dipole current distribution: a cosine pulse distribution where current is maximum at the center and zero at the ends, resembling a standing wave.
Common confusion: magnitude vs phase approximations—for magnitude calculations use r alone, but for phase calculations must account for the extra path-length term involving cos θ.
Comparison with ESD: the HWD and electrically-short dipole have identical polarization and zero radiation along the axis, but the HWD radiates stronger fields in the maximum direction for the same terminal current.
Plane wave relationship: at far distances, the wave appears locally planar, so the magnetic field can be found from the electric field using the simple cross-product relationship.

📏 Far-field approximation mechanics

📏 Distance requirements

The far-field approximations are valid only when:

r ≫ L (observation distance much greater than antenna length)
r ≫ λ (observation distance much greater than wavelength)

These conditions define what "far" means—the observer must be distant compared to both the physical size of the antenna and the electromagnetic wavelength.

🔀 Magnitude vs phase approximations

The excerpt distinguishes two different approximations for the distance term:

Purpose	Approximation	Reason
Magnitude	Use r alone	The small correction term is negligible for amplitude
Phase	Use r - z′ cos θ	Phase is sensitive to path-length differences

Why this matters:

The phase term appears in an exponential (e to the power of -jβ times distance), so even small distance differences create noticeable phase shifts.
The magnitude term appears in a denominator (1/r), so small corrections are less significant.

Example: The phase approximation splits into two exponential factors: one with r (the main propagation) and one with z′ cos θ (the correction for different source points along the antenna).

🌊 Locally planar waves

At far distances satisfying the conditions above:

The wave appears locally planar (flat) rather than spherical.
This justifies using the plane wave relationship: magnetic field equals (1/η) times the unit radial vector crossed with the electric field.
This is much simpler than applying Ampere's law directly.

🎯 Half-wave dipole characteristics

🎯 Current distribution definition

Half-wave dipole current distribution: ˜I(z) ≈ I₀ cos(π z/L) for |z| ≤ L/2, where L = λ/2.

Key features:

Cosine pulse shape: maximum at the center (z = 0), zero at the ends (z = ±L/2).
I₀ is a complex constant: indicates both the maximum magnitude and phase of the current.
Physical interpretation: resembles the sinusoidal variation of current in a standing wave on a wire.
The distribution is "very similar to the triangular distribution of the ESD" (electrically-short dipole).

📐 Mathematical forms

The current distribution can be written two equivalent ways:

In terms of antenna length: I₀ cos(π z/L)
In terms of wavelength: I₀ cos(2π z/λ)

Both are valid because L = λ/2 for the half-wave dipole.

⚡ Electric field result

The far-field electric field is:

Direction: θ-hat (tangent to the sphere, perpendicular to the radial direction)
Contains a factor: cos[(π/2) cos θ] / sin θ
Multiplied by: (j η I₀)/(2π) times (e to the power of -jβr)/r

The quantity in the numerator involving cos and sin of θ determines the angular radiation pattern.

🔄 Comparison with electrically-short dipole

🔄 Similarities

Both the HWD and ESD share:

Identical polarization characteristics: the electric field points in the same direction for both.
Zero radiation along the axis: both produce zero field magnitude along the z-axis (the dipole axis).
Maximum in the perpendicular plane: both have maximum field magnitude in the z = 0 plane.

🔄 Key difference

For the same terminal current I₀:

The HWD produces greater maximum magnitude than the ESD.
This means the half-wave dipole is more efficient at radiating power.

Don't confuse: the comparison is for the same terminal current; if comparing different current values, the relationship would change.

📊 Radiation pattern comparison

The excerpt references Figure 9.17, which shows:

A comparison in "any radial plane containing the z-axis" (a vertical slice through the radiation pattern).
Both patterns have the same general shape but different magnitudes.
Both go to zero at θ = 0° and θ = 180° (along the dipole axis).

🧮 Calculation method

🧮 Integration approach

To find the radiated field:

Start with the general formula for a current filament (Equation 9.99).
Substitute the specific current distribution ˜I(z′) for the HWD.
Evaluate the integral over the antenna length from -λ/4 to +λ/4.

The excerpt notes:

"The evaluation of this integral is straightforward, but tedious."
The integral reduces to: (I₀/π) times cos[(π/2) cos θ] / sin² θ.

🧲 Finding the magnetic field

Two methods are available:

Complex method: Apply Ampere's law directly.
Simpler method: Use the plane wave relationship ˜H = (1/η) times r-hat crossed with ˜E.

The simpler method works because the wave is locally planar in the far field. This indicates the magnetic field is φ-hat directed (circumferential direction in spherical coordinates).

Radiation from Surface and Volume Distributions of Current

9.8 Radiation from Surface and Volume Distributions of Current

🧭 Overview

🧠 One-sentence thesis

Any current distribution—whether confined to a line, spread over a surface, or flowing through a volume—can be analyzed by treating it as a superposition of differential current moments, each radiating according to the same fundamental formula.

📌 Key points (3–5)

Core idea: All current distributions (line, surface, volume) radiate electromagnetic fields that can be calculated from the magnetic vector potential.
Unifying principle: A differential current moment can be expressed equivalently as current×length, surface-current-density×area, or volume-current-density×volume—all have the same units (A·m).
Method: Start with the solution for a point current moment, then integrate over the entire distribution using superposition.
Common confusion: Don't confuse the type of distribution (line/surface/volume) with the physics—the underlying current moment is the same, only the integration domain changes.
Practical outcome: Two key formulas (Equations 9.112 and 9.113) give the magnetic vector potential for surface and volume currents; from there, electric and magnetic fields follow.

🔄 From point sources to distributed currents

🔄 The starting point: differential current moment

The excerpt begins by recalling the solution for a single point current:

Current moment: d˜J(r) = ˆl ˜I dl δ(r − r′)

ˆl: direction of current flow
˜I: current magnitude (units: amperes, A)
dl: differential length (units: meters, m)
δ(r − r′): Dirac delta function (units: per cubic meter, m⁻³)

The product ˜I dl has units of A·m (ampere-meters), which is the fundamental quantity describing a current moment.

🧲 Magnetic vector potential from a point

The magnetic vector potential at field point r due to this differential current moment at source point r′ is:

d˜A(r; r′) = ˆl (μ ˜I dl / 4π) · [e^(−γ|r−r′|) / |r−r′|]

μ: permeability of the medium
γ: propagation constant
The exponential and distance terms describe how the field propagates and attenuates.

🔁 Superposition principle

Any extended current distribution is a collection of these differential moments. By superposition:

˜A(r) = ∫ d˜A(r; r′)

The integral sums contributions from all source points r′.

🔀 Three equivalent descriptions of the same current moment

🔀 Line, surface, and volume formulations

The excerpt emphasizes that the same differential current moment can be written three ways:

Type	Formula	Current quantity	Differential element	Product units
Line	˜I dl	Current ˜I (A)	Length dl (m)	A·m
Surface	˜J_s ds	Surface current density ˜J_s (A/m)	Area ds (m²)	A·m
Volume	˜J dv	Volume current density ˜J (A/m²)	Volume dv (m³)	A·m

Key insight: All three expressions equal the same current moment:

˜I dl = ˜J_s ds = ˜J dv

🧩 Why this matters

The physics is identical—only the geometric description changes.
You choose the formulation that matches how the current is distributed in your problem.
Example: Current flowing on a thin wire → use line current. Current flowing on a metal sheet → use surface current. Current flowing through a plasma or conductor volume → use volume current.

⚠️ Don't confuse

Surface current density ˜J_s (A/m) is not the same as volume current density ˜J (A/m²)—they have different units and apply to different geometries.
The differential element (dl, ds, dv) must match the type of current density you're using.

📐 Formulas for surface and volume distributions

📐 Surface current distribution

By substituting ˜I dl → ˜J_s ds into the integral, the excerpt derives:

˜A(r) = (μ / 4π) ∫_S [˜J_s(r′) · e^(−γ|r−r′|) / |r−r′|] ds

˜J_s(r′): surface current density at source point r′ (a vector: direction ˆl times magnitude)
S: the surface over which current flows
ds: differential area element on that surface

📐 Volume current distribution

Similarly, substituting ˜I dl → ˜J dv gives:

˜A(r) = (μ / 4π) ∫_V [˜J(r′) · e^(−γ|r−r′|) / |r−r′|] dv

˜J(r′): volume current density at source point r′
V: the volume in which current flows
dv: differential volume element

🔧 Next steps

Once you have the magnetic vector potential ˜A(r), you can find the electric and magnetic fields using the procedure from Section 9.2 (referenced but not detailed in this excerpt).

🛠️ Method summary

🛠️ The "recycling" approach

The excerpt describes the derivation as "recycling" the line-current solution:

Start with the known solution for a differential line current moment.
Recognize that the same current moment can be expressed for surface or volume distributions.
Replace ˜I dl with ˜J_s ds or ˜J dv as appropriate.
Integrate over the surface S or volume V where current flows.

🛠️ Why this works

The principle of superposition allows you to add up contributions from infinitesimal current moments.
Each infinitesimal moment radiates independently according to the same fundamental law.
The integral accounts for the phase and amplitude variations across the distribution.

🛠️ Practical use

For a given current distribution ˜J_s(r′) or ˜J(r′), evaluate the integral (often numerically or with approximations).
The result is the magnetic vector potential everywhere in space.
From ˜A, derive ˜E and ˜H to fully characterize the radiated electromagnetic field.

Example: If you have a rectangular metal plate with a known surface current pattern, use Equation 9.112 to find the radiated field. If you have a conducting sphere with volume current, use Equation 9.113.

How Antennas Radiate

10.1 How Antennas Radiate

🧭 Overview

🧠 One-sentence thesis

Antennas radiate because they support time-varying current distributions that do not cancel out, unlike ideal transmission lines where opposing currents on closely-spaced conductors produce negligible far-field radiation.

📌 Key points (3–5)

What an antenna does: converts conductor-bound voltage and current signals into electromagnetic waves (or vice versa).
Why transmission lines don't radiate: currents on the two conductors are equal in magnitude but opposite in direction, so their radiation cancels at far distances.
How to make a transmission line radiate: modify the geometry (e.g., bend the ends) so that the current distribution no longer cancels—this creates an antenna.
Common confusion: time-varying currents generally radiate, but in transmission lines the opposing currents cancel each other's fields in the far field; antennas work by breaking this cancellation.
General principle: any antenna supports a time-varying current distribution that radiates away from the device.

🔌 What an antenna is

🔌 Antenna as a transducer

An antenna is a transducer; that is, a device which converts signals in one form into another form.

The two forms are:
1. Conductor-bound voltage and current signals
2. Electromagnetic waves
Traditional passive antennas work in both directions (transmit and receive).
This section focuses on the transmit case: converting a bound signal into a radiating electromagnetic wave.

⚡ The source of radiation

Radiation from an antenna is due to time-varying current excited by the electrical signal applied to the antenna terminals.
Time-varying currents generally give rise to radiation.
The key question: why do some structures radiate while others (like transmission lines) do not?

🚫 Why ideal transmission lines don't radiate

🚫 The twin lead scenario

Consider a sinusoidal source applied to an ideal twin lead transmission line terminated in an open circuit.
Key properties:
- Spacing between conductors is much less than a wavelength.
- Current must be zero at the open-circuit end.
- At any position, the current on each conductor is equal in magnitude and flows in opposite directions.
The source varies periodically, so the sign of the current changes every half-period.

🔬 Analyzing radiation using Hertzian dipoles

The current can be interpreted as a collection of Hertzian dipoles.
Superposition applies: total radiation is the sum of radiation from individual Hertzian dipoles.
Each Hertzian dipole on one conductor has a corresponding dipole on the other conductor:
- They are only a tiny fraction of a wavelength apart.
- They are identical in magnitude but opposite in sign.
Result: the radiated field from any such pair is approximately zero at distances sufficiently far from the transmission line.

📏 Far-field cancellation

The radiation from an ideal twin lead transmission line with open circuit termination is negligible at distances much greater than the separation between the conductors.

At distances not large relative to conductor separation, the situation is different:
- Hertzian dipole pairs do not appear precisely collocated.
- Cancellation is less precise.
- Resulting fields are not negligible and depend on conductor separation.
Don't confuse: near-field vs far-field behavior—cancellation is effective only in the far field.

🎯 Creating an antenna from a transmission line

🎯 The modified twin lead

Modify the twin lead by bending the ends into right angles.
The new section has an overall length much less than one-half wavelength.
Current distribution on the modified section:
- Current must still be zero at the ends of the conductors.
- Since the section is much shorter than one-half wavelength, the current distribution is very simple.
- It exhibits the pattern of an electrically-short dipole (ESD): maximum at the center, decreasing approximately linearly to zero at the ends.
- Current is continuous at the junction between unmodified and modified sections.

📡 Why the modified section radiates

The modified section exhibits the current distribution of an ESD.
The unmodified transmission line section does not radiate (as established earlier).
Conclusion: this system radiates precisely as an ESD does.
Alternative view: instead of a "modified transmission line," view it as an unmodified transmission line attached to an antenna (the ESD).
Example: by bending the ends, you break the cancellation that prevented radiation—the currents at the ends no longer have opposing partners nearby.

🌐 The general principle

🌐 Any antenna works this way

The insight developed for the ESD applies generally to all antennas, not just dipoles.
Any antenna can be viewed as a structure that supports a current distribution that radiates.

🌐 Definition of a transmitting antenna

A transmitting antenna is a device that, when driven by an appropriate source, supports a time-varying distribution of current resulting in an electromagnetic wave that radiates away from the device.

This is a restatement of the initial definition, but now we understand:
- How this happens: by supporting a current distribution that does not cancel.
- Why transmission lines typically aren't antennas: their opposing currents cancel in the far field.
The key difference: antennas break the symmetry that causes cancellation in transmission lines.

Power Radiated by an Electrically-Short Dipole

10.2 Power Radiated by an Electrically-Short Dipole

🧭 Overview

🧠 One-sentence thesis

The power radiated by an electrically-short dipole antenna increases with the square of both the terminal current and the electrical length of the antenna, and can be calculated by integrating the power density over a closed surface surrounding the antenna.

📌 Key points (3–5)

What is derived: total power radiated by an electrically-short dipole (ESD) in response to sinusoidal current at the terminals.
How it's calculated: by integrating time-average power density over a closed surface enclosing the antenna, using far-field approximations.
Key dependencies: radiated power is proportional to the square of terminal current magnitude and the square of electrical length (βL).
Common confusion: electrical length vs physical length—electrical length is physical length expressed in radians where 2π radians equals one wavelength, so the same physical antenna radiates more power at higher frequencies.
Why it matters: this result is both useful on its own and necessary for determining the impedance of the ESD.

📡 Starting point and assumptions

📡 Electric field in the far field

The analysis begins with the electric field intensity for a z-oriented ESD at the origin:

E(r) ≈ (θ-hat) × j × η × I₀ × (βL / 8π) × sin(θ) × e^(-jβr) / r

Where:

r = field point location
I₀ = complex number representing peak magnitude and phase of terminal current
L = length of the ESD
β = phase propagation constant = 2π/λ (wavelength)
η = wave impedance of the medium

🔍 Validity conditions

This equation is valid only under specific constraints:

Electrically-short: L ≪ λ (length much smaller than wavelength)
Far-field conditions: r ≫ L and r ≫ λ (observation point far from antenna)
Medium properties: simple media (linear, homogeneous, time-invariant, isotropic) with negligible loss

Don't confuse: "far field" means both distance conditions must be satisfied—far compared to both the antenna length and the wavelength.

⚡ Power density and integration setup

⚡ Approximating as a plane wave

In the far field, the electromagnetic field at each point r can be approximated as a plane wave propagating radially outward (in the r-hat direction).

Under this assumption, the time-average power density is:

S(r) = r-hat × |E(r)|² / (2η)

This represents power per unit area (W/m²) flowing radially outward.

🔢 Total radiated power integral

The total power radiated by the antenna is found by integrating power density over any closed surface S that encloses the antenna:

P_rad = ∮_S S(r) · ds

Where ds is the outward-facing differential element of surface area.

The physical meaning: power density (W/m²) integrated over area (m²) gives total power (W).

🌐 Spherical coordinates

For this problem, spherical coordinates are natural. The differential surface element is:

ds = r-hat × r² × sin(θ) dθ dφ

This leads to the double integral:

P_rad = (1 / 2η) ∫₀^π ∫₀^(2π) |E(r)|² × r² × sin(θ) dθ dφ

🧮 Carrying out the calculation

🧮 Magnitude of the electric field

From the far-field expression, the magnitude squared is:

|E(r)|² ≈ η² × |I₀|² × (βL)² / (64π²) × sin²(θ) × (1/r²)

Notice the 1/r² dependence, which will cancel with the r² in the surface element.

📐 Simplifying the integral

Substituting into the power integral:

P_rad ≈ (η × |I₀|² × (βL)²) / (128π²) × ∫₀^π ∫₀^(2π) sin³(θ) dθ dφ

The integral can be factored into separate integrals over θ and φ:

The φ integral is simply 2π (integrating around the full circle)
This leaves: P_rad ≈ (η × |I₀|² × (βL)²) / (64π) × ∫₀^π sin³(θ) dθ

✅ Final result

The remaining integral over θ equals 4/3, yielding:

P_rad ≈ (η × |I₀|² × (βL)²) / (48π)

This is the power radiated by an electrically-short dipole in response to current I₀ at the terminals.

🔬 Units check

β has units of rad/m, so βL is in radians (dimensionless in SI base units)
This leaves η × |I₀|² with units of Ω × A² = W (watts), as expected

📊 Physical interpretation and dependencies

📊 Current dependence

Radiated power is proportional to the square of the terminal current magnitude:

Doubling the current quadruples the radiated power
This is consistent with power being proportional to current squared in resistive elements

📏 Electrical length dependence

The product βL = 2πL/λ is the electrical length of the antenna:

Electrical length: the physical length of the antenna expressed in radians, where 2π radians equals one wavelength.

Key insight: radiated power increases as the square of electrical length.

This means:

Longer antennas (relative to wavelength) radiate more power for the same current
The same physical antenna radiates more power at higher frequencies (shorter wavelengths)

Don't confuse: a 10 cm antenna is electrically short at 30 MHz (λ ≈ 10 m) but might not be at 3 GHz (λ ≈ 10 cm).

🧪 Worked example

Problem: A dipole is 10 cm long, surrounded by free space. A 30 MHz sinusoidal current with peak magnitude 100 mA is applied. What power is radiated?

Solution steps:

Calculate wavelength: λ = c/f ≈ 10 m
Check if electrically short: L ≈ 0.01λ ✓ (qualifies as electrically short)
Identify parameters:
- η ≈ 376.7 Ω (free space wave impedance)
- I₀ = 100 mA = 0.1 A
- β = 2π/λ ≈ 0.628 rad/m
Apply formula: P_rad ≈ 98.6 μW

Note: The solution assumes no power is dissipated within the antenna (all delivered power is radiated).

Power Dissipated by an Electrically-Short Dipole

10.3 Power Dissipated by an Electrically-Short Dipole

🧭 Overview

🧠 One-sentence thesis

Power delivered to an electrically-short dipole antenna is partly dissipated as heat within the antenna due to the finite conductivity of the wire, and this loss can be quantified as a resistance perceived by the source at the terminals.

📌 Key points (3–5)

What is being calculated: the power dissipated inside the antenna (not radiated) due to finite conductivity of the wire material.
Method: model the antenna as many tiny current segments, each with resistance; sum the power dissipated in all segments.
Key result: the loss resistance R_loss depends on wire length L, radius a, conductivity σ, permeability μ, and frequency f.
Common confusion: R_loss is not the total impedance of the antenna—it is only the resistive contribution from internal losses; radiation resistance and reactance are separate contributions.
Why it matters: knowing dissipated power allows calculation of radiation efficiency and helps determine the antenna's impedance.

🔬 Modeling power dissipation in the antenna

🔬 Current distribution in the electrically-short dipole

The current distribution along an electrically-short dipole (ESD) aligned on the z-axis is approximately: ˜I(z) ≈ I₀ (1 − 2|z|/L), where I₀ is the peak current magnitude and phase at the terminals, and L is the antenna length.

The current is maximum at the center (z = 0) and decreases linearly to zero at the ends (z = ±L/2).
This distribution can be interpreted as a set of many very short segments ("Hertzian dipoles"), each carrying a constant current magnitude ˜I(zₙ) at position zₙ.
Example: if the antenna is 10 cm long and the peak current is 100 mA, the current at the center is 100 mA and drops to zero at the tips.

⚡ Power dissipated in each segment

Each tiny segment of length Δl at position zₙ dissipates power according to: P_seg(zₙ) = (1/2) |˜I(zₙ)|² R_seg.
R_seg is the resistance of one segment, which depends on the wire's material and geometry.
The factor 1/2 comes from the time-average power for sinusoidal currents.

🧮 Segment resistance for a good conductor

Assume the wire has circular cross-section with radius a and is made of a good conductor (conductivity σ, permeability μ).
The resistance of a segment of length Δl is: R_seg ≈ (1/2) √(μf / πσ) · (Δl / a), where f is frequency.
This formula comes from the skin effect in good conductors at high frequencies.
Substituting into the power expression: P_seg(zₙ) ≈ (1 / 4a) √(μf / πσ) |˜I(zₙ)|² Δl.

🧩 Total power loss and loss resistance

🧩 Summing over all segments

Total power dissipated P_loss is the sum of power in all N segments: P_loss ≈ Σ [(1 / 4a) √(μf / πσ) |˜I(zₙ)|² Δl].
Factor out constants: P_loss ≈ (1 / 4a) √(μf / πσ) Σ |˜I(zₙ)|² Δl.
As Δl becomes infinitesimally small (differential length dz′), the sum becomes an integral from z′ = −L/2 to z′ = +L/2.
Important: this integral form applies to any straight wire antenna of length L, not just the ESD.

📐 Evaluating the integral for the ESD

Substitute the ESD current distribution: P_loss ≈ (1 / 4a) √(μf / πσ) ∫ |I₀ (1 − 2|z′|/L)|² dz′.
Factor out |I₀|²: P_loss ≈ (1 / 4a) √(μf / πσ) |I₀|² ∫ |1 − 2|z′|/L|² dz′.
The integral evaluates to L/3 (the excerpt notes this is "straightforward to solve, albeit a bit tedious").
Final result: P_loss ≈ (L / 12a) √(μf / πσ) |I₀|².

🔌 Loss resistance perceived by the source

A current source driving the antenna does not "see" the varying current distribution or power along the antenna.
Instead, it perceives a net resistance R_loss such that P_loss = (1/2) |I₀|² R_loss.
Comparing the two expressions for P_loss: R_loss ≈ (L / 6a) √(μf / πσ).
This is the loss resistance of the ESD.

Quantity	Formula	Meaning
P_loss	(L / 12a) √(μf / πσ) \|I₀\|²	Power dissipated inside the antenna
R_loss	(L / 6a) √(μf / πσ)	Resistance due to internal losses
Power formula	(1/2) \|I₀\|² R_loss	Power in terms of terminal current and loss resistance

⚠️ What loss resistance is and is not

⚠️ Not the total impedance

R_loss is not the impedance of the ESD. R_loss is merely the contribution of internal loss to the impedance of the ESD.

The total impedance of the antenna has three contributions:
1. Loss resistance R_loss (real-valued, from internal dissipation).
2. Radiation resistance (real-valued, from radiated power).
3. Reactance (imaginary-valued, from energy storage).
Don't confuse: R_loss accounts only for power turned into heat inside the wire, not for power radiated away.
Example: if R_loss = 9.49 mΩ and radiation resistance is much larger, most power is radiated, not dissipated.

🧪 Example calculation

Problem: A 10 cm dipole, 1 mm radius, aluminum (conductivity ≈ 3.7 × 10⁷ S/m, μ ≈ μ₀), in free space. Frequency 30 MHz, peak current 100 mA. What is the power dissipated?

Solution:

Wavelength λ = c/f ≈ 10 m, so L = 10 cm ≈ 0.01λ → qualifies as electrically-short.
Use R_loss ≈ (L / 6a) √(μf / πσ) with L = 0.1 m, a = 0.001 m, σ ≈ 3.7 × 10⁷ S/m.
Result: R_loss ≈ 9.49 mΩ.
Power dissipated: P_loss = (1/2) |I₀|² R_loss ≈ 47.5 μW.

🛠️ Other sources of loss

The analysis focuses on limited conductivity of the wire.
Additional loss mechanisms may exist:
- Coatings on the antenna.
- Mechanical support materials near the terminals.
These materials may absorb and dissipate power that would otherwise be radiated, increasing the effective loss resistance.

Reactance of the Electrically-Short Dipole

10.4 Reactance of the Electrically-Short Dipole

🧭 Overview

🧠 One-sentence thesis

The reactance of an electrically-short dipole is always large and negative (approaching negative infinity as length approaches zero), which makes these antennas difficult to match with typical transmission lines in practical transmit applications.

📌 Key points

What reactance represents: the imaginary component of antenna impedance, representing energy storage (like capacitors or inductors store energy).
Zero-length extreme case: a zero-length dipole behaves as an open circuit with impedance 0 − j∞ (zero real part, negative infinite imaginary part).
How reactance changes with length: as dipole length increases from zero, reactance remains large and negative but becomes less negative monotonically.
Common confusion: the ESD behaves like a capacitor (negative reactance), but the frequency dependence is different—the capacitor analogy is valid only at one frequency at a time.
Practical challenge: the reactance is typically orders of magnitude larger than transmission line impedances (tens to hundreds of ohms), making ESDs hard to use in transmit applications.

🔌 Impedance fundamentals for antennas

🔌 What antenna impedance means

Impedance: the ratio of time-varying voltage across the antenna terminals to the time-varying current that flows in response.

Any antenna can be characterized by impedance, just like resistors, capacitors, or inductors.
The impedance has two components:
- Real part: accounts for power radiated away and power dissipated (lost) within the antenna.
- Imaginary part (reactance): represents energy storage within the antenna.

⚡ Energy storage analogy

Reactance in antennas works the same way as in circuit elements:
- Capacitor reactance → electrical energy storage.
- Inductor reactance → magnetic energy storage.
- Antenna reactance → energy storage within the antenna structure.

🔬 The zero-length dipole thought experiment

🔬 Why consider zero length

A zero-length dipole is an extreme case of an electrically-short dipole.
It provides a boundary condition to deduce useful information about real short dipoles.
A zero-length dipole is equivalent to an open circuit.

🔍 Impedance of an open circuit

The impedance is 0 − j∞, not simply "infinite":

Component	Value	Reason
Real part	0	No power transfer into the termination
Imaginary magnitude	∞	Current is zero regardless of voltage
Imaginary sign	Negative	Reflected current changes sign (total current = 0), but reflected voltage does not change sign

Don't confuse: "infinite impedance" must be broken into real and imaginary parts with correct signs.
The zero-length dipole impedance is 0 − j∞.

📏 How reactance changes with dipole length

📏 Behavior as length increases from zero

Starting from zero length (reactance = −j∞), as the dipole gets longer:
- Reactance remains large and negative.
- Reactance increases monotonically (becomes less negative).
- This trend continues until the dipole is no longer electrically short.

🔋 Capacitor-like behavior

The reactance is negative and increases toward zero with increasing frequency, similar to a capacitor.
Circuit diagrams sometimes represent ESD reactance as a capacitor.
Important limitation: the specific frequency dependence differs from a true capacitor, so this model is valid only for analysis at one frequency at a time.

Example: In the worked example, a 10 cm dipole at 30 MHz has reactance approximately −11.1 kΩ, which is "approximately the same reactance as a 0.47 pF capacitor at the same frequency"—but only at that frequency.

🧮 Approximate formula for ESD reactance

🧮 The expression

The reactance X_A of an electrically-short dipole can be estimated by:

X_A ≈ −(120 Ω / (π L / λ)) × [ln(L / (2a)) − 1]

Where:

L = dipole length
a = radius of the wire (a ≪ L)
λ = wavelength

✅ Formula behavior check

As L → 0, X_A → −∞ (matches the zero-length case).
As L increases from zero, X_A increases monotonically (becomes less negative).
This confirms the expected physical behavior.

📐 Worked example

Given: dipole length L = 10 cm, radius a = 1 mm, frequency = 30 MHz in free space.

Wavelength λ = c / f ≈ 10 m.
L = 10 cm ≈ 0.01λ → qualifies as electrically-short.
Using the formula: X_A ≈ −11.1 kΩ.
For comparison: this is about the same reactance as a 0.47 pF capacitor at 30 MHz.

⚠️ Practical implications

⚠️ Magnitude comparison

The reactance of an ESD is typically orders of magnitude larger than:
- The real part of the ESD's own impedance.
- The characteristic impedance of typical transmission lines (usually tens to hundreds of ohms).

🚧 Difficulty in transmit applications

The huge reactance mismatch makes ESDs quite difficult to use in practical transmit applications.
Matching the antenna to the transmission line becomes a major challenge.
Don't confuse: the problem is not that the antenna doesn't work—it's that the impedance mismatch makes efficient power transfer very difficult.

Equivalent Circuit Model for Transmission; Radiation Efficiency

10.5 Equivalent Circuit Model for Transmission; Radiation Efficiency

🧭 Overview

🧠 One-sentence thesis

An antenna in transmit mode can be modeled as a passive impedance circuit whose components represent radiated power, dissipated power, and stored energy, enabling analysis using conventional circuit theory.

📌 Key points (3–5)

Passive antenna principle: the antenna does not add power; delivered power is either radiated, dissipated, or stored.
Impedance decomposition: antenna impedance Z_A consists of radiation resistance R_rad (power radiated), loss resistance R_loss (power dissipated), and reactance X_A (energy stored).
Circuit formalism utility: modeling the antenna as an impedance allows use of standard circuit and transmission line theory for impedance matching and power transfer analysis.
Common confusion: reactance X_A does not directly account for power disposition—it only affects the relationship between voltage and current, just like capacitors and inductors.
Radiation efficiency: when loss is non-negligible, efficiency e_rad quantifies the fraction of delivered power that is actually radiated.

🔌 Antenna as a passive circuit element

🔌 Conservation of power principle

The excerpt emphasizes that the antenna is passive—it does not add power. By conservation of power, only three outcomes are possible for power delivered to the antenna:

Power converted to a propagating electromagnetic wave (the desired outcome)
Power dissipated within the antenna
Energy stored by the antenna, analogous to storage in a capacitor or inductor

These outcomes can occur in any combination.

🧮 Impedance definition

Antenna impedance Z_A: the ratio of voltage Ṽ_A to current Ĩ_A at the terminals.

In the phasor domain, Z_A is complex-valued and has both real and imaginary components:

Real component: represents power transfer
Imaginary component: represents energy storage

This leads to the form: Z_A = R_A + j X_A, where R_A represents power transferred and X_A represents energy stored.

🔍 Full impedance breakdown

The real component R_A further divides into two parts:

Z_A = R_rad + R_loss + j X_A

where:

R_rad (radiation resistance): power transferred and subsequently radiated
R_loss (loss resistance): power transferred and subsequently dissipated
X_A (reactance): energy stored by the antenna

⚡ Power transfer and component roles

⚡ Time-average power calculation

When voltage is applied across antenna terminals, the time-average power P_A transferred to the antenna is:

P_A = (1/2) Re{Ṽ_A Ĩ*_A}

(assuming peak units for voltage and current).

Substituting Z_A = R_rad + R_loss + j X_A and Ṽ_A = Z_A Ĩ_A, this reduces to:

P_A = (1/2)|Ĩ_A|² R_rad + (1/2)|Ĩ_A|² R_loss

📡 Radiated vs dissipated power

The power splits into two components:

Component	Formula	Meaning
P_rad	(1/2)\|Ĩ_A\|² R_rad	Power transferred to the radiating electromagnetic field
P_loss	(1/2)\|Ĩ_A\|² R_loss	Power dissipated within the antenna

🔋 Role of reactance

The reactance X_A plays a role in determining current Ĩ_A given voltage Ṽ_A (and vice versa), but does not by itself account for disposition of power.

Don't confuse: Reactance affects the voltage-current relationship but does not directly represent power loss or radiation—this is exactly analogous to the role played by inductors and capacitors in a circuit.

🛠️ Practical applications of the circuit model

🛠️ Impedance matching

The equivalent circuit formalism allows treating the antenna like any other component, facilitating analysis using conventional electric circuit theory and transmission line theory.

Example applications:

Minimize reflection: Choose transmitter output impedance Z_S = Z_A so that the voltage reflection coefficient Γ = (Z_A - Z_S)/(Z_A + Z_S) = 0
Maximize power transfer: Choose Z_S = Z*_A (conjugate matching)

🔢 Determining component values

To use this formalism fully, values for R_rad, R_loss, and X_A are required:

Radiation resistance R_rad:

Defined by: R_rad = 2 P_rad / |Ĩ_A|⁻²
Procedure: Apply current Ĩ_A to antenna terminals, determine total radiated power P_rad, then calculate R_rad
The excerpt references Section 10.2 for an example with an electrically-short dipole

Loss resistance R_loss:

Represents dissipation within the antenna, usually from materials comprising or surrounding it
For antennas made from good conductors (metals), R_loss is often very low compared to R_rad and may be neglected
For electrically-short dipoles, both R_loss and R_rad are very small, so both must be considered
Antennas with lossy materials (e.g., microstrip patch antennas on printed circuit boards) have non-negligible R_loss

Antenna reactance X_A:

Accounts for energy stored by the antenna
May be due to internal reflections or non-propagating electric and magnetic fields surrounding the antenna
Significant reactance (|X_A| comparable to or greater than |R_A|) complicates impedance matching to the source
The excerpt references Section 10.4 for an example with an electrically-short dipole

📊 Radiation efficiency

📊 Efficiency definition

When R_loss is non-negligible, antennas are characterized by their radiation efficiency:

Radiation efficiency e_rad: the fraction of power which is radiated compared to the total power delivered to the antenna.

e_rad = P_rad / P_A

🧮 Efficiency in terms of resistance

Using the power equations, efficiency can be expressed as:

e_rad = R_rad / (R_rad + R_loss)

This formulation demonstrates the utility of the equivalent circuit formalism: efficiency is directly calculable from the resistance components.

Interpretation:

Higher R_rad relative to R_loss → higher efficiency
When R_loss is negligible, e_rad approaches 1 (100% efficiency)
When R_loss is significant, a substantial fraction of delivered power is dissipated rather than radiated

Impedance of the Electrically-Short Dipole

10.6 Impedance of the Electrically-Short Dipole

🧭 Overview

🧠 One-sentence thesis

The electrically-short dipole (ESD) has very low radiation resistance (proportional to the square of length-to-wavelength ratio) and large negative reactance, requiring impedance matching to efficiently transfer power from typical transmission lines.

📌 Key points (3–5)

Impedance components: antenna impedance consists of radiation resistance (power radiated), loss resistance (power dissipated internally), and reactance (stored energy).
ESD radiation resistance is tiny: for an ESD, R_rad ≈ 20π²(L/λ)² Ω, which is very small because L ≪ λ, but grows with the square of antenna length.
Common confusion: the Hertzian dipole (ideal, uniform current) has 4× higher radiation resistance than the practical ESD (triangular current distribution); nearly all real ESDs follow the ESD model, not the Hertzian model.
Large reactance problem: the ESD exhibits very large negative reactance (capacitive), making impedance matching essential to avoid power reflection.
Radiation efficiency: even when loss resistance is small, both R_rad and R_loss must be considered for ESDs because R_rad itself is very small.

🔌 Three components of antenna impedance

🔌 General impedance formula

Antenna impedance: Z_A = R_rad + R_loss + jX_A, where R_rad is radiation resistance, R_loss is loss resistance, and X_A is reactance.

This formula applies to any antenna, not just the ESD.
Each term represents a different physical mechanism.

⚡ Radiation resistance (R_rad)

What it represents: the power radiated away from the antenna.
How to find it: R_rad = 2 P_rad / |I_0|², where I_0 is the current at the antenna terminals and P_rad is total radiated power.
The equivalent circuit formalism treats radiated power as if it were dissipated in a resistor.
Example: if an antenna radiates 60 mW with 20 mA (rms) current, R_rad ≈ 150 Ω.

🔥 Loss resistance (R_loss)

What it represents: power dissipated within the antenna or surrounding materials.
When it matters:
- For good conductors (metals), R_loss is often negligible compared to R_rad.
- For ESDs, both R_rad and R_loss are very small, so both must be considered.
- For antennas with lossy dielectrics (e.g., microstrip patch on a circuit board), R_loss can be significant.
Don't confuse: R_loss is not the same as R_rad; one is dissipated as heat, the other is radiated.

🔄 Reactance (X_A)

What it represents: energy stored by the antenna, either from internal reflections or non-propagating electric and magnetic fields around the antenna.
Why it matters: significant reactance (|X_A| comparable to or greater than |R_A|) complicates impedance matching to the source.
Example: if voltage and current are in-phase at the antenna terminals, X_A = 0.

📏 Radiation resistance of the ESD

📏 Derivation from radiated power

Starting from the ESD radiated power formula (Section 10.2): P_rad ≈ η |I_0|² (βL)² / (48π).
Substituting into R_rad = 2 P_rad / |I_0|² gives: R_rad ≈ η (βL)² / (24π).
Using βL = 2π L/λ (where L/λ is antenna length in wavelengths), this becomes: R_rad ≈ (η π / 6) (L/λ)².

🌐 Free-space formula

Assuming free-space conditions (η ≈ 376.7 Ω ≈ 120π Ω):

R_rad ≈ 20π² (L/λ)² Ω for a practical ESD in free space.

This remarkably simple expression shows:
- R_rad is very small because L ≪ λ for an ESD.
- R_rad increases as the square of the length.
Example: for L = 0.1λ, R_rad ≈ 1.97 Ω.

⚠️ Hertzian dipole vs practical ESD

Dipole type	Current distribution	Radiation resistance	Realizability
Hertzian (ideal)	Uniform magnitude over length	80π² (L/λ)² (4× higher)	Quite difficult in practice
ESD (practical)	Maximum at center, linearly to zero at ends	20π² (L/λ)²	Nearly all practical ESDs

Common confusion: some sources quote 80π² (L/λ)² for "electrically-short dipole," but this is for the Hertzian dipole, not the physically-realizable ESD.
The factor-of-4 difference is a straightforward consequence of the different current distributions.

🔧 Loss resistance and reactance of the ESD

🔧 Loss resistance formula

Derived in Section 10.3:

R_loss ≈ (L / 6a) √(μ f π / σ) for an ESD.

Where:
- a = radius of the wire
- μ = permeability of the wire
- σ = conductivity of the wire
- f = frequency
Example: for a 1 m aluminum dipole (a = 1 mm, σ ≈ 3.7×10⁷ S/m) at 30 MHz, R_loss ≈ 94.9 mΩ.

🔄 Reactance formula

Addressed in Section 10.4; a suitable expression is:

X_A ≈ −120 Ω / (πL/λ) [ln(L / 2a) − 1] for an ESD.

Assumes a ≪ L.
The negative sign indicates capacitive reactance.
Example: for L = 1 m, a = 1 mm, λ = 10 m, X_A ≈ −1991.8 Ω (very large).

📊 Radiation efficiency and practical implications

📊 Efficiency definition and formula

Radiation efficiency e_rad = P_rad / P_A, the fraction of delivered power that is radiated.

Using the equivalent circuit: e_rad = R_rad / (R_rad + R_loss).
Example: if R_rad ≈ 1.97 Ω and R_loss ≈ 94.9 mΩ, then e_rad ≈ 95.4%.
The equivalent circuit formalism proves useful for calculating efficiency.

🔌 Impedance matching challenges

Problem 1 (real part): the real part of ESD impedance (R_rad + R_loss) is much less than typical transmission line impedances (tens to hundreds of ohms).
Problem 2 (reactance): the ESD has very large negative reactance.
Consequence: without impedance matching, a large fraction of incident power will be reflected at the antenna terminals.

🛠️ Common solution

Step 1: insert series inductance to reduce (nominally cancel) the negative reactance.
- Example: for X_A ≈ −1991.8 Ω at 30 MHz, about 10 μH of inductance is needed.
Step 2: use common "real-to-real" impedance matching techniques to handle the remaining mismatch in the real-valued components.
This approach enables efficient power transfer from transmitter or transmission line to the ESD.

🧮 Worked example: ESD impedance calculation

🧮 Problem setup

A thin, straight dipole operates at 30 MHz in free space.
Length L = 1 m, radius a = 1 mm.
Material: aluminum (σ ≈ 3.7×10⁷ S/m, μ ≈ μ₀).
Free-space wavelength: λ = c/f ≈ 10 m, so L ≈ 0.1λ (confirms it is an ESD).

🧮 Calculation steps

Radiation resistance: using R_rad ≈ 20π² (L/λ)², we get R_rad ≈ 1.97 Ω.
Loss resistance: using R_loss ≈ (L / 6a) √(μ f π / σ), we get R_loss ≈ 94.9 mΩ.
Reactance: using X_A ≈ −120 Ω / (πL/λ) [ln(L / 2a) − 1], we get X_A ≈ −1991.8 Ω.
Total impedance: Z_A = R_rad + R_loss + jX_A ≈ 2.1 − j1991.8 Ω.
Radiation efficiency: e_rad = R_rad / (R_rad + R_loss) ≈ 95.4%.

🧮 Interpretation

The radiation efficiency is respectable (95.4%), but the impedance is highly mismatched to typical transmission lines.
The large negative reactance dominates the impedance and must be addressed with series inductance.

Directivity and Gain

10.7 Directivity and Gain

🧭 Overview

🧠 One-sentence thesis

Directivity quantifies how much more power an antenna radiates in a particular direction compared to the average over all directions, and gain adjusts this measure to account for losses within the antenna itself.

📌 Key points (3–5)

What directivity measures: the ratio of power density in a specified direction to the average power density over all directions at the same distance from the antenna.
Distance independence: directivity does not depend on distance from the antenna because both numerator and denominator scale with distance in the same way (proportional to r⁻²).
Maximum directivity: often "directivity" informally refers to the maximum value over all directions, reported as a single number.
Common confusion: directivity vs gain—gain is directivity modified by radiation efficiency to account for losses within the antenna, so gain is always less than or equal to directivity.
Reciprocity: directivity (and gain) are the same for transmit and receive modes.

📐 Understanding directivity

📐 Why spatial power density matters

Power radiated in a single direction is formally zero because a single direction corresponds to zero solid angle, which intercepts zero area.
To describe the spatial distribution of power flow, we need a different metric: spatial power density (power per unit area, W/m²).
Directivity is defined in terms of spatial power density, not total power in a direction.

📏 Definition of directivity

Directivity in the direction (θ, φ) is: D(θ, φ) = S(r) / S_ave(r)

S(r): power density at distance r in direction (θ, φ).
S_ave(r): average power density at distance r, averaged over all possible directions.
Directivity is unitless because it is a ratio of power densities.

In plain language: Directivity is the ratio of power density in a specified direction to the power density averaged over all directions at the same distance from the antenna.

🔄 Distance independence

Even though both S(r) and S_ave(r) vary with distance r, directivity does not depend on r.
Both numerator and denominator are proportional to r⁻², so the distance dependence cancels.
Why this matters: Directivity is a convenient way to characterize an antenna because it does not change with distance from the antenna.

Example: Whether you measure 1 meter or 100 meters from the antenna, the directivity in a given direction remains the same.

🎯 Maximum directivity

Directivity is generally a function of direction (θ, φ).
Often, only the maximum directivity is of interest—the directivity in the direction where it is largest.
Informally, "directivity" often refers to this maximum value when stated as a single number.
Directivity is commonly expressed in decibels: 10 log₁₀(D) dB (using "10 log₁₀" because directivity is a ratio of power-like quantities).

🔌 Electrically-short dipole example

🔌 ESD directivity calculation

The excerpt provides a worked example for an electrically-short dipole (ESD) with length L ≪ λ/2.

Given information:

Power density: S(r) ≈ η |I₀|² (βL)² / (128π²) · (sin θ)² · (1/r²)
Total radiated power: P_rad ≈ η |I₀|² (βL)² / (48π)
Average power density: S_ave = P_rad / (4πr²) ≈ η |I₀|² (βL)² / (192π²r²)

Result:

Directivity: D(θ, φ) = S(r) / S_ave(r) ≈ 1.5 (sin θ)²
Maximum directivity occurs at θ = π/2 (the equatorial plane).
Maximum directivity = 1.5, meaning the maximum power density is 1.5 times greater than the average over all directions.
In decibels: 10 log₁₀(1.5) ≈ 1.76 dB.

Don't confuse: The directivity value (1.5) is not the power density itself; it is the ratio of the actual power density to the average.

⚡ Gain: accounting for losses

⚡ Definition of gain

Gain G(θ, φ) is: G(θ, φ) = S(r) for actual antenna / S_ave(r) for identical but lossless antenna

The numerator is the actual power density radiated by the antenna (reduced by losses).
The denominator is the average power density for an identical but lossless antenna.
Since the actual antenna radiates less power than a lossless version, gain is always less than directivity.

🔗 Relationship between gain and directivity

An equivalent definition:

G(θ, φ) = e_rad · D(θ, φ)

where e_rad is the radiation efficiency of the antenna (Section 10.5).

In plain language: Gain is directivity times radiation efficiency; that is, directivity modified to account for loss within the antenna.

Quantity	What it measures	Accounts for losses?
Directivity D(θ, φ)	Ratio of power density to average, assuming lossless antenna	No
Gain G(θ, φ)	Ratio of actual power density to average for lossless antenna	Yes

Example: If an antenna has directivity 1.5 and radiation efficiency 95.4%, then gain = 0.954 × 1.5 ≈ 1.43.

📡 Transmit and receive reciprocity

📡 Directivity in receive mode

The preceding discussion assumes a transmitting antenna.
Directivity can also be defined for the receive case, where it quantifies the effectiveness of the antenna in converting power in an incident wave to power in a load attached to the antenna.
Receive directivity is formally introduced in Section 10.13 ("Effective Aperture").
When receive directivity is defined as specified in Section 10.13, it is equal to transmit directivity as defined in this section.

Key takeaway: The directivity of an antenna is the same for receive and transmit.

Don't confuse: This reciprocity applies to directivity (and gain), not to all antenna parameters—it is a specific property of these spatial distribution measures.

Radiation Pattern

10.8 Radiation Pattern

🧭 Overview

🧠 One-sentence thesis

The radiation pattern describes how the magnitude and polarization of the field radiated by an antenna vary with angle, extending the concept of directivity by explicitly considering polarization and how well a receiver can capture the transmitted power.

📌 Key points (3–5)

What radiation pattern adds: it extends directivity by explicitly considering polarization, not just power density.
Co-pol vs cross-pol: co-pol is the intended or nominal polarization; cross-pol measures deviation from that nominal polarization in the orthogonal plane.
E-plane and H-plane: E-plane contains the nominal electric field vector; H-plane is perpendicular to both E-plane and propagation direction.
Common confusion: "omnidirectional" does not mean constant pattern in all directions—it means constant maximum directivity in one plane; only an isotropic antenna (not physically realizable) has constant pattern everywhere.
Pattern scaling: normalized pattern scales maximum to 1; directivity-normalized pattern scales maximum to the maximum directivity value.

📡 Why polarization matters

📡 Power capture depends on polarization alignment

Directivity tells you the power density delivered to a receiver, but the receiver must be co-polarized with the arriving wave to capture all that power.
A receiver that is not fully co-polarized will capture less power.
Example: for a z-oriented electrically-short dipole (ESD), the field is θ-polarized; a receiver that is φ-polarized relative to the transmitter will capture zero power.

🔄 Co-pol and cross-pol definitions

Co-pol is commonly defined to be the intended or nominal polarization for a particular application, which is not necessarily the actual polarization radiated by the antenna under consideration.

Cross-pol measures polarization in the orthogonal plane; i.e., deviation from the presumed co-pol.

Cross-pol is non-zero when the actual polarization differs from the intended or nominal polarization.
For an ideal ESD, the electric field is always θ-polarized, so that is the nominal polarization and cross-pol is zero.
Cross-pol can also quantify effects from errors in position, orientation, or undesired field modification by nearby materials.

🧭 E-plane and H-plane patterns

🧭 E-plane definition and example

The E-plane is any plane in which the nominal or intended vector Ẽ lies.

For the z-oriented ESD, the field depends only on θ, not φ, so the co-pol pattern is the same in any plane containing the z-axis.
The E-plane co-pol pattern of the ESD is proportional to sin θ divided by distance r (unnormalized).
This pattern is shown in a polar plot where the dashed circle indicates the maximum value.

🧭 H-plane definition and example

The H-plane is any plane in which the nominal or intended vector H̃ lies, and so is perpendicular to both the E-plane and the direction of propagation.

For the ESD, the one and only H-plane is the z = 0 plane.
The H-plane pattern is constant (circular) because the ESD field does not depend on φ.
Don't confuse: the ESD is omnidirectional in the H-plane, but it has nulls along the z-axis, so it is not isotropic.

📏 Pattern normalization and scaling

📏 Normalized pattern (maximum = 1)

A normalized pattern is scaled to a maximum magnitude of 1, using the definition F(θ, φ) = |ê · Ẽ(r)| / |ê · Ẽ(r)|_max.

ê is the co-pol reference direction.
The value of distance r is irrelevant because numerator and denominator both scale with r in the same way.
For the ESD, the normalized co-pol pattern is simply sin θ.
In this scaling, the maximum value circle has radius 1, which is 0 dB.

📏 Directivity-normalized pattern (maximum = D_max)

This scaling sets the maximum value equal to maximum directivity.
Directivity is proportional to power density, which is proportional to |Ẽ(r)|².
The pattern is expressed as D_max |F(θ, φ)|².
For the ESD, D_max = 1.5, so the co-pol pattern is 1.5 sin² θ.
In this scaling, the maximum value circle has radius 1.5, which is 1.76 dB.

Scaling type	Maximum value	ESD pattern	Maximum in dB
Normalized	1	sin θ	0 dB
Directivity-normalized	D_max	1.5 sin² θ	1.76 dB

🎯 Pattern structure and metrics

🎯 Main lobe, nulls, and sidelobes

Nearly all antennas in common use exhibit directivity greater than that of the ESD, and their patterns exhibit more complex structure.
Main lobe: the region around the direction of maximum directivity.
Null: the main lobe is bounded on each side by a null, where the magnitude reaches a local minimum (perhaps zero).
Backlobe: a lobe in the opposite direction from the main lobe (many antennas exhibit a null in this direction instead).
Sidelobes: lobes between the main lobe and the backlobe.

📊 Common pattern metrics

First sidelobe level: the ratio of the maximum magnitude of the sidelobe closest to the main lobe to that of the main lobe.
Front-to-back ratio: the ratio of the maximum magnitude of the backlobe to that of the main lobe.
Half-power beamwidth (HPBW): the width of the main lobe measured between two points at which the directivity is one-half its maximum value.
- HPBW is used when the main lobe is narrow.
- HPBW depends on the plane in which it is measured; it may differ in E-plane and H-plane.
- Example: the E-plane HPBW of the ESD is 90°, whereas the H-plane HPBW is undefined because the pattern is constant in the H-plane.

🌐 Omnidirectional vs isotropic antennas

🌐 Omnidirectional antenna

An omnidirectional antenna, such as the ESD, exhibits constant and nominally maximum directivity in one plane.

The ESD is omnidirectional in the H-plane (the z = 0 plane).
The term "omnidirectional" does not indicate constant pattern in all directions.
Example: the ESD exhibits pattern nulls in two directions (along the z-axis), so it is not constant everywhere.

🌐 Isotropic antenna

An isotropic antenna exhibits constant directivity in every direction. Such an antenna is not physically realizable, but is nevertheless useful as a baseline for describing other antennas.

No physically-realizable antenna is isotropic; the ESD is about as close as one can get.
Since the pattern of an isotropic antenna is constant with direction, the power density in any direction equals the average over all directions.
The directivity of an isotropic antenna is exactly 1.
Directivity of other antennas may be expressed in "dBi" (dB relative to isotropic).
Example: the maximum directivity of the ESD is 1.5, which is 10 log₁₀(1.5/1) = 1.76 dBi.

🔍 How to distinguish omnidirectional from isotropic

Omnidirectional: constant pattern in one plane (e.g., H-plane for ESD); may have nulls in other directions.
Isotropic: constant pattern in every direction; directivity = 1 everywhere; not physically realizable.

Equivalent Circuit Model for Reception

10.9 Equivalent Circuit Model for Reception

🧭 Overview

🧠 One-sentence thesis

A receiving antenna can be modeled as a Thévenin equivalent circuit with an open-circuit voltage determined by vector effective length and an output impedance equal to the transmit input impedance, enabling calculation of power delivered to a load.

📌 Key points

Thévenin model components: The equivalent circuit consists of an open-circuit voltage (V_OC) and a series impedance (Z_A) that together determine power delivery to a load.
Vector effective length concept: V_OC is proportional to the incident electric field through a vector quantity (l_e) that captures both antenna size and polarization alignment.
Reciprocity principle: The output impedance Z_A in receive mode equals the input impedance in transmit mode, greatly simplifying analysis.
Common confusion: Vector effective length is not uniquely determined by geometry alone—its direction is defined by the polarization of the field the antenna would transmit, not just its physical orientation.
Practical application: The model enables calculation of received power when the antenna is terminated with a load, especially useful for conjugate-matched loads.

🔌 The Thévenin equivalent circuit

🔌 Circuit structure

The receiving antenna is represented as:

A voltage source V_OC: the potential at the antenna terminals when open-circuited (no load connected)
A series impedance Z_A: the output impedance that determines current magnitude and phase once a load is connected

🎯 Why this model matters

Given V_OC and the current through the circuit, you can determine the power delivered to any load
The model is useful only if both V_OC and Z_A can be determined
This section provides an informal derivation sufficient for understanding effective aperture and impedance matching

📏 Vector effective length

📏 What it represents

Vector effective length l_e = (unit vector l-hat) × (scalar l_e) is defined such that the unit vector l-hat corresponds to the polarization of the electric field that would be transmitted from the antenna in the far field, and the effective length l_e converts the co-polarized component of the incident field into V_OC.

The relationship is:

V_OC = (incident electric field E_i) · l_e

🧮 How it's determined

The scalar effective length is defined as:

l_e = V_OC / (E_i · l-hat)

Where:

E_i · l-hat extracts the component of the incident field that is co-polarized with the antenna's transmit polarization
l_e is the conversion factor from this field component to the open-circuit voltage

🔍 Properties deduced without derivation

Linearity: V_OC must be proportional to the magnitude of the incident electric field E_i
Units: Since E_i has units of V/m and V_OC has units of V, the effective length l_e must have units of meters (length)
Size dependence: V_OC increases as antenna size increases, so the magnitude of l_e likely increases with antenna size
Orientation: The direction of l_e relates to how the incident electric field is oriented relative to the antenna

⚠️ Non-uniqueness clarification

There are multiple valid definitions of l_e that can reduce the vector E_i to the scalar V_OC via the dot product
The definition used here (vector effective length) is the most common
The direction l-hat is arbitrarily set to match the transmit polarization direction—this is a choice, not a derivation

Don't confuse: The direction of l_e is not simply the physical orientation of the antenna; it's defined by the polarization of the field the antenna would radiate when transmitting.

📡 Electrically-short dipole (ESD) example

📡 ESD effective length formula

For a z-directed ESD of length L:

l_e ≈ (theta-hat) × (L/2) × sin(theta)

Where theta is the angle from the z-axis.

🔬 Key observations

Arrival direction	Field orientation	Effective length	Explanation
Broadside (θ = π/2)	Points in -z direction	l_e ≈ L/2	Maximum coupling; l_e proportional to physical length L
Axial (θ = 0 or π)	Perpendicular to ESD axis	l_e = 0	No coupling when field is perpendicular to antenna

The sin(theta) factor captures how coupling varies with arrival angle.

📊 Worked example: Induced potential

Scenario: 10 cm dipole at origin, z-aligned; 30 MHz plane wave from direction (θ = π/4, φ = π/2); field magnitude 10 μV/m (rms).

Wavelength: c/f ≈ 10 m, so the dipole is electrically-short.

Effective length: l_e ≈ (theta-hat) × (10 cm / 2) × sin(π/4) ≈ (theta-hat) × 3.54 cm

Case (a) - theta-polarized field:

|V_OC| = |E_i · l_e| ≈ (10 μV/m) × (theta-hat · theta-hat) × (3.54 cm) ≈ 354 nV rms
The dot product equals 1 because polarizations align

Case (b) - phi-polarized field:

|V_OC| ≈ (10 μV/m) × (phi-hat · theta-hat) × (3.54 cm) ≈ 0
The dot product equals 0 because polarizations are orthogonal
This result holds for any angle of incidence when the field is phi-polarized

Key insight: Only the field component co-polarized with the antenna's transmit polarization contributes to V_OC.

⚡ Output impedance

⚡ The reciprocity result

Output impedance Z_A of the equivalent circuit for an antenna in receive mode equals the input impedance of the same antenna in transmit mode.

This is a consequence of the reciprocity property of antenna systems
It greatly simplifies receive antenna analysis—no separate calculation needed
Formal derivation is deferred to a later section (10.12)

🔧 Practical application

For a lossless antenna:

Z_A = R_A + j×X_A
R_A = R_rad (radiation resistance) + loss resistance
Loss resistance = 0 for lossless antenna
For an ESD: R_rad ≈ 20×π² × (L/λ)²

🔋 Power delivery calculation

🔋 Conjugate-matched load scenario

When the antenna is terminated with a conjugate-matched load (impedance Z_A*):

Voltage across load:

V_L = V_OC × Z_A* / (Z_A + Z_A*) = V_OC × Z_A* / (2×R_A)

Current through load:

I_L = V_OC / (Z_A + Z_A*) = V_OC / (2×R_A)

Power delivered to load (taking V_OC as RMS):

P_L = Real{V_L × I_L*} = |V_OC|² / (4×R_A)

📊 Worked example: Power captured

Continuing the previous example (theta-polarized, 354 nV rms open-circuit voltage):

Radiation resistance:

R_rad ≈ 20×π² × (0.1 m / 10 m)² ≈ 4.93 mΩ
R_A = R_rad (lossless antenna)

Power delivered:

P_L ≈ (354 nV)² / (4 × 4.93 mΩ) ≈ 6.33 pW

Note: The reactive part X_A cancels out in conjugate matching, so it doesn't need to be calculated.

🔄 Reciprocity preview

🔄 What reciprocity means

Reciprocity refers to theorems that relate inputs and outputs of a linear system to those of an identical system with inputs and outputs swapped
In electromagnetics, it simplifies problems that would otherwise be difficult to solve
Example application: deriving receiving properties of antennas from their transmit properties

🔄 Circuit theory example

For a two-port device:

The device is reciprocal if voltage v₂ at port 2 due to current applied at port 1 equals voltage v₁ when the same current is applied at port 2
Normally true for linear passive devices (resistors, capacitors, inductors)
The fundamental requirement is linearity

Why it matters for antennas: Reciprocity allows us to use easily-measured transmit properties to determine harder-to-calculate receive properties.

Reciprocity

10.10 Reciprocity

🧭 Overview

🧠 One-sentence thesis

The Lorentz reciprocity theorem establishes that in a linear electromagnetic system, the relationship between current distributions and resulting electric fields is symmetric—swapping inputs and outputs yields the same mathematical relationship—which simplifies analysis of antenna receiving properties and two-port systems.

📌 Key points (3–5)

What reciprocity means: a class of theorems relating inputs and outputs of a linear system to an identical system where inputs and outputs are swapped.
Core requirement: the system must be linear (outputs proportional to inputs, superposition applies) and time-invariant.
The Lorentz reciprocity theorem: for two different current distributions in the same linear medium, the integral of E₁·J₂ equals the integral of E₂·J₁ over all space.
Common confusion: reciprocity does not distinguish "inputs" from "outputs"—the relationship is symmetric when subscripts are swapped.
Practical result for antennas: the potential induced at antenna 1's terminals by a current at antenna 2 equals the potential induced at antenna 2 by the same current at antenna 1.

🔄 What reciprocity is and why it matters

🔄 Definition and scope

Reciprocity: a class of theorems that relate the inputs and outputs of a linear system to those of an identical system in which the inputs and outputs are swapped.

The importance in electromagnetics: it simplifies problems that would otherwise be relatively difficult to solve.
Example application: deriving the receiving properties of antennas (addressed in other sections using results from this section).

🔌 Circuit-theory analogy

A familiar special case from basic circuit theory: a two-port device.
The two-port is reciprocal if voltage v₂ at port 2 due to a current applied at port 1 equals v₁ when the same current is applied instead at port 2.
This normally holds when the two-port consists exclusively of linear passive devices (ideal resistors, capacitors, inductors).
Key requirement: linearity—outputs must be proportional to inputs, and superposition must apply.
The circuit-theory result is actually a special case of a more general electromagnetic theorem.

📐 Deriving the Lorentz reciprocity theorem

📐 Setting up two scenarios

The excerpt describes two scenarios in the same volume V with the same distribution of linear time-invariant matter (permittivity ε, permeability μ, conductivity σ are constants that may vary with position but not time):

Scenario	Input (current)	Output (field)	Governing equations
1	J̃₁ (volume current density, A/m²)	Ẽ₁ (electric field, V/m)	Maxwell's equations in phasor form
2	J̃₂ (different current distribution)	Ẽ₂ (different electric field)	Same Maxwell's equations, same boundary conditions

Both scenarios involve precisely the same electromagnetic system (same governing equations, same matter distribution).
The input current and output field in scenario 2 are generally completely different from scenario 1.
Question posed: knowing only Ẽ₁ in response to J̃₁, what can you infer about Ẽ₂ given J̃₂, or vice-versa?
At first glance, the answer might seem to be "nothing," since you lack a description of the system—but reciprocity provides a surprising bit more information.

🧮 Mathematical derivation steps

The derivation involves pure mathematics using Maxwell's equations in phasor form:

Take dot products of fields from one scenario with curl equations from the other scenario.
Subtract the resulting equations and apply the vector identity: ∇·(A × B) = B·(∇ × A) − A·(∇ × B).
After algebraic manipulation, obtain the differential form of the Lorentz reciprocity theorem:

∇·(H̃₂ × Ẽ₁ − H̃₁ × Ẽ₂) = Ẽ₁·J̃₂ − Ẽ₂·J̃₁
Integrate both sides over volume V and apply the divergence theorem to transform the left side into a surface integral over the bounding surface S.
This yields the integral form:

∮ₛ(H̃₂ × Ẽ₁ − H̃₁ × Ẽ₂)·ds = ∫ᵥ(Ẽ₁·J̃₂ − Ẽ₂·J̃₁)dv

🌐 Infinite-volume case

A particularly useful feature of the integral form:

Confine the sources to a finite region while allowing V to grow infinitely large.
The closest distance between any non-zero source current and the bounding surface S becomes infinite.
Because field magnitude diminishes with distance from the source, all fields (Ẽ₁, H̃₁, Ẽ₂, H̃₂) are effectively zero on S.
The left side of the integral form becomes zero, yielding:

∫ᵥ(Ẽ₁·J̃₂ − Ẽ₂·J̃₁)dv = 0

for any volume V containing all the current.

🔁 Why "reciprocity"?

The expression is identical when subscripts "1" and "2" are swapped.
The relationship does not recognize a distinction between "inputs" and "outputs"—there are only "ports."
This symmetry is the essence of reciprocity.

🔌 Spatially disjoint currents and antenna two-ports

🔌 Spatially disjoint case

Spatially disjoint: there is no point in space at which both J̃₁ and J̃₂ are non-zero; the distributions do not overlap.

Rewrite the theorem as: ∫ᵥ Ẽ₁·J̃₂ dv = ∫ᵥ Ẽ₂·J̃₁ dv
Let V₁ be the volume where J̃₁ is non-zero, and V₂ be the volume where J̃₂ is non-zero.
Then: ∫{V₂} Ẽ₁·J̃₂ dv = ∫{V₁} Ẽ₂·J̃₁ dv
Utility: the region of integration is reduced to just those regions where the current exists.

📡 Two-port consisting of dipole antennas

The excerpt demonstrates reciprocity for a two-port consisting of two thin dipole antennas:

Port 1 is defined by terminal quantities (Ṽ₁, Ĩ₁) of the left antenna.
Port 2 is defined by terminal quantities (Ṽ₂, Ĩ₂) of the right antenna.
Quantities are defined with respect to a small gap of length Δl between the perfectly-conducting arms of the dipole.
Either antenna may transmit or receive, so (Ṽ₁, Ĩ₁) depends on (Ṽ₂, Ĩ₂), and vice-versa.

🧩 Simplifying the integral for a dipole

For the transmitting port 1 antenna:

Select V₁ to be the cylindrical volume defined by the exterior surface of the dipole, including the gap.
On perfectly-conducting dipole arms, all current flows on the surface, so J̃₁ = 0 everywhere except on the surface.
The direction of J̃₁ is always tangent to the surface.
The tangent component of Ẽ₂ is zero on a perfectly-conducting surface (boundary condition).
Therefore, Ẽ₂·J̃₁ = 0 everywhere J̃₁ is non-zero, except at the gap (where current is non-zero but there is no conductor).
The integral reduces to a line integral across the gap: ∫{V₁} Ẽ₂·J̃₁ dv = ∫{gap} Ẽ₂·Ĩₜ₁ ˆl₁ dl
Assuming current Ĩₜ₁ is constant over the gap, factor it out: ∫{V₁} Ẽ₂·J̃₁ dv = Ĩₜ₁ ∫{gap} Ẽ₂·ˆl₁ dl
Recall that potential between two points is the integral of the electric field over any path between them.
The open-circuit potential Ṽʳ₁ = −∫_{gap} Ẽ₂·ˆl₁ dl
Result: ∫_{V₁} Ẽ₂·J̃₁ dv = −Ĩₜ₁ Ṽʳ₁

🎯 Final reciprocity result for antennas

Applying the same procedure for port 2 (or by swapping subscripts):

∫_{V₂} Ẽ₁·J̃₂ dv = −Ĩₜ₂ Ṽʳ₂
Substituting into the spatially disjoint form yields:

Ĩₜ₁ Ṽʳ₁ = Ĩₜ₂ Ṽʳ₂

Verification of reciprocity:

Set Ĩₜ₁ = I₀, resulting in Ṽʳ₂ = V₀.
Subsequently set Ĩₜ₂ = I₀.
Then Ṽʳ₁ = (Ĩₜ₂ Ṽʳ₂)/Ĩₜ₁ = (I₀ V₀)/I₀ = V₀.
This confirms: the potential induced at port 2 by current I₀ at port 1 equals the potential induced at port 1 by the same current I₀ at port 2.
This is the familiar definition of reciprocity from basic circuit theory.

🌍 Generalization

The analysis presumed pairs of straight, perfectly conducting dipoles of arbitrary length.
The result is readily generalized—and is the same—for any pair of passive antennas in linear time-invariant media.

Summary: The potential induced at the terminals of one antenna due to a current applied to a second antenna is equal to the potential induced in the second antenna by the same current applied to the first antenna.

Potential Induced in a Dipole

10.11 Potential Induced in a Dipole

🧭 Overview

🧠 One-sentence thesis

The potential induced at the terminals of a receiving dipole by an incident electromagnetic wave can be determined from the dipole's transmit current distribution using the principle of reciprocity, yielding the result that open-circuit potential equals the co-polarized component of the incident electric field times the vector effective length.

📌 Key points

Core problem: An incident plane wave induces a potential at the antenna terminals, but the electric field in the terminal gap is modified by the antenna structure and is not directly known.
Reciprocity strategy: Instead of directly calculating the gap field, reciprocity relates two scenarios—the dipole of interest transmitting to a Hertzian dipole, and vice versa—to derive the induced potential.
Vector effective length emerges: The derivation naturally produces the vector effective length l_e, which encodes the antenna's receiving properties from its transmit current distribution.
Key result: Open-circuit potential V_OC = E_i · l_e, where E_i is the incident electric field and l_e is the vector effective length.
Common confusion: The electric field in the terminal gap is not simply the incident field E_i; the antenna structure modifies it, so a direct integration approach fails without reciprocity.

🚧 The direct approach obstacle

🚧 Why direct calculation fails

Potential is equal to the integral of the electric field intensity over a path.

The natural approach: integrate the electric field E_gap across the gap between the "−" and "+" terminals:
- V_OC = − ∫_gap E_gap · dl
The problem: E_gap is not readily available.
E_gap is not simply the incident field E_i, because the antenna structure and electromagnetic boundary conditions modify the field near the antenna.
Don't confuse: If E_gap were just E_i, the antenna geometry wouldn't matter—only terminal spacing and orientation would, which is physically incorrect.

🔄 Reciprocity-based solution

🔄 The reciprocity principle for antennas

The potential induced at the terminals of one antenna due to a current applied to a second antenna is equal to the potential induced in the second antenna by the same current applied to the first antenna.

Mathematically: (I_t1 / V_r1) = (I_t2 / V_r2), or equivalently V_r1 = (I_t2 / I_t1) · V_r2.
This reciprocity property holds for any pair of passive antennas in linear time-invariant media.
Strategy: Establish a relationship between two scenarios within the same electromagnetic system, then use reciprocity to connect them.

📡 Scenario 1: Dipole transmits, Hertzian dipole receives

Setup: The dipole of interest is driven by current I_t1, producing current distribution I(z) along its length.
The dipole radiates electric field E_1(r) in the far field (proportional to an integral of I(z) over the dipole length).
A Hertzian dipole (vanishingly small, θ-aligned) is placed in the far field and receives E_1.
Why a Hertzian dipole? For a Hertzian dipole, the gap field is simply the incident field (negligible structure to modify it), so V_r2 = − ∫_gap E_1 · dl is straightforward.
Since the Hertzian dipole is very short and far away, E_1 is essentially constant over the gap, and aligned with the dipole:
- V_r2 ≈ − E_1(r_2) · θ̂ Δl
Substituting the expression for E_1 from the transmit dipole:
- V_r2 involves an integral of I(z′) e^(jβz′ cos θ) over the dipole length, times Δl and other factors.

📡 Scenario 2: Hertzian dipole transmits, dipole receives

Setup: The Hertzian dipole is driven by current I_t2, radiating electric field E_2(r = 0) at the origin (where the dipole of interest is located).
E_2(r = 0) is proportional to I_t2, and may be interpreted as a plane wave in the region of the receiving dipole.
The unknown: What is the induced potential V_r1 in the dipole of interest?
Direct calculation via ∫ E_gap · dl is still not feasible (gap field unknown).
Key insight: V_r1 must be proportional to E_2(r = 0) (linear system), so there exists some vector l_e such that:
- V_r1 = E_2(r = 0) · l_e
The unit vector direction is chosen to match the polarization direction of the transmitted field from the dipole of interest (standard definition of vector effective length).
For the dipole of interest (z-aligned), this direction is −θ̂, so:
- V_r1 = − E_2(r = 0) · θ̂ l_e

🔗 Connecting the scenarios via reciprocity

Reciprocity states: V_r1 = (I_t2 / I_t1) · V_r2.
Substitute the expressions for V_r2 (from Scenario 1) and V_r1 (from Scenario 2).
Equate the two expressions for V_r1 and solve for l_e.
Result:
- l_e ≈ [1 / I_t1 ∫_{−L/2}^{+L/2} I(z′) e^(jβz′ cos θ) dz′] · sin θ
This is the vector effective length (magnitude l_e, direction ˆl = −θ̂ for the z-aligned dipole).

📐 Calculating vector effective length

📐 Step-by-step procedure

Apply a current I_t1 to the dipole of interest.
Determine the resulting current distribution I(z) along the dipole length (this is done for specific dipoles in other sections, e.g., electrically-short dipole or half-wave dipole).
Integrate I(z) over the dipole length as indicated:
- Integral of I(z′) e^(jβz′ cos θ) dz′ from −L/2 to +L/2.
- Divide ("normalize") by I_t1 (which is simply I(0), the terminal current).
- The result is independent of the excitation I_t1 (linear system property).
Multiply by sin θ (angle between dipole axis and direction of incidence).

🧮 Example: Electrically-short dipole (ESD)

Current distribution for a thin ESD of length L:
- I(z) ≈ I_0 (1 − 2|z| / L)
- I_0 is the terminal current.
Apply the formula:
- l_e ≈ [1 / I_0 ∫_{−L/2}^{+L/2} I_0 (1 − 2|z′| / L) e^(jβz′ cos θ) dz′] · sin θ
Simplification: Since the dipole is electrically short (z′ ≪ λ over the entire integral), e^(jβz′ cos θ) ≈ 1.
The integral becomes:
- ∫_{−L/2}^{+L/2} (1 − 2|z′| / L) dz′
- This is the "area under the curve": (1/2) · base · height = (1/2) · L · 1 = L/2.
Result:
- l_e ≈ (L / 2) sin θ

🎯 Final result and interpretation

🎯 Open-circuit potential formula

V_OC = E_i · l_e

E_i: incident electric field (vector).
l_e: vector effective length (magnitude l_e, direction ˆl).
The dot product extracts the co-polarized component of the incident field.

🎯 Plain-language interpretation

The potential induced in a dipole is:
- The co-polarized component of the incident electric field
- times a normalized integral of the transmit current distribution over the dipole length
- times sine of the angle between the dipole axis and the direction of incidence.
Key insight: Reciprocity allows the receiving property to be determined from the transmit characteristics of the antenna.

🎯 Why this result is remarkable

Transmit-receive duality: You don't need to solve the difficult receiving problem directly; instead, solve the (often easier) transmit problem and use reciprocity.
Vector effective length: This quantity naturally emerges from the derivation and encodes how the antenna couples to incident fields.
Generality: Although derived for a straight thin dipole, the concepts (effective length, reciprocity) apply to antennas generally.

10.12 Equivalent Circuit Model for Reception, Redux

🧭 Overview

🧠 One-sentence thesis

The source impedance in the Thévenin equivalent circuit for a receiving antenna is rigorously derived using two-port network theory and reciprocity, proving that it equals the antenna's impedance when transmitting.

📌 Key points (3–5)

What this section completes: the formal derivation of the receiving antenna equivalent circuit by deriving the source impedance Z_A, which was missing from the informal model in Section 10.9.
Core method: represent a pair of antennas as a passive linear time-invariant two-port network, then apply reciprocity to show Z₁₂ = Z₂₁.
Key result: the series impedance Z_A in the Thévenin equivalent circuit equals Z₂₂, which is the impedance of the antenna when transmitting.
Common confusion: the derivation uses open-circuit and short-circuit conditions to find impedances, but the resulting equality Z₁₂ = Z₂₁ holds generally for all port potentials and currents, not just those special cases.
Practical caveat: mutual coupling between antennas becomes significant in dense arrays (antennas separated by less than a few wavelengths) and requires generalized analysis.

🔌 Two-port network foundation

🔌 Why a two-port model

The excerpt starts with a two-port model (Figure 10.19) where port 1 represents one antenna and port 2 represents another antenna.
For a passive, linear, time-invariant two-port, the potential v₂ is a linear function of potentials and currents at both ports.
Because v₁ must be proportional to i₁ and v₂ must be proportional to i₂, any pair of "inputs" (either potentials or currents) completely determines the remaining potentials or currents.

📐 The impedance equations

The two-port relationships are written as:

v₂ = Z₂₁ i₁ + Z₂₂ i₂
v₁ = Z₁₁ i₁ + Z₁₂ i₂

where Z₁₁, Z₁₂, Z₂₁, and Z₂₂ are constants of proportionality with units of ohms (Ω).

🔍 Defining Z₁₂ and Z₂₁ through special conditions

The excerpt defines these impedances using open-circuit conditions:

Z₁₂ is defined as: v₁ / i₂ when i₁ = 0

To make i₁ = 0, leave port 1 open-circuited.
Using the notation from previous sections: Ĩ_t² is i₂ (the sole "input," superscript "t" for "transmitter") and Ṽ_r¹ is v₁ (the "output," superscript "r" for "receiver," port 1 open-circuited).
Therefore: Z₁₂ = Ṽ_r¹ / Ĩ_t²

Similarly:

Z₂₁ = Ṽ_r² / Ĩ_t¹

🔄 Reciprocity and impedance equality

🔄 Applying reciprocity

Section 10.10 established that for a pair of antennas represented as a passive linear time-invariant two-port:

Ĩ_t¹ Ṽ_r¹ = Ĩ_t² Ṽ_r²

Rearranging:

Ṽ_r¹ / Ĩ_t² = Ṽ_r² / Ĩ_t¹

⚖️ The key equality

Referring to the definitions of Z₁₂ and Z₂₁, the reciprocity relationship requires:

Z₁₂ = Z₂₁

Important: Even though this equality was derived by open-circuiting ports one at a time, it must hold generally because the impedance equations (v₂ = Z₂₁ i₁ + Z₂₂ i₂ and v₁ = Z₁₁ i₁ + Z₁₂ i₂) must apply with the same values of Z₁₂ and Z₂₁ regardless of the particular values of the port potentials and currents.

🔋 Deriving the Thévenin equivalent circuit

🔋 The voltage source

Set up the scenario:

Port 1 corresponds to the transmitting antenna (i₁ is Ĩ_t¹).
Port 2 corresponds to an open-circuited receiving antenna (i₂ = 0, v₂ is Ṽ_r²).

Applying the impedance equation:

v₂ = Z₂₁ i₁ + Z₂₂ i₂
= (Ṽ_r² / Ĩ_t¹) Ĩ_t¹ + Z₂₂ · 0
= Ṽ_r²

From Section 10.11, the open-circuit potential was determined electromagnetically to be:

Ṽ_r² = Ẽ_i · l_e

where Ẽ_i is the incident electric field and l_e is the vector effective length.

Result: The voltage source in the Thévenin equivalent circuit is Ẽ_i · l_e, as shown in Figure 10.18.

🔧 The series impedance

From basic circuit theory, the Thévenin series impedance is the ratio of:

v₂ when port 2 is open-circuited (i.e., Ṽ_r²)
to i₂ when port 2 is short-circuited

Finding the short-circuit current:

Using the impedance equation with v₂ = 0:

0 = Z₂₁ Ĩ_t¹ + Z₂₂ i₂
Therefore: i₂ = - (Z₂₁ / Z₂₂) Ĩ_t¹

Using Z₂₁ = Ṽ_r² / Ĩ_t¹ to eliminate Ĩ_t¹:

i₂ = - Ṽ_r² / Z₂₂

Reference direction correction:

The reference direction for i₂ in Figure 10.19 is opposite the reference direction for short-circuit current. Given the polarity of v₂, the reference direction of current flow through a passive load is from "+" to "−" through the load.

Therefore, the source impedance (ratio of open-circuit potential to short-circuit current) is:

Ṽ_r² / (+ Ṽ_r² / Z₂₂) = Z₂₂

🎯 What is Z₂₂?

To determine Z₂₂, apply a current i₂ = Ĩ_t² to port 2 (antenna 2):

v₂ = Z₂₁ i₁ + Z₂₂ Ĩ_t²

Solving for Z₂₂:

Z₂₂ = (v₂ / Ĩ_t²) - Z₂₁ (i₁ / Ĩ_t²)

Interpretation:

Term	Meaning
First term: v₂ / Ĩ_t²	Precisely the impedance of antenna 2 in transmission
Second term: Z₂₁ (i₁ / Ĩ_t²)	Contribution to Z₂₂ from antenna 1

Resolving the contribution from antenna 1:

The immediate interest is the equivalent circuit for reception of an electric field Ẽ_i in the absence of any other antenna. The excerpt resolves this by imagining:

Ẽ_i is generated by antenna 1, but
Antenna 1 is far enough away to make Z₂₁ (the factor determining the effect of antenna 1 on antenna 2) negligible.

Then Z₂₂ is simply the impedance of antenna 2 when transmitting.

📦 Summary and practical considerations

📦 The complete result

The Thévenin equivalent circuit for an antenna in the presence of an incident electric field Ẽ_i (shown in Figure 10.18):

Voltage source: Ẽ_i · l_e

Series impedance Z_A: equal to the impedance of the antenna in transmission

⚠️ Mutual coupling caveat

What if antenna 2 is present and not sufficiently far away that Z₂₁ can be assumed negligible?

Antennas 1 and 2 are referred to as being "coupled."
The effect of the presence of antenna 1 on antenna 2 is called coupling or mutual coupling (since the coupling affects both antennas reciprocally).

When is coupling significant?

Scenario	Coupling significance
Antennas on opposite ends of a radio link	Rarely significant (changes to a receive antenna do not normally affect the electric field incident at other locations)
Dense array (multiple antennas separated by less than a few wavelengths)	Commonly significant; requires generalized analysis

Don't confuse: The derivation assumes negligible Z₂₁ (far-field condition), but the equality Z₁₂ = Z₂₁ itself is general and always holds for passive linear time-invariant two-ports.

Effective Aperture

10.13 Effective Aperture

🧭 Overview

🧠 One-sentence thesis

Effective aperture quantifies how much power a receiving antenna delivers to a matched load from incident electromagnetic waves, and it equals the transmit directivity scaled by wavelength squared, demonstrating fundamental reciprocity between transmit and receive antenna behavior.

📌 Key points (3–5)

What effective aperture measures: the ratio of power delivered to a conjugate-matched load to the incident co-polarized power density (units: m²).
Three calculation methods: via effective length (for wire antennas), via thermodynamics (for theoretical limits), and via reciprocity (using transmit directivity).
Key reciprocity insight: receive directivity equals transmit directivity, so the same radiation patterns apply in both cases.
Common confusion: effective aperture is not just a receive concept—it can be defined for transmitting antennas using the same directivity relationship.
Minimum possible value: an isotropic antenna has effective aperture equal to wavelength squared divided by 4π, which is the theoretical minimum maximum effective aperture for any antenna.

📐 Definition and basic relationship

📐 What effective aperture is

Effective aperture (SI base units of m²) is the ratio of power delivered by an antenna to a conjugate matched load, to the incident co-polarized power density.

The formal relationship: P_R,max = S_i_co × A_e
- P_R,max: power delivered to a load that is conjugate matched to the antenna impedance
- S_i_co: incident power density (W/m²) that is co-polarized with the antenna
- A_e: effective aperture
This parameter relates incident power (not electric field) directly to delivered power, making system calculations convenient.

🔌 Conjugate matching condition

Maximum power transfer occurs when the load impedance Z_L equals the complex conjugate of the antenna impedance: Z_L = Z_A*
If Z_A = R_A + jX_A, then the matched load is Z_L = R_A - jX_A
The effective aperture definition assumes this optimal matching condition.

🔧 Method 1: Calculation via effective length

🔧 Derivation from equivalent circuit

The excerpt derives effective aperture using the Thévenin equivalent circuit model for a receiving antenna:

The voltage source in the model is determined by the incident electric field E_i and the vector effective length l_e of the antenna.
For a co-polarized plane wave with power density S_i_co = |E_i|² / (2η), where η is the wave impedance (≈377 Ω in free space).
Through circuit analysis with conjugate matching, the maximum received power is: P_R,max = |E_i × l_e|² / (8 R_rad)
Combining with the definition yields: A_e = η |l_e|² / (4 R_rad)

📏 Wire antenna examples

Electrically-short dipole (ESD):

For an ESD of length L: l_e ≈ (L/2) sin θ and R_rad ≈ 20π² (L/λ)²
Result in free space: A_e ≈ 0.119 λ² |sin θ|²
Remarkable finding: effective aperture does not depend on the dipole's physical length, only on frequency.
Maximum effective aperture (at θ = π/2): A_e ≈ 0.119 λ²

Half-wave dipole:

Radiation resistance ≈ 73 Ω, effective length = λ/π
Maximum effective aperture: A_e ≈ 0.131 λ²
Slightly larger than the ESD, but still primarily wavelength-dependent.

⚠️ When this method applies

Effective length and radiation resistance are easy to calculate for wire antennas.
For antennas not well-described as current distributions along a line, this method becomes difficult.
Non-electrically-short dipoles show radiation resistance proportional to L^p where p > 2, so effective aperture does increase with length, but not dramatically unless L is significantly greater than λ/2.

🌡️ Method 2: Thermodynamic analysis

🌡️ The thought experiment

The excerpt presents a scenario where:

An antenna and its conjugate-matched load are each enclosed in separate chambers at uniform temperature T.
Heat causes random charge acceleration, producing Johnson-Nyquist noise power in the load: P_load = kTB
- k ≈ 1.38 × 10⁻²³ J/K (Boltzmann's constant)
- B: measurement bandwidth
The antenna chamber walls emit thermal radiation with power density (2kT/λ²)B per steradian.
The antenna captures only one polarization, so accessible power density is half that amount.

⚖️ Thermodynamic equilibrium

Since both chambers are at the same temperature, the system is in thermodynamic equilibrium.
Power captured by the antenna and delivered to the load must equal power generated by the load and delivered to the antenna: P_ant = P_load
Integrating effective aperture over all 4π steradians of solid angle and setting equal to load noise power yields: ∮ A_e(θ', φ') sin θ' dθ' dφ' = λ²

🔽 Mean effective aperture and isotropic limit

The mean effective aperture (averaged over all directions): ⟨A_e⟩ = λ² / (4π)
For an isotropic antenna (same effective aperture in every direction), the effective aperture equals the mean: A_e = λ² / (4π) ≈ 0.080 λ²
This is the minimum possible value of maximum effective aperture for any antenna.
Don't confuse: this is a theoretical benchmark; real antennas cannot be truly isotropic but must have maximum effective aperture ≥ this value.

🔄 Method 3: Reciprocity and directivity

🔄 Relating effective aperture to directivity

The excerpt introduces a unitless constant D to compare any antenna to the isotropic benchmark:

A_e = D × (λ² / 4π) for any antenna
D must be ≥ 1 (since isotropic is the minimum).

Key discovery: D equals the directivity of the antenna when transmitting.

Example: For the ESD with A_e ≈ 0.119 λ² (maximum), D = A_e × 4π / λ² ≈ 1.50, which matches the known transmit directivity of an ESD.
This result is general: effective aperture in any direction equals the transmit directivity in that direction, scaled by λ² / (4π).

🔁 Reciprocity principle

As long as the conditions required for formal reciprocity are satisfied, the directivity of a receiving antenna (defined via the effective aperture equation) is equal to directivity of the same antenna when transmitting, and patterns describing receive directivity are equal to those for transmit directivity.

Directivity is not exclusively transmit or receive—it is a single characteristic applying to both.
Radiation patterns measured in transmit mode directly describe receive performance.
This principle can be derived using thermodynamic analysis or the reciprocity theorem (developed in Section 10.10).

🛠️ Practical utility

Calculating effective aperture via effective length is limited to wire antennas or direct measurement.
It is usually far easier to calculate transmit directivity and then use A_e = D × (λ² / 4π) to obtain receive effective aperture.
This is an enormously useful tool in antenna engineering, especially for complex antenna geometries.

🔀 Effective aperture for transmitting antennas

The equivalence allows defining effective aperture for the transmit case: effective aperture of a transmitting antenna = (λ² / 4π) × directivity.
No new physics—just leveraging the fact that effective aperture and directivity describe essentially the same antenna characteristic in both modes.

📊 Summary comparison of methods

Method	Best for	Key formula	Insight
Effective length	Wire antennas	A_e = η \|l_e\|² / (4 R_rad)	Shows dependence on antenna geometry and radiation resistance
Thermodynamics	Theoretical limits	⟨A_e⟩ = λ² / (4π)	Establishes minimum possible effective aperture (isotropic antenna)
Reciprocity	General antennas	A_e = D × (λ² / 4π)	Connects receive performance to easily-calculated transmit directivity

🧩 Additional context

🧩 Mutual coupling caveat

The derivation assumes antenna 2 is far enough from antenna 1 that mutual impedance Z_21 is negligible.
When antennas are close (e.g., in a dense array with spacing less than a few wavelengths), coupling becomes significant and affects both antennas reciprocally.
Such arrays require generalized analysis beyond the single-antenna model.

🧩 Assumptions in the thermodynamic derivation

The Rayleigh-Jeans law expression for thermal radiation power density is an approximation, but very accurate at radio frequencies.
The analysis assumes the antenna is lossless (R_A ≈ R_rad) for simplicity.

Friis Transmission Equation

10.14 Friis Transmission Equation

🧭 Overview

🧠 One-sentence thesis

The Friis transmission equation calculates the power delivered to a receiver from a distant transmitter by accounting for antenna gains, distance, wavelength, and the spreading of the transmitted wave under free space conditions.

📌 Key points (3–5)

What the equation does: gives the received power for a conjugate-matched receiver in response to a distant transmitter, assuming co-polarized antennas and free space conditions.
Key factors: transmitted power, transmit antenna gain, receive antenna gain, wavelength, and distance between antennas.
Free space path loss: the factor (λ / 4πR)⁻² describes how power decreases with distance and wavelength; it combines spreading and the wavelength-squared dependence of effective aperture.
Common confusion: path loss is not just spreading loss—spreading loss is frequency-independent, but path loss depends on frequency because effective aperture depends on wavelength squared.
Why it matters: enables prediction of received power in radio systems; can be adapted for non-free-space conditions by using different path loss values.

📡 The transmission scenario

📡 Setup and assumptions

The scenario involves:

A transmitter delivering power P_T to an antenna with gain G_T in the direction of the receiver.
A receiver's antenna with gain G_R at distance R from the transmitter.
Antenna gain equals directivity times radiation efficiency, so G_T and G_R account for losses internal to the antenna but not impedance mismatch losses.

Key assumptions:

Free space conditions: negligible absorption, reflection, or scattering by the intervening terrain.
Co-polarized antennas: the receive antenna is aligned with the incident electric field.
Conjugate-matched receiver: the receiver is impedance-matched to the antenna for maximum power transfer.

🌐 Power density at the receiver

Starting from an isotropic radiator, the spatial power density at range R would be:

Total transmitted power divided by the area of a sphere of radius R: P_T / (4πR²).
This represents power flowing through a sphere of radius R.

The actual power density S_i at the receiver location is:

S_i = (P_T / 4πR²) × G_T
The transmit antenna gain G_T increases the power density in the direction of the receiver.

🔢 Deriving the equation

🔢 Maximum received power

The maximum received power is calculated as:

P_R,max = A_e × S_i
Where A_e is the effective aperture of the receive antenna.

Substituting the power density:

P_R,max = A_e × (P_T / 4πR²) × G_T

🔄 Expressing aperture in terms of gain

The effective aperture can be expressed in terms of the receive antenna gain:

A_e = (λ² / 4π) × G_R
This relationship connects the physical concept of aperture to the gain parameter.

Substituting this into the received power expression yields:

Friis transmission equation: P_R,max = P_T × G_T × (λ / 4πR)² × G_R

This gives the power delivered to a conjugate-matched receiver in response to a distant transmitter, assuming co-polarized antennas and free space conditions.

📐 Alternative form with path loss

The factor (λ / 4πR)² is called free space path gain.

More commonly, the reciprocal is used:

L_p = (λ / 4πR)⁻²
This is known as free space path loss.

The Friis equation can then be written as:

P_R,max = P_T × G_T × L_p⁻¹ × G_R

Utility: The concept of path loss allows the equation to be adapted for non-free-space conditions by using the appropriate (and possibly very different) value of L_p.

🚫 Common misconceptions

🚫 Path loss vs spreading loss

The misconception: Path loss equals the reduction in power density due to spreading, and therefore increases with frequency.

The reality:

Spreading loss between any two distances R₁ < R₂ is: (R₁ / R₂)²
This ratio is clearly independent of frequency.

What path loss actually is:

Path loss L_p depends on both total distance R and frequency (through wavelength λ).
The frequency dependence reflects the λ² dependence of effective aperture.
Path loss is not loss in the traditional sense; it accounts for a combination of spreading and the wavelength-squared dependence of effective aperture that is common to all receiving antennas.

Example: The same spreading occurs at all frequencies, but higher frequencies (shorter wavelengths) result in smaller effective apertures, leading to greater path loss.

🔧 Alternative forms

The equation presented is the simplest form. Commonly encountered alternatives include:

Forms where G_T and/or G_R are represented by their associated effective apertures.
Forms that account for antenna impedance mismatch.
Forms that account for cross-polarization effects.

📊 Practical example

📊 6 GHz point-to-point link

Scenario: Terrestrial telecommunications systems use radio links between dish antennas on tall towers operating around 6 GHz.

Given parameters:

Antenna gain: G_T = G_R = 27 dBi (about 501 in linear units)
Frequency: 6 GHz → wavelength λ ≈ 5.00 cm
Range: R = 30 km
Minimum acceptable receive power: P_R ≥ -120 dBm (10⁻¹⁵ W)

Additional assumptions:

Height and high directivity yield conditions close to free space.
Conjugate-matching at the receiver.
Co-polarized antennas.

Solution approach: Under these conditions, P_R = P_R,max and the Friis equation applies:

P_T ≥ P_R,max / [G_T × (λ / 4πR)² × G_R]
Result: P_T ≥ 2.26 × 10⁻⁷ W ≈ -36.5 dBm

This shows that relatively low transmit power (about 0.23 microwatts) is sufficient when using high-gain antennas over a 30 km link.