Precalculus

Sets of Real Numbers and the Cartesian Coordinate Plane

1.1 Sets of Real Numbers and the Cartesian Coordinate Plane

🧭 Overview

🧠 One-sentence thesis

This section establishes the foundational vocabulary and notation for sets of real numbers—including interval notation and set operations—that underpin the study of Precalculus.

📌 Key points (3–5)

What a set is: a well-defined collection of objects (elements) where membership can be determined without ambiguity.
Three ways to describe sets: verbal (sentence), roster (list in braces), and set-builder (using a dummy variable and a condition).
Nested number systems: natural numbers ⊂ whole numbers ⊂ integers ⊂ rational numbers ⊂ real numbers ⊂ complex numbers; each larger set contains all the previous ones.
Interval notation: a compact way to describe continuous segments of the real number line using brackets, parentheses, and infinity symbols.
Common confusion: intersection vs union—intersection is the overlap (elements in both sets), union is the totality (elements in either set or both).

📚 What sets are and how to describe them

📚 Definition of a set

Set: a well-defined collection of objects called elements, where "well-defined" means it is possible to determine membership without prejudice.

The collection of letters in "smolko" is a set (you can tell if a letter belongs).
The collection of "worst math teachers in the world" is not a set (membership is subjective and not well-defined).
Membership notation: if m is in set S, write m ∈ S; if q is not in S, write q ∉ S.

📝 Three methods to describe sets

The excerpt presents three equivalent ways to specify the same set:

Method	Description	Example for letters in "smolko"
Verbal	Use a sentence	"the set of letters that make up the word 'smolko'"
Roster	List elements in braces, each once	{s, m, o, l, k}
Set-Builder	Use a dummy variable and condition	{x \| x is a letter in the word "smolko"}

Roster method details: list each element only once (even if it appears multiple times in the original context); order does not matter, so {k, l, m, o, s} is the same set.
Set-builder method: read as "the set of elements x such that x satisfies [condition]."
All three methods can be connected with an equals sign: S = {s, m, o, l, k} = {x | x is a letter in "smolko"}.

🔢 Famous sets of numbers

🔢 The hierarchy of number systems

The excerpt lists eight important sets, each nested within the next:

Set	Symbol	Definition / Description
Empty Set	∅	{} or {x \| x ≠ x}; the set with no elements
Natural Numbers	N	{1, 2, 3, ...}
Whole Numbers	W	{0, 1, 2, ...}
Integers	Z	{..., −3, −2, −1, 0, 1, 2, 3, ...}
Rational Numbers	Q	{a/b \| a ∈ Z and b ∈ Z (b ≠ 0)}; equivalently, numbers with repeating or terminating decimal representations
Real Numbers	R	{x \| x possesses a decimal representation}
Irrational Numbers	P	{x \| x is a non-rational real number}; decimals that neither repeat nor terminate (e.g., π, √2, 0.101001000100001...)
Complex Numbers	C	{a + bi \| a, b ∈ R and i = √−1}

🪆 Nested structure (Matryoshka dolls)

Every natural number is a whole number.
Every whole number is an integer.
Every integer is a rational number (take b = 1 in the definition of Q).
Every rational number is a real number (they have decimal representations).
Every real number is a complex number (take b = 0 in the definition of C).
The sets N, W, Z, Q, R, and C are nested like Russian dolls, each contained in the next.

🧮 Key distinctions

Rational vs irrational: rational numbers have repeating or terminating decimals; irrational numbers have non-repeating, non-terminating decimals.
Real vs complex: real numbers are a special case of complex numbers (when the imaginary part is zero).
Empty set: plays a vital role in mathematics, analogous to the number 0.

📏 Interval notation for real numbers

📏 What interval notation does

Interval notation: a compact way to describe segments (intervals) of the real number line.

The real numbers R can be visualized as a line.
Segments of this line are called intervals.
Interval notation uses brackets, parentheses, and infinity symbols to describe these segments concisely.

🔲 Brackets vs parentheses

Square brackets [ or ]: the endpoint is included in the interval; shown with a filled-in (closed) dot on the number line.
Parentheses ( or ): the endpoint is not included; shown with an open circle on the number line.
Infinity symbols: −∞ (extends indefinitely to the left) and ∞ (extends indefinitely to the right) always use parentheses because infinity is a concept, not a number.

📊 Interval notation table

For real numbers a and b with a < b:

Set-builder notation	Interval notation	Meaning
{x \| a < x < b}	(a, b)	Open interval: neither endpoint included
{x \| a ≤ x < b}	[a, b)	Left endpoint included, right excluded
{x \| a < x ≤ b}	(a, b]	Left endpoint excluded, right included
{x \| a ≤ x ≤ b}	[a, b]	Closed interval: both endpoints included
{x \| x < b}	(−∞, b)	All numbers less than b
{x \| x ≤ b}	(−∞, b]	All numbers less than or equal to b
{x \| x > a}	(a, ∞)	All numbers greater than a
{x \| x ≥ a}	[a, ∞)	All numbers greater than or equal to a
R	(−∞, ∞)	All real numbers

🎯 Examples from the excerpt

{x | 1 ≤ x < 3} = [1, 3): includes 1, excludes 3.
{x | −1 ≤ x ≤ 4} = [−1, 4]: includes both −1 and 4.
{x | x ≤ 5} = (−∞, 5]: all numbers up to and including 5.
{x | x > −2} = (−2, ∞): all numbers greater than −2.

🔗 Combining sets: intersection and union

🔗 Definitions

Intersection of A and B: A ∩ B = {x | x ∈ A and x ∈ B}
Union of A and B: A ∪ B = {x | x ∈ A or x ∈ B (or both)}

Intersection: the overlap of two sets; elements that the sets have in common.
Union: the totality of elements in each set, collected together.

🔍 How to distinguish intersection from union

Intersection (∩): "and"—an element must be in both sets.
Union (∪): "or"—an element can be in either set (or both).
Example with finite sets: if A = {1, 2, 3} and B = {2, 4, 6}, then A ∩ B = {2} (only 2 is in both) and A ∪ B = {1, 2, 3, 4, 6} (all elements from both sets, listed once).

📐 Graphical method for intervals

The excerpt demonstrates finding intersection and union by shading on the number line:

Example: A = [−5, 3) and B = (1, ∞).
Intersection A ∩ B: shade the overlap → (1, 3).
Union A ∪ B: shade each of A and B, then describe the combined shaded region → [−5, ∞).

🧩 Why union matters more in this text

Most sets of real numbers in Precalculus are either intervals or unions of intervals.
The excerpt notes that union is used more often than intersection in the text.

🧮 Expressing sets with interval notation

🧮 Strategy: graph first, then write

The excerpt recommends graphing the set on the number line and gleaning the interval notation from the graph.

🎯 Example patterns

The excerpt begins Example 1.1.1 with four sets to express in interval notation:

{x | x ≤ −2 or x ≥ 2}: This is a union of two intervals.
- x ≤ −2 corresponds to (−∞, −2].
- x ≥ 2 corresponds to [2, ∞).
- The word "or" signals union.
- (The excerpt does not complete the solution, but the pattern is clear: graph each inequality, then combine with union notation.)
{x | x ≠ 3}: All real numbers except 3.
- This is the union of (−∞, 3) and (3, ∞).
{x | x ≠ ±3}: All real numbers except 3 and −3.
- This is the union of (−∞, −3), (−3, 3), and (3, ∞).
{x | −1 < x ≤ 3 or x = 5}: A union of an interval and a single point.
- The interval (−1, 3] combined with the singleton set {5}.

⚠️ Don't confuse

"or" in set-builder notation → union (∪) in interval notation.
"and" in set-builder notation → intersection (∩) in interval notation.
A single inequality → one interval; multiple inequalities connected by "or" → union of intervals.

Relations

1.2 Relations

🧭 Overview

🧠 One-sentence thesis

Set operations (intersection and union) and the Cartesian coordinate plane provide the foundational tools for combining sets of real numbers and visualizing algebraic relationships geometrically.

📌 Key points (3–5)

Intersection and union: intersection captures the overlap (elements in both sets), union captures the totality (elements in either set or both).
Interval notation for unions: sets of real numbers that exclude certain points or combine multiple ranges are expressed as unions of intervals.
Cartesian coordinates: ordered pairs (x, y) locate points in the plane by projecting to the x-axis and y-axis; order matters.
Common confusion: union vs intersection—union uses "or" (either/both), intersection uses "and" (must be in both); also, (a, b) and (b, a) are different points unless a = b.
Symmetry: points can be symmetric about the x-axis, y-axis, or origin by negating the appropriate coordinate(s).

🔗 Set operations: intersection and union

🔗 Intersection

The intersection of A and B: A ∩ B = {x | x ∈ A and x ∈ B}

Intersection is the overlap of two sets—the elements the sets have in common.
Both conditions must hold: x must be in A and in B.
Example: if A = {1, 2, 3} and B = {2, 4, 6}, then A ∩ B = {2} (only 2 appears in both).
For intervals: if A = [−5, 3) and B = (1, ∞), shade the overlap to get A ∩ B = (1, 3).

🔗 Union

The union of A and B: A ∪ B = {x | x ∈ A or x ∈ B (or both)}

Union is the totality of elements in each set, collected together.
At least one condition must hold: x can be in A, in B, or in both.
Example: if A = {1, 2, 3} and B = {2, 4, 6}, then A ∪ B = {1, 2, 3, 4, 6}.
For intervals: if A = [−5, 3) and B = (1, ∞), shade both regions to get A ∪ B = [−5, ∞).
Don't confuse: union uses "or" (inclusive), intersection uses "and" (both required).

📝 Why union is emphasized

Most sets of real numbers in this text are either intervals or unions of intervals.
Union is used more frequently than intersection for describing real-number sets.

📐 Expressing sets with interval notation

📐 Excluding single points

When a set excludes one value, it splits the number line into two intervals joined by union.
Example: {x | x ≠ 3} excludes x = 3, so shade everything except that point: (−∞, 3) ∪ (3, ∞).
Example: {x | x ≠ ±3} excludes both x = 3 and x = −3, breaking the line into three intervals: (−∞, −3) ∪ (−3, 3) ∪ (3, ∞).

📐 Combining inequalities with "or"

"Or" in set-builder notation translates to union in interval notation.
Example: {x | x ≤ −2 or x ≥ 2} becomes (−∞, −2] ∪ [2, ∞).
Example: {x | −1 < x ≤ 3 or x = 5} becomes (−1, 3] ∪ {5} (a single point can be written as a set).

🔍 Graphing first

The excerpt recommends graphing the set on the number line first, then reading off the interval notation.
This visual approach helps identify how many intervals are needed and where the boundaries are.

🗺️ The Cartesian coordinate plane

🗺️ What it is

The Cartesian coordinate plane unites algebra and geometry by allowing us to draw algebraic objects.
It consists of two perpendicular real number lines: the horizontal x-axis and the vertical y-axis, crossing at the origin (0, 0).
The plane is the set of all possible ordered pairs (x, y) where x and y are real numbers.

📍 Ordered pairs and coordinates

An ordered pair (x, y) locates a point by projecting to the x-axis (abscissa or x-coordinate) and y-axis (ordinate or y-coordinate).

The first number (x-coordinate) tells how far to move horizontally from the origin (right if positive, left if negative).
The second number (y-coordinate) tells how far to move vertically (up if positive, down if negative).
Order matters: (2, −4) and (−4, 2) are different points.
Example: to plot (2, −4), move 2 units right and 4 units down from the origin.

📍 Important facts about coordinates

Fact	Meaning
(a, b) = (c, d) if and only if a = c and b = d	Two points are the same only if both coordinates match
(x, y) lies on the x-axis ⟺ y = 0	Points on the x-axis have zero y-coordinate
(x, y) lies on the y-axis ⟺ x = 0	Points on the y-axis have zero x-coordinate
Origin is (0, 0)	The only point on both axes

🧭 Quadrants

The axes divide the plane into four regions (quadrants), numbered counterclockwise with Roman numerals:

Quadrant	Signs	Example
I	x > 0, y > 0	(1, 2)
II	x < 0, y > 0	(−1, 2)
III	x < 0, y < 0	(−1, −2)
IV	x > 0, y < 0	(1, −2)

Points on the axes (other than the origin) do not belong to any quadrant; they lie on the positive or negative x-axis or y-axis.
Example: (0, 4) lies on the positive y-axis; (−117, 0) lies on the negative x-axis.

🪞 Symmetry and reflections

🪞 Three types of symmetry

Two points are symmetric about the x-axis if they have the same x-coordinate and opposite y-coordinates; about the y-axis if opposite x-coordinates and same y-coordinate; about the origin if both coordinates are opposite.

Symmetry type	Condition	How to find
About x-axis	a = c and b = −d	Replace y with −y
About y-axis	a = −c and b = d	Replace x with −x
About origin	a = −c and b = −d	Replace both x with −x and y with −y

🪞 Reflection process

Think of the axis or origin as a mirror.
Example: starting with (−2, 3):
- Reflect across x-axis: (−2, −3) (negate y).
- Reflect across y-axis: (2, 3) (negate x).
- Reflect across origin: (2, −3) (negate both).
Reflecting across the x-axis then the y-axis is equivalent to reflecting across the origin.

🪞 Visualizing symmetry

In a diagram, if P and Q are symmetric about the x-axis, they are mirror images across the horizontal axis.
If P and R are symmetric about the origin, they are diagonally opposite through (0, 0).
Don't confuse: symmetry about the y-axis (flip horizontally) vs symmetry about the x-axis (flip vertically) vs symmetry about the origin (flip both).

Introduction to Relations and Graphing Equations

1.3 Introduction to Functions

🧭 Overview

🧠 One-sentence thesis

Relations are sets of points in the plane that can be described verbally, algebraically, or graphically, and understanding intercepts and symmetry helps us sketch their graphs systematically.

📌 Key points (3–5)

What a relation is: a set of points in the plane that can be described using roster method, set-builder notation, or equations.
The Fundamental Graphing Principle: a point (x, y) is on the graph of an equation if and only if substituting those values makes the equation true.
Intercepts: x-intercepts are found by setting y = 0; y-intercepts are found by setting x = 0; both are written as ordered pairs.
Symmetry testing: substitute (−x, y) for y-axis symmetry, (x, −y) for x-axis symmetry, or (−x, −y) for origin symmetry—if the result matches the original equation, that symmetry exists.
Common confusion: not every x-value produces a real y-value on a graph; some substitutions may yield non-real results, meaning no point exists at that x-coordinate.

📍 What relations are and how to describe them

📍 Definition and methods

Relation: a set of points in the plane.

Since relations are sets, we can describe them in multiple ways:
- Verbally: in words.
- Roster method: listing the points explicitly, e.g., R = {(−2, 1), (4, 3), (0, −3)}.
- Set-builder notation: using a rule, e.g., {(x, 3) | −2 ≤ x ≤ 4}.
- Graphically: plotting the points.
- Algebraically: using an equation, e.g., x = 3 or y = −2.

🖊️ Graphing relations from set-builder notation

Example: HLS₁ = {(x, 3) | −2 ≤ x ≤ 4} means "all points with y-coordinate 3 and x-coordinate between −2 and 4, inclusive."
This describes a horizontal line segment from (−2, 3) to (4, 3), including both endpoints.
Don't confuse: HLS₂ = {(x, 3) | −2 ≤ x < 4} looks similar but excludes the point (4, 3)—we draw an open circle at (4, 3) to show it's not included.
Why the open circle? There is no real number "immediately before" 4, so we use the open circle to indicate the segment extends arbitrarily close to (4, 3) without including it.

📏 Vertical and horizontal lines

Equation	Description	Example
x = a	Vertical line through (a, 0)	V = {(3, y) \| y is a real number} gives the vertical line x = 3
y = b	Horizontal line through (0, b)	H = {(x, y) \| y = −2} gives the horizontal line y = −2

For vertical and horizontal lines, one coordinate is fixed and the other is "free" (unrestricted).
We use arrows on the graph to indicate the line extends indefinitely in both directions.

🌐 Regions in the plane

Example: R = {(x, y) | 1 < y ≤ 3} describes all points where y is strictly greater than 1 but less than or equal to 3, with x unrestricted.
This is the region between the horizontal lines y = 1 and y = 3.
The line y = 1 is dashed (not included); the line y = 3 is solid (included).

🔗 The Fundamental Graphing Principle

🔗 What it means

The Fundamental Graphing Principle: The graph of an equation is the set of points which satisfy the equation. A point (x, y) is on the graph if and only if x and y satisfy (make true) the equation.

"Satisfy the equation" means substituting x and y into the equation produces a true statement.
Example: To check if (2, −1) is on the graph of x² + y³ = 1, substitute: (2)² + (−1)³ = 4 − 1 = 3 ≠ 1, so (2, −1) is not on the graph.

🧮 Generating points systematically

Instead of guessing, solve the equation for one variable (usually y) in terms of the other (usually x).
Example: From x² + y³ = 1, solve for y: y = ³√(1 − x²).
Substitute various x-values, compute the corresponding y-values, and plot the points (x, y).
Limitation: Some x-values may produce non-real y-values (e.g., square root of a negative number), meaning no point exists at that x-coordinate—just move on and try another value.

🎨 Connecting the dots

After plotting enough representative points, sketch a smooth curve through them.
The excerpt notes that precise graphing is a topic for Calculus; for now, a rough sketch is acceptable.

🎯 Intercepts: where graphs cross the axes

🎯 Definitions

x-intercept: a point on the graph that is also on the x-axis (has the form (x, 0)).

y-intercept: a point on the graph that is also on the y-axis (has the form (0, y)).

A graph can have any number of intercepts—zero, one, two, or more.

🔍 How to find intercepts

Intercept type	How to find it	Why
x-intercept	Set y = 0 and solve for x	x-intercepts lie on the x-axis, where y = 0
y-intercept	Set x = 0 and solve for y	y-intercepts lie on the y-axis, where x = 0

Important: Intercepts are points, so write them as ordered pairs, not just numbers.
Example: For (x − 2)² + y² = 1, setting y = 0 gives (x − 2)² = 1, so x − 2 = ±1, yielding x = 3 or x = 1. The x-intercepts are (1, 0) and (3, 0).
Setting x = 0 gives (0 − 2)² + y² = 1, so 4 + y² = 1, which means y² = −3. Since no real number squares to a negative, there are no y-intercepts.

🔄 Testing for symmetry

🔄 Why symmetry matters

If a graph has symmetry, we can reduce the amount of plotting: whatever happens on one side is mirrored on the other.
Symmetry is tested analytically by substituting transformed coordinates into the equation.

🧪 The three symmetry tests

Symmetry type	Substitute	If result matches original equation…
About the y-axis	(−x, y)	Graph is symmetric about the y-axis
About the x-axis	(x, −y)	Graph is symmetric about the x-axis
About the origin	(−x, −y)	Graph is symmetric about the origin

🧩 Example: y-axis symmetry

For x² + y³ = 1, substitute (−x, y): (−x)² + (y)³ = x² + y³ = 1.
This matches the original equation, so the graph is symmetric about the y-axis.
Consequence: If (x, y) is on the graph, so is (−x, y).

🧩 Example: x-axis symmetry

For (x − 2)² + y² = 1, substitute (x, −y): (x − 2)² + (−y)² = (x − 2)² + y² = 1.
This matches the original equation, so the graph is symmetric about the x-axis.
Consequence: We only need to plot points below (or above) the x-axis; the other half is a reflection.

⚠️ Ruling out symmetry

If a graph has an x-intercept at (1, 0) but not at (−1, 0), it cannot be symmetric about the y-axis or the origin.
Why? y-axis symmetry would require (−1, 0) to also be on the graph; origin symmetry would require (−1, 0) as well.

🛠️ Putting it all together: graphing strategy

🛠️ Step-by-step approach

Find intercepts: Set y = 0 for x-intercepts; set x = 0 for y-intercepts.
Test for symmetry: Check y-axis, x-axis, and origin symmetry using substitution.
Solve for y (if possible): Express y in terms of x to make point generation easier.
Plot additional points: Choose x-values, compute y-values, and plot (x, y).
Use symmetry: If symmetry exists, reflect points accordingly to reduce work.
Sketch the curve: Connect the dots smoothly, keeping in mind the overall shape.

🧭 Example walkthrough

For (x − 2)² + y² = 1:
- Intercepts: x-intercepts at (1, 0) and (3, 0); no y-intercepts.
- Symmetry: Symmetric about the x-axis (not about y-axis or origin).
- Additional points: Solve for y: y = ±√(1 − (x − 2)²). Choose x-values between 1 and 3, compute y, and plot.
- Reflect: Since the graph is symmetric about the x-axis, points below the x-axis mirror those above.
- Result: A closed curve (circle) centered at (2, 0) with radius 1.

💡 Practical tips

Not every x-value will yield a real y-value—if you encounter a square root of a negative number, skip that x and try another.
Graphing calculators can help visualize, but understanding the analytical process builds deeper insight.
The excerpt emphasizes that this "plug and plot" approach is introductory; more sophisticated techniques come later (in Calculus).

Function Notation

1.4 Function Notation

🧭 Overview

🧠 One-sentence thesis

Relations are sets of points in the plane that can be described verbally, algebraically, or graphically, and understanding how to graph equations and identify their intercepts forms the foundation for analyzing these point sets systematically.

📌 Key points (3–5)

What a relation is: a set of points in the plane that can be described using roster method, set-builder notation, verbally, graphically, or algebraically.
How to graph relations: plot points that satisfy the given description, whether listed explicitly or defined by conditions on x and y coordinates.
Intercepts are special points: x-intercepts occur where the graph crosses the x-axis (set y = 0), and y-intercepts occur where it crosses the y-axis (set x = 0).
Common confusion: distinguishing between strict and non-strict inequalities—use open circles for strict inequalities (less than or greater than) and closed circles or solid points for inclusive inequalities (less than or equal to, greater than or equal to).
The Fundamental Graphing Principle: a point (x, y) is on the graph of an equation if and only if x and y satisfy (make true) that equation.

📍 What relations are and how to describe them

📍 Definition of a relation

A relation is a set of points in the plane.

Since relations are sets, they can be described using multiple methods from set theory.
The same relation can be represented in different ways depending on convenience.

🔤 Five ways to describe relations

Method	Description	Example from excerpt
Roster	List all points explicitly	R = {(−2, 1), (4, 3), (0, −3)}
Set-builder	Use conditions with variables	{(x, 3) \| −2 ≤ x ≤ 4}
Verbal	Describe in words	"Points with x-coordinate 3"
Graphical	Plot points visually	Plotting points on coordinate plane
Algebraic	Use equations	x = 3 or y = −2

🎯 Why the word "relation"

When an equation contains both x and y, the equation itself is what relates the two variables.
Example: In the relation R = {(x, y) | x squared + y cubed = 1}, the pairs of x and y are related by making the equation true.

📊 Graphing specific types of relations

📊 Discrete point sets

Simply plot each point listed in the roster.
Example: A = {(0, 0), (−3, 1), (4, 2), (−3, 2)} is graphed by plotting these four points.

➖ Horizontal and vertical lines

Equations of Vertical and Horizontal Lines:

The graph of x = a is a vertical line through (a, 0).
The graph of y = b is a horizontal line through (0, b).

Why this matters:

When one coordinate is fixed and the other is free, you get a line.
Example: V = {(3, y) | y is a real number} produces a vertical line at x = 3 with arrows indicating it extends indefinitely.

📏 Line segments with inequalities

When variables are restricted by inequalities, you get segments rather than full lines.
Inclusive inequality (≤ or ≥): use a closed/filled circle or include the endpoint.
Strict inequality (< or >): use an open circle to show the endpoint is not included.

Example from excerpt:

HLS₁ = {(x, 3) | −2 ≤ x ≤ 4} includes both (−2, 3) and (4, 3)—draw a solid segment between them.
HLS₂ = {(x, 3) | −2 ≤ x < 4} includes (−2, 3) but not (4, 3)—draw an open circle at (4, 3).

Don't confuse: You cannot just stop at the last integer before 4 (like stopping at 3); there are infinitely many points between 3 and 4 (like 3.1, 3.9, 3.99), so an open circle is required.

🌐 Regions in the plane

When both coordinates have restrictions, you get a region.
Example: R = {(x, y) | 1 < y ≤ 3} means x is free (any real number) but y is between 1 and 3.
Graph: the region between horizontal lines y = 1 and y = 3, with y = 1 dashed (not included) and y = 3 solid (included).

🔍 The Fundamental Graphing Principle

🔍 Core principle

The graph of an equation is the set of points which satisfy the equation. That is, a point (x, y) is on the graph of an equation if and only if x and y satisfy the equation.

"Satisfy" means "make the equation true."
This principle connects algebra (equations) with geometry (graphs).

✅ Testing if a point is on a graph

Method: Substitute the x and y values into the equation and check if the result is true.

Example from excerpt:

Question: Is (2, −1) on the graph of x squared + y cubed = 1?
Substitute: (2) squared + (−1) cubed = 4 + (−1) = 3
Since 3 does not equal 1, the point (2, −1) is NOT on the graph.

🗺️ Generating points systematically

Instead of random guessing, solve the equation for one variable in terms of the other.

Example from excerpt:

Equation: x squared + y cubed = 1
Solve for y: y = cube root of (1 − x squared)
Now substitute various x values to find corresponding y values.
Create a table of (x, y) pairs and plot them.

Note: The points you plot are only a sample; connecting them gives an approximate curve shape.

🎯 Intercepts: where graphs meet the axes

🎯 What intercepts are

An x-intercept is a point on a graph which is also on the x-axis.

A y-intercept is a point on a graph which is also on the y-axis.

Intercepts are special because they show where the graph crosses or touches the coordinate axes.
A graph can have any number of intercepts (zero, one, many).

🔢 How to find intercepts algebraically

Intercept type	Form	How to find
x-intercept	(x, 0)	Set y = 0 in the equation and solve for x
y-intercept	(0, y)	Set x = 0 in the equation and solve for y

Important: Intercepts are points (ordered pairs), not just single numbers.

Example from excerpt:

The graph of x squared + y cubed = 1 has two x-intercepts: (−1, 0) and (1, 0)
It has one y-intercept: (0, 1)

⚠️ Common mistake

Don't write intercepts as single numbers like "x-intercept = 3."
Always write them as ordered pairs: "x-intercept is (3, 0)."

Intercepts and Symmetry of Graphs

1.5 Function Arithmetic

🧭 Overview

🧠 One-sentence thesis

Intercepts and symmetry are two analytical tools that help us sketch and understand the graph of an equation without relying solely on plotting many points.

📌 Key points (3–5)

Intercepts: points where the graph crosses or touches the axes—x-intercepts lie on the x-axis (form (x, 0)), y-intercepts lie on the y-axis (form (0, y)).
How to find intercepts: set y = 0 to find x-intercepts; set x = 0 to find y-intercepts; intercepts are always written as ordered pairs.
Symmetry tests: substitute specific coordinate transformations into the equation—if the result is equivalent to the original equation, the graph has that symmetry.
Common confusion: a point may fail to exist on the graph (e.g., when solving yields no real number); also, failing a symmetry test algebraically must be confirmed by finding a counterexample point.
Why it matters: intercepts and symmetry reduce the amount of plotting needed and reveal structural properties of the graph.

📍 Intercepts: where graphs meet the axes

📍 What intercepts are

x-intercept: a point on a graph which is also on the x-axis.

y-intercept: a point on a graph which is also on the y-axis.

Intercepts are special points because they show where the graph crosses or touches the coordinate axes.
A graph can have any number of intercepts—many, one, or none at all.
Intercepts must always be written as ordered pairs (points), not just single numbers.

🔍 How to find intercepts

Type	Form	Procedure
x-intercept	(x, 0)	Set y = 0 in the equation and solve for x
y-intercept	(0, y)	Set x = 0 in the equation and solve for y

Example: For the equation (x − 2)² + y² = 1, setting y = 0 gives (x − 2)² = 1, so x − 2 = ±1, yielding x = 3 or x = 1. The x-intercepts are (1, 0) and (3, 0).
Example: For the same equation, setting x = 0 gives 4 + y² = 1, so y² = −3. Since no real number squares to a negative number, there are no y-intercepts.

⚠️ When intercepts don't exist

Sometimes solving for an intercept yields no real solution (e.g., the square root of a negative number).
This simply means the graph does not cross that axis at any point.
Don't confuse: "no solution" is a valid answer—it tells you something about the graph's behavior.

🔄 Symmetry: mirror properties of graphs

🔄 Three types of symmetry

Symmetry type	Test substitution	What it means
About the y-axis	Replace (x, y) with (−x, y)	Left and right halves are mirror images
About the x-axis	Replace (x, y) with (x, −y)	Top and bottom halves are mirror images
About the origin	Replace (x, y) with (−x, −y)	180° rotational symmetry around (0, 0)

For each test: substitute the transformed coordinates into the equation and simplify.
If the result is equivalent to the original equation, the graph has that symmetry.
If the result is not equivalent, the graph does not have that symmetry.

🧪 How to perform a symmetry test

Step-by-step procedure:

Assume (x, y) is a generic point on the graph (i.e., it satisfies the equation).
Substitute the appropriate transformation: (−x, y) for y-axis, (x, −y) for x-axis, or (−x, −y) for origin.
Simplify the resulting equation.
Compare: does it match the original equation?

Example: For x² + y³ = 1, test y-axis symmetry by substituting (−x, y):

(−x)² + (y)³ = x² + y³ = 1 ✓
This matches the original equation, so the graph is symmetric about the y-axis.

Example: For (x − 2)² + y² = 1, test x-axis symmetry by substituting (x, −y):

(x − 2)² + (−y)² = (x − 2)² + y² = 1 ✓
This matches the original equation, so the graph is symmetric about the x-axis.

🚫 Proving lack of symmetry

If the algebraic test fails (the simplified equation does not match the original), you should confirm by finding a counterexample point.
A counterexample: a point (x, y) on the graph whose reflection is not on the graph.

Example: For (x − 2)² + y² = 1, test y-axis symmetry by substituting (−x, y):

(−x − 2)² + y² = (x + 2)² + y² ✗
This does not match the original equation.
Confirm: the point (1, 0) is on the graph (it's an x-intercept), but substituting (−1, 0) gives (−1 − 2)² + 0² = 9 ≠ 1, so (−1, 0) is not on the graph.
Therefore, the graph is not symmetric about the y-axis.

🔗 Using symmetry to reduce work

If a graph has x-axis symmetry, whatever happens below the x-axis is reflected above it, and vice versa.
This means you can plot points on one side of the axis and automatically know the corresponding points on the other side.
Example: If (2, 1) is on a graph with x-axis symmetry, then (2, −1) is also on the graph without additional computation.

🛠️ Practical graphing strategy

🛠️ Combining intercepts and symmetry

Recommended workflow:

Find all x- and y-intercepts (if any exist).
Test for symmetry (y-axis, x-axis, origin).
Plot the intercepts.
Use symmetry to reduce the number of additional points you need to plot.
Choose a few strategic x-values, solve for y, and plot those points.
Reflect points across the axis or origin if symmetry exists.
Sketch the graph by connecting the points smoothly.

⚠️ Common pitfalls

Not all x-values yield real y-values: when you substitute an x-value and solve for y, you may get the square root of a negative number or another undefined expression. This means there is no point on the graph with that x-coordinate—move on and try another value.
Intercepts are points, not numbers: always write intercepts as ordered pairs, e.g., (3, 0), not just "3".
Symmetry failure requires a counterexample: if the algebraic test suggests no symmetry, verify by finding a point on the graph whose reflection is not on the graph.

🧩 Why this approach matters

The excerpt emphasizes that "tedious computation and plotting" can be reduced by using intercepts and symmetry.
These tools allow you to understand the structure of the graph analytically, rather than relying on a graphing calculator or plotting many points blindly.
The excerpt notes that much of the remainder of the book will develop techniques to graph entire families of equations quickly, building on these foundational ideas.

Graphs of Functions

1.6 Graphs of Functions

🧭 Overview

🧠 One-sentence thesis

The Vertical Line Test provides a geometric criterion to determine whether a relation represents y as a function of x by checking that no two points share the same x-coordinate.

📌 Key points (3–5)

What a function is: a special relation where each x-coordinate matches with only one y-coordinate.
How to test graphically: use the Vertical Line Test—if any vertical line crosses the graph more than once, it's not a function.
Common confusion: multiple points can share the same y-coordinate and still be a function; what matters is that x-coordinates don't repeat.
Why it matters: the function concept is core to algebra and connects algebraic definitions to geometric visualization.

📐 Defining functions as relations

📐 What makes a relation a function

A relation in which each x-coordinate is matched with only one y-coordinate is said to describe y as a function of x.

A function is not just any relation—it has a special restriction.
The key requirement: one x-coordinate → one y-coordinate only.
This is an algebraic definition based on ordered pairs.

🔍 Testing with ordered pairs

When given a set of points, scan for repeated x-coordinates:

If the same x appears with different y-values → not a function.
If every x appears only once (or always with the same y) → is a function.

Example from the excerpt:

R₁ = {(−2, 1), (1, 3), (1, 4), (3, −1)}: the x-coordinate 1 appears twice with different y-values (3 and 4), so not a function.
R₂ = {(−2, 1), (1, 3), (2, 3), (3, −1)}: every x appears only once, so is a function.

Don't confuse: R₂ has two points with the same y-coordinate (both 1 and 2 map to 3), but that's allowed—only repeated x-coordinates matter.

🖼️ The Vertical Line Test

🖼️ Geometric interpretation

The Vertical Line Test: A set of points in the plane represents y as a function of x if and only if no two points lie on the same vertical line.

This translates the algebraic definition into a visual check.
A vertical line has the form x = constant.
If two points lie on the same vertical line (e.g., x = 1), they share the same x-coordinate but have different y-coordinates → violates the function definition.

✅ How to apply the test

Look at the graph of the relation.
Imagine (or draw) vertical lines sweeping across the graph.
If any vertical line crosses the graph at more than one point → not a function.
If every vertical line crosses at most once → is a function.

Example from the excerpt:

Graph of R₁: points (1, 3) and (1, 4) both lie on the vertical line x = 1 → fails the test.
Graph of R₂: no vertical line crosses more than one point → passes the test.

🧠 Why the test works

The vertical line x = c represents all points with that x-coordinate.
If the graph crosses that line twice, it means the relation assigns two different y-values to the same x-value.
This directly violates the "one x → one y" rule.

🔄 Common confusions

🔄 Same y-coordinate is okay

Multiple x-values can map to the same y-value without breaking the function rule.
Example: (1, 3) and (2, 3) are both in R₂—this is fine because the x-coordinates are different.
Horizontal lines (same y, different x) are allowed; vertical repetition (same x, different y) is not.

🔄 Function of x vs function of y

The excerpt notes that the same relation can behave differently depending on which variable is the "input."
R₁ does not represent y as a function of x, but it does represent x as a function of y (each y appears only once).
The direction matters: "y as a function of x" means x is the independent variable.

Note: The excerpt includes exercise lists, answer keys, and references to external curves (Folium of Descartes, etc.), which provide practice but do not add new conceptual content beyond the core definition and Vertical Line Test.

Transformations

1.7 Transformations

🧭 Overview

🧠 One-sentence thesis

This excerpt does not contain substantive content on transformations; it consists entirely of exercise problems, answer keys, and introductory material on functions and relations.

📌 Key points (3–5)

The excerpt is primarily a collection of exercises (graphing equations, testing symmetry, determining functions) and their numerical/graphical answers.
A small portion introduces the concept of a function as a special relation where each x-coordinate matches exactly one y-coordinate.
The Vertical Line Test is presented as a geometric method to determine whether a relation represents y as a function of x.
The excerpt briefly introduces function notation f(x) and the idea of domain (inputs) and range (outputs).
Common confusion: The excerpt does not address transformations of functions (shifts, stretches, reflections); the title "1.7 Transformations" does not match the content provided.

📚 What the excerpt actually contains

📚 Exercise sets and answers

The bulk of the excerpt consists of:
- Numbered exercises asking students to graph equations, test for symmetry, determine whether relations are functions, and find domains and ranges.
- Answer keys with tables of values, graphs, and statements about symmetry.
These exercises cover relations, graphs of equations (parabolas, cubics, circles, hyperbolas, lines, square roots), and symmetry tests (x-axis, y-axis, origin).
No instructional content on transformations (e.g., shifting, scaling, reflecting graphs) is present.

📚 Brief function concepts (Section 1.3)

The excerpt includes a short introduction to functions:

Definition 1.6: A relation in which each x-coordinate is matched with only one y-coordinate is said to describe y as a function of x.

Vertical Line Test (Theorem 1.1): A set of points represents y as a function of x if and only if no two points lie on the same vertical line.
Domain and range (Definition 1.7):
- Domain = the set of x-coordinates of points in the function.
- Range = the set of y-coordinates of points in the function.
Examples show how to apply the Vertical Line Test to graphs and how to solve equations to determine if y is a function of x.

📚 Function notation (Section 1.4, partial)

A brief introduction to function notation appears at the end:

Function notation: f(x) denotes the output when the process f is applied to input x.
Independent variable: x (the input).
Dependent variable: y (the output, which depends on x).
Example: if f multiplies by 3 then adds 4, then f(x) = 3x + 4.

⚠️ Mismatch with title

⚠️ No transformation content

The title "1.7 Transformations" suggests the section should cover transformations of functions (e.g., vertical/horizontal shifts, stretches, compressions, reflections).
The excerpt provided contains no such material.
Instead, it includes exercises on graphing, symmetry, and basic function definitions from earlier sections (1.2 Relations, 1.3 Introduction to Functions, 1.4 Function Notation).

⚠️ What is missing

To fulfill the title, the excerpt would need to explain:

How to shift graphs vertically (f(x) + k) or horizontally (f(x − h)).
How to stretch or compress graphs (af(x) or f(bx)).
How to reflect graphs (−f(x) or f(−x)).
None of these topics appear in the provided text.

🔍 Key concepts that are present

🔍 Function definition

A relation describes y as a function of x if each x-coordinate is matched with only one y-coordinate.

This means: for every input x, there is exactly one output y.
Example from the excerpt: R₁ = {(−2, 1), (1, 3), (1, 4), (3, −1)} is not a function because x = 1 is paired with both y = 3 and y = 4.
Example: R₂ = {(−2, 1), (1, 3), (2, 3), (3, −1)} is a function because each x appears only once (even though y = 3 appears twice, which is allowed).

🔍 Vertical Line Test

Geometric criterion: A graph represents y as a function of x if and only if every vertical line crosses the graph at most once.
If any vertical line crosses the graph more than once, the relation fails the test and is not a function.
Example: A circle fails the Vertical Line Test (a vertical line through the middle crosses it twice).

🔍 Domain and range

Domain: all x-values that appear in the function.
Range: all y-values that appear in the function.
For a graph, project the curve onto the x-axis to find the domain; project onto the y-axis to find the range.
Example from the excerpt: For F = {(−3, 2), (0, 1), (4, 2), (5, 2)}, domain = {−3, 0, 4, 5} and range = {1, 2}.

🔍 Determining functions algebraically

To check if an equation represents y as a function of x, solve for y in terms of x.
If solving produces a ± (plus-or-minus) in front of y, the equation is not a function (one x gives two y-values).
Example: x³ + y² = 1 → y² = 1 − x³ → y = ±√(1 − x³). The ± means not a function.
Example: x² + y³ = 1 → y³ = 1 − x² → y = ³√(1 − x²). Only one y for each x, so it is a function.

📝 Summary

The excerpt does not cover transformations of functions. It is a collection of exercises and answers on:

Graphing relations and equations.
Testing for symmetry (x-axis, y-axis, origin).
Determining whether a relation is a function (using the Vertical Line Test and algebraic methods).
Finding domain and range.
Introducing function notation f(x).

To study transformations, you would need a different section of the textbook that explains how to shift, stretch, compress, and reflect function graphs.

Introduction to Functions

2.1 Linear Functions

🧭 Overview

🧠 One-sentence thesis

A function is a special kind of relation where each x-coordinate is matched with exactly one y-coordinate, ensuring that no x-value maps to multiple y-values.

📌 Key points (3–5)

What a function is: a relation in which each x-coordinate is matched with only one y-coordinate.
How to check if a relation is a function: scan all points to see if any x-coordinate appears more than once with different y-coordinates.
Common confusion: different x-coordinates can share the same y-coordinate and still be a function; the restriction is only that one x cannot map to multiple y-values.
Why it matters: functions are a core concept in College Algebra and distinguish a special subset of all relations.

🔍 Defining functions as relations

🔍 What a function is

A relation in which each x-coordinate is matched with only one y-coordinate is said to describe y as a function of x.

A function is not just any relation; it is a relation with a specific restriction.
The restriction: each x-coordinate must correspond to exactly one y-coordinate.
The excerpt emphasizes "only one," meaning no x-value can be paired with two or more different y-values.

🧪 How to test whether a relation is a function

Examine all ordered pairs in the relation.
Look for any x-coordinate that appears more than once.
If the same x-coordinate appears with different y-coordinates, the relation is not a function.
If every x-coordinate appears only once (or appears multiple times but always with the same y-coordinate), the relation is a function.

📋 Worked example: distinguishing functions from non-functions

📋 Example 1: R₁ is not a function

Given: R₁ = {(−2, 1), (1, 3), (1, 4), (3, −1)}
Analysis: The x-coordinate 1 is matched with two different y-coordinates: 3 and 4.
Conclusion: In R₁, y is not a function of x.
Why: The definition requires each x to have only one y; here x = 1 has two different y-values.

✅ Example 2: R₂ is a function

Given: R₂ = {(−2, 1), (1, 3), (2, 3), (3, −1)}
Analysis: Every x-coordinate occurs only once in the set.
Conclusion: R₂ does represent y as a function of x.
Don't confuse: Notice that the y-coordinate 3 appears twice (paired with x = 1 and x = 2). This is allowed; the restriction is only on x-coordinates, not y-coordinates.

🧷 Common confusion: repeated y-coordinates

🧷 Multiple x-values can share the same y-value

The excerpt notes that R₂ "contained two different" points with the same y-coordinate (the sentence is incomplete, but the context is clear from the example).
Key distinction:
- Not allowed: one x matched with multiple y-values (breaks the function rule).
- Allowed: multiple x-values matched with the same y-value (does not break the function rule).
Example: In R₂, both (1, 3) and (2, 3) have y = 3, but this is fine because each x (1 and 2) has only one y.

Absolute Value Functions

2.2 Absolute Value Functions

🧭 Overview

🧠 One-sentence thesis

Absolute value functions can be graphed and solved by rewriting them as piecewise-defined functions, which reveals their characteristic "V" shape and allows systematic analysis of their behavior.

📌 Key points (3–5)

Definition of absolute value: |x| is defined piecewise: it equals −x when x < 0 and equals x when x ≥ 0, effectively converting negative numbers to their positive counterparts while leaving positive numbers unchanged.
Solving absolute value equations: use equality properties (e.g., |x| = c means x = c or x = −c when c > 0) or break into cases using the piecewise definition when the variable appears both inside and outside the absolute value.
Graphing strategy: rewrite the absolute value function as a piecewise function using the definition, then graph each piece separately; alternatively, apply transformations to the basic graph of f(x) = |x|.
Common confusion: the equation |x| = c has no solution when c < 0 (absolute value is never negative); when solving equations with x both inside and outside absolute values, you must use the case-by-case method, not just the equality properties.
Characteristic shape: most simple absolute value functions produce a "V" or inverted "V" shape with a corner point, though more complex functions involving multiple absolute values may not follow this pattern.

📐 Definition and properties

📐 What absolute value means

Absolute value of a real number x, denoted |x|, is given by: |x| = { −x, if x < 0; x, if x ≥ 0 }

This is a piecewise-defined function.
Interpretation: |x| takes negative real numbers and assigns them to their positive counterparts while leaving positive numbers alone.
Alternative definitions exist (distance from zero on the number line, or square root of x squared), but the piecewise definition is adopted here.
Example: |5| = 5 because 5 ≥ 0, so we use the rule |x| = x. For |−5|, since −5 < 0, we use |x| = −x, giving |−5| = −(−5) = 5.
Don't confuse: "−x" means "the opposite of x," not "negative x"—when x is already negative, −x is positive.

🔧 Properties of absolute value

The excerpt lists these key properties:

Property	Statement	Condition
Product Rule	\|ab\| = \|a\|\|b\|	Always
Power Rule	\|aⁿ\| = \|a\|ⁿ	Whenever aⁿ is defined
Quotient Rule	\|a/b\| = \|a\|/\|b\|	b ≠ 0
Zero property	\|x\| = 0 if and only if x = 0	Always
Positive c	\|x\| = c means x = c or −x = c	c > 0
Negative c	\|x\| = c has no solution	c < 0

The proofs involve checking cases (both positive, both negative, one positive and one negative, one or both zero).
Example proof sketch for product rule: if a and b are both positive, then |ab| = ab and |a||b| = ab, so the equation holds; if both are negative, |ab| = ab (product is positive) and |a||b| = (−a)(−b) = ab, so it holds; similar reasoning for mixed signs and zero cases.

🧮 Solving absolute value equations

🧮 Using equality properties (simple cases)

When the equation has the form |expression| = c:

If c > 0: set expression = c OR expression = −c, then solve both.
If c < 0: no solution (absolute value is never negative).
If c = 0: set expression = 0 and solve.

Example: |3x − 1| = 6

Since 6 > 0, write: 3x − 1 = 6 or 3x − 1 = −6
Solving: x = 7/3 or x = −5/3

Example: 3 − |x + 5| = 1

Isolate: −|x + 5| = −2, so |x + 5| = 2
Then: x + 5 = 2 or x + 5 = −2
Solutions: x = −3 or x = −7

Example: 4 − |5x + 3| = 5

Isolate: |5x + 3| = −1
No solution (absolute value cannot equal a negative number).

🔀 Case-by-case method (complex cases)

When the variable appears both inside and outside the absolute value, use the piecewise definition:

Example: |x| = x² − 6

Case 1 (x < 0): |x| = −x, so −x = x² − 6, giving x² + x − 6 = 0, so (x + 3)(x − 2) = 0. Solutions x = −3 or x = 2; keep only x = −3 (satisfies x < 0).
Case 2 (x ≥ 0): |x| = x, so x = x² − 6, giving x² − x − 6 = 0, so (x − 3)(x + 2) = 0. Solutions x = 3 or x = −2; keep only x = 3 (satisfies x ≥ 0).
Final solutions: x = −3 and x = 3.

Example: |x − 2| + 1 = x

Isolate: |x − 2| = x − 1
Rewrite |x − 2| using the definition: |x − 2| = { −(x − 2) if x < 2; (x − 2) if x ≥ 2 } = { −x + 2 if x < 2; x − 2 if x ≥ 2 }
Case 1 (x < 2): −x + 2 = x − 1, so 2x = 3, giving x = 3/2. Since 3/2 < 2, keep this solution.
Case 2 (x ≥ 2): x − 2 = x − 1, which simplifies to −2 = −1 (false), so no solution here.
Final solution: x = 3/2.

Don't confuse: You cannot always use the equality properties when x appears both inside and outside the absolute value—you risk introducing extraneous solutions or losing valid ones.

📊 Graphing absolute value functions

📊 Basic graph and piecewise approach

The basic function f(x) = |x|:

Rewrite using definition: f(x) = { −x if x < 0; x if x ≥ 0 }
For x < 0: graph the line y = −x (with open circle at origin)
For x ≥ 0: graph the line y = x (fills in the point at origin)
Result: characteristic "V" shape with vertex at (0, 0)
Domain: all real numbers; Range: [0, ∞)
Decreasing on (−∞, 0], increasing on [0, ∞)
Absolute minimum at (0, 0); no maximum

🔄 Using transformations

You can graph absolute value functions by transforming the basic graph of f(x) = |x|:

Example: g(x) = |x − 3|

This is f(x − 3), so shift the graph of f(x) = |x| right 3 units
Moves (−1, 1) to (2, 1), (0, 0) to (3, 0), (1, 1) to (4, 1)
Vertex at (3, 0); domain: all real numbers; range: [0, ∞)

Example: h(x) = |x| − 3

This is f(x) − 3, so shift the graph of f(x) = |x| down 3 units
Moves (−1, 1) to (−1, −2), (0, 0) to (0, −3), (1, 1) to (1, −2)
Vertex at (0, −3); domain: all real numbers; range: [−3, ∞)

Example: i(x) = 4 − 2|3x + 1|

Rewrite as i(x) = −2f(3x + 1) + 4
Inside changes: subtract 1 from x-values (shift left 1), then divide by 3 (horizontal shrink by factor of 3)
Outside changes: multiply y-values by −2 (vertical stretch by 2 and reflect across x-axis), then add 4 (shift up 4)
Result: inverted "V" shape with vertex at (−1/3, 4)

🧩 Functions without the "V" shape

Not all functions with absolute values have the characteristic "V" shape:

Example: f(x) = |x|/x

Domain: all real numbers except 0
Rewrite: f(x) = { −x/x if x < 0; x/x if x > 0 } = { −1 if x < 0; 1 if x > 0 }
Graph: two horizontal lines (y = −1 for x < 0, y = 1 for x > 0) with open circles at (0, −1) and (0, 1)
Range: {−1, 1}; constant on each piece
Every point is simultaneously a relative maximum and minimum

Example: g(x) = |x + 2| − |x − 3| + 1

Must break into multiple cases because there are two absolute value terms
Rewrite by applying the definition to each absolute value separately:
- For x < −2: g(x) = −4
- For −2 ≤ x < 3: g(x) = 2x
- For x ≥ 3: g(x) = 6
Graph consists of two horizontal segments connected by a slanted line segment
Domain: all real numbers; range: [−4, 6]

🔍 Analyzing graphs

🔍 Finding intercepts and zeros

Zeros (where f(x) = 0):

Set the function equal to zero and solve
These give the x-coordinates of x-intercepts

y-intercept:

Set x = 0 and evaluate f(0)
Functions can have at most one y-intercept

Example: For g(x) = |x − 3|, setting g(x) = 0 gives |x − 3| = 0, so x = 3. The x-intercept is (3, 0). Setting x = 0 gives y = |0 − 3| = 3, so the y-intercept is (0, 3).

Example: For f(x) = |x|/x, setting f(x) = 0 gives |x| = 0, so x = 0. But x = 0 is not in the domain, so there are no zeros and no x-intercepts. There is also no y-intercept since f(0) is undefined.

📈 Domain, range, and behavior

For each function, determine:

Domain: all x-values for which the function is defined (project graph onto x-axis)
Range: all y-values the function takes (project graph onto y-axis)
Increasing/decreasing/constant intervals: where the function rises, falls, or stays level as x increases
Extrema:
- Relative (local) minimum/maximum: lowest/highest point in a neighborhood
- Absolute (global) minimum/maximum: lowest/highest point overall

Example: For h(x) = |x| − 3:

Domain: all real numbers
Range: [−3, ∞)
Decreasing on (−∞, 0], increasing on [0, ∞)
Relative and absolute minimum: −3 at (0, −3)
No relative or absolute maximum

Don't confuse: A function that is constant on an interval can have every point be both a relative maximum and minimum simultaneously (this occurs when the function value doesn't change in any neighborhood of the point).

Quadratic Functions

2.3 Quadratic Functions

🧭 Overview

🧠 One-sentence thesis

Quadratic functions can be transformed and analyzed using either standard form (which reveals the vertex directly) or general form (which can be converted to standard form by completing the square), enabling us to solve optimization problems and understand parabolic behavior.

📌 Key points (3–5)

Two forms: Quadratic functions appear in general form f(x) = ax² + bx + c or standard form f(x) = a(x − h)² + k, each useful for different purposes.
The vertex reveals extrema: The vertex (h, k) is either the maximum or minimum point of the parabola, determined by whether a is negative (opens down) or positive (opens up).
Completing the square converts forms: Any quadratic in general form can be rewritten in standard form through the completing-the-square process.
Common confusion: The coefficient 'a' in both forms is the same number—it controls whether the parabola opens upward (a > 0) or downward (a < 0), not the vertex location.
Real-world optimization: Quadratic functions model profit, area, projectile motion, and other scenarios where finding maximum or minimum values is essential.

📐 Definition and basic shape

📐 What is a quadratic function

Quadratic function: A function of the form f(x) = ax² + bx + c, where a, b, and c are real numbers with a ≠ 0. The domain is always (−∞, ∞).

The simplest quadratic is f(x) = x², whose graph is a parabola.
The parabola has a characteristic U-shape (or inverted U if a < 0).
The turning point is called the vertex.

🔄 Parabola orientation

Coefficient a	Parabola opens	Vertex is
a > 0	Upward	Minimum point
a < 0	Downward	Maximum point

Example: f(x) = x² opens upward; vertex (0, 0) is the minimum.
The parabola is symmetric about a vertical line through the vertex, called the axis of symmetry.

🎯 Standard form and transformations

🎯 Standard form definition

Standard form: f(x) = a(x − h)² + k, where a, h, and k are real numbers with a ≠ 0.

This form makes the vertex immediately visible: (h, k).
The axis of symmetry is the vertical line x = h.
Standard form connects directly to transformations of the basic parabola y = x².

🔧 Using transformations to graph

Start with the graph of f(x) = x² (vertex at origin).
The expression a(x − h)² + k applies transformations in sequence:
- Multiply y-values by a (vertical stretch/compression and possible reflection).
- Shift horizontally by h units (right if h > 0, left if h < 0).
- Shift vertically by k units (up if k > 0, down if k < 0).
Example: g(x) = (x + 2)² − 3 shifts the basic parabola left 2 units and down 3 units; vertex moves to (−2, −3).

🧮 Finding intercepts from standard form

x-intercepts: Set y = 0 and solve a(x − h)² + k = 0. Extract square roots after isolating (x − h)².
y-intercept: Set x = 0 and compute f(0).
Example: For g(x) = (x + 2)² − 3, setting y = 0 gives (x + 2)² = 3, so x = −2 ± √3.

🔄 Converting between forms

🔄 Completing the square

Purpose: Convert general form f(x) = ax² + bx + c into standard form f(x) = a(x − h)² + k.
Process:
1. Factor out the coefficient of x² from the x² and x terms (if a ≠ 1).
2. Take half the coefficient of x (inside the parentheses), square it, then add and subtract that value.
3. Group the perfect square trinomial and factor it.
4. Simplify the constant terms outside.
Example: f(x) = x² − 4x + 3
- Compute half of −4: (−4)/2 = −2; square it: 4.
- Rewrite: f(x) = (x² − 4x + 4 − 4) + 3 = (x − 2)² − 1.
- Vertex is (2, −1).

🔄 When a ≠ 1

First factor out a from the x² and x terms only.
Complete the square inside the parentheses.
Distribute the factored 'a' back through when simplifying.
Example: g(x) = −x² − x + 6
- Factor: (−1)(x² + x) + 6
- Half of 1 is 1/2; square: 1/4.
- (−1)(x² + x + 1/4 − 1/4) + 6 = −(x + 1/2)² + 1/4 + 6 = −(x + 1/2)² + 25/4.
- Vertex: (−1/2, 25/4).

📍 Vertex formulas

📍 Vertex from standard form

If f(x) = a(x − h)² + k, the vertex is (h, k) directly.
No calculation needed—just read off h and k.
Watch for sign: f(x) = (x + 2)² − 3 means h = −2 (not +2).

📍 Vertex from general form

If f(x) = ax² + bx + c, the x-coordinate of the vertex is x = −b/(2a).
Substitute this x-value back into f to find the y-coordinate: y = f(−b/(2a)).
Example: f(x) = x² − 4x + 3 has a = 1, b = −4, so x = −(−4)/(2·1) = 2; then f(2) = 4 − 8 + 3 = −1. Vertex: (2, −1).

🔍 Why the vertex formula works

Completing the square on ax² + bx + c yields a(x + b/(2a))² + (4ac − b²)/(4a).
The squared term (x + b/(2a))² equals zero when x = −b/(2a), making that the vertex's x-coordinate.
The minimum (or maximum) value occurs there because the squared term is always ≥ 0 (or ≤ 0 if a < 0).

🧪 The quadratic formula and discriminant

🧪 Quadratic formula

Quadratic formula: The solutions to ax² + bx + c = 0 are x = (−b ± √(b² − 4ac))/(2a).

Derived by completing the square on the general form and solving for x.
Provides the x-intercepts (zeros) of the quadratic function.

🔬 The discriminant

Discriminant: The quantity b² − 4ac in the quadratic formula.

The discriminant determines the nature of the solutions:

Discriminant	Number of real solutions	Graph behavior
b² − 4ac < 0	No real solutions	Parabola does not cross x-axis
b² − 4ac = 0	Exactly one real solution	Parabola touches x-axis at vertex
b² − 4ac > 0	Two distinct real solutions	Parabola crosses x-axis twice

Example: For x² − 4x + 3 = 0, discriminant = (−4)² − 4(1)(3) = 16 − 12 = 4 > 0, so two real solutions exist.

🔍 Don't confuse

The discriminant tells you how many real zeros exist, not their values.
A negative discriminant means the parabola stays entirely above (a > 0) or below (a < 0) the x-axis.

💼 Applications and optimization

💼 Profit maximization

Profit P(x) = Revenue R(x) − Cost C(x).
Often R(x) is quadratic (from price-demand) and C(x) is linear, making P(x) quadratic.
The vertex of P(x) gives the production level x that maximizes profit.
Example: If P(x) = −1.5x² + 170x − 150, vertex at x = −170/(2·(−1.5)) = 170/3 ≈ 56.67. Since x must be a whole number, compare P(56) and P(57) to find the maximum.

💼 Break-even points

Break-even occurs when P(x) = 0 (profit equals zero).
These are the x-intercepts of the profit function.
Profit is positive between the two break-even points (if the parabola opens downward).

📦 Area and perimeter problems

Typical setup: maximize area given a fixed perimeter.
Express area A as a function of one variable using the perimeter constraint.
Find the vertex of the resulting quadratic to locate the maximum area.
Example: Rectangular pasture with 200 feet of fencing along three sides (one side is a river). Let width = w, length = l. Then l + 2w = 200, so l = 200 − 2w. Area A(w) = w·l = w(200 − 2w) = 200w − 2w². Vertex at w = −200/(2·(−2)) = 50 feet; maximum area = 5000 square feet.

🚀 Projectile motion

Height h(t) of a projectile is often modeled by h(t) = −at² + v₀t + s₀, where a > 0 (gravity), v₀ is initial velocity, s₀ is initial height.
This is a downward-opening parabola (since coefficient of t² is negative).
Maximum height occurs at the vertex.
The object hits the ground when h(t) = 0.

📊 Graphing and symmetry

📊 Axis of symmetry

Every parabola is symmetric about a vertical line through the vertex.
For f(x) = a(x − h)² + k, the axis of symmetry is x = h.
For f(x) = ax² + bx + c, the axis of symmetry is x = −b/(2a).
Points equidistant from the axis have the same y-value.

📊 Sketching strategy

Identify the vertex (using standard form or the vertex formula).
Determine whether the parabola opens up or down (sign of a).
Find the y-intercept by setting x = 0.
Find x-intercepts (if any) by solving f(x) = 0.
Use symmetry to plot additional points if needed.
Draw a smooth parabolic curve through the points.

🔍 Range determination

If a > 0 (opens upward), range is [k, ∞) where k is the y-coordinate of the vertex.
If a < 0 (opens downward), range is (−∞, k].
The vertex y-value is the absolute minimum (a > 0) or absolute maximum (a < 0).

🎨 Absolute value of quadratics

🎨 Graphing f(x) = |g(x)| for quadratic g

Start by graphing y = g(x) normally.
Wherever g(x) < 0 (graph is below the x-axis), reflect that portion upward across the x-axis.
Portions where g(x) ≥ 0 remain unchanged.
Example: f(x) = |x² − x − 6|
- First graph g(x) = x² − x − 6 (parabola with x-intercepts at x = −2 and x = 3).
- Between x = −2 and x = 3, g(x) is negative, so reflect that portion upward.
- Result: a W-shaped graph with vertices at the original x-intercepts.

🔍 Why this works

By definition, |g(x)| = g(x) when g(x) ≥ 0 and |g(x)| = −g(x) when g(x) < 0.
Multiplying by −1 reflects the graph across the x-axis.
This is a general technique for graphing absolute values of any function.

Inequalities with Absolute Value and Quadratic Functions

2.4 Inequalities with Absolute Value and Quadratic Functions

🧭 Overview

🧠 One-sentence thesis

Inequalities involving absolute values and quadratic functions can be solved both analytically (using algebraic techniques and sign diagrams) and graphically (by comparing where one function's graph lies above or below another's), with both methods revealing the same solution intervals.

📌 Key points (3–5)

Graphical interpretation: Solutions to f(x) < g(x) correspond to x-values where the graph of f is below the graph of g; solutions to f(x) > g(x) are where f is above g; solutions to f(x) = g(x) are intersection points.
Absolute value inequalities: For c > 0, |x| < c means -c < x < c (x is within c units of zero), while |x| > c means x < -c or x > c (x is more than c units from zero).
Sign diagram method for quadratics: Find zeros of the quadratic, place them on a number line, test values in each interval to determine the sign, then select intervals matching the inequality.
Common confusion: When variables appear both inside and outside absolute values (e.g., |x + 1| ≥ x), use case-by-case analysis based on the definition of absolute value rather than standard theorems.
Practical applications: Inequalities model real-world tolerances, such as manufacturing specifications where measurements must fall within acceptable ranges of target values.

📊 Graphical interpretation of inequalities

📍 Connecting equations and graphs

The Fundamental Graphing Principle: point (a, b) is on the graph of f if and only if f(a) = b.

A generic point on y = f(x) is written (x, f(x)).
When solving f(x) = g(x), we seek x-values where both functions produce the same y-value.
Graphically, these are the x-coordinates of intersection points.

📈 Reading inequalities from graphs

The excerpt establishes three key graphical rules:

Inequality	Graphical meaning
f(x) = g(x)	x-values where graphs intersect
f(x) < g(x)	x-values where graph of f is below graph of g
f(x) > g(x)	x-values where graph of f is above graph of g

Example: If f(x) = 2x - 1 and g(x) = 5, they intersect at x = 3. For x < 3, the line f is below the horizontal line g; for x > 3, f is above g.

🔍 Visual verification

After solving algebraically, you can check by graphing both functions.
The solution intervals should match where one graph is positioned relative to the other.
Don't confuse: "below" means smaller y-values, not physical position on the page.

🔢 Absolute value inequalities

📏 Basic absolute value theorem

The excerpt provides Theorem 2.4 with six parts for different cases:

For c > 0:

|x| < c is equivalent to -c < x < c
|x| ≤ c is equivalent to -c ≤ x ≤ c
|x| > c is equivalent to x < -c or x > c
|x| ≥ c is equivalent to x ≤ -c or x ≥ c

For c ≤ 0:

|x| < c has no solution (absolute value is never negative)
|x| > c and |x| ≥ c are true for all real numbers when c < 0

🧮 Graphical understanding

The graph of y = |x| is a V-shape with vertex at the origin.
For |x| < c (where c > 0), we want x-values where the V-shape is below the horizontal line y = c.
This occurs between the two intersection points at x = -c and x = c.
The excerpt notes this can be understood using the increasing property of square root: if 0 ≤ a ≤ b, then √a ≤ √b.

⚠️ Variables inside and outside

When variables appear both inside and outside the absolute value (e.g., |x + 1| ≥ (x + 4)/2):

Standard theorems don't apply directly.
Use Definition 2.4: break into cases based on when the expression inside is positive or negative.
Example: For |x + 1|, consider x < -1 (where |x + 1| = -(x + 1)) and x ≥ -1 (where |x + 1| = x + 1) separately.
Solve each case, then combine solutions that fall within the appropriate domain.

🔗 Compound inequalities

For inequalities like 2 < |x - 1| ≤ 5:

Rewrite as two separate inequalities connected by "and."
Solve |x - 1| > 2 to get x < -1 or x > 3.
Solve |x - 1| ≤ 5 to get -4 ≤ x ≤ 6.
Take the intersection: [-4, -1) ∪ (3, 6].

📐 Quadratic inequalities and sign diagrams

🎯 The sign diagram method

The excerpt presents a five-step procedure:

Rewrite: Get quadratic function f(x) on one side, 0 on the other.
Find zeros: Solve f(x) = 0 (by factoring, quadratic formula, etc.).
Mark intervals: Place zeros on a number line; they divide it into intervals.
Test each interval: Choose one test value in each interval; evaluate f at that value to determine sign.
Select intervals: Choose intervals with the correct sign (+ for >, - for <).

🔑 Key property of quadratics

If a quadratic is positive at one point and negative at another, it must have at least one zero in between.
Graphically: a parabola can't be above the x-axis at one point and below at another without crossing.
This means testing just one value per interval is sufficient to determine the sign throughout that interval.

📝 Sign diagram notation

Example for f(x) = x² - x - 6 with zeros at x = -2 and x = 3:

    -2        3
(+)  0  (-)  0  (+)

(+) means f(x) > 0 in that interval
(-) means f(x) < 0 in that interval
0 marks the zeros themselves

🧪 Handling non-factorable quadratics

When the quadratic doesn't factor nicely:

Use the quadratic formula to find zeros.
Approximate the zeros to help choose test values.
Example: For x² - 2x - 1 = 0, zeros are x = 1 ± √2 ≈ -0.4 and 2.4.

🎲 Special cases

One zero only: The parabola touches the x-axis at one point (vertex on x-axis). Example: x² - 2x + 1 = 0 has only x = 1. The sign diagram shows (+) on both sides of this point.
No real zeros: The parabola doesn't cross the x-axis. Example: x² + 1 ≤ 0 has no solution because x² + 1 is always positive.

🏭 Applications and regions

📦 Tolerance problems

The excerpt demonstrates using absolute value inequalities to model manufacturing tolerances:

Setup: If a measurement x must be within d units of target c, write |x - c| ≤ d.
Interpretation: |x - c| represents the distance from x to c.
Example: A particle board with area A(x) = x² must have area within 0.25 square inches of 576 square inches. This gives |x² - 576| ≤ 0.25.

🔧 Solving tolerance problems

The excerpt shows a technique using the increasing property of square root:

From |x² - 576| ≤ 0.25, get -0.25 ≤ x² - 576 ≤ 0.25.
Add 576: 575.75 ≤ x² ≤ 576.25.
Take square roots: √575.75 ≤ |x| ≤ √576.25.
Solve using Theorem 2.4 and intersect the solution sets.
Discard negative values if x represents a physical measurement.

🗺️ Graphing regions in the plane

Relations can describe regions using inequalities:

R = {(x, y) : y > |x|}: All points above the V-shaped graph of y = |x|. Use dotted line for the boundary since points on y = |x| are not included.
S = {(x, y) : y ≤ 2 - x²}: All points on or below the parabola y = 2 - x². Use solid curve since boundary points are included.
Combined conditions: For T = {(x, y) : |x| < y ≤ 2 - x²}, shade the region between the two graphs, including the parabola boundary but not the absolute value boundary.

🔍 Finding boundary intersections

To accurately graph combined regions:

Set the two boundary functions equal to find intersection points.
Example: |x| = 2 - x² requires case analysis, yielding x = -1 and x = 1.
These points help define where the shaded region begins and ends.

Regression

2.5 Regression

🧭 Overview

🧠 One-sentence thesis

Regression analysis uses statistical methods to find lines or curves that best fit real-world data by minimizing the total squared error, enabling predictions and trend analysis even when data points don't fall perfectly on a mathematical curve.

📌 Key points (3–5)

What regression does: finds a "best fit" line or curve for data points that don't lie perfectly on one path, measured by minimizing total squared error.
Linear vs quadratic models: linear regression finds the best line; quadratic regression finds the best parabola; choice depends on whether data shows constant change or curved behavior.
Goodness of fit indicators: the correlation coefficient r (closer to ±1 is better for linear) and R² coefficient of determination measure how well the model matches the data.
Common confusion: a mathematically good fit (high r or R²) doesn't guarantee the model is appropriate—always check if the model type (linear, quadratic, etc.) matches the real-world phenomenon.
Practical use: regression models enable predictions (future values) and interpretation (e.g., slope = rate of change), but they are approximations and may not work far beyond the original data range.

📏 How regression measures closeness

📏 Total squared error

Total squared error E: the sum of the squares of the vertical distances between each data point and the corresponding point on the line or curve.

For each data point, find the point on the line/curve with the same x-coordinate.
Measure the vertical distance (difference in y-coordinates).
Square each distance and add them all up.
Example: Three points {(1, 2), (3, 1), (4, 3)} with line y = (1/2)x + (1/2) gives E = 9/4.

🎯 Least squares regression line

Least squares regression line (or "line of best fit"): the line that produces the lowest possible total squared error for a given data set.

Advanced mathematics (Calculus and Linear Algebra) can find this optimal line.
Graphing calculators have built-in features to compute it automatically.
The calculator outputs the line as y = ax + b, where a is the slope and b is the y-intercept.

📊 Measuring fit quality

📊 Correlation coefficient (r)

Measures how close data points are to being perfectly linear.
Range: r is between -1 and +1.
Interpretation: |r| closer to 1 means better linear fit; r ≈ 0.327 indicates data "aren't close to being linear."
Don't confuse: r only measures linear fit; data might fit a curve well but have low r.

📊 Coefficient of determination (R²)

Also measures goodness of fit.
For quadratic regression, the excerpt uses R² instead of r.
Closer to 1 indicates a better fit.
Example: R² reasonably close to 1 suggests the quadratic model is decent.

🔢 Linear regression in practice

🔢 Energy consumption example

The excerpt walks through US energy usage from 1950–2000:

Step	What happens	Result
Plot data	Points appear linear	Suggests linear model is appropriate
Find regression line	Calculator gives y = 1.287x − 2473.890	High r indicates good fit
Interpret slope	Slope = 1.287 Quads per year	Energy usage increasing at this rate
Predict future	Substitute x = 2013	Predicted usage ≈ 116.841 Quads
Predict timing	Set y = 120 and solve for x	Usage reaches 120 Quads between 2015–2016

🔢 Slope interpretation

Slope = rate of change of y with respect to x.
Units matter: if y is in Quads and x is in years, slope of 1.287 means "1.287 Quads per year increase."
Positive slope = increasing trend; negative slope = decreasing trend.

📈 Quadratic regression

📈 When to use quadratic models

Data that starts one way then changes direction suggests a parabola.
Example: Temperature rises in morning, peaks, then falls in afternoon—this curved behavior fits a parabola better than a line.

📈 Temperature example

Hourly temperature data for one day:

Plotting shows temperatures rise then fall.
Linear regression wouldn't capture the peak.
Quadratic regression finds best-fit parabola: y = −0.321x² + 9.464x − 45.857.
The vertex formula (x = −b/(2a)) locates the maximum temperature and when it occurs.

📈 Process

Quadratic regression: finds the parabola that minimizes the least square error between data points and the curve.

Calculator has built-in quadratic regression feature.
Outputs y = ax² + bx + c and R² value.
Use vertex formula to find maximum or minimum points.

⚠️ Model limitations and cautions

⚠️ Models are approximations

The quadratic temperature model predicted 23.899°F maximum, but actual data showed 24°F.
Regression captures trends but doesn't perfectly match every point.
Models work best within the range of the original data.

⚠️ Choosing the right model type

Just because you can fit a line or parabola doesn't mean you should.
Example: Cat population growth over 10 years—quadratic regression gives R² = 0.70888 but predicts 24 million cats at year 0 when actual starting value is 2.
Example: Hours of daylight through the year—quadratic model gives R² = 0.92295 but predicts negative daylight hours for month 13.
Don't confuse: high R² doesn't mean the model type is correct for the phenomenon.

⚠️ Understanding why matters

Observing trends and fitting models is useful, but understanding the underlying reasons why data should be linear, quadratic, or something else is crucial.
This deeper investigation "is usually the business of scientists."
Predictions far beyond the data range are especially unreliable.

⚠️ Data selection and honesty

The excerpt includes a cautionary note about presenting only selected data:

Same person's weight measured on Thursdays vs Saturdays gave very different regression results.
Thursday data: good linear fit (r² = 0.9155).
Saturday data: terrible linear fit (r² = 0.0000807).
Both used real, accurate data—but selective presentation can be "disingenuous."
Lesson: even without lying or using bad math, careful data selection can manipulate conclusions.

Graphs of Polynomials

3.1 Graphs of Polynomials

🧭 Overview

🧠 One-sentence thesis

The behavior of polynomial graphs is determined by their degree, leading coefficient, multiplicity of zeros, and the fact that they are smooth and continuous everywhere, allowing us to predict end behavior and sketch graphs without detailed calculations.

📌 Key points (3–5)

What polynomials are: functions of the form f(x) = aₙxⁿ + aₙ₋₁xⁿ⁻¹ + ... + a₁x + a₀ with natural number degree n ≥ 1 and domain (−∞, ∞).
End behavior rule: a polynomial's end behavior matches that of its leading term aₙxⁿ; even-degree polynomials go the same direction on both ends, odd-degree polynomials go opposite directions.
Multiplicity determines crossing: zeros with odd multiplicity cross the x-axis; zeros with even multiplicity touch and rebound.
Common confusion: not all functions that look polynomial-like are polynomials—domain restrictions, absolute values, and fractional exponents disqualify functions.
Why smoothness matters: polynomials are continuous (no breaks) and smooth (no sharp turns), which guarantees the Intermediate Value Theorem applies and enables sign-diagram construction.

📐 Defining polynomial functions

📐 What counts as a polynomial

Polynomial function: a function of the form f(x) = aₙxⁿ + aₙ₋₁xⁿ⁻¹ + ... + a₂x² + a₁x + a₀, where a₀, a₁, ..., aₙ are real numbers and n ≥ 1 is a natural number. The domain is (−∞, ∞).

The subscripts on coefficients indicate which power of x they belong to (a₅ is the coefficient of x⁵).
The natural number restriction (n = 1, 2, 3, ...) keeps us focused on well-behaved algebraic functions.
Domain is crucial: even if a function simplifies to polynomial form, if its original domain excludes any real numbers, it is not a polynomial.

📐 What disqualifies a function

The excerpt provides several examples of non-polynomials:

Domain restrictions: g(x) = (x³ + 4)/x excludes x = 0, so not a polynomial.
Fractional exponents: f(x) = ³√x = x^(1/3) has exponent 1/3, which is not a natural number.
Absolute value: h(x) = |x| cannot be written as a combination of powers of x (it has a sharp corner).
However: q(x) = (x³ + 4x)/(x² + 4) simplifies to x with domain all real numbers, so it qualifies as a polynomial.

Example: A function that simplifies to polynomial form but originally had domain restrictions is not a polynomial; check the domain before simplifying.

📐 Polynomial terminology

Degree: the natural number n in the leading term (when aₙ ≠ 0).
Leading term: aₙxⁿ (the term with highest power).
Leading coefficient: aₙ (the coefficient of the leading term).
Constant term: a₀ (the term with no x).

Special cases:

If f(x) = a₀ with a₀ ≠ 0, the degree is 0.
If f(x) = 0, the function has no degree.

Finding these without expanding: For a factored polynomial like p(x) = (2x − 1)³(x − 2)(3x + 2), multiply only the leading terms of each factor: (2x)³(x)(3x) = 24x⁵, so degree is 5 and leading coefficient is 24. For the constant term, multiply the constant terms: (−1)³(−2)(2) = 4.

🔚 End behavior patterns

🔚 What end behavior means

End behavior: what happens to y-values as x → −∞ (far left) and as x → ∞ (far right).
Notation: "as x → −∞, f(x) → ∞" means as x becomes a large negative number, f(x) becomes a large positive number.

🔚 Even-degree polynomials

For f(x) = axⁿ where n is even:

Leading coefficient	Left end (x → −∞)	Right end (x → ∞)	Shape
a > 0	f(x) → ∞	f(x) → ∞	Both ends up
a < 0	f(x) → −∞	f(x) → −∞	Both ends down

Examples: y = x², y = x⁴, y = x⁶ all have both ends going up.
As exponent increases, the bottom becomes "flatter" and sides become "steeper."
All even-power functions are symmetric (even functions).

🔚 Odd-degree polynomials

For f(x) = axⁿ where n ≥ 3 is odd:

Leading coefficient	Left end (x → −∞)	Right end (x → ∞)	Shape
a > 0	f(x) → −∞	f(x) → ∞	Down left, up right
a < 0	f(x) → ∞	f(x) → −∞	Up left, down right

Examples: y = x³, y = x⁵, y = x⁷ all go down on the left and up on the right.
These functions are odd functions (symmetric about the origin).

🔚 Why only the leading term matters

Key theorem: The end behavior of f(x) = aₙxⁿ + aₙ₋₁xⁿ⁻¹ + ... + a₀ matches the end behavior of y = aₙxⁿ.

Why: For large |x|, terms with lower powers become negligible. For example, in f(x) = 4x³ − x + 5, as x → ±∞, the terms −1/(4x²) and 5/(4x³) approach 0, leaving f(x) ≈ 4x³.

Graphically: If you zoom out far enough on a graphing calculator, the graph of a polynomial becomes indistinguishable from the graph of its leading term.

🎯 Zeros and multiplicity

🎯 What multiplicity means

Multiplicity: If (x − c)ᵐ is a factor of f(x) but (x − c)^(m+1) is not, then x = c is a zero of multiplicity m.

Example: For f(x) = x³(x − 3)²(x + 2) = (x − 0)³(x − 3)²(x − (−2))¹:

x = 0 has multiplicity 3
x = 3 has multiplicity 2
x = −2 has multiplicity 1

🎯 How multiplicity affects the graph

Key theorem:

Even multiplicity: the graph touches and rebounds from the x-axis at (c, 0).
Odd multiplicity: the graph crosses through the x-axis at (c, 0).

Why this happens: When testing values on either side of a zero, the factor (x − c)ᵐ changes sign only if m is odd. If m is even, squaring (or raising to any even power) keeps the sign positive on both sides.

Example: For f(x) = x³(x − 3)²(x + 2):

At x = −2 (multiplicity 1, odd): graph crosses
At x = 0 (multiplicity 3, odd): graph crosses
At x = 3 (multiplicity 2, even): graph touches and rebounds

Don't confuse: All odd multiplicities cause crossing, but higher odd multiplicities (3, 5, 7, ...) show "flattening" near the zero, unlike multiplicity 1 which crosses more directly.

🧮 Continuity and smoothness

🧮 What these properties mean

Continuous: the graph has no breaks or holes.
Smooth: the graph has no sharp turns (corners or cusps).

Polynomials are continuous and smooth everywhere. These properties are inherited when functions are added together.

🧮 Why this matters: the Intermediate Value Theorem

Intermediate Value Theorem (Zero Version): If f is continuous on an interval containing x = a and x = b (with a < b), and f(a) and f(b) have different signs, then f has at least one zero between x = a and x = b.

In plain language: A continuous function cannot go from above the x-axis to below it (or vice versa) without crossing the x-axis somewhere in between.

Example application: For f(x) = x² − 2, we have f(1) = −1 and f(3) = 7. Since these have different signs, there must be a real number c between 1 and 3 where f(c) = 0. This proves √2 is a real number.

🧮 Sign diagrams

Because polynomials are continuous, they maintain the same sign (all positive or all negative) on intervals between zeros.

Algorithm for constructing sign diagrams:

Find all zeros and place them on a number line with 0 above them.
Choose a test value in each interval between zeros.
Determine the sign of f(x) at each test value and write that sign above the interval.

Shortcut using end behavior and multiplicity: If you know the end behavior and multiplicity of each zero, you can sketch the graph without computing test values—just follow the pattern of crossing (odd multiplicity) or rebounding (even multiplicity) while respecting the end behavior.

📦 Applied example: optimization

📦 Box volume problem

The excerpt shows how polynomials arise naturally in geometry:

Problem: Make a box from a 10 in × 12 in piece of cardboard by cutting x-inch squares from each corner and folding up the sides.

Solution approach:

Height = x
Width = 10 − 2x (removed x from each side)
Depth = 12 − 2x
Volume V(x) = x(10 − 2x)(12 − 2x) = 4x³ − 44x² + 120x
Applied domain: 0 < x < 5 (must have x > 0 and 10 − 2x > 0)

Using a graphing calculator to find the maximum on this domain gives x ≈ 1.81 inches for maximum volume ≈ 96.77 cubic inches.

This illustrates why understanding polynomial graphs matters: real-world optimization problems often reduce to finding maxima or minima of polynomial functions.

The Factor Theorem and the Remainder Theorem

3.2 The Factor Theorem and the Remainder Theorem

🧭 Overview

🧠 One-sentence thesis

The Factor Theorem and Remainder Theorem establish that dividing a polynomial by (x − c) yields a remainder equal to p(c), and that c is a zero of the polynomial if and only if (x − c) is a factor, providing a systematic method to find all real zeros.

📌 Key points (3–5)

The Remainder Theorem: when you divide polynomial p(x) by (x − c), the remainder is exactly p(c).
The Factor Theorem: c is a zero of p if and only if (x − c) is a factor of p(x)—this connects zeros to factorization.
Synthetic division: a streamlined algorithm for dividing polynomials by divisors of the form (x − c), much faster than long division.
Common confusion: synthetic division works only for divisors of the form (x − c); for higher-degree divisors you must use polynomial long division.
Upper bound on zeros: a polynomial of degree n has at most n real zeros (counting multiplicities).

📐 Polynomial division framework

📐 The division algorithm (Theorem 3.4)

Polynomial Division: Suppose d(x) and p(x) are nonzero polynomials where the degree of p is greater than or equal to the degree of d. There exist two unique polynomials, q(x) and r(x), such that p(x) = d(x) q(x) + r(x), where either r(x) = 0 or the degree of r is strictly less than the degree of d.

Terminology: p is the dividend, d is the divisor, q is the quotient, r is the remainder.
Factor definition: if r(x) = 0, then d is called a factor of p.
Uniqueness: there is only one way to decompose p(x) into this form for a given divisor d(x).
Example: dividing x³ + 4x² − 5x − 14 by x − 2 gives quotient x² + 6x + 7 and remainder 0, so x³ + 4x² − 5x − 14 = (x − 2)(x² + 6x + 7).

📏 Degree behavior

When you divide a polynomial of degree n by a polynomial of degree 1 (like x − c), the quotient has degree exactly n − 1.
The remainder must have degree strictly less than the divisor's degree.
For divisor x − c (degree 1), the remainder must be degree 0, i.e., a constant (possibly zero).

🔑 The two core theorems

🔑 The Remainder Theorem (Theorem 3.5)

The Remainder Theorem: Suppose p is a polynomial of degree at least 1 and c is a real number. When p(x) is divided by x − c the remainder is p(c).

Why it works: write p(x) = (x − c) q(x) + r where r is a constant. Substitute x = c: p(c) = (c − c) q(c) + r = 0 · q(c) + r = r.
Practical use: to find p(c), you can either substitute c directly into p(x) or divide p(x) by x − c and read off the remainder.
Example: for p(x) = 2x³ − 5x + 3, dividing by x − (−2) gives remainder −3, so p(−2) = −3. Direct substitution confirms: 2(−8) − 5(−2) + 3 = −16 + 10 + 3 = −3.

🔑 The Factor Theorem (Theorem 3.6)

The Factor Theorem: Suppose p is a nonzero polynomial. The real number c is a zero of p if and only if (x − c) is a factor of p(x).

Two directions:
- If (x − c) is a factor, then p(x) = (x − c) q(x), so p(c) = 0 · q(c) = 0.
- If c is a zero (p(c) = 0), the Remainder Theorem says the remainder when dividing by x − c is p(c) = 0, so (x − c) is a factor.
Fundamental connection: this theorem links zeros (solutions to p(x) = 0) with factors (algebraic structure of p(x)).
Example: if x = 2 is a zero of p(x) = x³ + 4x² − 5x − 14, then (x − 2) must be a factor. Division confirms: x³ + 4x² − 5x − 14 = (x − 2)(x² + 6x + 7).

⚡ Synthetic division

⚡ Why synthetic division

Polynomial long division is tedious and error-prone.
When the divisor is x − c (a very common case), most of the written work is redundant.
Synthetic division strips away all redundant notation and keeps only the essential coefficients.

⚡ How synthetic division works

Setup: write c (from divisor x − c) on the left; write all coefficients of the dividend in a row (include 0 for missing powers).
Algorithm:
1. Bring down the first coefficient.
2. Multiply it by c, write the result under the next coefficient, then add.
3. Repeat: multiply the new sum by c, write under the next coefficient, add.
4. Continue until you reach the last coefficient.
Reading the result: the last number in the bottom row is the remainder; the other numbers are the coefficients of the quotient (which has degree one less than the dividend).
Example: to divide 5x³ − 2x² + 0x + 1 by x − 3, write:
```
3 | 5  −2   0   1
  ↓   15  39 117
  ─────────────
  5  13  39 118
```
Quotient: 5x² + 13x + 39; remainder: 118.

⚡ Important restrictions

Synthetic division works only for divisors of the form x − c.
For divisors like 2x − 3, first factor out the leading coefficient: 2x − 3 = 2(x − 3/2), divide the dividend by 2, then use synthetic division with c = 3/2.
For higher-degree divisors (e.g., x² + 4), you must use polynomial long division.

⚡ Repeated division for multiple zeros

If c is a zero of multiplicity 2 or more, you can continue the synthetic division tableau and divide by x − c again.

Example: if x = 1/2 is a zero of multiplicity 2 of p(x) = 4x⁴ − 4x³ − 11x² + 12x − 3, divide twice:

1/2 | 4  −4  −11   12  −3
    ↓   2   −1   −6   3
    ─────────────────────
    4  −2  −12    6   0
    ↓   2    0   −6
    ─────────────────
    4   0  −12    0

Result: p(x) = (x − 1/2)²(4x² − 12).

🔍 Finding all real zeros

🔍 Strategy: peel off zeros one at a time

Use the Factor Theorem: if you know one zero c, divide by x − c to get p(x) = (x − c) q(x).
Find the zeros of the quotient q(x), which has degree one less.
Repeat until you reach a quadratic, then use the quadratic formula.
Example: for p(x) = 2x³ − 5x + 3 with known zero x = 1, divide to get p(x) = (x − 1)(2x² + 2x − 3). Solve 2x² + 2x − 3 = 0 using the quadratic formula: x = (−1 ± √7)/2.

🔍 Handling irrational zeros

If x = √3 is a zero, then (x − √3) is a factor by the Factor Theorem.
You can continue synthetic division with c = √3 (or c = −√3) even though it's irrational.
Example: dividing 4x² − 12 by x − √3 gives quotient 4x + 4√3 and remainder 0, confirming the factorization.

🔍 Upper bound on the number of zeros (Theorem 3.7)

Theorem 3.7: Suppose f is a polynomial of degree n ≥ 1. Then f has at most n real zeros, counting multiplicities.

Why: each zero c gives a factor (x − c). Since f has degree n, there can be at most n such factors.
Don't confuse: "at most n" means there can be fewer (e.g., x² + 1 has degree 2 but zero real zeros).
Multiplicities count: if x = 2 is a zero of multiplicity 3, it counts as three zeros toward the total.

🔗 Connections between zeros, factors, and graphs

🔗 Five equivalent statements

For a polynomial p of degree n ≥ 1, the following are all equivalent (they mean the same thing):

Statement	Meaning
c is a zero of p	p(c) = 0
x = c is a solution to p(x) = 0	Solving the polynomial equation
(x − c) is a factor of p(x)	Algebraic factorization
(c, 0) is an x-intercept of y = p(x)	Graphical interpretation

Why this matters: you can move freely between algebraic (factors), numerical (zeros), equation-solving (solutions), and geometric (intercepts) perspectives.
Example: knowing x = 2 is a zero tells you (x − 2) is a factor, (2, 0) is on the graph, and x = 2 solves p(x) = 0.

🔗 Don't confuse: zero vs root vs solution vs intercept

These terms refer to the same underlying concept but emphasize different contexts.
"Zero of p" and "root of p(x) = 0" are most common in pure algebra.
"Solution to p(x) = 0" emphasizes equation-solving.
"x-intercept" emphasizes the graph.

Real Zeros of Polynomials

3.3 Real Zeros of Polynomials

🧭 Overview

🧠 One-sentence thesis

Finding the real zeros of a polynomial can be accomplished either by combining mathematical theorems with graphing technology or by relying solely on a sequence of mathematical theorems to narrow down candidates and test them systematically.

📌 Key points (3–5)

Two approaches exist: one uses theorems plus graphing calculators; the other uses theorems alone (the "purist" approach).
Cauchy's Bound tells you an interval that contains all real zeros; the Rational Zeros Theorem gives a list of possible rational zeros to test.
Descartes' Rule of Signs estimates how many positive and negative real zeros exist (counting multiplicity); the Upper and Lower Bounds Theorem helps refine the search interval during synthetic division.
Common confusion: Descartes' Rule counts multiplicities and gives a range (e.g., 3, 1, or 0 negative zeros), not exact values; it does not tell you which numbers are zeros, only how many to expect.
When theorems fail: some polynomials have irrational or no real zeros; in such cases, approximation methods like the Bisection Method or calculator commands are necessary.

🧮 Foundational theorems

🎯 Cauchy's Bound

Cauchy's Bound: For a polynomial f(x) = aₙxⁿ + aₙ₋₁xⁿ⁻¹ + … + a₁x + a₀ of degree n ≥ 1, let M be the largest of the absolute values |a₀|/|aₙ|, |a₁|/|aₙ|, …, |aₙ₋₁|/|aₙ|. Then all real zeros lie in the interval [−(M + 1), M + 1].

How to apply: divide the absolute value of each coefficient (except the leading one) by the absolute value of the leading coefficient; take the largest result as M; all real zeros are guaranteed to lie between −(M + 1) and M + 1.
Why it matters: gives you a starting window for graphing or a boundary for testing candidates.
Example: For f(x) = 2x⁴ + 4x³ − x² − 6x − 3, the largest coefficient in absolute value (other than the leading 2) is |−6| = 6, so M = 6/2 = 3, and all real zeros lie in [−4, 4].

🔢 Rational Zeros Theorem

Rational Zeros Theorem: If f(x) = aₙxⁿ + … + a₁x + a₀ has integer coefficients and r is a rational zero of f, then r = ±p/q, where p is a factor of the constant term a₀ and q is a factor of the leading coefficient aₙ.

How to apply: list all factors of the constant term (±p) and all factors of the leading coefficient (q); form all fractions ±p/q.
What it tells you: a complete list of possible rational zeros—if the polynomial has a rational zero, it must be on this list.
What it does not tell you: whether any of these candidates are actually zeros, or whether the polynomial has irrational or no real zeros.
Example: For f(x) = 2x⁴ + 4x³ − x² − 6x − 3, factors of −3 are ±1, ±3; factors of 2 are 1, 2; possible rational zeros are ±1/2, ±1, ±3/2, ±3.
Why it works: If c = p/q (in lowest terms) is a zero, then substituting into f and clearing denominators shows aₙpⁿ is a multiple of both p and q; since p and q share no common factors, aₙ must be a multiple of q and a₀ must be a multiple of p.

📊 Estimating the number and sign of zeros

🔀 Descartes' Rule of Signs

Descartes' Rule of Signs: For a polynomial f(x) written in descending powers of x, let P = number of sign changes in f(x) and N = number of sign changes in f(−x). Then the number of positive real zeros (counting multiplicity) is one of {P, P−2, P−4, …}, and the number of negative real zeros (counting multiplicity) is one of {N, N−2, N−4, …}.

How to count sign changes: read the nonzero coefficients from left to right; count each time the sign switches from + to − or − to +.
Counting multiplicity: a zero of multiplicity 2 counts as two zeros in this rule.
What it guarantees: if P = 1, there is exactly 1 positive real zero (counting multiplicity); if N = 3, there are 3, 1, or 0 negative real zeros.
Example: For f(x) = 2x⁴ + 4x³ − x² − 6x − 3, the signs are (+, +, −, −, −), so 1 sign change → exactly 1 positive real zero. For f(−x) = 2x⁴ − 4x³ − x² + 6x − 3, the signs are (+, −, −, +, −), so 3 sign changes → 3, 1, or 0 negative real zeros.
Don't confuse: this rule does not identify which numbers are zeros; it only estimates how many to expect in each region.

📏 Upper and Lower Bounds Theorem

Upper and Lower Bounds Theorem: If c > 0 is synthetically divided into f and all numbers in the final line have the same sign, then c is an upper bound (no real zero is greater than c). If c < 0 is synthetically divided into f and the numbers in the final line alternate signs, then c is a lower bound (no real zero is less than c). (Zero can be treated as + or − as needed.)

How to use: perform synthetic division with a candidate c; inspect the bottom row.
Upper bound: if c is positive and the bottom row is all nonnegative (or all nonpositive), stop testing larger candidates.
Lower bound: if c is negative and the bottom row alternates signs, stop testing smaller (more negative) candidates.
Example: Dividing f(x) = 2x⁴ + 4x³ − x² − 6x − 3 by x − 3/2 yields a bottom row of all positive numbers, so 3/2 is an upper bound; no need to test 3.
Why it works: the theorem relies on the division formula f(x) = (x − c)q(x) + r; if c > 0 and all coefficients of q and r are nonnegative, then for any b > c, f(b) > 0, so b cannot be a zero.

🖥️ Approach 1: Using a graphing calculator

📈 Workflow with technology

Find the interval using Cauchy's Bound.
List possible rational zeros using the Rational Zeros Theorem.
Graph y = f(x) on the calculator within the interval from step 1.
Shorten the candidate list by inspecting the graph: eliminate candidates that clearly do not match where the graph crosses the x-axis.
Test remaining candidates with synthetic division to confirm zeros and find multiplicities.
Use the quadratic formula if the quotient polynomial is quadratic.

🔍 Example walkthrough

For f(x) = 2x⁴ + 4x³ − x² − 6x − 3, the graph suggests no positive rational zeros and that −1 might be a zero.
Synthetic division with −1 succeeds; dividing again by −1 succeeds again, giving quotient 2x² − 3.
Setting 2x² − 3 = 0 yields x = ±√(3/2) = ±√6/2.
Multiplicities: −1 has multiplicity 2 (confirmed by two successful divisions); ±√6/2 each have multiplicity 1 (since the quotient is quadratic with two distinct roots, and the original polynomial is degree 4).
Graph refinement: the calculator initially suggested a "flattened crossing" at x = −1, but the multiplicity analysis reveals a relative maximum there instead.

⚠️ When technology is not enough

Example: f(x) = x⁴ + x² − 12 has no rational zeros (the graph shows two real zeros, one positive and one negative, but none of the rational candidates work).
Solution: recognize the polynomial as "quadratic in disguise": (x²)² + (x²) − 12 = 0; substitute u = x², factor as (u − 3)(u + 4) = 0, giving x² = 3 (so x = ±√3) or x² = −4 (no real solutions).
This technique (u-substitution) works when there are exactly three terms and the exponent of the first term is exactly twice that of the second.

🧪 Approach 2: Theorems only (no calculator)

🔬 Workflow without technology

Find the interval using Cauchy's Bound.
List possible rational zeros using the Rational Zeros Theorem.
Estimate the number of positive and negative zeros using Descartes' Rule of Signs.
Test candidates systematically with synthetic division, using the Upper and Lower Bounds Theorem to stop early when a bound is found.
Factor the quotient or apply the quadratic formula when the quotient is quadratic.

🧩 Example walkthrough

For f(x) = 2x⁴ + 4x³ − x² − 6x − 3, Descartes' Rule guarantees exactly 1 positive real zero and 3 or 1 negative real zeros.
Test positive candidates: 1/2 and 1 are not zeros; 3/2 is not a zero but is an upper bound (all positive in the bottom row), so stop testing positive rationals.
Conclusion: the positive zero is irrational, between 1 and 3/2 (by the Intermediate Value Theorem, since f(1) < 0 and f(3/2) > 0).
Test negative candidates: −1/2 is not a zero; −1 is a zero; dividing again by −1 succeeds; quotient is 2x² − 3, giving x = ±√6/2.
Multiplicities: Descartes' Rule says the positive zero has multiplicity 1 and the total multiplicity of negative zeros is 3; since −1 worked twice and −√6/2 is the third negative zero, −1 has multiplicity 2 and −√6/2 has multiplicity 1.

🔁 When theorems are not enough: Bisection Method

Some polynomials have real zeros that cannot be found exactly with these techniques (e.g., f(x) = x⁵ − x − 1).
Bisection Method: find an interval [a, b] where f changes sign; test the midpoint c = (a + b)/2; if f(c) has the same sign as f(a), replace a with c; otherwise replace b with c; repeat until the interval is smaller than the desired accuracy.
Example: f(1) = −1 and f(2) = 29, so a zero lies in [1, 2]; f(1.5) ≈ 5.09, so the zero is in [1, 1.5]; f(1.25) ≈ 0.80, so the zero is in [1, 1.25]; continue until the interval is sufficiently narrow.
Why it works: the Intermediate Value Theorem guarantees a zero in any interval where f changes sign; bisection systematically narrows the interval.

🧷 Applications: Equations and inequalities

🔧 Solving polynomial equations

Goal: find all real solutions to an equation like 2x⁵ + 6x³ + 3 = 3x⁴ + 8x².
Method: rewrite as 2x⁵ − 3x⁴ + 6x³ − 8x² + 3 = 0; find the real zeros of p(x) = 2x⁵ − 3x⁴ + 6x³ − 8x² + 3 using the techniques above.
Example: synthetic division yields x = −1/2 and x = 1; the quotient is 2x² + 6, which has no real zeros; solutions are x = −1/2 and x = 1.

📉 Solving polynomial inequalities

Goal: solve an inequality like 2x⁵ + 6x³ + 3 ≤ 3x⁴ + 8x².
Method: rewrite as p(x) ≤ 0; find the real zeros of p(x); construct a sign diagram by testing values in each interval between zeros; identify where p(x) < 0 and where p(x) = 0.
Example: zeros are x = −1/2 and x = 1; test values show p(x) < 0 on (−∞, −1/2); solution to p(x) ≤ 0 is (−∞, −1/2] ∪ {1}.
Graphical interpretation: the solution is the set of x where the graph of f(x) = 2x⁵ + 6x³ + 3 is below or touching the graph of g(x) = 3x⁴ + 8x².

💼 Applied example: Profit model

Problem: Profit P(x) = −5x³ + 35x² − 45x − 25 (in thousands of dollars) from producing x hundred TVs, 0 ≤ x ≤ 10.07. How many TVs to make a profit?
Method: solve P(x) > 0; factor out −5 to get −5(x³ − 7x² + 9x + 5) > 0, so focus on f(x) = x³ − 7x² + 9x + 5.
Find zeros: x = 5 is rational; factor as (x − 5)(x² − 2x − 1); quadratic formula gives x = 1 ± √2.
Only x = 1 + √2 ≈ 2.414 and x = 5 lie in [0, 10.07]; sign diagram shows P(x) > 0 on (1 + √2, 5).
Interpretation: x is in hundreds, so 1 + √2 ≈ 2.414 hundred = 241.4 TVs; since we can't produce a fraction, test P(2.41) < 0 and P(2.42) > 0, so minimum is 242 TVs. At x = 5 (500 TVs), take the next smaller integer, 499 TVs, to ensure profit. Answer: produce at least 242 but no more than 499 TVs.

🛠️ Special techniques

🔄 u-substitution for "quadratic in disguise"

When to use: the polynomial has exactly three terms, and the exponent of the first term is exactly twice that of the second.
Example: x⁴ + x² − 12 = 0 can be rewritten as (x²)² + (x²) − 12 = 0; let u = x², so u² + u − 12 = 0; factor as (u + 4)(u − 3) = 0; u = −4 (no real x) or u = 3 (x = ±√3).
Don't confuse: this only works when the exponent relationship is exact (e.g., x⁵ and x terms do not fit this pattern).

🔢 Handling non-integer coefficients

Problem: Rational Zeros Theorem requires integer coefficients.
Solution: if f(x) has fractional coefficients, multiply by a common denominator N to get q(x) = N · f(x) with integer coefficients; f and q have the same real zeros (since f(x) = 0 if and only if q(x) = 0).
Example: f(x) = x³ − (1/12)x² − (7/72)x + 1/72; multiply by 72 to get q(x) = 72x³ − 6x² − 7x + 1; find zeros of q, which are also zeros of f.

🎯 Multiplicity determination

From synthetic division: if a candidate c works multiple times in a row, its multiplicity is at least the number of successful divisions.
From Descartes' Rule: the total multiplicity of positive (or negative) zeros is constrained by the rule; use this to confirm exact multiplicities.
From the Factor Theorem and degree: if f is degree n and you have found k distinct zeros with known multiplicities summing to m < n, the remaining zeros must account for n − m multiplicities.
Example: f(x) = 2x⁴ + 4x³ − x² − 6x − 3 has degree 4; −1 divides successfully twice, and the quotient yields two more zeros ±√6/2; Descartes' Rule says the positive zero has multiplicity 1 and negative zeros have total multiplicity 3, so −1 has multiplicity exactly 2 and −√6/2 has multiplicity 1.

🚧 Limitations and fallback strategies

❌ When no technique works exactly

Some polynomials have real zeros that are not rational and cannot be expressed using radicals (e.g., x⁵ − x − 1).
Fallback 1 (with calculator): use the "Zero" command to approximate.
Fallback 2 (without calculator): use the Bisection Method to approximate to any desired accuracy.

🔍 Recognizing irrational zeros

If all rational candidates fail but the graph (or Descartes' Rule) guarantees a real zero exists, the zero must be irrational.
The Intermediate Value Theorem can locate the zero between two test values where f changes sign.

📐 Comparison of the two approaches

Aspect	With calculator	Without calculator
Speed	Faster for complex polynomials	Slower; more computation
Theorems needed	Cauchy's Bound, Rational Zeros Theorem	All of the above plus Descartes' Rule, Upper/Lower Bounds
Accuracy	Approximations from graph; exact via algebra	Exact when possible; Bisection for approximation
Insight	Graph reveals behavior (crossings, turning points)	Sign diagram and multiplicity analysis reveal behavior
When it fails	Still need algebra for exact zeros	Still need Bisection or admit defeat

Common confusion: The calculator approach is not "less mathematical"—it still requires the same algebraic techniques (synthetic division, factoring, quadratic formula) to confirm zeros and find multiplicities; the graph only helps narrow the candidate list and visualize behavior.

Complex Zeros and the Fundamental Theorem of Algebra

3.4 Complex Zeros and the Fundamental Theorem of Algebra

🧭 Overview

🧠 One-sentence thesis

Every polynomial of degree n ≥ 1 has exactly n complex zeros (counting multiplicity), which means polynomials can always be completely factored over the complex numbers even when they have no real zeros.

📌 Key points (3–5)

The imaginary unit i: defined by i² = −1, allowing us to find zeros of polynomials like x² + 1 = 0.
Complex numbers include real numbers: every number of the form a + bi (where a, b are real) is complex, including real numbers (when b = 0).
Fundamental Theorem of Algebra: guarantees at least one complex zero for any polynomial of degree ≥ 1 with complex coefficients.
Conjugate Pairs Theorem: for polynomials with real coefficients, nonreal zeros always come in conjugate pairs (a + bi and a − bi).
Common confusion: don't apply general radical rules to even roots of negative numbers; √−c = i√c only when c ≥ 0.

🔢 The imaginary unit and complex numbers

🔢 What is i?

The imaginary unit i satisfies: (1) i² = −1, and (2) if c is a real number with c ≥ 0, then √−c = i√c.

The letter i represents a square root of −1, solving equations like x² = −1.
Property 1 establishes that i behaves as a square root of −1.
Property 2 defines how to take square roots of negative real numbers.
Warning: The restriction c ≥ 0 in property 2 is critical; √−(−4) ≠ i√−4, otherwise contradictions arise.

🔢 Complex number definition

A complex number is a number of the form a + bi, where a and b are real numbers and i is the imaginary unit.

Examples: 3 + 2i, (2/5) − i√3, 3i, and 6 are all complex numbers.
Real numbers are complex numbers where b = 0 (e.g., 6 = 6 + 0i).
The complex numbers include all real numbers as a subset.

🔢 Arithmetic with complex numbers

Treat i like any radical expression, but remember i² = −1:

Addition/subtraction: combine like terms, e.g., (1 − 2i) − (3 + 4i) = −2 − 6i.
Multiplication: use distributive property, then simplify using i² = −1.
- Example: (1 − 2i)(3 + 4i) = 3 + 4i − 6i − 8i² = 3 − 2i + 8 = 11 − 2i.
Division: multiply numerator and denominator by the conjugate of the denominator.
Don't confuse: √−3 · √−12 = (i√3)(i√12) = −6, but √(−3)(−12) = √36 = 6 (different results!).

🔄 Complex conjugates

🔄 Definition and notation

The conjugate of a complex number a + bi is a − bi, denoted with a bar: a + bi = a − bi.

Examples: 3 + 2i = 3 − 2i; 6 = 6; 4i = −4i.
Conjugation changes the sign of the imaginary part only.

🔄 Properties of conjugates

Property	Statement	Meaning
Double conjugate	z̄̄ = z	Conjugating twice returns the original
Sum	z + w = z̄ + w̄	Conjugate of a sum = sum of conjugates
Product	zw = z̄ · w̄	Conjugate of a product = product of conjugates
Power	(z)ⁿ = z̄ⁿ	Conjugate works well with powers
Real characterization	z is real ⟺ z = z̄	A number equals its conjugate iff it's real

These properties show conjugation "plays nicely" with arithmetic operations.
Proof strategy: write z = a + bi and w = c + di, then verify both sides are equal.

🎯 The Fundamental Theorem of Algebra

🎯 The main theorem

Fundamental Theorem of Algebra: If f is a polynomial function with complex coefficients of degree n ≥ 1, then f has at least one complex zero.

This is an "existence" theorem—it guarantees a zero exists but doesn't tell you how to find it.
Like the Intermediate Value Theorem, it assures existence without providing an algorithm.
Applies to polynomials with any complex coefficients, not just real ones.

🎯 Complete factorization over complex numbers

Complex Factorization Theorem: A polynomial f of degree n ≥ 1 has exactly n complex zeros (counting multiplicity), and can be factored as f(x) = a(x − z₁)^(m₁)(x − z₂)^(m₂)···(x − z_k)^(m_k), where a is the leading coefficient.

Every polynomial of degree n has exactly n zeros when multiplicities are counted.
The value a is the leading coefficient of f(x).
A polynomial is completely determined by its zeros, their multiplicities, and its leading coefficient.
Example: f(x) = 12x⁵ − 20x⁴ + 19x³ − 6x² − 2x + 1 has zeros at x = 1/2 (multiplicity 2), x = −1/3, and x = (1 ± i√3)/2, giving the complete factorization.

🔗 Conjugate pairs and real polynomials

🔗 Conjugate Pairs Theorem

Conjugate Pairs Theorem: If f is a polynomial with real coefficients and z is a zero of f, then z̄ is also a zero of f.

Nonreal zeros of real-coefficient polynomials come in conjugate pairs.
If a + bi is a zero (with b ≠ 0), then a − bi is also a zero.
Proof: Use the conjugate properties to show f(z̄) = f̄(z) = 0̄ = 0.

🔗 Factoring over the real numbers

Real Factorization Theorem: A polynomial f with real coefficients can be factored into linear factors (for real zeros) and irreducible quadratic factors (for nonreal zeros).

An irreducible quadratic is a quadratic with no real zeros, impossible to factor further using real numbers.
When nonreal zeros a ± bi occur as conjugate pairs, they produce the irreducible quadratic factor (x − [a + bi])(x − [a − bi]) = x² − 2ax + (a² + b²).
Example: f(x) = x⁴ + 64 factors completely over complex numbers but over reals becomes (x² − 4x + 8)(x² + 4x + 8).
Don't confuse: "factored completely over complex numbers" means using all complex zeros; "factored over real numbers" stops at irreducible quadratics.

🛠️ Finding and using complex zeros

🛠️ Techniques for finding zeros

Use synthetic division with complex numbers just as with real numbers.
The Factor Theorem and Remainder Theorem work for complex zeros too.
For quadratics, use the quadratic formula even when the discriminant is negative.
When one nonreal zero is found for a real-coefficient polynomial, its conjugate is automatically also a zero.

🛠️ Building polynomials from zeros

To construct a polynomial with given properties:

Identify all zeros and their multiplicities from the given information.
If a graph "touches but doesn't cross" at x = c, then c has even multiplicity (use 2 for lowest degree).
For real-coefficient polynomials, include conjugate pairs for any nonreal zeros.
Apply the Complex Factorization Theorem: f(x) = a(x − z₁)^(m₁)···(x − z_k)^(m_k).
Determine the leading coefficient a from end behavior or other constraints.

Example: If the graph touches at (1/3, 0), x = 3i is a zero, and both ends go to −∞, then f(x) must include (x − 1/3)² (multiplicity 2), (x − 3i)(x + 3i) as factors, and a < 0 for the correct end behavior.

Introduction to Rational Functions

4.1 Introduction to Rational Functions

🧭 Overview

🧠 One-sentence thesis

Rational functions—ratios of polynomials—exhibit distinctive behaviors such as vertical asymptotes (where the function becomes unbounded), horizontal asymptotes (where the function levels off at infinity), and slant asymptotes (where the function approaches a non-horizontal line), all determined by the degrees and zeros of the numerator and denominator.

📌 Key points (3–5)

What rational functions are: functions formed by dividing one polynomial by another, written as r(x) = p(x)/q(x).
Domain restrictions: any value that makes the denominator zero is excluded from the domain.
Three types of asymptotes: vertical (unbounded behavior at excluded domain values), horizontal (leveling-off behavior as x → ±∞), and slant (approaching a non-horizontal line as x → ±∞).
Common confusion—asymptote vs. hole: both occur at zeros of the denominator, but if a factor cancels between numerator and denominator, you get a hole (a missing point); if it doesn't cancel, you get a vertical asymptote (unbounded behavior).
Why degrees matter: comparing the degrees of numerator and denominator tells you whether horizontal or slant asymptotes exist and how to find them.

📐 Definition and domain

📐 What is a rational function

Rational function: a function that is the ratio of polynomial functions; r(x) = p(x)/q(x), where p and q are polynomials.

All polynomial functions are also rational functions (take q(x) = 1).
The key feature: division of polynomials, which introduces new behaviors not seen in polynomials alone.

🚫 Finding the domain

Rule: exclude any x-value that makes the denominator zero.
Method: set q(x) = 0 and solve; remove those solutions from the real numbers.
Example: for f(x) = (2x − 1)/(x + 1), set x + 1 = 0 → x = −1 is excluded → domain is (−∞, −1) ∪ (−1, ∞).

✏️ Simplifying rational expressions

Write in the form p(x)/q(x) by finding common denominators, combining terms, and factoring.
Cancel common factors only after factoring completely.
Important: even if two expressions simplify to the same formula, they are different functions if their domains differ.
Example: f(x) = (2x − 1)/(x + 1) and h(x) = (2x² − 1)/(x² − 1) − (3x − 2)/(x² − 1) both simplify to (2x − 1)/(x + 1), but h has x = 1 excluded from its domain, so f ≠ h.

📏 Vertical asymptotes and holes

📏 What is a vertical asymptote

Vertical asymptote: the line x = c is a vertical asymptote of y = f(x) if, as x approaches c from the left or right, f(x) → ∞ or f(x) → −∞.

Graphically: the graph "breaks" and shoots up or down near x = c, resembling the vertical line x = c.
Notation: "as x → c⁻, f(x) → ∞" means approaching c from the left, function values grow without bound upward; "as x → c⁺, f(x) → −∞" means approaching from the right, values grow without bound downward.

🕳️ What is a hole

A hole is a single missing point on the graph, marked with an open circle.
Occurs when a factor in the denominator cancels with a factor in the numerator.
The function is undefined at that x-value, but the graph approaches a finite y-value from both sides.
Example: h(x) = (2x² − 1)/(x² − 1) − (3x − 2)/(x² − 1) simplifies to (2x − 1)/(x + 1) after canceling (x − 1); x = 1 is excluded, but as x → 1, h(x) → 0.5, so there is a hole at (1, 0.5).

🔍 How to distinguish asymptotes from holes (Theorem 4.1)

Theorem 4.1 (Location of Vertical Asymptotes and Holes):

Suppose r(x) = p(x)/q(x) is in lowest terms (p and q have no common zeros). Let c be a real number not in the domain of r.

Condition	Result
q(c) ≠ 0	Hole at (c, p(c)/q(c))
q(c) = 0	Vertical asymptote at x = c

In plain language: if c is excluded from the domain but no longer makes the denominator zero after simplification, it's a hole; otherwise, it's a vertical asymptote.
Why it works: canceling a factor removes the "trouble" it caused; if the factor remains in the denominator, the trouble (unbounded behavior) persists.

🧪 Example: identifying asymptotes and holes

f(x) = 2x/(x² − 3): denominator zeros at x = ±√3; already in lowest terms → vertical asymptotes at x = −√3 and x = √3.
g(x) = (x² − x − 6)/(x² − 9): factors to ((x − 3)(x + 2))/((x − 3)(x + 3)); (x − 3) cancels → hole at x = 3, vertical asymptote at x = −3.
h(x) = (x² − x − 6)/(x² + 9): denominator has no real zeros → no vertical asymptotes, no holes; domain is all real numbers.
r(x) = (x² − x − 6)/(x² + 4x + 4): factors to ((x − 3)(x + 2))/(x + 2)²; (x + 2) cancels once but remains in denominator → vertical asymptote at x = −2 (not a hole, because the factor still appears).

🌅 Horizontal asymptotes

🌅 What is a horizontal asymptote

Horizontal asymptote: the line y = c is a horizontal asymptote of y = f(x) if, as x → −∞ or x → ∞, f(x) → c.

Describes end behavior: what the function does as x becomes very large (positive or negative).
Graphically: the graph "levels off" and approaches the horizontal line y = c.
A rational function can have at most one horizontal asymptote.

📊 How to find horizontal asymptotes (Theorem 4.2)

Theorem 4.2 (Location of Horizontal Asymptotes):

Suppose r(x) = p(x)/q(x), where p has degree n with leading coefficient a, and q has degree m with leading coefficient b.

Degree comparison	Horizontal asymptote
n = m (same degree)	y = a/b
n < m (numerator smaller)	y = 0
n > m (numerator larger)	None

Why it works: as x → ±∞, the highest-degree terms dominate; the ratio of leading terms determines the limit.
Example: f(x) = (2x − 1)/(x + 1) has degrees 1 and 1 → y = 2/1 = 2.
Example: f(x) = 5x/(x² + 1) has degrees 1 and 2 → y = 0.
Example: g(x) = (x² − 4)/(x + 1) has degrees 2 and 1 → no horizontal asymptote.

🧪 Example: identifying horizontal asymptotes

f(x) = 5x/(x² + 1): degree 1 over degree 2 → y = 0; as x → −∞, f(x) → 0⁻; as x → ∞, f(x) → 0⁺.
g(x) = (x² − 4)/(x + 1): degree 2 over degree 1 → no horizontal asymptote.
h(x) = (6x³ − 3x + 1)/(5 − 2x³): degree 3 over degree 3 → y = 6/(−2) = −3; as x → −∞, h(x) → −3⁺; as x → ∞, h(x) → −3⁻.

🌍 Real-world interpretation

Example: N(t) = 500 − 450/(1 + 3t) models the number of students who have had the flu t months after the semester begins.

As t → ∞, the term 450/(1 + 3t) → 0, so N(t) → 500.
Interpretation: over time, the total number of students who will have had the flu approaches 500 (but never exceeds it).

🎢 Slant asymptotes

🎢 What is a slant asymptote

Slant asymptote (or oblique asymptote): the line y = mx + b (where m ≠ 0) is a slant asymptote of y = f(x) if, as x → −∞ or x → ∞, f(x) → mx + b.

The graph resembles a non-horizontal line as x becomes unbounded.
Occurs when the function's end behavior is linear but not constant.
Notation: "as x → ±∞, f(x) → mx + b" means the difference f(x) − (mx + b) → 0.

📐 How to find slant asymptotes (Theorem 4.3)

Theorem 4.3 (Determination of Slant Asymptotes):

Suppose r(x) = p(x)/q(x), where the degree of p is exactly one more than the degree of q. Then the graph has a slant asymptote y = L(x), where L(x) is the quotient obtained by dividing p(x) by q(x).

Method: perform polynomial long division; the quotient (ignoring the remainder) is the slant asymptote.
Example: g(x) = (x² − 4)/(x + 1); divide x² − 4 by x + 1 → quotient x − 1 → slant asymptote y = x − 1.
Why it works: as x → ±∞, the remainder term (which has a denominator) → 0, leaving only the quotient.

🧪 Example: identifying slant asymptotes

f(x) = (x² − 4x + 2)/(1 − x): degree 2 over degree 1 → slant asymptote exists; divide to get quotient −x + 3 → y = −x + 3.
g(x) = (x² − 4)/(x − 2): simplifies to x + 2 (with a hole at x = 2) → the slant asymptote y = x + 2 is identical to the graph except at the hole.
h(x) = (x³ + 1)/(x² − 4): degree 3 over degree 2 → slant asymptote exists; divide to get quotient x → y = x.

⚠️ Don't confuse

Horizontal vs. slant: if the degree of the numerator equals the degree of the denominator, you get a horizontal asymptote; if the numerator's degree is exactly one more, you get a slant asymptote; if the numerator's degree is two or more greater, there is no asymptote (the function grows without bound).

🔬 Analyzing behavior near asymptotes

🔬 Numerical approach

Make tables of values approaching the asymptote from the left (x → c⁻) and from the right (x → c⁺).
Observe whether f(x) → ∞, f(x) → −∞, or f(x) → finite value.
Example: for f(x) = (2x − 1)/(x + 1) near x = −1:
- As x → −1⁻ (e.g., x = −1.1, −1.01, −1.001), f(x) → ∞.
- As x → −1⁺ (e.g., x = −0.9, −0.99, −0.999), f(x) → −∞.

📈 Graphical approach

Use a graphing calculator to visualize the function.
Vertical asymptotes appear as "breaks" where the graph shoots up or down.
Horizontal and slant asymptotes appear as lines the graph approaches but may cross finitely many times.
Holes may not be visible on a calculator but can be confirmed numerically.

🌐 Real-world context

Example: P(t) = 100/(5 − t)² models bacteria population (in thousands) for 0 ≤ t < 5.

As t → 5⁻, P(t) → ∞.
Interpretation: the population grows without bound as t approaches 5 days; this is called a "doomsday" scenario because no environment can support infinitely many bacteria—the model predicts collapse shortly before t = 5.

🧮 Summary of asymptote rules

Asymptote type	When it occurs	How to find it
Vertical	Denominator zero after simplification	Set q(x) = 0 in lowest terms; solve for x
Horizontal	Degree of numerator ≤ degree of denominator	If degrees equal: y = (leading coeff. of p)/(leading coeff. of q); if numerator smaller: y = 0
Slant	Degree of numerator exactly one more than denominator	Divide p(x) by q(x); quotient is the asymptote
Hole	Factor cancels between numerator and denominator	Find x where canceled factor = 0; evaluate simplified function at that x for y-coordinate

🔑 Key insight

Vertical asymptotes and holes are determined by the zeros of the denominator and whether they cancel.
Horizontal and slant asymptotes are determined by the degrees of numerator and denominator.
A rational function can have multiple vertical asymptotes and holes, but at most one horizontal or slant asymptote (never both).

Graphs of Rational Functions

4.2 Graphs of Rational Functions

🧭 Overview

🧠 One-sentence thesis

Graphing rational functions requires a systematic six-step procedure that combines domain analysis, asymptote detection, sign diagrams, and end-behavior reasoning to reveal how these functions behave near excluded values and at infinity.

📌 Key points (3–5)

Sign diagrams for rational functions must mark both zeros (with "0") and domain exclusions (with "⚛") because rational functions can change sign without crossing the x-axis at vertical asymptotes or holes.
Vertical asymptotes vs. holes: if a factor cancels during reduction, you get a hole; if it remains in the denominator, you get a vertical asymptote.
End behavior analysis uses "number sense" (imagining very large or very small values) to determine whether the graph approaches asymptotes from above or below.
Common confusion: graphs can cross horizontal or slant asymptotes in the middle of the domain—asymptotes only describe behavior as x → ±∞.
The six-step procedure systematically finds domain, reduces the function, locates intercepts, identifies asymptotes/holes, analyzes end behavior, and constructs a sign diagram.

🔍 Why rational functions need special sign diagrams

🔍 The continuity gap

Rational functions are continuous on their domains, but vertical asymptotes and holes occur at values excluded from the domain.

The Intermediate Value Theorem (which says a continuous function can't change sign without crossing the x-axis) applies only where the function is continuous.
At excluded values (vertical asymptotes or holes), the function is not continuous, so it can "jump" from positive to negative without crossing zero.
Example: A graph might be above the x-axis on one side of a vertical asymptote and below it on the other side, even though it never touches the axis at that point.

🔍 Marking the sign diagram

When constructing a sign diagram for a rational function:

Place "0" above each zero (where the numerator equals zero and the value is in the domain).
Place "⚛" (interrobang) above each excluded value (vertical asymptotes and holes).
The interrobang conveys "surprise, caution, and wonderment"—a reminder that the function behaves unusually at these points.
Choose test values in each interval created by these marked points to determine the sign.

Don't confuse: zeros vs. excluded values—zeros are in the domain and make the function equal zero; excluded values are not in the domain and create asymptotes or holes.

📋 The six-step graphing procedure

📋 Step 1: Find the domain

Set the denominator equal to zero and solve.
The domain excludes these values.
Express the domain as a union of intervals.

📋 Step 2: Reduce to lowest terms

Factor both numerator and denominator completely.
Cancel any common factors.
Important: note which factors cancel—these create holes, not vertical asymptotes.
Write the reduced form with the restriction "x ≠ [canceled value]" to remember the hole location.

📋 Step 3: Find intercepts

x-intercepts: set the reduced numerator equal to zero; only include solutions in the domain.
y-intercept: substitute x = 0 (if 0 is in the domain).

📋 Step 4: Locate vertical asymptotes and holes

Vertical asymptotes occur at values that make the reduced denominator zero.
Holes occur at values that were canceled during reduction; find the y-coordinate by substituting into the reduced formula.
For each vertical asymptote, analyze behavior on both sides (see next section).

📋 Step 5: Analyze end behavior

Use theorems about horizontal/slant asymptotes based on degree comparison.
Perform long division if needed to rewrite the function as quotient + remainder/divisor.
Use "number sense" to determine whether the graph approaches the asymptote from above or below (see next section).

📋 Step 6: Construct sign diagram and sketch

Mark zeros and excluded values on a number line.
Test intervals to find where the function is positive or negative.
Combine all information to sketch the graph.

🧮 "Number sense" for asymptote behavior

🧮 Near vertical asymptotes

To determine how the graph behaves as x approaches a vertical asymptote:

Use the factored form of the function.
Imagine substituting a value very close to the asymptote (e.g., x = -2.000001 if the asymptote is at x = -2).
Estimate each factor:
- Factors far from the asymptote → approximate their actual value
- The factor causing the asymptote → "very small (±)" depending on which side you approach from
Combine to get "very big (±)" which tells you whether f(x) → ∞ or -∞.

Example: For f(x) = 3x/[(x-2)(x+2)] as x → -2⁻:

3x ≈ -6
(x-2) ≈ -4
(x+2) ≈ very small (-)
Result: (-6)/[(-4)·very small (-)] = (positive)/(very small (-)) = very big (-), so f(x) → -∞

🧮 End behavior (as x → ±∞)

To determine behavior at infinity:

Imagine substituting a very large number (e.g., x = 1 billion).
In polynomials, the highest-degree term dominates, so lower-degree terms become insignificant.
For the remainder term after long division, estimate whether it's "very small (±)".
This tells you whether the graph is slightly above or below the asymptote.

Example: If f(x) = 2 - 3/(x+2) and x → ∞:

3/(x+2) ≈ 3/(very large) ≈ very small (+)
So f(x) ≈ 2 - very small (+), meaning the graph is slightly below y = 2

Don't confuse: "very small" means close to zero in absolute value; "very big" means large in absolute value. The sign (±) indicates positive or negative.

🎯 Key insights from examples

🎯 Graphs can cross horizontal asymptotes

Horizontal asymptotes describe behavior only as x → ±∞.
In the middle of the domain, the graph may cross the asymptote.
To find crossing points: set the function equal to the asymptote value and solve.
Example: If f(x) = quotient + remainder/divisor, the graph crosses y = quotient wherever remainder = 0.

🎯 Holes appear as missing points

A hole occurs at (a, b) where x = a was canceled during reduction.
To find b: substitute x = a into the reduced formula.
The graph looks continuous except for one missing point.
Near a hole, the function approaches the hole's y-value from both sides.

🎯 Symmetry can simplify graphing

Check if f(-x) = f(x) (even function, y-axis symmetry) or f(-x) = -f(x) (odd function, origin symmetry).
Symmetry is not part of the standard procedure but can reduce work.
Example: f(x) = 3x/(x²-4) is odd, so the graph has origin symmetry.

🎯 When standard procedures fall short

For functions where the numerator's degree exceeds the denominator's by more than one, neither horizontal nor slant asymptote theorems apply.
The graph may approach a parabola or higher-degree polynomial.
Calculators become necessary to find relative extrema and detailed behavior.
Example: r(x) = (x⁴+1)/(x²+1) approaches y = x²-1 as x → ±∞.

⚠️ Common pitfalls and clarifications

⚠️ Reducing too hastily

Always note which factors cancel—they indicate holes, not just algebraic simplification.
Write the reduced form with explicit restrictions (e.g., "x ≠ -1") to remember excluded values.

⚠️ Forgetting to check both sides of asymptotes

A vertical asymptote has two sides: x → a⁻ (from the left) and x → a⁺ (from the right).
The function may approach +∞ on one side and -∞ on the other.
Always analyze both directions.

⚠️ Misinterpreting "very small" arithmetic

"very small (-)" divided by "very small (-)" is NOT necessarily 1—it depends on which is smaller.
In f(x) = numerator/denominator, if the denominator shrinks faster, the quotient grows ("very big").

⚠️ Assuming graphs never cross asymptotes

Myth: Graphs can't cross horizontal or slant asymptotes.
Reality: Graphs can cross these asymptotes in the middle of the domain; asymptotes only constrain behavior at infinity.

Rational Inequalities and Applications

4.3 Rational Inequalities and Applications

🧭 Overview

🧠 One-sentence thesis

Rational inequalities require sign-diagram techniques rather than clearing denominators, and rational functions model real-world rate problems and variation relationships where quantities depend inversely or jointly on one another.

📌 Key points (3–5)

Equation vs inequality: clearing denominators works for rational equations (then check for extraneous solutions), but rational inequalities require collecting terms on one side and using a sign diagram to avoid sign-reversal errors.
Rate problems: rates are additive (e.g., downstream speed = canoe speed + river speed), and the core relationship is distance = rate × time (or work = rate × time).
Variation language: "varies directly" means y = kx; "varies inversely" means y = k/x; "varies jointly" means z = kxy; the constant k is the constant of proportionality.
Common confusion: when solving inequalities, do NOT multiply both sides by a variable expression (it may be positive or negative); instead, move everything to one side and analyze the sign of the rational function.
Applied domain: in word problems, the mathematical domain must be restricted to physically meaningful values (e.g., positive lengths, whole numbers of items).

🔀 Solving rational equations vs inequalities

🟰 Rational equations: clear denominators

Method: multiply both sides by the least common denominator to eliminate fractions, then solve the resulting polynomial equation.
Why it works: multiplying both sides of an equation by a nonzero quantity preserves equality.
Critical step: check all solutions in the original equation to discard extraneous solutions (values that make any denominator zero).

Example: To solve (x³ − 2x + 1)/(x − 1) = (1/2)x − 1, multiply both sides by 2(x − 1) to get 2x³ − 4x + 2 = x² − 3x + 2, which simplifies to 2x³ − x² − x = 0. Factoring gives x(2x + 1)(x − 1) = 0, so x = −1/2, 0, 1. Since x = 1 makes the original denominator zero, discard it. Solutions: x = −1/2 and x = 0.

🔺 Rational inequalities: use a sign diagram

Method: collect all terms on one side with 0 on the other, then treat the left side as a rational function r(x) and construct a sign diagram.
Why: multiplying both sides by a variable expression is dangerous because the expression may be positive (preserving the inequality) or negative (reversing it).
Steps:
1. Move all terms to one side: r(x) ≥ 0 (or <, >, ≤).
2. Find the domain (exclude values that make the denominator zero).
3. Find zeros of r (numerator = 0, but not excluded from domain).
4. Test intervals between zeros and excluded points.
5. Select intervals where r(x) has the correct sign; include zeros if the inequality is ≥ or ≤.

Example: To solve (x³ − 2x + 1)/(x − 1) ≥ (1/2)x − 1, rewrite as (2x³ − x² − x)/(2x − 2) ≥ 0. The denominator 2x − 2 = 0 at x = 1 (excluded). The numerator 2x³ − x² − x = x(2x + 1)(x − 1) = 0 at x = −1/2, 0, 1. Since x = 1 is excluded, the zeros are x = −1/2 and x = 0. Testing intervals (−∞, −1/2), (−1/2, 0), (0, 1), (1, ∞) gives (+), (−), (+), (+). Solution: (−∞, −1/2] ∪ [0, 1) ∪ (1, ∞).

📊 Graphical check

Interpretation: if f(x) ≥ g(x), find where the graph of y = f(x) is above or touching the graph of y = g(x).
Caution: calculators may not show holes (removable discontinuities) or vertical asymptotes clearly; always verify algebraically.

🚣 Rate problems: distance, work, and time

🚣 Distance-rate-time problems

Core relationship: distance = rate × time

Additive rates: when traveling downstream, the effective speed is canoe speed + river speed; upstream, it is canoe speed − river speed.
Setup strategy:
1. Break the trip into parts (e.g., downstream and upstream).
2. Write distance = rate × time for each part.
3. Use given totals (e.g., total time) to relate the parts.
4. Solve the resulting system of equations.

Example: Carl canoes 5 miles downstream and 5 miles upstream in 3 hours total. His still-water speed is 6 mph; find the river speed R. Downstream: 5 = (6 + R)·t_down, so t_down = 5/(6 + R). Upstream: 5 = (6 − R)·t_up, so t_up = 5/(6 − R). Total time: t_down + t_up = 3, so 5/(6 + R) + 5/(6 − R) = 3. Clearing denominators: 5(6 − R) + 5(6 + R) = 3(6 + R)(6 − R), which simplifies to R² = 16. Since R > 0, R = 4 mph.

🛠️ Work-rate-time problems

Core relationship: work done = rate of work × time working

Rate of work: if Taylor weeds the garden alone in 4 hours, Taylor's rate is 1 garden / 4 hours = 1/4 garden per hour.
Additive rates: when working together, the combined rate is the sum of individual rates.

Example: Taylor weeds the garden alone in 4 hours; together with Carl, they finish in 3 hours. Find Carl's time alone. Taylor's rate: 1/4 garden/hour. Combined rate: 1/3 garden/hour. Carl's rate R = 1/3 − 1/4 = 1/12 garden/hour. Carl's time alone: 1 garden / (1/12 garden/hour) = 12 hours.

📏 Tracking units

Why it matters: units help check if an equation makes sense (e.g., both sides should have the same units).
Don't confuse: rates are not the same as totals; rate × time = total.

📦 Applied optimization with rational functions

📦 Average cost function

Average cost function: C̄(x) = C(x)/x, where C(x) is the total cost to produce x items.

Interpretation: C̄(x) is the cost per item when x items are produced.
Solving inequalities: to find when C̄(x) < 100, solve the rational inequality and interpret in context (e.g., produce more than 7.5 systems → at least 8 systems).

Example: If C(x) = 80x + 150, then C̄(x) = (80x + 150)/x. Solve C̄(x) < 100: (80x + 150)/x < 100 → (150 − 20x)/x < 0. Sign diagram on (0, ∞): zero at x = 7.5, positive on (0, 7.5), negative on (7.5, ∞). Solution: (7.5, ∞), or [8, ∞) in context.

📈 Horizontal asymptote interpretation

Long-run behavior: as x → ∞, C̄(x) = 80 + 150/x → 80⁺. The average cost approaches $80 per item but never reaches it.
Meaning: $80 is the variable cost per item; the fixed cost ($150) is "spread out" over more items as production increases.

📦 Box optimization

Volume constraint: if volume = 1000 cm³ and the base is square (width x, depth x), then height h = 1000/x².
Surface area: for a box with no top, S = x² + 4xh. Substitute h to get S(x) = x² + 4000/x.
Minimize S(x): use a calculator to find the minimum; interpret the result (e.g., width ≈ 12.60 cm, height ≈ 6.30 cm).

⚠️ Applied domain

Physical constraints: x > 0 (positive dimensions), and sometimes x must be a whole number (e.g., number of items produced).
Don't confuse: the mathematical domain of a function may be larger than the applied domain.

🔗 Variation: direct, inverse, and joint

🔗 Direct variation

y varies directly with x (or is directly proportional to x): y = kx for some constant k.

Interpretation: y increases when x increases (if k > 0).
Example: Hooke's Law: force F on a spring varies directly with extension x, so F = kx.

🔄 Inverse variation

y varies inversely with x (or is inversely proportional to x): y = k/x for some constant k.

Interpretation: y decreases when x increases (if k > 0).
Example: Boyle's Law: at constant temperature, pressure P of a gas varies inversely with volume V, so P = k/V.

🔗 Joint variation

z varies jointly with x and y (or is jointly proportional to x and y): z = kxy for some constant k.

Interpretation: z depends on the product of x and y.
Example: Volume V of a cone varies jointly with height h and the square of radius r, so V = khr².

🔗 Combined variation

Direct and inverse together: if I varies directly with V and inversely with R, then I = kV/R.
Example: Ohm's Law: current I = kV/R. Newton's Law of Universal Gravitation: F = kmM/r².

📊 Finding the constant of proportionality

Method: use given data to solve for k. If y = k/x and you know one (x, y) pair, then k = xy.
Verification: check that k is approximately constant across all data points; use regression (e.g., power regression y = ax^b) to confirm the relationship.

Example: Boyle's data suggests P = k/V. Multiplying each P and V pair gives k ≈ 1400. Power regression confirms y = 1400x^(−1), so P = 1400/V.

Function Composition

5.1 Function Composition

🧭 Overview

🧠 One-sentence thesis

Function composition creates a new function by feeding the output of one function into another, enabling us to build complex algebraic functions from simpler building blocks like linear and quadratic functions.

📌 Key points (3–5)

What composition is: A two-step process where the output of function f becomes the input to function g, written as (g ∘ f)(x) = g(f(x)).
Order matters: In general, g ∘ f ≠ f ∘ g; composing functions is not commutative (like putting on socks then shoes vs. shoes then socks).
Domain before simplification: Always determine the domain of a composite function before algebraic simplification, since the simplified form may hide restrictions.
Common confusion: Inside vs. outside—f is the "inside" function (applied first), g is the "outside" function (applied second); don't confuse the notation order with the execution order.
Why it matters: Composition allows us to express complicated functions as combinations of simpler ones and to relate quantities through intermediate variables.

🔗 Definition and basic mechanics

🔗 What composition means

Composite of g with f, denoted g ∘ f: defined by the formula (g ∘ f)(x) = g(f(x)), provided x is in the domain of f and f(x) is in the domain of g.

At the formula level: replace every occurrence of x in g(x) with the entire formula for f(x).
At the process level: take an input x → apply f to get f(x) → apply g to that result to get g(f(x)).
The notation "g ∘ f" is read as "g composed with f" or "g of f."

🎯 Inside vs. outside functions

In the expression g(f(x)), function f is called the inside function (executed first).
Function g is called the outside function (executed second).
Example: If F(x) = |3x - 1|, then f(x) = 3x - 1 is inside the absolute value operation g(x) = |x|.

🔄 Two evaluation approaches

The excerpt demonstrates two methods for evaluating composites:

Method	Description	When to use
Inside out	Substitute the formula for f(x) into g first	When you want to see the inner function explicitly
Outside in	Write out g's formula first, then replace its variable with f(x)	When g's structure is clearer to work with first

Both methods yield the same result; choose based on which makes the algebra clearer.

⚠️ Domain determination

⚠️ Why domain comes first

Critical rule: Find the domain of the composite function before simplifying the formula.
The simplified form may hide restrictions that exist in the unsimplified version.
Example from the excerpt: (f ∘ g)(x) simplifies to x - 1, but its domain is [-3, ∞), not all real numbers, because the original form contained √(x + 3).

🔍 How to find the domain

For (g ∘ f)(x):

Look at the formula before any simplification.
Identify all restrictions:
- Values that make f(x) undefined (e.g., division by zero in f).
- Values where f(x) is defined but falls outside g's domain.
- Square roots requiring non-negative arguments.
- Denominators that cannot be zero.
Solve inequalities as needed using sign diagrams.
Express the domain in interval notation.

Example: For (g ∘ f)(x) = 2 - √(x² - 4x + 3), solve x² - 4x + 3 ≥ 0 to get domain (-∞, 1] ∪ [3, ∞).

🔢 Properties and special cases

🔢 Associativity property

Theorem: h ∘ (g ∘ f) = (h ∘ g) ∘ f, provided the composite functions are defined.

When composing three or more functions, the order matters, but grouping doesn't.
This means we can write h ∘ g ∘ f without parentheses.
Example from the excerpt: (h ∘ (g ∘ f))(x) and ((h ∘ g) ∘ f)(x) produce identical formulas and domains.

🆔 Identity function

Identity function: I(x) = x for all real numbers x.

Property: I ∘ f = f ∘ I = f for any function f.
Composing any function with the identity leaves it unchanged.
Analogous to how multiplying by 1 leaves a number unchanged: 1 · x = x · 1 = x.

🔁 Iteration (self-composition)

Composing a function with itself: (f ∘ f)(x) = f(f(x)).
Can be repeated: (g ∘ g)(6) means evaluate g at 6, then evaluate g again at that result.
Real-world analogy: setting a washing machine to "double rinse" performs the same operation twice.

🧩 Decomposition of functions

🧩 Breaking down complex functions

The excerpt shows how to reverse the composition process: take a complicated function and express it as a composition of simpler functions.

Three approaches demonstrated:

Inside/outside perspective: Identify what's "inside" an operation (like absolute value or square root) and what operation is applied "outside."
- Example: F(x) = |3x - 1| → f(x) = 3x - 1 (inside), g(x) = |x| (outside), so F = g ∘ f.
Operational sequence: List the steps performed on an input in order, assign each step a function.
- Example: G(x) = 2/(x² + 1) → first square (f(x) = x²), then add 1 (g(x) = x + 1), then divide into 2 (h(x) = 2/x), so G = h ∘ g ∘ f.
Recognizing simple pieces: Spot a simpler function embedded in the complex one.
- Example: H(x) = (√x + 1)/(√x - 1) → recognize f(x) = √x appears multiple times, then g(x) = (x + 1)/(x - 1), so H = g ∘ f.

✅ Checking your decomposition

Always verify by performing the composition: if you claim F = g ∘ f, compute (g ∘ f)(x) and confirm it equals F(x).

🌍 Real-world applications

🌍 Relating quantities through intermediate variables

Function composition naturally models situations where quantities are related indirectly through a common variable.

Example from the excerpt:

Surface area of a sphere: S(r) = 4πr² (surface area as a function of radius).
Radius changing over time: r(t) = 3t² (radius as a function of time).
Composite (S ∘ r)(t) = 36πt⁴ gives surface area directly as a function of time, bypassing the "middle man" r.

This allows direct computation without intermediate steps and is fundamental in calculus for related rates problems.

🔗 Don't confuse with arithmetic operations

Composition is different from the four arithmetic operations on functions (addition, subtraction, multiplication, division) covered in earlier sections.
Unlike addition (f + g = g + f), composition is not commutative: generally g ∘ f ≠ f ∘ g.
Think of composition as a fundamentally different way to combine functions—it's about chaining processes, not combining outputs.

Inverse Functions

5.2 Inverse Functions

🧭 Overview

🧠 One-sentence thesis

A function is invertible if and only if it is one-to-one (different inputs produce different outputs), and the inverse function reverses the original process by swapping inputs and outputs.

📌 Key points (3–5)

What makes a function invertible: A function must be one-to-one, meaning different inputs always produce different outputs.
How to test graphically: Use the Horizontal Line Test—if any horizontal line crosses the graph more than once, the function is not one-to-one.
How inverses work: The inverse function reverses the original process; if f(a) = b, then the inverse satisfies f⁻¹(b) = a.
Common confusion: The notation f⁻¹ does NOT mean 1/f(x); it denotes the inverse function that undoes f.
Domain and range swap: The domain of f becomes the range of f⁻¹, and vice versa; graphs of inverse functions reflect across the line y = x.

🔄 What inverse functions do

🔄 The reversing process

Inverse functions: If f and g are inverses, then (g ∘ f)(x) = x for all x in the domain of f, and (f ∘ g)(x) = x for all x in the domain of g.

Think of a function as a two-step process (like putting on socks then shoes).
The inverse reverses each step in reverse order (take off shoes first, then socks).
Example: For f(x) = 3x + 4, the function multiplies by 3 then adds 4; the inverse g(x) = (x - 4)/3 subtracts 4 then divides by 3.

🔗 Input-output relationship

If f takes input x and produces output y, then f⁻¹ takes y and returns x.
Formally: f(a) = b if and only if f⁻¹(b) = a.
On graphs: (a, b) is on the graph of f if and only if (b, a) is on the graph of f⁻¹.

⚠️ Notation warning

Don't confuse f⁻¹(x) with 1/f(x):

f⁻¹(x) is the inverse function that undoes f.
1/f(x) is the reciprocal of f(x).
Example: If f(x) = 3x + 4, then f⁻¹(x) = (x - 4)/3, which is NOT the same as 1/(3x + 4).

🎯 One-to-one functions

🎯 Definition and importance

A function f is one-to-one if f matches different inputs to different outputs. Equivalently, whenever f(c) = f(d), then c = d.

Only one-to-one functions are invertible.
If two different inputs produce the same output, you cannot reverse the process uniquely.
Example: f(x) = x² is not one-to-one because f(-2) = 4 and f(2) = 4; you cannot tell whether 4 came from -2 or 2.

📊 The Horizontal Line Test

A function is one-to-one if and only if no horizontal line intersects its graph more than once.

Test result	Meaning	Consequence
Passes	No horizontal line crosses more than once	Function is one-to-one and invertible
Fails	Some horizontal line crosses multiple times	Function is not one-to-one; not invertible as-is

This test checks whether the graph describes x as a function of y.
Example: The parabola y = x² fails because the line y = 4 crosses at both x = -2 and x = 2.

🔍 Checking analytically

To verify one-to-one algebraically:

Assume f(c) = f(d)
Use algebra to show this forces c = d
If successful, the function is one-to-one

Example: For f(x) = (1 - 2x)/5, if f(c) = f(d), then (1 - 2c)/5 = (1 - 2d)/5, which simplifies to c = d. ✓

🛠️ Finding inverse functions

🛠️ The algorithm

Steps for finding the inverse of a one-to-one function:

Write y = f(x)
Interchange x and y
Solve x = f(y) for y to obtain y = f⁻¹(x)

The interchange reminds us we are swapping inputs and outputs.
The equation y = f⁻¹(x) is equivalent to f(y) = x.

✅ Verifying your answer

Check both compositions:

Verify (f⁻¹ ∘ f)(x) = x for all x in the domain of f
Verify (f ∘ f⁻¹)(x) = x for all x in the domain of f⁻¹ (which equals the range of f)

Graphical check:

Graph both y = f(x) and y = f⁻¹(x) on the same axes
They should be reflections across the line y = x

📐 Domain restrictions matter

When solving for the inverse, domain restrictions from the original function must be tracked:

The domain of f⁻¹ equals the range of f
The range of f⁻¹ equals the domain of f
Example: If g(x) = x², x ≥ 0, then g⁻¹(x) = √x. The restriction x ≥ 0 ensures we get g⁻¹(g(x)) = √(x²) = |x| = x (since x ≥ 0).

🔧 Restricting domains to create inverses

🔧 Making non-invertible functions invertible

A function that is not one-to-one on its full domain may become one-to-one on a restricted domain.
Example: f(x) = x² is not one-to-one on all real numbers, but g(x) = x² for x ≥ 0 IS one-to-one.

✂️ Choosing the restriction

Restrict the domain to a portion where the function passes the Horizontal Line Test.
Common choice: take the "left half" or "right half" of a parabola.
The restriction must be stated explicitly in the inverse formula.

Example from the excerpt:

j(x) = x² - 2x + 4, x ≤ 1 (left half of parabola)
After finding j⁻¹(x) = 1 - √(x - 3), the condition x ≤ 1 ensures the algebra works correctly when checking compositions.

🔄 Asymptotes swap roles

When finding inverses of rational functions:

A vertical asymptote of f becomes a horizontal asymptote of f⁻¹
A horizontal asymptote of f becomes a vertical asymptote of f⁻¹
This happens because x and y are interchanged

📊 Properties and theorems

📊 Key properties

If f and g are inverses:

The range of f is the domain of g
The domain of f is the range of g
(a, b) is on the graph of f if and only if (b, a) is on the graph of g
The graphs are reflections about y = x

🎯 Uniqueness

If a function is invertible, it has exactly one inverse, denoted f⁻¹.

You cannot have two different functions both serving as the inverse of f.
This justifies the special notation f⁻¹.

🔗 Equivalent conditions for invertibility

The following are equivalent (all true together or all false together):

f is invertible
f is one-to-one
The graph of f passes the Horizontal Line Test

🌍 Application example

🌍 Price-demand context

The excerpt gives a price-demand equation p(x) = -1.5x + 250 for 0 ≤ x ≤ 166:

x = number of systems sold weekly
p(x) = price per system in dollars

Finding the inverse p⁻¹(x) = (500 - 2x)/3:

Input: price per system
Output: weekly sales
Interpretation: p⁻¹(220) = 20 means if the price is set at $220, then 20 systems will be sold weekly.

🔗 Composing with other functions

Composing profit P(x) with p⁻¹(x):

P(x) gives weekly profit as a function of weekly sales
p⁻¹(x) gives weekly sales as a function of price
(P ∘ p⁻¹)(x) gives weekly profit as a function of price
This composition uses sales as the "middle man" connecting price to profit.

Practical use: Find the price that maximizes profit by finding the vertex of the composed function.

Other Algebraic Functions

5.3 Other Algebraic Functions

🧭 Overview

🧠 One-sentence thesis

Algebraic functions—combinations of polynomials, rational functions, and radicals—require careful handling of domains, exponents, and radicals, and their graphs can exhibit new features like cusps and unusual steepness that distinguish them from simpler function families.

📌 Key points (3–5)

What algebraic functions are: combinations of polynomial and rational functions with radical operations (roots).
Domain concerns: even-indexed radicals require non-negative radicands; negative exponents create denominators that cannot be zero.
Graphical features: algebraic functions can have cusps, unusual steepness (vertical tangents), and multiple horizontal asymptotes—features not seen in rational functions alone.
Common confusion: when simplifying fractional exponents, the square root of x squared equals the absolute value of x, not simply x; always convert to radicals when in doubt.
Sign diagrams work here too: the Intermediate Value Theorem applies to algebraic functions, so the sign-diagram method from earlier chapters extends to solving inequalities involving radicals.

📐 Radicals and roots

📐 Definition of principal nth root

Principal nth root: For a real number x and natural number n, the principal nth root (denoted n√x) is the unique real number satisfying (n√x)^n = x. When n is odd, this is defined for all real x; when n is even, it requires x ≥ 0 and n√x ≥ 0.

The index is the number n; the radicand is the number x.
For n = 2 (square root), we write √x instead of 2√x.
Why the restriction for even roots: both x = −2 and x = 2 satisfy x⁴ = 16, but the principal fourth root ⁴√16 = 2, not −2.
Example: The cube root ³√(−8) = −2 because (−2)³ = −8, but the fourth root ⁴√16 must be the non-negative solution, which is 2.

🔄 Radicals as inverse functions

The function f(x) = n√x can be defined as the inverse of g(x) = x^n.
When n is even, the domain of g must be restricted to [0, ∞) to make it one-to-one.
Graphically: reflect the graph of y = x^n across the line y = x to obtain y = n√x.
Behavior patterns:
- Even-indexed roots (√x, ⁴√x, ⁶√x): steepen vertically near x = 0 and flatten horizontally as x → ∞.
- Odd-indexed roots (³√x, ⁵√x, ⁷√x): steepen near x = 0 and flatten as x → ±∞.

🧮 Properties of radicals

The excerpt lists these key properties (all assume the roots are real numbers):

Property	Formula	Notes
Product Rule	n√(xy) = n√x · n√y	Multiply radicands or multiply roots
Powers of Radicals	n√(x^m) = (n√x)^m	Take the root first, then raise to power
Quotient Rule	n√(x/y) = n√x / n√y	Provided y ≠ 0
Simplifying powers	n√(x^n) = x if n is odd; = \|x\| if n is even	The absolute value appears for even indices

Why absolute value for even n: ⁴√((−2)⁴) = ⁴√16 = 2 = |−2|, not −2, because the principal root must be non-negative.
Don't confuse: the last property is critical when composing roots and powers.

⚠️ Rational exponents and their pitfalls

⚠️ Definition of rational exponents

Rational exponents: For real number x, integer m, and natural number n:

x^(1/n) = n√x (defined whenever n√x is defined)

x^(m/n) = (n√x)^m = n√(x^m) (defined whenever (n√x)^m is defined)

Rational exponents behave like integer exponents, but with one critical exception.
Example: x^(2/3) means "cube root of x, then square the result" or equivalently "x squared, then take the cube root."

🚨 The cancellation trap

The problem: Applying exponent laws blindly can produce wrong answers.
Example from the excerpt: (x^(2/3))^(3/2) should equal x^(2/3 · 3/2) = x^1 = x, right?
Wrong! Substituting x = −1:
- (−1)^(2/3) = (³√(−1))² = (−1)² = 1
- Then (1)^(3/2) = (√1)³ = 1³ = 1
- So ((−1)^(2/3))^(3/2) = 1 ≠ −1
Why it fails: When we "canceled" the 2's in multiplying (2/3)·(3/2), we were canceling a square with a square root, but √(x²) = |x|, not x.
The correct simplification: (x^(2/3))^(3/2) = |x|, not x.

🛠️ Best practice with fractional exponents

Moral of the story: When simplifying fractional exponents, rewrite them as radicals to avoid errors.
The excerpt notes that (x^(3/2))^(2/3) = x does work (verification left as exercise).
Don't confuse: rational exponents are usually preferred in most contexts, but when compositions get tricky, convert to radical notation.

📊 Domains and sign diagrams

📊 Finding domains of algebraic functions

Domain restrictions come from two sources:

Even-indexed radicals: require radicand ≥ 0
Denominators: cannot equal zero (watch for negative exponents, which indicate denominators)

Procedure:

Identify all even-indexed radicals and set their radicands ≥ 0.
Identify all denominators (including those from negative exponents) and exclude zeros.
Solve inequalities using sign diagrams when needed.
Example: For g(x) = √(2 − ⁴√(x+3)), first require x + 3 ≥ 0 (so x ≥ −3), then require 2 − ⁴√(x+3) ≥ 0, which needs its own sign diagram to solve.

📈 Sign diagrams for algebraic functions

The procedure from earlier chapters extends here:

Mark domain exclusions on the number line with ⊗
Find zeros of the function and mark them with 0
Choose test values in each interval
Determine the sign in each interval

Why this works: Algebraic functions are continuous on their domains (a fact proved in Calculus), so the Intermediate Value Theorem applies.
Example: To solve x^(4/3) − x^(2/3) − 6 > 0, set r(x) = x^(4/3) − x^(2/3) − 6, find its zeros, construct a sign diagram, and read off where r is positive.

🔍 Solving equations with radicals

Techniques demonstrated:

Isolate the radical and raise both sides to the appropriate power.
Check for extraneous solutions: raising both sides to an even power can introduce solutions that don't satisfy the original equation.
Quadratic-in-disguise: if exponents have a 2:1 ratio (like x^(4/3) and x^(2/3)), substitute u for the smaller power.
Example: x^(4/3) − x^(2/3) − 6 = 0 becomes (x^(2/3))² − x^(2/3) − 6 = 0; let u = x^(2/3) to get u² − u − 6 = 0.

🎨 Graphical features unique to algebraic functions

🎨 New behaviors not seen in rational functions

Algebraic functions can exhibit:

Feature	Description	Example from excerpt
Unusual steepness	Graph becomes nearly vertical (vertical tangent in Calculus)	Near x = 2 in f(x) = 3x·³√(2−x)
Cusps	Sharp turn or point where graph is not smooth	At x = 0 in f(x) = x^(2/3)
Multiple horizontal asymptotes	Different limits as x → ∞ and x → −∞	k(x) = 2x/√(x²−1) has y = 2 and y = −2

Unusual steepness: The graph appears to "shoot up" or "dive down" almost vertically at certain points.
Cusps: A sharp corner where the graph changes direction abruptly (seen in absolute value and some radical functions).
Don't confuse: Rational functions can have at most one horizontal asymptote, but algebraic functions can have two (one for each direction).

🔬 Analyzing end behavior

For k(x) = 2x/√(x²−1):
- As x → ∞: the radicand x² − 1 ≈ x², so √(x²−1) ≈ √(x²) = |x| = x (since x > 0), thus k(x) ≈ 2x/x = 2.
- As x → −∞: |x| = −x (since x < 0), so k(x) ≈ 2x/(−x) = −2.
This explains the two different horizontal asymptotes.

🖼️ Comparing similar functions

The excerpt shows f(x) = x^(2/3) and g(x) = x^(4/3) − 6 near x = 0:

Both involve cube roots, but f has a cusp at x = 0 while g does not.
The difference comes from the specific combination of powers and roots.
Calculus is needed to fully understand these distinctions.

🧪 Applications and problem-solving

🧪 Applied domain considerations

In real-world problems, the mathematical domain may be further restricted by context.
Example: Cable-laying problem where x represents miles along a road—domain is [0, 50] because you can't run negative miles or more than the total distance.
Always state both the mathematical domain (from the function itself) and the applied domain (from the problem context).

🔧 Factoring with fractional exponents

When factoring expressions with fractional exponents:

Factor out the term with the smallest exponent.
Example: 3(2−x)^(1/3) − x(2−x)^(−2/3)
- The smaller exponent is −2/3 (since −2/3 < 1/3)
- Factor out (2−x)^(−2/3): (2−x)^(−2/3)[3(2−x)^(1/3−(−2/3)) − x]
- Simplify: (2−x)^(−2/3)[3(2−x)^1 − x] = (2−x)^(−2/3)(6 − 4x)

🧮 Alternative: common denominator approach

Rewrite negative exponents as fractions with positive exponents in the denominator.
Find a common denominator and combine.
Example: 3(2−x)^(1/3) − x/(2−x)^(2/3) = [3(2−x)^(1/3)·(2−x)^(2/3) − x]/(2−x)^(2/3) = [3(2−x) − x]/(2−x)^(2/3)
Both methods yield the same result; choose whichever feels more natural.

Introduction to Exponential and Logarithmic Functions

6.1 Introduction to Exponential and Logarithmic Functions

🧭 Overview

🧠 One-sentence thesis

Logarithmic functions serve as the inverses of exponential functions, allowing us to "undo" exponentiation and solve equations where the variable appears in an exponent.

📌 Key points (3–5)

Primary purpose of logarithms: to undo exponential functions, since logarithms are defined as inverses of exponentials.
How to find inverse formulas: break the original function into procedural steps, then reverse each step in opposite order—use logarithms to undo exponentiation.
Graphing inverses via transformations: track key points and asymptotes through shifts and reflections; domain and range swap between a function and its inverse.
Verification requirement: inverses must satisfy both composition identities—composing f and f-inverse in either order returns the input x.
Common confusion: horizontal asymptotes in exponential functions become vertical asymptotes in their logarithmic inverses; the line y = x is the axis of symmetry between a function and its inverse.

🔄 Inverse relationship between exponentials and logarithms

🔄 Definition and core identity

The fundamental property: b raised to the power a equals c if and only if log base b of c equals a.

This bidirectional relationship is the foundation: exponential form and logarithmic form are two ways to express the same relationship.
The excerpt emphasizes that logarithms were defined specifically to invert exponential functions.
Example: 2 to the power 3 equals 8 can be rewritten as log base 2 of 8 equals 3.

🔄 Composition properties

The excerpt states two key composition identities:

log base b of (b to the power x) equals x for all real numbers x.
b to the power (log base b of x) equals x for all x greater than 0.
These show that applying an exponential function and then its logarithmic inverse (or vice versa) returns the original input.
Don't confuse: the domain restriction (x greater than 0) applies only to the second identity because logarithms require positive inputs.

🔧 Finding inverse formulas procedurally

🔧 The procedural method

The excerpt presents a step-by-step approach to finding inverse functions without solving equations algebraically:

Break the original function into a sequence of operations.
Identify the inverse operation for each step.
Apply the inverse operations in reverse order.

🔧 Example walkthrough

For f(x) = 2 to the power (x minus 1) minus 3:

Forward steps: (a) subtract 1, (b) use as exponent on 2, (c) subtract 3.
Inverse steps in reverse: (a) add 3, (b) take log base 2, (c) add 1.
Result: f-inverse(x) = log base 2 of (x plus 3) plus 1.
The excerpt emphasizes: "How do we undo the second step? The answer is we use the logarithm. By definition, log base 2 of x undoes exponentiation by 2."

🔧 Why logarithms are needed

Undoing addition/subtraction is straightforward (add/subtract the opposite).
Undoing exponentiation requires logarithms because they are defined precisely for this purpose.
Example: if the function raises 2 to some power, the inverse must apply log base 2 to undo that step.

📊 Graphing using transformations

📊 Tracking points and asymptotes

The excerpt demonstrates graphing by:

Starting with a basic function (e.g., g(x) = 2 to the power x or j(x) = log base 2 of x).
Choosing three reference points plus the asymptote.
Applying transformations (horizontal/vertical shifts) to these points and the asymptote.
Connecting the transformed points in the same order and manner as the original graph.

📊 Horizontal vs vertical asymptotes

Function type	Asymptote orientation	What happens under transformation
Exponential (e.g., 2 to the power x)	Horizontal (e.g., y = 0)	Vertical shifts move the asymptote up/down
Logarithmic (e.g., log base 2 of x)	Vertical (e.g., x = 0)	Horizontal shifts move the asymptote left/right

Don't confuse: when you take the inverse, a horizontal asymptote becomes a vertical asymptote and vice versa.
Example: f(x) = 2 to the power (x minus 1) minus 3 has horizontal asymptote y = negative 3; its inverse f-inverse(x) = log base 2 of (x plus 3) plus 1 has vertical asymptote x = negative 3.

📊 Domain and range swap

The domain of f becomes the range of f-inverse.
The range of f becomes the domain of f-inverse.
Example: if f has domain (negative infinity, infinity) and range (negative 3, infinity), then f-inverse has domain (negative 3, infinity) and range (negative infinity, infinity).

✅ Verifying inverses

✅ Composition verification

To confirm that two functions are inverses, the excerpt requires checking both:

(f-inverse composed with f)(x) = x for all x in the domain of f.
(f composed with f-inverse)(x) = x for all x in the domain of f-inverse.

✅ Step-by-step composition

The excerpt shows detailed algebraic verification:

Substitute one function into the other.
Simplify using the inverse properties (e.g., log base 2 of 2 to the power u equals u).
Confirm the result simplifies to x.
Pay attention to domain restrictions: the excerpt notes "Pay attention—can you spot in which step below we need x greater than negative 3?" when verifying the second composition.

✅ Graphical symmetry check

Graph both f and f-inverse on the same axes.
Verify symmetry about the line y = x.
The excerpt states: "we see the symmetry about the line y = x" as the final confirmation.
This visual check reinforces the algebraic verification.

🎯 End behavior and continuity

🎯 Analyzing limits

The excerpt briefly discusses end behavior for a function g(x) = ln of (x divided by (x minus 1)):

As x approaches positive or negative infinity, the graph suggests g(x) approaches 0.
Analytical verification: as x approaches infinity, x divided by (x minus 1) is approximately 1.
Therefore, g(x) = ln of (approximately 1) = ln(1) = 0.

🎯 When calculator graphs are necessary

The excerpt notes that for more complex functions, "barring a more detailed analysis using Calculus, the calculator graph is the best we can do."
This acknowledges the limits of algebraic techniques at this level.
However, end behavior can sometimes be verified by approximating the argument of the logarithm.

Properties of Logarithms

6.2 Properties of Logarithms

🧭 Overview

🧠 One-sentence thesis

Logarithmic properties mirror exponential properties through inverse relationships, enabling us to transform complex logarithmic expressions by converting products into sums, quotients into differences, and powers into multiples.

📌 Key points (3–5)

Three core rules: Product Rule (log of a product = sum of logs), Quotient Rule (log of a quotient = difference of logs), Power Rule (log of a power = multiple of the log).
Inverse correspondence: Each logarithm property corresponds directly to an exponential property because logarithms are inverse functions of exponentials.
Common confusion: Expanding vs. condensing—expanding reads properties left-to-right (breaking apart), condensing reads right-to-left (combining); also, there is NO property for log(x + y) or log(x − y).
Domain changes: Expanding or condensing logarithms can change the domain of the expression, which matters when solving equations.
Change of Base: All logarithmic and exponential functions are scalings of each other; any base can be converted to any other base using change-of-base formulas.

🔗 The three algebraic properties

➕ Product Rule

Product Rule: log_b(uw) = log_b(u) + log_b(w)

What it means: The logarithm of a product equals the sum of the logarithms.
Why it works: Because exponentials turn addition into multiplication (b^(c+d) = b^c · b^d), logarithms (the inverse) must turn multiplication into addition.
Example: log₂(8x) = log₂(8) + log₂(x) = 3 + log₂(x).

➖ Quotient Rule

Quotient Rule: log_b(u/w) = log_b(u) − log_b(w)

What it means: The logarithm of a quotient equals the difference of the logarithms.
Why it works: Exponentials turn subtraction into division (b^a = b^c / b^d when a = c − d), so logarithms turn division into subtraction.
Example: log₃((x−1)/(x+1)) = log₃(x−1) − log₃(x+1).

🔢 Power Rule

Power Rule: log_b(u^w) = w · log_b(u)

What it means: The logarithm of a power equals the exponent times the logarithm of the base.
Why it works: Exponentials turn multiplication into powers (b^(uw) = (b^u)^w), so logarithms turn powers into multiplication.
Example: log(x²) = 2 log(x); ln((3/(ex))²) = 2 ln(3/(ex)).

🧩 Expanding logarithms (left-to-right)

🧩 What "expand" means

Goal: Rewrite a single logarithm with products, quotients, or powers inside into a sum, difference, or multiple of simpler logarithms outside.
Direction: Read the three rules from left to right.
Rule of thumb: Apply properties in reverse order of operations—handle powers first (Power Rule), then products/quotients (Product/Quotient Rules).

📐 Handling radicals and nested operations

Radicals: Rewrite roots as fractional exponents (e.g., cube root of x = x^(1/3)), then apply the Power Rule.
Example: log(∛(100x²yz⁵)) = log((100x²yz⁵)^(1/3)) = (1/3) log(100x²yz⁵), then expand the inside using Product and Quotient Rules.
Nested structures: Work from the outermost operation inward, applying one rule at a time.

⚠️ Common pitfalls when expanding

No property for sums or differences inside the log: log(x² − 4) ≠ log(x²) − log(4). Instead, factor first if possible: log₁₁₇(x² − 4) = log₁₁₇((x+2)(x−2)) = log₁₁₇(x+2) + log₁₁₇(x−2).
Domain restriction: Expanding can restrict the domain. For instance, f(x) = log₁₁₇(x² − 4) has domain (−∞, −2) ∪ (2, ∞), but g(x) = log₁₁₇(x+2) + log₁₁₇(x−2) has domain (2, ∞) only, because both logs inside g require their arguments to be positive.
Don't invent properties: If it's not in the textbook, it's probably not true.

🔄 Condensing logarithms (right-to-left)

🔄 What "condense" means

Goal: Combine sums, differences, or multiples of logarithms into a single logarithm.
Direction: Read the three rules from right to left.
Rule of thumb: Follow the usual order of operations—apply the Power Rule first (to handle coefficients), then Product and Quotient Rules.

🛠️ Handling coefficients and constants

Coefficients: Use the Power Rule to move a coefficient inside as an exponent.
- Example: 2 log(y) = log(y²).
Constants: Rewrite constants as logarithms of the appropriate base.
- Example: 3 = log₂(2³) = log₂(8), so 4 log₂(x) + 3 = log₂(x⁴) + log₂(8) = log₂(8x⁴).
- Example: 1/2 = ln(e^(1/2)) = ln(√e), so −ln(x) − 1/2 = ln(x⁻¹) − ln(√e) = ln(1/(x√e)).

⚠️ Domain expansion warning

Condensing can increase the domain.
Example: f(x) = log₃(x−1) − log₃(x+1) has domain (1, ∞), but g(x) = log₃((x−1)/(x+1)) has domain (−∞, −1) ∪ (1, ∞).
Why it matters: When solving logarithmic equations, you must check for extraneous solutions because condensing may introduce solutions outside the original domain.

🔀 Change of Base formulas

🔀 The two formulas

Exponential form: a^x = b^(x log_b(a))
Logarithmic form: log_a(x) = log_b(x) / log_b(a)

What they say: You can express any exponential or logarithm in terms of any other base.
Inverse relationship: To change the base of an exponential, multiply the input by log_b(a); to change the base of a logarithm, divide the output by log_b(a).

🧮 Practical use: calculator approximations

Calculators typically have only LOG (base 10) and LN (base e) buttons.
To compute log₂(7), use the logarithmic change-of-base formula with b = 10 or e:
- log₂(7) = log(7) / log(2) or ln(7) / ln(2).
Example: log₄(5) = ln(5) / ln(4) ≈ 1.16. Verify: 4^1.16 ≈ 5.

🌐 All bases are equivalent

Key insight: All exponential and logarithmic functions are just scalings of one another.
This explains why their graphs have similar shapes.
In principle, you could do all mathematics with a single base (10, e, 42, or any other).

📊 Summary table of properties

Property	Exponential form (Theorem 6.5)	Logarithmic form (Theorem 6.6)	In words
Product	f(u + w) = f(u) · f(w)	log_b(uw) = log_b(u) + log_b(w)	Adding inputs (exp) ↔ Multiplying arguments (log)
Quotient	f(u − w) = f(u) / f(w)	log_b(u/w) = log_b(u) − log_b(w)	Subtracting inputs (exp) ↔ Dividing arguments (log)
Power	f(uw) = (f(u))^w	log_b(u^w) = w · log_b(u)	Multiplying inputs (exp) ↔ Exponentiating arguments (log)

Why the correspondence: Logarithms are inverse functions of exponentials, so they interchange the roles of inputs and outputs.

Budget: 1000000 Used: 245936 (24.6%) Remaining: 754064

Exponential Equations and Inequalities

6.3 Exponential Equations and Inequalities

🧭 Overview

🧠 One-sentence thesis

Exponential equations can be solved either by expressing both sides with a common base and equating exponents or by taking the natural logarithm of both sides, and exponential inequalities are solved using sign diagrams after isolating the exponential and finding zeros.

📌 Key points (3–5)

Two main solution strategies: rewrite both sides with a common base (when convenient) or take the natural log of both sides and use the Power Rule.
One-to-one property is key: if base b raised to exponent u equals base b raised to exponent w, then u equals w; this allows us to equate exponents directly.
Quadratic-in-disguise patterns: when an equation has three terms where one exponent is exactly twice another (e.g., e to the 2x and e to the x), substitute u for the simpler exponential to get a quadratic equation.
Common confusion: when taking the natural log of both sides, divide by ln(base) as a number, not by "ln" as a symbol—just as you wouldn't divide by the square root symbol.
Inequalities use sign diagrams: isolate the exponential, set the expression equal to zero, find zeros and undefined points, then test intervals to determine where the inequality holds.

🔧 Core solution techniques

🔧 Common-base method

When both sides of an exponential equation can be expressed as powers of the same base, rewrite them and equate the exponents.
Example: 2 to the 3x equals 16 to the (1 minus x). Since 16 equals 2 to the 4th, rewrite as 2 to the 3x equals 2 to the 4(1 minus x). By the one-to-one property, 3x equals 4(1 minus x), so x equals 4/7.
This method is "convenient" only when you can easily express both sides as powers of the same base.

🪵 Natural log method

When a common base is not convenient, take the natural log of both sides and use the Power Rule: ln(b to the u) equals u times ln(b).
Example: 3 to the (negative 0.1t) equals 2. Taking ln of both sides gives (negative 0.1t) times ln(3) equals ln(2). Solve for t: t equals negative ln(2) divided by (0.1 times ln(3)), which simplifies to negative 10 ln(2) divided by ln(3).
Don't confuse: Treat ln(base) as a constant number. Divide both sides by ln(base), not by "ln" itself. Just as you wouldn't divide by the square root symbol when solving x times square root of 2 equals 5, you cannot divide by "ln."

🔄 Verifying solutions

Substitute the solution back into the original equation and use inverse properties and log rules to confirm.
Graphically: plot both sides of the equation as separate functions and check that they intersect at the solution value.
Example: For 2 to the x equals 129, the solution x equals log base 2 of 129 can be rewritten using change of base as ln(129) divided by ln(2).

🧩 Special equation patterns

🧩 Quadratic-in-disguise

When an equation has three terms and one exponent is exactly twice another, substitute u for the simpler exponential to convert to a quadratic.
Example: 25 to the x equals 5 to the x plus 6. Rewrite 25 as 5 squared: (5 squared) to the x equals 5 to the (2x). Let u equal 5 to the x, so u squared equals 5 to the (2x). The equation becomes u squared equals u plus 6, or u squared minus u minus 6 equals 0. Solving gives u equals negative 2 or u equals 3. Since 5 to the x equals negative 2 has no real solution (exponentials are always positive), only 5 to the x equals 3 is valid. Taking ln gives x equals ln(3) divided by ln(5).
Another example: e to the x minus e to the (negative x) equals 10. Rewrite e to the (negative x) as 1 divided by e to the x, clear denominators to get e to the (2x) minus 1 equals 10 times e to the x. Let u equal e to the x, so u squared minus 10u minus 1 equals 0. Use the quadratic formula to get u equals 5 plus or minus square root of 26. Discard the negative root (exponentials are positive), so e to the x equals 5 plus square root of 26, giving x equals ln(5 plus square root of 26).

🧩 Isolating the exponential first

Before applying either method, isolate the exponential function by clearing denominators, moving constants, and factoring.
Example: 75 equals 100 divided by (1 plus 3 times e to the (negative 2t)). Clear denominators: 75 times (1 plus 3 times e to the (negative 2t)) equals 100. Expand: 75 plus 225 times e to the (negative 2t) equals 100. Isolate: 225 times e to the (negative 2t) equals 25, so e to the (negative 2t) equals 1/9. Taking ln: negative 2t equals ln(1/9) equals negative ln(9). So t equals ln(9) divided by 2, which simplifies to ln(3).

📊 Solving exponential inequalities

📊 Sign diagram method

Get zero on one side of the inequality, then set the resulting expression equal to a function r(x).
Find the domain of r (exclude points where denominators are zero or logs are undefined).
Find zeros of r by solving r(x) equals 0.
Construct a sign diagram: mark zeros and undefined points on a number line, choose test values in each interval, and determine the sign of r in each interval.
Read off the solution from the diagram based on whether the inequality is greater than, less than, or equal to zero.

📊 Choosing test values carefully

When zeros or undefined points involve logarithms (e.g., ln(4) or ln(6)), use the fact that ln is an increasing function: if a is less than b, then ln(a) is less than ln(b).
Example: To test intervals around ln(4) and ln(6), note that ln(3) is less than ln(4) is less than ln(5) is less than ln(6) is less than ln(7). Use ln(3), ln(5), and ln(7) as test values.
Evaluate r at these points using inverse properties: e to the ln(a) equals a. For instance, if r(x) equals (12 minus 2 times e to the x) divided by (e to the x minus 4), then r(ln(3)) equals (12 minus 2 times 3) divided by (3 minus 4) equals negative 6.

📊 Graphical confirmation

Plot both sides of the inequality as separate functions and observe where one graph is above or below the other.
Check that the solution intervals from the sign diagram match the regions where the inequality holds graphically.

🔍 Inverse functions with exponentials

🔍 Finding the inverse

Start with y equals f(x), then swap x and y.
Solve for y by isolating the exponential and taking the natural log.
Example: f(x) equals (5 times e to the x) divided by (e to the x plus 1). Swap: x equals (5 times e to the y) divided by (e to the y plus 1). Clear denominators: x times (e to the y plus 1) equals 5 times e to the y. Expand: x times e to the y plus x equals 5 times e to the y. Rearrange: x equals e to the y times (5 minus x). So e to the y equals x divided by (5 minus x). Taking ln: y equals ln(x divided by (5 minus x)). Thus f inverse of x equals ln(x divided by (5 minus x)).

🔍 Verifying the inverse

Analytically: check that (f inverse composed with f)(x) equals x for all x in the domain of f, and (f composed with f inverse)(x) equals x for all x in the domain of f inverse.
Graphically: plot y equals f(x) and y equals f inverse(x) on the same axes and observe symmetry about the line y equals x.
The domain of f equals the range of f inverse, and the range of f equals the domain of f inverse.

🌡️ Applied context example

🌡️ Temperature modeling

Example from the excerpt: Coffee temperature T (in degrees Fahrenheit) t minutes after serving is modeled by T(t) equals 70 plus 90 times e to the (negative 0.1t). When is the coffee warmer than 100 degrees?
Set up the inequality: 70 plus 90 times e to the (negative 0.1t) greater than 100. Simplify: 90 times e to the (negative 0.1t) minus 30 greater than 0.
Let r(t) equal 90 times e to the (negative 0.1t) minus 30. Domain is restricted to [0, infinity) by context.
Find zeros: 90 times e to the (negative 0.1t) equals 30, so e to the (negative 0.1t) equals 1/3. Taking ln: negative 0.1t equals ln(1/3) equals negative ln(3). So t equals 10 ln(3).
Sign diagram shows r(t) greater than 0 on [0, 10 ln(3)). Approximating: 10 ln(3) is approximately 10.986 minutes. Interpretation: the coffee is warmer than 100 degrees for approximately the first 11 minutes.

🌡️ Context constraints

The domain is artificially restricted by the real-world context (time cannot be negative).
Always interpret the mathematical solution in terms of the applied problem: translate the interval back into a meaningful statement about the scenario.

Logarithmic Equations and Inequalities

6.4 Logarithmic Equations and Inequalities

🧭 Overview

🧠 One-sentence thesis

Logarithmic equations and inequalities can be solved by isolating the logarithm and then either equating arguments when bases match or converting to exponential form, but extraneous solutions must always be checked because negative arguments inside logarithms are undefined.

📌 Key points (3–5)

Two main strategies: equate arguments when bases are the same, or rewrite the log equation as an exponential equation.
Isolate first: before applying either strategy, isolate the logarithmic function on one side.
Extraneous solutions are common: algebraic solutions may produce negative arguments inside logarithms, which are not in the domain and must be discarded.
Common confusion: a solution that "looks correct" algebraically (e.g., log of negative equals log of negative) is still invalid if the argument is negative—both sides are undefined.
Inequalities use sign diagrams: because logarithmic functions are continuous on their domains, sign diagrams work for inequalities involving logs.

🔧 Two core strategies for equations

🔧 Strategy A: Equate arguments (same base)

If both sides of the equation are logarithms with the same base, you can equate what is inside the logs.
Example: log base 2 of x equals log base 2 of 5 implies x equals 5 (by Theorem 6.4).
This works only when the bases match; if they don't, use change of base or convert to exponential form.

🔧 Strategy B: Convert to exponential form

If one side is a number (not a log), rewrite the log equation as an exponential equation.
Example: log base 2 of x equals 3 becomes 2 to the power 3 equals x, so x equals 8.
This is often faster when you have a log equal to a constant.

🔧 When to use which

Use Strategy A when both sides are already logs with the same base.
Use Strategy B when one side is a plain number or when combining logs into a single log with a constant on the other side.

⚠️ Domain and extraneous solutions

⚠️ Why extraneous solutions appear

Algebraic manipulation (like squaring or combining logs) can introduce solutions that violate the domain of the original equation.
The domain of log functions requires the argument to be positive; any solution that makes an argument zero or negative is extraneous.

⚠️ How to check

Substitute the candidate solution back into the original equation.
If any logarithm has a negative or zero argument, that solution is not valid.
Example: solving log base 117 of (1 minus 3x) equals log base 117 of (x squared minus 3) gives x equals negative 4 and x equals 1. Substituting x equals 1 produces log base 117 of negative 2 equals log base 117 of negative 2, which is undefined, so x equals 1 is extraneous.

⚠️ Graphical confirmation

Graph both sides of the equation as separate functions and find intersections.
Only the x-values where the graphs actually intersect (and both functions are defined) are valid solutions.
Don't confuse: two expressions that "look identical" (like log of negative 2 equals log of negative 2) are still not real numbers and do not represent a valid solution.

🧮 Solving logarithmic equations step-by-step

🧮 Isolate the logarithm

Move all terms involving logs to one side and constants to the other.
Example: 2 minus natural log of (x minus 3) equals 1 becomes natural log of (x minus 3) equals 1.

🧮 Use log properties to combine

Apply Product Rule: log of A plus log of B equals log of (A times B).
Apply Quotient Rule: log of A minus log of B equals log of (A divided by B).
Apply Power Rule: k times log of A equals log of (A to the power k).
Example: log base 6 of (x plus 4) plus log base 6 of (3 minus x) equals 1 becomes log base 6 of [(x plus 4) times (3 minus x)] equals 1.

🧮 Convert to exponential and solve

Rewrite the log equation in exponential form.
Example: log base 6 of [(x plus 4)(3 minus x)] equals 1 becomes 6 to the power 1 equals (x plus 4)(3 minus x).
Solve the resulting polynomial or rational equation.

🧮 Handle different bases

If logs have different bases, use change of base formula to convert to a common base.
Example: 1 plus 2 log base 4 of (x plus 1) equals 2 log base 2 of x. Convert log base 4 to log base 2: log base 4 of (x plus 1) equals (log base 2 of (x plus 1)) divided by (log base 2 of 4) equals one half times log base 2 of (x plus 1).

📊 Solving logarithmic inequalities

📊 Use sign diagrams

Logarithmic functions are continuous on their domains, so sign diagrams can be used for inequalities.

Define a function r(x) by moving all terms to one side so the inequality is r(x) greater than or equal to 0 (or less than 0).
Find the domain of r(x) (require all log arguments positive).
Find zeros of r(x) by setting r(x) equals 0.
Choose test values in each interval and determine the sign of r(x).
Read off the solution intervals where r(x) has the correct sign.

📊 Example: one over (natural log of x plus 1) less than or equal to 1

Move terms: one over (natural log of x plus 1) minus 1 less than or equal to 0.
Combine: (1 minus (natural log of x plus 1)) over (natural log of x plus 1) less than or equal to 0, which simplifies to (natural log of x) over (natural log of x plus 1) greater than or equal to 0.
Domain: x greater than 0 and natural log of x plus 1 not equal to 0, so x not equal to 1 over e.
Zeros: natural log of x equals 0 gives x equals 1.
Test intervals: (0, 1 over e), (1 over e, 1), (1, infinity).
Solution: (0, 1 over e) union [1, infinity).

📊 Quadratic-in-disguise inequalities

If the inequality involves (log of x) squared or similar, set u equals log of x to get a quadratic in u.
Solve the quadratic for u, then convert back to x.
Example: (log base 2 of x) squared minus 2 log base 2 of x minus 3 less than 0. Let u equals log base 2 of x, solve u squared minus 2u minus 3 less than 0 to get negative 1 less than u less than 3, then convert back: one half less than x less than 8.

📊 Compound inequalities

For inequalities like a less than or equal to log of x less than or equal to b, split into two separate inequalities and take the intersection.
Example: 7.8 less than or equal to negative log of [H plus] less than or equal to 8.5 becomes negative 7.8 greater than or equal to log of [H plus] and log of [H plus] greater than or equal to negative 8.5, which gives 10 to the power negative 8.5 less than or equal to [H plus] less than or equal to 10 to the power negative 7.8.

🔄 Finding inverses of logarithmic functions

🔄 Standard procedure

Write y equals f(x).
Interchange x and y.
Solve for y (this may require exponential form or algebraic manipulation).
The result is f inverse of x.

🔄 Example: f(x) equals (log of x) over (1 minus log of x)

Write y equals (log of x) over (1 minus log of x).
Interchange: x equals (log of y) over (1 minus log of y).
Multiply both sides by (1 minus log of y): x times (1 minus log of y) equals log of y.
Expand: x minus x times log of y equals log of y.
Collect: x equals x times log of y plus log of y equals (x plus 1) times log of y.
Solve: log of y equals x over (x plus 1).
Convert to exponential: y equals 10 to the power (x over (x plus 1)).
So f inverse of x equals 10 to the power (x over (x plus 1)).

🔄 Graphical check

Graph y equals f(x) and y equals f inverse of x on the same axes.
The graphs should be reflections of each other across the line y equals x.
This confirms the inverse relationship visually.

Applications of Exponential and Logarithmic Functions

6.5 Applications of Exponential and Logarithmic Functions

🧭 Overview

🧠 One-sentence thesis

Exponential and logarithmic functions model diverse real-world phenomena—from compound interest and population growth to radioactive decay and temperature change—using the same underlying mathematical principles across different disciplines.

📌 Key points (3–5)

Compound interest models: Interest can be compounded periodically (using discrete compounding) or continuously, with more frequent compounding yielding slightly more money.
Growth and decay share structure: Uninhibited population growth and radioactive decay follow similar exponential equations, differing only in the sign of the rate constant k.
Limited growth models: Newton's Law of Cooling and logistic growth describe situations where a quantity approaches a limiting value rather than growing without bound.
Common confusion: Distinguishing when to use uninhibited exponential growth (early stages, no limits) versus logistic growth (accounts for capacity limits).
Logarithms linearize data: Taking logarithms of exponential data transforms it into linear form, making it easier to fit models and estimate parameters.

💰 Financial applications

💰 Simple versus compound interest

Simple interest: The amount of interest I accrued at annual rate r on principal P after t years is I = Prt, giving total amount A = P(1 + rt).

Simple interest earns a fixed amount each period based only on the original principal.
Compound interest earns "interest on interest"—each compounding period uses the current balance as the new principal.
Example: $100 at 5% annual rate yields $105 with simple interest after one year, but $105.06 if compounded semiannually (interest earned in the first half-year itself earns interest in the second half).

💰 Periodic compounding formula

Compounded interest: If principal P is invested at annual rate r compounded n times per year, the amount after t years is A(t) = P(1 + r/n)^(nt).

The variable n represents compounding frequency: n = 12 for monthly, n = 4 for quarterly, n = 2 for semiannually.
More frequent compounding increases returns, but the effect diminishes as n grows.
To find doubling time, set A(t) = 2P and solve for t; the result depends only on r and n, not on P itself.

💰 Continuous compounding

Continuously compounded interest: If principal P is invested at annual rate r with continuous compounding, the amount after t years is A(t) = Pe^(rt).

As the number of compoundings per year approaches infinity, the formula converges to the exponential function with base e.
The mathematical limit: as n → ∞, (1 + 1/n)^n → e.
Continuous compounding yields only slightly more than very frequent periodic compounding (e.g., monthly versus continuous differs by less than 1% over 35 years in the excerpt's example).
Don't confuse: Continuous compounding is a theoretical limit; real banks use periodic compounding (daily, monthly, etc.).

🌱 Growth models

🌱 Law of Uninhibited Growth

Uninhibited Growth: If a population grows such that its instantaneous rate of change is directly proportional to the current population, then N(t) = N₀e^(kt), where N₀ is the initial population and k > 0 is the growth constant.

The premise: "the more organisms there are, the faster they reproduce."
This model applies to early-stage population growth when resources are abundant and there are no limiting factors.
The growth rate k can be interpreted as a percentage: k ≈ 0.8618 means an 86.18% daily growth rate.
Example: Epithelial cells growing from 12,000 to 5 million in one week follow this model; solving for k gives the daily growth rate.

🌱 Radioactive decay

Radioactive Decay: The amount of a radioactive element at time t is A(t) = A₀e^(kt), where A₀ is the initial amount and k < 0 is the decay constant satisfying (instantaneous rate of change) = kA(t).

The decay model has the same form as uninhibited growth, but k is negative.
The premise: the rate of decay at any time is directly proportional to the amount remaining.
Half-life h is the time for half the substance to decay; it relates to k by k = -ln(2)/h.
Example: Iodine-131 with half-life 8 days and initial amount 5 grams gives A(t) = 5e^(-t·ln(2)/8), representing an 8.664% daily loss.

🌱 Logistic growth

Logistic Growth: If growth rate varies jointly with population size and available room to grow, then N(t) = L/(1 + Ce^(-kLt)), where L is the limiting population, N₀ is initial population, and C = L/N₀ - 1.

Combines features of uninhibited growth (early rapid increase) and limited growth (levels off at capacity L).
The growth rate depends on both N(t) and (L - N(t))—the more room to grow, the faster the growth.
As t → ∞, N(t) → L, meaning the population approaches but never exceeds the limiting value.
The inflection point (point of diminishing returns) occurs at half the limiting population, where growth rate begins to decline even though the function still increases.
Example: A rumor spreading through a community of 8,400 people follows N(t) = 84/(1 + 2799e^(-t)), starting with 3 people and approaching the limit of 8,400 over time.

🌡️ Temperature and limited growth

🌡️ Newton's Law of Cooling (Warming)

Newton's Law of Cooling: The temperature T of an object at time t is T(t) = Tₐ + (T₀ - Tₐ)e^(-kt), where T₀ is initial temperature, Tₐ is ambient (surrounding) temperature, and k > 0.

The rate of temperature change is proportional to the temperature difference between the object and its surroundings, not the object's temperature itself.
The same equation handles both cooling (T₀ > Tₐ) and warming (T₀ < Tₐ); the sign of (T₀ - Tₐ) determines the direction.
As t → ∞, T(t) → Tₐ: the object's temperature approaches ambient temperature asymptotically.
Example: A 40°F roast in a 350°F oven follows T(t) = 350 - 310e^(-kt); after finding k from a measurement at t = 2 hours, we can predict when the roast reaches 165°F.
Don't confuse: The constant k is always positive; whether the function increases or decreases depends on whether T₀ is less than or greater than Tₐ.

🌡️ Limited growth interpretation

Newton's Law of Cooling is sometimes called a limited growth model because T(t) remains bounded as time progresses.
The horizontal asymptote y = Tₐ represents the physical limit (room temperature for cooling coffee, oven temperature for a roasting meat).
The Second Law of Thermodynamics ensures heat flows from hot to cold, preventing an object from cooling below ambient temperature.

📊 Applications of logarithms

📊 Linearizing data with logarithms

Scientists use logarithms to transform exponential or power relationships into linear ones, making parameter estimation easier.
For power functions N = Bt^A: Taking ln of both sides gives ln(N) = A·ln(t) + ln(B), a line with slope A and intercept ln(B).
For exponential functions N = Be^(At): Taking ln gives ln(N) = At + ln(B), a line with slope A and intercept ln(B).
Process: Plot ln(N) versus ln(t) for power models, or ln(N) versus t for exponential models; perform linear regression; back-calculate A and B.
Example: H1N1 flu data can be modeled by plotting ln(N) versus ln(t) to test a power function, or ln(N) versus t to test exponential growth.

📊 Choosing appropriate models

Model type	When to use	Key feature
Quadratic	Data shows parabolic shape	May fit well numerically but lacks scientific justification for many phenomena
Power function	Relationship N = Bt^A suspected	Linearize by plotting ln(N) vs ln(t)
Exponential	Early uninhibited growth	Linearize by plotting ln(N) vs t
Logistic	Growth with capacity limit	S-shaped curve; approaches limiting value L

The "best" model depends on both goodness of fit and scientific plausibility.
Example: For flu spread, logistic model accounts for finite population (only so many people can catch it), whereas quadratic predicts unlimited growth—scientifically unreasonable.
Don't confuse: A model that fits historical data well may still fail to predict future values if it doesn't capture the underlying mechanism.

📊 Other logarithmic applications

Information entropy for password strength: H = L·log₂(N), where L is password length and N is number of possible symbols per character; higher entropy means stronger password.
Buffer solutions and pH: Blood pH modeled by pH = 6.1 + log(800/x), where x is partial pressure of carbon dioxide; this is derived from the Henderson-Hasselbalch Equation.
Logarithmic scales (Richter scale for earthquakes, decibels for sound, pH for acidity) compress wide ranges of values into manageable numbers.

Introduction to Conics

7.1 Introduction to Conics

🧭 Overview

🧠 One-sentence thesis

Conic sections—circles, ellipses, parabolas, and hyperbolas—are curves formed by slicing a double-napped cone with a plane at different angles, and their equations are derived from distance definitions.

📌 Key points (3–5)

What conics are: curves created by intersecting a plane with a double-napped cone.
The four main types: circle (horizontal slice), ellipse (slightly tilted), parabola (parallel to cone), and hyperbola (vertical slice).
Degenerate vs non-degenerate: when the plane contains the vertex, you get degenerate cases (point, line, or two intersecting lines); the focus is on non-degenerate cases.
How equations are built: by using definitions based on distances from fixed points.
Common confusion: the shape depends on the angle of the slicing plane—horizontal gives a circle, tilting slightly gives an ellipse, parallel to the cone gives a parabola, and vertical gives a hyperbola.

🍰 How conics are formed

🍰 The double-napped cone

Imagine a cone with two identical parts (nappes) meeting at a vertex, extending infinitely in both directions.
A plane slices through this cone at various angles to produce different curves.

✂️ Four slicing angles, four curves

Plane orientation	Resulting conic	Description
Horizontal	Circle	The plane cuts perpendicular to the axis of the cone
Slightly tilted	Ellipse	The plane is tilted from horizontal but does not cut parallel to the cone
Parallel to the cone	Parabola	The plane runs parallel to the slant of the cone
Vertical	Hyperbola	The plane cuts through both nappes vertically

Example: if you hold a plane flat and slice through the cone horizontally, you see a circular cross-section; tilt it a bit and the circle stretches into an ellipse.

🔻 Degenerate cases

🔻 When the plane contains the vertex

If the slicing plane passes through the vertex of the cone, the intersection collapses into simpler shapes:
- A single point (the vertex itself).
- A single line (if the plane is tangent to the cone).
- Two intersecting lines (if the plane cuts through the vertex at an angle).
These are called degenerate conics because they are limiting or special cases, not the full curves.

🎯 Focus on non-degenerate conics

The chapter focuses on the four main, non-degenerate cases: circles, parabolas, ellipses, and hyperbolas.
These are the curves that have "full" shapes, not collapsed into points or lines.

📐 Building equations from distance

📐 Distance-based definitions

Each conic section can be defined by a relationship involving distances from fixed points or lines.
The excerpt states: "To determine equations which describe these curves, we will make use of their definitions in terms of distances."
This approach translates geometric properties (like "all points at a fixed distance from a center") into algebraic equations.

🔵 Example: the circle

Circle: A circle with center (h, k) and radius r > 0 is the set of all points (x, y) in the plane whose distance to (h, k) is r.

A point (x, y) is on the circle if and only if its distance to the center (h, k) equals r.
Using the distance formula: r = square root of [(x − h)² + (y − k)²].
Squaring both sides (since r > 0, this is equivalent) gives the standard equation:

Standard equation of a circle: (x − h)² + (y − k)² = r²

Example: if the center is at (2, 3) and the radius is 5, the equation is (x − 2)² + (y − 3)² = 25.
Don't confuse: the equation uses (x − h) and (y − k), not (x + h) and (y + k); the signs flip because of the distance formula structure.

Circles

7.2 Circles

🧭 Overview

🧠 One-sentence thesis

A circle is fully determined by its center point and radius, and its equation can be derived from the distance formula and manipulated through completing the square.

📌 Key points (3–5)

Definition: A circle is the set of all points at a fixed distance (radius) from a center point.
Standard equation: The equation (x − h)² + (y − k)² = r² describes a circle with center (h, k) and radius r.
Converting to standard form: Complete the square on both variables to identify the center and radius from expanded equations.
Common confusion: When completing the square, remember to factor out the leading coefficient first if it's not 1, then divide both sides by it at the end.
Special case: The Unit Circle (center at origin, radius 1) has the simplest equation: x² + y² = 1.

📐 Definition and derivation

📐 What defines a circle

A circle with center (h, k) and radius r > 0 is the set of all points (x, y) in the plane whose distance to (h, k) is r.

A circle requires two pieces of information: where it's centered and how big it is.
Every point on the circle is exactly the same distance from the center.
The radius must be positive (r > 0).

🧮 From distance to equation

The standard equation comes directly from the distance formula:

Start with: distance from (x, y) to (h, k) equals r
Apply distance formula: r = √[(x − h)² + (y − k)²]
Square both sides to eliminate the square root: r² = (x − h)² + (y − k)²

Standard equation of a circle:

(x − h)² + (y − k)² = r²

Example: A circle centered at (−2, 3) with radius 5 has equation (x + 2)² + (y − 3)² = 25.

🔄 Converting equations to standard form

🔄 Why complete the square

When a circle equation is expanded (like x² + 4x + y² − 2y + 1 = 0), the center and radius are hidden. Completing the square reveals them.

📝 Step-by-step process

Step	Action	Purpose
1	Group same variables together, move constant to other side	Prepare for completing the square
2	Factor out leading coefficients from each variable group	Make the squared terms have coefficient 1
3	Complete the square on both x and y	Create perfect square trinomials
4	Divide both sides by the coefficient of the squares	Get standard form

⚠️ Watch out for leading coefficients

Example from the excerpt: 3x² − 6x + 3y² + 4y − 4 = 0

Factor out 3 from x terms: 3(x² − 2x)
Factor out 3 from y terms: 3(y² + 4/3 y)
Complete the square: add 3(1) and 3(4/9) to the right side
Final step: divide everything by 3 to get (x − 1)² + (y + 2/3)² = 25/9
Center: (1, −2/3), radius: 5/3

Don't confuse: When you add a value inside parentheses that have a coefficient, you must multiply that value by the coefficient when adding to the other side.

🎯 Special cases and applications

🎯 The Unit Circle

The Unit Circle is the circle centered at (0, 0) with radius 1. Its equation is x² + y² = 1.

This is the most important circle in mathematics (used extensively in trigonometry).
It's the simplest form: h = 0, k = 0, r = 1.

Example: Finding points on the unit circle with y-coordinate √3/2:

Substitute into x² + y² = 1: x² + 3/4 = 1
Solve: x² = 1/4, so x = ±1/2
Points: (1/2, √3/2) and (−1/2, √3/2)

📏 Using endpoints of a diameter

When given two endpoints of a diameter:

The center is the midpoint of the two points (use midpoint formula).
The radius is half the distance between the two points (use distance formula).

Example: Endpoints (−1, 3) and (2, 4)

Center: ((−1+2)/2, (3+4)/2) = (1/2, 7/2)
Radius: (1/2)√[(2−(−1))² + (4−3)²] = (1/2)√10 = √10/2
Equation: (x − 1/2)² + (y − 7/2)² = 10/4

⚠️ Equations that don't represent circles

Not every equation of the form (x − h)² + (y − k)² = c represents a circle:

If c = 0: only one point (the center itself)
If c < 0: no real points satisfy the equation (no graph)
Only when c > 0 do you get an actual circle with radius r = √c

Parabolas

7.3 Parabolas

🧭 Overview

🧠 One-sentence thesis

A parabola can be defined as the set of all points equidistant from a fixed point (the focus) and a fixed line (the directrix), and this distance-based definition leads directly to the standard algebraic equations that describe parabolas opening in any direction.

📌 Key points (3–5)

Distance definition: A parabola is the set of all points equally distant from a focus point F and a directrix line D.
Two orientations: Parabolas can be vertical (opening up/down, with x squared) or horizontal (opening left/right, with y squared).
Key parameters: The vertex is the closest point to the focus; the directed distance p from vertex to focus determines both direction and width.
Common confusion: Only one variable is squared in a parabola equation (x² or y²), whereas circles have both x² and y² squared—this is the quick way to tell them apart.
Focal diameter and reflective property: The width at the focus (latus rectum) is |4p|, and parabolas reflect parallel rays through the focus, making them useful for satellite dishes and flashlights.

📐 Defining parabolas by distance

📏 The focus-directrix definition

Parabola: the set of all points equidistant from a point F (the focus) and a line D (the directrix) not containing F.

This is a geometric definition based purely on distance, not on algebra.
Every point on the parabola satisfies: (distance to focus) = (distance to directrix).
The vertex is the point on the parabola closest to the focus.
The directed distance from the vertex to the focus is denoted p; the same distance separates the vertex from the directrix.

🔍 Deriving the equation from distance

Starting with vertex at (0, 0) and parabola opening upward:

Focus is at (0, p); directrix is the line y = −p.
For any point (x, y) on the parabola, the distance to (0, p) equals the distance to the line y = −p.
Using the distance formula and squaring both sides:
- √[x² + (y − p)²] = √[(y + p)²]
- x² + (y − p)² = (y + p)²
- x² = 4py
Solving for y gives y = x²/(4p), which matches the quadratic form y = ax² with a = 1/(4p).

Why it matters: This shows that the familiar quadratic function graph is exactly the curve defined by equal distances to focus and directrix.

📊 Standard equations for vertical parabolas

📐 Vertex at (h, k)

Standard equation (vertical parabola): (x − h)² = 4p(y − k)

Vertex: (h, k)
Focal length: |p|
If p > 0, the parabola opens upward (focus above vertex).
If p < 0, the parabola opens downward (focus below vertex).
The focus is located |p| units from the vertex in the direction the parabola opens.
The directrix is a horizontal line |p| units from the vertex in the opposite direction.

🎯 Identifying the parts

Element	Location	How to find
Vertex	(h, k)	Read from (x − h)² and (y − k)
Focus	(h, k ± p)	Move p units vertically from vertex
Directrix	y = k ∓ p	Horizontal line p units opposite the focus
Focal diameter	\|4p\|	Width of parabola at the focus

Example: For (x + 1)² = −8(y − 3):

h = −1, k = 3 → vertex (−1, 3)
4p = −8 → p = −2 (negative, so opens downward)
Focus: 2 units below vertex → (−1, 1)
Directrix: 2 units above vertex → y = 5
Focal diameter: |−8| = 8, so the parabola is 8 units wide at the focus (4 units left and right).

📊 Standard equations for horizontal parabolas

📐 Switching x and y roles

Standard equation (horizontal parabola): (y − k)² = 4p(x − h)

Vertex: (h, k)
If p > 0, the parabola opens to the right (focus right of vertex).
If p < 0, the parabola opens to the left (focus left of vertex).
The focus is located |p| units horizontally from the vertex.
The directrix is a vertical line |p| units from the vertex in the opposite direction.

🔄 How to distinguish vertical vs horizontal

Feature	Vertical parabola	Horizontal parabola
Squared variable	x²	y²
Opens	Up or down	Left or right
Directrix	Horizontal line	Vertical line
Standard form	(x − h)² = 4p(y − k)	(y − k)² = 4p(x − h)

Don't confuse: The variable that is squared tells you the orientation—x² means vertical (up/down), y² means horizontal (left/right).

Example: For (y − 2)² = 12(x + 1):

h = −1, k = 2 → vertex (−1, 2)
4p = 12 → p = 3 (positive, so opens right)
Focus: 3 units right of vertex → (2, 2)
Directrix: 3 units left of vertex → x = −4
Focal diameter: 12, so 6 units above and below the focus.

🔧 Converting to standard form

🛠️ Steps to rewrite equations

When given an equation with exactly one squared variable:

Group the squared variable on one side; move the non-squared variable and constant to the other.
Complete the square for the squared variable if needed.
Factor out the coefficient of the non-squared variable from both it and the constant.

📝 Worked example

Given: y² + 4y + 8x = 4

Group: y² + 4y = −8x + 4
Complete the square in y: y² + 4y + 4 = −8x + 4 + 4
Factor: (y + 2)² = −8x + 8
Factor out −8: (y + 2)² = −8(x − 1)

Now in standard form (y − k)² = 4p(x − h):

Vertex: (1, −2)
4p = −8 → p = −2 (opens left)
Focus: 2 units left of vertex → (−1, −2)
Directrix: 2 units right of vertex → x = 3
Focal diameter: 8 (4 units above and below focus)

📏 The latus rectum and focal diameter

📐 What is the latus rectum?

Latus rectum: the line segment parallel to the directrix that passes through the focus, with endpoints on the parabola.

It measures how "wide" the parabola is at the focus.
The length of the latus rectum is called the focal diameter and equals |4p|.
This is useful for sketching: plot points |4p|/2 units on either side of the focus (perpendicular to the axis of symmetry).

Why it matters: The focal diameter gives you two additional points on the parabola, making it easier to draw an accurate graph.

Example: If 4p = 12, the focal diameter is 12, so the parabola is 6 units wide on each side of the focus.

🔭 Reflective property and applications

🛰️ How parabolas focus light and signals

Reflective property: Rays parallel to the axis of symmetry reflect off a parabola and pass through the focus.
Conversely, rays emanating from the focus reflect off the parabola and travel parallel to the axis.

Applications:

Device	How it uses the parabola
Satellite dish	Incoming signals reflect off the parabolic surface and concentrate at the focus, where the receiver is placed
Flashlight	Bulb at the focus; light reflects off parabolic mirror to produce a parallel beam
Paraboloid of revolution	3D shape formed by rotating a parabola; every cross-section through the vertex is a parabola with the same focus

📐 Practical example: satellite dish design

Problem: A satellite dish is 12 feet wide and the receiver (at the focus) is 2 feet above the vertex. How deep is the dish?

Solution:

Assume vertex at (0, 0), opens upward.
Focus 2 feet above vertex → p = 2.
Equation: x² = 4(2)y = 8y.
Dish is 12 feet wide → edge is 6 feet from center (x = 6).
Substitute x = 6: 36 = 8y → y = 36/8 = 4.5 feet.

Answer: The dish is 4.5 feet deep.

🆚 Distinguishing parabolas from circles

🔍 Quick identification

Feature	Circle	Parabola
Squared variables	Both x² and y²	Only x² or y²
Standard form	(x − h)² + (y − k)² = r²	(x − h)² = 4p(y − k) or (y − k)² = 4p(x − h)
Shape	Closed curve	Open curve

Don't confuse: If you see both x² and y² with the same sign, it's a circle (or related conic). If only one variable is squared, it's a parabola.

Ellipses

7.4 Ellipses

🧭 Overview

🧠 One-sentence thesis

An ellipse is defined by two fixed points (foci) such that the sum of distances from any point on the ellipse to both foci equals a constant, and this geometric property determines the ellipse's equation, shape, and reflective characteristics.

📌 Key points (3–5)

Definition by two foci: An ellipse is the set of all points where the sum of distances to two fixed points (foci) equals a constant distance d.
Standard equation structure: The equation has the form (x−h)²/a² + (y−k)²/b² = 1, where (h,k) is the center and a, b determine horizontal and vertical reach.
Major vs minor axis: The longer axis is the major axis (contains the foci and vertices); the shorter is the minor axis (perpendicular to the major axis through the center).
Common confusion—which axis is major: Compare a and b; whichever denominator is larger determines the major axis direction, and c = √(bigger denominator − smaller denominator).
Eccentricity measures roundness: e = c/a (distance center-to-focus / distance center-to-vertex); closer to 0 means more circular, closer to 1 means more elongated.

📐 Defining an ellipse

📐 The two-foci definition

Ellipse: Given two distinct points F₁ and F₂ (the foci) and a fixed distance d, an ellipse is the set of all points (x,y) such that the sum of distances from F₁ and F₂ to (x,y) equals d.

Think of anchoring a string at two points and tracing with a pencil kept taut—the curve is an ellipse.
The sum d₁ + d₂ = d holds for every point on the ellipse.
This is fundamentally different from a circle, which uses one center point and one fixed radius.

🎯 Key parts of an ellipse

Term	Definition
Center	Midpoint of the line segment connecting the two foci
Major axis	Line segment through the center and foci, connecting two opposite ends (the longer axis)
Minor axis	Line segment through the center perpendicular to the major axis (the shorter axis)
Vertices	The two points where the ellipse intersects the major axis
Foci	The two fixed points F₁ and F₂ used in the definition

The center is also the midpoint of the vertices.
The major axis is always longer than the minor axis.

🧮 Standard equation and how to use it

🧮 The standard form

Standard equation of an ellipse: For positive unequal numbers a and b, the equation with center (h,k) is:
(x − h)²/a² + (y − k)²/b² = 1

The values a and b tell you how far to move horizontally and vertically from the center to reach the ellipse.
The center is at (h,k), found by looking at (x−h) and (y−k) in the equation.
Example: In (x+1)²/9 + (y−2)²/25 = 1, the center is (−1, 2); move 3 units left/right (since a²=9, so a=3) and 5 units up/down (since b²=25, so b=5).

🔄 Determining major vs minor axis

If a > b: Major axis is horizontal (along the line y=k); foci lie left and right of center.
If b > a: Major axis is vertical (along the line x=h); foci lie above and below center.
The larger denominator always corresponds to the major axis direction.
Don't confuse: the letters a and b don't inherently mean "horizontal" or "vertical"—you must compare their values.

📏 Finding the foci

The distance from center to each focus is c, where:
c = √(bigger denominator − smaller denominator)
If a > b: c = √(a² − b²), and foci are at (h±c, k).
If b > a: c = √(b² − a²), and foci are at (h, k±c).
Example: For (x+1)²/9 + (y−2)²/25 = 1, since 25 > 9, we have c = √(25−9) = √16 = 4; foci are 4 units above and below center at (−1, −2) and (−1, 6).

🔧 Converting to standard form

🔧 When the equation is not in standard form

If given an equation like x² + 4y² − 2x + 24y + 33 = 0, follow these steps:

Group variables: Move same variables together, constant to the other side.
Complete the square in both x and y.
Divide so the right side equals 1.

🔧 Worked approach

Starting with x² + 4y² − 2x + 24y + 33 = 0:

Group: x² − 2x + 4y² + 24y = −33
Factor out coefficient of y²: x² − 2x + 4(y² + 6y) = −33
Complete squares: (x² − 2x + 1) + 4(y² + 6y + 9) = −33 + 1 + 36
Simplify: (x−1)² + 4(y+3)² = 4
Divide by 4: (x−1)²/4 + (y+3)² = 1

Now in standard form with center (1, −3), a²=4, b²=1.

🎨 Eccentricity and shape

🎨 What eccentricity measures

Eccentricity e = (distance from center to focus) / (distance from center to vertex) = c/a

For an ellipse, 0 < e < 1 (foci are always closer to center than vertices).
Closer to 0: more circular (foci nearly at center).
Closer to 1: more elongated or "eccentric" (foci nearly at vertices).
Example: An ellipse with e ≈ 0.98 is very stretched; one with e ≈ 0.66 is rounder.

🎨 Using eccentricity to find the equation

If given vertices and eccentricity:

Find a from the vertices.
Use e = c/a to find c.
Use c = √(a² − b²) to solve for b².

Example: Vertices at (±5, 0) and e = 1/4 gives a=5, so c = (1/4)·5 = 5/4, then b² = 25 − (5/4)² = 375/16.

🔊 Reflective property and applications

🔊 The whispering gallery effect

Sound waves emanating from one focus reflect off the ellipse and converge at the other focus.
This creates "whispering galleries" where a whisper at one focus is clearly heard at the other, even across a large room.
Famous examples: St. Paul's Cathedral (London), National Statuary Hall (Washington, D.C.).

🔊 Practical calculation

Example: A whispering gallery is 40 feet tall at center and 100 feet wide at floor. How far from the outer wall are the foci?

Center the ellipse at (0,0); width gives a=50, height gives b=40.
Find c = √(50² − 40²) = √900 = 30.
Foci are 30 units from center, so 50−30 = 20 feet from the outer wall.
Don't confuse: measure from the wall (vertex), not from the center.

🔗 Relationship to circles

🔗 Circles as special ellipses

A circle's equation can be written as (x−h)²/r² + (y−k)²/r² = 1.
Notice: same denominator for both terms (a = b = r).
An ellipse has different denominators (a ≠ b).
You can think of an ellipse as a "stretched circle"—the unit circle x² + y² = 1 stretched differently in x and y directions.

Hyperbolas

7.5 Hyperbolas

🧭 Overview

🧠 One-sentence thesis

A hyperbola is the set of all points where the absolute value of the difference of distances to two fixed foci equals a constant, and this difference property generates two symmetric branches that open either horizontally or vertically with asymptotes guiding their shape.

📌 Key points (3–5)

What a hyperbola is: the locus of points where |distance to focus₁ − distance to focus₂| = constant d; this creates two separate branches.
Key parameters: center, transverse axis (contains foci and vertices), conjugate axis (perpendicular through center), and asymptotes (lines the branches approach).
Two standard forms: horizontal (x² term positive) opens left/right; vertical (y² term positive) opens up/down.
Common confusion: hyperbola uses difference of distances (subtraction), while ellipse uses sum (addition); also, the relationship is c² = a² + b² (not a² = b² + c² as in ellipses).
Real-world use: positioning problems like LORAN navigation and earthquake epicenter location use intersecting hyperbolas.

📐 Definition and structure

📐 What defines a hyperbola

Hyperbola: Given two distinct points F₁ and F₂ (the foci) and a fixed distance d, a hyperbola is the set of all points (x, y) such that the absolute value of the difference of the distances from F₁ and F₂ to (x, y) equals d.

The absolute value is crucial: for some points, distance to F₁ minus distance to F₂ equals d; for others, distance to F₂ minus distance to F₁ equals d.
This creates two branches (separate curves), unlike the single closed loop of an ellipse.
Example: If F₁ is at (−5, 0), F₂ at (5, 0), and d = 2, then any point 2 units closer to one focus than the other lies on the hyperbola.

🧩 Key components

Component	Definition	Location
Center	Midpoint of the segment connecting the two foci	Between the foci
Transverse axis	Line segment connecting the two vertices; contains center and foci	Through both branches
Conjugate axis	Perpendicular to transverse axis through center; same length as the segment through a vertex connecting asymptotes	Perpendicular to transverse axis
Vertices	Points where the hyperbola intersects the transverse axis	On the hyperbola, closest to center
Foci	Two fixed points used in the definition	On the transverse axis, outside the vertices
Asymptotes	Lines the branches approach for large x and y values	Pass through center and corners of guide rectangle

The guide rectangle is constructed using the transverse and conjugate axis lengths; its diagonals contain the asymptotes.
Don't confuse: the conjugate axis does not intersect the hyperbola (unlike the minor axis of an ellipse, which touches at endpoints).

📏 Standard equations and parameters

📏 Horizontal hyperbola (opens left/right)

Standard form: (x − h)²/a² − (y − k)²/b² = 1

Center at (h, k).
Transverse axis is horizontal (parallel to x-axis).
Vertices are a units left and right of center: (h ± a, k).
The x² term is positive (being subtracted from), y² term is negative (being subtracted).

📏 Vertical hyperbola (opens up/down)

Standard form: (y − k)²/b² − (x − h)²/a² = 1

Center at (h, k).
Transverse axis is vertical (parallel to y-axis).
Vertices are b units above and below center: (h, k ± b).
The y² term is positive, x² term is negative.

🔢 Parameter relationships

Fixed distance d = 2a: The constant difference in the definition equals the length of the transverse axis (distance between vertices).
Foci distance: c = √(a² + b²), where c is the distance from center to each focus.
Asymptote slopes: For horizontal hyperbola, slopes are ±b/a; for vertical, ±b/a (but applied to the appropriate orientation).
Asymptote equations (horizontal case centered at (h, k)): y − k = ±(b/a)(x − h).

Common confusion: In ellipses, a² = b² + c² (a is largest); in hyperbolas, c² = a² + b² (c is largest because foci are outside vertices).

Example: For (x − 2)²/4 − y²/25 = 1, we have h = 2, k = 0, a² = 4 (so a = 2), b² = 25 (so b = 5). Center is (2, 0), vertices at (0, 0) and (4, 0), c = √(4 + 25) = √29, foci at (2 ± √29, 0), asymptotes y = ±(5/2)(x − 2).

🔄 Converting to standard form

🔄 Procedure for general equations

When given Ax² + Cy² + Dx + Ey + F = 0:

Group variables: Move same variables together on one side, constant on the other.
Complete the square: For both x and y as needed.
Normalize: Divide both sides so the constant becomes 1.

Example from excerpt: 9y² − x² − 6x = 10

Group: 9y² − (x² + 6x) = 10
Complete square in x: 9y² − (x² + 6x + 9) = 10 − 9
Simplify: 9y² − (x + 3)² = 1
Normalize: y²/(1/9) − (x + 3)²/1 = 1

This is vertical (y² positive), center (−3, 0), a = 1, b = 1/3.

🔍 Identifying conic type from general form

For Ax² + Cy² + Dx + Ey + F = 0 (non-degenerate conic):

Condition	Conic type	Why
Only one variable squared	Parabola	No balance between x² and y²
A = C (same coefficient)	Circle	Perfect symmetry
A ≠ C, same sign	Ellipse	Both squared terms added
A and C opposite signs	Hyperbola	One squared term subtracted

The key for hyperbola: look for a difference (subtraction) of squared terms, or equivalently, A and C have opposite signs.
Don't confuse: both ellipse and hyperbola have two squared terms, but ellipse adds them (AC > 0), hyperbola subtracts them (AC < 0).

🌍 Applications: positioning problems

🌍 Trilateration with hyperbolas

The excerpt demonstrates locating an object (Sasquatch) using time differences in sound detection:

Setup:

Jeff and Carl are 10 miles apart (foci of a hyperbola).
Jeff hears a call 9 seconds before Carl.
Sound travels at 760 mph.

Calculation:

Distance difference = 760 miles/hour × (1 hour/3600 seconds) × 9 seconds = 1.9 miles.
This means the object is 1.9 miles closer to Jeff than to Carl.
By definition, the object lies on a hyperbola with foci at Jeff and Carl, where d = 1.9.

Finding the equation:

Place Jeff at (−5, 0) and Carl at (5, 0), so c = 5.
Since d = 2a, we have 2a = 1.9, so a = 0.95, a² = 0.9025.
Using c² = a² + b²: b² = 25 − 0.9025 = 24.0975.
Equation: x²/0.9025 − y²/24.0975 = 1.
Since the object is closer to Jeff (western focus), it's on the left branch.

🌍 Pinpointing with a third observer

A third observer (Kai) 6 miles north of Jeff hears the call 18 seconds after Jeff.
This creates a second hyperbola with Jeff and Kai as foci.
Distance difference = 760 × (18/3600) = 3.8 miles, so d = 3.8, giving 2b = 3.8, b = 1.9.
The intersection of the two hyperbola branches gives the exact location.
Techniques from systems of non-linear equations are needed for exact coordinates; calculator gives approximately (−0.9629, −0.8113).

Real-world note: This was the basis of LORAN navigation (now largely replaced by GPS). Also used for earthquake epicenter location, though only valid for small distances where Earth can be approximated as flat.

🔭 Other applications mentioned

Cooling towers: Natural draft cooling towers are shaped as hyperboloids of revolution (3D hyperbolas rotated around an axis).
Cassegrain telescope: Uses the reflective property of hyperbolas combined with parabolas.
Eccentricity: Defined as e = (distance center to focus)/(distance center to vertex) = c/a; for hyperbolas, e > 1 always (since c > a).

📊 Graphing and visual features

📊 The guide rectangle method

From the center, move a units along the transverse axis direction (to locate vertices).
From the center, move b units along the conjugate axis direction.
Draw a rectangle with these dimensions centered at the hyperbola's center.
Draw the diagonals of this rectangle—these are the asymptotes.
Sketch the branches opening toward the vertices, approaching but never touching the asymptotes.

Example: For (x − 2)²/4 − y²/25 = 1:

Center (2, 0), a = 2 (horizontal), b = 5 (vertical).
Rectangle corners at (0, 5), (4, 5), (4, −5), (0, −5).
Asymptotes pass through (2, 0) with slopes ±5/2.
Branches open left and right through vertices (0, 0) and (4, 0).

📊 Distinguishing horizontal vs vertical

Horizontal (opens left/right): x² term is positive (first in subtraction), transverse axis horizontal, vertices have same y-coordinate.
Vertical (opens up/down): y² term is positive (first in subtraction), transverse axis vertical, vertices have same x-coordinate.

Don't confuse: The sign in the standard form tells you which way it opens, not the size of a vs b. The positive term's variable determines the opening direction.

Systems of Linear Equations: Gaussian Elimination

8.1 Systems of Linear Equations: Gaussian Elimination

🧭 Overview

🧠 One-sentence thesis

Gaussian Elimination transforms any system of linear equations into triangular form through systematic row operations, enabling us to determine whether the system has one solution, infinitely many solutions, or no solution.

📌 Key points (3–5)

What a system of linear equations is: finding all pairs (or tuples) of values that satisfy multiple linear equations simultaneously; geometrically, finding where lines (or planes) intersect.
Three possible outcomes: consistent independent (exactly one solution, no free variables), consistent dependent (infinitely many solutions, has free variables), or inconsistent (no solution, leads to contradiction).
The algorithm: Gaussian Elimination uses three legal moves—swap equations, multiply an equation by a nonzero constant, or replace an equation with itself plus a multiple of another—to reach triangular form, then back-substitute to find solutions.
Common confusion—consistent dependent vs inconsistent: both can eliminate all variables in a row, but dependent systems yield an identity (0 = 0) while inconsistent systems yield a contradiction (0 = nonzero number).
Parametric solutions: when free variables exist, we assign them parameters (like t or s) and express other variables in terms of these parameters to describe the infinite solution set.

🔍 What is a system of linear equations

🔍 Linear equations in multiple variables

A linear equation in n variables x₁, x₂, ..., xₙ is an equation of the form a₁x₁ + a₂x₂ + ... + aₙxₙ = c where a₁, a₂, ..., aₙ and c are real numbers and at least one coefficient is nonzero.

The key feature: variables appear only to the first power with numerical coefficients.
Examples of non-linear equations: x² + y = 1 (squared term), xy = 5 (variables multiplied together), e^(2x) + ln(y) = 1 (exponential and logarithm functions).
A system couples two or more such equations together, written with a curly bracket notation.

🎯 What the system asks for

Find all ordered pairs (x, y) or tuples (x, y, z, ...) that satisfy all equations in the system simultaneously.
Geometrically in two variables: find where two lines intersect.
Geometrically in three variables: find where three planes intersect (can be a single point, a line, or no intersection at all).
The excerpt notes that ordered triples like (8/3, 7/2, −1) locate points in three-dimensional space, just as ordered pairs locate points in the plane.

🔧 The three legal moves (Theorem 8.1)

🔧 Operations that preserve solutions

The excerpt lists three moves that produce an equivalent system (same solution set):

Move	What it does	Why it's legal
Interchange equations	Swap the position of any two equations	Order doesn't affect which values satisfy all equations
Multiply by nonzero constant	Replace an equation with a nonzero multiple of itself	Multiplying both sides by the same nonzero number preserves equality
Add a multiple of another equation	Replace an equation with itself plus a nonzero multiple of another	Adding equals to equals preserves equality

The first move may seem pointless initially, but becomes useful when working with many equations and variables.
These moves are the foundation of Gaussian Elimination.

📐 Triangular form and the algorithm

📐 What triangular form means

A system is in triangular form when: (1) variable subscripts increase left to right in each equation, (2) the leading variable in each equation has coefficient 1, (3) the leading variable's subscript increases as you go down the equations, and (4) equations without variables (identities or contradictions) come after equations with variables.

Example from the excerpt:

x − (1/3)y + (1/2)z = 1
y − (1/2)z = 4
z = −1

This form makes solving easy: the last equation gives z directly, substitute into the second to get y, then substitute both into the first to get x.

⚙️ Gaussian Elimination process

The algorithm works systematically:

Start with the first variable (leftmost).
Get a coefficient of 1 for that variable in the top equation (swap equations or multiply).
Eliminate that variable from all equations below (add multiples of the top equation).
Move to the next variable and repeat.
Continue until triangular form is reached.

Example: To eliminate x from equation E₂ when E₁ is x + ... = ..., replace E₂ with (−coefficient of x in E₂) × E₁ + E₂.

🔙 Back-substitution

Once in triangular form, solve from bottom to top.
The last equation gives one variable directly.
Substitute that value into the equation above to find the next variable.
Continue upward until all variables are determined.
The excerpt notes this delays substitution until all elimination is done, which can be inefficient but is systematic.

🎭 Three outcomes: independent, dependent, or inconsistent

✅ Consistent independent (exactly one solution)

The system reaches triangular form with no free variables.
Every variable has a leading equation.
Back-substitution yields unique values for all variables.
Example from the excerpt: system 1 in Example 8.1.1 gives solution (−1, 0, 6).
Geometrically in two variables: two lines intersect at exactly one point.

♾️ Consistent dependent (infinitely many solutions)

During elimination, a row reduces to 0 = 0 (an identity).
Some variables have no leading equation; these become free variables.
Assign parameters (like t or s) to free variables.
Express other variables in terms of these parameters.
Example from the excerpt: system 4 in Example 8.1.1 gives {(t, (1/2)t − 3/2) | −∞ < t < ∞}.
Geometrically in two variables: the two equations represent the same line.
Don't confuse: 0 = 0 means the equations are equivalent, not contradictory.

❌ Inconsistent (no solution)

During elimination, a row reduces to 0 = (nonzero number), like 0 = 6 or 0 = −18.
This is a contradiction that can never be satisfied.
The system has no solution.
Example from the excerpt: system 5 in Example 8.1.1 gives 0 = −18 after elimination.
Geometrically in two variables: the lines are parallel and distinct, so never intersect.
Don't confuse with dependent: 0 = 0 is always true (dependent), but 0 = 6 is never true (inconsistent).

🔍 How to distinguish dependent from inconsistent

Both eliminate all variables in a row, but:

Dependent: left side becomes 0, right side also becomes 0 → identity → infinite solutions.
Inconsistent: left side becomes 0, right side is nonzero → contradiction → no solution.

🧪 Parametric solutions in detail

🧪 When and why they appear

Occur in consistent dependent systems.
The excerpt explains: when a variable has no equation with it as the leading variable, it is a free variable.
Free variables can take any value; other variables adjust accordingly.

🧪 How to write parametric solutions

Solve one of the equations for a variable in terms of others.
Let free variables equal parameters (t, s, etc.).
Express all variables in terms of parameters.
Example from the excerpt: solving 2x − 4y = 6 for y gives y = (1/2)x − 3/2; letting x = t yields {(t, (1/2)t − 3/2) | −∞ < t < ∞}.

🧪 Multiple parametrizations possible

The excerpt notes: "There is no one correct way to parameterize the solution set."
You could solve for x instead of y, or choose different free variables.
Example: the same system can be written as {(2t + 3, t) | −∞ < t < ∞} if you solve for x and let y = t.
Always check your parametric answer by substituting back into the original equations.

🧪 Multiple free variables

System 3 in Example 8.1.2 has two free variables: x₃ = s and x₄ = t.
Solution: {(2 − s − t, 3s + 2t, s, t) | −∞ < s, t < ∞}.
Any choice of s and t determines a solution point (in four-dimensional space, though the excerpt admits this is hard to visualize).

🧩 Special cases and terminology

🧩 Overdetermined systems

More equations than variables.
Example 6 in Example 8.1.1 has three equations in two variables.
Can be consistent or inconsistent.
The excerpt's example is inconsistent: the first two equations intersect at (1, 1), but the third line doesn't pass through that point.

🧩 Underdetermined systems

More variables than equations.
Example 3 in Example 8.1.2 has three equations in four variables.
Can be consistent or inconsistent.
The excerpt states: a consistent underdetermined system of linear equations is necessarily dependent (must have free variables).

🧩 Identities and contradictions

If all coefficients are zero in a linear equation, you get 0 = c.
If c = 0: identity (always true).
If c ≠ 0: contradiction (never true).
The excerpt notes these are not considered linear equations by Definition 8.1, but they play a large role in determining system outcomes.

🧪 Application example: mixture problems

🧪 Setting up the system

Example 8.1.3: Create 500 mL of 40% acid solution from 30% stock, 90% stock, and water.

Define variables: x = amount of 30% stock, y = amount of 90% stock, w = amount of water.
Volume constraint: x + y + w = 500 (total volume).
Acid constraint: 0.30x + 0.90y = 200 (total acid content must be 40% of 500 mL = 200 mL).
Water contributes no acid, so it doesn't appear in the acid equation.

🧪 Solving and interpreting

The system has two equations in three variables (underdetermined).
Gaussian Elimination yields w as a free variable: w = t.
Solution: {(−(3/2)t + 1250/3, (1/2)t + 250/3, t) | −∞ < t < ∞}.
But physical constraints require x ≥ 0, y ≥ 0, w ≥ 0.
Applying these gives 0 ≤ t ≤ 2500/9.

🧪 Practical use

The parametric solution describes all possible combinations.
Example: if you want to use exactly 100 mL of 90% solution, set y = 100, solve for t = 100/3, then find x = 1100/3 mL and w = 100/3 mL.
The excerpt emphasizes checking that this combination produces the required 40% acid mix.

Systems of Linear Equations: Augmented Matrices

8.2 Systems of Linear Equations: Augmented Matrices

🧭 Overview

🧠 One-sentence thesis

Augmented matrices streamline solving systems of linear equations by encoding coefficients and constants into a rectangular array, allowing row operations to transform the system into a form where solutions can be read directly.

📌 Key points (3–5)

What an augmented matrix is: a rectangular array that encodes a system of equations by recording coefficients and constants in rows and columns, separated by a vertical bar.
Row operations mirror equation operations: the three row operations (swap rows, multiply a row, add multiples of rows) produce equivalent systems, just like the moves in Gaussian Elimination.
Row echelon form vs reduced row echelon form: row echelon form has leading 1s stepping to the right with zeros below; reduced row echelon form also requires zeros above each leading 1, yielding solutions without back-substitution.
Common confusion—multiple row echelon forms but one reduced form: a matrix can be put into row echelon form in many ways, but it has exactly one reduced row echelon form.
Why it matters: matrices are a bookkeeping device that makes solving systems faster and programmable; reduced row echelon form (via Gauss-Jordan Elimination) gives solutions instantly.

📦 What matrices are and how they encode systems

📦 Definition of a matrix

Matrix: a rectangular array of real numbers enclosed in square brackets.

Matrices are sized by rows times columns (e.g., 2 by 3 means 2 rows and 3 columns).
Individual numbers in a matrix are called entries.
Entries are labeled with double subscripts: the first subscript is the row number (top to bottom), the second is the column number (left to right).
Example: if A equals a 2 by 3 matrix with first row 3, 0, negative 1 and second row 2, negative 5, 10, then entry a sub 1 1 equals 3, entry a sub 1 2 equals 0, entry a sub 2 3 equals 10, and so on.

🔗 Encoding a system into an augmented matrix

Each equation becomes a row; each variable and the constant gets its own column.
A vertical bar separates the variable coefficients on the left from the constants on the right.
If a variable is missing from an equation, record its coefficient as 0.
Example: the system 2x plus 3y minus z equals 1, 10x minus z equals 2, 4x minus 9y plus 2z equals 5 becomes a 3 by 4 augmented matrix with columns for x, y, z, and the constant, and rows corresponding to the three equations.

Augmented matrix: the matrix formed by appending the constant column to the coefficient matrix, separated by a vertical bar.

🔄 Row operations and their purpose

🔄 The three row operations (Theorem 8.2)

The following row operations produce an augmented matrix corresponding to an equivalent system:

Operation	Description	Effect
Interchange rows	Swap any two rows	Reorders equations
Multiply a row	Replace a row with a nonzero multiple of itself	Scales an equation
Add rows	Replace a row with itself plus a nonzero multiple of another row	Eliminates variables

These operations mirror the moves in Gaussian Elimination (Theorem 8.1) but work directly on the matrix entries.
The goal is to transform the matrix into a simpler form where the solution is easier to extract.

🎯 Why row operations work

Row operations preserve the solution set because they correspond exactly to valid equation manipulations.
Example: replacing row 2 with negative 10 times row 1 plus row 2 is the same as replacing equation 2 with negative 10 times equation 1 plus equation 2.
All moves are determined by the coefficients, not the variable names, so the matrix bookkeeping is sufficient.

🪜 Row echelon form and reduced row echelon form

🪜 Row echelon form (Definition 8.4)

Leading entry: the first nonzero entry (if it exists) in a row.

A matrix is in row echelon form if:

The first nonzero entry in each row is 1 (called a leading 1).
The leading 1 of each row is to the right of the leading 1 in the row above it.
Any all-zero rows are below all nonzero rows.

Row echelon form is the matrix analog of triangular form for systems of equations.
Once in row echelon form, you decode the matrix back into equations and use back-substitution to solve.

✨ Reduced row echelon form (Definition 8.5)

Reduced row echelon form: a matrix in row echelon form where each leading 1 is the only nonzero entry in its column.

This means zeros appear both below and above every leading 1.
When a matrix is in reduced row echelon form, the solution can be read directly without back-substitution.
Example: a reduced row echelon matrix with leading 1s in the first three columns and constants 3, 0, negative 1 decodes immediately to x equals 3, y equals 0, z equals negative 1.

🔁 Gauss-Jordan Elimination

Gauss-Jordan Elimination is the process of putting a matrix into reduced row echelon form.
After getting the matrix into row echelon form (Gaussian Elimination), continue using row operations to zero out entries above each leading 1.
Start with the last leading 1 and work upward; this minimizes arithmetic because more zeros appear first.

🆚 Multiple row echelon forms vs one reduced form

A matrix can be put into row echelon form in infinitely many ways (different sequences of row operations yield different row echelon forms).
However, every matrix has exactly one reduced row echelon form.
Don't confuse: row echelon form is not unique; reduced row echelon form is unique.

🛠️ Solving systems with augmented matrices

🛠️ The general procedure

Encode the system into an augmented matrix.
Apply row operations to put the matrix into row echelon form (or reduced row echelon form).
Decode the matrix back into equations.
Solve by back-substitution (row echelon) or read the solution directly (reduced row echelon).

📝 Example walkthrough (from Example 8.2.1)

System: 3x minus y plus z equals 8, x plus 2y minus z equals 4, 2x plus 3y minus 4z equals 10.
Encode into a 3 by 4 augmented matrix.
Swap rows 1 and 2 to get a leading 1 in row 1.
Use row 1 to zero out the first column below it (replace row 2 with negative 3 times row 1 plus row 2; replace row 3 with negative 2 times row 1 plus row 3).
Scale row 2 to get a leading 1 in the second column.
Use row 2 to zero out the second column below it.
Scale row 3 to get a leading 1 in the third column.
Decode to get the triangular system: x plus 2y minus z equals 4, y minus four sevenths z equals four sevenths, z equals negative 1.
Back-substitute: z equals negative 1, then y equals 0, then x equals 3.
Solution: (3, 0, negative 1).

🎓 Example with reduced row echelon form (from Example 8.2.2)

System with four variables: x sub 2 minus 3 x sub 1 plus x sub 4 equals 2, 2 x sub 1 plus 4 x sub 3 equals 5, 4 x sub 2 minus x sub 4 equals 3.
Encode carefully (pay attention to subscripts and missing variables).
Apply Gauss-Jordan Elimination to reach reduced row echelon form.
Decode: x sub 1 minus five twelfths x sub 4 equals negative five twelfths, x sub 2 minus one fourth x sub 4 equals three fourths, x sub 3 plus five twenty-fourths x sub 4 equals thirty-five twenty-fourths.
x sub 4 is free (assign parameter t); express the other variables in terms of t.
Solution: a parametric family of solutions.

🖩 Using calculators

Graphing calculators can perform row operations and compute reduced row echelon form automatically.
This is especially useful for systems with messy fractions or many variables.
Example 8.2.3 uses a calculator to find a quadratic function through three points by solving a 3 by 3 system.

🌟 Applications and special cases

🌟 Fitting curves to data (Example 8.2.3)

Problem: find the quadratic function f(x) equals a x squared plus b x plus c passing through the points (negative 1, 3), (2, 4), (5, negative 2).
Each point gives one equation: f(negative 1) equals 3 becomes a minus b plus c equals 3, and so on.
Encode the three equations into an augmented matrix and solve for a, b, c.
Solution: a equals negative seven eighteenths, b equals thirteen eighteenths, c equals thirty-seven ninths.
The technique generalizes: fitting a polynomial of degree n through n plus 1 points yields a system of n plus 1 linear equations.

🔍 Consistent dependent systems

When a system has infinitely many solutions, the reduced row echelon form will have one or more free variables.
Example: a row of all zeros in the augmented matrix (including the constant column) indicates a dependent system.
Assign parameters to the free variables and express the other variables in terms of those parameters.
Don't confuse: a row of zeros on the left but a nonzero constant on the right indicates an inconsistent system (no solution).

❌ Inconsistent systems

If a row reduces to all zeros on the left but a nonzero constant on the right (e.g., 0 equals 1), the system is inconsistent.
This corresponds to contradictory equations.
Example: in the exercises, a matrix with a row 0, 0, 0, vertical bar, 1 decodes to 0 equals 1, which is impossible.

🧮 Practical tips

Start with the leftmost column and work right to build leading 1s.
Use row swaps to avoid fractions when possible.
When putting a matrix into reduced row echelon form, start with the last leading 1 and work upward to minimize arithmetic.
Always check your solution by substituting back into the original equations (or use a calculator to verify).

Matrix Arithmetic

8.3 Matrix Arithmetic

🧭 Overview

🧠 One-sentence thesis

Matrix arithmetic extends real number operations to rectangular arrays of numbers, enabling algebraic manipulation that ultimately allows systems of linear equations to be expressed and solved as matrix equations.

📌 Key points (3–5)

Matrix equality: Two matrices are equal only if they have the same dimensions and identical entries in corresponding positions.
Addition and scalar multiplication: Matrices of the same size can be added entry-by-entry, and any matrix can be multiplied by a real number (scalar) by multiplying each entry.
Matrix multiplication: The product AB is defined only when the number of columns in A equals the number of rows in B; the ij-entry of AB is the i-th row of A times the j-th column of B.
Common confusion: Matrix multiplication is not commutative—AB may not equal BA, and one product may be defined while the other is not.
Why it matters: Matrix arithmetic allows systems of linear equations to be written as AX = B, setting up methods for solving them systematically.

📐 Matrix equality and notation

📐 What matrix equality means

Matrix Equality: Two matrices are equal if they are the same size and their corresponding entries are equal.

Formally, if A = [aᵢⱼ]ₘₓₙ and B = [bᵢⱼ]ₚₓᵣ, then A = B provided:
1. m = p and n = r (same dimensions)
2. aᵢⱼ = bᵢⱼ for all positions i, j
It is not enough for matrices to "look similar"—they must have the same numbers in the same spots.
Example: The excerpt shows that a matrix with entries 0, -2, 9, 25, 117, -3 equals a matrix with entries ln(1), ³√(-8), e^(2ln(3)), 125^(2/3), (3²)·13, log(0.001) because these expressions evaluate to the same numbers in the same positions.

🔢 Entry notation

When we write A = [aᵢⱼ]ₘₓₙ, we mean A is an m by n matrix (m rows, n columns).
The subscript i counts rows from top to bottom; j counts columns from left to right.
The entry aᵢⱼ is found in the i-th row and j-th column.
This notation allows precise definitions of matrix operations.

➕ Matrix addition and subtraction

➕ How addition works

Matrix Addition: Given two matrices of the same size, the sum is obtained by adding corresponding entries.

Formally: A + B = [aᵢⱼ]ₘₓₙ + [bᵢⱼ]ₘₓₙ = [aᵢⱼ + bᵢⱼ]ₘₓₙ
Both matrices must have the same dimensions; otherwise addition is undefined.
Example from the excerpt: Adding two 2×3 matrices entry-by-entry produces another 2×3 matrix.

🔄 Properties inherited from real numbers

Property	Statement	Why it holds
Commutative	A + B = B + A	Real number addition is commutative
Associative	(A + B) + C = A + (B + C)	Real number addition is associative
Identity	A + 0ₘₓₙ = A	The zero matrix (all entries 0) acts like 0
Inverse	A + (-A) = 0ₘₓₙ	Every matrix has an additive inverse

The additive inverse -A is obtained by taking the additive inverse of each entry: -A = [-aᵢⱼ]ₘₓₙ.
Don't confuse: The zero matrix 0ₘₓₙ is not the number 0; it is a matrix of the appropriate size with all entries equal to 0.

➖ Subtraction as adding the inverse

Subtraction is defined as A - B = A + (-B), just as with real numbers.
At the entry level: A - B = [aᵢⱼ - bᵢⱼ]ₘₓₙ
In practice, subtract corresponding entries.

✖️ Scalar multiplication

✖️ Multiplying a matrix by a real number

Scalar Multiplication: The product of a real number k and a matrix A is the matrix obtained by multiplying each entry of A by k.

Formally: kA = k[aᵢⱼ]ₘₓₙ = [kaᵢⱼ]ₘₓₙ
The word "scalar" refers to real numbers; scalar multiplication means multiplying a matrix by a real number.
Example: 3A means multiply every entry of A by 3.

🔍 Why "scalar"?

The term comes from "scaling" factors.
A point P(x, y) can be represented by a position matrix P = [x; y].
Multiplying P by a scalar k scales the point: 3P takes (-2, 1) to (-6, 3), magnifying the plane by a factor of 3.
This geometric interpretation explains the terminology.

🧮 Properties of scalar multiplication

Property	Statement
Associative	(kr)A = k(rA)
Identity	1A = A
Additive inverse	-A = (-1)A
Distributive over scalars	(k + r)A = kA + rA
Distributive over matrices	k(A + B) = kA + kB
Zero product	kA = 0ₘₓₙ if and only if k = 0 or A = 0ₘₓₙ

These properties allow us to manipulate matrix equations algebraically, similar to solving linear equations with variables.
Example from the excerpt: Solving 3A - ([matrix] + 5A) = [matrix] + (1/3)[matrix] uses these properties step-by-step, just like solving 3a - (2 + 5a) = -4 + (1/3)(9).

🔗 Matrix multiplication

🔗 Product of a row and a column

Product of a Row and a Column: If Rᵢ is the i-th row of A and Cⱼ is the j-th column of B, then Rᵢ · Cⱼ = aᵢ₁b₁ⱼ + aᵢ₂b₂ⱼ + ... + aᵢₙbₙⱼ.

To multiply a row by a column, the number of entries in the row must match the number of entries in the column.
Multiply corresponding entries and add the results.
Example from the excerpt: For row R₁ = [2, 0, -1] and column C₁ = [3; 4; 5], we get R₁·C₁ = (2)(3) + (0)(4) + (-1)(5) = 6 + 0 - 5 = 1.

🔗 Defining matrix multiplication

Matrix Multiplication: If A = [aᵢⱼ]ₘₓₙ and B = [bᵢⱼ]ₙₓᵣ, then AB = [Rᵢ · Cⱼ]ₘₓᵣ.

The ij-entry of AB is the i-th row of A times the j-th column of B.
Dimension requirement: The number of columns in A must equal the number of rows in B.
The product AB has as many rows as A and as many columns as B: (m×n)(n×r) = (m×r).
Example: If A is 2×3 and B is 3×4, then AB is defined and is 2×4, but BA is not defined (4 ≠ 2).

⚠️ Non-commutativity and other surprises

Matrix multiplication is not commutative: AB may not equal BA.
- Even when both products are defined, they may have different dimensions.
- Example from the excerpt: If A is 2×3 and B is 3×2, then AB is 2×2 but BA is 3×3.
Zero product property fails: AB can equal the zero matrix even when neither A nor B is zero.
- Example from the excerpt: A specific 2×3 matrix times a 3×2 matrix gives the 2×2 zero matrix, yet neither factor is zero.
Don't confuse: Unlike real numbers, "AB = 0" does not imply "A = 0 or B = 0."

🧩 Properties that do hold

Property	Statement
Associative	(AB)C = A(BC)
Associative with scalars	k(AB) = (kA)B = A(kB)
Distributive	A(B ± C) = AB ± AC and (A ± B)C = AC ± BC

These properties allow algebraic manipulation of matrix products.
The excerpt emphasizes: Matrix multiplication inherits some—but not all—properties from real number arithmetic.

🆔 The identity matrix

🆔 What the identity matrix is

Identity Matrix: The k×k identity matrix Iₖ = [dᵢⱼ]ₖₓₖ where dᵢⱼ = 1 if i = j, and dᵢⱼ = 0 otherwise.

The identity matrix is a square matrix (same number of rows and columns).
It has 1's along the main diagonal (where i = j) and 0's everywhere else.
Examples from the excerpt: I₁ = [1], I₂ = [[1,0],[0,1]], I₃ = [[1,0,0],[0,1,0],[0,0,1]], etc.

🆔 How it acts as an identity

For any m×n matrix A: IₘA = A and AIₙ = A.
The size of the identity matrix must match the appropriate dimension.
Example from the excerpt: For a 2×3 matrix A, I₂A = A (using the 2×2 identity on the left) and AI₃ = A (using the 3×3 identity on the right).
Don't confuse: The identity matrix is not always the same size; it must be chosen to make the multiplication compatible.

🧪 Applications and examples

🧪 Matrix polynomials

Just as x² means x times itself, C² denotes the matrix C times itself.
Expressions like C² - 5C + 10I₂ are evaluated by computing powers, scalar multiples, and sums.
Example from the excerpt: For C = [[1,-2],[3,4]], the expression C² - 5C + 10I₂ evaluates to the zero matrix.
Matrix algebra can be used to expand products like (M - 2I₄)(M + 3I₄), similar to expanding (x - 2)(x + 3), but respecting the non-commutativity of matrix multiplication.

🔄 Geometric transformations

A point P(x, y) can be represented as a position matrix P = [x; y].
Multiplying by a matrix can rotate, scale, or otherwise transform points.
Example from the excerpt: The matrix R = [[√2/2, -√2/2],[√2/2, √2/2]] rotates points counterclockwise by 45°.
- The point (2, -2) is moved to (2√2, 0).
- The hyperbola x² - y² = 4 is rotated to the curve y = 2/x.
This shows how matrix multiplication encodes geometric operations.

📊 Encoding systems of linear equations

A system of linear equations can be written as a matrix equation AX = B.
- A is the coefficient matrix (entries from the coefficients of the variables).
- X is the unknowns matrix (a column of variables).
- B is the constant matrix (a column of constants).
Example from the excerpt: The system {3x - y + z = 8, x + 2y - z = 4, 2x + 3y - 4z = 10} becomes [[3,-1,1],[1,2,-1],[2,3,-4]][x;y;z] = [8;4;10].
Finding a solution (x, y, z) to the system corresponds to finding a matrix X such that AX = B.
This sets up the question: Can we "divide" both sides by A to solve for X? (This is addressed in Section 8.4.)

🔢 Stochastic processes (from exercises)

Matrices can model transitions in systems over time.
A state matrix X represents quantities at a given time; a transition matrix Q describes how the state changes.
The state at the next time step is QX; after n steps, it is QⁿX.
Example from the excerpt: Newspaper subscriptions in a village can be modeled with a transition matrix, and repeated multiplication shows the system approaching a steady state where QXₛ = Xₛ.
This illustrates how matrix arithmetic models real-world processes.

Systems of Linear Equations: Matrix Inverses

8.4 Systems of Linear Equations: Matrix Inverses

🧭 Overview

🧠 One-sentence thesis

Matrix inverses provide a powerful method to solve entire families of linear systems with the same coefficient matrix by performing row operations once to find A⁻¹, then solving any system AX = B through a single multiplication A⁻¹B.

📌 Key points (3–5)

What a matrix inverse is: For an n × n matrix A, the inverse A⁻¹ satisfies A⁻¹A = AA⁻¹ = Iₙ (the identity matrix).
How to find it: Use the super-sized augmented matrix [A | Iₙ] and apply Gauss-Jordan elimination to transform it into [Iₙ | A⁻¹].
Why it matters for systems: If A is invertible, the system AX = B has exactly one solution X = A⁻¹B, meaning the system is consistent and independent.
Common confusion: Not all matrices are invertible—only square matrices can be invertible, and even then, not all square matrices have inverses.
Efficiency advantage: Finding A⁻¹ once allows you to solve any system with coefficient matrix A using just one matrix multiplication, avoiding repeated row operations.

🔍 What is an invertible matrix?

🔍 Definition and basic properties

Invertible matrix: An n × n matrix A is invertible if there exists a matrix A⁻¹ (read "A-inverse") such that A⁻¹A = AA⁻¹ = Iₙ.

Only square matrices (n × n) can be invertible, because the definition requires both products A⁻¹A and AA⁻¹ to equal the same identity matrix.
The inverse A⁻¹ must have the same dimensions as A (also n × n).
Don't confuse: Being square is necessary but not sufficient—some square matrices are not invertible (see Exercise 3 where matrix C is not invertible).

🎯 Uniqueness of the inverse

Theorem 8.6, property 1: If A is invertible, then A⁻¹ is unique.

The proof uses a "suppose two inverses exist" argument: if both B and C act as inverses for A, then B = IₙB = (CA)B = C(AB) = CIₙ = C, so they must be the same matrix.
This means you can safely refer to "the" inverse of A, not "an" inverse.

🔗 Connection to systems of equations

Theorem 8.6, property 2: A is invertible if and only if AX = B has a unique solution for every n × r matrix B.

If A is invertible: The solution is X = A⁻¹B, which is unique because A⁻¹ is unique.
Conversely, if every such system has a unique solution: Then in particular AX = Iₙ has a unique solution X₀, and this X₀ must be A⁻¹.
Implication for systems: When A is the coefficient matrix of a system and A is invertible, the system is always consistent and independent (exactly one solution).

🛠️ How to find a matrix inverse

🛠️ The super-sized augmented matrix method

Core procedure: To find A⁻¹ for an n × n matrix A, form the augmented matrix [A | Iₙ] and use Gauss-Jordan elimination to transform it into [Iₙ | A⁻¹].

Start with the identity matrix Iₙ on the right side of the augmented matrix.
Apply row operations to transform the left side (A) into the identity matrix.
The right side simultaneously transforms into A⁻¹.
Why this works: The row operations that "undo" A to make it the identity are exactly the operations encoded in A⁻¹.

🧮 Motivation from solving AA⁻¹ = I₂

The excerpt walks through finding A⁻¹ for a 2 × 2 matrix by setting up AA⁻¹ = I₂:

If A⁻¹ = [x₁ x₂; x₃ x₄], then multiplying out AA⁻¹ = I₂ gives two systems of equations.
Both systems have the same coefficient matrix A, differing only in their constant terms.
The constants come from the columns of I₂: [1; 0] for the first system, [0; 1] for the second.
Key insight: Since both systems use the same coefficient matrix, we can solve them simultaneously by putting both constant columns into one super-sized augmented matrix [A | I₂].

📋 Step-by-step example

For A = [2 -3; 3 4]:

Form [A | I₂] = [2 -3 | 1 0; 3 4 | 0 1]
Apply Gauss-Jordan elimination (the excerpt shows detailed steps)
Result: [I₂ | A⁻¹] = [1 0 | 4/17 3/17; 0 1 | -3/17 2/17]
Therefore A⁻¹ = [4/17 3/17; -3/17 2/17]

Verification: Check both AA⁻¹ = I₂ and A⁻¹A = I₂ (the excerpt confirms both work).

🔄 Why the same row operations work for both systems

When solving two systems with the same coefficient matrix but different constants, the row operations needed to reduce the coefficient matrix are identical.
Only what happens to the constants (right side of the augmented matrix) differs.
By keeping the constants in separate columns, we track both solutions simultaneously.

💡 Using matrix inverses to solve systems

💡 The solution formula

For a system written as AX = B where A is invertible:

Solution: X = A⁻¹B

This mimics solving 3x = 5 by multiplying both sides by 3⁻¹: x = 3⁻¹(5).
The derivation uses associativity: A⁻¹(AX) = A⁻¹B → (A⁻¹A)X = A⁻¹B → IₙX = A⁻¹B → X = A⁻¹B.

Verification: Substitute back: A(A⁻¹B) = (AA⁻¹)B = IₙB = B ✓

🎯 Solving multiple systems efficiently

Example 8.4.1 demonstrates the power of this method:

Given A = [3 1 2; 0 -1 5; 2 1 4], first find A⁻¹ once using row operations.
Then solve three different systems (parts 2a, 2b, 2c) by computing A⁻¹B for different B matrices.
Each solution requires only one matrix multiplication instead of full row reduction.

System	B matrix	Solution via X = A⁻¹B
(a)	[26; 39; 117]	(-39, 91, 26)
(b)	[4; 2; 5]	(5/13, 19/13, 9/13)
(c)	[1; 0; 0]	(9/13, -10/13, -2/13)

Key advantage: Once A⁻¹ is found, solving additional systems with the same coefficient matrix is fast—just one multiplication.

⚡ When this method is most useful

When you need to solve many systems with the same coefficient matrix but different constants.
The excerpt's electronics example (Example 8.4.2) shows this: same circuit (same A), different battery voltages (different B values).
Finding A⁻¹ does all the row operations "once and for all."

🔌 Application: Circuit analysis

🔌 The electronics problem setup

Example 8.4.2: A circuit with two batteries (VB₁, VB₂) and six resistors (R₁ through R₆) produces a system relating voltages to mesh currents (i₁, i₂, i₃, i₄).

The system has coefficient matrix A determined by the resistances.
The constant matrix B is determined by the battery voltages.
Changing battery voltages changes B but not A.

🔌 Part 1: Finding currents for different voltages

With all resistances = 1 kΩ, the coefficient matrix A is fixed. The excerpt finds A⁻¹ using a calculator, then solves four systems:

VB₁	VB₂	Interpretation	Currents (i₁, i₂, i₃, i₄)
10V	5V	Both batteries active	(10.625, 6.25, 3.125, 5) mA
10V	0V	Only first battery	(16.25, 12.5, 11.25, 10) mA
0V	10V	Only second battery	(-11.25, -12.5, -16.25, -10) mA
10V	10V	Batteries oppose equally	(5, 0, -5, 0) mA

Observation: In part (d), currents i₂ and i₄ are zero because the batteries push equally in opposite directions, canceling out.

🔌 Part 2: Finding resistances from known currents

Now the unknowns are the resistances, given voltages and currents from part 1(a).

The system is 4 equations with 6 unknowns (4 × 6 coefficient matrix).
This matrix is not square, so not invertible.
The system is underdetermined but consistent (we know R₁ = ... = R₆ = 1 is a solution).
Using row reduction gives a dependent system with free variables R₃ = s and R₆ = t.
Physical constraints (resistances must be positive) restrict s and t to a region: {(s,t) : 0 < s < 16/7, 0 < t < 8/3, 3.5s + 1.5t - 4 > 0}.

Don't confuse: This part doesn't use matrix inverses because the coefficient matrix isn't square; it uses standard row reduction instead.

⚠️ Important limitations and notes

⚠️ When matrices are not invertible

Only square matrices can be invertible (necessary condition).
Not all square matrices are invertible (see Exercise 3: matrix C is not invertible).
The excerpt doesn't explain how to tell if a square matrix is invertible before attempting to find the inverse, but the row reduction process will reveal it: if you cannot transform A into Iₙ, then A is not invertible.

⚠️ Computational considerations

For larger matrices (3 × 3 and above), the excerpt uses a calculator to find inverses and perform multiplications.
The manual row operations are shown in detail for the 3 × 3 example to illustrate the method.
Example: Finding A⁻¹ for the 3 × 3 matrix in Example 8.4.1 requires many row operations (shown step-by-step), but once found, solving systems is quick.

⚠️ Matrix multiplication is not commutative

The definition requires checking both A⁻¹A = Iₙ AND AA⁻¹ = Iₙ because matrix multiplication order matters.
The excerpt verifies both products in examples to confirm the inverse is correct.
Contrast with real numbers: For real numbers, 3⁻¹ · 3 = 3 · 3⁻¹ automatically, but for matrices, you must verify both orders.

Determinants and Cramer's Rule

8.5 Determinants and Cramer’s Rule

🧭 Overview

🧠 One-sentence thesis

The determinant of a square matrix is a single real number that reveals whether the matrix is invertible and enables solving linear systems through Cramer's Rule and finding inverses via the adjoint method.

📌 Key points (3–5)

What the determinant is: a real number assigned to each square matrix, computed recursively by reducing n×n matrices to sums of (n−1)×(n−1) determinants.
Connection to invertibility: a matrix A is invertible if and only if det(A) ≠ 0; when det(A) = 0, you cannot "divide" by that matrix.
Cramer's Rule: solves consistent independent systems by computing determinants—each unknown equals det(A_j)/det(A), where A_j replaces one column with constants.
Common confusion: the determinant is not the same as absolute value (despite the |A| notation); it can be negative, zero, or positive, and it measures invertibility, not size.
Adjoint method for inverses: A⁻¹ = (1/det(A)) × adj(A), where adj(A) is built from cofactors with swapped indices.

🔢 Definition and computation of determinants

🔢 Recursive definition

Determinant of A, denoted det(A): For a 1×1 matrix [a₁₁], det(A) = a₁₁. For n×n matrices with n > 1, det(A) = a₁₁ det(A₁₁) − a₁₂ det(A₁₂) + − ... + (−1)^(1+n) a₁ₙ det(A₁ₙ).

The definition is recursive: you break down an n×n determinant into a sum of (n−1)×(n−1) determinants.
A_ij notation: the matrix formed by deleting row i and column j from A.
Example: a 2×2 determinant reduces to two 1×1 determinants; a 3×3 reduces to three 2×2 determinants, and so on.
The excerpt calls this "expanding along the first row" because you use entries a₁₁, a₁₂, ..., a₁ₙ from row 1.

🧮 The 2×2 formula

For a 2×2 matrix, the determinant simplifies to:

det([a b; c d]) = ad − bc

This is the product of the main diagonal minus the product of the off-diagonal.
Worth memorizing because it appears frequently and is the base case for larger matrices.
Example: det([4 −3; 2 1]) = 4·1 − (−3)·2 = 4 + 6 = 10.

🔄 Expanding along any row

Definition 8.13 expands along the first row, but Theorem 8.7 says you can expand along any row (or column).
Expanding along row k: det(A) = a_k1 C_k1 + a_k2 C_k2 + ... + a_kn C_kn, where C_ij is the cofactor.
Cofactor C_ij: (−1)^(i+j) × det(A_ij). The sign alternates in a checkerboard pattern: [+ − +; − + −; + − +] for 3×3.
Strategic tip: expand along a row (or column) with zeros to reduce computation—each zero term vanishes.
Example: the excerpt expands a 3×3 matrix along the second row (which has a zero) to save work.

Don't confuse: the sign pattern (+ or −) in the cofactor formula is separate from the sign of the matrix entry itself; both contribute to the final term's sign.

🔧 Properties and row operations

🔧 Key properties from Theorem 8.7

Property	Effect on determinant	Why it matters
Swap two rows	det(A′) = − det(A)	Sign flips
Multiply a row by c ≠ 0	det(A′) = c det(A)	Scales determinant
Add a multiple of one row to another	det(A′) = det(A)	No change—useful for simplification
Two identical rows or a zero row	det(A) = 0	Matrix is not invertible
Upper or lower triangular	det(A) = product of main diagonal	Easy computation

These properties mirror the allowable row operations from Gauss-Jordan elimination.
Because replacing a row with itself plus a multiple of another row does not change the determinant, you can use row operations to simplify a matrix to upper triangular form, then multiply the diagonal entries.
Example: the excerpt transforms a 3×3 matrix A → B → C (upper triangular) using row operations; det(A) = det(B) = det(C) = (3)(−1)(13/3) = −13.

🔗 Determinant and invertibility

Core connection: A is invertible ⟺ det(A) ≠ 0.
Why: if det(A) ≠ 0, row operations can transform A to the identity I_n (whose determinant is 1 ≠ 0), so A is invertible. Conversely, if A is invertible, it can be transformed to I_n, so det(A) must be nonzero.
The determinant determines invertibility—hence the name.
If det(A) = 0, the matrix is singular (not invertible); you cannot "divide" by it, just as you cannot divide a real number by zero.
For products: det(AB) = det(A) det(B), and det(A⁻¹) = 1/det(A) when A is invertible.

Don't confuse: det(A) = 0 does not mean A is the zero matrix; it means A has no inverse and its rows (or columns) are linearly dependent.

🎯 Cramer's Rule for solving systems

🎯 What Cramer's Rule says

Cramer's Rule (Theorem 8.8): For a system AX = B with n equations in n unknowns, if det(A) ≠ 0, the system is consistent and independent, and each unknown x_j = det(A_j)/det(A), where A_j is A with column j replaced by the constants in B.

When to use: only when the coefficient matrix is square and det(A) ≠ 0 (i.e., the system has a unique solution).
How it works: you compute n+1 determinants—det(A) once, then det(A_j) for each unknown.
Each A_j is formed by replacing the column of coefficients for x_j with the constant vector B.
Example: to solve {2x₁ − 3x₂ = 4; 5x₁ + x₂ = −2}, form A = [2 −3; 5 1], A₁ = [4 −3; −2 1], A₂ = [2 4; 5 −2]. Then x₁ = det(A₁)/det(A) = −2/17, x₂ = det(A₂)/det(A) = −24/17.

🧪 Practical example

The excerpt solves a 3×3 system for z only:

Identify z as x₃, so A₃ replaces the third column of the coefficient matrix with the constants.
Compute det(A) and det(A₃) by expanding along a row with zeros.
Result: z = det(A₃)/det(A) = −12/(−10) = 6/5.
You can solve for x and y similarly by forming A₁ and A₂.

Don't confuse: Cramer's Rule requires det(A) ≠ 0; if det(A) = 0, the system is either inconsistent or dependent, and Cramer's Rule does not apply.

🧩 Minors, cofactors, and the adjoint

🧩 Minors and cofactors

Minor M_ij: det(A_ij), the determinant of the matrix obtained by deleting row i and column j from A.

Cofactor C_ij: (−1)^(i+j) M_ij = (−1)^(i+j) det(A_ij).

The cofactor adds a sign to the minor based on the checkerboard pattern.
Cofactors appear in the expansion of the determinant: det(A) = a_k1 C_k1 + a_k2 C_k2 + ... + a_kn C_kn when expanding along row k.
Example: for a 3×3 matrix A, C₁₁ = det(A₁₁), C₁₂ = − det(A₁₂), C₁₃ = det(A₁₃), etc.

🔄 The adjoint matrix

Adjoint of A, denoted adj(A): the matrix whose ij-entry is the ji-cofactor C_ji (note the index swap).

In other words, adj(A) is the transpose of the cofactor matrix.
For a 3×3 matrix, adj(A) = [C₁₁ C₂₁ C₃₁; C₁₂ C₂₂ C₃₂; C₁₃ C₂₃ C₃₃].
The excerpt demonstrates this by solving AX = I₃ using Cramer's Rule and discovering that the entries of A⁻¹ are cofactors divided by det(A).

🔑 Inverse via the adjoint (Theorem 8.9)

Theorem 8.9: If A is invertible, then A⁻¹ = (1/det(A)) adj(A).

This gives an alternative to the Gauss-Jordan method for finding inverses.
For 2×2 matrices, this simplifies to: [a b; c d]⁻¹ = (1/(ad−bc)) [d −b; −c a].
Example: the excerpt computes all nine cofactors of a 3×3 matrix A, forms adj(A), and verifies A⁻¹ = (−1/13) adj(A).
Why the 13's appear: det(A) = −13, so every entry of A⁻¹ has denominator 13.

Don't confuse: the adjoint is not the transpose; it is the transpose of the cofactor matrix. The indices are swapped: the ij-entry of adj(A) is C_ji, not C_ij.

Partial Fraction Decomposition

8.6 Partial Fraction Decomposition

🧭 Overview

🧠 One-sentence thesis

Partial fraction decomposition reverses the common-denominator process to rewrite a single rational function as a sum of simpler fractions, which is essential for certain calculus applications.

📌 Key points (3–5)

What it does: transforms a rational function from one fraction with a factored denominator into a sum of simpler fractions with smaller denominators.
Two-step process: first determine the form of the decomposition based on the factored denominator, then solve for the unknown coefficients.
Form depends on factors: linear factors (like x − 1) contribute constant numerators; irreducible quadratics (like x² + 1) contribute linear numerators; repeated factors require multiple terms.
Common confusion: repeated linear factors vs. distinct factors—a factor like (x − 1)² requires two terms: A/(x − 1) + B/(x − 1)², not just one.
Finding coefficients: clear denominators, expand, equate coefficients of like powers of x, then solve the resulting system of linear equations.

🔄 The reverse process

🔄 What partial fraction decomposition reverses

In Intermediate Algebra, you combine fractions by finding a common denominator:
- Example: A/x + B/x² becomes (Ax + B)/x²
Partial fraction decomposition goes the opposite direction:
- Start with (x² − x − 6)/(x⁴ + x²)
- End with −1/x − 6/x² + (x + 7)/(x² + 1)
Why it matters: certain calculus techniques require the separated form rather than the combined form.

📐 Proper rational functions

A rational function R(x) = N(x)/D(x) is proper when the degree of the numerator N(x) is less than the degree of the denominator D(x).

The excerpt assumes the rational function is proper and fully reduced (no common factors between numerator and denominator).
If the function is not proper, use polynomial long division first to separate the polynomial part from the proper fraction part.
Example: 4x³/(x² − 2) is not proper, so divide to get 4x + 8x/(x² − 2), then decompose only the proper part 8x/(x² − 2).

🧱 Determining the form

🧱 Factor the denominator completely

Factor the denominator into linear factors (ax + b) and irreducible quadratic factors (ax² + bx + c).
Linear factors: correspond to real zeros; can be factored as (x − zero).
Irreducible quadratics: have no real zeros; check the discriminant b² − 4ac < 0 to confirm irreducibility.
Multiplicity matters: if a zero appears m times, the factor is repeated m times.
- Example: x²(x² + 1) means x = 0 has multiplicity 2, so the factor x appears twice: x · x.

🧱 Apply the decomposition rules (Theorem 8.10)

Factor type	Multiplicity	Terms in decomposition
Linear (ax + b)	m	A₁/(ax + b) + A₂/(ax + b)² + ... + Aₘ/(ax + b)ᵐ
Irreducible quadratic (ax² + bx + c)	m	(B₁x + C₁)/(ax² + bx + c) + (B₂x + C₂)/(ax² + bx + c)² + ... + (Bₘx + Cₘ)/(ax² + bx + c)ᵐ

Linear factors: numerators are constants (A, B, C, ...).
Irreducible quadratics: numerators are linear polynomials (Bx + C, Dx + E, ...).
Each power of a repeated factor up to its multiplicity gets its own term.

🔍 Handling repeated factors carefully

A factor like (x − 1)² means x = 1 is a zero of multiplicity 2.
You need two terms: A/(x − 1) + B/(x − 1)².
Don't confuse: (x − 1) and (1 − x) are the same factor up to sign; rewrite (1 − x) = −(x − 1) to group them correctly.
- Example: (x + 1)(x − 1)(1 − x)(2 + x) = −(x − 1)²(x + 1)(x + 2).

🔍 Simplifying the form

Sometimes the initial guess includes redundant terms.
Example: if you write A/x + (Bx + C)/x², notice that (Bx + C)/x² = B/x + C/x², so the B/x term duplicates A/x.
Drop the redundant term and relabel: A/x + C/x² is sufficient.

🔢 Finding the unknown coefficients

🔢 Clear denominators

Multiply both sides by the common denominator (the original factored denominator).
This eliminates all fractions and gives a polynomial equation.
Example: starting from (x² − x − 6)/(x²(x² + 1)) = A/x + B/x² + (Cx + D)/(x² + 1), multiply through by x²(x² + 1) to get:
- x² − x − 6 = Ax(x² + 1) + B(x² + 1) + (Cx + D)x²

🔢 Expand and collect like terms

Expand the right-hand side fully.
Group terms by powers of x: x³ terms, x² terms, x terms, and constant terms.
Example: x² − x − 6 = (A + C)x³ + (B + D)x² + Ax + B

🔢 Equate coefficients (Theorem 8.11)

Theorem 8.11: If two polynomials are equal for all x in an interval, then the coefficients of corresponding powers of x are equal.

Match coefficients on both sides:
- Coefficient of x³: A + C = 0
- Coefficient of x²: B + D = 1
- Coefficient of x: A = −1
- Constant term: B = −6
This gives a system of linear equations.

🔢 Solve the system

Use substitution, addition, or any method from earlier sections (8.1–8.5).
The excerpt notes that substitution is often the most efficient for these systems.
Example: from A = −1 and A + C = 0, substitute to get C = 1; from B = −6 and B + D = 1, get D = 7.

🧪 Special cases and techniques

🧪 Quadratics in disguise

Some polynomials look irreducible but can be factored by recognizing a hidden pattern.
Substitution trick: if the exponent on one term is twice another, try substituting u = x².
- Example: x⁴ + 6x² + 9 = (x²)² + 6x² + 9 = (x² + 3)².
Difference of squares: add and subtract a term to create a factorable pattern.
- Example: x⁴ + 16 = (x²)² + 4² = (x² + 4)² − 8x² = (x² − 2x√2 + 4)(x² + 2x√2 + 4).

🧪 Checking irreducibility

For a quadratic ax² + bx + c, compute the discriminant b² − 4ac.
If b² − 4ac < 0, the quadratic has no real zeros and is irreducible over the reals.
Even if two quadratics have the same discriminant, they may have different (non-real) zeros, so treat them as separate factors.

🧪 Non-proper rational functions

If the degree of the numerator ≥ degree of the denominator, the function is not proper.
Use polynomial long division first to write R(x) = (polynomial) + (proper fraction).
Decompose only the proper fraction part.
Example: 4x³/(x² − 2) = 4x + 8x/(x² − 2); decompose 8x/(x² − 2).

⚠️ Common pitfalls

⚠️ Misidentifying multiplicity

Don't confuse: (x − 1)(1 − x) looks like two different factors, but (1 − x) = −(x − 1), so x = 1 has multiplicity 2.
Always rewrite factors with the same zero in the same form before counting multiplicity.

⚠️ Wrong numerator form

Linear factors get constant numerators (A, B, ...).
Irreducible quadratics get linear numerators (Bx + C, Dx + E, ...).
Don't use a linear numerator for a linear factor or a constant numerator for an irreducible quadratic.

⚠️ Forgetting terms for repeated factors

A factor raised to the mth power requires m terms in the decomposition.
Example: (x − 1)³ requires A/(x − 1) + B/(x − 1)² + C/(x − 1)³, not just one term.

⚠️ Algebraic errors when clearing fractions

The excerpt warns against errors like treating 8/(x² − 9) as if it equals 8/x² − 8/9.
Why this is wrong: you cannot split a single denominator across a difference; partial fraction decomposition exists precisely because this shortcut does not work.
Always factor the denominator and apply the formal decomposition rules.

📋 Step-by-step summary

📋 The complete procedure

Check if proper: if degree(numerator) ≥ degree(denominator), use long division first.
Factor the denominator: find all linear and irreducible quadratic factors, noting multiplicities.
Write the form: apply Theorem 8.10 to set up the sum of partial fractions with unknown coefficients.
Clear denominators: multiply both sides by the common denominator.
Expand and collect: gather like powers of x on both sides.
Equate coefficients: match coefficients of x³, x², x, and constants to form a system of equations.
Solve the system: use substitution or another method to find all unknowns.
Write the final answer: substitute the coefficients back into the partial fraction form.
Check: combine the partial fractions to verify you recover the original expression.

Systems of Non-Linear Equations and Inequalities

8.7 Systems of Non-Linear Equations and Inequalities

🧭 Overview

🧠 One-sentence thesis

Unlike linear systems, non-linear systems have no general algorithmic solution method and require a combination of substitution, elimination, graphical insight, and algebraic tenacity to solve, often yielding a finite number of solutions rather than none or infinitely many.

📌 Key points (3–5)

No universal algorithm: Non-linear systems cannot be solved by a single routine method; each problem demands its own approach combining substitution, elimination, and sometimes creative algebraic manipulation.
Finite solution sets are common: A consistent non-linear system can have exactly two, three, or another finite number of solutions—unlike linear systems, which have either none, one, or infinitely many.
Graphical verification is valuable: Sketching the curves helps identify the number of intersection points and can prevent extraneous solutions from slipping through.
Common confusion—when to use elimination vs. substitution: Elimination works when you can remove troublesome terms (like xy or squared terms); substitution is needed when equations have no like terms, but beware of introducing extraneous solutions.
Inequalities define regions: Non-linear inequalities partition the plane into regions separated by curves; test points in each region determine which areas satisfy the inequality.

🔧 Solution techniques for non-linear equations

🔧 Elimination method

Elimination: combining equations to remove one variable, especially useful when both equations contain the same squared terms (e.g., x² and y²).

Works well when you can subtract or add multiples of equations to cancel variables.
Example: If both equations contain x² + y², you can eliminate those terms by subtracting one equation from the other.
When it helps: The excerpt shows that even if you cannot eliminate a variable completely, you can sometimes eliminate troublesome cross-terms like xy to get a simpler relationship (e.g., y² − x² = 0, which factors as y = ±x).
Don't confuse: Elimination doesn't always remove a variable entirely; sometimes it just simplifies the relationship between variables.

🔄 Substitution method

Substitution: solving one equation for a variable and replacing that variable in the other equation.

Necessary when equations have no like terms to eliminate.
Can lead to complicated expressions (e.g., quadratic formulas, exponentials, nested radicals).
Caution: Never divide both sides by a variable (e.g., dividing yz = y by y) because you might lose solutions where that variable equals zero. Instead, factor: yz − y = 0 → y(z − 1) = 0.
Example: From y − 2x = 0, get y = 2x and substitute into x² + y² = 4 to obtain x² + (2x)² = 4.

🎨 Graphical insight

Sketching both equations can reveal the number of solutions and their approximate locations.
Helps catch extraneous solutions: if algebra gives two y-values but the graph shows only positive y-intersections, discard the negative value.
Example: The excerpt shows a circle x² + y² = 4 intersecting a parabola y = x²; the sketch confirms both intersection points have positive y, so only y = (−1 + √17)/2 is valid.

🧩 Special tricks

Factoring after elimination: If you get y² − x² = 0, factor as (y − x)(y + x) = 0 to split into two simpler cases: y = x or y = −x.
Substitution for "linear in form" systems: If equations like 4/x + 3/y = 1 appear, let u = 1/x and v = 1/y to convert to a linear system in u and v, then back-substitute.
Case analysis: When an equation factors (e.g., y(z − 1) = 0), solve each case separately (y = 0 or z = 1) and combine results.

📊 Understanding solution sets

📊 Finite vs. infinite solutions

System type	Typical solution count	Excerpt example
Linear	0, 1, or ∞	Not applicable here
Non-linear	Often 2, 3, 4, or another finite number	Circle and ellipse intersect at exactly 2 points

Why finite is common: Two curves (e.g., a circle and a line) typically cross at a few points, not infinitely many.
Inconsistent systems: When elimination yields an impossible equation (e.g., −13y² = 20 with no real solution), the system has no solution.
Don't use "dependent/independent": The excerpt notes these labels are generally not applied to non-linear systems.

🔍 Checking solutions

Algebraically: Substitute each solution point into both original equations to verify.
Graphically: Plot both equations and confirm that the solution points lie at their intersections.
Example: For solutions {(0, 2), (0, −2)}, check that both points satisfy x² + y² = 4 and 4x² + 9y² = 36, and confirm on the graph that the circle and ellipse intersect at those y-intercepts.

🌐 Systems with three variables

🌐 Organizing three-variable systems

Label equations E₁, E₂, E₃ to keep track during elimination and substitution.
Start with the simplest equation (e.g., yz = y) and factor it to split into cases.
Example from the excerpt:
- E₂: yz = y → y(z − 1) = 0 → Case 1: y = 0; Case 2: z = 1.
- Substitute each case into the other two equations and solve.
- Case 1 yields two solutions; Case 2 yields a contradiction (−2 = 0), so no solutions from that case.

🧮 Handling contradictions

If substituting a case into the remaining equations produces an impossible statement (e.g., x − 2 = x → −2 = 0), that case contributes no solutions.
Don't discard the entire system—other cases may still yield valid solutions.

🗺️ Non-linear inequalities and regions

🗺️ How inequalities partition the plane

Non-linear inequality: an inequality like x² + y² − 4 < 0 that divides the plane into regions separated by a curve (the boundary where equality holds).

Boundary curve: Set the inequality to equality (e.g., x² + y² − 4 = 0 → x² + y² = 4, a circle).
Test points: Choose a point in each region (inside and outside the circle) and substitute into the inequality to see which region satisfies it.
Example: For x² + y² − 4 < 0, test (0, 0) inside the circle (gives −4 < 0, true) and (0, 3) outside (gives 5 < 0, false) → solution is the inside of the circle.

🖊️ Boundary inclusion

Strict inequality (< or >): boundary curve is not included; draw it dashed.
Non-strict inequality (≤ or ≥): boundary curve is included; draw it solid.
Example: x² + y² − 4 < 0 excludes the circle itself (dashed); x² + y² ≥ 4 includes the circle (solid).

🔗 Systems of inequalities

Solve each inequality separately to find its region.
The solution to the system is the intersection of all individual regions.
Example: {x² + y² ≥ 4, x² − 2x + y² − 2y ≤ 0} → solution is points on or outside the first circle that are also on or inside the second circle.
Finding precise boundaries: Solve the system of equations formed by the boundary curves (set each inequality to equality) to locate intersection points for accurate graphing.

📐 Compound inequalities

An inequality like y² − 4 ≤ x < y + 2 means both y² − 4 ≤ x and x < y + 2 must hold.
Solve each part, graph both regions, and take their overlap.
The excerpt shows this requires finding where the parabola y² = x + 4 and the line y = x − 2 intersect (by solving the system) to determine the exact overlap region.

⚠️ Common pitfalls and cautions

⚠️ Extraneous solutions

Algebraic manipulation (especially squaring or substituting complicated expressions) can introduce solutions that don't satisfy the original equations.
Prevention: Always check solutions in the original equations, and use a graph to confirm the number and location of solutions.
Example: Solving y² + y = 4 gives y = (−1 ± √17)/2, but the graph shows only positive y-intersections, so discard y = (−1 − √17)/2.

⚠️ Dividing by a variable

Never divide both sides by a variable (e.g., y) without considering that it might be zero.
Instead, move everything to one side and factor: yz − y = 0 → y(z − 1) = 0 → y = 0 or z = 1.
This ensures you don't lose solutions where the variable equals zero.

⚠️ Misinterpreting the boundary

A common error (mentioned at the start of the excerpt): thinking 8/(x² − 9/6) = 8/x² − 8/(9/6).
This misunderstanding would make partial fractions (and much of algebra) unnecessary—always respect the structure of rational expressions.

Sequences

9.1 Sequences

🧭 Overview

🧠 One-sentence thesis

Sequences are functions whose domain is the natural numbers, and two fundamental types—arithmetic (adding a constant) and geometric (multiplying by a constant)—can be described by explicit formulas that predict any term directly.

📌 Key points (3–5)

What a sequence is: a function from the natural numbers to real numbers, where each natural number n maps to the nth term.
Arithmetic vs geometric: arithmetic sequences add a fixed number (common difference d) each step; geometric sequences multiply by a fixed number (common ratio r) each step.
Explicit formulas exist: arithmetic sequences follow a_n = a + (n − 1)d; geometric sequences follow a_n = a·r^(n−1).
Common confusion: not every sequence is arithmetic or geometric—many are neither, and some can be built from combinations of these patterns.
Recursive vs explicit: sequences can be defined recursively (each term depends on previous terms) or explicitly (a formula directly computes the nth term).

📐 What sequences are

📐 Definition and notation

Sequence: a function a whose domain is the natural numbers {1, 2, 3, ...}. The value a(n) is written as a_n and called the nth term.

Informally: an infinite list of numbers with a first term, second term, third term, and so on.
Notation: the sequence itself is written as {a_n, n ≥ 1} or {a_n}∞_(n=1).
Example: the sequence 1/2, −3/4, 9/8, −27/16, ... assigns a(1) = 1/2, a(2) = −3/4, etc.

📊 Graphing sequences

Since sequences are functions, you can graph them by plotting points (n, a_n) for each natural number n.
Key difference from continuous functions: do not connect the dots—the domain is only whole numbers, not intervals.
Example: the sequence b_k = (−1)^k / (2k + 1) for k ≥ 0 produces isolated points on the graph, not a curve.

🔄 Recursive definitions

Some sequences are defined by giving an initial term and a recursion equation that builds later terms from earlier ones.
Example: a₁ = 7, a_(n+1) = 2 − a_n produces a₁ = 7, a₂ = −5, a₃ = 7, a₄ = −5, ...
Factorial sequence: defined recursively as 0! = 1 and n! = n·(n−1)! for n ≥ 1, which expands to n! = n·(n−1)·(n−2)·...·2·1.

🔢 Arithmetic sequences

🔢 Definition and pattern

Arithmetic sequence: a sequence where there exists a number d (the common difference) such that a_(n+1) = a_n + d for all n.

In plain language: you go from one term to the next by always adding the same fixed number d.
The name "common difference" comes from rewriting the recursion as a_(n+1) − a_n = d.
Example: the sequence 1, 3, 5, 7, ... (the odd numbers) has common difference d = 2.

📝 Explicit formula

Formula: a_n = a + (n − 1)d, where a is the first term and d is the common difference.

Why this works: to reach the nth term, you start at a and add d exactly (n − 1) times.
Example: for the odd numbers starting at a = 1 with d = 2, the formula is a_n = 1 + (n − 1)·2 = 2n − 1.
This matches: a₁ = 1, a₂ = 3, a₃ = 5, etc.

🧪 Identifying arithmetic sequences

Check successive differences: compute a₂ − a₁, a₃ − a₂, a₄ − a₃, ...
If all differences are the same, the sequence is arithmetic.
Example: the sequence 2/5, 2, −2/3, −2/7, ... has differences that vary, so it is not arithmetic (though the denominators 5, 1, −3, −7 form an arithmetic sequence with d = −4).

✖️ Geometric sequences

✖️ Definition and pattern

Geometric sequence: a sequence where there exists a nonzero number r (the common ratio) such that a_(n+1) = r·a_n for all n.

In plain language: you go from one term to the next by always multiplying by the same fixed number r.
The name "common ratio" comes from rewriting as a_(n+1) / a_n = r (when a_n ≠ 0).
Example: the sequence 1/3, 5/9, 25/27, 125/81, ... has common ratio r = 5/3.

📝 Explicit formula

Formula: a_n = a·r^(n−1), where a is the first term and r ≠ 0 is the common ratio.

Why this works: to reach the nth term, you start at a and multiply by r exactly (n − 1) times.
Example: for the sequence 1/2, −3/4, 9/8, −27/16, ... with a = 1/2 and r = −3/2, the formula is a_n = (1/2)·(−3/2)^(n−1) = (−3)^(n−1) / 2^n.
The restriction r ≠ 0 avoids the indeterminate form 0^0.

🧪 Identifying geometric sequences

Check successive ratios: compute a₂/a₁, a₃/a₂, a₄/a₃, ...
If all ratios are the same, the sequence is geometric.
Example: the sequence 1, −1/3, 1/5, −1/7, ... has ratios −1/3 and −3/5, which differ, so it is not geometric.

🔀 Alternating sequences

A sequence with terms that alternate in sign often has a factor like (−1)^n or (−1)^(n−1).
Example: b_k = (−1)^k / (2k + 1) produces 1, −1/3, 1/5, −1/7, ... (alternating positive and negative).

🧩 Neither arithmetic nor geometric

🧩 Combination patterns

Many sequences are neither arithmetic nor geometric but can be decomposed into simpler patterns.
Strategy: separate numerators and denominators, or look for sub-patterns.
Example: the sequence 2/5, 2/1, 2/(−3), 2/(−7), ... has constant numerator 2 and denominators forming an arithmetic sequence (5, 1, −3, −7 with d = −4), so a_n = 2 / (9 − 4n).

🧩 Recognizing number patterns

Some sequences follow recognizable patterns like squares (1, 4, 9, 16, ...) or cubes (1, 8, 27, 64, ...).
Example: the sequence 2, 5, 10, 17, ... can be seen as 1+1, 4+1, 9+1, 16+1, ... so a_n = n² + 1.
Important caveat: given only a finite sample, infinitely many formulas can generate the same initial terms—there is no unique "correct" answer unless more context is provided.

🧩 Mixed patterns

Example: the sequence 1, −2/7, 4/13, −8/19, ... has numerators 1, −2, 4, −8 (geometric with r = −2) and denominators 1, 7, 13, 19 (arithmetic with d = 6), yielding a_n = (−2)^(n−1) / (6n − 5).

💰 Applications in finance

💰 Compound interest as a geometric sequence

When P dollars are invested at annual rate r compounded n times per year, the amount after k compounding periods is A_k = P·(1 + r/n)^k.
This is a geometric sequence with first term P·(1 + r/n) and common ratio (1 + r/n).
Each term represents the account balance after one more compounding period.

💰 Annuities (preview)

An annuity is an investment where additional principal is deposited at regular intervals, not just a single lump sum.
Analyzing annuities requires summing sequences, which will be covered in the next section.

🛠️ Finding explicit formulas

🛠️ General strategy

When in doubt, write it out: list the first several terms to spot patterns.
Check if the sequence is arithmetic (constant differences) or geometric (constant ratios).
If neither, look for sub-patterns in numerators, denominators, signs, or other components.
Use the explicit formulas from Equation 9.1 when applicable.

🛠️ Examples of pattern recognition

Sequence	Pattern	Formula
0.9, 0.09, 0.009, ...	Geometric, r = 1/10	a_n = 9/10^n
1, 4, 9, 16, ...	Perfect squares	a_n = n²
1, 0, 1, 0, ...	Alternating 1 and 0	a_n = (1 + (−1)^(n+1))/2

🛠️ Don't confuse: finite samples and uniqueness

A finite list of terms does not uniquely determine a sequence formula.
Example: both a_n = n² + 1 and b_n = (−1/4)n⁴ + (5/2)n³ − (31/4)n² + (25/2)n − 5 produce the terms 2, 5, 10, 17 for n = 1, 2, 3, 4.
As long as your formula generates the given terms in the correct order, it is a valid answer.

Summation Notation

9.2 Summation Notation

🧭 Overview

🧠 One-sentence thesis

Summation notation provides a compact way to express the sum of sequence terms, enabling efficient calculation of arithmetic and geometric sums and practical applications like annuities and infinite series.

📌 Key points (3–5)

What summation notation is: a shorthand using the Greek letter Σ (sigma) to represent adding up terms of a sequence from a starting index to an ending index.
Index variable flexibility: the index variable (n, k, j, etc.) is a "dummy variable" that can be changed to any letter without affecting the sum's value.
Arithmetic and geometric sum formulas: closed-form expressions exist for summing the first n terms of arithmetic sequences (using the average of first and last terms) and geometric sequences (using the ratio formula).
Common confusion: when r = 1 in a geometric sequence, the standard geometric sum formula does not apply; instead, the sum is simply n times the constant term.
Practical applications: summation notation is essential for defining polynomials, matrix operations, annuity calculations, and understanding infinite series (when the sum converges).

📐 Understanding summation notation

📐 Basic structure and meaning

Summation notation: a shorthand for adding up terms of a sequence {aₙ} from aₘ through aₚ, written with the symbol Σ (capital Greek letter sigma meaning "sum"), with lower and upper limits indicating the starting and ending terms.

The notation includes three parts: the summation symbol Σ, the formula for the general term, and the limits (lower and upper bounds).
Example: The sum from n=3 to n=6 of (2n - 1) means substitute n=3, 4, 5, 6 into the formula and add: (2(3)-1) + (2(4)-1) + (2(5)-1) + (2(6)-1) = 5 + 7 + 9 + 11 = 32.

🔄 The dummy variable property

The index variable is called a "dummy variable" because changing its letter does not change the sum's value.
Example: The sum from n=3 to n=6 of (2n-1) equals the sum from k=3 to k=6 of (2k-1), which equals the sum from j=3 to j=6 of (2j-1).
Don't confuse: the index variable with other variables in the formula—only the index variable gets replaced by the limit values; other variables (like x in a polynomial) remain unchanged.

🧮 Applications in definitions

Summation notation allows compact mathematical definitions.
Polynomials can be defined as f(x) = sum from k=0 to n of (aₖ xᵏ), where aₖ are real coefficients.
Matrix operations use summation: the product of the i-th row of matrix A and j-th column of matrix B is written as the sum from k=1 to n of (aᵢₖ bₖⱼ).

✍️ Working with summation notation

✍️ Evaluating sums

To find a sum written in summation notation:

Substitute each value from the lower limit to the upper limit into the formula.
Calculate each resulting term.
Add all the terms together.

Example: The sum from k=1 to k=4 of (13/100ᵏ) means:

13/100¹ + 13/100² + 13/100³ + 13/100⁴
= 0.13 + 0.0013 + 0.000013 + 0.00000013
= 0.13131313

🔍 Writing sums in summation notation

To express a sum using summation notation:

Identify the pattern in the terms (arithmetic, geometric, or other).
Find a formula for the n-th term using sequence techniques.
Determine the lower limit (usually the first term's index).
Determine the upper limit (which value of the index produces the last term).

Example: For 1 + 3 + 5 + ... + 117:

The terms form an arithmetic sequence with first term a=1 and common difference d=2.
Formula: aₙ = 2n - 1 for n ≥ 1.
Setting 2n - 1 = 117 gives n = 59.
Result: sum from n=1 to n=59 of (2n - 1).

🔀 Properties of summation

The excerpt lists four key properties (though formal proofs require later machinery):

Sum/difference property: The sum of (aₙ ± bₙ) equals the sum of aₙ plus/minus the sum of bₙ.
Constant multiple property: The sum of c·aₙ equals c times the sum of aₙ, for any constant c.
Split property: A sum from m to p can be split at any intermediate point j into two sums: from m to j and from j+1 to p.
Index shift property: The sum from m to p of aₙ equals the sum from m+r to p+r of aₙ₋ᵣ, for any whole number r.

📊 Formulas for arithmetic and geometric sums

📊 Arithmetic sequence sum

For an arithmetic sequence aₖ = a + (k-1)d for k ≥ 1, the sum S of the first n terms is:

Formula version	Expression	Interpretation
Average form	S = n · (a₁ + aₙ)/2	Number of terms × average of first and last terms
Expanded form	S = (n/2) · (2a + (n-1)d)	Uses first term and common difference directly

The "average form" is easier to remember: multiply the number of terms by the average of the endpoints.
Example application: The sum of the first 100 natural numbers (arithmetic with a=d=1) is 100(101)/2 = 5050.

📊 Geometric sequence sum

For a geometric sequence aₖ = arᵏ⁻¹ for k ≥ 1, the sum S of the first n terms depends on the ratio r:

Case	Formula	When to use
r ≠ 1	S = a(1 - rⁿ)/(1 - r) or S = (a₁ - aₙ₊₁)/(1 - r)	When the ratio is not equal to 1
r = 1	S = na	When every term equals a (constant sequence)

Don't confuse: When r=1, the sequence is constant (a, a, a, ...), so the sum is simply n times a; the standard geometric formula would involve division by zero.
The two formulas for r ≠ 1 are equivalent: distributing a through the numerator in the first gives the second.

💰 Application: Ordinary annuities

💰 What an annuity is

An annuity is an investment plan where:

Payments are deposited on an ongoing basis (not a single lump sum).
Interest is compounded at regular intervals.
An ordinary annuity makes deposits at the end of each compounding period.
An annuity-due makes deposits at the beginning of each period.

💰 Future value formula derivation

The excerpt derives the formula by tracking the account balance after each period:

After period 1: A₁ = P (first payment, no interest yet).
After period 2: A₂ = P(1+i) + P (first payment grew with interest, plus second payment).
After period 3: A₃ = P(1+i)² (1 + 1/(1+i) + 1/(1+i)²).
The terms in parentheses form a geometric sequence with first term a=1 and ratio r=1/(1+i).
Applying the geometric sum formula and simplifying yields the final result.

Future value of an ordinary annuity: A = P · ((1+i)ⁿᵗ - 1)/i, where P is the payment per period, i = r/n is the interest rate per period (r = annual rate, n = compounding frequency per year), and t is the number of years.

Example: With monthly payments of $50, annual rate 6% (so i=0.005), compounded monthly for 30 years:

A = 50 · ((1.005)³⁶⁰ - 1)/0.005 ≈ $50,225.75.

💰 Solving for time

To find how long it takes to reach a target amount:

Set A equal to the target and solve for t.
Isolate the exponential term (1+i)ⁿᵗ.
Take the natural logarithm of both sides.
Solve for t using logarithm properties.

Example: To reach $100,000 with the same parameters, solving gives t ≈ 40.06 years—just over 40 years, showing that in 10 additional years beyond the 30-year mark, the annuity nearly doubles due to compound growth.

∞ Infinite series and geometric series

∞ What an infinite series is

Series: an infinite sum, written as the sum from k=1 to infinity of terms.

A repeating decimal like 0.9̄ (0.9999...) can be written as an infinite sum: 0.9 + 0.09 + 0.009 + ...
This is the sum from k=1 to infinity of 9/10ᵏ.

∞ Convergence of geometric series

Geometric series theorem: For a geometric sequence aₖ = arᵏ⁻¹ with |r| < 1, the infinite sum equals a/(1-r). If |r| ≥ 1, the sum is not defined (does not converge).

The condition |r| < 1 means -1 < r < 1, so rⁿ → 0 as n → ∞.
Taking the finite geometric sum formula and letting n → ∞, the term rⁿ vanishes, leaving a/(1-r).
Example: For 0.9̄, we have a geometric series with a = 9/10 and r = 1/10. The sum is (9/10)/(1 - 1/10) = (9/10)/(9/10) = 1, so 0.9̄ = 1.

∞ Repeating decimals as fractions

Any non-terminating repeating decimal can be expressed as a geometric series and converted to a fraction:

Identify the repeating block and write it as a geometric series.
Apply the geometric series formula with |r| < 1.
Example: 0.7̄ = 7/10 + 7/100 + 7/1000 + ... = (7/10)/(1 - 1/10) = 7/9.

Don't confuse: The infinite series formula only applies when |r| < 1; for |r| ≥ 1, the terms do not shrink to zero, so the sum grows without bound or oscillates and does not converge to a finite value.

Mathematical Induction

9.3 Mathematical Induction

🧭 Overview

🧠 One-sentence thesis

Mathematical induction is a proof technique that establishes formulas for all natural numbers by proving a base case and showing that truth for any number k implies truth for k+1, much like climbing stairs where you prove you can get on the first step and can always move from one step to the next.

📌 Key points (3–5)

What PMI proves: formulas or properties that hold for all natural numbers n (or all n beyond some starting value).
Two required steps: (1) base case—prove the formula is true for n=1 (or the starting value); (2) inductive step—assume it's true for k and prove it must then be true for k+1.
The induction hypothesis: assuming P(k) is true is not circular reasoning; it's showing that truth "propagates" from one number to the next.
Common confusion: assuming P(k) feels like assuming what you're trying to prove, but you're only showing that if it's true for k, then it's true for k+1—combined with the base case, this proves it for all n.
Why it matters: PMI allows us to rigorously prove formulas that were previously only motivated by patterns or examples.

🏗️ The structure of an induction proof

🏗️ The Principle of Mathematical Induction (PMI)

Principle of Mathematical Induction (PMI): Suppose P(n) is a sentence involving the natural number n. IF (1) P(1) is true and (2) whenever P(k) is true, it follows that P(k+1) is also true, THEN the sentence P(n) is true for all natural numbers n.

PMI is a principle (or axiom)—a property of natural numbers we choose to accept.
P(n) is notation like function notation: it represents a formula or statement that depends on n.
Example: if P(n) is "n squared + 1 = 3", then P(1) is "1 squared + 1 = 3" (which is false), and P(k+1) is "(k+1) squared + 1 = 3".

🪜 The staircase metaphor

Think of induction like climbing stairs.
Step (1): prove you can get on the stairs (base case).
Step (2): prove you can climb from any step to the next (inductive step).
If both are true, you can climb the entire staircase.
Don't confuse: the inductive step doesn't prove P(k) is true; it proves P(k) → P(k+1), which is different.

🔢 The base case

🔢 Establishing P(1)

The base case verifies the formula for the starting value (usually n=1).
Example: to prove the arithmetic sequence formula a_n = a + (n−1)d, check n=1: does a_1 = a + (1−1)d? Yes, because this simplifies to a_1 = a, which is the definition.
Sometimes the base case starts at a different value (e.g., n=6 if the claim is only for n > 5).

🔢 Why the base case matters

Without the base case, the inductive step alone proves nothing.
The base case is the "anchor" that starts the chain of implications: P(1) is true, so P(2) is true, so P(3) is true, and so on.

🔗 The inductive step

🔗 Assume P(k), prove P(k+1)

Induction hypothesis: assume P(k) is true for some arbitrary natural number k.
Goal: use this assumption to deduce that P(k+1) is true.
This is not circular reasoning—you're proving a conditional statement: "if P(k), then P(k+1)."

🔗 How to connect P(k) to P(k+1)

Look for a way to express P(k+1) in terms of P(k).
Example (arithmetic sequence sum): the sum of the first k+1 terms equals (sum of first k terms) + (the (k+1)st term).
Use the induction hypothesis to replace the "sum of first k terms" with the formula for P(k), then simplify to show it matches the formula for P(k+1).

🔗 Example walkthrough: arithmetic sequence formula

Claim: a_n = a + (n−1)d for all n ≥ 1.
Base case: a_1 = a + (1−1)d = a. True by definition.
Inductive step: assume a_k = a + (k−1)d. Show a_(k+1) = a + ((k+1)−1)d.
- By the recursive definition, a_(k+1) = a_k + d.
- Substitute the induction hypothesis: a_(k+1) = (a + (k−1)d) + d = a + kd = a + ((k+1)−1)d.
- Hence P(k+1) is true.
By PMI, the formula holds for all n ≥ 1.

📐 Variations and special cases

📐 Starting at n ≠ 1

Sometimes the claim is only for n ≥ some value m (e.g., n > 5).
The base case becomes P(m) instead of P(1).
The conclusion is that P(n) is true for all n ≥ m, not all n ≥ 1.
Example: proving 3^n > 100n for n > 5 starts with base case n=6.

📐 Inequalities and growth comparisons

Induction can prove inequalities like 3^n > 100n for n ≥ 6.
Base case: check 3^6 = 729 > 600 = 100(6). True.
Inductive step: assume 3^k > 100k. Show 3^(k+1) > 100(k+1).
- 3^(k+1) = 3 · 3^k > 3(100k) = 300k (using the induction hypothesis).
- Need to show 300k > 100(k+1), i.e., 300k > 100k + 100, i.e., 200k > 100, i.e., k > 1/2.
- Since k ≥ 6, this is true.
- Hence 3^(k+1) > 100(k+1).

📐 Matrix and algebraic properties

Induction can prove properties involving matrices, complex numbers, logarithms, etc.
Example: proving (conjugate of z)^n = conjugate of (z^n) for complex numbers.
Example: proving determinant properties for n×n matrices by induction on n.

⚠️ Common pitfalls

⚠️ "Isn't assuming P(k) circular?"

No. You are not assuming P(k) is true for all k; you are assuming it for one arbitrary k and proving it for k+1.
The base case + inductive step together create a chain: P(1) → P(2) → P(3) → ... → P(n).

⚠️ Forgetting the base case

The inductive step alone does not prove anything without the base case.
Example: if you only prove P(k) → P(k+1), you haven't shown P(1) is true, so the chain never starts.

⚠️ Not using the induction hypothesis

In the inductive step, you must explicitly use the assumption that P(k) is true.
If your proof of P(k+1) doesn't reference P(k), you're probably not doing induction correctly (though some proofs have cases where the hypothesis isn't needed).

🎯 Why induction matters

🎯 Proving formulas rigorously

Many formulas in sequences and series are initially motivated by patterns or examples.
Induction allows us to prove these formulas hold for all natural numbers, not just the cases we checked.
Example: the sum formula for arithmetic sequences, the formula for geometric sequences, etc.

🎯 Foundation for advanced mathematics

PMI appears throughout higher mathematics, sometimes explicitly, sometimes implicitly.
Any property stated "for all natural numbers n" likely requires induction for a formal proof.
Induction is a fundamental proof technique in discrete mathematics, combinatorics, and computer science.

The Binomial Theorem

9.4 The Binomial Theorem

🧭 Overview

🧠 One-sentence thesis

The Binomial Theorem provides a formula for expanding powers of binomials (a + b)^n using factorials and binomial coefficients, which can be proven by mathematical induction and efficiently computed using Pascal's Triangle.

📌 Key points (3–5)

What the theorem does: gives a formula to expand (a + b)^n for any natural number n without multiplying out the binomial repeatedly.
Key ingredients: factorials (n!) and binomial coefficients (n choose j) form the coefficients in the expansion.
Pattern in the expansion: exponents on a decrease from n to 0 while exponents on b increase from 0 to n; the sum of exponents in each term equals n.
Common confusion: binomial coefficients are not arbitrary—they count the number of ways to select items and follow the recursive pattern in Pascal's Triangle.
Practical shortcuts: Pascal's Triangle provides a quick way to find all coefficients for small n; for large n or single terms, use the Binomial Theorem formula directly.

🔢 Factorials: the building blocks

🔢 Definition and computation

Factorial: For a whole number n, n factorial (written n!) is defined recursively as: 0! = 1, and n! = n · (n − 1)! for n ≥ 1.

Informally, n! = n · (n − 1) · (n − 2) · ... · 2 · 1, with 0! = 1 as the base case.
Example: 3! = 3 · 2 · 1 = 6; 4! = 4 · 3! = 4 · 6 = 24.
Why 0! = 1: this is the defined base case that makes the recursive formula work.

⚙️ Simplifying factorial expressions

Key technique: cancel common factorial factors by writing out the larger factorial until you reach the smaller one.
Example: 7! / 5! = (7 · 6 · 5!) / 5! = 7 · 6 = 42.
Example: (k + 2)! / (k − 1)! = (k + 2)(k + 1)(k)(k − 1)! / (k − 1)! = k(k + 1)(k + 2) for k ≥ 1.
Don't confuse: 0! in the denominator is fine because 0! = 1, not undefined.

📈 Growth rate of factorials

Factorials grow faster than exponential functions.
The excerpt proves by induction that n! > 3^n for all n ≥ 7.
More generally, for any positive real number x, the ratio (x^n) / n! approaches 0 as n approaches infinity.
Application context: factorials count arrangements (e.g., 50! different playlists of 50 songs in order).

🎯 Binomial coefficients

🎯 Definition and meaning

Binomial coefficient: Given whole numbers n and j with n ≥ j, the binomial coefficient (n choose j) is defined as: (n choose j) = n! / [j! · (n − j)!].

Physical interpretation: (n choose j) counts the number of ways to select j items from n items where order does not matter.
Example: (5 choose 2) = 5! / (2! · 3!) = (5 · 4) / 2 = 10—there are 10 ways to choose 2 friends out of 5.
Why "binomial coefficient": these numbers appear as coefficients in the expansion of (a + b)^n.

🔗 Recursive property (Theorem 9.3)

Key identity: (n choose j−1) + (n choose j) = (n+1 choose j) for natural numbers n and j with n ≥ j.
This identity is proven purely by algebraic manipulation using the definition of binomial coefficients and factorial properties.
Why it matters: this recursive relationship is the foundation of Pascal's Triangle and is essential in the induction proof of the Binomial Theorem.

📐 The Binomial Theorem

📐 Statement of the theorem

Binomial Theorem: For nonzero real numbers a and b, (a + b)^n = sum from j=0 to n of [(n choose j) · a^(n−j) · b^j] for all natural numbers n.

Pattern in each term: the exponent on a starts at n and decreases by 1 each term; the exponent on b starts at 0 and increases by 1 each term.
The sum of the two exponents in every term equals n.
The binomial coefficient (n choose j) appears with the term a^(n−j) · b^j.

🧮 Understanding the pattern (n = 4 example)

(a + b)^4 = (4 choose 0)a^4 + (4 choose 1)a^3·b + (4 choose 2)a^2·b^2 + (4 choose 3)a·b^3 + (4 choose 4)b^4.
The lower number in each binomial coefficient matches the exponent on b in that term.
Counting interpretation: (4 choose 1) counts the number of ways to pick 1 factor of b out of 4 total factors when multiplying (a + b)(a + b)(a + b)(a + b), forcing the remaining 3 factors to be a.

🔬 Proof by induction

Base case (n = 1): (a + b)^1 = (1 choose 0)a + (1 choose 1)b = a + b. ✓
Inductive step: assume the formula holds for (a + b)^k; show it holds for (a + b)^(k+1).
- Write (a + b)^(k+1) = (a + b) · (a + b)^k and apply the induction hypothesis.
- Distribute and separate into two sums; the first sum contributes terms from a · (a + b)^k, the second from b · (a + b)^k.
- The first and last terms (a^(k+1) and b^(k+1)) are isolated; the middle terms are combined using index shifting and Theorem 9.3.
- After simplification, the result matches the Binomial Theorem formula for n = k + 1.
Don't confuse: the proof requires careful index manipulation and the recursive property of binomial coefficients; it is not just "obvious" from the pattern.

🛠️ Applications and shortcuts

🛠️ Using the theorem for specific terms

When you need only one term: use the Binomial Theorem formula directly without expanding the entire sum.
Example: to find the term containing x^3 in (2x + y)^5, identify that the exponent on x is 3, so the exponent on y must be 2 (since 3 + 2 = 5). The term is (5 choose 2) · (2x)^3 · y^2 = 10 · 8x^3 · y^2 = 80x^3·y^2.
Example: (x − 2)^4 with a = x, b = −2, n = 4 expands to x^4 − 8x^3 + 24x^2 − 32x + 16.
Example: 2.1^3 = (2 + 0.1)^3 can be computed using the Binomial Theorem with a = 2, b = 0.1, n = 3, yielding 9.261.

🔺 Pascal's Triangle

Construction: arrange binomial coefficients in a triangular array where each row corresponds to the coefficients for (a + b)^n.
- Row 0: 1
- Row 1: 1 1
- Row 2: 1 2 1
- Row 3: 1 3 3 1
- Row 4: 1 4 6 4 1
Pattern: each row starts and ends with 1; interior numbers are the sum of the two numbers directly above (using Theorem 9.3).
When to use Pascal's Triangle: best for expanding an entire binomial with small n; quick and visual.
When to use the Binomial Theorem formula: when n is large or you need only one or a few specific terms.

📊 Comparison: Pascal's Triangle vs. Binomial Theorem formula

Method	Best for	Example
Pascal's Triangle	Full expansion, small n	(3x − y)^4 using row 4: 1, 4, 6, 4, 1
Binomial Theorem	Single term, large n, or any n	Term with x^117 in (x + 2)^118

Don't confuse: Pascal's Triangle is a computational shortcut, not a different theorem—it encodes the same binomial coefficients defined by the formula.

Angles and their Measure

10.1 Angles and their Measure

🧭 Overview

🧠 One-sentence thesis

Angles can be measured in two systems—degrees and radians—with radian measure providing a natural way to connect angles with real numbers and enabling applications in circular motion.

📌 Key points (3–5)

Two measurement systems: Angles can be measured in degrees (360° per revolution) or radians (2π per revolution), with radians being dimensionless.
Oriented angles: Direction matters—positive angles rotate counter-clockwise, negative angles rotate clockwise, and angles can exceed one full revolution.
Coterminal angles: Angles that share the same terminal side differ by multiples of 360° (or 2π radians).
Common confusion: Radian measure seems more complicated than degrees, but radians are "pure numbers" that equal arc length divided by radius (s/r), making them essential for calculus and physics.
Circular motion application: For constant angular velocity ω on a circle of radius r, linear velocity v = rω connects rotational and linear motion.

📐 Basic angle concepts

📐 What is an angle

An angle is formed when two rays share a common initial point called the vertex.

A ray is a "half-line" with one endpoint (the initial point) extending infinitely in one direction.
The measure of an angle indicates the amount of rotation separating the two rays.
Multiple angles can be described by the same diagram depending on which rotation you measure.
Angles are labeled with Greek letters: α (alpha), β (beta), γ (gamma), θ (theta).

📏 Special angle types

Straight angle: Two rays pointing in opposite directions (180°).
Right angle: One quarter revolution (90°), marked with a small square symbol.
Acute angle: Measures strictly between 0° and 90°.
Obtuse angle: Measures strictly between 90° and 180°.

🔢 Degree measure system

🔢 How degrees work

Degree measure: One complete revolution equals 360°, and parts of a revolution are measured proportionately.

One revolution = 360°
Half revolution (straight angle) = 180°
Quarter revolution (right angle) = 90°
Any angle's measure = (fraction of revolution) × 360°

Example: An angle representing 2/3 of a revolution measures (2/3)(360°) = 240°.

🕐 Subdividing degrees: DMS system

The Degree-Minute-Second (DMS) system divides degrees into smaller units:

1° = 60′ (60 minutes)
1′ = 60″ (60 seconds)
Therefore: 1° = 3600″

Converting decimal to DMS: Take 42.125°

Separate whole degrees: 42° + 0.125°
Convert decimal degrees to minutes: 0.125° × (60′/1°) = 7.5′ = 7′ + 0.5′
Convert decimal minutes to seconds: 0.5′ × (60″/1′) = 30″
Result: 42°7′30″

Converting DMS to decimal: Take 117°15′45″

Convert minutes: 15′ × (1°/60′) = 1/4°
Convert seconds: 45″ × (1°/3600″) = 1/80°
Add: 117° + 1/4° + 1/80° = 117.2625°

🔄 Complementary and supplementary angles

Complementary angles: Two acute angles whose measures add to 90°.
Supplementary angles: Two angles (either both right, or one acute and one obtuse) whose measures add to 180°.

➕ Oriented angles and standard position

➕ Direction matters: oriented angles

An oriented angle has direction:

Positive angle: Counter-clockwise rotation from initial side to terminal side.
Negative angle: Clockwise rotation.
Angles can exceed 360° by making multiple revolutions.

Example: 450° = one complete revolution (360°) plus an additional 90° counter-clockwise.

📍 Standard position

An angle is in standard position when its vertex is at the origin and its initial side lies on the positive x-axis.

Classification by terminal side location:

Quadrant I, II, III, or IV angle: Terminal side lies in that quadrant.
Quadrantal angle: Terminal side lies on a coordinate axis.

🔁 Coterminal angles

Coterminal angles are angles in standard position that share the same terminal side.

Coterminal angles differ by multiples of 360°.
Formula: If α and β are coterminal, then β = α + 360° · k where k is any integer (0, ±1, ±2, ...).

Example: 120° and -240° are coterminal because -240° = 120° - 360°.

Don't confuse: The same terminal side can be described by infinitely many different angle measures.

⭕ Radian measure

⭕ Defining radians

Radian measure of angle θ is the ratio s/r, where s is the arc length and r is the radius of the circle.

Key insight: Radian measure tells you how many "radius lengths" you need to sweep along the circle to subtend the angle.

One complete revolution = 2πr/r = 2π radians
Half revolution = π radians
Quarter revolution = π/2 radians
Radians are dimensionless (pure numbers) because they equal length/length

🔄 Converting between degrees and radians

To convert	Multiply by
Degrees → Radians	π radians / 180°
Radians → Degrees	180° / π radians

Examples:

60° × (π/180°) = π/3 radians
(-5π/6) × (180°/π) = -150°
1 radian ≈ 57.2958°

Don't confuse: When you see θ = π/2, this means π/2 radians, not degrees.

🎯 Radians and the Unit Circle

On the Unit Circle (radius = 1):

The radian measure θ equals the arc length s directly (since s/r = s/1 = s).
This allows us to identify every real number t with an angle.

Wrapping the number line:

For t > 0: Wrap the interval [0, t] counter-clockwise around the Unit Circle starting at (1, 0).
For t < 0: Wrap the interval [t, 0] clockwise.
For t = 0: Stay at point (1, 0).

Example: t = 3π/4 corresponds to 3/8 of a revolution counter-clockwise, ending in Quadrant II.

🔁 Coterminal angles in radians

For radian measure, coterminal angles differ by multiples of 2π:

β = α + 2πk where k is any integer

Example: π/6 is coterminal with 13π/6 (k=1) and -11π/6 (k=-1).

🎡 Applications: Circular motion

🎡 Velocity concepts

For an object moving on a circular path:

Displacement s:

Positive = counter-clockwise
Negative = clockwise
The formula θ = s/r still holds with signed values

Average velocity: v̄ = s/t (displacement over time)

Units: length/time
Sign indicates direction; magnitude |v̄| is the speed

Angular velocity: ω̄ = θ/t (angle change over time)

Units: radians/time
Measures how fast the angle changes

⚡ Linear velocity formula

For constant angular velocity ω on a circular path of radius r: v = rω

Why this works:

Since θ = s/r, we have s = rθ
Therefore v = s/t = rθ/t = r(θ/t) = rω
The radius r is the "magnification factor" relating angular and linear velocity

Unit note: Although ω has units radians/time, radians are dimensionless, so rω has units length/time, matching v.

Example: Earth rotates once (2π radians) in 24 hours at latitude with radius 2960 miles.

ω = 2π/24 = π/12 radians per hour
v = 2960 × (π/12) ≈ 775 miles per hour

🔄 Frequency and period

Ordinary frequency f: Number of complete cycles per unit time (e.g., revolutions per hour)
Angular frequency ω: ω = 2πf (radians per unit time)
Period T: Time for one complete cycle, T = 1/f

Don't confuse: Points farther from the center (larger r) must travel faster (larger v) to maintain the same angular velocity ω, because they have farther to go in the same time period.

The Unit Circle: Cosine and Sine

10.2 The Unit Circle: Cosine and Sine

🧭 Overview

🧠 One-sentence thesis

The cosine and sine functions assign to every angle a position on the Unit Circle—cosine as the x-coordinate and sine as the y-coordinate—enabling us to describe circular motion, solve triangles, and establish fundamental trigonometric identities.

📌 Key points (3–5)

Core definitions: For an angle θ in standard position, cos(θ) is the x-coordinate and sin(θ) is the y-coordinate of the point where the terminal side intersects the Unit Circle.
The Pythagorean Identity: cos²(θ) + sin²(θ) = 1 for any angle, derived from the Unit Circle equation x² + y² = 1.
Reference angles simplify computation: Any non-quadrantal angle θ has an acute reference angle α; cos(θ) and sin(θ) equal ±cos(α) and ±sin(α), with signs determined by the quadrant.
Common confusion—quadrantal vs non-quadrantal: Quadrantal angles (0°, 90°, 180°, 270°) lie on axes and have simple cosine/sine values (0, ±1); non-quadrantal angles require reference angles or special triangles.
Extension beyond the unit circle: The definitions extend to any circle of radius r and to real-number inputs, enabling equations of circular motion and right-triangle trigonometry.

📍 Defining cosine and sine on the Unit Circle

📍 The Unit Circle setup

Unit Circle: the circle centered at the origin with radius 1, described by x² + y² = 1.

Place an angle θ in standard position: vertex at the origin, initial side along the positive x-axis.
Let P be the point where the terminal side of θ intersects the Unit Circle.
Definition: cos(θ) = x-coordinate of P; sin(θ) = y-coordinate of P.
These rules satisfy the definition of a function: each angle θ has exactly one cos(θ) and one sin(θ).

🔢 Finding cosine and sine for specific angles

Quadrantal angles (terminal side on an axis):

θ = 270° → terminal side on negative y-axis → P = (0, −1) → cos(270°) = 0, sin(270°) = 0.
θ = −π (half clockwise revolution) → terminal side on negative x-axis → P = (−1, 0) → cos(−π) = −1, sin(−π) = 0.

Non-quadrantal angles (terminal side not on an axis):

Use geometry: drop a perpendicular from P to the x-axis to form a right triangle.
Example: θ = 45° → 45°-45°-90° triangle → legs equal → x = y, and x² + y² = 1 → x = y = √2/2.
Example: θ = π/6 (30°) → 30°-60°-90° triangle → y = 1/2, then x² + (1/2)² = 1 → x = √3/2.
Example: θ = 60° → same triangle → x = 1/2, y = √3/2.

Don't confuse: For quadrantal angles, one coordinate is 0 and the other is ±1; for non-quadrantal angles, both coordinates are typically non-zero and require calculation.

🔗 The Pythagorean Identity

🔗 Derivation and statement

Since P(cos(θ), sin(θ)) lies on the Unit Circle, substitute x = cos(θ) and y = sin(θ) into x² + y² = 1.
Result: (cos(θ))² + (sin(θ))² = 1.
Convention: write cos²(θ) for (cos(θ))² and sin²(θ) for (sin(θ))².

Theorem 10.1 (Pythagorean Identity): For any angle θ, cos²(θ) + sin²(θ) = 1.

🔗 Using the identity to find missing values

If one of cos(θ) or sin(θ) is known, the identity determines the other up to a ± sign.
The quadrant where θ's terminal side lies removes the ambiguity.

Example: If θ is Quadrant II and sin(θ) = 3/5, then:

Substitute into identity: cos²(θ) + 9/25 = 1 → cos²(θ) = 16/25 → cos(θ) = ±4/5.
Quadrant II → x-coordinates negative → cos(θ) = −4/5.

Example: If sin(θ) = 1, then cos²(θ) + 1 = 1 → cos(θ) = 0.

🔄 Reference angles and symmetry

🔄 What is a reference angle?

Reference angle α for a non-quadrantal angle θ: the acute angle between the terminal side of θ and the x-axis.

Quadrant I or IV: α is the angle between the terminal side and the positive x-axis.
Quadrant II or III: α is the angle between the terminal side and the negative x-axis.

🔄 The Reference Angle Theorem

Theorem 10.2: If α is the reference angle for θ, then cos(θ) = ±cos(α) and sin(θ) = ±sin(α), where the ± depends on the quadrant of θ's terminal side.

The Unit Circle has symmetry with respect to the x-axis, y-axis, and origin.
For any angle θ, there is a symmetric point on the circle that determines the reference angle α.

Example: θ = 5π/6 (Quadrant II, π/6 short of π) → reference angle α = π/6.

The point for π/6 is (√3/2, 1/2).
Reflect across the y-axis → P = (−√3/2, 1/2).
Hence cos(5π/6) = −√3/2, sin(5π/6) = 1/2.

📋 Essential values to memorize

θ (degrees)	θ (radians)	cos(θ)	sin(θ)
0°	0	1	0
30°	π/6	√3/2	1/2
45°	π/4	√2/2	√2/2
60°	π/3	1/2	√3/2
90°	π/2	0	1

Use the Reference Angle Theorem and these values to find cosine and sine for any common angle.

🔄 Examples using reference angles

θ = 225° (Quadrant III):

Reference angle: α = 225° − 180° = 45°.
Quadrant III → both cos and sin negative.
cos(225°) = −cos(45°) = −√2/2; sin(225°) = −sin(45°) = −√2/2.

θ = 11π/6 (Quadrant IV):

Reference angle: α = 2π − 11π/6 = π/6.
Quadrant IV → cos positive, sin negative.
cos(11π/6) = cos(π/6) = √3/2; sin(11π/6) = −sin(π/6) = −1/2.

θ = −5π/4 (clockwise 5π/4, lands in Quadrant II):

Reference angle: α = 5π/4 − π = π/4.
Quadrant II → cos negative, sin positive.
cos(−5π/4) = −cos(π/4) = −√2/2; sin(−5π/4) = sin(π/4) = √2/2.

θ = 7π/3 (more than one revolution):

Coterminal angle: 7π/3 − 2π = π/3.
cos(7π/3) = cos(π/3) = 1/2; sin(7π/3) = sin(π/3) = √3/2.

Tip: In radian measure, fractions with denominator 6 have reference angle π/6; denominator 4 → π/4; denominator 3 → π/3.

🧮 Solving basic trigonometric equations

🧮 Finding all angles that satisfy an equation

Trigonometric equations typically have infinitely many solutions because angles are coterminal if they differ by integer multiples of 2π.

Example: Solve cos(θ) = 1/2.

Terminal side intersects Unit Circle at x = 1/2 → Quadrant I or IV angle with reference angle π/3.
Quadrant I solution: θ = π/3 → all solutions: θ = π/3 + 2πk for integers k.
Quadrant IV solution: θ = 5π/3 → all solutions: θ = 5π/3 + 2πk for integers k.

Example: Solve sin(θ) = −1/2.

Terminal side intersects at y = −1/2 → Quadrant III or IV with reference angle π/6.
Quadrant III: θ = 7π/6 + 2πk.
Quadrant IV: θ = 11π/6 + 2πk.

Example: Solve cos(θ) = 0.

Quadrantal angles on the y-axis: θ = π/2 or 3π/2.
General solution: θ = π/2 + πk for integers k (captures both by varying k).

Don't confuse: Different-looking families like θ = 11π/6 + 2πk and θ = −π/6 + 2πk represent the same set of angles (verify by writing out several values).

🌐 Beyond the Unit Circle

🌐 Extending to circles of radius r

Consider a circle x² + y² = r² centered at the origin.
Let Q(x, y) be the point on the terminal side of θ (in standard position) that lies on this circle.
Using similar triangles and the Unit Circle point P(cos(θ), sin(θ)):
- x = r cos(θ)
- y = r sin(θ)
Conversely: cos(θ) = x/r = x/√(x² + y²); sin(θ) = y/r = y/√(x² + y²).

Theorem 10.3: If Q(x, y) is on the terminal side of θ and lies on the circle x² + y² = r², then x = r cos(θ), y = r sin(θ), and cos(θ) = x/r, sin(θ) = y/r.

Example: Terminal side of θ contains Q(4, −2).

r = √(16 + 4) = √20 = 2√5.
cos(θ) = 4/(2√5) = 2√5/5; sin(θ) = −2/(2√5) = −√5/5.

🌐 Application: circular motion

An object moving in a circular path of radius r with constant angular velocity ω.
At time t, the angle swept is θ = ωt (assuming starting at (r, 0) when t = 0).
Position at time t: (x, y) = (r cos(ωt), r sin(ωt)).
ω > 0 → counter-clockwise; ω < 0 → clockwise.

Example: Lakeland Community College rotating with Earth at radius r = 2960 miles, ω = π/12 radians per hour.

Equations of motion: x = 2960 cos(πt/12), y = 2960 sin(πt/12), where x, y in miles, t in hours.

📐 Right triangle trigonometry

For an acute angle θ in a right triangle:
- Adjacent side: length a (next to θ).
- Opposite side: length b (across from θ).
- Hypotenuse: length c (opposite the right angle).
By the Pythagorean Theorem: a² + b² = c².
Place the triangle in Quadrant I with θ in standard position → point P(a, b) lies on a circle of radius c.

Theorem 10.4: For an acute angle θ in a right triangle, cos(θ) = (adjacent)/(hypotenuse) = a/c and sin(θ) = (opposite)/(hypotenuse) = b/c.

Example: Right triangle with 30° angle and adjacent side 7.

Missing angle: 180° − 30° − 90° = 60°.
Hypotenuse: c = 7/cos(30°) = 7/(√3/2) = 14√3/3.
Opposite side: b = c sin(30°) = (14√3/3)(1/2) = 7√3/3.

🔢 Cosine and sine as functions of real numbers

🔢 Extending to real-number inputs

Identify a real number t with the angle θ = t radians.
Define cos(t) = cos(θ) and sin(t) = sin(θ).
Equivalently: wrap an oriented arc of length t around the Unit Circle starting at (1, 0) to reach P(cos(t), sin(t)).

🔢 Domain and range

Theorem 10.5:

f(t) = cos(t) has domain (−∞, ∞) and range [−1, 1].

g(t) = sin(t) has domain (−∞, ∞) and range [−1, 1].

Cosine and sine are defined for all real numbers (or all angles).
Since they represent coordinates on the Unit Circle, their values lie between −1 and 1, inclusive.

Solving equations with real-number inputs: The process is identical to solving with angles in radians.

Example: sin(t) = −1/2 → t = 7π/6 + 2πk or t = 11π/6 + 2πk for integers k.

Note: Properties developed for cosine and sine as functions of angles in radian measure apply equally when inputs are real numbers.

The Six Circular Functions and Fundamental Identities

10.3 The Six Circular Functions and Fundamental Identities

🧭 Overview

🧠 One-sentence thesis

The six circular functions—cosine, sine, secant, cosecant, tangent, and cotangent—are all defined using coordinates on the Unit Circle, and they are related through reciprocal, quotient, and Pythagorean identities that allow us to transform and verify trigonometric expressions.

📌 Key points (3–5)

Six functions from the Unit Circle: All six circular functions are defined using the coordinates (x, y) of points on the Unit Circle where an angle's terminal side intersects it.
Reciprocal and quotient relationships: Secant, cosecant, tangent, and cotangent can be expressed in terms of cosine and sine through simple reciprocal and quotient formulas.
Pythagorean identities: Three fundamental identities relate the squares of these functions: the basic identity involving cosine and sine, one involving tangent and secant, and one involving cotangent and cosecant.
Common confusion—solving equations: When tan(θ) equals a ratio like 3, this does NOT mean sin(θ) = 3 and cos(θ) = 1; instead, you must use the relationship sin(θ) = 3cos(θ) along with the Pythagorean identity to find the actual values.
Why it matters: These identities are essential for simplifying expressions, solving equations, and performing calculations in both trigonometry and calculus.

📐 Defining the six circular functions

📐 The Unit Circle definitions

Definition: For an angle θ in standard position with point P(x, y) on the terminal side lying on the Unit Circle:

cosine: cos(θ) = x

sine: sin(θ) = y

secant: sec(θ) = 1/x (provided x ≠ 0)

cosecant: csc(θ) = 1/y (provided y ≠ 0)

tangent: tan(θ) = y/x (provided x ≠ 0)

cotangent: cot(θ) = x/y (provided y ≠ 0)

Only cosine and sine are defined for all angles.
The other four functions are undefined when their denominators equal zero.
Example: tan(θ) is undefined when x = 0, which occurs at θ = π/2 + πk for any integer k.

🔤 Etymology of tangent and secant

Tangent: The word means "to touch." For an acute angle θ, tan(θ) equals the y-coordinate of the point where the terminal side intersects the vertical line x = 1, which is tangent to (touches) the Unit Circle at exactly one point.
Secant: The word means "to cut." For an acute angle θ, sec(θ) is the length of the line segment on the secant line (which cuts through the circle at two points) from the origin to where it intersects the tangent line x = 1.
These geometric interpretations help explain the function names and lead to important inequalities used in calculus.

🔗 Reciprocal and quotient identities

🔗 Expressing four functions in terms of cosine and sine

The excerpt presents these as the Reciprocal and Quotient Identities:

Function	Formula	When undefined
sec(θ)	1/cos(θ)	cos(θ) = 0
csc(θ)	1/sin(θ)	sin(θ) = 0
tan(θ)	sin(θ)/cos(θ)	cos(θ) = 0
cot(θ)	cos(θ)/sin(θ)	sin(θ) = 0

These identities allow you to convert any problem involving the six functions into one involving only cosine and sine.
However, it's not always convenient to do so—when solving tangent or cotangent equations, it's often better to work directly with those functions.

📊 Common angle values for tangent and cotangent

The excerpt provides a table of exact values:

θ (degrees)	θ (radians)	tan(θ)	cot(θ)
0°	0	0	undefined
30°	π/6	√3/3	√3
45°	π/4	1	1
60°	π/3	√3	√3/3
90°	π/2	undefined	0

Memorizing these values is worthwhile for solving problems efficiently.
The pattern shows symmetry: tan and cot are reciprocals when both are defined.

🔢 Pythagorean identities

🔢 Three fundamental Pythagorean identities

The excerpt derives three identities by manipulating the basic Pythagorean identity cos²(θ) + sin²(θ) = 1:

Identity 1: cos²(θ) + sin²(θ) = 1

Alternate forms: 1 - sin²(θ) = cos²(θ); 1 - cos²(θ) = sin²(θ)

Identity 2: 1 + tan²(θ) = sec²(θ) (provided cos(θ) ≠ 0)

Derived by dividing Identity 1 by cos²(θ)
Alternate forms: sec²(θ) - tan²(θ) = 1; sec²(θ) - 1 = tan²(θ)

Identity 3: 1 + cot²(θ) = csc²(θ) (provided sin(θ) ≠ 0)

Derived by dividing Identity 1 by sin²(θ)
Alternate forms: csc²(θ) - cot²(θ) = 1; csc²(θ) - 1 = cot²(θ)

🔄 How the identities are derived

Start with cos²(θ) + sin²(θ) = 1.
To get the tangent-secant identity: divide both sides by cos²(θ), then apply reciprocal and quotient identities.
To get the cotangent-cosecant identity: divide both sides by sin²(θ), then apply reciprocal and quotient identities.
These identities are essential for simplifying expressions and verifying more complex identities.

🧮 Solving equations with circular functions

🧮 Using reference angles

The Generalized Reference Angle Theorem states:

The values of circular functions of any angle θ are the same as those of its reference angle α, up to a sign.
The sign depends on which quadrant the terminal side of θ lies in.

Example: To solve tan(θ) = √3:

The reference angle is π/3 (since tan(π/3) = √3).
Tangent is positive in Quadrants I and III (where x and y have the same sign).
Solutions: θ = π/3 + πk for any integer k (this captures both quadrants).

⚠️ Common pitfall when solving

Don't confuse: If tan(θ) = 3, this does NOT mean sin(θ) = 3 and cos(θ) = 1.

Why not?

tan(θ) = sin(θ)/cos(θ) = 3 means sin(θ) = 3cos(θ).
You must use the Pythagorean identity cos²(θ) + sin²(θ) = 1 to find the actual values.
Substitute sin(θ) = 3cos(θ) into the identity: cos²(θ) + (3cos(θ))² = 1.
Solve: cos²(θ) = 1/10, so cos(θ) = ±√10/10.
Then determine the correct sign based on which quadrant θ lies in.

Example from the excerpt: If tan(θ) = 3 and π < θ < 3π/2 (Quadrant III), then both sin(θ) and cos(θ are negative, giving sin(θ) = -3√10/10.

✅ Verifying trigonometric identities

✅ General strategies

The excerpt provides these approaches:

Work on the more complicated side of the equation.
Use reciprocal and quotient identities to express functions in terms of those on the other side.
Find common denominators when adding rational expressions.
Apply Pythagorean identities to exchange functions or simplify sums/differences of squares.
Multiply by Pythagorean conjugates to create differences of squares that simplify.
If stuck, try working from the other side and bridge the two parts.

🔀 Pythagorean conjugates

These are pairs of expressions that, when multiplied, produce a single term via a Pythagorean identity:

Conjugate pair	Product
(1 - cos(θ)) and (1 + cos(θ))	1 - cos²(θ) = sin²(θ)
(1 - sin(θ)) and (1 + sin(θ))	1 - sin²(θ) = cos²(θ)
(sec(θ) - 1) and (sec(θ) + 1)	sec²(θ) - 1 = tan²(θ)
(sec(θ) - tan(θ)) and (sec(θ) + tan(θ))	sec²(θ) - tan²(θ) = 1

Multiplying numerator and denominator by a Pythagorean conjugate is a powerful technique.
This is analogous to rationalizing denominators with radicals (e.g., multiplying by 1 + √2 when you have 1 - √2).

📝 Example verification

To verify sin(θ)/(1 - cos(θ)) = (1 + cos(θ))/sin(θ):

Multiply the left side's numerator and denominator by (1 + cos(θ)).
Numerator becomes: sin(θ)(1 + cos(θ)).
Denominator becomes: (1 - cos(θ))(1 + cos(θ)) = 1 - cos²(θ) = sin²(θ).
Result: sin(θ)(1 + cos(θ))/sin²(θ) = (1 + cos(θ))/sin(θ). ✓

🌐 Beyond the Unit Circle

🌐 Generalizing to any radius

For a point Q(x, y) on a circle of radius r (where x² + y² = r²) on the terminal side of angle θ:

Function	Formula
sec(θ)	r/x = √(x² + y²)/x
csc(θ)	r/y = √(x² + y²)/y
tan(θ)	y/x
cot(θ)	x/y

This extends the definitions beyond the Unit Circle.
Notice that tan(θ) and cot(θ) remain as simple ratios of coordinates, regardless of radius.

📐 Right triangle definitions

For an acute angle θ in a right triangle with adjacent side a, opposite side b, and hypotenuse c:

Function	Formula
tan(θ)	b/a (opposite/adjacent)
sec(θ)	c/a (hypotenuse/adjacent)
csc(θ)	c/b (hypotenuse/opposite)
cot(θ)	a/b (adjacent/opposite)

These are the traditional "SOH-CAH-TOA" extensions.
Useful for applied problems involving angles of inclination (elevation) or depression.

🏗️ Application: angles of inclination

Angle of inclination (or elevation): The angle whose initial side is a baseline (e.g., the ground) and whose terminal side is the line-of-sight to an object above the baseline.

Example from the excerpt: To find the height h of a tower 30 feet away with an angle of inclination of 60°:

tan(60°) = h/30.
h = 30 tan(60°) = 30√3 ≈ 52 feet.

📏 Domains and ranges

📏 Summary of domains and ranges

The excerpt provides detailed domain and range information:

Cosine and sine:

Domain: all real numbers.
Range: [-1, 1].

Secant and cosecant:

Domain: excludes points where cosine or sine equals zero (expressed using extended interval notation).
Range: {u : |u| ≥ 1} = (-∞, -1] ∪ [1, ∞).

Tangent and cotangent:

Domain: excludes points where cosine or sine equals zero.
Range: all real numbers (-∞, ∞).

🔢 Extended interval notation

The excerpt introduces notation like:

Domain of sec(t): ⋃(k=-∞ to ∞) ((2k+1)π/2, (2k+3)π/2)

This means:

The union of infinitely many intervals.
The index k ranges through all integers.
Similar to summation notation, k never actually equals ∞ or -∞; it's a notational device to indicate "all integers."

Trigonometric Identities

10.4 Trigonometric Identities

🧭 Overview

🧠 One-sentence thesis

Trigonometric identities provide systematic ways to transform and simplify expressions involving circular functions, enabling exact computation of function values and revealing relationships between angles and their multiples or halves.

📌 Key points (3–5)

Even/Odd identities: cosine and secant are even (unchanged by negation of angle), while sine, cosecant, tangent, and cotangent are odd (sign flips).
Sum and difference formulas: express cos(α ± β), sin(α ± β), and tan(α ± β) in terms of individual angle functions.
Double and half angle formulas: relate functions of 2θ or θ/2 to functions of θ, enabling computation from a single piece of information.
Power reduction and product-to-sum: convert powers and products of trig functions into sums, essential for calculus applications.
Common confusion: when using half-angle formulas, the ± sign depends on which quadrant θ/2 lies in, not θ itself.

🔄 Even and Odd Identities

🔄 Which functions are even or odd

Even/Odd Identities: For all applicable angles θ:

cos(−θ) = cos(θ) and sec(−θ) = sec(θ) (even)

sin(−θ) = −sin(θ), csc(−θ) = −csc(θ), tan(−θ) = −tan(θ), cot(−θ) = −cot(θ) (odd)

Even functions (cosine, secant): the function value is unchanged when the angle is negated.
Odd functions (sine, cosecant, tangent, cotangent): the function value changes sign when the angle is negated.
The proof relies on symmetry about the x-axis on the unit circle: points P and Q on terminal sides of θ₀ and −θ₀ are symmetric, so cos(−θ₀) = cos(θ₀) and sin(−θ₀) = −sin(θ₀).

🧩 Why these matter

These identities simplify expressions by removing negative signs from angles.
They are building blocks for proving sum and difference formulas.
Example: sin(3π − 2θ) = −sin(2θ − 3π) uses the odd property of sine.

➕ Sum and Difference Identities

➕ Formulas for cosine

Sum and Difference Identities for Cosine:

cos(α + β) = cos(α)cos(β) − sin(α)sin(β)

cos(α − β) = cos(α)cos(β) + sin(α)sin(β)

The proof for the difference formula uses the distance formula on the unit circle: the distance between points P (on terminal side of α₀) and Q (on terminal side of β₀) equals the distance between A (on terminal side of α₀ − β₀) and B (at (1,0)).
Squaring both sides and applying the Pythagorean identity cos²(θ) + sin²(θ) = 1 yields the result.
The sum formula follows by rewriting α + β as α − (−β) and applying even/odd identities.

➕ Formulas for sine and tangent

Sum and Difference Identities for Sine:

sin(α + β) = sin(α)cos(β) + cos(α)sin(β)

sin(α − β) = sin(α)cos(β) − cos(α)sin(β)

Sum and Difference Identities for Tangent:

tan(α ± β) = [tan(α) ± tan(β)] / [1 ∓ tan(α)tan(β)]

The sine sum formula is derived by converting to cosine using a cofunction identity, then expanding using the cosine difference formula.
The tangent formula is derived by writing tan(α + β) = sin(α + β)/cos(α + β), expanding numerator and denominator, then dividing both by cos(α)cos(β).
Example: cos(15°) = cos(45° − 30°) = cos(45°)cos(30°) + sin(45°)sin(30°) = (√2/2)(√3/2) + (√2/2)(1/2) = (√6 + √2)/4.

🔗 Cofunction identities

Cofunction Identities: For all applicable angles θ:

cos(π/2 − θ) = sin(θ) and sin(π/2 − θ) = cos(θ)

sec(π/2 − θ) = csc(θ) and csc(π/2 − θ) = sec(θ)

tan(π/2 − θ) = cot(θ) and cot(π/2 − θ) = tan(θ)

These say that the "co"sine of an angle is the sine of its "co"mplement.
Derived directly from the sum/difference formulas: cos(π/2 − θ) = cos(π/2)cos(θ) + sin(π/2)sin(θ) = 0·cos(θ) + 1·sin(θ) = sin(θ).
The remaining four follow from the first two using quotient and reciprocal identities.

🔢 Double Angle Identities

🔢 Formulas for 2θ

Double Angle Identities: For all applicable angles θ:

cos(2θ) = cos²(θ) − sin²(θ) = 2cos²(θ) − 1 = 1 − 2sin²(θ)

sin(2θ) = 2sin(θ)cos(θ)

tan(2θ) = 2tan(θ) / [1 − tan²(θ)]

These are special cases of the sum formulas when α = β.
The three forms for cos(2θ) come from substituting sin²(θ) = 1 − cos²(θ) or cos²(θ) = 1 − sin²(θ) using the Pythagorean identity.
Key observation: to find cos(2θ), you need only cos(θ) or sin(θ); to find sin(2θ), it appears you need both, but you can derive one from the other using Pythagorean identities.

🧮 Using double angle formulas

Example: If P(−3, 4) lies on the terminal side of θ in standard position, then r = 5, so cos(θ) = −3/5 and sin(θ) = 4/5. Then cos(2θ) = (−3/5)² − (4/5)² = −7/25 and sin(2θ) = 2(4/5)(−3/5) = −24/25.
Since both cos(2θ) and sin(2θ) are negative, 2θ lies in Quadrant III.
The identity sin(2θ) = 2tan(θ)/[1 + tan²(θ)] can be verified by converting tan and sec to sin and cos.

✂️ Half Angle and Power Reduction Formulas

✂️ Half angle formulas

Half Angle Formulas: For all applicable angles θ:

cos(θ/2) = ±√[(1 + cos(θ))/2]

sin(θ/2) = ±√[(1 − cos(θ))/2]

tan(θ/2) = ±√[(1 − cos(θ))/(1 + cos(θ))]

The choice of ± depends on the quadrant in which the terminal side of θ/2 lies.

Derived from power reduction formulas by substituting θ/2 for θ and taking square roots.
Don't confuse: the sign is determined by θ/2, not θ. Example: if −π ≤ θ ≤ 0, then −π/2 ≤ θ/2 ≤ 0, so sin(θ/2) < 0.
Example: cos(15°) = +√[(1 + cos(30°))/2] = √[(1 + √3/2)/2] = √(2 + √3)/2 (positive because 15° is in Quadrant I).
An alternative formula: tan(θ/2) = sin(θ)/[1 + cos(θ)] can be derived by manipulating the identity sin(2θ) = 2tan(θ)/[1 + tan²(θ)].

📉 Power reduction formulas

Power Reduction Formulas: For all angles θ:

cos²(θ) = [1 + cos(2θ)]/2

sin²(θ) = [1 − cos(2θ)]/2

Derived by solving the double angle formulas cos(2θ) = 2cos²(θ) − 1 and cos(2θ) = 1 − 2sin²(θ) for cos²(θ) and sin²(θ).
Used in calculus to reduce powers of sine and cosine.
Example: sin²(θ)cos²(θ) = [(1 − cos(2θ))/2][(1 + cos(2θ))/2] = [1 − cos²(2θ)]/4 = 1/4 − (1/4)cos²(2θ). Applying power reduction again: = 1/4 − (1/4)[(1 + cos(4θ))/2] = 1/8 − (1/8)cos(4θ).

🔀 Product-to-Sum and Sum-to-Product Formulas

🔀 Product to sum

Product to Sum Formulas: For all angles α and β:

cos(α)cos(β) = (1/2)[cos(α − β) + cos(α + β)]

sin(α)sin(β) = (1/2)[cos(α − β) − cos(α + β)]

sin(α)cos(β) = (1/2)[sin(α − β) + sin(α + β)]

Verified by expanding the right-hand side using sum and difference formulas.
Used in calculus and have historical significance (Prosthaphaeresis Formulas).
Example: cos(2θ)cos(6θ) = (1/2)[cos(2θ − 6θ) + cos(2θ + 6θ)] = (1/2)cos(−4θ) + (1/2)cos(8θ) = (1/2)cos(4θ) + (1/2)cos(8θ) (using even identity for cosine).

🔀 Sum to product

Sum to Product Formulas: For all angles α and β:

cos(α) + cos(β) = 2cos[(α + β)/2]cos[(α − β)/2]

cos(α) − cos(β) = −2sin[(α + β)/2]sin[(α − β)/2]

sin(α) ± sin(β) = 2sin[(α ± β)/2]cos[(α ∓ β)/2]

Verified using the product-to-sum formulas.
Example: sin(θ) − sin(3θ) = 2sin[(θ − 3θ)/2]cos[(θ + 3θ)/2] = 2sin(−θ)cos(2θ) = −2sin(θ)cos(2θ) (using odd identity for sine).

🧮 Expressing cos(nθ) as polynomials

cos(3θ) can be expressed as a polynomial in cos(θ): cos(3θ) = 4cos³(θ) − 3cos(θ).
Derived by writing cos(3θ) = cos(2θ + θ), expanding using sum formulas, substituting double angle formulas, then using Pythagorean identities to eliminate sin²(θ).
This technique extends to higher multiples: cos(4θ) = 8cos⁴(θ) − 8cos²(θ) + 1.

Graphs of the Trigonometric Functions

10.5 Graphs of the Trigonometric Functions

🧭 Overview

🧠 One-sentence thesis

The graphs of the six trigonometric functions—cosine, sine, secant, cosecant, tangent, and cotangent—exhibit wavelike or periodic behavior that can be completely characterized by properties such as period, amplitude, phase shift, and vertical shift, allowing us to graph transformations systematically.

📌 Key points (3–5)

Periodicity is central: All six trigonometric functions repeat their values at regular intervals, with cosine, sine, secant, and cosecant having period 2π, while tangent and cotangent have period π.
Sinusoids (transformed cosine/sine) are characterized by four quantities: period, amplitude, phase shift, and vertical shift, which can be read directly from the formula.
Reciprocal relationships: secant is 1/cosine and cosecant is 1/sine, so their graphs have vertical asymptotes wherever the denominator is zero.
Common confusion—phase shift vs. starting point: the phase shift formula tells you where one particular cycle begins, but because the function is periodic, you can graph any complete cycle; the "quarter marks" method and the Theorem 10.23 method may highlight different cycles.
Even/odd symmetry and smoothness: cosine and secant are even, sine/cosecant/tangent/cotangent are odd, and all six are continuous and smooth on their domains.

📐 Cosine and sine: the foundation

📐 Basic properties

The excerpt reviews that f(t) = cos(t) and g(t) = sin(t) share:

Domain: all real numbers (−∞, ∞).
Range: [−1, 1].
Both are continuous and smooth (no jumps, gaps, asymptotes, or corners).
Even vs. odd: cos(−t) = cos(t) (even function), sin(−t) = −sin(t) (odd function).

Periodic function: A function f is periodic if there exists a real number c so that f(t + c) = f(t) for all t in the domain. The period is the smallest positive p for which f(t + p) = f(t) for all t.

Both cosine and sine have period 2π: cos(t + 2π) = cos(t) and sin(t + 2π) = sin(t).
The excerpt proves that 2π is the smallest such number by showing that if cos(t + p) = cos(t) for all t, then cos(p) = 1, which forces p to be a multiple of 2π.

Why this matters: Because the period is 2π, we can understand the entire graph by studying one interval of length 2π, typically [0, 2π].

📊 The fundamental cycle

The excerpt constructs the graph of y = cos(x) by plotting "quarter marks" at x = 0, π/4, π/2, 3π/4, π, 5π/4, 3π/2, 7π/4, 2π and connecting the dots in a wavelike fashion.

The result is one complete wave from x = 0 to x = 2π, called the fundamental cycle.
The full graph is obtained by "copying and pasting" this cycle infinitely in both directions.
The graph is described as wavelike.

Similarly, y = sin(x) is graphed using the same quarter marks, yielding a similar wave.

Relationship: sin(x) = cos(x − π/2), so the sine graph is the cosine graph shifted right by π/2 units.

Don't confuse: The "fundamental cycle" is just one period; the actual function is defined for all real numbers and repeats this cycle infinitely.

🎢 Sinusoids: transformed cosine and sine

🎢 What is a sinusoid?

A sinusoid is the result of taking the basic graph of f(x) = cos(x) or g(x) = sin(x) and applying transformations (shifts, stretches, reflections).

The excerpt introduces Theorem 10.23: For ω > 0, the functions

C(x) = A cos(ωx + φ) + B
S(x) = A sin(ωx + φ) + B

have:

Period: 2π/ω
Amplitude: |A|
Phase shift: −φ/ω (horizontal shift)
Vertical shift: B

How to use this:

Rewrite the function in the form above (factor out ω from the argument).
Read off A, ω, φ, B.
Calculate period, amplitude, phase shift, vertical shift.
Use these to sketch the graph.

Example from the excerpt: f(x) = 3 cos(πx − π/2) + 1 is rewritten as 3 cos(π/2 · x + (−π/2)) + 1, so A = 3, ω = π/2, φ = −π/2, B = 1. Then period = 2π/(π/2) = 4, amplitude = 3, phase shift = −(−π/2)/(π/2) = 1 (right 1 unit), vertical shift = 1 (up 1 unit).

🔄 The "quarter marks" method

An alternative graphing approach:

Set the argument of the trig function equal to the five quarter marks: 0, π/2, π, 3π/2, 2π.
Solve for x.
Substitute these x-values into the function to find y-values.
Plot and connect the dots.

This method always produces one complete cycle, even if the phase shift from Theorem 10.23 points to a different starting x-value.

Common confusion: The phase shift formula and the quarter marks method may highlight different cycles. For example, if you use the odd property of sine to rewrite g(x) = (1/2) sin(π − 2x) + 3/2 as g(x) = −(1/2) sin(2x − π) + 3/2, the phase shift is π/2 (right), but the quarter marks method may graph a cycle ending at x = π/2 instead of starting there. Both are correct—they just show different portions of the same periodic function.

🧮 Fitting a graph to a formula

Given a graph of one cycle, you can find a formula:

Period: length of the interval for one cycle → solve 2π/ω = period for ω.
Amplitude: half the distance from the maximum to the minimum.
Vertical shift: average of the maximum and minimum y-values.
Phase shift: identify where the cycle starts (for cosine, typically at a maximum; for sine, at the midline crossing upward).

Example from the excerpt: A cycle graphed from x = −1 to x = 5 has period 6, so ω = π/3. The range is [−3/2, 5/2], so amplitude = (5/2 − (−3/2))/2 = 2 and vertical shift = (5/2 + (−3/2))/2 = 1/2. The phase shift is −1 (the cycle starts at x = −1), so −φ/ω = −1 → φ = π/3. Answer: C(x) = 2 cos(π/3 · x + π/3) + 1/2.

🔀 Combining sine and cosine

The excerpt shows that f(x) = a cos(ωx) + b sin(ωx) + B is also a sinusoid. Using sum identities:

C(x) = A cos(ωx + φ) + B expands to A cos(ωx) cos(φ) − A sin(ωx) sin(φ) + B.
By matching coefficients, you can find A and φ from a and b.

Example: f(x) = cos(2x) − √3 sin(2x). Match with A cos(2x) cos(φ) − A sin(2x) sin(φ):

A cos(φ) = 1 and A sin(φ) = √3.
Square and add: A² = 1 + 3 = 4 → A = 2.
Then cos(φ) = 1/2 and sin(φ) = √3/2 → φ = π/3.
Answer: f(x) = 2 cos(2x + π/3).

Important: This technique only works if the arguments of cosine and sine match (e.g., both are 2x). If they differ (e.g., cos(2x) − √3 sin(3x)), the function is not a sinusoid.

🔺 Secant and cosecant: reciprocal functions

🔺 Secant: y = sec(x)

Since sec(x) = 1/cos(x), the graph is constructed by taking reciprocals of cosine values.

Vertical asymptotes occur where cos(x) = 0, i.e., at x = π/2 + πk for any integer k.
Domain: all x except odd multiples of π/2.
Range: (−∞, −1] ∪ [1, ∞) (the reciprocal of values in [−1, 1] excluding 0).
Period: 2π (inherited from cosine).
Even function: sec(−x) = sec(x).
Continuous and smooth on its domain.

The excerpt graphs one fundamental cycle over [0, 2π], showing vertical asymptotes at x = π/2 and x = 3π/2, with the graph approaching ±∞ near these lines.

🔺 Cosecant: y = csc(x)

Since csc(x) = 1/sin(x):

Vertical asymptotes at x = πk (where sin(x) = 0).
Domain: all x except integer multiples of π.
Range: (−∞, −1] ∪ [1, ∞).
Period: 2π.
Odd function: csc(−x) = −csc(x).
Continuous and smooth on its domain.

The fundamental cycle is graphed over [0, 2π] with asymptotes at x = 0, π, 2π.

Graphing strategy: For transformations like f(x) = 1 − 2 sec(2x), use the quarter marks method on the argument (2x) to find key x-values and asymptotes. It helps to sketch the associated cosine or sine curve (dotted) as a reference.

Example from the excerpt: f(x) = 1 − 2 sec(2x). Set 2x = 0, π/2, π, 3π/2, 2π → x = 0, π/4, π/2, 3π/4, π. Asymptotes at x = π/4 and 3π/4 (where cos(2x) = 0). Period = π.

Don't confuse: Secant and cosecant have no amplitude (their ranges are unbounded), but they do have period, phase shift, and vertical shift.

📏 Tangent and cotangent: quotient functions

📏 Tangent: y = tan(x)

Since tan(x) = sin(x)/cos(x):

Vertical asymptotes at x = π/2 + πk (where cos(x) = 0).
Domain: all x except odd multiples of π/2.
Range: (−∞, ∞).
Period: π (not 2π!). The excerpt proves this using the sum formula: tan(x + π) = (tan(x) + tan(π))/(1 − tan(x) tan(π)) = tan(x).
Odd function: tan(−x) = −tan(x).
Continuous and smooth on its domain.

The fundamental cycle is taken as (−π/2, π/2), with quarter marks at x = −π/2, −π/4, 0, π/4, π/2.

📏 Cotangent: y = cot(x)

Since cot(x) = cos(x)/sin(x):

Vertical asymptotes at x = πk (where sin(x) = 0).
Domain: all x except integer multiples of π.
Range: (−∞, ∞).
Period: π.
Odd function: cot(−x) = −cot(x).
Continuous and smooth on its domain.

The fundamental cycle is (0, π), with quarter marks at x = 0, π/4, π/2, 3π/4, π.

Graphing strategy: Use the quarter marks method on the argument. For example, to graph g(x) = 2 cot(π/2 · x + π) + 1, set π/2 · x + π equal to 0, π/4, π/2, 3π/4, π and solve for x. Asymptotes occur at the endpoints (where the argument is 0 or π).

Example from the excerpt: g(x) = 2 cot(π/2 · x + π) + 1. Solving gives x = −2, −3/2, −1, −1/2, 0. Asymptotes at x = −2 and x = 0. Period = 2.

Don't confuse: Tangent and cotangent have period π, not 2π. Also, they have no amplitude (unbounded range).

📋 Summary tables

Function	Period	Domain exclusions	Range	Even/Odd
cos(x), sin(x)	2π	none	[−1, 1]	even / odd
sec(x), csc(x)	2π	odd π/2 / integer π	(−∞,−1]∪[1,∞)	even / odd
tan(x), cot(x)	π	odd π/2 / integer π	(−∞, ∞)	odd / odd

For sinusoids C(x) = A cos(ωx + φ) + B or S(x) = A sin(ωx + φ) + B (ω > 0):

Period = 2π/ω
Amplitude = |A|
Phase shift = −φ/ω
Vertical shift = B

Key insight: The parameter ω is called the (angular) frequency and counts how many cycles occur over a 2π interval. The excerpt notes that ω must be positive; use even/odd identities to ensure this.

The Inverse Trigonometric Functions

10.6 The Inverse Trigonometric Functions

🧭 Overview

🧠 One-sentence thesis

Inverse trigonometric functions reverse the process of the circular trigonometric functions by taking coordinate values and returning arc lengths (or angles in radians), but because the original functions are periodic and not one-to-one, their domains must be carefully restricted to create valid inverses.

📌 Key points (3–5)

Why restriction is necessary: The six circular trigonometric functions are periodic and therefore not one-to-one, so we must restrict their domains to intervals where they are one-to-one before defining inverses.
Notation and naming: The inverses are named arcsine, arccosine, arctangent, arccotangent, arcsecant, and arccosecant (not using the potentially confusing notation like cos⁻¹(x) which can be mistaken for 1/cos(x)).
The "arc" meaning: The prefix "arc" reflects that these functions take coordinate values on the Unit Circle and return oriented arc lengths (real numbers representing radian measures).
Common confusion—domain restrictions matter: Expressions like arccos(cos(11π/6)) equal π/6, not 11π/6, because the input 11π/6 lies outside the restricted domain [0, π] of the inverse; this is analogous to how √((-2)²) = 2, not -2.
Two competing conventions for arcsecant and arccosecant: There are "Trigonometry Friendly" and "Calculus Friendly" approaches that use different restricted ranges, leading to different answers for negative arguments.

🔧 Restricting domains to create inverses

🔧 Why we need restrictions

The circular trigonometric functions are periodic, so none of them is one-to-one.

A function must be one-to-one to have an inverse.
Periodicity means the same output value repeats infinitely many times, violating the one-to-one requirement.
Solution: restrict the domain to a single interval where the function is one-to-one, smooth, and continuous, while preserving the full range.

🔧 Cosine and sine restrictions

For cosine:

Restrict f(x) = cos(x) to the domain [0, π].
This keeps the range as [-1, 1] and ensures the function is one-to-one.
The inverse is F⁻¹(x) = arccos(x).

For sine:

Restrict g(x) = sin(x) to the domain [-π/2, π/2].
This keeps the range as [-1, 1] and ensures the function is one-to-one.
The inverse is G⁻¹(x) = arcsin(x).

🔧 Tangent and cotangent restrictions

For tangent:

Restrict f(x) = tan(x) to the open interval (-π/2, π/2).
The range is all real numbers (-∞, ∞).
The inverse is F⁻¹(x) = arctan(x), with domain all real numbers.
Vertical asymptotes x = ±π/2 of tangent become horizontal asymptotes y = ±π/2 of arctangent.

For cotangent:

Restrict g(x) = cot(x) to the open interval (0, π).
The range is all real numbers (-∞, ∞).
The inverse is G⁻¹(x) = arccot(x), with domain all real numbers.
Vertical asymptotes x = 0 and x = π of cotangent become horizontal asymptotes y = 0 and y = π of arccotangent.

📐 Properties of arcsine and arccosine

📐 Arccosine properties

arccos(x) = t if and only if 0 ≤ t ≤ π and cos(t) = x.

Domain: [-1, 1]
Range: [0, π]
Composition cancellation: cos(arccos(x)) = x provided -1 ≤ x ≤ 1
Reverse composition: arccos(cos(x)) = x provided 0 ≤ x ≤ π
The range [0, π] corresponds to Quadrants I and II angles.

📐 Arcsine properties

arcsin(x) = t if and only if -π/2 ≤ t ≤ π/2 and sin(t) = x.

Domain: [-1, 1]
Range: [-π/2, π/2]
Composition cancellation: sin(arcsin(x)) = x provided -1 ≤ x ≤ 1
Reverse composition: arcsin(sin(x)) = x provided -π/2 ≤ x ≤ π/2
Odd function: arcsine is odd, meaning arcsin(-x) = -arcsin(x)
The range [-π/2, π/2] corresponds to Quadrants IV and I angles.

📐 Don't confuse: composition order matters

arccos(cos(π/6)) = π/6 because π/6 is in [0, π].
arccos(cos(11π/6)) = π/6, not 11π/6, because 11π/6 is outside [0, π]; the cosine value is √3/2, and the unique angle in [0, π] with that cosine is π/6.
This is exactly like √(x²) = |x|, not x; the square root "undoes" the square only when the result stays in the range [0, ∞).

📏 Properties of arctangent and arccotangent

📏 Arctangent properties

arctan(x) = t if and only if -π/2 < t < π/2 and tan(t) = x.

Domain: all real numbers (-∞, ∞)
Range: (-π/2, π/2)
Horizontal asymptotes: as x → -∞, arctan(x) → -π/2⁺; as x → ∞, arctan(x) → π/2⁻
Composition cancellation: tan(arctan(x)) = x for all real x
Reverse composition: arctan(tan(x)) = x provided -π/2 < x < π/2
Odd function: arctangent is odd
Relationship: arctan(x) = arccot(1/x) for x > 0

📏 Arccotangent properties

arccot(x) = t if and only if 0 < t < π and cot(t) = x.

Domain: all real numbers (-∞, ∞)
Range: (0, π)
Horizontal asymptotes: as x → -∞, arccot(x) → π⁻; as x → ∞, arccot(x) → 0⁺
Composition cancellation: cot(arccot(x)) = x for all real x
Reverse composition: arccot(cot(x)) = x provided 0 < x < π
Relationship: arccot(x) = arctan(1/x) for x > 0

🔀 Two approaches to arcsecant and arccosecant

🔀 The problem with secant and cosecant

Unlike the other four functions, we cannot find one single continuous piece of the secant or cosecant graph that covers the entire range ((-∞, -1] ∪ [1, ∞)) while being one-to-one.
We must use a piecewise approach: one piece covers [1, ∞) and another covers (-∞, -1].
Two different conventions exist, leading to different answers for negative arguments.

🔀 Trigonometry Friendly approach

Arcsecant (Trig Friendly):

Restrict sec(x) to [0, π/2) ∪ (π/2, π]
Range of arcsec(x): [0, π/2) ∪ (π/2, π]
This matches the restriction used for arccosine.
Relationship: arcsec(x) = arccos(1/x) for |x| ≥ 1

Arccosecant (Trig Friendly):

Restrict csc(x) to [-π/2, 0) ∪ (0, π/2]
Range of arccsc(x): [-π/2, 0) ∪ (0, π/2]
This matches the restriction used for arcsine.
Relationship: arccsc(x) = arcsin(1/x) for |x| ≥ 1
Odd function: arccosecant is odd

Example: arccsc(-2) = arcsin(-1/2) = -π/6 (Trig Friendly)

🔀 Calculus Friendly approach

Arcsecant (Calc Friendly):

Restrict sec(x) to [0, π/2) ∪ [π, 3π/2)
Range of arcsec(x): [0, π/2) ∪ [π, 3π/2)
Relationship: arcsec(x) = arccos(1/x) only for x ≥ 1 (not for x ≤ -1)

Arccosecant (Calc Friendly):

Restrict csc(x) to (0, π/2] ∪ (π, 3π/2]
Range of arccsc(x): (0, π/2] ∪ (π, 3π/2]
Relationship: arccsc(x) = arcsin(1/x) only for x ≥ 1 (not for x ≤ -1)

Example: arccsc(-2) = 7π/6 (Calc Friendly), which is a Quadrant III angle.

🔀 Don't confuse: which convention is being used?

For positive arguments, both conventions give the same answer.
For negative arguments, the answers differ significantly.
Always check which convention your textbook or calculator uses.
The excerpt presents both approaches and notes that the Calculus Friendly approach "makes the Calculus easier, but the Trigonometry less so."

🧮 Computing with inverse functions

🧮 Evaluating at "common angles"

When the argument corresponds to a common angle value:

arccos(1/2) = π/3 because cos(π/3) = 1/2 and π/3 is in [0, π].
arcsin(-1/2) = -π/6 because sin(-π/6) = -1/2 and -π/6 is in [-π/2, π/2].
arctan(√3) = π/3 because tan(π/3) = √3 and π/3 is in (-π/2, π/2).
arccot(-√3) = 5π/6 because cot(5π/6) = -√3 and 5π/6 is in (0, π).

🧮 Simplifying compositions

Direct cancellation (when domains align):

cos(arccos(-3/5)) = -3/5 because -3/5 is in [-1, 1].
sin(arcsin(3/5)) = 3/5 because 3/5 is in [-1, 1].

Using identities when functions differ: To find sin(arccos(-3/5)):

Let t = arccos(-3/5), so cos(t) = -3/5 where 0 ≤ t ≤ π.
Since cos(t) < 0, we know π/2 < t < π (Quadrant II).
Use the Pythagorean identity: cos²(t) + sin²(t) = 1.
Substitute: (-3/5)² + sin²(t) = 1, so sin²(t) = 16/25.
Since t is in Quadrant II, sin(t) > 0, so sin(t) = 4/5.
Therefore sin(arccos(-3/5)) = 4/5.

🧮 Rewriting as algebraic expressions

To express tan(arccos(x)) in terms of x:

Let t = arccos(x), so cos(t) = x where 0 ≤ t ≤ π.
We need tan(t), but tan(π/2) is undefined, so exclude t = π/2, meaning x ≠ 0.
Use tan(t) = sin(t)/cos(t) and the Pythagorean identity.
From cos²(t) + sin²(t) = 1, we get sin(t) = ±√(1 - x²).
For 0 ≤ t < π/2 (Quadrant I), sin(t) ≥ 0; for π/2 < t ≤ π (Quadrant II), sin(t) ≥ 0.
So sin(t) = √(1 - x²) for both cases.
Therefore tan(arccos(x)) = √(1 - x²)/x, valid for x in [-1, 0) ∪ (0, 1].

Don't confuse: Even if the final algebraic expression (like 1 - 2x²) is defined for all real numbers, the equivalence is valid only where the original inverse function is defined (e.g., [-1, 1] for arcsin).

🖩 Calculator techniques

🖩 Direct evaluation

Most calculators have buttons for arcsin, arccos, and arctan (often labeled sin⁻¹, cos⁻¹, tan⁻¹).

Make sure the calculator is in radian mode for these problems.
Example: arctan(1/2) ≈ 0.4636 radians.

🖩 Arccotangent with positive argument

For arccot(2) where 2 > 0:

Use the relationship arccot(2) = arctan(1/2).
Result: arccot(2) ≈ 0.4636.

🖩 Arccotangent with negative argument

For arccot(-2):

Cannot directly use arctan(-1/2) because that gives a Quadrant IV angle.
Method 1 (reference angle): Let α = arccot(2) = arctan(1/2). The angle θ = arccot(-2) is in Quadrant II, so θ = π - α = π - arctan(1/2) ≈ 2.6779.
Method 2 (shift): Note that arctan(-1/2) gives a Quadrant IV angle β. The Quadrant II angle is θ = π + arctan(-1/2) ≈ 2.6779.

🖩 Arcsecant and arccosecant

For positive arguments:

arcsec(5) = arccos(1/5) ≈ 1.3694 (both conventions agree).

For negative arguments (depends on convention):

Trig Friendly: arccsc(-3/2) = arcsin(-2/3) ≈ -0.7297.
Calc Friendly: arccsc(-3/2) requires finding the Quadrant III angle. Let α = arccsc(3/2) = arcsin(2/3). Then θ = π + α = π + arcsin(2/3) ≈ 3.8713.

🔍 Solving equations with inverse functions

🔍 When the answer isn't a "common angle"

Just as x² = 7 has solutions x = ±√7 (not "friendly" integers), equations like sin(θ) = 1/3 have solutions that must be expressed using inverse functions.

🔍 Finding all solutions to sin(θ) = 1/3

The terminal side of θ intersects the Unit Circle at y = 1/3, which occurs in Quadrants I and II.
Let α = arcsin(1/3) be the acute (Quadrant I) solution.
Quadrant I solutions: θ = α + 2πk = arcsin(1/3) + 2πk for integers k.
Quadrant II solutions: θ = π - α + 2πk = π - arcsin(1/3) + 2πk for integers k.

🔍 Finding all solutions to tan(t) = -2

Tangent is negative in Quadrants II and IV.
Let β = arctan(-2), which gives a Quadrant IV angle (since -π/2 < β < 0).
Quadrant IV solutions are coterminal with β.
Quadrant II solutions differ by exactly π from Quadrant IV solutions.
All solutions: t = β + πk = arctan(-2) + πk for integers k.

🔍 Finding all solutions to sec(x) = -5/3

Convert to cosine: cos(x) = -3/5.
Solutions lie in Quadrants II and III (where cosine is negative).
Let β = arccos(-3/5), which is a Quadrant II angle (since π/2 < β < π).
A Quadrant III solution is -β = -arccos(-3/5).
All solutions: x = arccos(-3/5) + 2πk or x = -arccos(-3/5) + 2πk for integers k.

🔍 Checking your answers

Substitute back into the original equation analytically.
Use a calculator to verify numerically (convert to decimal approximations).
Remember that there are infinitely many solutions due to periodicity; the "+2πk" or "+πk" captures all of them.

📊 Domain and range of transformed inverse functions

📊 Finding the domain

To find the domain of f(x) = (π/2) - arccos(x/5):

The domain of arccos(x) is -1 ≤ x ≤ 1.
Set the argument between -1 and 1: -1 ≤ x/5 ≤ 1.
Solve: -5 ≤ x ≤ 5.
Domain: [-5, 5].

📊 Finding the range using transformations

Track key points and asymptotes through the transformations (following the procedure from earlier sections on function transformations).

Example: For f(x) = 3arctan(4x):

The domain of arctan(x) is all real numbers, and 4x is defined everywhere, so the domain of f is all real numbers.
The range of arctan(x) is (-π/2, π/2), bounded by horizontal asymptotes.
The transformation involves a horizontal compression (factor 4) and vertical stretch (factor 3).
The vertical stretch affects the range: multiply the asymptotes by 3.
Range: (-3π/2, 3π/2).

📊 Graphing arccotangent on a calculator

Since many calculators lack an arccot button, rewrite using arctan:

For x > 0: arccot(x/2) = arctan(2/x).
For x < 0: arccot(x/2) = π + arctan(2/x).
For x = 0: arccot(0) = π/2.
This requires entering a piecewise-defined function into the calculator.

🏗️ Applications

🏗️ Angle of inclination

A roof has a "6/12 pitch," meaning rise = 6 feet over run = 12 feet.

The roof line forms the hypotenuse of a right triangle with legs 6 and 12.
The angle of inclination θ satisfies tan(θ) = 6/12 = 1/2.
Since θ is acute, θ = arctan(1/2) radians ≈ 26.56°.

🏗️ Rewriting sums as sinusoids

To rewrite f(x) = 5sin(3x) + 12cos(3x) as a single sinusoid:

Use identities to express as A sin(ωx + φ).
The phase shift φ involves an inverse trigonometric function.
Example result: f(x) = 13sin(3x + φ) where φ = arctan(12/5) ≈ 1.1760 radians.

🏗️ Don't confuse: degrees vs radians

arcsin(1/2) = π/6 radians, which equals 30°.
But arcsin(1/2) ≠ 30° as a number; it equals approximately 0.5236 as a number.
Always be clear about units when working with inverse trigonometric functions.

Trigonometric Equations and Inequalities

10.7 Trigonometric Equations and Inequalities

🧭 Overview

🧠 One-sentence thesis

Solving trigonometric equations and inequalities requires combining basic solution strategies for standard forms with algebraic techniques, identities, and sign diagrams to handle more complex cases involving multiple functions, different arguments, or inverse trigonometric functions.

📌 Key points (3–5)

Basic strategy: Solve standard forms (sin(u) = c, cos(u) = c, tan(u) = c) by finding solutions in a reference interval and adding integer multiples of the period.
Handling different arguments: When the argument is not just x (e.g., sin(3x) = c), solve for the argument first, then divide through to isolate x, remembering to divide the period term as well.
Multiple functions or arguments: Use identities to convert equations to a single function with matching arguments, then factor and solve—never divide by a trigonometric expression without checking for lost solutions.
Common confusion: When solving inequalities, use sign diagrams just as with polynomial inequalities; the continuity of trigonometric functions on their domains makes this technique valid.
Inverse functions: To solve equations with inverse trig functions, isolate the inverse function, verify the value is in its range, then apply the appropriate inverse property.

🔧 Basic Solution Techniques

🔧 Standard forms and their solutions

Standard equation forms: cos(u) = c or sin(u) = c for −1 ≤ c ≤ 1; sec(u) = c or csc(u) = c for c ≤ −1 or c ≥ 1; tan(u) = c for any real c; cot(u) = c for any real c.

How to solve each type:

Sine and cosine: Find u in [0, 2π) and add integer multiples of 2π (the period).
Secant and cosecant: Convert to cosine or sine using reciprocal identities, then solve as above.
Tangent: Find u in (−π/2, π/2) and add integer multiples of π (the period).
Cotangent: Convert to tangent for c ≠ 0; if c = 0, the solution is u = π/2 + πk.

Important: If c is outside the valid range (e.g., sin(u) = 2), there are no real solutions.

🎯 Solving with modified arguments

When the argument is not simply x (e.g., sin(3x) = 1/2):

Recognize the form sin(u) = 1/2 where u = 3x.
Solve for u: u = π/6 + 2πk or u = 5π/6 + 2πk.
Substitute back: 3x = π/6 + 2πk or 3x = 5π/6 + 2πk.
Divide everything by 3: x = π/18 + (2π/3)k or x = 5π/18 + (2π/3)k.

Don't confuse: The period term (2πk) must also be divided by the coefficient—this is a common error.

✅ Checking and restricting solutions

Analytical check: Substitute the solution family back into the original equation and verify using periodicity.
Finding solutions in [0, 2π): Substitute integer values k = 0, 1, 2, ... and k = −1, −2, ... until solutions fall outside the interval.
Graphical verification: Graph both sides of the equation and confirm intersection points match the decimal approximations of exact answers.

🧮 Advanced Equation Solving

🧮 Equations with powers or products

When you see squares or higher powers:

Extract square roots carefully: sec²(x) = 4 becomes sec(x) = ±2, creating two equations to solve.
Factor, don't divide: For 3sin³(x) = sin²(x), move everything to one side and factor: sin²(x)(3sin(x) − 1) = 0.
Why not divide? Dividing by sin²(x) would lose the solution sin(x) = 0.

Example: From sin²(x)(3sin(x) − 1) = 0, we get sin(x) = 0 or sin(x) = 1/3, yielding multiple solution families.

🔄 Using identities to match functions

When different functions appear (e.g., sec²(x) = tan(x) + 3):

Use a Pythagorean identity: sec²(x) = 1 + tan²(x).
Substitute: 1 + tan²(x) = tan(x) + 3.
Rearrange: tan²(x) − tan(x) − 2 = 0.
Let u = tan(x) and factor: (u + 1)(u − 2) = 0.
Solve: tan(x) = −1 or tan(x) = 2.

When arguments differ (e.g., cos(2x) = 3cos(x) − 2):

Use a double-angle identity: cos(2x) = 2cos²(x) − 1.
This creates a "quadratic in disguise": 2cos²(x) − 3cos(x) + 1 = 0.
Factor and solve as a polynomial in cos(x).

🎭 Sum-to-product and other special techniques

For equations like cos(3x) = cos(5x):

Rewrite as cos(5x) − cos(3x) = 0.
Apply sum-to-product identity: −2sin(4x)sin(x) = 0.
This factors the equation, giving sin(4x) = 0 or sin(x) = 0.
The presence of zero on one side makes product form advantageous.

For linear combinations (e.g., cos(x) − √3·sin(x) = 2):

Recognize this can be rewritten as a single sinusoid: A·sin(ωx + φ) + B.
Use the technique from Section 10.5 to find amplitude and phase shift.
Solve the simpler equation in standard form.

📉 Solving Inequalities

📉 Sign diagram method

Key principle: Trigonometric functions are continuous on their domains, so the sign diagram technique applies.

Steps:

Move all terms to one side: f(x) ≤ 0 or f(x) > 0.
Find where f is undefined (e.g., tan(x) at x = π/2 + πk).
Find where f(x) = 0 by solving the corresponding equation.
Mark these points on a number line for the interval of interest.
Choose test values in each interval and evaluate f.
Select intervals where f has the desired sign.

Example: For 2sin(x) ≤ 1 on [0, 2π):

Rewrite as 2sin(x) − 1 ≤ 0.
Zeros: sin(x) = 1/2 gives x = π/6, 5π/6.
Test intervals: f(0) = −1 (negative), f(π/2) = 1 (positive), f(π) = −1 (negative).
Solution: [0, π/6] ∪ [5π/6, 2π).

📊 Graphical verification

Graph both sides of the inequality as separate functions.
Identify where one graph is above/below the other.
Confirm the intervals match the analytical solution.

Don't confuse: The endpoint x = 2π is not included in [0, 2π), even if it satisfies the inequality.

🎯 Domain Problems and Inverse Functions

🎯 Finding domains using equations

To find the domain of f(x) = csc(2x + π/3):

Rewrite using sine: f(x) = 1/sin(2x + π/3).
Sine is defined everywhere, so only zeros in the denominator matter.
Solve sin(2x + π/3) = 0: get x = −π/6 + (π/2)k.
Express in extended interval notation by identifying the pattern of excluded points.

For periodic functions:

Find excluded points in one period interval.
Express the domain for that interval.
Add integer multiples of the period to capture all intervals.

Example: If the domain on [π/3, 7π/3] is (π/3, 5π/3) ∪ (5π/3, 7π/3), then the full domain is the union of these intervals shifted by 2πk for all integers k.

🔁 Inverse trigonometric equations

Basic approach:

Isolate the inverse function: arcsin(2x) = π/3.
Check the range: Is π/3 in [−π/2, π/2]? Yes, so a solution exists.
Apply the inverse property: sin(arcsin(2x)) = sin(π/3).
Simplify: 2x = √3/2, so x = √3/4.

For equations with squares (e.g., 4arctan²(x) − 3π·arctan(x) − π² = 0):

Substitute u = arctan(x) to get a polynomial in u.
Solve for u, then check which values lie in the range of arctangent.
Only values in the range produce valid solutions.

🔍 Inverse function inequalities

Use sign diagrams just as with regular inequalities:

Let f(x) equal the expression and find its domain (restricted by the inverse function).
Find zeros by solving f(x) = 0.
Test intervals and determine where f has the required sign.

Example: For π²/4 − 4arccos²(x) < 0 on [−1, 1]:

Zero: arccos(x) = π/2 gives x = 0.
Test: f(−1) < 0, f(1) > 0.
Solution: [−1, 0).

Applications of Sinusoids

11.1 Applications of Sinusoids

🧭 Overview

🧠 One-sentence thesis

Sinusoidal functions model a wide range of time-dependent natural phenomena—from circular motion and daylight variation to harmonic oscillation of springs—by capturing periodic behavior through amplitude, frequency, phase, and baseline parameters.

📌 Key points (3–5)

Standard form and parameters: A sinusoid S(t) = A sin(ωt + φ) + B encodes amplitude |A|, angular frequency ω, phase φ, and vertical shift B, each with physical meaning.
Circular motion application: The height or horizontal position of a rotating object (e.g., Ferris wheel, yo-yo) can be modeled by adjusting amplitude (radius), frequency (rotation speed), and phase (starting position).
Fitting sinusoids to data: Real-world periodic data (e.g., daylight hours, temperature) can be approximated by estimating baseline (average of max/min), amplitude (displacement from baseline), period (cycle length), and phase shift (horizontal offset).
Harmonic motion on springs: Free undamped motion of a mass on a spring follows x(t) = A sin(ωt + φ) where ω depends on spring constant k and mass m; initial displacement and velocity determine A and φ.
Common confusion—phase vs phase shift: Phase φ is the angle at t = 0; phase shift is −φ/ω, representing the horizontal time offset from the standard sine wave.

📐 Sinusoid structure and meaning

📐 The standard form

Sinusoid: S(t) = A sin(ωt + φ) + B for ω > 0, where t represents time.

Amplitude |A|: maximum displacement from the baseline.
Angular frequency ω: how many cycles occur over an interval of length 2π.
Ordinary frequency f = ω/(2π): cycles per unit time.
Period T = 1/f = 2π/ω: time to complete one full cycle.
Phase φ: the angle corresponding to t = 0.
Phase shift −φ/ω: horizontal time offset; how much "head start" the sinusoid has.
Vertical shift (baseline) B: the center line around which oscillation occurs.

🔍 Physical interpretation

Amplitude measures how far the wave swings from its center.
Period tells how long one complete oscillation takes.
Frequency counts oscillations per second (or per unit time).
Phase shift indicates whether the wave starts ahead or behind the standard sine curve.
Example: If a Ferris wheel has radius 64 feet and completes one revolution in 127 seconds, amplitude = 64 ft and period T = 127 s, so ω = 2π/127.

🎡 Modeling circular motion

🎡 Ferris wheel height

A passenger on a Ferris wheel traces a circle; height above ground oscillates sinusoidally.
Setup: Diameter 128 ft → radius r = 64 ft; platform 8 ft high → center at 72 ft; period T = 127/2 s (two revolutions in 127 s) → ω = 4π/127.
Baseline formula: y = 64 sin(4π t / 127) describes vertical position on a circle centered at origin.
Adjust for real height: Add 72 to account for center height → h = 64 sin(4π t / 127) + 72.
Adjust starting point: If t = 0 is at the lowest point (not the rightmost), subtract π/2 from the angle → h(t) = 64 sin(4π t / 127 − π/2) + 72.
Don't confuse: The phase shift −(−π/2)/(4π/127) = 127/8 ≈ 15.875 seconds is the time delay introduced by starting at the bottom instead of the side.

🎢 Horizontal displacement

Similarly, horizontal position relative to the center can be modeled using cosine (or sine with different phase).
Example: x(t) = r cos(ωt) if t = 0 corresponds to the rightmost point; adjust phase if starting elsewhere.

📊 Fitting sinusoids to real data

📊 Manual fitting process

Baseline B: average of maximum and minimum data values.
Amplitude A: difference between maximum (or minimum) and baseline.
Period T: length of one complete cycle in the data; then ω = 2π/T.
Phase shift: identify the first t where the function crosses the baseline (if A > 0); set −φ/ω equal to that t, solve for φ.

🌞 Daylight hours example

Data: hours of daylight in Fairbanks, Alaska over 12 months.
Max = 21.8 hours, min = 3.3 hours → B = (21.8 + 3.3)/2 = 12.55.
Amplitude A = 21.8 − 12.55 = 9.25.
Period T = 12 months → ω = 2π/12 = π/6.
Baseline crossing near t = 3 → phase shift = 3 → φ = −3ω = −π/2.
Result: H(t) = 9.25 sin(π t / 6 − π/2) + 12.55.
Don't confuse: Even though data spans [1, 12], the period is 12 because each month represents a sample over a full month interval.

🖩 Calculator regression

Graphing calculators have a "SinReg" feature that fits a sinusoid automatically.
Manual fitting and calculator regression may differ slightly; both should be checked against the data visually.
Unlike linear or polynomial regression, sinusoidal regression does not always provide an R² goodness-of-fit value.

🔧 Harmonic motion on springs

🔧 Physics background

Mass vs weight: Mass (slugs in English, kg in SI) measures resistance to motion; weight (pounds or Newtons) is gravitational force.
Conversion: w = mg, where g = 32 ft/s² (English) or 9.8 m/s² (SI).
Hooke's Law: F = kd relates applied force F to spring stretch d via spring constant k.
Equilibrium: position where spring force balances weight.

🔧 Equation of motion

Free undamped harmonic motion: x(t) = A sin(ωt + φ), where ω = √(k/m), A = √(x₀² + (v₀/ω)²), A sin(φ) = x₀, and Aω cos(φ) = v₀.

x(t) = displacement from equilibrium at time t.
x(t) > 0 means below equilibrium; x(t) < 0 means above equilibrium.
x₀ = initial displacement; v₀ = initial velocity.
v₀ < 0 means upward velocity; v₀ > 0 means downward.

🧪 Solving spring problems

Example: 64 lb object stretches spring 8 ft.

Spring constant: F = kd → 64 = k · 8 → k = 8 lb/ft.
Mass: w = mg → 64 = m · 32 → m = 2 slugs.
Angular frequency: ω = √(8/2) = 2 rad/s.

Case 1: Released 3 ft below equilibrium from rest (x₀ = 3, v₀ = 0).

A = √(9 + 0) = 3.
3 sin(φ) = 3 → sin(φ) = 1 → φ = π/2.
Motion: x(t) = 3 sin(2t + π/2).
First equilibrium crossing: solve 3 sin(2t + π/2) = 0 → t = π/4 ≈ 0.78 s, heading upward.

Case 2: Released 3 ft below with upward velocity 8 ft/s (x₀ = 3, v₀ = −8).

A = √(9 + 16) = 5.
5 sin(φ) = 3 and 10 cos(φ) = −8 → φ is Quadrant II → φ = π − arcsin(3/5).
Maximum height above equilibrium: 5 ft (the amplitude).

🌊 Advanced motion phenomena

🌊 Damped motion

Form: x(t) = A(t) sin(ωt + φ) where A(t) = Ce^(−kt) (exponential decay).
Example: x(t) = 10e^(−t/5) sin(t + π/3).
The amplitude shrinks over time due to friction (air resistance).
Object oscillates forever but with decreasing swing.

🌊 Forced motion and resonance

Form: x(t) = A(t) sin(ωt + φ) where A(t) grows, e.g., A(t) = 2(t + 3).
Occurs when external oscillation frequency matches the system's natural frequency.
Amplitude grows without bound, leading to potentially destructive effects.

🌊 Beats

Sum of two sinusoids with different frequencies: x(t) = 5 sin(6t) − 5 sin(8t).
Period: find ratio of frequencies 6/8 = 3/4 → period is 3 times the period of the first (or 4 times the second) → T = π.
Rewrite using sum-to-product: x(t) = −10 sin(t) cos(7t).
The lower-frequency factor −10 sin(t) creates a "beat envelope" modulating the higher-frequency oscillation.
Don't confuse: Beats occur when frequencies differ; resonance occurs when they match.

🔢 Sign conventions and units

🔢 Displacement and velocity signs

Spring motion: x > 0 is below equilibrium (natural falling direction); x < 0 is above.
Velocity: v > 0 is downward; v < 0 is upward (opposite to gravity).
These conventions align with physics: gravity pulls down (positive direction), spring pulls up (negative).

🔢 Unit consistency

English system: distance in feet, mass in slugs, force in pounds, time in seconds.
SI system: distance in meters, mass in kilograms, force in Newtons, time in seconds.
Always check dimensional analysis: ω has units 1/s, A has units of length.

The Law of Sines

11.2 The Law of Sines

🧭 Overview

🧠 One-sentence thesis

The Law of Sines provides a method to solve triangles by relating the ratios of sides to the sines of their opposite angles, though certain configurations (especially the Angle-Side-Side case) can produce zero, one, or two valid triangles.

📌 Key points (3–5)

What the Law of Sines states: For any triangle, the ratio of each side to the sine of its opposite angle is constant: sin(α)/a = sin(β)/b = sin(γ)/c.
When you can use it: You need at least one angle-side opposite pair to apply the Law of Sines.
The Angle-Side-Side (ASS) ambiguity: When given one angle and two sides (only one adjacent to the angle), you may get no triangle, exactly one triangle, or two distinct triangles depending on the side lengths.
Common confusion: The ASS case vs. other cases—knowing two angles (AAS or ASA) always yields exactly one triangle, but ASS requires checking whether the given side is long enough and whether it creates one or two solutions.
Area formula bonus: The Law of Sines proof leads to area formulas A = (1/2)bc·sin(α) = (1/2)ac·sin(β) = (1/2)ab·sin(γ).

📐 What the Law of Sines says and proves

📐 The theorem statement

Law of Sines: Given a triangle with angle-side opposite pairs (α, a), (β, b), and (γ, c), the following ratios hold: sin(α)/a = sin(β)/b = sin(γ)/c, or equivalently a/sin(α) = b/sin(β) = c/sin(γ).

Angle-side opposite pairs: The convention is that lowercase Greek letters denote angles and the corresponding lowercase English letters represent the side opposite that angle.
Example: α is an angle, and a is the side opposite α.

🔍 How the proof works

The proof divides into three cases based on triangle shape:

Case	Method	Key insight
All acute angles	Drop altitude from one vertex, creating two right triangles	The altitude h satisfies h = c·sin(α) = a·sin(γ), leading to sin(α)/a = sin(γ)/c
One obtuse angle	Drop altitude from vertex, use supplementary angle identity	For obtuse α, use sin(α') = sin(α) where α' = 180° - α
Right triangle	Direct application	Reduces to the definitions from Theorem 10.4

The proof uses altitudes to create right triangles where basic sine definitions apply.
Don't confuse: The proof handles obtuse angles by recognizing that sin(180° - α) = sin(α).

🔢 Using the Law of Sines: straightforward cases

🔢 Angle-Angle-Side (AAS) case

When you know two angles and one side not between them:

First find the third angle using the fact that angles sum to 180°.
Use the Law of Sines with the known angle-side pair to find the remaining sides.
Example from the excerpt: α = 120°, a = 7, β = 45° → find γ = 15°, then use sin(β)/b = sin(α)/a to get b.

🔢 Angle-Side-Angle (ASA) case

When you know two angles and the side between them:

Find the third angle (sum to 180°).
The given side forms an angle-side opposite pair with the angle opposite to it.
Use the Law of Sines to find the other two sides.
Example from the excerpt: α = 85°, β = 30°, c = 5.25 → find γ = 65°, then solve for a and b.

Why these work: Knowing two angles completely determines the triangle's shape; adding one side determines its size, so exactly one triangle exists.

⚠️ The ambiguous Angle-Side-Side (ASS) case

⚠️ The four possible outcomes

Theorem 11.3: Suppose (α, a) and (γ, c) are intended angle-side pairs where α, a, and c are given. Let h = c·sin(α). Then:

If a < h, no triangle exists.

If a = h, exactly one right triangle exists (γ = 90°).

If h < a < c, two distinct triangles exist.

If a ≥ c, exactly one triangle exists (γ is acute).

⚠️ Why the ambiguity happens

When you use the Law of Sines to find sin(γ), you may get a value between 0 and 1 that corresponds to two angles: an acute angle γ₀ and its supplement 180° - γ₀.
Both angles might "fit" into a triangle with the given angle α, creating two valid solutions.
Example from the excerpt: α = 30°, a = 3, c = 4 → sin(γ) = 2/3, which gives γ ≈ 41.81° or γ ≈ 138.19°, both producing valid triangles.

⚠️ How to check each case

Case 1: a < h (too short)

The side a is literally too short to reach and form a triangle.
Algebraically, sin(γ) = c·sin(α)/a > 1, which is impossible.
Example: α = 30°, a = 1, c = 4 → h = 4·sin(30°) = 2, so a < h means no triangle.

Case 2: a = h (just right for a right triangle)

The side a is exactly long enough to form a right angle at γ.
sin(γ) = 1, so γ = 90°.
Example: α = 30°, a = 2, c = 4 → h = 2, so a = h gives one right triangle.

Case 3: h < a < c (two triangles)

The side a is long enough but shorter than c, allowing it to "swing" and touch in two places.
sin(γ) < 1, giving two solutions for γ in the valid range.
Both γ₀ and 180° - γ₀ can coexist with α in a triangle.
Example: α = 30°, a = 3, c = 4 → two distinct triangles result.

Case 4: a ≥ c (one triangle, acute γ)

Since a ≥ c, we have α ≥ γ, forcing γ to be acute.
Only one angle satisfies sin(γ) = c·sin(α)/a in the acute range.
Example: α = 30°, a = 4, c = 4 → exactly one triangle with γ = 30°.

Don't confuse: If you start with an obtuse angle, the two-triangle case is impossible—there's no room for a second configuration.

🧮 Area formulas from the Law of Sines

🧮 The area theorem

Theorem 11.4: For a triangle with angle-side opposite pairs (α, a), (β, b), (γ, c), the area A is: A = (1/2)bc·sin(α) = (1/2)ac·sin(β) = (1/2)ab·sin(γ)

🧮 Why this works

The proof uses the same altitude constructions as the Law of Sines proof.
For an altitude h from a vertex, the area is (1/2)·base·height.
The altitude relates to the sine of an angle: h = (adjacent side)·sin(angle).
Example from the excerpt: For α = 120°, a = 7, c ≈ 2.09, β = 45°, use A = (1/2)ac·sin(β) ≈ 5.18 square units.

Why square units: You multiply (1/2) (dimensionless) × side (length) × side (length) × sin(angle) (dimensionless) = length².

🧮 Minimizing error

Use the formula that incorporates the most given (not derived) information.
Avoid using calculated values when possible to prevent propagated error.
Example: If α, a, and β are given but c is calculated, prefer A = (1/2)ac·sin(β) over formulas using only derived quantities.

🧭 Practical application: navigation and bearings

🧭 Real-world triangle solving

The excerpt includes an application to Sasquatch Island:

Two observation points 5 miles apart on a straight shore.
Angles from shore to island: 30° at first point, 45° at second point.
Solution approach: Use supplementary angles to find all angles in the triangle, then apply Law of Sines to find distances.
Result: Distance from second point to island d ≈ 9.66 miles; perpendicular distance to shore ≈ 6.83 miles.

Key technique: When angles are given relative to a baseline (like a shoreline), use supplementary or complementary angle relationships to find the triangle's interior angles before applying the Law of Sines.

🧭 Minimizing error in multi-step problems

Always use original given data when possible.
Example: To find distance y from island to shore, use the calculated distance d and the given 45° angle: y = d·sin(45°), rather than chaining multiple derived values.
This approach "strives to solve for quantities using the original data given in the problem whenever possible" to minimize propagated error.

The Law of Cosines

11.3 The Law of Cosines

🧭 Overview

🧠 One-sentence thesis

The Law of Cosines extends the Pythagorean Theorem to all triangles and enables solving triangles when given two sides and the included angle (SAS) or all three sides (SSS).

📌 Key points (3–5)

What it solves: The Law of Cosines handles Side-Angle-Side (SAS) and Side-Side-Side (SSS) triangle cases that the Law of Sines cannot address.
The formulas: For any triangle with angle-side opposite pairs (α, a), (β, b), (γ, c), the law states a² = b² + c² − 2bc cos(α) and similar equations for the other sides.
Relationship to Pythagorean Theorem: When γ = 90°, cos(γ) = 0, and the Law of Cosines reduces to c² = a² + b², showing it generalizes the Pythagorean Theorem.
Common confusion: When using derived values to find remaining angles, the Law of Cosines is safer than the Law of Sines because cosine distinguishes acute from obtuse angles (positive vs negative), while sine is positive for both.
Practical outcome: The law enables real-world distance measurements and leads to Heron's Formula for triangle area using only the three side lengths.

📐 The Law of Cosines formulas

📐 Three equivalent forms

Law of Cosines: Given a triangle with angle-side opposite pairs (α, a), (β, b), and (γ, c):

a² = b² + c² − 2bc cos(α)

b² = a² + c² − 2ac cos(β)

c² = a² + b² − 2ab cos(γ)

Each formula relates one side to the other two sides and the angle opposite the first side.
The angle in the formula must be the one opposite the side being solved for, and the other two sides must be adjacent to that angle.

📐 Solving for angles

The formulas can be rearranged to solve for the cosine of an angle:

cos(α) = (b² + c² − a²) / (2bc)
cos(β) = (a² + c² − b²) / (2ac)
cos(γ) = (a² + b² − c²) / (2ab)

Then use arccosine to find the angle measure.

🔗 Connection to the Pythagorean Theorem

🔗 A generalization, not a replacement

The Pythagorean Theorem is a special case of the Law of Cosines.
When γ = 90°, cos(90°) = 0, so the term −2ab cos(γ) vanishes.
This leaves c² = a² + b², the familiar Pythagorean relationship.
Implication: In a larger mathematical sense, the Law of Cosines and the Pythagorean Theorem "amount to pretty much the same thing"—both trace back to the distance formula.

🔗 Why it works for all triangles

The proof places a triangle vertex at the origin with one side along the positive x-axis.
Using circular coordinates, the third vertex is at (c cos(α), c sin(α)).
The distance formula then yields the Law of Cosines after applying the identity cos²(α) + sin²(α) = 1.
The proof holds whether α is acute, right, or obtuse.

🧮 Solving triangles with the Law of Cosines

🧮 Side-Angle-Side (SAS) case

Given: two sides and the included angle (the angle between them).
Strategy: Use the Law of Cosines to find the third side.
Example: Given β = 50°, a = 7, c = 2, compute b² = 7² + 2² − 2(7)(2) cos(50°), yielding b ≈ 5.92 units.

🧮 Side-Side-Side (SSS) case

Given: all three sides, no angles.
Strategy: Use the Law of Cosines to find one angle at a time.
Best practice: Find the largest angle first (opposite the longest side) because if there is an obtuse angle, it will be the largest.
Example: Given a = 4, b = 7, c = 5, find β first using cos(β) = (a² + c² − b²) / (2ac) = −1/5, so β ≈ 101.54°.

🧮 Finding remaining angles

After finding one side or angle, you have two choices:

Continue with the Law of Cosines: Safer because cosine's sign tells you if the angle is acute (positive) or obtuse (negative).
Switch to the Law of Sines: Faster but ambiguous—sine is positive for both acute and obtuse angles, so you must be careful to find the smallest unknown angle first to ensure it is acute.

Don't confuse: The Law of Sines is ambiguous in the ASS case; the Law of Cosines avoids this ambiguity because the cosine function distinguishes angle types.

🌍 Real-world applications

🌍 Measuring inaccessible distances

A researcher measures distances from a point P to two ends of a pond: 950 feet and 1000 feet, with a 60° angle between the lines of sight.
Using the Law of Cosines: w² = 950² + 1000² − 2(950)(1000) cos(60°) = 952500, so w ≈ 976 feet.
This technique works whenever you can measure two distances and the included angle but cannot measure the target distance directly.

🌍 Navigation and bearings

The excerpt includes exercises involving bearings (compass directions) and distances traveled by helicopters, ships, and hikers.
The Law of Cosines helps find the straight-line distance between two points when the path taken involves a change of direction.
Example scenario: A helicopter flies 192 miles on one bearing, then 207 miles on another bearing; the angle between the two paths determines the straight-line distance between endpoints.

📏 Heron's Formula for area

📏 Area from three sides only

Heron's Formula: If a triangle has sides a, b, c and semiperimeter s = (a + b + c)/2, then the area A is: A = √[s(s − a)(s − b)(s − c)]

This formula requires no angle measurements, only the three side lengths.
The semiperimeter s is half the perimeter.

📏 Derivation outline

Start with the area formula A = (1/2) ab sin(γ) from the Law of Sines section.
Square both sides and use the identity sin²(γ) = 1 − cos²(γ).
Substitute cos(γ) from the Law of Cosines: cos(γ) = (a² + b² − c²) / (2ab).
After algebraic manipulation (difference of squares, perfect square trinomials), the expression simplifies to A² = s(s − a)(s − b)(s − c).
Taking the square root yields Heron's Formula.

📏 Example calculation

Given a = 4, b = 7, c = 5:

Semiperimeter: s = (4 + 7 + 5)/2 = 8
Differences: (s − a) = 4, (s − b) = 1, (s − c) = 3
Area: A = √(8 · 4 · 1 · 3) = √96 = 4√6 ≈ 9.80 square units

Don't confuse: Heron's Formula multiplies four lengths but yields square units because the semiperimeter and the three differences combine algebraically to produce an area.

⚠️ Practical considerations

⚠️ Rounding and error propagation

When you use a calculated (approximate) value in a subsequent calculation, rounding errors accumulate.
Best practice: To minimize error propagation, use the Law of Cosines with the original given data whenever possible, rather than relying on previously derived approximate values.
Example: After finding b ≈ 5.92, you could find the remaining angles using only the original values a = 7, c = 2, and β = 50°, plus the exact expression for b.

⚠️ Checking your work

The approximate angle measures in a triangle should sum to 180° (or very close, like 180.01° due to rounding).
If the sum is far from 180°, recheck calculations.
The excerpt notes that "depending on how many decimal places are carried through successive calculations, and depending on which approach is used to solve the problem, the approximate answers you obtain may differ slightly."

⚠️ When no triangle exists

In the SSS case, if the Law of Cosines yields a cosine value outside [−1, 1], the given information does not produce a valid triangle.
In the SAS case, the given information always produces exactly one triangle (assuming positive side lengths and an angle between 0° and 180°).
The excerpt mentions that applying the Law of Cosines to the ambiguous ASS case results in a quadratic equation; the number of positive real solutions (zero, one, or two) tells you how many triangles are formed.

Polar Coordinates

11.4 Polar Coordinates

🧭 Overview

🧠 One-sentence thesis

Polar coordinates provide an alternative to Cartesian coordinates by locating points using a directed distance from a central pole and an angle of rotation from a fixed axis, allowing the same point to be represented by infinitely many coordinate pairs.

📌 Key points (3–5)

What polar coordinates measure: a directed distance r from the pole (origin) and an angle θ of rotation from the polar axis.
Multiple representations: unlike Cartesian coordinates where one point has one unique pair, a single point in polar coordinates can be expressed infinitely many ways by varying r (positive/negative) and θ (adding multiples of 2π or π).
Negative r means opposite direction: when r < 0, you move in the opposite direction along the polar axis before rotating.
Common confusion: the same point can have r > 0 or r < 0 representations; for example, (4, 5π/6) and (−4, 11π/6) describe different coordinate pairs but the same geometric location.
Conversion formulas: x = r cos(θ), y = r sin(θ), r² = x² + y², and tan(θ) = y/x link polar and rectangular systems.

📍 How polar coordinates work

📍 The polar coordinate system setup

Pole: the origin point in polar coordinates.

Polar axis: a ray (starting line) from the pole, analogous to the positive x-axis in Cartesian coordinates.

A point P is located using the pair (r, θ).
r represents a directed distance from the pole (can be positive or negative).
θ represents the angle of rotation from the polar axis (positive = counter-clockwise, negative = clockwise).

📍 Plotting a point (r, θ)

Two equivalent visualization methods:

Method 1: Distance first, then rotate

Start at the pole, move r units along the polar axis (left if r < 0).
Then rotate through angle θ.

Method 2: Rotate first, then move

Rotate θ from the polar axis.
Then move outward r units from the pole (or inward if r < 0).

Example: To plot (4, 5π/6), either move 4 units out then rotate 5π/6 counter-clockwise, or rotate 5π/6 first then move out 4 units—both reach the same point on the terminal side of 5π/6 that is 4 units from the pole.

🔄 When r is negative

r < 0 means you move in the opposite direction on the polar axis before rotating.
Example: (−3.5, π/4) means move 3.5 units to the left of the pole, then rotate π/4. Equivalently, the point lies 3.5 units from the pole on the terminal side of 5π/4 (not π/4).

🔄 When θ is negative

θ < 0 means clockwise rotation instead of counter-clockwise.
Example: (3.5, −3π/4) means rotate 3π/4 clockwise, then move out 3.5 units.

🔁 Multiple representations of the same point

🔁 Why one point has infinitely many polar coordinates

Unlike Cartesian coordinates where (a, b) and (c, d) are the same point only if a = c and b = d, polar coordinates allow infinitely many representations of the same geometric location.

You can add any multiple of 2π to θ (full rotations return to the same direction).
You can switch the sign of r and adjust θ by π (moving in the opposite direction then rotating π radians lands you at the same spot).

🔁 Equivalent representations property

Equivalent Representations: Suppose (r, θ) and (r′, θ′) are polar coordinates where r ≠ 0, r′ ≠ 0, and angles are in radians. Then (r, θ) and (r′, θ′) determine the same point P if and only if one of the following holds:

r′ = r and θ′ = θ + 2πk for some integer k, or

r′ = −r and θ′ = θ + (2k + 1)π for some integer k.

All coordinates of the form (0, θ) represent the pole, regardless of θ.

🔁 Finding alternate representations

Given a point in polar coordinates, you can find other representations by:

Same distance, coterminal angle: keep r the same, add or subtract multiples of 2π to θ.
Opposite distance, shifted angle: change r to −r, and adjust θ by an odd multiple of π.

Example: The point (2, 240°) can also be written as (2, −120°) (coterminal angle) or (−2, 60°) (opposite direction, shifted by 180°).

Don't confuse: (2, 240°) and (−2, 60°) look like different ordered pairs, but they represent the same geometric point in the plane.

🔄 Converting between polar and rectangular coordinates

🔄 Conversion formulas

Theorem 11.7: If point P has rectangular coordinates (x, y) and polar coordinates (r, θ), then:

x = r cos(θ) and y = r sin(θ)

x² + y² = r²

tan(θ) = y/x (provided x ≠ 0)

These formulas follow from the definition of sine and cosine on the unit circle, extended to any radius r.

🔄 Polar to rectangular

Substitute r and θ into x = r cos(θ) and y = r sin(θ).
Example: (4, 5π/3) converts to x = 4 cos(5π/3) = 4·(1/2) = 2 and y = 4 sin(5π/3) = 4·(−√3/2) = −2√3, giving rectangular coordinates (2, −2√3).

🔄 Rectangular to polar

Use r² = x² + y² to find r (choose r ≥ 0 if required).
Use tan(θ) = y/x to find θ, paying attention to the quadrant.
Example: (−3, −3) lies in Quadrant III. r² = 9 + 9 = 18, so r = 3√2. tan(θ) = 1 gives reference angle π/4; in Quadrant III, θ = 5π/4. Answer: (3√2, 5π/4).

Important: Always plot the point first to determine the correct quadrant for θ. The arctangent function alone does not tell you the quadrant.

🔄 Special cases

Point on an axis: For (0, −3) on the negative y-axis, r = 3 and θ = 3π/2 (no calculation needed, use geometry).
Non-standard angles: If tan(θ) is not a common value, use θ = arctan(y/x) adjusted for the correct quadrant. Example: (−3, 4) in Quadrant II gives θ = π − arctan(4/3).

🔁 Converting equations between coordinate systems

🔁 Rectangular to polar equations

Strategy: Replace every x with r cos(θ) and every y with r sin(θ), then simplify using identities.

Example: (x − 3)² + y² = 9 becomes (r cos(θ) − 3)² + (r sin(θ))² = 9. Expanding and using cos²(θ) + sin²(θ) = 1 gives r² − 6r cos(θ) = 0, which factors to r(r − 6 cos(θ)) = 0. Since r = 0 is just the pole (already included in r = 6 cos(θ) when θ = π/2), the answer is r = 6 cos(θ).
Example: y = −x becomes r sin(θ) = −r cos(θ). Factoring gives r(sin(θ) + cos(θ)) = 0. Geometrically, y = −x is a line through the origin, which in polar form is simply θ = −π/4 (the variable r is free, meaning it can be any value, tracing out the entire line).

Don't confuse: r = 0 describes only the pole, but θ = (constant) describes an entire line through the pole.

🔁 Polar to rectangular equations

Strategy: Rearrange the polar equation to make r² = x² + y², r cos(θ) = x, r sin(θ) = y, or tan(θ) = y/x appear, then substitute.

Example: r = −3. Square both sides: r² = 9. Substitute r² = x² + y² to get x² + y² = 9.
- Note: Squaring introduces r = ±3, but any point (3, θ) can be rewritten as (−3, θ + π), so no extra points are added.
Example: θ = 4π/3. Take tangent of both sides: tan(θ) = tan(4π/3) = √3. Substitute tan(θ) = y/x to get y/x = √3, or y = x√3.
Example: r = 1 − cos(θ). Multiply both sides by r: r² = r − r cos(θ). Substitute r² = x² + y² and r cos(θ) = x to get x² + y² = r + x. To eliminate the remaining r, note r = √(x² + y²), so square both sides: x² + y² = (x² + y² + x)². The excerpt verifies that this squaring does not introduce extraneous points (any solution to the squared equation has an equivalent representation satisfying the original).

🔁 Why conversion matters

Some equations are simpler in one system than the other.
Example: y = x² (a parabola) converts to the complicated polar form r = sec(θ) tan(θ), while r = 1 − cos(θ) (a cardioid) converts to the complicated rectangular form x² + y² = (x² + y² + x)².
Choosing the right coordinate system makes graphing and analysis much easier.

Markdown notes complete.

Graphs of Polar Equations

11.5 Graphs of Polar Equations

🧭 Overview

🧠 One-sentence thesis

Graphing polar equations requires understanding how the directed distance r and angle θ work together to trace curves in the xy-plane, and finding intersection points demands careful attention to the infinitely many representations of each point.

📌 Key points (3–5)

Fundamental principle: A point P(r, θ) is on the graph if some representation of P satisfies the equation—not necessarily the representation you first think of.
Graphing strategy: Plot r as a function of θ in the θr-plane first, then use that to visualize how the curve sweeps out in the xy-plane as θ varies.
Negative r values: When r < 0, the curve is plotted in the opposite direction from where θ points, which can cause curves to appear in unexpected quadrants.
Common confusion—intersections: Two curves can intersect at a point even when no single (r, θ) satisfies both equations simultaneously, because the same point has infinitely many polar representations.
Period vs. complete graph: The period of the function f(θ) does not always match the interval of θ needed to trace the entire curve—some curves require longer intervals, others shorter.

📐 Basic polar graphs

📐 Constant r equations

The graph of r = a (where a ≠ 0) is a circle centered at the origin with radius |a|.

The angle θ is free, so every angle at the same distance from the origin is included.
Example: r = 4 traces all points 4 units from the origin, forming a circle of radius 4.
Example: r = −3√2 also gives a circle of radius 3√2, because the absolute value determines the radius.

📐 Constant θ equations

The graph of θ = α is the line containing the terminal side of α in standard position.

The variable r is free, so all directed distances along that angle are included.
Positive r values extend in the direction of θ; negative r values extend in the opposite direction.
Example: θ = 5π/4 traces the line through the origin at angle 5π/4.
Example: θ = −3π/2 (which is equivalent to π/2) traces the y-axis.

🎨 Graphing technique: the θr-plane method

🎨 Why use the θr-plane

Treat θ as the independent variable and r as the dependent variable, just like y = f(x).
Plot one cycle of r = f(θ) on the θr-plane to see how r changes as θ increases.
This intermediate graph helps you visualize where the curve pulls toward or away from the origin in the xy-plane.

🎨 Translating from θr to xy

Divide the θ interval into subintervals (e.g., [0, π/2], [π/2, π], etc.) to track the curve piece by piece.
For each subinterval, observe whether r is increasing, decreasing, positive, or negative.
Positive r: the curve extends in the direction of θ.
Negative r: the curve extends in the opposite direction (rotate θ by π).
Arrows in the θr-plane correspond to directed distances from the origin in the xy-plane.

🎨 Handling negative r

When r crosses zero, the curve passes through the origin (the pole).
The line θ = (value where r = 0) often serves as a tangent to the curve at the origin.
Example: For r = 2 + 4cos(θ), setting r = 0 gives cos(θ) = −1/2, so θ = 2π/3 and θ = 4π/3. The curve hugs these lines as it passes through the origin.

🌹 Common polar curve families

🌹 Circles through the origin

Equations like r = 6cos(θ) or r = 2sin(θ) produce circles.
The curve starts at the origin, sweeps out to a maximum distance, and returns to the origin.
Example: r = 6cos(θ) is a circle centered at (3, 0) in rectangular coordinates with radius 3.

🌹 Roses

Equations of the form r = a sin(nθ) or r = a cos(nθ) produce "petals."
The number of petals depends on n (the excerpt shows examples with n = 2, 3, 4).
Example: r = 5sin(2θ) has petals that sweep out as θ ranges from 0 to π, then additional petals appear as θ continues from π to 2π.

🌹 Cardioids

Equations like r = a ± a sin(θ) or r = a ± a cos(θ) produce heart-shaped curves.
The curve starts at the origin (or a maximum distance) and sweeps around to return.
Example: r = 2 − 2sin(θ) is a cardioid.

🌹 Limaçons

Equations like r = a ± b sin(θ) or r = a ± b cos(θ) (where a ≠ b) produce limaçons.
Depending on the ratio of a to b, the limaçon may have an inner loop, a dimple, or be convex.
Example: r = 2 + 4cos(θ) has an inner loop because the coefficient of cos(θ) is larger than the constant term.

🌹 Lemniscates

Equations like r² = a² cos(2θ) or r² = a² sin(2θ) produce figure-eight shapes.
Solving for r gives r = ±√(a² cos(2θ)), which is only defined when the expression under the square root is non-negative.
Example: r² = 16cos(2θ) is undefined when cos(2θ) < 0, so the curve has gaps.

🔍 Finding intersection points

🔍 The challenge of multiple representations

A single point in the plane has infinitely many polar representations: (r, θ), (r, θ + 2πk), (−r, θ + (2k+1)π) for any integer k.
Two curves can intersect at a point even if no single (r, θ) satisfies both equations.
You must check multiple representations to find all intersections.

🔍 Step 1: Sketch and equate

Graph both equations to estimate the number and location of intersections.
Equate the two expressions for r and solve for θ to find points with a common representation.
Example: For r = 2sin(θ) and r = 2 − 2sin(θ), equating gives 2sin(θ) = 2 − 2sin(θ), so sin(θ) = 1/2, yielding θ = π/6 and θ = 5π/6.

🔍 Step 2: Check the pole separately

The origin (pole) can be represented as (0, θ) for any θ.
Check if each curve passes through the origin, even if at different angles.
Example: r = 2sin(θ) reaches the origin when θ = 0 or θ = π; r = 2 − 2sin(θ) reaches it when θ = π/2. The origin is an intersection point, but it is never reached simultaneously.

🔍 Step 3: Substitute θ + 2πk

Replace θ with θ + 2πk in one equation (but not both) and solve again.
This checks if a point on one curve at angle θ matches a point on the other curve at angle θ + 2πk.
Often this reduces to the same equation as Step 1, yielding no new solutions.

🔍 Step 4: Substitute (−r, θ + (2k+1)π)

Replace r with −r and θ with θ + (2k+1)π in one equation (but not both).
This checks if a point (r, θ) on one curve corresponds to (−r, θ + π) on the other.
Example: For r = 3 and r = 6cos(2θ), substituting gives −r = 6cos(2(θ + π)) = 6cos(2θ), so r = −6cos(2θ). Solving −6cos(2θ) = 3 gives cos(2θ) = −1/2, yielding four more intersection points.

🔍 Don't confuse: same point vs. same representation

(3, π/3) and (−3, π/3) are different points in the plane.
But (3, π/3) and (−3, 4π/3) are the same point, because rotating 4π/3 from the negative x-axis with distance 3 lands at the same location as rotating π/3 from the positive x-axis with distance 3.

🗺️ Describing regions in polar coordinates

🗺️ Set-builder notation

Regions are described as {(r, θ) | conditions on r and θ}.
The conditions specify bounds on the directed distance r and the angle θ.
Example: {(r, θ) | 0 ≤ r ≤ 5sin(2θ), 0 ≤ θ ≤ π/2} is the region between the origin and the rose petal in the first quadrant.

🗺️ Interpreting inequalities

0 ≤ r ≤ f(θ): all points from the origin out to the curve r = f(θ) as θ varies.
a ≤ r ≤ b: all points at directed distances between a and b from the origin.
f(θ) ≤ r ≤ 0: when f(θ) is negative, this describes points between the curve and the origin, plotted in the opposite direction from θ.
Example: {(r, θ) | 2 + 4cos(θ) ≤ r ≤ 0, 2π/3 ≤ θ ≤ 4π/3} describes the inner loop of the limaçon r = 2 + 4cos(θ).

🗺️ Union of regions

The symbol ∪ combines two or more regions.
Example: {(r, θ) | 0 ≤ r ≤ 2sin(θ), 0 ≤ θ ≤ π/6} ∪ {(r, θ) | 0 ≤ r ≤ 2 − 2sin(θ), π/6 ≤ θ ≤ π/2} describes a region that switches from one bounding curve to another at θ = π/6.

⚠️ Important observations

⚠️ Period vs. complete graph

The period of f(θ) is the smallest interval over which f repeats.
The complete graph may require a longer or shorter interval of θ than one period.
Example: r = 6cos(θ) has period 2π, but the complete graph is traced as θ runs from 0 to π only.
Example: r = 5sin(2θ) has period π, but the complete graph requires θ from 0 to 2π.

⚠️ Symmetry

Many polar curves exhibit symmetry about the x-axis, y-axis, or origin.
Symmetry can help reduce the work needed to graph or find intersections.
The excerpt notes that classifying symmetry in polar coordinates is more subtle than in rectangular coordinates.

⚠️ Equivalent equations

Two different polar equations can describe the same curve.
Example: r = 3sin(θ/2) and r = 3cos(θ/2) produce the same set of points in the plane, even though they are different functions of θ.
This happens because every point on one curve has a representation that satisfies the other equation.

Hooked on Conics Again

11.6 Hooked on Conics Again

🧭 Overview

🧠 One-sentence thesis

This section unifies the study of conic sections by introducing axis rotation formulas and polar coordinate representations that reveal how all conics—parabolas, ellipses, and hyperbolas—can be described through a single framework involving eccentricity and directrix.

📌 Key points (3–5)

Rotation of axes: Any equation with an xy term can be transformed into standard form by rotating the coordinate axes through a specific angle θ.
Discriminant test: The value B² − 4AC determines the conic type without rotation—positive means hyperbola, zero means parabola, negative means ellipse.
Polar form unification: All conics can be expressed as r = ed/(1 − e cos(θ − φ)) where e (eccentricity) determines the shape: e = 0 (circle), 0 < e < 1 (ellipse), e = 1 (parabola), e > 1 (hyperbola).
Common confusion: The coefficients of x² and y² alone don't determine the conic type when an xy term is present—you must check the discriminant or eliminate the xy term first.
Focus and directrix: Every conic is the set of points P where (distance to focus)/(distance to directrix) = e, unifying the definitions from Chapter 7.

🔄 Rotation of axes formulas

🔄 Converting between rotated coordinates

Rotation of Axes Theorem: When axes are rotated counter-clockwise through angle θ, coordinates P(x, y) and P(x′, y′) are related by:

x = x′ cos(θ) − y′ sin(θ)

y = x′ sin(θ) + y′ cos(θ)

And inversely:

x′ = x cos(θ) + y sin(θ)

y′ = −x sin(θ) + y cos(θ)

These formulas come from converting both coordinate systems to polar form with the same radius r but different angles.
The forward equations (x, y in terms of x′, y′) make it easy to substitute into an equation and transform it.
The inverse equations let you convert points from the original system to the rotated system.

🧮 How to find the rotation angle

When you have an equation Ax² + Bxy + Cy² + Dx + Ey + F = 0 with B ≠ 0:

The angle θ that eliminates the x′y′ term satisfies: cot(2θ) = (A − C)/B
Why this works: Substituting the rotation formulas and setting the coefficient of x′y′ to zero produces this condition.
You can always find at least one acute angle solution.
Example: If A = 5, B = 26, C = 5, then cot(2θ) = 0, so θ = π/4.

📐 Working through a rotation example

Example: To convert 21x² + 10xy√3 + 31y² = 144 with θ = π/3:

Substitute x = (x′/2) − (y′√3/2) and y = (x′√3/2) + (y′/2)
Expand x², xy, and y² in terms of x′ and y′
Collect like terms—the x′y′ term vanishes
Result: 36(x′)² + 16(y′)² = 144, or (x′)²/4 + (y′)²/9 = 1
This is an ellipse in standard form in the rotated system

Don't confuse: The original equation looked complicated with the xy term, but it's just an ellipse tilted at angle π/3.

🔍 Discriminant classification

🔍 The discriminant B² − 4AC

Discriminant Theorem: For Ax² + Bxy + Cy² + Dx + Ey + F = 0 describing a non-degenerate conic:

B² − 4AC > 0 → hyperbola

B² − 4AC = 0 → parabola

B² − 4AC < 0 → ellipse or circle

This quantity is called the discriminant of the conic section.
It tells you the conic type before you rotate the axes.
The discriminant is preserved under rotation: B² − 4AC = −4A′C′ where A′ and C′ are coefficients after rotation.

🎯 Why the discriminant works

The proof shows that after rotating to eliminate the xy term:

The product A′C′ has the opposite sign of B² − 4AC
In the rotated form A′(x′)² + C′(y′)² + ... = 0, the sign of A′C′ determines the conic type
Positive A′C′ means same signs → ellipse
Zero A′C′ means one coefficient is zero → parabola
Negative A′C′ means opposite signs → hyperbola

Example: For 16x² + 24xy + 9y², we get 24² − 4(16)(9) = 576 − 576 = 0, predicting a parabola.

🎯 Polar form of conics

🎯 Unified definition via eccentricity

Conic Section (unified definition): Given a fixed line L (directrix), a point F (focus) not on L, and a positive number e (eccentricity), a conic section is the set of all points P such that:

(distance from P to F)/(distance from P to L) = e

This single definition encompasses all conic types through the value of e.
The "old" definitions from Chapter 7 (separate for each conic) are special cases of this unified definition.
The directrix is always perpendicular to the major/transverse axis (or axis of symmetry for parabolas).

📊 Eccentricity determines shape

Eccentricity e	Conic type	Geometric meaning
e = 0	Circle	All points equidistant from center
0 < e < 1	Ellipse	Points closer to focus than directrix (on average)
e = 1	Parabola	Points equidistant from focus and directrix
e > 1	Hyperbola	Points farther from focus than directrix (on average)

🧮 Standard polar equations

With focus at origin (0, 0) and directrix at x = −d:

The polar equation is: r = ed/(1 − e cos(θ))
Different directrix positions give similar forms:
- Directrix x = d: r = ed/(1 + e cos(θ))
- Directrix y = −d: r = ed/(1 − e sin(θ))
- Directrix y = d: r = ed/(1 + e sin(θ))

📏 Dimensions from polar form

For r = ed/(1 − e cos(θ)):

Ellipse (0 < e < 1):

Major axis length: 2ed/(1 − e²)
Minor axis length: 2ed/√(1 − e²)

Parabola (e = 1):

Focal diameter: 2d

Hyperbola (e > 1):

Transverse axis length: 2ed/(e² − 1)
Conjugate axis length: 2ed/√(e² − 1)

🔄 Rotated polar form

The most general polar form includes rotation:

r = ℓ/(1 − e cos(θ − φ))

φ represents the rotation angle
The directrix contains the point with polar coordinates (−d, φ) where d = ℓ/e
All length formulas remain the same as the unrotated case
Example: r = 4/(1 − sin(θ − π/4)) is the parabola r = 4/(1 − sin(θ)) rotated by π/4

🎨 Graphing from polar form

Example: r = 12/(3 − cos(θ)) = 4/(1 − (1/3)cos(θ))

Identify e = 1/3 < 1, so it's an ellipse
Since ed = 4 and e = 1/3, we get d = 12
Form shows directrix is x = −12
Major axis lies along x-axis
Find vertices: r(0) = 6 and r(π) = 3 give points (6, 0) and (−3, 0)
Center is midpoint: (3/2, 0)
Minor axis length: 2ed/√(1 − e²) = 6√3
Sketch the ellipse with these parameters

Don't confuse: In polar form, the focus is always at the origin—this is different from the centered forms in Chapter 7.

Polar Form of Complex Numbers

11.7 Polar Form of Complex Numbers

🧭 Overview

🧠 One-sentence thesis

The polar form of complex numbers—expressed as |z| cis(θ) where |z| is the modulus and θ is an argument—transforms multiplication, division, and especially powers and roots into simple operations on magnitudes and angles, revealing the deep geometric structure of the complex plane.

📌 Key points (3–5)

Modulus and argument: Every nonzero complex number z = a + bi corresponds to a point (a, b) in the complex plane with modulus |z| = √(a² + b²) (distance from origin) and argument θ (angle from positive real axis).
Polar form definition: z = |z| cis(θ) = |z|[cos(θ) + i sin(θ)], where cis(θ) abbreviates cos(θ) + i sin(θ); there are infinitely many polar forms because θ can differ by integer multiples of 2π.
Multiplication and division become addition and subtraction: The product rule zw = |z||w| cis(α + β) and quotient rule z/w = (|z|/|w|) cis(α − β) turn complex arithmetic into operations on moduli and arguments.
DeMoivre's Theorem for powers: z^n = |z|^n cis(nθ) makes computing high powers straightforward; finding nth roots reverses this by taking the nth root of the modulus and dividing the argument by n, yielding n equally spaced roots around a circle.
Common confusion—principal argument vs all arguments: arg(z) is the set of all angles θ representing z (differing by 2πk), while Arg(z) is the unique principal argument in (−π, π]; for z = 0, arg(0) = (−∞, ∞) and Arg(0) is undefined.

📐 Modulus and argument definitions

📏 Modulus |z|

Modulus of z: If z = a + bi and (r, θ) is a polar representation of (a, b) with r ≥ 0, then |z| = r.

The modulus is the distance from z to 0 in the complex plane.
Formula: |z| = √(Re(z)² + Im(z)²) = √(a² + b²).
Always |z| ≥ 0, and |z| = 0 if and only if z = 0.
The notation |z| is consistent with absolute value for real numbers (when b = 0, |z| = |a|).
Why r ≥ 0 matters: Even though polar coordinates allow negative r, the definition requires r ≥ 0 to make |z| well-defined (a unique nonnegative number).

Example: For z = √3 − i, we have |z| = √((√3)² + (−1)²) = √4 = 2.

🧭 Argument arg(z) and principal argument Arg(z)

Argument of z: Any angle θ such that (|z|, θ) is a polar representation of z; the set of all such angles is denoted arg(z).

Principal argument Arg(z): If z ≠ 0, the unique angle θ ∈ (−π, π] such that (|z|, θ) represents z.

For z ≠ 0, all arguments are coterminal: arg(z) = {Arg(z) + 2πk | k is an integer}.
If Re(z) ≠ 0, then tan(θ) = Im(z)/Re(z) for any θ ∈ arg(z); use the quadrant of z to determine the correct angle.
Special cases:
- If z lies on the positive imaginary axis (Re(z) = 0, Im(z) > 0): arg(z) = {π/2 + 2πk}.
- If z lies on the negative imaginary axis (Re(z) = 0, Im(z) < 0): arg(z) = {−π/2 + 2πk}.
- If z = 0: arg(0) = (−∞, ∞) and Arg(0) is undefined.
Don't confuse: arg(z) is a set of angles (infinitely many), while Arg(z) is a single number (when z ≠ 0).

Example: For z = √3 − i (Quadrant IV), tan(θ) = −1/√3 = −√3/3, so θ = −π/6 + 2πk. Thus arg(z) = {−π/6 + 2πk | k integer} and Arg(z) = −π/6.

🔄 Converting between rectangular and polar forms

🔄 Polar form definition

Polar form of z: If θ ∈ arg(z), then z = |z| cis(θ) = |z|[cos(θ) + i sin(θ)].

The abbreviation cis(θ) stands for cos(θ) + i sin(θ).
Since a = r cos(θ) and b = r sin(θ) from polar-to-rectangular conversion, z = a + bi = r cos(θ) + r sin(θ) i = r[cos(θ) + i sin(θ)].
Every complex number has infinitely many polar forms (one for each choice of argument).

➡️ Rectangular to polar

Find modulus: |z| = √(a² + b²).
Find an argument:
- If a ≠ 0: compute tan(θ) = b/a and adjust for the correct quadrant.
- If a = 0: θ = π/2 (if b > 0) or θ = −π/2 (if b < 0).
Write z = |z| cis(θ).

Example: For z = −2 + 4i, |z| = √(4 + 16) = 2√5. Since z is in Quadrant II and tan(θ) = 4/(−2) = −2, we have θ = π − arctan(2). Thus z = 2√5 cis(π − arctan(2)).

⬅️ Polar to rectangular

Expand cis(θ) = cos(θ) + i sin(θ).
Multiply by |z|: z = |z| cos(θ) + i |z| sin(θ).
Simplify using known values or identities.

Example: z = 4 cis(2π/3) = 4[cos(2π/3) + i sin(2π/3)] = 4[−1/2 + i√3/2] = −2 + 2i√3.

⚙️ Properties of modulus

⚙️ Key modulus properties

Property	Statement	Meaning
Distance	\|z\| is the distance from z to 0	Geometric interpretation
Nonnegativity	\|z\| ≥ 0; \|z\| = 0 iff z = 0	Modulus is always nonnegative
Formula	\|z\| = √(Re(z)² + Im(z)²)	Pythagorean distance
Product rule	\|zw\| = \|z\|\|w\|	Modulus of product = product of moduli
Power rule	\|z^n\| = \|z\|^n	Modulus of power = power of modulus
Quotient rule	\|z/w\| = \|z\|/\|w\| (w ≠ 0)	Modulus of quotient = quotient of moduli

The product rule is proven by direct computation: if z = a + bi and w = c + di, then zw = (ac − bd) + (ad + bc)i, and expanding |zw|² = (ac − bd)² + (ad + bc)² simplifies to (a² + b²)(c² + d²) = |z|²|w|².
The power rule follows from the product rule by induction.
The quotient rule reduces to showing |1/w| = 1/|w| when w ≠ 0.

Example: If z = 2 cis(π/6) and w = 3 cis(π/3), then |zw| = |z||w| = 2 · 3 = 6, which matches |zw| computed directly.

🔢 Operations in polar form (Theorem 11.16)

✖️ Product rule

Formula: If z = |z| cis(α) and w = |w| cis(β), then zw = |z||w| cis(α + β).

Multiply moduli, add arguments.
Proof uses the sum identities: [cos(α) + i sin(α)][cos(β) + i sin(β)] expands and simplifies to cos(α + β) + i sin(α + β) = cis(α + β).
Geometric interpretation: Multiplying by w magnifies the distance from 0 by |w| and rotates counterclockwise by β radians (if β > 0).

Example: z = 4 cis(π/6) and w = 2 cis(2π/3) give zw = 8 cis(π/6 + 2π/3) = 8 cis(5π/6).

🔺 Power rule (DeMoivre's Theorem)

Formula: z^n = |z|^n cis(nθ) for every natural number n.

Raise modulus to the nth power, multiply argument by n.
Proven by induction using the product rule: z^(k+1) = z^k · z = |z|^k cis(kθ) · |z| cis(θ) = |z|^(k+1) cis((k+1)θ).
Makes computing high powers trivial compared to binomial expansion.

Example: w = 2 cis(2π/3), so w⁵ = 2⁵ cis(5 · 2π/3) = 32 cis(10π/3) = 32 cis(4π/3) (since 10π/3 is coterminal with 4π/3).

➗ Quotient rule

Formula: z/w = (|z|/|w|) cis(α − β), provided w ≠ 0.

Divide moduli, subtract arguments.
Proof multiplies numerator and denominator by the conjugate cos(β) − i sin(β), using difference identities.
Geometric interpretation: Dividing by w shrinks the distance by |w| and rotates clockwise by β radians (if β > 0).

Example: z/w = (4 cis(π/6))/(2 cis(2π/3)) = 2 cis(π/6 − 2π/3) = 2 cis(−π/2).

Don't confuse: These rules apply to polar forms; for rectangular forms, traditional arithmetic (multiply/divide with conjugates) may be simpler unless powers are involved.

🌱 nth roots of complex numbers

🌱 Definition of nth root

nth root of z: A complex number w such that w^n = z for some natural number n.

Unlike real nth roots, complex numbers have exactly n distinct nth roots (when z ≠ 0).
Example: The cube roots of 8 are 2, −1 + i√3, and −1 − i√3 (three distinct values).

🌱 Formula for nth roots (Theorem 11.17)

Formula: If z = r cis(θ) with z ≠ 0, the n distinct nth roots are:

w_k = ⁿ√r cis(θ/n + (2π/n)k) for k = 0, 1, 2, ..., n−1.

Take the nth root of the modulus, divide the argument by n, then add 2π/n for each successive root.
The roots are equally spaced around a circle of radius ⁿ√r, separated by angles of 2π/n radians.
Proof: (w_k)^n = (ⁿ√r)^n cis(n · [θ/n + (2π/n)k]) = r cis(θ + 2πk) = r cis(θ) = z.
The n roots are distinct because the arguments differ by multiples of 2π/n, which is less than 2π.

Example: To find the cube roots of 8 = 8 cis(0):

w₀ = ³√8 cis(0/3 + 0) = 2 cis(0) = 2
w₁ = 2 cis(2π/3) = −1 + i√3
w₂ = 2 cis(4π/3) = −1 − i√3

🎯 Geometric visualization

The nth roots lie on a circle of radius ⁿ√r centered at the origin.
They are evenly distributed, forming a regular n-gon.
Starting from w₀ = ⁿ√r cis(θ/n), each successive root is rotated by 2π/n radians counterclockwise.

Example: The four fourth roots of −16 = 16 cis(π) are w₀ = 2 cis(π/4), w₁ = 2 cis(3π/4), w₂ = 2 cis(5π/4), w₃ = 2 cis(7π/4), forming a square.

🌟 nth roots of unity

nth roots of unity: The n complex nth roots of z = 1.

Since 1 = 1 cis(0), the nth roots of unity are w_k = cis(2πk/n) for k = 0, 1, ..., n−1.
They lie on the unit circle, equally spaced by 2π/n radians.
w₀ = 1 is always one of the roots.
These roots have special algebraic properties: their product is an nth root of unity, and each has a multiplicative inverse that is also an nth root of unity.

Example: The five fifth roots of 1 are cis(0) = 1, cis(2π/5), cis(4π/5), cis(6π/5), cis(8π/5).

🎨 Geometric interpretation of operations

🎨 Multiplication as magnification and rotation

Multiplying z by w = |w| cis(β) has two effects:
1. Magnify the distance from 0 to z by the factor |w|.
2. Rotate z counterclockwise by β radians (if β > 0).
This explains why complex multiplication is not commutative in appearance but is algebraically: the order of magnification and rotation doesn't matter for the final result.

Example: z = 4 cis(π/6) multiplied by w = 2 cis(2π/3):

First magnify: distance becomes 4 · 2 = 8.
Then rotate: angle becomes π/6 + 2π/3 = 5π/6.
Result: zw = 8 cis(5π/6).

🎨 Division as shrinking and rotation

Dividing z by w = |w| cis(β) has two effects:
1. Shrink the distance from 0 to z by the factor |w| (divide by |w|).
2. Rotate z clockwise by β radians (subtract β from the argument).

Example: z/w = (4 cis(π/6))/(2 cis(2π/3)):

Shrink: distance becomes 4/2 = 2.
Rotate clockwise: angle becomes π/6 − 2π/3 = −π/2.
Result: z/w = 2 cis(−π/2).

🎨 Powers as repeated rotation and magnification

z^n = |z|^n cis(nθ) means:
- The distance from 0 is magnified by |z| a total of n times (raised to the nth power).
- The angle is rotated by θ a total of n times (multiplied by n).
If |z| > 1, successive powers spiral outward; if |z| < 1, they spiral inward; if |z| = 1, they stay on the unit circle.

Don't confuse: Rotating by nθ is not the same as rotating n times by θ in separate steps—but the result is the same due to angle addition.

Vectors

11.8 Vectors

🧭 Overview

🧠 One-sentence thesis

Vectors combine magnitude (size) and direction into a single mathematical object, enabling us to model and solve problems involving forces, velocities, and other quantities where both "how much" and "which way" matter.

📌 Key points (3–5)

What vectors represent: A vector captures both a quantitative answer (magnitude) and a direction, unlike a single real number.
Component form vs geometric form: Vectors can be written as ordered pairs ⟨v₁, v₂⟩ (component form) or as directed line segments with initial and terminal points.
Operations preserve structure: Vector addition and scalar multiplication follow algebraic rules similar to real numbers (commutative, associative, distributive properties).
Common confusion—magnitude vs direction: The magnitude ‖v‖ is a scalar (length), while the direction v̂ is a unit vector; together they reconstruct the original vector: v = ‖v‖ v̂.
Why it matters: Vectors model real-world phenomena like airplane velocity in wind, forces on suspended objects, and any situation requiring both size and direction.

🧩 What vectors are and how to represent them

🧩 Definition and geometric picture

A vector is a mathematical object defined by two characteristics: magnitude (length) and direction.

Geometrically, a vector is a directed line segment with an arrow at one endpoint.
The initial point (or tail) is where the vector starts; the terminal point (or head) is where it ends.
Notation: v⃗ denotes "the vector v"; if the initial point is P and terminal point is Q, we write v⃗ = PQ⃗.
Example: A vector from P(1, 2) to Q(4, 6) moves 3 units right and 4 units up.

📐 Component form

The component form of a vector v⃗ with initial point P(x₀, y₀) and terminal point Q(x₁, y₁) is v⃗ = ⟨x₁ − x₀, y₁ − y₀⟩.

The first number (x-component) tells you the horizontal change; the second (y-component) tells you the vertical change.
Two vectors are equal if and only if their corresponding components are equal.
Key insight: Any directed line segment with the same length and direction represents the same vector, regardless of where it starts.
Example: v⃗ = ⟨3, 4⟩ can start at any point; if it starts at P′(−2, 3), its terminal point is Q′(1, 7).

🔄 Standard position

A vector is in standard position when its initial point is at the origin (0, 0).
If v⃗ = ⟨v₁, v₂⟩ is in standard position, its terminal point is (v₁, v₂).
Standard position simplifies converting between rectangular and polar representations.

➕ Vector addition and subtraction

➕ Geometric addition (resultant vector)

To add v⃗ + w⃗ geometrically: plot v⃗, then plot w⃗ starting at the terminal point of v⃗.
The sum v⃗ + w⃗ starts at the initial point of v⃗ and ends at the terminal point of w⃗.
Think of v⃗ + w⃗ as the "net result" of moving along v⃗ then moving along w⃗.
Example: An airplane's velocity plus wind velocity gives the true velocity (resultant).

🧮 Component-wise addition

If v⃗ = ⟨v₁, v₂⟩ and w⃗ = ⟨w₁, w₂⟩, then v⃗ + w⃗ = ⟨v₁ + w₁, v₂ + w₂⟩.

Add corresponding components separately.
This algebraic definition matches the geometric action.
Example: ⟨3, 4⟩ + ⟨1, −2⟩ = ⟨4, 2⟩.

🔁 Properties of vector addition

Property	Statement	Meaning
Commutative	v⃗ + w⃗ = w⃗ + v⃗	Order doesn't matter
Associative	(u⃗ + v⃗) + w⃗ = u⃗ + (v⃗ + w⃗)	Grouping doesn't matter
Identity	v⃗ + 0⃗ = v⃗	Zero vector 0⃗ = ⟨0, 0⟩ acts like zero
Inverse	v⃗ + (−v⃗) = 0⃗	Every vector has an opposite

The zero vector 0⃗ = ⟨0, 0⟩ represents a point (no movement); its direction is undefined.
The additive inverse −v⃗ = ⟨−v₁, −v₂⟩ has the same length as v⃗ but opposite direction.

➖ Vector subtraction

v⃗ − w⃗ = v⃗ + (−w⃗) = ⟨v₁ − w₁, v₂ − w₂⟩.

Subtraction also works component-wise.
Geometrically, v⃗ − w⃗ can be visualized as the vector from the terminal point of w⃗ to the terminal point of v⃗ (when both are drawn from the same initial point).
In a parallelogram formed by v⃗ and w⃗, the diagonals are v⃗ + w⃗ and v⃗ − w⃗.

✖️ Scalar multiplication

✖️ Definition and effect

If k is a real number (scalar) and v⃗ = ⟨v₁, v₂⟩, then kv⃗ = ⟨kv₁, kv₂⟩.

Multiplying by a scalar scales each component.
Geometric effect:
- If k > 0: the vector stretches (or shrinks) but keeps the same direction.
- If k < 0: the vector stretches (or shrinks) and reverses direction.
Example: 2v⃗ doubles the length; −2v⃗ doubles the length and flips direction.

🔧 Properties of scalar multiplication

Property	Statement
Associative	(kr)v⃗ = k(rv⃗)
Identity	1v⃗ = v⃗
Additive inverse	−v⃗ = (−1)v⃗
Distributive (scalar addition)	(k + r)v⃗ = kv⃗ + rv⃗
Distributive (vector addition)	k(v⃗ + w⃗) = kv⃗ + kw⃗
Zero product	kv⃗ = 0⃗ if and only if k = 0 or v⃗ = 0⃗

These properties allow algebraic manipulation of vectors just like variables.
Example: Solving 5v⃗ − 2(v⃗ + ⟨1, −2⟩) = 0⃗ for v⃗ uses distributive and inverse properties step-by-step.

📏 Magnitude and direction

📏 Magnitude (length)

The magnitude of v⃗ = ⟨v₁, v₂⟩ is ‖v⃗‖ = √(v₁² + v₂²).

Magnitude is always non-negative: ‖v⃗‖ ≥ 0.
‖v⃗‖ = 0 if and only if v⃗ = 0⃗.
For scalar k: ‖kv⃗‖ = |k| ‖v⃗‖.
Example: ‖⟨3, 4⟩‖ = √(9 + 16) = 5.

🧭 Direction (unit vector)

If v⃗ ≠ 0⃗, the direction of v⃗ is the unit vector v̂ = ⟨cos(θ), sin(θ)⟩, where θ is the angle the vector makes with the positive x-axis (in standard position).

The direction v̂ is a unit vector (length 1) pointing in the same direction as v⃗.
Key relationship: v⃗ = ‖v⃗‖ v̂ (any vector equals its magnitude times its direction).
Solving for direction: v̂ = (1/‖v⃗‖) v⃗ (multiply the vector by the reciprocal of its magnitude).
Example: For v⃗ = ⟨3, 4⟩, ‖v⃗‖ = 5, so v̂ = (1/5)⟨3, 4⟩ = ⟨3/5, 4/5⟩.

🔄 Converting between forms

Polar to component: If ‖v⃗‖ = r and direction angle is θ, then v⃗ = ⟨r cos(θ), r sin(θ)⟩.
Component to polar: If v⃗ = ⟨v₁, v₂⟩, then ‖v⃗‖ = √(v₁² + v₂²) and tan(θ) = v₂/v₁ (adjust θ for the correct quadrant).
Resolving a vector into components: taking magnitude and direction information and finding the component form.
Example: ‖v⃗‖ = 5, θ = 120° → v⃗ = 5⟨cos(120°), sin(120°)⟩ = ⟨−5/2, 5√3/2⟩.

🎯 Unit vectors

A unit vector is any vector with magnitude 1.

If v⃗ is a unit vector, then v⃗ = v̂ (it is its own direction).
Normalizing a vector: multiplying v⃗ by 1/‖v⃗‖ to produce a unit vector in the same direction.
Geometrically, normalizing shrinks (or stretches) the terminal point back to the unit circle.
Don't confuse: ‖v̂‖ = 1 always (the direction is a unit vector), but ‖v⃗‖ can be any non-negative number.

📐 Principal unit vectors

The principal unit vectors are ı̂ = ⟨1, 0⟩ (positive x-direction) and ȷ̂ = ⟨0, 1⟩ (positive y-direction).

Any vector v⃗ = ⟨v₁, v₂⟩ can be written as v⃗ = v₁ı̂ + v₂ȷ̂ (principal vector decomposition).
This expresses v⃗ as a sum of horizontal and vertical components.
Example: ⟨3, 4⟩ = 3ı̂ + 4ȷ̂.

🌍 Applications: velocity and forces

✈️ Velocity problems (resultant motion)

Airspeed vs true speed: An airplane's velocity relative to the air plus the wind's velocity gives the true velocity (ground speed and heading).
Method: represent each velocity as a vector (using magnitude and direction), add them component-wise, then find the magnitude and direction of the resultant.
Example: Plane at 175 mph bearing N40°E, wind 35 mph bearing S60°E → resolve each into components, add, compute magnitude (true speed ≈ 184 mph) and direction (true bearing ≈ N51°E).

⚖️ Force problems (static equilibrium)

Static equilibrium: An object at rest has forces that sum to the zero vector: F⃗₁ + F⃗₂ + … = 0⃗.
Example: A 50-pound speaker suspended by two cables at 60° and 30° angles with the ceiling.
- Weight w⃗ = ⟨0, −50⟩ (downward).
- Tensions T⃗₁ and T⃗₂ act upward at given angles.
- Resolve each tension into components using magnitude and direction.
- Set w⃗ + T⃗₁ + T⃗₂ = 0⃗ and equate components to get a system of equations.
- Solve for the unknown magnitudes ‖T⃗₁‖ and ‖T⃗₂‖.
Don't confuse: the magnitude of a force (e.g., tension in pounds) is a scalar, but the force itself is a vector (magnitude + direction).

🔍 Common strategy

Draw a diagram showing all vectors.
Resolve each vector into component form using magnitude and direction.
Add (or set equal to zero) the vectors component-wise.
Solve the resulting system of equations for unknowns.
Interpret the result (magnitude = speed or force; direction = bearing or angle).

The Dot Product and Projection

11.9 The Dot Product and Projection

🧭 Overview

🧠 One-sentence thesis

The dot product provides a scalar measure of how much two vectors align, enabling calculations of angles between vectors, detection of perpendicularity, and decomposition of one vector into components parallel and orthogonal to another.

📌 Key points (3–5)

What the dot product produces: takes two vectors and produces a scalar (not another vector), calculated by multiplying corresponding components and summing them.
Geometric meaning: the dot product equals the product of the vectors' magnitudes times the cosine of the angle between them.
Orthogonality detection: two nonzero vectors are perpendicular (orthogonal) if and only if their dot product equals zero.
Common confusion: the dot product is not the same as scalar multiplication—scalar multiplication produces a vector, while the dot product produces a number.
Projection application: the dot product enables finding how much of one vector lies in the direction of another (orthogonal projection), which is essential for calculating work done by forces.

🔢 Defining the dot product

🔢 Component formula

Dot product: If vectors v and w have component forms v = ⟨v₁, v₂⟩ and w = ⟨w₁, w₂⟩, then v · w = v₁w₁ + v₂w₂.

Multiply the first components together, multiply the second components together, then add the results.
Example: If v = ⟨3, 4⟩ and w = ⟨1, −2⟩, then v · w = (3)(1) + (4)(−2) = 3 − 8 = −5.
The result is always a single number (scalar), never a vector.
Also called the scalar product because it produces a scalar.

📐 Algebraic properties

The dot product behaves similarly to ordinary multiplication of real numbers:

Property	Formula	What it means
Commutative	v · w = w · v	Order doesn't matter
Distributive	u · (v + w) = u · v + u · w	Distributes over addition
Scalar property	(kv) · w = k(v · w) = v · (kw)	Scalars can be factored out
Relation to magnitude	v · v = ‖v‖²	Dotting a vector with itself gives its squared magnitude

These properties follow from the definition and basic arithmetic of real numbers.
Example proof of commutativity: v · w = v₁w₁ + v₂w₂ = w₁v₁ + w₂v₂ = w · v (using commutativity of real number multiplication).

📐 Geometric interpretation

📐 The angle formula

Geometric interpretation: If v and w are nonzero vectors, then v · w = ‖v‖‖w‖ cos(θ), where θ is the angle between v and w.

The angle θ is measured between the rays containing the vectors when drawn from the same initial point.
By definition, 0 ≤ θ ≤ π (0° to 180°).
This connects the algebraic definition (component multiplication) to geometry (angles and magnitudes).

🔍 Finding angles between vectors

To find the angle θ between two nonzero vectors:

θ = arccos(v · w / (‖v‖‖w‖)) = arccos(v̂ · ŵ)

First compute the dot product v · w using components.
Compute the magnitudes ‖v‖ and ‖w‖.
Divide the dot product by the product of magnitudes.
Take the arccosine of the result.
Alternative: normalize both vectors first (find unit vectors v̂ and ŵ), then take the arccosine of their dot product.

Example: For v = ⟨3, −3√3⟩ and w = ⟨−√3, 1⟩:

v · w = −3√3 − 3√3 = −6√3
‖v‖ = 6, ‖w‖ = 2
θ = arccos(−6√3/12) = arccos(−√3/2) = 5π/6 (or 150°)

⊥ Detecting perpendicular vectors

Orthogonality theorem: Nonzero vectors v and w are orthogonal (perpendicular) if and only if v · w = 0.

When vectors are orthogonal, we write v ⊥ w.
The angle between orthogonal vectors is π/2 radians (90°).
Since cos(π/2) = 0, the geometric formula v · w = ‖v‖‖w‖ cos(θ) gives v · w = 0.
Example: v = ⟨2, 2⟩ and w = ⟨5, −5⟩ have v · w = 10 − 10 = 0, so they are orthogonal.
Don't confuse: there is no "zero product property" for dot products—neither vector needs to be the zero vector for their dot product to be zero.

📏 Application to perpendicular lines

The dot product provides an alternative proof that perpendicular lines have slopes whose product is −1:

For line L₁: y = m₁x + b₁, construct direction vector v₁ = ⟨1, m₁⟩.
For line L₂: y = m₂x + b₂, construct direction vector v₂ = ⟨1, m₂⟩.
The lines are perpendicular if and only if v₁ ⊥ v₂.
v₁ · v₂ = ⟨1, m₁⟩ · ⟨1, m₂⟩ = 1 + m₁m₂.
Setting this equal to zero: 1 + m₁m₂ = 0, which gives m₁m₂ = −1.

🎯 Orthogonal projection

🎯 What projection means

Orthogonal projection: The orthogonal projection of vector v onto vector w, denoted proj_w(v), is the vector p that lies along w and represents how much of v points in the direction of w.

Geometrically: drop a perpendicular from the terminal point of v to the line containing w.
The projection p points in the same direction as w (or opposite if the angle is obtuse).
The projection has direction ŵ (the unit vector in the direction of w).
The magnitude of the projection is |v · ŵ|, called the scalar projection.

🧮 Computing projections

Three equivalent formulas for computing proj_w(v):

proj_w(v) = (v · ŵ)ŵ (using the unit vector)
proj_w(v) = ((v · w)/‖w‖²)w (avoiding explicit normalization)
proj_w(v) = ((v · w)/(w · w))w (using dot product in denominator)

Formula 2 or 3 is often easier in practice because you don't need to compute ŵ separately.
Example: For v = ⟨1, 8⟩ and w = ⟨−1, 2⟩:
- v · w = −1 + 16 = 15
- w · w = 1 + 4 = 5
- proj_w(v) = (15/5)⟨−1, 2⟩ = 3⟨−1, 2⟩ = ⟨−3, 6⟩

🔀 Vector decomposition

Generalized Decomposition Theorem: Any vector v can be uniquely written as v = p + q, where p is parallel to w (i.e., p = kw for some scalar k) and q is orthogonal to w (i.e., q · w = 0).

The parallel component is p = proj_w(v).
The orthogonal component is q = v − proj_w(v).
These components are sometimes called "the vector component of v parallel to w" and "the vector component of v orthogonal to w."
To verify: compute q · w and confirm it equals zero.
Example continuation: If v = ⟨1, 8⟩ and p = ⟨−3, 6⟩, then q = ⟨1, 8⟩ − ⟨−3, 6⟩ = ⟨4, 2⟩, and q · w = ⟨4, 2⟩ · ⟨−1, 2⟩ = −4 + 4 = 0 ✓

⚙️ Application to work

⚙️ Work as a dot product

Work theorem: If a constant force F is applied to move an object along vector PQ, the work W done is W = F · PQ = ‖F‖‖PQ‖ cos(θ), where θ is the angle between the force and the motion.

In physics, work = force × distance when force is in the direction of motion.
When force is at an angle, only the component of force in the direction of motion does work.
The dot product F · P̂Q gives exactly this component.
Since distance is ‖PQ‖, we get W = (F · P̂Q)‖PQ‖ = F · PQ.

🧰 Practical calculation

Two approaches to computing work:

Vector approach: Find the force vector F and displacement vector PQ, then compute F · PQ directly.
Magnitude-angle approach: Use W = ‖F‖‖PQ‖ cos(θ) with known magnitudes and angle.

Example: Taylor pulls a wagon with 10 pounds of force at a 30° angle for 50 feet:

Vector approach: F = 10⟨cos(30°), sin(30°)⟩ = ⟨5√3, 5⟩ and PQ = ⟨50, 0⟩, so W = ⟨5√3, 5⟩ · ⟨50, 0⟩ = 250√3 foot-pounds.
Magnitude-angle approach: W = (10 pounds)(50 feet) cos(30°) = 500(√3/2) = 250√3 foot-pounds.
Both methods give the same answer; choose based on what information is readily available.

Parametric Equations

11.10 Parametric Equations

🧭 Overview

🧠 One-sentence thesis

Parametric equations allow us to describe curves that fail both the Vertical and Horizontal Line Tests by expressing both x and y as separate functions of a third variable (the parameter t), enabling us to trace paths with orientation and model motion over time.

📌 Key points (3–5)

What parametric equations are: A system {x = f(t), y = g(t)} where both coordinates depend on a parameter t, allowing us to describe curves that aren't functions of x or y alone.
Orientation matters: The parameter t determines the direction of motion along the curve; different parametrizations can give the same curve different orientations.
Eliminating the parameter: We can often convert parametric equations back to a single equation in x and y using substitution or trigonometric identities, though this loses orientation information.
Common confusion: The curve itself (a set of points) has no orientation, but the parametrization imposes one; also, the same curve can have infinitely many different parametrizations.
Standard forms exist: Common curves (lines, circles, ellipses, parabolas) have standard parametric formulas that can be adjusted for specific situations.

📐 Core motivation and definition

🐛 The bug analogy

A bug crawling across a table traces out a curve C. Since the bug can be in only one place P(x, y) at any given time t, we can define the x-coordinate of P as a function of t and the y-coordinate of P as a (usually different) function of t.

The curve C might fail both the Vertical Line Test (so y is not a function of x) and the Horizontal Line Test (so x is not a function of y).
But at each moment in time t, there is exactly one position (x, y).
This leads to the idea of expressing both coordinates as functions of time (or another parameter).

📝 Definition of parametric equations

The system of equations {x = f(t), y = g(t)} is called a system of parametric equations or a parametrization of the curve C.

The independent variable t is called a parameter.
The parametrization endows the curve with an orientation (direction of motion as t increases).
The curve itself is just a set of points and has no inherent orientation; the parametrization determines it.

🔄 Familiar example: circular motion

The Unit Circle is parametrized by {x = cos(t), y = sin(t)} with counter-clockwise orientation.
More generally, {x = r cos(ωt), y = r sin(ωt)} traces a circle of radius r centered at the origin.
If ω > 0, the orientation is counter-clockwise; if ω < 0, it's clockwise.
The angular frequency ω determines how fast the object moves around the circle.

🎨 Sketching parametric curves

📊 Basic procedure

Choose friendly values of t: Start with the bounds on t and pick values that are easy to compute.
Plot points: Calculate (x(t), y(t)) for each chosen t value.
Draw arrows: Indicate the direction of motion as t increases.
Connect smoothly: Join the points in a pleasing fashion following the orientation.

🔍 Analyzing behavior

Before plotting, examine the individual functions x(t) and y(t):

Determine ranges: Find the minimum and maximum values of x and y to identify the region where the curve lives.
Check increasing/decreasing: Determine where x(t) and y(t) are increasing or decreasing to understand the direction of motion.
Examine endpoints and limits: Check behavior at the boundaries of the t-interval, especially if t approaches infinity or a point where functions are undefined.

Example: For {x = t³, y = 2t²} on [-1, 1]:

As t runs from -1 to 0, x = t³ is increasing (moving right) and y = 2t² is decreasing (moving down).
As t runs from 0 to 1, x = t³ is still increasing (moving right) but y = 2t² is now increasing (moving up).
This tells us the curve starts at (-1, 2), moves right and down to (0, 0), then continues right and up to (1, 2).

⚠️ Don't confuse: curve vs. parametrization

The curve is the set of points traced out—it has no orientation.
The parametrization determines which direction we travel along the curve.
Different parametrizations can trace the same curve with different orientations or speeds.

🔄 Eliminating the parameter

🧮 Algebraic method (for algebraic functions)

Use substitution or elimination techniques:

Solve one equation for t and substitute into the other.
Example: From {x = t², y = 2t - 1}, solve y = 2t - 1 for t to get t = (y + 1)/2, then substitute: x = ((y + 1)/2)² = (y + 1)²/4.

🔺 Trigonometric method (for trig functions)

Use trigonometric identities to relate the functions:

Solve for the trig functions and apply an identity like cos²(t) + sin²(t) = 1.
Example: From {x = 1 + 3cos(t), y = 2sin(t)}, get cos(t) = (x - 1)/3 and sin(t) = y/2.
Substitute into the Pythagorean Identity: ((x - 1)/3)² + (y/2)² = 1, which simplifies to (x - 1)²/9 + y²/4 = 1 (an ellipse).

⚠️ What is lost

Orientation: The equation in x and y has no direction information.
Portion of curve: The parametric equations might trace only part of the curve described by the x-y equation.
Timing: We lose information about how fast the curve is traced.

Example: {x = sin(t), y = csc(t)} for 0 < t < π traces part of y = 1/x twice as t varies, but the equation y = 1/x doesn't tell us this.

📋 Standard parametrizations

📏 Lines and parabolas

Curve type	Parametrization	Notes
y = f(x) on interval I	{x = t, y = f(t)}, t in I	Simplest: let x = t
x = g(y) on interval I	{x = g(t), y = t}, t in I	Simplest: let y = t
Line segment from (x₀, y₀) to (x₁, y₁)	{x = x₀ + (x₁ - x₀)t, y = y₀ + (y₁ - y₀)t}, 0 ≤ t ≤ 1	"Starting point + (displacement)t"

⭕ Circles and ellipses

Curve	Parametrization	Notes
Circle: (x - h)²/a² + (y - k)²/b² = 1 (where a = b)	{x = h + a cos(t), y = k + b sin(t)}, 0 ≤ t < 2π	Counter-clockwise orientation
Ellipse: (x - h)²/a² + (y - k)²/b² = 1	{x = h + a cos(t), y = k + b sin(t)}, 0 ≤ t < 2π	Counter-clockwise orientation

These come from the Pythagorean Identity cos²(t) + sin²(t) = 1.
Think: cos(t) = (x - h)/a and sin(t) = (y - k)/b.

🔧 Adjusting parametrizations

🔄 Reversing orientation

Rule: Replace every occurrence of t with -t (including in the bounds).

Example: To reverse {x = t, y = t²} for -1 ≤ t ≤ 2:

Replace t with -t: {x = -t, y = (-t)² = t²} for -1 ≤ -t ≤ 2.
Simplify bounds: -2 ≤ t ≤ 1.
Result: {x = -t, y = t²} for -2 ≤ t ≤ 1 traces the same curve in the opposite direction.

⏰ Shifting the parameter (time delay)

Rule: Replace every occurrence of t with (t - c) to introduce a "time delay" of c units.

This shifts the start of the parameter ahead by c units.
Useful when connecting multiple parametric paths or when you want t to start at 0.

Example: If {x = 3t, y = 4t} for 0 ≤ t ≤ 1 ends at t = 1, and you want the next segment to start at t = 1:

Take the second segment {x = 3 + 2t, y = 4 - 4t} for 0 ≤ t ≤ 1.
Replace t with (t - 1): {x = 3 + 2(t - 1), y = 4 - 4(t - 1)} for 0 ≤ t - 1 ≤ 1.
Simplify: {x = 1 + 2t, y = 8 - 4t} for 1 ≤ t ≤ 2.

⚠️ Don't confuse: shift vs. reverse

Reversing changes the direction of travel along the curve.
Shifting changes when (at what t-value) you reach each point, but not the direction.

🌀 Special curve: the cycloid

🎡 Definition and derivation

A cycloid is the curve traced by a point on the rim of a circle of radius r as it rolls along a straight line (the positive x-axis).

Let θ be the angle (in radians) measuring the clockwise rotation of the circle.
The center of the circle moves a distance rθ along the x-axis (arc length = radius × angle).
The center is always at height r above the x-axis, so it's at position (rθ, r).

Starting from the parametrization of clockwise circular motion {x = -r sin(θ), y = -r cos(θ)}:

Shift the x-coordinate right by rθ: x = -r sin(θ) + rθ.
Shift the y-coordinate up by r: y = -r cos(θ) + r.

Result: {x = r(θ - sin(θ)), y = r(1 - cos(θ))} for θ ≥ 0.

📊 Graphing with technology

Set the calculator to Parametric Mode and radian mode.
For one full revolution (one arch), use 0 ≤ t < 2π.
The x-range should be [0, 2πr] and y-range [0, 2r] for one arch.
Example: For r = 3, one arch uses 0 ≤ t < 2π, x in [0, 6π], y in [0, 6].

🎯 Finding parametrizations

📐 Strategy summary

For y = f(x): Let x = t, y = f(t).
For x = g(y): Let y = t, x = g(t).
For line segments: Use "starting point + (displacement)t" for 0 ≤ t ≤ 1.
For circles/ellipses: Use cos(t) and sin(t) with the Pythagorean Identity.
Adjust orientation: Replace t with -t if needed.
Adjust timing: Replace t with (t - c) to shift the start time.

🔍 Checking your answer

Substitute back: Plug the parametric equations into the original x-y equation to verify all points lie on the curve.
Check bounds: Verify that the parameter range produces the correct portion of the curve.
Check orientation: Plot a few points in order to confirm the direction of travel.

Example: To parametrize the left half of the ellipse x²/4 + y²/9 = 1:

Standard parametrization: {x = 2cos(t), y = 3sin(t)} for 0 ≤ t < 2π traces the whole ellipse.
Left half means x ≤ 0, which corresponds to Quadrant II and III angles: π/2 ≤ t ≤ 3π/2.
Result: {x = 2cos(t), y = 3sin(t)} for π/2 ≤ t ≤ 3π/2.

⚠️ Common pitfall: inverses

When parametrizing y = f⁻¹(x), don't try to solve for the inverse explicitly:

Start with x = f(y) (the inverse relationship).
Parametrize by letting y = t, so x = f(t).
Example: For y = f⁻¹(x) where f(x) = x⁵ + 2x + 1, use {x = t⁵ + 2t + 1, y = t}.