Cultivar Development

1

Global Grain Yield Increases

Global Grain Yield Increases

🧭 Overview

🧠 One-sentence thesis

Plant breeding has dramatically increased crop yields over decades, but current annual improvement rates (0.9–1.6%) fall short of the 2.4% needed to meet projected 2050 food demand without expanding farmland.

📌 Key points (3–5)

  • Historic success: Corn yields in the USA increased more than 6-fold from the 1930s to today (from ~1.8 t/ha to ~11.7 t/ha).
  • Current global rates: Annual grain yield increases are 1.6% (maize), 1.0% (rice), 0.9% (wheat), and 1.3% (soybean) from 1961–2021.
  • The gap: A 2.4% annual yield gain is required across crops to meet 2050 demand without additional cropland.
  • Common confusion: Yield potential vs. realized yield—potential is what an adapted cultivar can achieve under ideal conditions; realized yield is what actually happens in the field with real-world limitations.
  • The Grand Challenge: Nine billion people by 2050, rising meat consumption, degraded land, falling water tables, and climate risks demand twofold yield increases (relative to 2008).

🌾 Historic and current yield trends

🌽 Corn's dramatic transformation

  • When hybrids replaced open-pollinated varieties in the 1930s, average U.S. corn yields were approximately 1.8 tonnes per hectare (26.8 bushels per acre).
  • Today, yields have reached approximately 11.7 t/ha (174.6 bu/A).
  • This represents more than a 6-fold increase over roughly 90 years.
  • Hybrids brought uniformity in addition to higher yields.

📈 Global grain yield growth rates (1961–2021)

The excerpt provides observed annual increase rates for four major crops:

CropAnnual yield increase rate
Maize1.6%
Rice1.0%
Wheat0.9%
Soybean1.3%
  • Steady increases are evident across all crops over the 60-year period.
  • However, these rates are insufficient for future needs.

🎯 Record-breaking potential

  • A U.S. farmer (David Hula) achieved a certified yield of 532 bushels per acre (35.78 t/ha) in 2015.
  • This was on a 10-acre field in Virginia using reduced tillage, irrigation, and a specific hybrid planted at high density.
  • This demonstrates that yield potential has not hit a permanent biological ceiling yet.
  • Example: This record yield is roughly 3 times the current U.S. average, showing room for improvement.

🎯 The 2050 challenge

📊 The required rate vs. current reality

  • To meet anticipated 2050 demand without bringing additional land under cultivation, a 2.4% per year yield gain is needed across crops.
  • Starting from the base year 2021, this 2.4% annual improvement must be sustained.
  • Current rates (0.9–1.6%) are well below this target.
  • Don't confuse: The 2.4% is not a one-time increase but a sustained annual rate over nearly three decades.

🌍 Multi-faceted drivers of demand

The Grand Challenge includes several interconnected factors:

  • Population growth: World population estimated at more than nine billion by 2050.
  • Dietary shifts: Increased meat consumption in emerging economies as living standards rise (meat production requires more grain for animal feed).
  • Land constraints: No appreciable change in available cropland globally, and much existing land is being degraded.
  • Water scarcity: Falling water tables globally limit irrigation potential.
  • Climate risk: Climate change increases uncertainty and risk in crop production.

🎯 The bottom line

Crop yields must increase twofold by 2050 to meet projected global demand for food and feed (relative to the base year 2008).

  • This is a doubling requirement, not just incremental improvement.
  • The challenge is compounded by resource constraints and environmental pressures.

🔧 Pathways to greater production

🌱 Two fundamental approaches

The excerpt identifies two main strategies for achieving greater food production:

  1. Expand cultivation: Bring more land into cultivation (though this is not always feasible).
  2. Intensify productivity: Produce more from each unit of land.

🧬 Improving productivity per unit land

Two complementary methods are highlighted:

  • Improve the genetics of the seed: This is the domain of plant breeding and cultivar development.
  • Better production practices: Provide adequate sunlight, water, and soil nutrients; mitigate stress factors.

Agricultural production can be maximized when the crop's yield potential is manifested.

🔍 Understanding the yield gap

Yield gap: The difference between yield potential and current realized yield.

Yield potential is defined as:

  • The yield productivity potential of an adapted cultivar
  • When grown under favorable conditions
  • Without growth limitations from water, nutrients, pests, disease, and other stress factors

Current realized yield is:

  • The actual yield on a specified spatial and temporal scale
  • What farmers actually harvest under real-world conditions

Example: If a cultivar can produce 12 t/ha under ideal conditions but farmers average 8 t/ha, the yield gap is 4 t/ha. Closing this gap through better practices and genetics is a key opportunity.

🧬 Genetic complexity in plant breeding

🧩 The polygenic nature of key traits

Polygenic: Most key traits of interest (e.g., yield) typically involve many genes in their expression.

Challenges arising from polygenic traits:

  • Small individual effects: Each gene is thought to contribute only a small effect to the overall trait.
  • Environmental noise: Genetic effects can be difficult to measure due to environmental variation masking the signal.
  • Gene-environment interaction: The expression of some genes is influenced by the environment, making effects inconsistent across locations or years.
  • Random recombination: Genes of parents are randomly shuffled when a cross is made, making outcomes unpredictable.

Don't confuse: Polygenic traits are not controlled by a single gene with a large, easily measurable effect; they result from the cumulative action of many genes, each with small contributions.

🔄 The cultivar improvement cycle

Cultivars: The result of plant breeding, the science of applying genetic principles to improve plants for human use.

The general cycle involves:

  1. Creating or assembling useful genetic diversity
  2. Exploiting this variation to achieve targeted breeding goals
  3. Crossing the "best" parents
  4. Producing progeny
  5. Identifying and selecting superior individuals

Plant breeding impacts every individual because it involves economically important traits in plants used for food, animal feed, fiber, fuel, and landscaping.

🍎 Beyond quantity: nutritional quality

🌍 The dual burden of hunger

The excerpt distinguishes two related but distinct problems:

ProblemDefinitionScale
UndernutritionNot having enough foodAffects millions globally
Malnutrition"Hidden hunger"—lacking essential nutrientsAffects over one billion people

Africa is particularly vulnerable, and children are especially hard-hit by both conditions.

💊 Specific nutritional deficiencies

Key facts about malnutrition impacts:

  • Iron deficiency: Affects half of children under age 5 in developing countries; impairs growth, cognitive development, and immune function.
  • Vitamin A deficiency: Affects at least 100 million children; limits growth, weakens immunity, and in acute cases leads to blindness.
  • Stunting: More than one-third of all African children suffer stunting (low height for weight, irreversible after age 2) due to malnutrition and undernutrition.

🔄 Lifelong and intergenerational consequences

  • 3.5 million maternal and child deaths could be prevented annually with improved nutrition.
  • Stunting in early life is associated with lifetime effects: poor cognition and learning, low adult wages, lost productivity, and increased risk of chronic disease.
  • Undernutrition during the critical window from conception to 2 years of age is associated with lower human capital.
  • Intergenerational cycle: A girl who was fed poorly as an infant is likely to have offspring with lower birth weight, perpetuating the problem.

🎯 The dual imperative

Crop improvement must be directed to producing better food as well as more food.

  • Plant breeding must address both yield quantity and nutritional quality.
  • This adds another dimension to the breeding challenge beyond simply increasing tonnes per hectare.
2

Undernutrition and Malnutrition

Undernutrition and Malnutrition

🧭 Overview

🧠 One-sentence thesis

Crop improvement must target both quantity and nutritional quality because over one billion people suffer from malnutrition ("hidden hunger") that causes devastating, multi-generational health and economic consequences, especially in children.

📌 Key points (3–5)

  • Two distinct problems: undernutrition (not enough food) vs. malnutrition (lacking essential nutrients like iron and vitamin A, also called "hidden hunger").
  • Scale and impact: over one billion people affected; 3.5 million maternal and child deaths could be prevented annually with improved nutrition.
  • Critical window: undernutrition from conception to age 2 causes irreversible stunting and lifetime cognitive, economic, and health impairments.
  • Intergenerational cycle: a malnourished girl is likely to have offspring with lower birth weight, perpetuating the problem across generations.
  • Common confusion: malnutrition is not just about total food quantity—it's about missing specific micronutrients even when calories are available.

🌍 The dual challenge: undernutrition and malnutrition

🍽️ Undernutrition vs. malnutrition

Undernutrition: caused by not having enough food.

Malnutrition: lacking essential nutrients; also called "hidden hunger."

  • These are distinct problems that often coexist but require different solutions.
  • Malnutrition affects over one billion people globally, even when total food availability may seem adequate.
  • Africa is particularly vulnerable to both conditions.
  • Don't confuse: someone can consume enough calories but still suffer malnutrition if their diet lacks key vitamins and minerals.

👶 Children are especially hard-hit

  • Among the undernourished, children face the most severe consequences.
  • The excerpt emphasizes that children are disproportionately affected by both undernutrition and malnutrition.

💔 Health and developmental consequences

🩸 Iron deficiency

  • Affects half of children under age 5 in developing countries.
  • Impairs:
    • Growth
    • Cognitive development
    • Immune function
  • Example: A child with iron deficiency may grow more slowly, struggle to learn, and get sick more frequently.

👁️ Vitamin A deficiency

  • Affects at least 100 million children.
  • Consequences:
    • Limits growth
    • Weakens immunity
    • In acute cases, leads to blindness

📏 Stunting

  • More than one-third of all African children suffer from stunting.
  • Definition and characteristics:
    • Low height for weight
    • Irreversible after age 2
    • Caused by malnutrition and undernutrition combined
  • Lifetime effects associated with stunting:
    • Poor cognition and learning
    • Low adult wages
    • Lost productivity
    • Increased risk of chronic disease

⏰ The critical window and long-term impacts

🪟 Conception to 2 years of age

  • This period is described as a "critical window".
  • Undernutrition during this time is associated with:
    • Lower human capital (the productive capacity of individuals)
    • Irreversible developmental damage (e.g., stunting becomes permanent after age 2)
  • Why it matters: interventions must target this narrow timeframe to prevent permanent harm.

🔄 Intergenerational cycle

  • The effects span multiple generations:
    • A girl who was fed poorly as an infant is likely to have offspring with lower birth weight.
    • This creates a cycle where malnutrition perpetuates itself across families.
  • Don't confuse: this is not just about immediate health—it's a self-reinforcing pattern that affects future generations even before they are born.

📊 Scale of preventable harm

ImpactStatistic from excerpt
Preventable deaths3.5 million maternal and child deaths could be prevented annually with improved nutrition
Iron deficiency prevalenceHalf of children under age 5 in developing countries
Vitamin A deficiency prevalenceAt least 100 million children
Stunting in AfricaMore than one-third of all African children

🌾 The breeding imperative

🎯 Better food, not just more food

  • The excerpt concludes: "Crop improvement must be directed to producing better food as well as more food."
  • This statement directly links the malnutrition problem to plant breeding goals.
  • Why both matter:
    • More food addresses undernutrition (total quantity).
    • Better food addresses malnutrition (nutritional quality—vitamins, minerals, etc.).
  • Example: A breeding program might aim to increase iron or vitamin A content in staple crops, not just yield.
3

The Cycle of Cultivar Improvement

The Cycle of Cultivar Improvement

🧭 Overview

🧠 One-sentence thesis

Plant breeders systematically create and exploit genetic variation through repeated cycles of crossing the best parents, producing progeny, and selecting superior offspring to develop improved cultivars that address both yield and nutritional goals.

📌 Key points (3–5)

  • The core cycle: cross the best parents → produce progeny → identify and recover progeny that surpass the parents → use superior progeny as new parents in the next cycle.
  • The breeder's mission: increase the frequency of favorable alleles and decrease unfavorable alleles to maximize genetic gain per unit of time and cost.
  • Key challenges: choosing the best parents, identifying truly superior progeny, environmental noise that reduces heritability, and lack of knowledge about genetic architecture.
  • Common confusion: selection response depends not just on selecting the best individuals, but on heritability (repeatability of the trait) and selection pressure—environmental noise can mask true genetic performance.
  • Modern tools accelerate the cycle: DNA-based technologies, genomics-assisted selection, transformation, gene editing, and automated high-throughput analysis enable faster and more precise cultivar development.

🌱 The fundamental improvement cycle

🔄 How the cycle works

The general philosophy is simple but powerful:

  1. Cross the "best" parents
  2. Produce progeny
  3. Identify and recover progeny that surpass the parents and demonstrate outstanding performance
  4. The superior progeny may become the basis of a new, improved cultivar
  5. Seed volumes (or plant propagules for clonal crops) are produced for distribution
  • This cycle is repeated multiple times to achieve a particular breeding target.
  • Superior progeny may be used as parents in the next cycle, accumulating gains from selection.
  • Example: A breeder crosses two high-yielding parent plants, evaluates hundreds of offspring, selects the top 5% that outperform both parents, and uses those as parents for the next generation.

🎯 Creating and exploiting genetic variation

Cultivar improvement involves the creation or assembly of useful genetic diversity and ways to exploit this variation to achieve targeted breeding goals.

  • The cycle has two phases: creating useful genetic variation (through crossing, transformation, or other methods) and exploiting that variation (through selection and evaluation).
  • With modern DNA-based information, knowledge of genetic architecture and genome function becomes part of the cycle.

🧬 The breeder's mission and genetic principles

🎲 Increasing favorable allele frequency

Plant breeders are on a mission to:

  • Increase the frequency of favorable alleles
  • Decrease the frequency of unfavorable alleles

This mirrors natural selection, but instead of selecting for "fitness" (contributing to the next generation in nature), breeders select for economically important traits and nutritional quality.

📊 Selection and the Breeder's Equation

Top-performing individuals from the base population are selected as parents to produce the next generation—this represents a complete cycle of selection.

Selection response (R) depends on three factors:

  • The total variation in the population (the bell-shaped curves)
  • The heritability (h²) of the trait (its repeatability)
  • The selection pressure (S) imposed

The relationship is represented in the Breeder's Equation:

  • R = h² × S
  • Where S is the difference between the mean of selected parents and the mean of the base population
  • Where R is the difference between the mean of offspring from selected parents and the mean of the base population

Rearranging gives: heritability = selection response / selection pressure

Example: If you select parents that are 10 units above the population average (S = 10), and their offspring average 6 units above the original population (R = 6), then heritability is 0.6 (60% of the difference is heritable).

⚡ Rate of genetic gain

The rate of genetic gain depends on:

  • Heritability of the trait
  • Phenotypic variability in the base population
  • Selection intensity
  • Length of the breeding cycle

To maximize genetic gain, the breeder must rely on a systematic approach to effectively increase favorable allele frequency and ensure improved cultivars are available to farmers on a timely basis.

🚧 Challenges in cultivar improvement

🤔 Major obstacles

ChallengeWhy it matters
Choosing the "best" parentsStarting material determines the genetic potential of all progeny
Identifying truly superior progenyMust distinguish genetic superiority from environmental luck
Environmental noiseReduces heritability and makes it difficult to discern performance differences
Effective screensSome traits lack efficient methods to measure performance
Knowledge gapsLack of understanding about metabolics and genetic architecture underlying traits of interest

🌾 Complexity of economically important traits

The excerpt notes that typically:

  • Many genes are involved in the expression of economically important traits
  • Each gene contributes a small effect
  • Genetic effects can be difficult to measure due to environmental noise
  • The expression of some genes is influenced by the environment
  • Genes of parents are randomly shuffled when a cross is made

Don't confuse: A trait with high heritability is not necessarily controlled by one gene—it may involve many genes but have consistent, repeatable expression across environments.

🛠️ Modern tools and the development process

🧰 Tools that help

Modern tools accelerate and improve the cycle:

Tool categoryExamples from excerptPurpose
PredictionChoosing parentsAid in selecting optimal parent combinations
Creating variationTransformation, gene editingGenerate new useful genetic variation
Speed developmentDoubled haploidySpeeds development of homozygous progeny
DNA-basedMolecular markers, sequence informationEnable genomics-assisted selection
AnalyticsAutomated high-throughput analysisExample: grain composition analysis
PhenomicsSpecialized testing environmentsEvaluate performance, often for stress tolerance

🏭 The cultivar development process

The process of cultivar development is established by crop, enabled through tools, and focuses on practical means to develop improved cultivars.

Key characteristics:

  • Typically involves several steps spanning more than 10 years
  • Includes all steps from crossing parents to evaluating progeny to producing volumes of seed for distribution
  • Given finite resources, the aim is to maximize genetic gain per unit of time and cost

🔧 Core functions

The process involves 4 core functions that utilize different approaches to meet breeding objectives:

  1. Creating breeding populations
  2. Developing progeny with new gene combinations
  3. Evaluating progeny performance
  4. Trait integration (a special case of new line development and evaluation)

🤝 Multidisciplinary engagement

The process pipeline is a multidisciplinary operation requiring expertise from:

Conventional Breeding:

  • Quantitative and population genetics
  • Plant breeding methods
  • Selection theory
  • Statistics & experimental design
  • Knowledge of germplasm
  • Phenotypic evaluation

Molecular Biology:

  • Biochemistry
  • Molecular genetics
  • Genomics
  • Transformation & tissue culture
  • Sequencing
  • Molecular marker technologies
  • Gene cloning

Data Management Analysis Display:

  • Bioinformatics
  • Information technology and management
  • Computer programming
  • Simulation & modeling
  • Statistical and mathematical theory

Engineering:

  • Profiling equipment
  • Analytics (e.g., grain composition)
  • Robotics
  • Nanotechnology

Agronomy/Botany:

  • Plant physiology and biology
  • Soil science
  • Pathology
  • Entomology

✅ Critical success factors

⏰ Timely delivery requirements

To succeed in delivering improved cultivars to the marketplace on a timely basis, it is essential to design the process to:

  • Align completely with stated product targets
  • Fully integrate all aspects

🎯 Critical decision points

The excerpt identifies key decision points:

  • Specifying your product target and your target market region
  • Choosing optimal parents to create breeding populations
  • Developing progeny with new gene combinations
  • Evaluating progeny to identify truly superior individuals
  • Selecting progeny to advance for further evaluation and to release as improved cultivars
  • Deploying tools

Each decision point affects the efficiency of the entire cycle and the ultimate success in delivering improved cultivars.

🌍 The broader context: addressing global nutrition

🍽️ Why better food matters

The excerpt emphasizes that crop improvement must be directed to producing better food as well as more food.

Undernutrition: caused by not having enough food
Malnutrition: "hidden hunger" from nutrient deficiencies

📉 Impact of malnutrition

Facts from the excerpt:

  • 3.5 million maternal and child deaths could be prevented annually with improved nutrition
  • In developing countries, iron deficiency affects half of children under age 5, impairing growth, cognitive development, and immune function
  • Vitamin A deficiency affects at least 100 million children, limiting growth, weakening immunity, and in acute cases, leading to blindness
  • More than one-third of all African children suffer stunting (low height for weight, irreversible after age 2) due to malnutrition and undernutrition
  • Stunting is associated with lifetime debilitating neurological effects: poor cognition and learning, low adult wages, lost productivity, and increased risk of chronic disease
  • Undernutrition during the critical window from conception to 2 years of age is associated with lower human capital
  • The devastating effects are across generations: a girl who was fed poorly as an infant is likely to have offspring with lower birth weight

This context explains why the cultivar improvement cycle must address both quantity (yield) and quality (nutritional content) of crops.

4

The Cultivar Development Process

The Cultivar Development Process

🧭 Overview

🧠 One-sentence thesis

Plant breeders maximize genetic gain per unit of time and cost by designing a systematic, multi-year process that integrates selection theory, multiple disciplines, and critical decision points to deliver improved cultivars aligned with specific product targets.

📌 Key points (3–5)

  • Selection response depends on three factors: total variation in the population, heritability (repeatability) of the trait, and selection pressure imposed by the breeder.
  • Rate of genetic gain formula: combines heritability, phenotypic variability, selection intensity, and breeding cycle length to predict improvement speed.
  • The process is systematic and long: cultivar development is crop-specific, spans more than 10 years, and includes all steps from crossing parents to seed distribution.
  • Four core functions: the process involves distinct approaches (New Line Development, New Line Evaluation, and Trait Integration as a special case) supported by multidisciplinary teams.
  • Common confusion—product target vs. characteristics: a product target describes what and for where, which then implies specific measurable characteristics with threshold levels.

🧬 Selection principles and genetic equations

🧬 Selection response (R)

Selection response: the change in trait mean from one generation to the next as a result of selecting top-performing individuals as parents.

  • How it works: top performers from the base population are chosen as parents → they produce the next generation → this completes one cycle of selection.
  • The response depends on:
    • Total variation in the population (visualized as bell-shaped curves).
    • Heritability (h²): how repeatable the trait is.
    • Selection pressure (S): how strongly the breeder filters individuals.
  • Breeder's Equation: R = h² × S
    • Where S = (mean of selected parents) − (mean of base population).
    • Where R = (mean of offspring from selected parents) − (mean of base population).
  • Rearranging gives heritability as: h² = R / S.

⚡ Rate of genetic gain

  • What it measures: how fast improvement happens over time.
  • Four factors that determine it:
    1. Heritability of the trait.
    2. Phenotypic variability in the base population.
    3. Selection intensity.
    4. Length of the breeding cycle.
  • The excerpt emphasizes that the breeder must rely on a systematic approach to increase favorable alleles and maximize this rate so improved cultivars reach farmers on time.

🔄 The cultivar development process structure

🔄 What the process is

The process of cultivar development: a systematic, crop-specific pipeline that uses tools and practical means to develop improved cultivars, typically spanning more than 10 years.

  • Key characteristics:
    • Established by crop (each crop has its own process).
    • Enabled through tools, etc.
    • Focuses on practical means.
    • Includes all steps: crossing parents → evaluating progeny → producing volumes of seed for distribution.
  • Goal: maximize genetic gain per unit of time and cost, given finite resources.

🧩 Four core functions

The process involves four core functions that use different approaches to meet breeding objectives:

  1. New Line Development: creating new genetic combinations.
  2. New Line Evaluation: assessing progeny to identify superior individuals.
  3. Trait Integration: a special case of New Line Development and Evaluation (the excerpt notes this explicitly).
  4. (The excerpt mentions "4 core functions" but only details three; the fourth is implied to be part of the pipeline.)

🏢 Supporting groups and facilities

  • The process pipeline engages supporting groups and various facilities.
  • It is a multidisciplinary operation (see next section).

🤝 Multidisciplinary engagement

The process requires expertise from five major disciplines:

DisciplineKey areas
1. Conventional BreedingQuantitative & population genetics; plant breeding methods; selection theory; statistics & experimental design; germplasm knowledge; phenotypic evaluation
2. Molecular BiologyBiochemistry; molecular genetics; genomics; transformation & tissue culture; sequencing; molecular marker technologies; gene cloning
3. Data Management Analysis DisplayBioinformatics; information technology & management; computer programming; simulation & modeling; statistical and mathematical theory
4. EngineeringProfiling equipment; analytics (e.g., grain composition); robotics; nanotechnology
5. Agronomy/BotanyPlant physiology & biology; soil science; pathology; entomology
  • Why this matters: the excerpt states it is essential to fully integrate all aspects to deliver improved cultivars on time.

🎯 Critical decision points and up-front design

🎯 Critical decision points

The breeder must make decisions at several stages:

  • Specifying product target and target market region.
  • Choosing optimal parents to create breeding populations.
  • Developing progeny with new gene combinations.
  • Evaluating progeny to identify truly superior individuals.
  • Selecting progeny to advance for further evaluation and to release as improved cultivars.
  • Deploying tools and technologies for greater efficiency and effectiveness.

📋 Up-front decisions before beginning

Before starting any activities, the breeder faces four important decisions:

  1. Specify the product target.
  2. Define the market region for the new cultivars.
  3. Identify base germplasm.
  4. Design the breeding strategy.
  • Why alignment is essential: the process must align completely with stated product targets to succeed in timely delivery.

🎯 Determining the product target

🎯 Organizational mission and team goals

  • A breeder's specific product targets fall out of the organizational mission and research team goals.
  • Whether working for a company or a national program, the high-level mission translates into an overall product goal (which may be a portfolio of products for a large region and may include more than one crop).
  • The excerpt emphasizes: "You are part of a team effort!"

🎯 Individual breeding program goals

  • An individual breeding program has more specific goals, centering on a particular crop.
  • The breeder carefully and specifically describes what is desired as a net result of the breeding process.
  • Note: an individual program may have more than one product target.

🎯 What is a product target?

Product target: describes the "What" (characteristics) and the "For Where" (market region).

Example from the excerpt (soybean):

  • What: Maturity group 5–6 (early to medium); high seed yield; high yield stability (consistent performance across environments); minimal/no lodging (good standability at harvest); minimal/no pod shattering ahead of harvest; RR1 transgenic event (tolerance to glyphosate herbicide).
  • For Where: (implied by maturity group and market region).

📊 Product targets indicate specific characteristics

Once target characteristics are specified, the breeder must also specify:

  • Targeted levels of these characteristics.
  • The way the characteristics will be measured.

Example table from the excerpt:

CharacteristicMeasurement StandardThreshold Level / Range
High seed yieldMachine harvest; seed weight at 13% moisture basis, expressed per unit of land10% greater (than baseline)
  • Don't confuse: the product target (the "what" and "for where") is the starting point; it implies specific measurable characteristics with thresholds, which are then defined in detail.
5

Designing the Process: Up-Front Decisions

Designing the Process: Up-Front Decisions

🧭 Overview

🧠 One-sentence thesis

Before starting breeding activities, the breeder must make critical up-front decisions—specifying the product target, defining the market region, identifying base germplasm, and designing the breeding strategy—so that the entire process aligns with stated product targets and delivers improved cultivars on a timely basis.

📌 Key points (3–5)

  • Four up-front decisions: specify the product target, define the market region, identify base germplasm, and design the breeding strategy.
  • What a product target describes: the "What" (specific characteristics and their threshold levels) and the "For Where" (the market region and population of environments).
  • How characteristics are specified: each target characteristic requires a measurement standard and a threshold level or range that will be used in selection.
  • Common confusion: the market region is not just geography—it includes the population of environments (locations, production management systems, season, maturity zone, altitude, etc.), and testing environments must be representative samples from that population.
  • Why alignment matters: complete alignment with stated product targets and full integration of all aspects are essential to succeed in delivering improved cultivars to the marketplace on a timely basis.

🎯 The four up-front decisions

🎯 What the breeder must decide before starting

Before beginning any activities, the breeder faces four important up-front decisions:

  • Specify the product target: describe what is desired as the net result of the breeding process.
  • Define the market region: identify where the new cultivars will be deployed.
  • Identify base germplasm: choose the starting genetic material.
  • Design the breeding strategy: plan the approach to meet breeding objectives.

🔗 Why these decisions are critical

  • The excerpt emphasizes that to succeed in delivering improved cultivars on a timely basis, it is essential to design the process to:
    • Align completely with stated product targets.
    • Fully integrate all aspects.
  • These up-front decisions ensure that all subsequent activities (choosing parents, developing progeny, evaluating and selecting) are aligned with the end goal.

📋 Understanding the product target

📋 What a product target is

Product target: describes the "What" and the "For Where."

  • The "What": the specific characteristics desired in the new cultivar.
  • The "For Where": the market region where the cultivar will be used.
  • An individual breeding program may have more than one product target.

🌱 How characteristics are specified

Each target characteristic must be defined with:

  1. The characteristic itself (e.g., high seed yield, lodging resistance, maturity group).
  2. A measurement standard (how the characteristic will be measured).
  3. A threshold level or range (the target level that will be used in selection).

Example from the excerpt (soybean for South Africa):

CharacteristicMeasurement StandardThreshold Level / Range
High seed yieldMachine harvest; seed weight at 13% moisture basis, expressed per unit of land10% greater than Variety X
High yield stabilityUse regression analysis or GGE biplot analysisComparable to Variety X
Lodging resistance1-5 scale; 1=plant erect, 5=prostrateScore ≤2
Resistance to pod shatteringOven dry method; 10 point scale measuring percentage affected; 0=none, 1=1-10%, 10=91-100%Score ≤1
Medium maturityMaturity Group; day length and temperature required to initiate floral development; full range includes Group 000 to Group 9MG 5-7
RR1 event (Roundup Ready 1)Integrate 40-3-2Pre-determined level of glyphosate tolerance
  • The target levels for the target characteristics become the thresholds that will be used in selection.
  • For value-added traits like RR1, achieving the desired level of trait expression is typically a function of integrating the particular transgenic event through either backcross or forward breeding.

🗺️ How the market region is specified

The market region is not just a geographic area; it includes the population of environments:

  • Geography (e.g., dryland and irrigated areas of South Africa involving corn production).
  • Production management system (e.g., use of inoculant "Rhizobia strain WB74" to ensure nitrogen-fixing bacteria are present for corn rotation).
  • Season, maturity zone, altitude, soil types, planting dates, farmer practices, etc.

Don't confuse: the market region with just a place name—it is the full set of environmental and management conditions where the cultivar will be grown.

🧪 Testing environments must be representative

  • Locations used as testing environments must be representative in terms of locations, planting dates, farmer practices, soil types, etc.
  • Think of your testing environments as "samples" from the population of environments.
  • This ensures that evaluation results predict performance in the actual market region.

🏢 How product targets emerge from organizational goals

🏢 From organizational mission to individual program goals

The excerpt describes a hierarchy:

  1. Organizational mission (high-level; whether working for a company or a national program).
    • In a seed company, this is translated into an overall product goal, which may be a portfolio of products for a large region.
    • It may include more than one crop.
  2. Individual breeding program goals (more specific; centering on a particular crop).
    • The breeder carefully and specifically describes what is desired as a net result of the breeding process.
    • An individual program may have more than one product target.
  • The breeder's specific product targets will fall out of his/her organizational mission and research team goals.
  • You are part of a team effort!

🎯 Example: soybean product target for South Africa

The excerpt provides a detailed example of a product target for an improved soybean variety for South Africa:

What characteristics does this product target imply?

  • Maturity group 5–6 (i.e., early to medium).
  • High seed yield.
  • High yield stability (i.e., consistent performance across all types of environments).
  • Minimal/no lodging (i.e., good standability at harvest).
  • Minimal/no pod shattering ahead of harvest.
  • RR1 transgenic event (which imparts tolerance to glyphosate herbicide).

What market region is specified?

  • Dryland and irrigated areas of South Africa involving corn production.
  • Because the purpose of soybean in corn rotation is to fix nitrogen in the soil, certain bacteria must be present in the soil to facilitate this activity.
  • Soybean works together with Rhizobia and other bacterial species to convert atmospheric nitrogen to a form readily usable by plants, presumably the corn crop in the following year.
  • To ensure that nitrogen-fixing strains of bacteria are present, farmers may inoculate the soil; in this case, use of the inoculant "Rhizobia strain WB74" is specified as a production management practice.

🔧 Critical decision points in the breeding process

🔧 What decisions are critical

Critical decision points involve:

  • Specifying your product target and your target market region (covered above).
  • Choosing optimal parents to create breeding populations.
  • Developing progeny with new gene combinations.
  • Evaluating progeny to identify truly superior individuals.
  • Selecting progeny to advance for further evaluation and to release as improved cultivars.
  • Deploying tools and technologies for greater efficiency and effectiveness.

🧩 How these decisions fit into the process

  • The excerpt mentions that the process pipeline involves 4 core functions that utilize different approaches to meet breeding objectives (though the details of these functions are not fully provided in the excerpt).

  • The process pipeline is a multidisciplinary operation engaging supporting groups and various facilities.

  • The excerpt lists five multidisciplinary areas involved:

    1. Conventional Breeding (genetics, plant breeding methods, selection theory, statistics & experimental design, knowledge of germplasm, phenotypic evaluation).
    2. Molecular Biology (biochemistry, molecular genetics, genomics, transformation & tissue culture, sequencing, molecular marker technologies, gene cloning).
    3. Data Management Analysis Display (bioinformatics, information technology, information management, computer programming, simulation & modeling, statistical and mathematical theory).
    4. Engineering (profiling equipment, analytics e.g. grain composition, robotics, nanotechnology).
    5. Agronomy/Botany (plant physiology, plant biology, soil science, pathology, entomology).
  • The excerpt also mentions the Breeder's Equation: genetic selection response (R) depends on the total variation in the population, the heritability (repeatability) of the trait, and the selection pressure (S) imposed (though the full equation details are not completely provided in the excerpt).

6

What is a Product Target?

What is a Product Target?

🧭 Overview

🧠 One-sentence thesis

A product target guides the breeding process by specifying both the desired characteristics of a new cultivar and the market region where it will be grown, serving as the foundation for all subsequent breeding decisions.

📌 Key points (3–5)

  • Product target defines "What" and "For Where": it describes the specific characteristics desired in a new cultivar and the target market region (geography, production system, environment).
  • Targets flow from organizational mission: a breeder's product targets derive from the organizational mission, which translates into overall product goals, then into individual breeding program goals.
  • Characteristics must be measurable with thresholds: each target characteristic requires a measurement standard and a threshold level that will be used in selection.
  • Common confusion: product target vs breeding strategy—the target specifies what is desired and where it will be used; the breeding strategy determines how to achieve it (methods, selection intensity, testing schemes).
  • Type of cultivar is part of the target: the product target indicates whether the result will be a pure line, hybrid, open-pollinated variety, synthetic, blend, or clonally propagated cultivar.

🎯 What a product target specifies

🎯 The "What": target characteristics

Product target: describes the "What" and the "For Where."

  • The "What" refers to the specific characteristics the breeder wants in the new cultivar.
  • These characteristics are not vague wishes; they must be concrete and specific.
  • Example from the excerpt: an improved soybean variety for South Africa requires maturity group 5–6, high seed yield, high yield stability, minimal lodging, minimal pod shattering, and the RR1 transgenic event (glyphosate tolerance).

🗺️ The "For Where": target market region

  • The "For Where" specifies the market region in terms of geography, production management system, season, maturity zone, altitude, and other environmental factors.
  • This is also called the population of environments—the full range of conditions where the cultivar will be grown.
  • Example: the soybean product target specifies "dryland and irrigated areas of South Africa involving corn production," including the use of a specific inoculant (Rhizobia strain WB74) as a production management practice.
  • Testing environments must be representative samples from this population of environments (locations, planting dates, farmer practices, soil types).

📏 Measurements and thresholds

Once characteristics are specified, the breeder must define:

  • Measurement standard: the protocol or method used to evaluate each characteristic.
  • Threshold level/range: the minimum acceptable performance that will be used in selection.
CharacteristicMeasurement StandardThreshold Level/Range
High seed yieldMachine harvest; seed weight at 13% moisture basis, expressed per unit of land10% greater than Variety X
High yield stabilityRegression analysis or GGE biplot analysisComparable to Variety X
Lodging resistance1-5 scale; 1=plant erect, 5=prostrateScore ≤2
Resistance to pod shatteringOven dry method; 10 point scale measuring percentage affected; 0=none, 1=1-10%, 10=91-100%Score ≤1
Medium maturityMaturity Group; day length and temperature required to initiate floral development; full range includes Group 000 to Group 9MG 5-7
RR1 event (Roundup Ready 1)Integrate 40-3-2Pre-determined level of glyphosate tolerance
  • The thresholds become the selection criteria used throughout the breeding process.
  • For value-added traits like RR1, achieving the desired trait expression typically requires integrating a particular transgenic event through backcross or forward breeding.

🏢 How product targets are determined

🏢 Organizational mission

  • Whether working for a seed company or a national program, the breeder operates within a high-level organizational mission.
  • The excerpt emphasizes: "You are part of a team effort!"
  • The organizational mission provides the overarching purpose and direction.

🎯 Overall product goal

  • In a seed company, the organizational mission translates into an overall product goal.
  • This may be a portfolio of products for a large region and may include more than one crop.
  • The overall product goal is broader than any single breeding program.

🌱 Individual breeding program goals

  • An individual breeding program has more specific goals, centering on a particular crop.
  • The breeder "carefully and specifically describes what is desired as a net result of the breeding process."
  • Note: an individual program may have more than one product target.

🌾 Types of cultivars in product targets

🌾 Six main types

The product target indicates the type of cultivar to be developed:

TypeDefinitionExample
Pure line varietyHomozygous and homogeneous; can be considered an inbred lineSoybean, pea
HybridResult of crossing two genetically different lines; designed to exploit heterosis (hybrid vigor), which may be expressed as increased yield and more robust plant healthMaize
Open-pollinated variety (OPV)Similar to a random mating population wherein cross-pollination occurs due to wind, insects, birds, and other natural mechanismsCarrots
SyntheticPopulation of cross-pollinated plants, typical of crop species that are self-incompatible for self-pollinationAlfalfa
BlendMixture of genotypes intended for genetic diversity to promote yield stability; may comprise different types of disease resistance, slightly different maturities, or varying levels of winter hardinessWheat
Clonally propagated cultivarGenetically identical to a "mother plant" that is the result of cross-pollination; often these species are polyploidsPotato

🧬 Factors that dictate cultivar type

The reproductive system of the crop, its life cycle, and its ploidy level may dictate the types of cultivars that can be developed:

  • Obligate out-crossing species cannot produce pure line cultivars.
  • Perennial crops include fruit tree species, many of which are produced from highly heterozygous parents; these are often reproduced and distributed as clones.
  • Seedless cultivars may result from hybridizing parents of different ploidy numbers.
    • Example: seedless watermelon results from crossing tetraploid with diploid watermelon to produce a triploid cultivar, which is used in production fields with a diploid pollinizer to produce sterile (seedless) fruit.

🌽 Hybrid cultivars: special considerations

  • Crops displaying heterosis may lend themselves to hybridization as a means to spike yields.
  • Crops that have transitioned from pure lines or OPVs to hybrids, or are in process, include maize, rice, sorghum, pearl millet, wheat, barley, sunflowers, cucumbers, tomatoes, melons, squash, and others.
  • Essential requirement: a dependable, cost-efficient means to control pollination in producing the hybrid seed (e.g., cytoplasmic male sterility, mechanical means to remove male flowers, environmentally- or chemically-induced male sterility, or a dioecious crop species).
  • Farmer requirement: farmers must be willing to purchase fresh seed each year (saved seed will not display the same levels of hybrid vigor).

🔗 Product target and other up-front decisions

🔗 Four key up-front decisions

Before beginning any breeding activities, the breeder faces important up-front decisions:

  1. Specify the product target (the focus of this excerpt).
  2. Define the market region for the new cultivars (overlaps with product target's "For Where").
  3. Identify base germplasm: consider what germplasm to use as parents in creating progeny with useful, new gene combinations; sources representing a high frequency of favorable alleles for the traits to be improved are needed.
  4. Design the breeding strategy: consider breeding strategies to be used to assemble the suite of characteristics in one line.

🧪 Breeding strategy: what it includes

  • Don't confuse product target with breeding strategy: the target specifies the desired outcome; the strategy specifies the methods to achieve it.
  • Breeding strategies incorporate testing schemes and selection criteria, and also consider:
    • Mode of reproduction.
    • Selection intensity (how many individuals will be selected at each evaluation step) and order in which selection for particular traits is implemented.
    • Breeding methods.
    • Systems and facilities available.
    • Experimental designs for performance evaluation.
    • Predicted response to selection.
  • The total package of characteristics is what will constitute the new, improved variety.
7

Types of Cultivars

Types of Cultivars

🧭 Overview

🧠 One-sentence thesis

Different cultivar types—pure lines, hybrids, open-pollinated varieties, synthetics, blends, and clones—are developed based on the crop's reproductive system, life cycle, and ploidy level to meet specific market needs.

📌 Key points (3–5)

  • Six main cultivar types exist: pure line, hybrid, open-pollinated variety (OPV), synthetic, blend, and clonally propagated.
  • Reproductive biology dictates options: obligate out-crossing species cannot produce pure lines; perennial crops often use clones; ploidy manipulation can create seedless varieties.
  • Hybrids exploit heterosis: crossing genetically different lines produces hybrid vigor (increased yield and robust health), but requires cost-efficient pollination control and farmers willing to buy fresh seed annually.
  • Common confusion: not all crops can use all cultivar types—the crop's biology (self vs. cross-pollination, annual vs. perennial, ploidy) constrains which types are feasible.
  • Market and biology together determine type: the product target specifies which cultivar type to develop, but biological constraints may limit choices.

🌱 The six cultivar types

🌱 Pure line variety

A pure line variety is homozygous and homogeneous; it can be considered an inbred line.

  • All genetic material is uniform and consistent across plants.
  • Example crops: soybean, pea.
  • These are self-pollinating species where uniformity is achievable and desirable.

🌱 Hybrid

A hybrid is the result of crossing two genetically different lines, designed to exploit "heterosis" or hybrid vigor.

  • Heterosis may express as increased yield and more robust plant health.
  • Example crop: maize.
  • Requires dependable, cost-efficient pollination control methods (cytoplasmic male sterility, mechanical removal of male flowers, chemically-induced sterility, or dioecious species).
  • Important constraint: farmers must purchase fresh seed each year because saved seed will not display the same hybrid vigor.

🌱 Open-pollinated variety (OPV)

An open-pollinated variety is similar to a random mating population wherein cross-pollination occurs due to wind, insects, birds, and other natural mechanisms.

  • Example crop: carrots.
  • Natural pollination maintains genetic diversity within the variety.

🌱 Synthetic

A synthetic is a population of cross-pollinated plants, typical of crop species that are self-incompatible for self-pollination.

  • Example crop: alfalfa.
  • Used when the crop cannot self-pollinate due to biological incompatibility.

🌱 Blend

A blend is a mixture of genotypes intended for genetic diversity to promote yield stability.

  • May comprise different types of disease resistance, slightly different maturities, or varying levels of winter hardiness.
  • Example crop: wheat.
  • The diversity within the blend buffers against environmental variation.

🌱 Clonally propagated cultivars

Clonally propagated cultivars are genetically identical to a "mother plant" that is the result of cross-pollination.

  • Often these species are polyploids.
  • Example crop: potato.
  • Each plant is a genetic copy of the original superior individual.

🧬 Biological constraints on cultivar type

🧬 Reproductive system limitations

The excerpt identifies three key biological factors that dictate which cultivar types are possible:

FactorConstraintExample
Obligate out-crossingCannot produce pure line cultivarsSelf-incompatible species must use synthetics, OPVs, or hybrids
Perennial life cycleOften use clonal propagationFruit tree species from highly heterozygous parents distributed as clones
Ploidy manipulationCan create seedless cultivarsSeedless watermelon from crossing tetraploid × diploid → triploid
  • Don't confuse: the cultivar type is not a free choice—it must match the crop's biology.
  • Example: seedless watermelon results from crossing tetraploid with diploid watermelon to produce a triploid cultivar, which is used in production fields with a diploid pollinizer to produce sterile (seedless) fruit.

🧬 Perennial crops and heterozygosity

  • Perennial crops include fruit tree species.
  • Many are produced from highly heterozygous parents.
  • These are often reproduced and distributed as clones to maintain desirable traits.

🌽 Hybrid cultivars in detail

🌽 When hybrids make sense

  • Crops displaying heterosis may lend themselves to hybridization as a means to spike yields.
  • Crops that have transitioned from pure lines or OPVs to hybrids, or are in process, include: maize, rice, sorghum, pearl millet, wheat, barley, sunflowers, cucumbers, tomatoes, melons, squash, and others.

🌽 Essential requirements for hybrid programs

Two critical requirements must be met:

  1. Pollination control: A dependable, cost-efficient means to control pollination in producing the hybrid seed.

    • Options include: cytoplasmic male sterility, mechanical means to remove male flowers, environmentally- or chemically-induced male sterility, or a dioecious crop species.
  2. Farmer cooperation: Farmers must be willing to purchase fresh seed each year.

    • Saved seed will not display the same levels of hybrid vigor.
    • This is an economic and adoption consideration, not just a biological one.

🌽 Why saved hybrid seed fails

  • The excerpt emphasizes that saved seed from hybrids "will not display the same levels of hybrid vigor."
  • This is because the genetic segregation in the next generation breaks up the favorable gene combinations that produced the original heterosis.
  • Don't confuse: the hybrid itself performs well, but its offspring do not—this is a fundamental genetic principle, not a seed quality issue.
8

Choosing Parents in Cultivar Development

Choosing parents

🧭 Overview

🧠 One-sentence thesis

Choosing parents for cultivar development requires selecting lines that both possess favorable alleles for target traits and contribute genetically diverse combinations to maximize the frequency of superior progeny.

📌 Key points (3–5)

  • The "good × good" cross ideal: both parents contribute favorable but different alleles, increasing the likelihood that progeny exceed either parent's performance.
  • Two core criteria: parents must have favorable alleles for traits of interest AND be genetically diverse to boost overall favorable allele frequency.
  • Breeding value vs. phenotype: parents pass genes, not genotypes or phenotypes; estimated breeding value (EBV) predicts progeny performance better than simple mean performance.
  • Common confusion: genotypic value (includes all genetic effects) vs. breeding value (only additive effects that predict progeny mean).
  • When ideal crosses aren't feasible: backcrossing to elite parents can modulate the effect of non-elite parents and control "dosage" of less favorable germplasm.

🌱 The cycle and pipeline framework

🔄 Process implements the cycle

The process of cultivar development becomes the mode and mechanism to effectively implement the cycle of cultivar improvement.

  • The process translates the improvement cycle into concrete steps: choosing parents, creating progeny, evaluating and selecting.
  • Goal: increase the frequency of favorable alleles for traits specified in the product target.
  • The way parents are chosen, progeny identified, and cultivars created all reflect this goal.

📈 Maximizing genetic gain

The process aims to produce a pipeline of improved cultivars by maximizing:

  • Selection response (R): how much the trait mean shifts due to selection.
  • Rate of genetic gain (ΔG): R divided by the length of a breeding cycle (L).

Key equation components:

  • R depends on heritability (h²), selection intensity (i), and phenotypic or additive genetic variation (σ).
  • Faster cycles and stronger selection pressure increase gain per unit time.

🎯 Core considerations in choosing parents

🏆 Best performers for traits of interest

  • Select parents that are contributors of favorable alleles.
  • Look for high mean phenotypic performance for the traits you want to improve.
  • In self-pollinated crops like soybean, the mid-parent value (average of the two parents) is the best initial indicator of progeny performance.

🧬 Genetic diversity

  • Parents should contribute favorable alleles at different loci (not the same alleles).
  • Genetic diversity boosts the overall frequency of favorable alleles and the number of loci with favorable alleles present.
  • Methods to assess diversity:
    • Pedigree analysis
    • Geographic inference
    • Cluster analysis based on molecular marker profiles

🗂️ Sources of parental germplasm

Potential sources include:

  • Current cultivars
  • Elite breeding lines
  • Acceptable breeding lines with superiority in one or a few characteristics (genetic stocks)
  • Plant introductions, landraces
  • Related species such as wild relatives

Germplasm can be accessed through germplasm banks (repositories) around the globe, which may contain materials from primary (GP-1), secondary (GP-2), or tertiary (GP-3) gene pools.

🔬 Assessing genetic diversity

🌳 Cluster analysis with molecular markers

  • Lines are genotyped for a set of markers; marker alleles are scored.
  • Each pair of lines is compared to compute an overall estimate of similarity, then converted to dissimilarity.
  • Output is a dendrogram: a visual depiction of relative dissimilarity among lines.

Tips for cluster analysis:

  • Choose appropriate marker type and number for adequate genome coverage.
  • Select a method for estimating dissimilarity and joining clusters.
  • Include familiar lines of known background as "anchors."
  • Remember: output is relative to the lines included, not absolute.

📜 Clustering based on pedigree records

  • Cluster analysis can also use complete and accurate pedigree records.
  • Dissimilarity estimates are computed from the coefficient of parentage.
  • Example: dendrogram of 38 soybean lines prominently used as parents in U.S. commercial germplasm.

Don't confuse: cluster analysis highlights genetic diversity but does NOT indicate whether favorable alleles for traits of interest are present—it only shows relatedness.

🧮 Breeding value and prediction methods

💡 What is breeding value?

Breeding value (BV): the value of an individual judged by the mean value of its progeny; the value of genes (not genotype) passed on to progeny; the sum of the average effects of genes in an individual.

  • Parents pass genes, not their genotype or phenotype, to progeny.
  • BV is the mean deviation from the population mean of individuals who received that allele from one parent, the other allele coming at random from the population.

Single-locus example:

  • If allele A₁ has average effect +10 and A₂ has effect –10:
    • BV of A₁A₁ = 20
    • BV of A₁A₂ = 0
    • BV of A₂A₂ = –20

📊 Estimated breeding value (EBV)

Estimated breeding value (EBV): a statistical prediction of the relative genetic merit of individuals as parents.

  • EBV is a function of narrow-sense heritability (only additive genetic variance).
  • Formula: EBV = (Phenotype deviation from mean) × h²
  • Example: if P = +3.0 units and h² = 0.33, then EBV = +1.0 unit.

Why EBV matters:

  • The predicted phenotype of the progeny is the average of the breeding values of the parents.
  • Used to rank available candidates for developing new breeding populations.

🔧 BLUP (Best Linear Unbiased Prediction)

BLUP = Best Linear Unbiased Prediction:

  • Best: minimum error
  • Linear: estimates are linear functions of the data
  • Unbiased: average value of estimate equals average value of quantity being estimated
  • Prediction: estimates of random effects (vs. "estimators" for fixed effects)

Model:

  • Phenotype = Environmental Effects + Genetic Effects + Residual Effects
  • Partitions additive effects from total genotypic effects.
  • Adjusts for "fixed effects" (e.g., year, location) to make fair comparisons.

Advantages over mid-parent value:

  • BLUP was more effective than mid-parent value for identifying superior parental combinations.
  • Can predict potential of a pair of lines even when no performance data on the lines is available, only data on relatives.

Tips for using EBVs:

  • Use as one piece of information along with all other knowledge about prospective parents.
  • EBVs are strictly a function of the data used; consider adding new phenotypic data and rerunning BLUP.

🧬 Genomic Selection (GS)

Genomic Selection: methods that use genotypic data across the entire genome to predict performance of any quantitative trait with high accuracy.

Process:

  1. A training population is phenotyped and genotyped to "train" a model.
  2. Estimated effects are computed for each marker allele using the training population.
  3. Genomic Estimated Breeding Values (GEBVs) are calculated for prospective parent lines based on the model (essentially a sum of effects across the individual's genome).
  4. Individuals with highest GEBVs are used as parents and may be advanced directly to varietal testing, bypassing preliminary testing.

⚖️ Breeding value vs. genotypic value

Don't confuse:

ConceptDefinitionWhat it includes
Breeding valueExpected phenotype of an individual's progenyOnly additive effects
Genotypic valueExpected phenotype of an individual given its genotypeAdditive AND non-additive effects

🔀 Working with suboptimal crosses

🤔 When "good × good" isn't feasible

Reasons why suboptimal crosses may be the only option:

  • Favorable alleles for all target traits may not exist in elite germplasm alone.
  • May need to access traits from genetic stocks, unadapted cultivars, or wild relatives.

🔁 Backcrossing to modulate non-elite parent effects

With a "good × not-so-good" cross, backcrossing to the elite parent can control the "dosage" of the non-elite parent genome.

Two main approaches:

  1. 1–3 backcrosses to the elite parent:

    • Access potentially favorable genes for quantitative traits from a lesser elite line, genetic stock, unadapted cultivar, or wild relative.
    • Example: yield improvement in tomato from wild relatives.
  2. ≥6 backcrosses to introgress a single gene:

    • Create an isoline that performs equivalently to the elite parent except for the integrated new trait.
    • Example: trait integration with disease resistance (e.g., lettuce mosaic virus resistance).

📉 Expected germplasm recovery

With each repeated crossing to the recurrent parent (elite parent), the amount of germplasm from the non-recurrent parent (non-elite parent) is reduced by half.

Formula: Percentage of recurrent parent germplasm = 100 × [1 – (1/2)^(n+1)], where n is the backcross generation number.

Bottom line: Choice of parents affects the choice of breeding methods!

🔢 Multiple-parent crosses

🌐 Beyond two-parent combinations

Although 2-parent combinations are most common, breeding populations can use multiple parents:

TypeStructureGermplasm contribution
3-parent(P1 × P2) × P3Depends on generation of 2-parent population
4-way cross(P1 × P2) × (P3 × P4)Each parent ~25%
4-way cross (alternative)[(P1 × P2) × P3] × P4P1, P2 ~12.5%; P3 ~25%; P4 ~50%
Complex>4 parents (five to hundreds)Varies by design

The "dosage" of each parent is determined by the way the population is formed.

⚖️ Complex crosses: trade-offs

Advantages:

  • Greater number of possible alleles in the population for each locus.
  • Greater probability that at least one parent has the most favorable allele at each locus.
  • Greater probability that homozygous parents will have different alleles at linked loci.
  • More allelic combinations possible.

Disadvantages:

  • If some parents are not elite, performance is not likely to exceed the best parent for each trait.
  • Greater probability of heterozygosity at multiple loci.
  • More generations required for inter-mating, expanding breeding cycle time.
  • Might not be as efficient as multiple 2-parent populations.

Note: Expected mean performance of progeny is still the mean of the parents for self-pollinated crops.

📋 Defining the product target

🎯 Dissecting the product target

A product target specifies:

  • Desired characteristics
  • Measurement standards
  • Threshold levels or ranges

Example soybean product target:

CharacteristicMeasurement StandardThreshold
High seed yieldMachine harvest; seed weight at 13% moisture; per unit land10% greater than Variety X
High yield stabilityRegression analysis or GGE biplotComparable to Variety X
Lodging resistance1–5 scale (1=erect, 5=prostrate)Score ≤ 2
Pod shattering resistanceOven dry method; 10-point scaleScore ≤ 1
Medium maturityMaturity Group (day length & temperature requirement)MG 5–6
RR1 EventIntegrate 40-3-2Pre-determined glyphosate tolerance

🔍 Describing a "good × good" cross

For the stated product target, describe ideal parents in terms of:

  • Types of germplasm: likely current cultivars or elite breeding lines.
  • Traits exhibited: each parent should have favorable alleles for different subsets of the target traits, ensuring complementary contributions.
  • Levels of each trait: both parents should meet or exceed thresholds for their respective strong traits.
9

Creating Progeny and Materials for Testing

Creating Progeny and Materials for Testing

🧭 Overview

🧠 One-sentence thesis

After choosing parents, breeders must decide on mating designs, progeny types, inbreeding levels, and breeding methods to produce and advance materials that maximize the probability of identifying superior individuals meeting the product target.

📌 Key points (3–5)

  • Mating design determines the selection unit: the type of progeny produced (full sibs, half sibs, doubled haploids) and how parents are organized affects what can be selected and estimated.
  • Number of progeny depends on heritability and diversity: at least 200 progeny per population for low-to-medium heritability traits; more needed when genetic diversity is limited.
  • Breeding methods advance materials through the pipeline: methods like pedigree breeding and single-seed descent transition selected lines toward homozygosity and the genetic state required for release.
  • Common confusion—early vs. late generation testing: early generation testing saves resources and correlates strongly with late generation performance, but seed supplies are limited and segregation adds noise; homozygous lines increase accuracy but require more time.
  • Evaluation centers on the product target: screens, evaluation protocols, and selection criteria must maximize the ability to identify outstanding individuals that meet or exceed the product target.

🧬 Parental crosses and population structure

🧬 Multi-parent breeding populations

  • Although 2-parent crosses are most common, breeders can use 3, 4, or more parents.
  • 3-parent population: (P1 × P2) × P3, where the 2-parent population may be at F₁ or a later inbreeding stage.
  • 4-parent population (4-way cross) can be formed two ways:
    • (P1 × P2) × (P3 × P4): each parent contributes ~25% of alleles.
    • [(P1 × P2) × P3] × P4: P1 and P2 contribute ~12.5%, P3 ~25%, P4 ~50%.
  • Complex breeding population: uses more than 4 parents (five to hundreds).
  • The "dosage" of each parent is determined by how the population is formed.

⚖️ Advantages and disadvantages of complex crosses

AspectAdvantagesDisadvantages
Allelic diversityGreater number of parents → more possible alleles per locus; higher probability a parent has the most favorable allele at each locusIf some parents are not elite, performance may not exceed the best parent for each trait
Linked lociGreater probability homozygous parents have different alleles at linked lociGreater probability of heterozygosity at multiple loci
Allelic combinationsMore allelic combinations possibleMore generations required for inter-mating, expanding breeding cycle time compared to 2-parent populations
EfficiencyExpected mean performance of progeny is still the mean of parents (self-pollinated crops)Might not be as efficient as multiple 2-parent populations

🧪 Mating design and selection units

🧪 What mating design determines

Mating design: the plan for producing and organizing progeny; it determines the "selection unit."

  • After deciding on parental combinations, the breeder must decide what progeny to produce and evaluate.
  • The mating design employed determines what can be selected (the selection unit).

🔍 Factors in choosing a mating design

  • Mode of reproduction: mating flexibility of the crop.
  • Specific objectives:
    • Understanding gene action.
    • Estimating genetic variance components and heritability (necessary for designing the breeding process structure).
    • Choosing among breeding methods.
    • Predicting gain from selection.
    • Making selections and developing an improved cultivar.
  • Reliability of estimates.
  • Rule: use the simplest design that meets your needs.
  • For self-pollinated crops, full sib designs are common.

📐 Specifics to determine

  • Types of progeny: e.g., full sibs, half sibs, doubled haploids.
  • Number of types of progeny and scheme for organizing parents.
  • Total number of progeny.

📊 Estimating genetic variance with mating designs

Four steps are required:

  1. Develop one or more types of progeny.
  2. Evaluate progeny in a set of environments.
  3. Estimate variance components from mean squares in the ANOVA.
  4. Interpret variance components in terms of covariances between relatives.

1-Factor Design:

  • Uses a single type of progeny.
  • ANOVA includes one component of variance for progeny and one covariance between relatives.
  • The Expected Means Square table provides a way to estimate the component of variance, which is then interpreted in terms of the covariance of relatives.
  • Example interpretations depend on progeny type:
    • If progeny are clonally reproduced.
    • If half sib families.
    • If full sib families.

🔢 Common mating designs

Mating designs are classified by number of factors, parents, and modalities.

  • 1-Factor design: single type of progeny.
  • 2-Factor designs (can estimate dominance and additive variances):
    • Design I: hierarchical, nested.
    • Design II: factorial design—cross-classified; parents must be inbreds.
    • Diallel: used to estimate GCA (general combining ability) and SCA (specific combining ability).
  • 3-Factor Designs: have at least one grandparent in common; can estimate additional variance components (e.g., epistatic).
    • Design III: can screen out pseudo-overdominance.

🌱 Level of inbreeding and progeny numbers

🌱 Early vs. late generation testing

  • Decision: will selection be initiated with segregating materials or fully homozygous lines?
  • Evidence for early generation testing:
    • Strong correlation between early generation performance and late generation performance for yield and other quantitative traits, even with lowly heritable traits.
    • Discarding poor-performing progeny early saves testing resources.
  • Limitations of early testing:
    • Seed supplies may be limited in early generations.
    • Segregation adds "noise" to the data.
  • Advantage of homozygous lines:
    • Eliminates noise due to segregation.
    • Increases selection accuracy.
  • Trade-off: producing more accurate estimates of progeny means vs. sampling a larger number of progeny per cross.
  • Don't confuse: early testing is efficient but less accurate; late testing is more accurate but more resource-intensive.

🔢 Determining the number of progeny

Key question: What number of progeny must be tested to assure that at least one truly superior progeny will be identified if it exists in that population?

General guidelines:

  • The number depends on trait heritability, expected proportion of truly outstanding progeny in the population, and selection intensity applied.
  • At least 200 progeny per population for traits of low to medium heritability.
  • More than 200 if genetic diversity is limited: e.g., with soybean, more progeny are needed to find unique gene combinations.
  • Commercial soybean programs suggest population sizes upwards of 500 (200–500 are selected after evaluation for disease screening or morphology).
  • Must consider the number of traits to be improved and the intended selection intensity for each.

🔄 Breeding methods for advancing progeny

🔄 What breeding methods do

Breeding method: the approach for advancing selected materials through the breeding pipeline; also called the "recombination unit."

  • Breeding methods involve the progression of selected materials through the breeding pipeline.
  • Testing occurs in various stages; selections from one stage are advanced to the next stage, often involving the next generation.
  • Advanced stages of testing provide a view of performance of entries as released cultivars.
  • Breeding methods transition selected lines to the genetic state required for release as a new, improved cultivar.

🌾 The pedigree breeding method

  • A method used in the inbreeding of populations of self- or cross-pollinated species with the goal of developing inbred lines.
  • Typically starts with an F₂ population, with self-pollination of each generation until fully inbred lines are generated.
  • Selections can be made in each generation as desired:
    • Advancing best families.
    • Best rows within a family.
    • Best plants within rows.
  • The pedigree breeding method can be used to select among families as well as within families.

🌿 Single-seed descent (a version of pedigree)

  • Involves the advancement of F₂ lines through self-pollination to homozygosity.
  • Process:
    • Harvest a single seed from each plant at each generation.
    • Bulk the individual seeds.
    • Plant out the entire bulk to represent the next generation.
  • Testing is initiated once the desired level of homozygosity is reached.
  • Advantages:
    • Easy way to maintain large populations through the inbreeding process.
    • Can be implemented in a relatively short timeframe using off-season nurseries and greenhouses.

🌾 Modified single-seed descent

  • Referenced in commercial soybean breeding programs.
  • Modification: retain a single pod containing two to three seeds from each F₂ plant (instead of a single seed) and bulk across the population.
  • Purpose: expands the number of F₂:₃ families represented in the breeding population.
  • Example: A commercial soybean program uses modified single-seed descent to advance S₀ plants to S₁.

🎯 Evaluation and selection

🎯 Focus on the product target

  • The cycle of cultivar improvement and the process of cultivar development center on the product target.
  • Genetic variation in breeding populations is exploited to make genetic gain toward the defined product target.

🔑 Key decisions in evaluation

After deciding what materials to evaluate, key decisions involve:

  • The basis for evaluation.
  • How evaluations are conducted.
  • Criteria for selection.
  • Goal: maximize the ability and probability of identifying outstanding individuals that meet or exceed the product target if these are present in the breeding populations.

🧪 Trait screens

Screens: measurements needed to assess traits of interest.

Example with soybean seed yield:

  • Seed yield is adjusted to a standard 13 percent grain moisture and expressed in weight per land unit.
  • Measured using a seed thresher or a combine based on weight of seed per plot.
  • Seed moisture may be measured using a moisture meter.
10

Evaluation and Selection

Evaluation and Selection

🧭 Overview

🧠 One-sentence thesis

Effective evaluation and selection in cultivar development requires careful design of testing regimes, appropriate experimental methods, and strategic use of multiple-trait selection approaches to maximize the probability of identifying outstanding individuals that meet or exceed the product target.

📌 Key points (3–5)

  • Core goal: Focus evaluation basis, methods, and criteria to maximize ability to identify outstanding individuals meeting the product target from breeding populations.
  • Testing progression: Move from large numbers of entries at few locations (preliminary trials) to fewer entries at many locations (advanced trials), with increasing selection intensity at each stage.
  • Multiple locations vs replication: For yield trials, maximizing locations with single replications often produces greater genetic gain than fewer locations with more replications per location.
  • Common confusion: Unreplicated trials may seem inadequate, but when combined with multiple locations they can be more effective than replicated trials at fewer sites.
  • Multiple-trait selection hierarchy: Selection index methods are most efficient, followed by independent culling levels, then tandem selection for achieving gains across multiple traits simultaneously.

🔬 Measurement foundations

🔬 Trait screens and protocols

  • Screens measure traits of interest with standardized methods.
  • Example: soybean seed yield measured by weight per plot, adjusted to standard moisture content (13%).
  • Protocols ensure:
    • Accurate, uniform measurements across entire experiments
    • Conformity with accepted practices
    • Reliable discrimination between resistant and susceptible types
  • Example protocol: oven-dried method for pod shattering uses 30 pods exposed to 60°C for 7 hours, counting opened pods.

📊 Uniform nomenclature

  • Performance measurements must use scales known to the Community of Practice for each crop.
  • Uniform terminology enables information sharing within organizations and across the breeding community.
  • Ontology systems (e.g., Breeding Management System) provide standardized vocabularies for multiple crops.

🌍 Testing site design

🌍 Relationship to market region

Testing locations serve as samples of the market region and must represent:

  • Geographical position
  • Season (if multiple per year)
  • Soil type
  • Planting dates
  • Cultural practices: tillage, fertilizer, harvesting, irrigation, crop rotation, etc.

🎯 Field uniformity requirements

Uniform fields allow variation among test entries to be clearly observed.

Uniformity factors:

  • Topography
  • Soil type
  • Previous crop
  • Planting preparation
  • Access to moisture
  • Protection from disturbance

Design strategy: Confine non-uniform areas (e.g., low spots with standing water) to single blocks that can be discarded if necessary, rather than letting them affect the entire location.

📐 Block size considerations

  • Blocks that are too large lack uniformity and homogeneity within.
  • Incomplete block designs accommodate many entries in advanced yield trials while maintaining homogeneity.
  • These designs effectively partition environmental effects.

🧪 Experimental execution

🧪 Core principles

Appropriate experimental designs partition variation due to genotype, environment, and other sources.

Proper execution includes:

  • Randomization: guards against bias
  • Replication: captures natural variation among experimental units treated alike
  • Blocking: partitions variation due to environmental influences

📝 Data collection best practices

Data integrity must be safeguarded:

  • Prepare data collection forms before fieldwork, reflecting field layout
  • Staff appropriately: one person to evaluate, another to record
  • Have the same person rate entire trial when possible; if not, assign ratings to individual reps
  • Digital tools (e.g., Field Book app) can facilitate collection and limit entry mistakes

📊 Trait selection

Primary traits: All traits specified in the product target must be observed and recorded.

Additional traits worth collecting:

  • Traits that correlate with target traits
  • Traits easier or less expensive to measure
  • Traits measurable earlier in the crop life cycle (shortening breeding cycle)

Example: soybean seed yield positively correlates with number of pods, seeds per pod, fruiting period, and 100-seed weight.

🔄 Indirect selection

🔄 Concept and application

Indirect selection: Evaluation and selection based on a correlated trait (secondary trait) to improve the primary trait of interest.

To improve Character X, select for Character Y (the secondary trait).

📈 When indirect selection is advantageous

Indirect selection produces relatively greater genetic gain for Character X than direct selection when the secondary trait offers advantages in:

  • Measurement ease or cost
  • Heritability (preferably above 0.6)
  • Timing (earlier measurement possible)

🌾 Secondary trait examples

  • Canopy temperature as proxy for drought stress tolerance
  • Root mass under water-limited conditions for drought tolerance
  • Leaf senescence below the ear for low nitrogen tolerance in maize
  • Yield component characteristics (e.g., 100-kernel weight) for grain yield

✅ Choosing secondary traits

Consider traits that:

  • Correlate genetically with the trait of interest in the target environment
  • Have higher heritability than the trait of interest (less non-additive genetic variation)
  • Exhibit genetic variation
  • Are not associated with poor performance in non-stressed environments
  • Are easily, cheaply measurable (need a good screen)
  • Can be measured per-plant or per-plot basis, preferably non-destructively

📋 Ancillary traits

Other traits may be collected for:

  • Calculations (e.g., biomass for harvest index estimation)
  • Categorization and positioning of new cultivars in the marketplace (e.g., pubescence presence, seed coat color, growth type)

Key principle: Consider all data needs before implementing the trial.

🎯 Multiple-trait selection approaches

🔢 Tandem selection

  • Selection for one trait at a time sequentially
  • Selection thresholds and intensities for each trait are independent
  • Common practice for traits affecting adaptation or stress tolerances

Example applications:

  • Tropical maize populations selected for photoperiod sensitivity before yield
  • Elite materials selected for disease resistance before yield

Don't confuse: This is not simultaneous selection; each trait is addressed in separate steps.

✂️ Independent culling levels

  • Selection for more than one trait in a single step
  • Thresholds established for each trait
  • Only individuals meeting all trait thresholds are selected
  • Advantage: individuals can be culled based on a single trait without waiting for other trait observations

📊 Index selection

  • Selection for more than one trait simultaneously based on a single index value
  • Accounts for relative superiority across all traits included in the index
  • Usually a linear function with traits weighted by importance
  • Takes into account genetic correlation between trait pairs
  • Enables progress even with negatively correlated traits

🏆 Smith-Hazel Index

The original index form:

  • Index = sum of (weight × phenotypic value) for each trait
  • Index value calculated for each individual becomes the new selection variable
  • Individuals with highest index values are selected

Weight calculation accounts for:

  • Phenotypic covariances among traits
  • Genetic covariances among traits
  • Economic value of each trait

Note: Phenotypic and genetic covariance matrices may be difficult to estimate.

📈 Efficiency ranking

According to research, the order of effectiveness is: Selection index > Independent culling > Tandem selection

🧬 Testing regime structure

🧬 Progression characteristics

Using the soybean example, testing moves through distinct phases:

Early-generation testing (Summers 2-3):

  • Very large number of entries (70,000 → 5,000)
  • Small number of locations (1-5)
  • Unreplicated trials
  • High selection intensity (only ~7% advanced)

Intermediate testing (Summer 4):

  • Moderate number of entries (200)
  • Increased locations (15-25)
  • Lines meeting benchmarks coded as "experimental"

Advanced testing (Summers 5-6):

  • Small number of entries (experimental/advanced lines)
  • Maximum research sites (20-50 locations)
  • On-farm strip tests (20-500 locations)
  • Comprehensive market area coverage

⏱️ Timeline features

  • At least five years of comprehensive performance testing
  • Five seasons of research yield trials
  • Two seasons of wide-area testing
  • Off-season seed increase to save time
  • Seed retrieved from yield plots for next year's trials (self-pollinated crops)

🎯 Testing focus

Main focus in the example is grain yield:

  • No other trial types mentioned besides yield trials (though individual plant selection noted)
  • High grain yield is the utmost priority trait
  • Large number of breeding populations created
  • Selection intensities are high throughout

📊 Variance partitioning

📊 Phenotypic variation components

Total phenotypic variation (V_P) comprises:

  • V_G: genetic variation
  • V_E: environmental variation
  • V_GxE: genotype × environment interaction
  • V_n: variation among plants in an experimental unit
  • V_r: variation among replications
  • V_L: variation among locations
  • V_Y: variation among years

Goal: Elucidate genotypic variation by accounting for environmental variation and controlling error.

🔍 Addressing variation sources

The testing regime accomplishes this through:

  • Experimental units with multiple plants (especially for low-to-medium heritability traits like grain yield)
  • Multiple locations to sample increasing range of target market as testing progresses
  • Multiple years/seasons to ensure adequate sample of climatic conditions in market region

🌐 Genotype × Environment interaction

Testing regime allows assessment of GxE:

  • Ideally, new improved cultivar performs best or among the best across entire market region
  • Multiple locations and years reveal interaction patterns

🔬 Unreplicated trials strategy

🔬 Rationale for single-rep trials

Research shows that for a given number of observations per entry, maximum gains from selection come from:

  • Maximizing number of locations
  • At the cost of replication within locations

Key finding: Single-rep trials at more locations provide greatest opportunity for gains in grain yield when two or more locations are involved.

Example: Two locations with one rep each facilitate more genetic gain than one location with two replications.

🧮 Determining location numbers

To detect a particular Least Significant Difference (LSD), the breeder can calculate required locations using:

  • Desired probability level (e.g., α = 0.05)
  • Estimates of error variance from previous trials
  • Estimates of genotype × environment interaction variance from previous trials

The formula can be rearranged to solve for number of locations required to detect a specific yield increase (e.g., 0.3 t/ha) at the desired significance level.

📈 Variance estimation

Estimates of error variance and GxE variance are obtained from Expected Mean Square (EMS) terms in ANOVA from previous similar trials (similar size, market area, trait of interest).

🌍 Managing genotype × environment interaction

🌍 GxE patterns

Three main patterns exist:

PatternDescriptionSelection impact
ParallelLines maintain relative performance across environmentsRankings consistent
Non-parallelPerformance differences change in magnitude but not rankRankings mostly consistent
CrossoverRankings flip between environmentsSelection complicated

Crossover interaction is most problematic: the "best" genotype differs by environment, making it difficult to identify a single best genotype for the entire target market.

Note: Statistically significant GxE in ANOVA does not indicate whether rankings are affected; further probing required (e.g., mean comparisons by environment).

🛠️ Management strategies

Ignore it:

  • Test in wide range of environments
  • Select based on mean performance
  • Does not recognize best cultivars for specific environments

Reduce it:

  • Partition environments into smaller, more homogeneous regions
  • Make selections by region

Exploit it:

  • Partition environments into mega-environments
  • Identify cultivars best suited to specific environments or subsets
  • Consider "stability" across environments

📉 Stability concepts

Stability: Performance of a line relative to other lines across a range of environments representing the target market.

Measurement approaches:

  1. Mean performance across environments: Regression analysis calculates linear relationship between performance and environments for each genotype (slope b_i)

    • Slope = 0: constant performance across environments
    • Slope similar to mean: typical response
    • High slope: responds best to favorable environments but yields less under stress
  2. Deviations from regression: Excessive deviations suggest erratic response to environment range

🗺️ Mega-environment partitioning

  • Partition environments into homogeneous subgroups
  • Leads to multiple product targets (one per mega-environment)
  • Testing conducted accordingly
  • Facilitated through cluster analysis based on performance trial data across locations and years
  • Methods like GGE Biplot can determine whether target market comprises multiple mega-environments and assess genotype stability

🏆 Advanced testing and product identification

🏆 Advanced testing characteristics

Focuses on lines meeting or exceeding product target specifications in earlier testing (coded lines).

Features:

  • Maximal numbers of research testing sites
  • Farmer participation with larger plot sizes
  • Real-life field conditions in market region
  • Wider farmer exposure to potential new cultivars
  • More farmers gain experience and familiarization

🎯 Purpose

  • Confirm coded lines meet or exceed all product target specifications
  • Sample even wider range of environments from target market
  • Assemble information for potential product release and varietal registration
  • Ideally identify at least one coded line representing a new "best" for the target market

✅ Identifying potential new products

Candidate cultivars will:

  • Meet or exceed all product target specifications throughout entire testing regime
  • Not exhibit negative characteristics undesirable to farmers or value chain stakeholders
  • Show best performance overall across the market region

Expected outcome: According to the example, zero to five new cultivars may emerge as release candidates.

Success principle: Well-designed, fully integrated, and effectively implemented product pipeline increases probability of producing 1-5 potential new products.

11

Hybrid Vigor = Heterosis

Hybrid Vigor = Heterosis

🧭 Overview

🧠 One-sentence thesis

Heterosis (hybrid vigor) enables hybrids to outperform both parents, and exploiting it requires maintaining genetic diversity between distinct heterotic groups that show strong combining ability.

📌 Key points (3–5)

  • What heterosis is: the superior performance of a hybrid compared to either parent, quantified as a percentage above mid-parent or better-parent mean.
  • Genetic basis: heterosis is the opposite of inbreeding depression; dominant alleles mask deleterious recessives, though the full mechanism remains unclear.
  • Genetic diversity is necessary but not sufficient: two diverse lines do not always show heterosis, but lines that do show heterosis are always genetically diverse.
  • Common confusion: diversity alone does not guarantee heterosis—specific combining ability between particular pairs matters.
  • Heterotic patterns: organized pairs of heterotic groups (e.g., SSS × NSS in U.S. maize) that consistently produce high heterosis and must be preserved through breeding.

🌱 What heterosis is and how to measure it

🌱 Definition and observation

Heterosis (also called hybrid vigor): the superior performance of a hybrid compared to either parent.

  • The hybrid outperforms both Parent A and Parent B (see Fig. 2 in the excerpt).
  • This advantage must be recreated each generation—hybrid vigor is not permanent.
  • Example: if Parent A yields 5.0 t/ha and Parent B yields 4.4 t/ha, the hybrid might yield 11.5 t/ha.

📏 Quantifying heterosis

The excerpt provides two formulas (rewritten in words):

MeasureFormula (in words)Example from excerpt
Mid-parent heterosis(Hybrid performance minus mid-parent mean) divided by mid-parent mean, times 100%MP = (5.0 + 4.4)/2 = 4.7 t/ha; hybrid = 11.5 t/ha → heterosis = (11.5 − 4.7)/4.7 × 100%
Better-parent heterosis(Hybrid performance minus better parent) divided by better parent, times 100%Better parent = max(5.0, 4.4) = 5.0 t/ha; heterosis = (11.5 − 5.0)/5.0 × 100%
  • Mid-parent mean (MP) is the average of the two parents' performance.
  • Better parent (BP) is the higher-performing parent.
  • Both measures express heterosis as a percentage improvement.

🧬 Genetic basis: the opposite of inbreeding depression

🧬 Shull's observations and interpretation

George Harrison Shull laid the foundation for understanding heterosis through work in corn. He observed:

  • Open-pollinated corn varieties are complex mixtures of hybrids, each plant a different genotype.
  • Inbreeding reduced hybrid vigor; crossing restored it.
  • Hybrid vigor must be recreated each generation to get the benefit.
  • Inbred lines differ in the level of hybrid vigor they produce when crossed (combining ability).
  • Inbred progeny show more defects than their parents.

Shull's interpretation:

Heterosis is the opposite of inbreeding depression.

  • Inbreeding causes segregation and eventual homozygosis (becoming uniform for alleles), unmasking deleterious (harmful) alleles.
  • Heterosis results from parent lines compensating for each other's weaknesses.
  • Shull defined heterosis as "increased vigor, size, fruitfulness, speed of development, resistance to disease and insect pests or to climatic rigors" in crossbred organisms compared to inbreds, as the result of "unlikeness in the constitutions of uniting parental gametes."

🔬 Competing hypotheses: dominance vs overdominance

The excerpt describes two main genetic hypotheses:

HypothesisProposed byMechanism
Dominance HypothesisEugene Davenport (1908)Dominant alleles improve fitness by masking deleterious recessive alleles in the hybrid
Overdominance HypothesisEdward M. East (1908) and George H. Shull (1908)Certain allele combinations confer an advantage in the heterozygote due to over-expression of the gene

🧩 Current understanding: dominance is primary, epistasis plays a role

  • Sewall Wright (1922) showed that mean performance vs. heterozygosity should be linear unless linkage or epistasis (gene interactions) is involved.
  • Several studies found epistasis contributes to heterosis but accounts for only a small portion (~10% in one study cited).
  • Dominant gene action is considered the primary factor underpinning heterosis, though the detailed mechanism is still lacking.
  • Additivity (additive gene action) is also implicated.

Don't confuse: Dominance (one allele masks another) vs. overdominance (the heterozygote itself is superior to both homozygotes).

🌍 Heterotic patterns and groups

🌍 What is a heterotic pattern?

Heterotic pattern: a specific pair of heterotic groups whose crosses express high heterosis and high hybrid performance.

  • Genetic diversity between the two groups is necessary but not sufficient for heterosis.
  • Example from the excerpt: U.S. Yellow Dent corn, specifically the Reid Yellow Dent × Lancaster Sure Crop pattern, which became the basis for the predominant U.S. maize heterotic pattern: Stiff Stalk Synthetic (SSS) × non-Stiff Stalk (NSS).

🧑‍🤝‍🧑 What is a heterotic group?

Heterotic group: a group of related or unrelated individuals from the same or different populations, displaying similar combining abilities when crossed with genotypes from other germplasm groups.

  • Within a heterotic group, there may be a heterotic subgroup: individuals more genetically similar to one another than to the larger group.

🔑 Properties of a heterotic pattern

A heterotic pattern has these characteristics:

  1. Genetic diversity between the two heterotic groups

    • Diversity is necessary but not sufficient for heterosis.
    • Two genetically diverse lines do not necessarily display heterosis in hybrid combination.
    • However, two lines that do display heterosis in hybrid combination are always genetically diverse.
  2. General combining ability is expressed between inbreds from opposite heterotic groups.

    • General combining ability: average performance of a line when crossed with multiple lines from the opposite group.
  3. Specific combining ability is expressed between specific pairs of inbreds from different heterotic groups.

    • Specific combining ability: the performance of a particular pair that deviates from what general combining ability would predict.
    • Plant breeders exploit specific combining ability to identify the best hybrid combinations.

Common confusion: Diversity alone does not guarantee heterosis. The excerpt emphasizes that genetic diversity is necessary but not sufficient—specific combining ability between particular pairs is what breeders seek.

🛡️ Preserving genetic diversity

  • The heterotic pattern must be taken into account when choosing parents to create breeding populations.
  • This ensures that the genetic diversity between heterotic groups is maintained, preserving the basis for continued heterosis.

🌾 Historical context: development of heterotic patterns in maize

🌾 Richey's observation (1922)

Frederick D. Richey observed that:

  • Hybrids between maize varieties of different endosperm types resulted in higher yield than varieties of the same endosperm type.
  • This suggested that genetically or geographically distant parents exhibited increased heterosis in hybrid combination.

🌾 The U.S. maize heterotic pattern

  • Richey's observation led to the development of the Reid Yellow Dent × Lancaster Sure Crop pattern.
  • This became the basis for the predominant heterotic pattern in U.S. maize: Stiff Stalk Synthetic (SSS) × non-Stiff Stalk (NSS).
  • This pattern is the oldest and most famous heterotic pattern in U.S. Yellow Dent corn.

Why it matters: The excerpt states "we can learn a great deal from this important development"—establishing and maintaining heterotic patterns is foundational to hybrid cultivar development.

12

What is a Heterotic Pattern?

What is a Heterotic Pattern?

🧭 Overview

🧠 One-sentence thesis

Heterotic patterns—specific pairs of genetically diverse groups that produce high-performing hybrids—must be preserved by breeding within each group rather than between groups to maintain the genetic diversity that drives hybrid vigor.

📌 Key points (3–5)

  • What a heterotic pattern is: a specific pair of heterotic groups whose crosses express high heterosis and hybrid performance; genetic diversity is necessary but not sufficient.
  • How heterotic groups differ from patterns: a heterotic group is a collection of individuals with similar combining ability when crossed with other groups; a heterotic pattern is the pairing of two such groups.
  • Common confusion: genetic diversity alone does not guarantee heterosis—two diverse lines may not display heterosis, but two lines that do display heterosis are always genetically diverse.
  • Why breeding stays within groups: to preserve heterosis between heterotic groups, breeding crosses are made within a heterotic group, not between groups.
  • How to improve hybrids: cross a parent with another line from the same heterotic group, then test progeny in combination with a tester from the opposite group.

🌽 Historical foundation and core definitions

🌽 The U.S. Yellow Dent corn pattern

  • The oldest and most famous heterotic pattern is U.S. Yellow Dent corn.
  • In 1922, Frederick D. Richey observed that hybrids between maize varieties of different endosperm types yielded more than varieties of the same endosperm type.
  • This suggested that genetically or geographically distant parents exhibited increased heterosis in hybrid combination.
  • Led to the Reid Yellow Dent × Lancaster Sure Crop pattern, which became the basis for the predominant U.S. maize heterotic pattern: Stiff Stalk Synthetic (SSS) × non-Stiff Stalk (NSS).

📖 What a heterotic pattern is

Heterotic pattern: a specific pair of heterotic groups whose crosses express high heterosis (and hybrid performance).

  • Genetic diversity is necessary but not sufficient.
  • Example: SSS × NSS in U.S. maize is a heterotic pattern.

📖 What a heterotic group is

Heterotic group: a group of related or unrelated individuals from the same or different populations, displaying similar combining abilities when crossed with genotypes from other germplasm groups.

  • Within a heterotic group, there may be subgroups of individuals more genetically similar to one another than to the group at large.
  • Such a subgroup is called a heterotic subgroup.
  • Example: within the SSS heterotic group, B14 and B73 are heterotic subgroups.

🔑 Properties and principles of heterotic patterns

🔑 Genetic diversity: necessary but not sufficient

  • Genetic diversity between the two heterotic groups is required.
  • Diversity is necessary but not sufficient for expression of heterosis.
  • That is:
    • Two genetically diverse lines do not necessarily display heterosis in hybrid combination.
    • Two lines that do display heterosis in hybrid combination are always genetically diverse.
  • Don't confuse: diversity ≠ automatic heterosis; heterosis → diversity is always true, but diversity → heterosis is not guaranteed.

🔑 General and specific combining ability

  • General combining ability is expressed between inbreds from opposite heterotic groups.
  • Specific combining ability is expressed between specific pairs of inbreds from different heterotic groups.
  • Plant breeders exploit specific combining ability.

🔑 Preserving heterosis by breeding within groups

  • The heterotic pattern must be taken into account when choosing parents to create breeding populations.
  • Principles still hold: choose parents with best performance for key traits and genetically diverse sources of favorable alleles.
  • Critical rule: with hybrid cultivars, preserve heterosis between heterotic groups by making breeding crosses within a heterotic group, not between heterotic groups.
  • Example: to improve a hybrid with one parent from SSS and one from NSS, cross the SSS parent with another SSS line, and cross the NSS parent with another NSS line.

🛠️ Improving a specific single-cross hybrid

🛠️ Improving one parent

  • Consider a specific single-cross corn hybrid with parents from heterotic group 1 and heterotic group 2.
  • To improve one parent (say, from group 1):
    1. Cross it to a line from the same heterotic group that represents a source of new favorable alleles.
    2. Evaluate progeny of this breeding cross for their performance in hybrid combination with a line from the opposite heterotic group (called a tester).
    3. Make selections.
    4. A superior progeny advanced to homozygosity becomes a new inbred, which when combined with the other parent results in an improved hybrid that outperforms the original.

🛠️ Improving both parents

  • In practice, breeding crosses are typically made within each heterotic group to improve both sides of the pedigree.
  • Progeny from each cross are evaluated with respect to performance in hybrid combination with a tester(s) from the opposite heterotic group, especially for traits involving non-additive gene action.
  • The testers can be the parents of the original hybrid or other prominent lines from the opposite heterotic group.

🛠️ Using a prospective source of new favorable alleles

  • Consider a source of favorable alleles for key traits to improve a hybrid.
  • The source could be crossed to one of the parents to create families that can be evaluated for their performance in hybrid combination with the other parent.
  • A new superior inbred resulting from this cross could be used in combination with the other parent to produce an improved hybrid.
  • Alternatively, the source could be crossed to the other parent to create F₂ families that can be evaluated for their performance in hybrid combination.
  • Decision rule: cross the source to the inbred to which it is more related (more genetically similar) to create a breeding population, thus preserving heterosis between the two heterotic groups.
  • Example: if the source is more genetically similar to parent A than to parent B, cross it to A to preserve heterosis with B.

🧬 Assigning and establishing heterotic groups

🧬 Assigning new germplasm to a heterotic group

Assuming a particular heterotic pattern has been recognized and utilized, new potential breeding materials can be assigned to a relevant heterotic group through:

MethodWhat it involves
Pedigree analysisTracing genetic relationships
Geographic inferenceUsing origin/location information
Genetic similarityMolecular marker profile using cluster analysis
Measurement of heterosisDirect testing of hybrid performance
Combining ability analysisTypically conducted using diallel, partial diallel, or NC Design II mating designs

🧬 Establishing a heterotic pattern from scratch

  • Separate heterotic groups are essential for the development of hybrid cultivars.
  • If heterotic groups have not yet been defined, there are approaches that can provide guidance on a starting point:
    1. Use a diallel mating design to measure heterosis between pairs of lines.
    2. Heterotic grouping can be arbitrarily assigned to maximize heterosis between heterotic groups.
    3. Supplement phenotypic data with cluster analysis of a larger set of lines based on molecular marker profile.
    4. Cluster analysis identifies genetically similar materials, with the assumption that genetically similar lines will perform similarly in hybrid combinations.
    5. Lines that showed heterosis in hybrid combination serve to anchor the groupings (clusters) and indicate heterotic group assignments as well as heterotic subgroup membership.

🧬 Reinforcing and strengthening the pattern

  • The newly established heterotic pattern can be reinforced and strengthened through breeding methods such as reciprocal recurrent selection.
  • One of the heterotic groups will be chosen to serve as the pool of female hybrid parents and the other as the male.
  • Selection within a heterotic group will also focus on reproductive features important to its role in hybridization:
    • Lines in the female pool will likely be improved for seed set.
    • Lines in the male pool will be improved for traits associated with pollen shed.

🎯 Choosing parents within a heterotic group

🎯 Maximizing genetic diversity within the breeding population

  • In developing improved hybrid cultivars, New Line Development takes place within the context of each heterotic group.
  • Parents for breeding populations are chosen within, not between, heterotic groups.
  • To maximize genetic diversity within the breeding population, parents may be chosen from different heterotic subgroups within the heterotic group.
  • Example: the choice of parents for a breeding population to improve the female side of the pedigree in U.S. maize may involve a line from the B14 heterotic subgroup and a line from the B73 subgroup (both within SSS).

🎯 Evaluating prospective parents

  • Prospective parents within a heterotic group can be evaluated on the basis of estimated breeding value (EBV) in hybrid combination with members of the opposite heterotic group for traits involving heterosis.
  • Whereas EBVs for pure line varieties consider the merit of lines per se, EBVs for prospective parents of improved inbreds for hybrid cultivars take complementarity into account.
  • Don't confuse: breeding value for pure lines (standalone merit) vs. breeding value for hybrid parents (complementarity with the opposite group).
13

Classes of Loci

Classes of Loci

🧭 Overview

🧠 One-sentence thesis

Dudley's classification of loci into eight classes helps breeders identify which prospective parents will introduce favorable alleles (Class G) while minimizing the risk of losing favorable alleles (Classes D and F) when improving hybrid cultivars.

📌 Key points

  • Eight classes of loci: any prospective parent I_W can be classified at each locus relative to the two current hybrid parents (I_1 and I_2) based on which alleles are present.
  • Class G is imperative: these loci carry new favorable alleles not present in either current parent, making them essential for improvement.
  • Potential loss matters: crossing I_W with I_1 risks losing favorable alleles at Class D loci; crossing with I_2 risks loss at Class F loci.
  • Common confusion: not all classes matter equally—Classes A and H are invariant (no new alleles), and under complete dominance, Classes C, D, E, or F may not need improvement depending on which parent is being improved.
  • Methodology guides decisions: Dudley's method evaluates which prospective parent has the highest probability of producing an improved hybrid and whether to improve I_1 or I_2.

🧬 The eight classes of loci

🧬 How loci are classified

Classes of loci: eight categories that describe the allele status at each locus across three inbred lines—I_1, I_2 (the current hybrid parents), and I_W (a prospective new parent).

  • Because all lines are inbred (homozygous), each locus has only one allele per line.
  • "+" indicates the dominant (favorable) allele; "–" indicates the recessive allele.
  • The classification captures which lines carry favorable alleles and which do not.

📋 The eight classes

ClassI_1I_2I_WMeaning
A++++++All three lines have the favorable allele
B++++––Both current parents have favorable; I_W does not
C++––++I_1 and I_W have favorable; I_2 does not
D++––––Only I_1 has favorable
E––++++I_2 and I_W have favorable; I_1 does not
F––++––Only I_2 has favorable
G––––++Only I_W has favorable (new alleles)
H––––––No line has the favorable allele
  • Classes A and H are invariant: no new favorable alleles are introduced by I_W.
  • Class G is imperative: I_W brings new favorable alleles not present in the current hybrid.

🎯 Which classes matter for improvement

🎯 Classes that don't need improvement

  • Classes A and H: all lines are the same, so I_W adds nothing new.
  • Under complete dominance (a=1):
    • If improving I_2, no need to worry about Class C or D (I_1 already has favorable alleles that will dominate).
    • If improving I_1, no need to worry about Class E or F (I_2 already has favorable alleles that will dominate).

Don't confuse: dominance level affects which classes are important; incomplete dominance changes the picture.

🌟 Class G: the key to improvement

  • Class G loci: I_W has favorable alleles that neither I_1 nor I_2 possess.
  • These are the loci where I_W can introduce genetic gain.
  • The excerpt states Class G is "imperative" for bringing new favorable alleles into the hybrid.

Example: If I_1 and I_2 both lack a favorable allele for drought tolerance, but I_W has it, that locus is Class G and represents a potential improvement.

⚖️ Potential gain vs. potential loss

⚖️ Improving I_1 by crossing with I_W

When I_W is crossed to I_1, the resulting progeny (which will eventually be crossed with I_2 to form the new hybrid H_New) show:

ClassStatus in (I_1 × I_W)Status in I_2Outcome
D+– (heterozygous)––Potential loss: I_1 had favorable allele, now diluted
G+– (heterozygous)––Potential gain: new favorable allele introduced
  • Risk at Class D: I_1 originally had the favorable allele, but crossing with I_W (which lacks it) creates heterozygotes, risking loss of the favorable allele in later generations.
  • Opportunity at Class G: I_W introduces new favorable alleles.

⚖️ Improving I_2 by crossing with I_W

When I_W is crossed to I_2:

ClassStatus in I_1Status in (I_2 × I_W)Outcome
F––+– (heterozygous)Potential loss: I_2 had favorable allele, now diluted
G––+– (heterozygous)Potential gain: new favorable allele introduced
  • Risk at Class F: I_2 originally had the favorable allele, but crossing with I_W (which lacks it) creates heterozygotes, risking loss.
  • Opportunity at Class G: same as above—new favorable alleles.

🧮 The trade-off

"Clearly, I_W needs to be a source of new favorable alleles (Class G), but the potential loss through Class D or F must be considered as well."

  • A good prospective parent has many Class G loci (new favorable alleles) and few Class D or F loci (where crossing would dilute existing favorable alleles).
  • Dudley's methodology evaluates this trade-off to determine:
    • Which prospective parent has the highest probability of producing an improved hybrid.
    • Which current parent (I_1 or I_2) should be improved.
    • Whether to create F_2 or BC_1 (backcross) families as a starting point.

🧩 General principles and context

🧩 Choosing parents for hybrid cultivars

The excerpt emphasizes that general principles remain the same as for pure-line varieties:

  • High frequency of favorable alleles for traits in the product target.
  • Diversity of favorable alleles between parents to create genetic variability in the breeding population.

🔗 Preserving heterosis

  • Breeding crosses are made within heterotic groups (not between them).
  • Progeny are evaluated for testcross performance to build complementarity between heterotic groups.
  • Goal: improved hybrids with strong specific combining ability.

Don't confuse: parents for breeding populations are chosen within a heterotic group, but the final hybrid is formed by crossing lines from different heterotic groups.

🌾 Context: heterotic groups and subgroups

  • To maximize diversity, parents may come from different heterotic subgroups within the same heterotic group.
  • Example from the excerpt: in U.S. maize, a breeding population for the female side might involve a line from the B14 subgroup and a line from the B73 subgroup.
  • Prospective parents are evaluated based on estimated breeding value (EBV) in hybrid combination with the opposite heterotic group, taking complementarity into account (not just line merit alone).
14

General Principles of Choosing Parents

General Principles of Choosing Parents

🧭 Overview

🧠 One-sentence thesis

Choosing effective testers and selection methods is critical to maximizing genetic improvement in hybrid breeding programs, with the ideal tester maximizing differences among genotypes while balancing discrimination power and practical performance.

📌 Key points (3–5)

  • Recurrent selection systematically improves breeding populations by evaluating, selecting, and intermating lines across cycles to increase favorable allele frequencies.
  • Reciprocal recurrent selection simultaneously improves two populations (one from each heterotic group) in a complementary fashion, with selection based on general combining ability (GCA).
  • Tester choice fundamentals: an ideal tester maximizes variance among testcrosses to discriminate among genotypes, with poor-performing testers (fixed for recessive alleles) often providing greater discrimination despite lower mean performance.
  • Common confusion: GCA vs SCA—GCA measures variation among progeny with one common parent (half-sib families), while SCA involves variation from female × male line interaction (specific combinations).
  • Elite vs poor testers trade-off: poor-performing testers maximize testcross variance but have low mean performance; elite testers have high mean performance but may mask differences among progeny.

🔄 Population Improvement Methods

🌱 Recurrent selection basics

Recurrent selection: the systematic development and improvement of a breeding population through cycles of evaluation, selection, and intermating.

  • How it works: Individuals are evaluated for traits of interest → selected lines are intermated → next cycle begins.
  • Purpose: increase the frequency of favorable alleles within a heterotic group when base germplasm is not elite for yield and other non-additive traits.
  • Selection can be based on:
    • Individual plant basis
    • Family structure (half-sib families, full-sib families, S₁ families)
  • Choice depends on the heritability of the trait(s) being improved.

🎯 Using improved populations

  • The improved population serves as a source of new inbreds.
  • At any cycle: top-performing lines can be:
    • Spun out for use as parents of improved hybrid cultivars, and/or
    • Used as parents to develop breeding populations (e.g., implementing pedigree breeding).

📚 Historical example: Iowa Stiff Stalk Synthetic

  • Developed in 1933-34 with 16 lines chosen for stalk strength.
  • Lines were intermated to form a population that was random mated for an unknown number of generations.
  • Important female inbreds spun out:
    • B14 from Cycle 0
    • B37 from Cycle 0
    • B73 from Cycle 5
    • B84 from Cycle 7
  • Became the basis for the female side of the pedigree in U.S. commercial maize germplasm (Stiff Stalk Synthetic, SSS), with prominence still noted today.

🔁 Reciprocal Recurrent Selection

🤝 What it improves

Reciprocal recurrent selection: a method to simultaneously improve two populations (one from each heterotic group), boosting the frequency of favorable alleles in a complementary fashion.

  • Each population is improved with respect to the other.
  • Used specifically for hybrid cultivars.
  • Any selection scheme based on testcross performance with the opposite heterotic group and applied to both sides of the pedigree can be considered a form of reciprocal recurrent selection.

🔄 Three-step cycle process

Season 1 (Crossing):

  • A plant in Population 1 (P1) is selfed (to produce S₁ seed) and crossed to several random plants in Population 2 (P2).
  • Seeds harvested from the P2 plants are bulked to form a P1 half-sib family.
  • The self and cross procedure is also done in P2.

Season 2 (Evaluation):

  • The half-sib families in P1 (crossed to P2) and P2 (crossed to P1) are evaluated in performance tests.

Season 3 (Recombination):

  • The selfed seeds from the best plants in P1 are grown and the plants are intercrossed to form the next cycle.
  • The selfed seeds from the best plants in P2 are likewise used in recombination separately from P1.

🧬 Selection basis: GCA

  • Selection within each population is based on general combining ability (GCA).
  • Variance associated with GCA measures variation among progeny with one common parent (i.e., half-sib families).

🧪 Combining Ability Concepts

🔍 GCA vs SCA distinction

TypeWhat it measuresFamily structure
GCA (General Combining Ability)Variation among progeny with one common parentHalf-sib families
SCA (Specific Combining Ability)Variation associated with female × male line interactionSpecific paired combinations
  • Don't confuse: GCA reflects average performance across many crosses (one parent in common); SCA reflects performance of a specific pairing (both parents matter).
  • Lines representing newly improved inbreds may be selected based on SCA.

📐 Mating designs to measure GCA and SCA

Complete diallel:

Complete diallel: every possible paired combination of lines.

  • Reciprocal crosses and parent lines may or may not be included in phenotypic evaluation.
  • To minimize crosses needed for large numbers of lines, parents may be assigned to sets (essentially mini-diallels).
  • Uses:
    • Provides estimates of GCA and SCA
    • Computes estimates of heterosis
    • Can establish a new heterotic pattern (lines showing greatest heterosis are assigned to opposite heterotic groups)

NC Design II:

  • Another mating design useful for obtaining estimates of GCA and SCA effects.

🎯 Tester Selection Principles

🔑 Why tester choice matters

  • A testing system based on hybrid performance is vital to developing improved hybrid cultivars.
  • The improved cultivar will exhibit its hybrid performance to farmers and other stakeholders.
  • Sufficient, accurate data on hybrid performance is essential to making a correct decision about product launch.
  • Challenge: find a tester that provides discrimination among progeny in a breeding population in keeping with the purposes of selection.

📏 Definition of a desirable tester

A desirable tester (Matzinger, 1953): one that combines the greatest simplicity of use with the maximum information on the performance to be expected from tested lines when used in other hybrid combinations or grown in other environments.

  • Testcross evaluation is used to assess combining ability and estimate breeding values.

⚖️ Requirements of an ideal tester

An ideal tester maximizes differences among the genotypes being tested—that is, maximizes the variance among testcrosses.

Testcross variance is a function of:

  • Allele frequencies in the population (p and q)
  • Allele frequencies in the tester (p_T and q_T)
  • Level of inbreeding in the population (F)
  • Levels of dominance (d)
  • The value a (half the difference between genotypic values of the two homozygotes, i.e., additive effect) across loci affecting trait performance

Key insight: For any level of dominance, the quantity is maximized when q_T = 1 (the tester is fixed for the recessive allele at most underlying loci).

  • The recessive allele from the tester does not mask the effect of a favorable, dominant allele contributed by the progeny.
  • However, this suggests poor performance per se of the tester line.
  • If d = 0 (no dominance), then the tester has no effect.
  • Most loci involved with heterosis expression are expected to have some level of dominant gene action.

📊 Empirical Tester Comparison

🌽 Five tester types evaluated

Study compared five types of testers in maize for assessing lines from Iowa Stiff Stalk Synthetic (BSSS) at early (S₁) and late (S₈) stages of inbreeding, for grain yield (a highly heterotic trait):

TesterDescriptionS₁ variance (t/ha)S₈ variance (t/ha)S₁ mean (t/ha)S₈ mean (t/ha)
BSSSPopulation itself0.180.425.795.69
BC13(S)C1Improved BSSS population0.110.346.956.81
BSSS-222Poor BSSS inbred0.220.396.035.89
B73Elite BSSS inbred0.040.267.297.21
Mo17Elite non-BSSS inbred0.260.307.817.78

🔬 Key conclusions from the comparison

Inbreeding level effect:

  • More differentiation among progeny was possible with S₈ vs S₁ testcrosses (i.e., greater F in the progeny highlighted the allele frequencies).
  • Performance levels remained consistent across inbreeding stages.

Tester performance trade-off:

  • Poor performing testers generally led to higher variance (e.g., BSSS-222 had higher variance than B73).
  • Elite testers (B73, Mo17) had the highest mean performance but often lower variance at early stages.
  • Example: B73 (elite BSSS inbred) had the lowest S₁ variance (0.04) but highest mean performance (7.29 t/ha) among same-heterotic-group testers.
  • Mo17 (elite non-BSSS inbred) provided both high variance (0.26 at S₁) and highest mean performance (7.81 t/ha), illustrating the value of opposite-heterotic-group testers.

Don't confuse: High mean performance of a tester with its ability to discriminate—elite testers may mask differences among progeny despite producing higher-yielding testcrosses overall.

15

GCA and SCA

GCA and SCA

🧭 Overview

🧠 One-sentence thesis

An elite inbred tester from the complementary heterotic group is ideal because it maximizes testcross variance while also delivering high mean performance and revealing potential new hybrid cultivars.

📌 Key points (3–5)

  • Tester choice affects variance and mean: poor-performing testers can increase testcross variance, but elite testers from the opposite heterotic group maximize variance and mean performance.
  • How testers reveal differences: a tester fixed for recessive alleles at most loci does not mask favorable dominant alleles from the progeny, highlighting genetic differences.
  • Common confusion—poor vs elite tester: both can maximize variance, but only the elite tester from the complementary heterotic group contributes favorable alleles at loci where progeny are depleted, raising mean performance and serving as a potential hybrid parent.
  • Why it matters: using an elite tester from the opposite heterotic group optimizes selection response and simultaneously identifies top hybrid combinations for new cultivars.

🧬 How testers work at the genetic level

🧬 Tester allele frequency and dominance

The quantity (testcross variance) is maximized when the tester is fixed for the recessive allele at most underlying loci.

  • A tester with recessive alleles does not mask the effect of a favorable, dominant allele contributed by the progeny.
  • This setup reveals differences among progeny lines at loci with dominant gene action.
  • Trade-off: a tester fixed for recessive alleles suggests poor performance per se of the tester line.
  • If dominance (d) = 0, the tester has no effect; most loci involved in heterosis are expected to show some level of dominant gene action.

🔍 Classes of loci and what each tester reveals

The excerpt presents four classes of loci in S₈ progeny (Table 5):

ClassInbred 1Inbred 2Poor TesterElite TesterWhat it means
i++++++ or —Progeny already fixed for favorable allele; elite tester may contribute but doesn't improve performance
j++Progeny segregating; neither tester contributes
k++Progeny segregating; neither tester contributes
l++Progeny depleted; elite tester contributes favorable allele
  • Poor tester: has recessive alleles at most or all classes of loci.
  • Elite tester from opposite heterotic group: homozygous for the dominant favorable allele at Class 'l' loci (where progeny are depleted).
  • Progeny segregate for Classes 'j' and 'k' only; these loci affect testcross variance.
  • Since neither tester contributes to 'j' and 'k', variance is the same with both testers.
  • Key difference: the elite tester contributes at Class 'l' loci, increasing mean performance (μ_T).

Example: If progeny lack favorable alleles at certain loci, the elite tester supplies them, boosting testcross yield without reducing the ability to detect genetic differences.

📊 Comparison of tester types in practice

📊 Five testers evaluated in maize

Hallauer and Lopez-Perez (1979) compared five testers in combination with Iowa Stiff Stalk Synthetic (BSSS) at early (S₁) and late (S₈) inbreeding stages for grain yield (Table 4):

TesterDescriptionS₁ varianceS₈ varianceS₁ mean (t/ha)S₈ mean (t/ha)
BSSSPopulation itself0.180.425.795.69
BC13(S)C1Improved BSSS population0.110.346.956.81
BSSS-222Poor BSSS inbred0.220.396.035.89
B73Elite BSSS inbred0.040.267.297.21
Mo17Elite non-BSSS inbred0.260.307.817.78

🔑 Key findings from the comparison

  • More differentiation with advanced inbreeding: S₈ testcrosses showed greater variance than S₁ testcrosses, highlighting allele frequency differences despite consistent performance levels.
  • Poor testers increase variance: poor-performing testers generally led to higher variance (e.g., BSSS-222 vs B73; BSSS vs BC13(S)C1).
  • Elite opposite-group tester maximizes variance in early testing: Mo17 (elite line from the opposite heterotic group) maximized variance in early testing, highlighting differences among BSSS lines at loci where Mo17 is depleted of favorable alleles.
  • Elite tester also provides best mean performance: Mo17 delivered the best mean yield in both early and late testing—a tremendous advantage because it can serve as a potential parent of an improved hybrid.

Don't confuse: All testers except Mo17 are related to BSSS; Mo17 is from the opposite heterotic group, which is why it reveals different genetic information.

🎯 Poor vs elite tester strategy

🎯 Two types of testers that maximize variance

  1. Poor-performing tester: presumably due to a low frequency of favorable alleles.
  2. Elite inbred tester from the opposite heterotic group: especially lines that could potentially serve as parents of an improved hybrid.

✅ Why the elite tester is superior

  • With the elite tester option, yield trials evaluate testcross performance and identify the best lines for one side of the pedigree.
  • The trials also capitalize on specific combining ability (SCA) to identify top-performing hybrid combinations as potential new cultivars.
  • A poor tester cannot be considered as a potential parent of an improved hybrid; an elite line from the opposite heterotic group can.

Example: Using Mo17 as a tester not only ranks BSSS lines by performance but also reveals which BSSS × Mo17 hybrids are the best candidates for release.

🏆 Take-home message

Bottom line: an elite inbred(s) from the complementary heterotic group is an ideal tester.

  • All progeny are compared on the same basis using a common tester or a common set of multiple testers.
  • It highlights real genetic differences among progeny under evaluation, optimizing the response to selection.
  • In addition, potential new single-cross hybrid cultivars may be detected in the evaluation process.

🌾 Producing testcross seed

🌾 Topcross mating design

  • Carried out in an isolated field.
  • Families of the breeding population are grown in separate rows with the male flower removed before pollen shed.
  • The tester is grown in rows placed intermittently through the field and serves as the pollen parent.
  • Pollen is transmitted to progeny rows through wind, insects, or other means.
  • Field isolation prevents stray pollen from contaminating the testcross seed.

🌾 Paired-row crosses

  • Each progeny (or progeny family) is represented in one row, with the tester grown in the adjacent row.
  • Silks are protected to prevent inadvertent pollen ahead of hand-pollination.
  • Pollen is collected and distributed manually (hand pollination).
  • Field isolation is not required.
  • Seed produced on plants in either of the paired rows can be bulked to represent the testcross seed associated with that progeny.

🌽 Product target example: maize single-cross hybrid

🌽 Target market and environment

  • Target market region: dryland production regions of western South Africa.
  • Product target environment: environments representing geographic location, season, maturity zone, altitude, soil types, and farmer practices.

🌽 Implied characteristics

The product target implies the following traits:

  • White grain
  • High grain yield
  • Tolerance to moisture stress levels common to non-irrigated production
  • Good standability (minimal/no stalk lodging or root lodging)
  • Disease package for this region:
    • Resistance to grey leaf spot (Cercospora zeae-maydis and Cercospora zeina)
    • Resistance to common rust (Puccinia sorghi)
    • Resistance to Diplodia ear rot (Stenocarpella maydis)
  • Medium maturity
  • Bt event (transgenic event imparting resistance to stalk borers)
  • RR event (transgenic event imparting tolerance to glyphosate herbicide such as Roundup®)

🌱 Secondary trait: anthesis-silking interval (ASI)

ASI is the number of days between the date when 50% of plants are shedding pollen and the date when 50% of plants are showing emerged silks.

  • What it indicates: ASI is an external indicator of the metabolics underlying stress tolerance.
  • Why it matters: large ASI in maize is a sign of reduced partitioning to the ear, resulting in slow spikelet growth which affects kernel number.
  • ASI is highly correlated with barrenness, which (like kernel number) is a contributing factor to grain yield.
  • ASI is a clear indicator of stress at flowering, the most vulnerable time for yield loss due to drought under "occasional drought."
  • ASI signals not only moisture stress but also other abiotic stresses such as nitrogen deficiency.

Example: Yield evaluations will be conducted in the target market region, providing an indication of response to any level of drought encountered; ASI serves as an additional indicator of tolerance to moisture stress.

16

A Word About Producing Testcross Seed

A Word About Producing Testcross Seed

🧭 Overview

🧠 One-sentence thesis

Hybrid seed production requires cost-effective male sterility systems to ensure controlled pollination on a large scale, preventing self-pollination of the female inbred and guaranteeing that the hybrid seed results from the intended cross between elite female and male inbreds.

📌 Key points (3–5)

  • Why male sterility matters: controlled pollination on a large scale is essential to ensure hybrid seed is a product of the intended cross, especially preventing self-pollination by the female inbred in wind-pollinated crops like corn.
  • How production fields are arranged: predetermined ratios of female inbred (seed parent) rows to male inbred (pollen parent) rows (e.g., 4:1, 4:2, 6:2), grown in isolation to avoid foreign pollen.
  • Physical vs genetic control: pollination can be controlled physically (dioecious species, manual/mechanical removal of male flowers like detasseling) or through male sterility systems in the plant.
  • Types of male sterility systems: genic MS (nuclear genes), environmentally-induced MS, cytoplasmic MS, transgenic MS, and chemically-induced MS.
  • Common confusion: genic MS vs cytoplasmic MS—genic MS is controlled by nuclear genes (often recessive alleles), while CMS involves cytoplasmic factors that prevent anther exertion and pollen shed.

🌾 Why male sterility is needed in hybrid production

🎯 The core problem

  • In wind-pollinated crops like corn, controlled pollination on a large scale is essential.
  • The goal: ensure that hybrid seed being produced for the farmer is a product of the intended cross between the elite female inbred (from the female heterotic group) and elite male inbred (from the male heterotic group).
  • Prevention of self-pollination by the female inbred plants maximizes the probability that the pollen parent is the desired male inbred.

🏗️ How production fields are set up

Hybrid production fields: arranged to include a predetermined ratio of female inbred (seed parent) rows to male inbred (pollen parent) rows.

  • Common ratios: 4:1, 4:2, 6:2, depending on the pollen dissemination range.
  • Fields are grown in isolation to avoid the intrusion of foreign pollen.
  • Example: In a 4:2 ratio field, four rows of female inbred plants are planted for every two rows of male inbred plants, ensuring adequate pollen supply while maximizing seed production from female rows.

🛠️ Physical methods of pollination control

🌿 Separate male and female plants

  • The crop species is dioecious (separate female/male plants).
  • This naturally prevents self-pollination but is not applicable to all crop species.

✋ Manual or mechanical removal

Detasseling: physical removal of the male flower of the maize plant.

  • In corn hybrid seed production, detasseling requires a large short-term labor force.
  • Can be done manually (hand detasseling as a quality check) or mechanically.
  • This is labor-intensive and costly, making genetic male sterility systems more attractive.

🧬 Genetic male sterility systems

🧬 Genic male sterility (MS)

Genic MS: male sterility controlled by nuclear genes, often determined by a recessive allele at a single locus.

  • First reported by L.A. Eyster (1921) in maize.
  • Today nearly all identified nuclear MS genes in maize have been mapped.
  • MS can also be affected by mutation.

Examples:

  • The ms45 allele in maize: this recessive allele in a homozygous state results in MS, whereas fertility is restored by a single copy of the dominant allele, Ms45.
  • The suppressor-mutator (Spm) transposable element system.

🌡️ Environmentally-induced male sterility

Environmentally-induced MS: some nuclear MS genes can be induced under certain environmental conditions.

  • Thermo-sensitive and photoperiod-sensitive nuclear genes have been used in rice to facilitate hybrid production.
  • This enables a two-line system (male-sterile female line, restorer line):
    • MS female inbred lines can be maintained through self-pollination in environments not conducive to MS induction.
    • Hybrid production is carried out under conditions that promote MS.

Limitations:

  • Weather conditions can be unpredictable and inconsistent.
  • Temperature fluctuations may cause "leaks," where some viable pollen is produced, or even complete reversal to a fertile state.
  • Don't confuse: this is still nuclear gene-based, but expression depends on environmental triggers, unlike constitutive genic MS.

🔬 Cytoplasmic male sterility (CMS)

Cytoplasmic male sterility (CMS): male sterility involving cytoplasmic factors that prevent anther exertion and pollen shed.

  • Compared to the normal maize inbred, the CMS-version of inbreds does not exert anthers and therefore does not shed pollen.
  • This trait is exploited in hybrid seed production.
  • Example: Normal tassel shows exerted anthers and pollen shed; CMS-version tassel has no anthers exerted or pollen shed.

How to distinguish from genic MS:

FeatureGenic MSCytoplasmic MS
Genetic basisNuclear genes (often recessive alleles)Cytoplasmic factors
InheritanceMendelian (follows nuclear gene segregation)Maternally inherited (cytoplasmic)
Anther behaviorVaries by specific geneNo anther exertion, no pollen shed

🧪 Other male sterility systems

The excerpt mentions but does not detail:

  • Transgenic MS: male sterility achieved through genetic engineering.
  • Chemically-induced MS: male sterility induced by chemical application.

🔄 Comparison of pollination control methods

MethodMechanismAdvantagesLimitations
Physical (detasseling)Manual/mechanical removal of male flowersDirect control, no genetic modification neededLabor-intensive, costly, requires large short-term labor force
Genic MSNuclear genes (often recessive alleles)Genetic control, mapped genes availableRequires breeding to maintain sterile lines
Environmentally-induced MSNuclear genes triggered by environmentTwo-line system, can maintain lines by self-pollinationWeather unpredictability, risk of "leaks" or fertility reversal
Cytoplasmic MSCytoplasmic factorsNo anther exertion, reliable sterilityRequires specific cytoplasm, maternally inherited
17

Example Commercial Maize Hybrid Improvement Program

Example Commercial Maize Hybrid Improvement Program

🧭 Overview

🧠 One-sentence thesis

Male sterility systems enable efficient hybrid seed production by preventing self-pollination in the female parent without requiring manual detasseling, and the choice of system fundamentally shapes the breeding program design.

📌 Key points (3–5)

  • Why male sterility matters: prevents selfed seed in hybrid production, eliminating or reducing the need for costly hand detasseling.
  • Multiple system types exist: genic, environmentally-induced, cytoplasmic (CMS), transgenic, and chemically-induced male sterility, each with different genetic mechanisms and trade-offs.
  • Common confusion—two-line vs three-line systems: two-line systems (e.g., environmentally-induced MS) use only a male-sterile line and a restorer line, while three-line systems (e.g., CMS) require a male-sterile line, a maintainer line, and a restorer line.
  • MS system elements must be bred into lines: although hybrid production happens downstream, breeders must integrate MS system components during new line development, affecting the entire product pipeline.
  • Historical lesson: the 1970 U.S. corn crop disaster from T-CMS susceptibility to Southern corn leaf blight shows that CMS system choice must consider potential negative impacts on hybrid performance.

🧬 Types of male sterility systems

🧬 Genic male sterility

Genic MS: male sterility controlled by nuclear genes, often determined by a recessive allele at a single locus.

  • First reported by L.A. Eyster in 1921 in maize; nearly all identified nuclear MS genes in maize have been mapped.
  • How it works: a recessive allele in homozygous state causes sterility; a single copy of the dominant allele restores fertility.
  • Example: the ms45 allele in maize—homozygous ms45/ms45 results in male sterility, whereas Ms45/ms45 or Ms45/Ms45 restores fertility.
  • Can also be affected by mutation, such as the suppressor-mutator (Spm) transposable element system.

🌡️ Environmentally-induced male sterility

  • Some nuclear MS genes are induced only under certain environmental conditions (temperature or photoperiod).
  • Used in rice to facilitate hybrid production: MS female inbred lines can be maintained through self-pollination in environments that do not induce MS, while hybrid production occurs under conditions that promote MS.
  • Two-line system: uses only a male-sterile female line and a restorer line (no separate maintainer line needed).
  • Major drawback: weather conditions are unpredictable—temperature fluctuations may cause "leaks" (some viable pollen produced) or complete reversal to a fertile state.
  • Don't confuse: this relies on environmental triggers, not cytoplasmic inheritance.

🧪 Cytoplasmic male sterility (CMS)

Cytoplasmic male sterility (CMS): male sterility transmitted exclusively by the female parent as a function of the mitochondrial genome; fertility restoration relies on nuclear genes that suppress cytoplasmic dysfunction.

  • Observed in more than 150 plant species.
  • How it works: CMS-version inbreds do not exert anthers and do not shed pollen; each MS cytoplasm has its own "key" to restore fertility in the form of nuclear gene(s) called Rf (Restorers of fertility).
  • Three-line system: requires a CMS line, a maintainer line, and a restorer line.
    • The maintainer line is genetically identical to the MS inbred except it lacks the MS cytoplasm; both harbor the recessive restorer allele (r).
    • When the MS female inbred is crossed to the male inbred containing the dominant nuclear fertility restorer gene (R), the resulting F₁ seed for farmers is male fertile despite having the MS cytoplasm.
  • Example CMS systems in maize:
    • CMS T-type: requires dominant Rf1 and Rf2 genes, both required.
    • CMS S-type: requires dominant Rf3 gene.
    • CMS C-type: dominant Rf4 involved, possibly others.

🧬 Transgenic male sterility

  • Male sterility created through genetic engineering.
  • DuPont Pioneer (Corteva) SPT system (event DP-32138-1):
    • Uses zm-aa1 gene to render pollen inviable.
    • Includes Ms45 gene to restore fertility in an ms45/ms45 background.
    • Includes red fluorescent protein (DsRed2) gene as a marker for transgenic seeds.
    • Key advantage: the transgenic element does not enter the value chain or food system—transgenic male-fertile maintainer produces non-transgenic seed of the female parent; non-transgenic MS female plants produce non-transgenic hybrid seed; fertility is restored in F₁ through the Ms45/Ms45 male parent.
  • Rice system: uses mutant allele of nuclear gene OsNP1 (Oryza sativa No Pollen 1), which encodes a regulator gene controlling tapetum degeneration and pollen exine formation; the osnp1 mutant exhibits male sterility insensitive to environmental conditions.

💊 Chemically-induced male sterility

Chemical hybridization agents (CHAs): chemicals such as male gametocides that induce male sterility.

  • Used in rice, sugarbeet, wheat, cotton, rapeseed, canola, cucurbits, tomato, and onion.
  • Many CHAs (including sulfonylureas and imidazolinones) inhibit acetohydroxyacid synthase (AHAS), an important enzyme in amino acid biosynthesis.
  • Example in rapeseed: a sulfonylurea-resistant mutant maintains normal male fertility in the pollen parent during hybrid production despite herbicide application to the entire field.
  • Can be coupled with transgenic systems: Monsanto's MON 87427 in maize uses glyphosate herbicide (e.g., Roundup®) to induce MS just prior to tassel development.
    • The seed parent (MON 87427) protects all plant tissues except tapetum cells and pollen grains against glyphosate; herbicide application affects only these tissues.
    • The pollen parent must be protected through incorporation of a glyphosate-tolerant event expressed in all plant tissues.
  • Critical feature: female fertility is not affected.

⚠️ Risks and system selection

⚠️ The 1970 Southern corn leaf blight disaster

  • The U.S. corn crop was nearly wiped out in 1970 due to widespread use of T-CMS, which is susceptible to Southern corn leaf blight caused by pathogen Bipolaris maydis (also known as Cochliobolus heterostrophus).
  • Lesson: when considering a CMS system, it is critical to compare options and explore the possibility of negative impacts of aberrant mitochondrial genes on hybrid performance.
  • Don't confuse: the problem was not CMS itself, but the specific T-CMS cytoplasm's vulnerability to a pathogen.

🔍 Choosing a male sterility system

  • Many options are available to facilitate hybrid production through controlled pollination.
  • MS systems are key to producing genuine F₁ seed of improved inbreds.
  • Design implication: although hybrid production to produce F₁ seed for farmers takes place downstream of New Line Development, New Line Evaluation, and Trait Integration, the elements of the MS system used must often be integrated through breeding efforts.
  • Bottom line: choice of a MS system is a decision affecting the design of the product pipeline; introgression of MS system elements must be planned during breeding.

🌽 Hybrid types and trade-offs

🌽 Single-cross hybrids

  • F₁ seed is produced by crossing two genetically unrelated inbred lines, one from the female heterotic group and one from the male heterotic group.
  • Each seed has a genetic complement from each parent.
  • At every locus where the two inbred parents possess different alleles, the single-cross hybrid is heterozygous.
  • All F₁ seeds are genetically identical and plants are uniform.

🔀 Double-cross and three-way hybrids

  • When used: when the heterotic pattern is young and inbreeding depression in hybrid parents is still an issue.
  • Double-cross hybrids:
    • Involve four distinct inbreds as parent lines.
    • Require two steps: two inbreds from the female heterotic group (P1 and P2) are crossed, and two inbreds from the male heterotic group (P3 and P4) are crossed; then the F₁ seed from the female parents is crossed to the F₁ seed from the male parents.
    • Advantage: vigor of plants serving as parents of hybrid seed production is improved due to within-heterotic-group variability, boosting hybrid seed yield.
    • The hybrid seed for farmers exhibits between-group heterosis.
  • Three-way cross: a variation on double-crossing where the hybrid seed for farmers involves a single-cross as one parent and an inbred line as the other parent (typically the single-cross is the female parent).

⚖️ Disadvantages of multi-cross hybrids

DisadvantageExplanation
Extra step requiredDemands additional time and resources in product development
Not genetically uniformHybrid seed for farmers is not genetically uniform
Lower yield potentialNot expected to yield as well as a single-cross hybrid since specific combining ability (SCA) is not maximized
  • Don't confuse: multi-cross hybrids can overcome inbreeding depression effects to improve seed yields in hybrid production, but they sacrifice uniformity and maximum SCA.

🎯 Bottom line principle

Any new prospective cultivar must be able to be reproduced efficiently and cost-effectively!

  • This principle applies across all breeding programs, whether for hybrids, pure lines, or clonally propagated cultivars.
  • The choice of male sterility system directly impacts the efficiency and cost-effectiveness of hybrid seed production.
18

Cytoplasmic Male Sterility

Cytoplasmic Male Sterility

🧭 Overview

🧠 One-sentence thesis

Cytoplasmic male sterility (CMS) is a maternally inherited trait that prevents pollen production and is exploited in hybrid seed production through a three-line system involving male-sterile, maintainer, and restorer lines.

📌 Key points (3–5)

  • What CMS is: a trait transmitted exclusively by the female parent through the mitochondrial genome that prevents pollen shed, observed in more than 150 plant species.
  • How fertility is restored: nuclear restorer genes (Rf) suppress the cytoplasmic dysfunction, with each CMS type requiring its own specific "key" restorer gene(s).
  • Three-line system: CMS requires a male-sterile line, a maintainer line (genetically identical but with normal cytoplasm), and a restorer line (with Rf genes) to produce fertile hybrid seed.
  • Common confusion: CMS vs other MS types—CMS is mitochondrial and maternally inherited, while genic MS is controlled by nuclear genes; environmentally-induced MS depends on conditions like temperature.
  • Critical risk: choosing the wrong CMS system can cause vulnerability to disease, as happened with T-CMS and Southern corn leaf blight in 1970.

🌱 Male sterility systems overview

🌱 Types of male sterility

The excerpt describes several systems to prevent pollen shed in female inbred parents (which would otherwise produce selfed rather than hybrid seed):

TypeKey characteristic
Genic MSControlled by nuclear genes
Environmentally-induced MSTriggered by environmental conditions
Cytoplasmic MSTransmitted through mitochondrial genome
Transgenic MSCreated through genetic engineering
Chemically-induced MSInduced by chemical hybridization agents

🧬 Genic male sterility

Genic MS: controlled by nuclear genes, often determined by a recessive allele at a single locus.

  • First reported by L.A. Eyster in 1921 in maize.
  • Nearly all identified nuclear MS genes in maize have been mapped.
  • Example: the ms45 allele in maize—homozygous recessive state results in MS, while a single copy of the dominant Ms45 allele restores fertility.
  • Can also be affected by mutation, such as the suppressor-mutator (Spm) transposable element system.

🌡️ Environmentally-induced male sterility

  • Some nuclear MS genes can be induced under certain environmental conditions.
  • Thermo-sensitive and photoperiod-sensitive nuclear genes have been used in rice to facilitate hybrid production.
  • Uses a two-line system (male-sterile female line, restorer line).
  • Major limitation: weather conditions can be unpredictable and inconsistent; temperature fluctuations may cause "leaks" (some viable pollen produced) or complete reversal to a fertile state.
  • Advantage: MS female inbred lines can be maintained through self-pollination in environments not conducive to MS induction, while hybrid production occurs under conditions that promote MS.

🔬 Cytoplasmic male sterility mechanism

🔬 What CMS does

Cytoplasmic male sterility (CMS): transmitted exclusively by the female parent as a function of the mitochondrial genome.

  • Compared to normal maize inbred, the CMS-version does not exert anthers and therefore does not shed pollen.
  • This trait is exploited in hybrid seed production.
  • Has been observed in more than 150 plant species.
  • Similar CMS systems have been detected in many crops.

🔑 Restoration of fertility

  • Restoration relies on nuclear genes that suppress cytoplasmic dysfunction.
  • Each MS cytoplasm has its own "key" to restore fertility in the form of nuclear gene(s) referred to as Rf (Restorers of fertility).
  • Different CMS types require different restorer genes.

🌽 Examples of CMS systems in maize

CMS typeRestorer gene(s) required
CMS T-typeDominant Rf1 and Rf2 genes, both required
CMS S-typeDominant Rf3 gene
CMS C-typeDominant Rf4 involved, maybe others

🔄 The three-line system

🔄 How the system works

Three-line system: CMS line, maintainer line, restorer line.

  • This system relies on certain genetic elements on both sides of the hybrid pedigree.
  • All three lines are necessary for successful hybrid seed production.

🧪 Maintaining the CMS inbred line

  • The need to maintain the MS inbred is accomplished through the use of a maintainer line.
  • The maintainer line is genetically identical to the MS inbred except that it does not have the MS cytoplasm.
  • Both the MS inbred and its maintainer line harbor the recessive MS restorer allele (denoted r).

🌾 Hybrid production process

  • When the MS female inbred is crossed to the male inbred containing the nuclear fertility restorer gene (denoted R), the resulting F₁ seed for farmers is male fertile despite having the MS cytoplasm.
  • The hybrid seed is fertile because the restorer gene from the male parent suppresses the cytoplasmic male sterility.

🔀 Don't confuse: two-line vs three-line systems

  • Two-line system (environmentally-induced MS): male-sterile female line + restorer line; relies on environmental conditions.
  • Three-line system (CMS): CMS line + maintainer line + restorer line; does not depend on environmental conditions but requires maintaining separate lines.

⚠️ Choosing and risks of CMS systems

⚠️ Critical considerations

  • When considering the use of a CMS system, it is critical to compare options and explore the possibility of the negative impact of aberrant mitochondrial genes on hybrid performance.
  • The choice of CMS system can have major consequences for disease susceptibility.

🦠 The 1970 Southern corn leaf blight disaster

  • The U.S. corn crop was nearly wiped out in 1970 due to the widespread use of T-CMS.
  • T-CMS is susceptible to Southern corn leaf blight caused by pathogen Bipolaris maydis (also known as Cochliobolus heterostrophus).
  • This demonstrates the importance of carefully evaluating CMS systems before widespread deployment.

🧬 Alternative MS systems

🧬 Transgenic male sterility

  • Some systems of male sterility have been created through genetic engineering.
  • Example: DuPont Pioneer (now Corteva Agrisciences) Seed Production Technology (SPT) system involving transgenic event DP-32138-1 in maize.

How the SPT system works:

  • Relies on the zm-aa1 gene to render pollen inviable.
  • Includes an Ms45 gene, which restores fertility in an ms45/ms45 background.
  • Includes a red fluorescent protein (DsRed2) gene as a marker for transgenic seeds.
  • The transgenic male-fertile maintainer line produces non-transgenic seed of the female parent (transgenic seed is also produced due to segregation and is used to propagate the maintainer line).
  • Non-transgenic MS female plants are used in hybrid production to create non-transgenic hybrid seed.
  • Fertility is restored in the F₁ seed through its Ms45/Ms45 male parent.
  • Key advantage: the transgenic element does not enter the value chain or the food system.

Rice transgenic MS system:

  • Utilizes a mutant allele of nuclear gene OsNP1 (Oryza sativa No Pollen 1), which encodes a regulator gene controlling tapetum degeneration and pollen exine formation.
  • The osnp1 mutant exhibits male sterility that is insensitive to environmental conditions.

💊 Chemically-induced male sterility

  • MS has been induced in a number of crops including rice, sugarbeet, wheat, cotton, rapeseed, canola, cucurbits, tomato, and onion.
  • Uses chemical hybridization agents (CHAs) such as male gametocides (e.g., gibberellic acid, maleic hydrazide, naphthalene acetic acid, ethereal).
  • Many CHAs, including classes of sulfonylureas and imidazolinones, inhibit acetohydroxyacid synthase (AHAS), an important enzyme in amino acid biosynthesis.
19

What are Clonally Propagated Crops?

What are Clonally Propagated Crops?

🧭 Overview

🧠 One-sentence thesis

Clonally propagated crops exploit all genetic variation through asexual reproduction, allowing breeders to access the final cultivar genotype immediately after crossing, though practical testing and propagation constraints require multi-stage evaluation.

📌 Key points (3–5)

  • What clonally propagated crops are: crops maintained and distributed by asexual reproduction using plant parts like tubers, roots, or stem cuttings.
  • Key genetic advantage: all genetic variation (additive, dominance, epistatic) is captured and passed intact to the next clonal generation, unlike crops requiring sexual reproduction.
  • Common confusion: varieties are both completely uniform (non-segregating, genetically fixed) and highly heterozygous hybrids at the same time.
  • Why testing takes time: despite immediate access to the final genotype, low propagation coefficients, complex product targets, and high genotype-by-environment interaction require multi-year, multi-location evaluation.
  • GxExM systems: genotype by environment by management interactions can be explored in depth to identify best genotypes for specific production conditions.

🌱 Core definition and reproduction

🌱 What clonally propagated crops are

Clonally propagated crops are those maintained and distributed for cultivation by asexual reproduction.

  • Different plant parts serve as clones depending on the crop: tubers, roots, stem cuttings.
  • Potato varieties specifically use tubers for propagation.
  • The excerpt uses Irish potato (Solanum tuberosum subsp. tuberosum) as the primary example.

🔄 How reproduction works

Sexual reproduction (one-time event):

  • New genetic variation is generated by crossing parent lines.
  • Once seed is produced, no genetic changes occur.
  • Each F₁ seed plant becomes the source of all subsequent clones.

Asexual reproduction (all subsequent generations):

  • All clones created from each F₁ seed plant are genetically identical.
  • Used to generate all materials derived from F₁ seed plants.
  • No inbreeding occurs; genotypic frequencies do not change.

Example: A breeder crosses two parent plants → produces F₁ seeds → grows F₁ plants → each F₁ plant can be cloned indefinitely with no genetic change.

🧬 Genetic characteristics

🧬 Heterozygosity and polyploidy

Most clonally propagated crops share these features:

  • Cross-pollinated: many are obligate outcrossers due to self-incompatibility.
  • Highly heterozygous parents: parents used in breeding crosses carry diverse alleles.
  • Heterozygous F₁ individuals: the breeding population consists of heterozygous F₁ plants from crossing.
  • Polyploidy amplifies heterozygosity: many clonally propagated crops are polyploid (potato can be 2x, 3x, 4x, or 5x; Irish potato is tetraploid).

The excerpt notes: "Theoretically, every allele at a locus could be distinctive" in polyploid crops.

🎯 Each F₁ plant is a potential variety

  • The family structure in the breeding population begins with a single plant at the F₁ stage.
  • Each F₁ plant is a potential variety because it can be cloned indefinitely.
  • Don't confuse: even though each variety is completely uniform (all clones are identical), the variety itself may be a highly heterozygous hybrid.

🔒 Uniformity paradox

All released varieties of clonally propagated crops are homogeneous: i.e. varieties are non-segregating, genetically fixed, and completely uniform. At the same time, each variety may be a highly heterozygous hybrid.

This apparent paradox is resolved by understanding:

  • Within a variety: all plants are genetically identical clones (uniform, non-segregating).
  • Within each plant: the genome may contain many different alleles at each locus (heterozygous).
  • The uniformity comes from cloning, not from genetic homozygosity.

💪 Genetic advantages

💪 Exploiting all genetic variation

Clonally propagated crops can exploit all types of genetic variation:

  • Additive variation
  • Dominance variation
  • Epistatic variation

In contrast, crops requiring sexual reproduction for the next generation cannot fully exploit dominance and epistatic variation.

📊 Variance and heritability

AspectClonally propagated cropsCrops requiring sexual reproduction
Covariance between individualsEquals the entire genetic varianceOnly a portion of genetic variance
Variation passed to next generationEntire variation passed intactReduced by segregation
Gene frequency changesOnly with further crossingChanges with each sexual generation
InbreedingDoes not occurOccurs with self-pollination

The excerpt emphasizes: "This represents a major advantage over breeding progeny that requires sexual reproduction to produce the next generation."

⚡ Immediate access to final genotype

  • The genotype finally released as a new cultivar is accessible to the breeder immediately after the initial crossings.
  • All testing is conducted using clones of the F₁ plants.
  • Genotypic frequencies do not change since there is no inbreeding.
  • Gene frequencies change only with further crossing.

This means the breeder theoretically has the winning variety in hand right after making crosses, unlike crops that require multiple generations of selection and inbreeding.

🚧 Practical obstacles to immediate release

🚧 Why further testing steps are needed

Even though the final genotype is accessible immediately, the excerpt identifies four major obstacles:

🌿 Seed-grown vs vegetative material performance

  • Plants grown from seed do not perform comparably with plants generated from vegetative planting material.
  • Plants raised from seed are typically grown in the greenhouse, which is not representative of field conditions.
  • This means the breeder cannot accurately evaluate F₁ seed plants directly; clonal propagation is needed first.

📈 Propagation coefficient limitations

Propagation coefficient: the amount of planting materials available at each stage, determined by how many new plants can be generated from existing plants.

  • Propagation of materials for testing takes time.
  • Potato has a relatively low propagation coefficient of approximately 10.
  • For comparison, sweet potato has a propagation coefficient of 30+.
  • Example: Starting with one potato plant, you can generate only about 10 new plants in the next cycle, limiting how quickly you can scale up for large trials.

🎯 Complex product targets

  • Clonally propagated crops often involve complex product targets.
  • Many traits are required at multiple levels: farmer, processor, and consumer.
  • Evaluations to facilitate selection are numerous and require significant amounts of testing materials.
  • This complexity means extensive testing is needed even after identifying promising clones.

🌍 High genotype by environment interaction

  • Genotype by environment interaction (GxE) can be very high for yield and other low heritable traits in clonally propagated crops like potato.
  • Multiple-location, multiple-year evaluation is needed for:
    • Yield
    • Disease/pest resistances
    • Quality traits
  • These evaluations are necessary to satisfy product demands locally and regionally.

🌐 Genotype by environment by management systems

🌐 Understanding phenotype components

The excerpt defines phenotype as:

Phenotype (P) is the observed expression of a trait in an individual, as determined by genetic makeup and environmental factors.

Components:

  • G = genetic effects
  • E = environmental effects (locations, years, seasons, etc.)
  • GxE = genotype x environment interaction effects

🔍 Exploring GxE in clonally propagated crops

Because all genetic variation (additive, dominance, epistatic) is captured in clonally propagated crops:

  • GxE can be examined in depth to identify "best" genotypes for specific types of environments.
  • With only one genotype considered, GxE is not relevant or discernible.
  • Phenotypic expression between clones of the same genotype grown in two different environments becomes a function of environmental variation.

Don't confuse: Clones of the same genotype are genetically identical, so differences between them in different environments are purely environmental (or GxE), not genetic.

🛠️ Adding management to the equation

Genotype by environment by management (GxExM) system can be exploited to realize the full genetic potential of new cultivars in crop production.

Management aspects that contribute to GxExM:

  • Watering regime
  • Tillage regime
  • Soil fertility regime
  • Pest control regimes
  • Plant density
  • Sowing date
  • Row spacing

All of these aspects have the potential to influence productivity and quality of crop production.

📋 Practical application for potato

  • An important element of cultivar release is guidance to farmers on management aspects of the new cultivar in production.
  • Management practices must be addressed in the testing regime prior to new cultivar release.
  • Example mentioned: ridge-furrow (text cuts off).
  • This means breeders must test not just which genotypes perform best, but also which management practices optimize each new cultivar's performance.
20

Examining Genotype by Environment by Management (GxExM) Systems

Examining Genotype by Environment by Management (GxExM) Systems

🧭 Overview

🧠 One-sentence thesis

Clonally propagated crops capture all genetic variation (additive, dominance, epistatic) in each F₁ plant, enabling breeders to exploit genotype by environment by management (GxExM) interactions to identify the best genotypes for specific production systems.

📌 Key points (3–5)

  • Clonal propagation advantage: all genetic variation is exploited and passed intact to the next generation, unlike sexually reproduced crops; the final cultivar genotype is accessible immediately after initial crossing.
  • Why testing takes time: despite theoretical ability to identify best clones in year one, practical obstacles include seed vs. vegetative performance differences, low propagation coefficients, complex product targets, and high genotype by environment (GxE) interaction.
  • GxExM framework: phenotype equals genetic effects plus environmental effects plus GxE interaction; adding management practices (watering, tillage, fertility, pest control, density, sowing date, spacing) creates the GxExM system to realize full genetic potential.
  • Complex product targets: potato breeding must satisfy multiple stakeholders (farmers, processors, consumers) across many traits—yield, stress tolerance, appearance, disease resistance, flavor, storage, processing quality.
  • Common confusion: in clonally propagated crops, every F₁ plant is a potential variety and may be highly heterozygous yet the variety itself is completely uniform (non-segregating, genetically fixed) because all clones are genetically identical.

🌱 Unique features of clonal propagation

🌱 Every F₁ plant is a potential variety

  • Clonal propagation begins with a single plant at the F₁ stage.
  • Each F₁ plant represents a potential variety due to high heterozygosity.
  • Many clonally propagated crops are polyploid (e.g., potato can be 2x, 3x, 4x, or 5x; Irish potato is tetraploid), amplifying heterozygosity.
  • Theoretically, every allele at a locus could be distinctive.

🔒 Homogeneous yet heterozygous

All released varieties of clonally propagated crops are homogeneous: varieties are non-segregating, genetically fixed, and completely uniform. At the same time, each variety may be a highly heterozygous hybrid.

  • Don't confuse: "homogeneous" refers to uniformity within a variety (all clones are genetically identical), not to homozygosity at the allele level.
  • Sexual reproduction generates new variation by crossing parents; once seed is produced, no genetic changes occur.
  • All clones from each F₁ seed plant are genetically identical.
  • Asexual reproduction generates all subsequent generations from F₁ seed plants.

🧬 All genetic variation is exploited

  • Additive, dominance, and epistatic variation are all captured.
  • The covariance between clones (of different F₁ plants) equals the entire genetic variance.
  • This variation is passed in its entirety to the next clonal generation—a major advantage over breeding progeny requiring sexual reproduction.
  • All testing uses clones of F₁ plants; genotypic frequencies do not change (no inbreeding).
  • Gene frequencies change only with further crossing.
  • Key advantage: the genotype finally released as a new cultivar is accessible immediately after initial crossings.

🚧 Practical obstacles to rapid selection

🚧 Seed vs. vegetative performance

  • Plants grown from seed do not perform comparably with plants generated from vegetative planting material.
  • Seed-raised plants are typically grown in greenhouses, which are not representative of field conditions.
  • Example: even if a clone looks promising from seed in the greenhouse, field performance from vegetative material may differ.

🚧 Propagation coefficient limits

Propagation coefficient: the amount of planting materials available at each stage, determined by the crop's reproductive capacity.

  • Propagation of materials for testing takes time.
  • Potato has a relatively low propagation coefficient of approximately 10 (compared to sweet potato at 30+).
  • This means fewer plants can be generated per cycle, slowing the testing process.

🚧 Complex product targets

  • Clonally propagated crops often involve complex product targets.
  • Many traits are required at the farmer, processor, and consumer levels.
  • Evaluations to facilitate selection are numerous and require significant amounts of testing materials.
  • Example: potato must satisfy demands for yield, appearance, disease resistance, flavor, storage life, and processing qualities—each requiring different screens.

🚧 High genotype by environment interaction

  • Genotype by environment interaction (GxE) can be very high for yield and other low heritable traits in clonally propagated crops like potato.
  • Multiple-location, multiple-year evaluation is needed for yield, disease/pest resistances, and quality traits.
  • This is necessary to satisfy product demands locally and regionally.

🌍 GxExM: Genotype by Environment by Management

🌍 The phenotype equation

Phenotype (P): the observed expression of a trait in an individual, as determined by genetic makeup and environmental factors.

The excerpt presents the relationship:

  • P = G + E + GxE
  • G = genetic effects
  • E = environmental effects (locations, years, seasons, etc.)
  • GxE = genotype × environment interaction effects

Example: phenotypic expression between clones of the same genotype grown in two different environments becomes a function of environmental variation. (Note: with only one genotype considered, GxE is not relevant or discernible.)

🌍 Adding management to the equation

  • To explore environmental effects and their impact on phenotype further, the particular management systems and practices employed in crop production must be detailed.
  • Genotype by environment by management (GxExM) system can be exploited to realize the full genetic potential of new cultivars in crop production.
  • Because all genetic variation (additive, dominance, epistatic) is captured in clonally propagated crops, GxE can be examined in depth to identify "best" genotypes for specific types of environments.

🛠️ Management aspects contributing to GxExM

The excerpt lists relevant management aspects:

  • Watering regime
  • Tillage regime
  • Soil fertility regime
  • Pest control regimes
  • Plant density
  • Sowing date
  • Row spacing

All of these aspects have the potential to influence productivity and quality of crop production.

🛠️ Management guidance for cultivar release

  • For potato, an important element of cultivar release is guidance to farmers on management aspects of the new cultivar in production.
  • Management practices must be addressed in the testing regime prior to new cultivar release.
  • Example: ridge-furrow tillage system with specific row spacing (e.g., 36-inch row spacing).

🎯 Complex product targets in potato

🎯 Stakeholder demands across the value chain

Target traits for potato must meet demands of value chain stakeholders and may include:

CategoryTraits
Yield & stabilityHigh fresh-weight yield, yield stability
Stress toleranceAbiotic stress tolerances to low soil fertility, drought, heat, salinity
Maturity & adaptationMaturity, adaptation to local environment
Tuber appearanceNumber, shape, size, uniformity, skin color, flesh color, eye depth, lack of internal defects (e.g., hollow heart, brown center), lack of external defects (e.g., cracks, greening)
Disease & pestDisease resistance (e.g., late blight, early blight, bacterial wilt, potato leafroll virus), pest tolerances (e.g., potato tuber moth, green peach aphid, leafminer fly, nematodes)
QualityFlavor, aroma, texture, nutrient and anti-nutrient content (e.g., carotenoids, anthocyanins, Vitamin C, micronutrients; low levels of glycoalkaloids), storage life
ProcessingProcessing qualities (e.g., frying for French fries or chips, blackening after cooking), starch type (altered ratio of amylopectin to amylose for industrial purposes)

🎯 Evaluation complexity

  • Product targets are typically complex and require many types of screens to evaluate performance.
  • Because of the fresh produce market for potato, it is critical to have farmers and consumers participate early in the selection process.
  • However, farmer participation is best limited to selection for visual traits, especially those for which a particular threshold for performance is essential.
  • Lowly heritable traits like yield are best selected by the breeder.

🧑‍🌾 Choosing parents for breeding

🧑‍🌾 Breeding population goals

  • The goal of breeding crosses is to create breeding populations with high mean performance for traits of interest as well as wide genetic variability.
  • Considering all traits of interest, the ideal is a good × good cross, resulting in new combinations of favorable alleles and more loci with the favorable genotype.

🧑‍🌾 Combining ability vs. per se performance

  • For autotetraploid, highly heterozygous potato, per se performance of parents is not necessarily a good indicator of breeding value.
  • Combining ability may be more informative.
  • Don't confuse: in highly heterozygous polyploid crops, a parent's own performance does not reliably predict the performance of its offspring; the parent's ability to combine well with other parents (combining ability) is more important.
21

Choosing Parents

Choosing Parents

🧭 Overview

🧠 One-sentence thesis

In autotetraploid potato breeding, combining ability (especially general combining ability) is more informative than per se parent performance for predicting breeding value, and the goal is to create populations with both high mean performance and wide genetic variability for complex trait targets.

📌 Key points (3–5)

  • Goal of parent crosses: create breeding populations with high mean performance and wide genetic variability for traits of interest.
  • Ideal cross strategy: good × good crosses that produce new combinations of favorable alleles and more loci with favorable genotypes.
  • Common confusion: for autotetraploid, highly heterozygous potato, per se performance of parents is not necessarily a good indicator of breeding value; combining ability may be more informative.
  • GCA vs SCA: general combining ability (GCA) captures additive and additive-by-additive epistatic effects; specific combining ability (SCA) may be critical for traits involving heterosis (e.g., yield).
  • Practical approach: parents typically include 100+ clones selected from the previous year's preliminary trials, forming a kind of recurrent selection.

🎯 Breeding goals and cross strategy

🎯 What a good cross should achieve

The excerpt states that the goal of breeding crosses is to create breeding populations with two qualities:

  • High mean performance for traits of interest
  • Wide genetic variability

These two goals work together: high mean ensures the population is shifted toward favorable trait values, while wide variability ensures enough diversity to select from.

🧬 The ideal cross type

The ideal is a good × good cross, resulting in new combinations of favorable alleles and more loci with the favorable genotype.

  • "Good × good" means both parents have desirable trait performance.
  • The cross recombines favorable alleles from both parents, increasing the number of loci that carry favorable genotypes.
  • Example: if Parent A has favorable alleles at some loci and Parent B has favorable alleles at other loci, their progeny may combine both sets.

🧪 Why per se performance is misleading in potato

🧪 The autotetraploid, heterozygous problem

The excerpt emphasizes:

For autotetraploid, highly heterozygous potato, per se performance of parents is not necessarily a good indicator of breeding value.

  • Autotetraploid: potato has four copies of each chromosome (not two like diploid crops).
  • Highly heterozygous: potato parents carry many different alleles at each locus.
  • Together, these traits mean that a parent's own performance (per se) does not reliably predict how well its progeny will perform.

🔍 Combining ability as a better predictor

The excerpt states:

Combining ability may be more informative.

  • Combining ability measures how well a parent's alleles combine with those of other parents in progeny.
  • It is a better predictor of breeding value than simply looking at the parent's own trait performance.
  • Don't confuse: per se performance = how the parent itself performs; combining ability = how the parent's alleles perform in crosses.

🧮 GCA and SCA: two types of combining ability

🧮 General combining ability (GCA)

GCA (general combining ability) for a quantitative trait is composed of additive and additive-by-additive epistatic gene effects.

  • GCA reflects the average performance of a parent across many crosses.
  • It captures:
    • Additive effects: the direct contribution of alleles.
    • Additive-by-additive epistatic effects: interactions between additive effects at different loci.
  • GCA is useful for predicting a parent's general breeding value across multiple crosses.

🧮 Specific combining ability (SCA)

For traits involving heterosis (e.g. yield), SCA (specific combining ability) may be critical to trait performance in progeny.

  • SCA measures the performance of a specific pair of parents in a particular cross.
  • It is especially important for traits that show heterosis (hybrid vigor), such as yield.
  • Example: Parent A may combine exceptionally well with Parent B but poorly with Parent C; SCA captures this pair-specific effect.
  • Don't confuse: GCA = average across all crosses; SCA = performance of a specific pair.

🧮 How to estimate combining ability

The excerpt recommends:

Diallel analysis may be useful in estimating breeding value (see Chapter 3 for guidelines and ALA example) and more reliable than per se performance in choosing parents.

  • Diallel analysis: a mating design where multiple parents are crossed in all possible combinations.
  • It allows estimation of both GCA and SCA.
  • The excerpt states this is "more reliable than per se performance" for choosing parents.

🔄 Practical parent selection in potato breeding

🔄 Using clones from the previous year

The excerpt describes a common practice:

Commonly, the parents include a set of 100+ clones selected in the previous year following preliminary trials for all traits.

  • Parents are not fixed; they are updated each year.
  • The breeder selects 100+ clones that performed well in the previous year's trials.
  • These clones are then used as parents for the next round of crosses.

🔄 Recurrent selection

Thus, a form of recurrent selection may be practiced.

  • Recurrent selection: a cyclical breeding method where the best individuals from one generation become parents for the next.
  • In potato, this means each year's best clones feed into the next year's crosses, gradually improving the breeding population over cycles.
  • Example: Year 1 trials identify 100 top clones → these become Year 2 parents → Year 2 trials identify new top clones → these become Year 3 parents, and so on.
22

What is a 'Value-Added' Trait?

What is a ‘Value-Added’ Trait?

🧭 Overview

🧠 One-sentence thesis

Value-added traits are special, high-demand characteristics that enhance elite cultivars by addressing critical needs of farmers, end-users, or consumers, and they are integrated through a dedicated function in the cultivar development pipeline.

📌 Key points (3–5)

  • What a VAT is: a novel or uncommon trait of value to stakeholders when incorporated into an elite cultivar—think of it as "frosting on the cake."
  • Genetic simplicity: VATs typically involve no more than five genetic factors and are often conferred by single genes; they are generally not common in the gene pool.
  • Trait types: include disease resistance, insect resistance, herbicide tolerance, abiotic stress tolerance, yield enhancement, nutritional improvement, and consumer preferences.
  • Development techniques: VATs may be developed through mutagenesis, QTL mapping, transformation, or gene editing.
  • Common confusion: most mutant alleles are recessive, which affects product development—for example, hybrid cultivars need both parents to carry the allele for trait expression.

🎯 Defining value-added traits

🎯 Core definition

A value-added trait (VAT) is a special trait that represents a novel or uncommon characteristic of value to farmers, end-users or consumers when incorporated into an elite cultivar.

  • VATs are not baseline characteristics; they are enhancements added to already elite (high-performing) cultivars.
  • The excerpt emphasizes they are "must-have" characteristics for one or more stakeholder groups in the crop value chain.
  • Sometimes these traits command a premium price depending on deployment economics.

🍰 The "frosting on the cake" analogy

  • The excerpt uses this metaphor to clarify that VATs are enhancements, not foundational traits.
  • They add special value on top of an already strong cultivar base.
  • Example: An elite lettuce variety becomes even more valuable when resistance to Lettuce Mosaic Virus (LMV) is added—the resistance is the "frosting."

🧬 Genetic characteristics

  • Typically involve no more than five genetic factors.
  • Often conferred by single genes.
  • Generally not common in the gene pool—they represent rare or novel genetic variation.
  • Some VATs have designated tradenames (e.g., Clearfield® rice).

🌾 Types of value-added traits

🛡️ Protection traits

Trait typeWhat it does
Disease resistanceProtects against pathogens (e.g., LMV resistance in lettuce via the mo1 gene)
Insect resistanceReduces damage from insect pests
Herbicide toleranceAllows crop survival when herbicides are applied
Abiotic stress toleranceImproves performance under drought, low fertility, etc.
  • Example: Lettuce Mosaic Virus resistance—a single recessive gene (mo1) confers resistance/tolerance, preventing devastation and severe crop losses from a potyvirus that causes deformed heads and discolored leaves.

📈 Performance and quality traits

  • Yield/productivity enhancement: increases output per unit area.
  • Nutritional enhancement: improves food quality (e.g., more lysine in corn, healthy oil profile in soybean, reduced lignin content in alfalfa).
  • Consumer or end-user preferences: addresses market demands (e.g., non-browning apple, reduced black spot bruising in potato, fruit ripening control in tomato).
  • Other specialized traits: male sterility for hybrid production, flower color for ornamentals (e.g., rose).

🔬 How VATs are developed

🧪 Mutagenesis

  • Generates new alleles through induced mutations.
  • Some mutant alleles have novel utility in crop plants.
  • Example: Imidazolinone-herbicide tolerant rice (Clearfield® rice) developed by BASF—confers tolerance to herbicides that inhibit amino acid synthesis by inhibiting the acetolactate synthase (ALS) enzyme; the ALS mutant allele was created through chemical exposure to ethylmethanesulfonate (EMS), a mutagen.

🧬 Other development techniques

The excerpt lists additional methods:

  • QTL mapping: identifies quantitative trait loci associated with valuable traits.
  • Transformation: introduces genes from other organisms (e.g., virus resistant papaya).
  • Gene editing: precisely modifies genes (e.g., sulfonylurea herbicide tolerant canola).

⚠️ Recessive allele challenge

  • Most mutant alleles are recessive, which has important implications for product development.
  • Don't confuse: a recessive VAT in a hybrid requires both inbred parents to carry the allele for threshold expression.
  • Example: For a hybrid cultivar to express a recessive herbicide tolerance trait, both parents must carry the mutant allele—otherwise the trait won't be expressed at sufficient levels.

🔄 VAT integration in the pipeline

🔄 Position in cultivar development

  • The excerpt places Trait Integration as the third core function in the four-function product pipeline (after New Line Development and New Line Evaluation).
  • This function serves to incorporate high-demand traits to further improve elite cultivars.
  • It is a distinct step that adds value to already-developed lines before they move to the final stage (typically distribution/commercialization).

🎯 Strategic purpose

  • VATs target specific stakeholder needs in the crop value chain.
  • They can be "must-have" characteristics that differentiate products in the market.
  • The integration process ensures these special traits are successfully combined with elite genetic backgrounds.
23

TILLING

TILLING

🧭 Overview

🧠 One-sentence thesis

TILLING is a practical method for discovering single-nucleotide mutations in plants, contributing to efficiency gains in trait integration and cultivar development.

📌 Key points (3–5)

  • What TILLING is: a technique for practical single-nucleotide mutation discovery in plants.
  • Context of use: part of the broader trait integration pipeline, which includes backcrossing, pyramiding, trait fixation, and version testing.
  • Efficiency strategies: seed chipping before planting and rapid cycling through off-season or tropical nurseries enable faster trait fixation and selection.
  • Version testing goal: converted cultivars must achieve performance equivalency with their unconverted counterparts.
  • Common confusion: trait fixation is not just about creating conversions—performance equivalency must be confirmed before hand-off to supply chain.

🧬 What TILLING is

🧬 Definition and purpose

TILLING: Practical single-nucleotide mutation discovery.

  • The excerpt cites Comai and Henikoff (2006) as the source for this definition.
  • TILLING is a tool for identifying mutations at the single-nucleotide level in plant genomes.
  • It is mentioned in the context of trait integration, suggesting it supports the identification and selection of desired genetic changes.

🔗 Role in trait integration

  • The excerpt places TILLING within a chapter on "Value-Added Trait Integration," indicating it is one component of a larger breeding pipeline.
  • Trait integration involves moving genetic factors (traits) into elite cultivar backgrounds through backcrossing, pyramiding, and fixation.
  • TILLING likely aids in detecting and confirming the presence of specific mutations during these processes.

⚙️ Efficiency strategies in trait fixation

🌱 Seed chipping and pre-planting selection

  • Seed chipping: sampling seed material before planting to genotype individuals.
  • Leaf tissue sampling: an alternative method using seedling plant tissue.
  • Key advantage: selection is performed before planting, so resources (space, labor, inputs) are focused only on individuals with the desired genotype.
  • Cycle time reduction: the S₂ (second selfed generation) outcome is determined before planting, shortening the breeding cycle.
  • Example: if testcrossing is needed for version testing, the selected S₂ materials can be planted directly to produce hybrids, avoiding wasted effort on undesired genotypes.

🔄 Rapid cycling of generations

  • What it means: achieving more than one generation per year to accelerate the entire trait integration process.
  • How it is done:
    • Off-season nurseries
    • Continuous nurseries in tropical locations
    • Greenhouse environments
  • Result: with corn, continuous nurseries have enabled four generations per year.
  • Term used: intensive nursery management for rapid generation progression is called speed breeding.
  • Don't confuse: rapid cycling applies to all stages—backcrossing, pyramiding, and trait fixation—not just one step.

🌾 Breeder seed production and hand-off

  • Individuals fixed for the introgressed genetic factors are self-pollinated to produce seed.
  • This seed is bulked to form the converted new elite line.
  • Hand-off to the supply chain occurs only after performance equivalency of the converted line is confirmed.

🧪 Version testing and performance equivalency

🎯 Goal of version testing

The goal is performance equivalency.

  • The converted cultivar (with new traits) is tested against its unconverted counterpart.
  • Success is not automatic: just because conversions are created does not mean performance equivalency is recaptured.

📊 Factors determining success

FactorWhat it means
Residual NRP germplasmAmount of non-recurrent parent genetic material remaining in the converted cultivar
Probability of recoveryLikelihood of finding at least one version with equivalent performance
Number of stacked versionsHow many stacked trait versions of each recurrent parent are available

🔬 Evidence for success

  • Sun and Mumm (2015) showed that, given strict standards for single-event conversions, there is a high likelihood of recovering an equivalent conversion even with 15 stacked events.
  • The excerpt describes this finding as "encouraging," indicating that multi-trait stacking does not necessarily compromise performance equivalency.

⚠️ Common confusion

  • Don't confuse trait fixation with performance confirmation: fixation means the desired genetic factors are homozygous and stable, but performance equivalency must still be tested and verified before the line is released.

🚀 Overall pipeline context

🏭 Trait integration in the product pipeline

  • Trait integration is one component of cultivar development.
  • The chapter emphasizes process design for efficiency, because a higher rate of genetic gain is critical for delivering better cultivars to farmers.
  • The excerpt mentions that the next chapter will consider further optimization of the product pipeline.

📚 Reference context

  • The excerpt is from Chapter 5 of a larger work on cultivar development.
  • It references multiple studies on marker-based procedures, pyramiding, and optimization (e.g., Ishii and Yonezawa 2007a, 2007b; Mumm 2013; Peng et al. 2014a).
  • The chapter also cites a Monsanto-DowAgrosciences collaborative agreement presentation (2014) and an ISAAA brief on biotech crop commercialization (James 2015), indicating the applied, industry-relevant context of the material.
24

VATs Based on Transformation

VATs Based on Transformation

🧭 Overview

🧠 One-sentence thesis

Transformation-based VATs (genetically modified traits) have been rapidly adopted worldwide since 1995, primarily to protect crop yield potential against pests through novel traits not accessible in the host species gene pool or through modified expression of host genes.

📌 Key points (3–5)

  • What transformation produces: a transgenic "event" defined by unique inserted DNA sequence and its precise insertion point in the host genome, originating from a single transformed cell regenerated into a T₀ plant.
  • Primary applications: protection against weeds, insects, and disease pests; traits either unavailable in the host gene pool or modified for higher expression, specific tissues, certain timing, or silenced expression.
  • Rapid global adoption: GM crop area grew from 1.7 million hectares (1995) to 179.7 million hectares (2015), making it the most rapidly adopted technology in modern agriculture.
  • Stacked traits: multiple GM traits combined in single cultivars, with nearly 60 million hectares globally by 2015; pyramiding multiple genes for the same trait helps prevent pest resistance.
  • Common confusion: transformation vs. gene editing—transformation inserts foreign DNA creating transgenic events requiring government approval, while gene editing (covered separately) makes targeted changes using the organism's own repair system.

🧬 What transformation creates

🧬 The transgenic event

Event: defined by the unique DNA sequence inserted in the host genome through transformation and the precise point of insertion.

  • Each event originates as a single plant (T₀ plant) regenerated from a single transformed cell.
  • The event becomes the original source of the VAT.
  • Example: Papaya resistant to Papaya Ringspot Virus uses Event 55-1, which confers resistance through gene silencing in response to a virus coat protein gene fragment.

🔬 Types of novel traits

Transformation enables traits that are:

  • Not accessible in the current host species gene pool, OR
  • Modified host genes expressed at:
    • Higher thresholds
    • Specific tissues
    • Certain timing in the life cycle
    • Silenced expression

Example: YieldGard® corn confers resistance to Lepidopteran insect pests (leaves, stalks, ears)—a trait engineered for protection not readily available through conventional breeding.

📈 Adoption patterns and scale

🌍 Global growth trajectory

  • 1995 (first commercialization): 1.7 million hectares
  • 2015: 179.7 million hectares
  • This represents the most rapidly adopted technology in modern agriculture.

The excerpt notes that industrial and developing countries both contributed to this growth, though specific breakdowns are shown in figures.

🌾 Primary GM crops

The four main GM crops globally (2015):

CropStatus
SoybeanPrincipal GM crop
MaizePrincipal GM crop
CottonPrincipal GM crop
CanolaPrincipal GM crop

🌍 GM adoption in Africa

Countries with approved GM crops (at least one type):

  • Burkina Faso (cotton)
  • Sudan (cotton)
  • South Africa (maize, soybean, cotton)

Countries with approved research field trials:

  • Cameroon, Egypt, Ghana, Kenya, Malawi, Nigeria, Uganda

Don't confuse: approval for cultivation vs. approval for research trials—the latter does not permit commercial growing.

🧩 Stacked traits and molecular stacking

🧩 What stacking means

Stacked traits: crops containing more than one GM trait combined in a single cultivar.

  • By 2015, stacked-trait crops were grown on nearly 60 million hectares globally.
  • The proportion of cultivars with stacked traits escalated quickly after initial GM adoption.

🌽 SmartStax® corn example

SmartStax® demonstrates both multiple VATs and multiple genes per VAT:

8 genes total, conferring 4 VATs:

  1. Above-ground insect resistance (Lepidopteran species): 3 genes
  2. Below-ground insect resistance (Corn Rootworm): 3 genes
  3. Glyphosate herbicide tolerance: 1 gene
  4. Glufosinate herbicide tolerance: 1 gene

🔐 Molecular stacking

Molecular stacking: a condition where each event includes more than one gene.

  • SmartStax® involves four events: MON88017, MON89034, TC6275, and DAS59122-7.
  • Each event delivers multiple genes within a single insertion.

🛡️ Why pyramid multiple genes for one trait

  • Multiple genes for the same characteristic make it difficult for pest species to overcome resistance.
  • Single genes are relatively easily overcome when enough selection pressure is applied.
  • This pyramiding is a "best practice" and key strategy for:
    • Managing development of resistance in pests
    • Prolonging the utility of an engineered solution

Example: SmartStax® uses 3 genes for above-ground insect resistance rather than 1, reducing the likelihood that Lepidopteran pests will develop resistance.

🏛️ Regulatory requirements

🏛️ Government approval process

VATs created through genetic engineering are subject to government regulation on a country-by-country basis.

Before approving cultivation or import, governmental bodies generally review:

  • Problem addressed and effectiveness of the proposed solution
  • Food safety
  • Safety to the environment and biosphere

🚧 Barriers to adoption

  • Adoption of transgenic VATs in some countries is hampered by the lack of appropriate governmental bodies to review and authorize use.
  • Breeding strategies and transport of seed must take account of government regulations and comply with containment policies.

Don't confuse: approval for one country does not mean approval elsewhere—each country conducts its own regulatory review.

🆚 Distinguishing transformation from gene editing

🆚 Key differences

The excerpt introduces gene editing as a separate category from transformation:

AspectTransformation (GM traits)Gene editing
MethodInserts DNA sequencesUses nucleases to make precise cuts in DNA
ResultTransgenic event with foreign DNATargeted changes (substitutions, deletions, insertions)
MechanismDirect insertionHarnesses organism's own DNA repair system
Regulatory statusSubject to government approval(Status not detailed in excerpt)

🧬 Gene editing techniques

Gene editing (or genome editing): a range of molecular techniques that facilitate targeted changes to be made in the genome, involving certain nucleases to make precise cuts in DNA.

  • Techniques are typically named for the type of nuclease used.
  • Example: CRISPR-Cas9 involves Clustered Regularly Interspaced Short Palindromic Repeats coupled with a programmable nuclease derived from bacteria (Cas9).

🌻 Gene editing VAT example

SU Canola™ (developed by Cibus):

  • Provides a non-transgenic option for weed control
  • Confers tolerance to sulfonylurea herbicides
  • Promoted for use with Draft™ herbicide from Rotam

Don't confuse: gene editing produces "non-transgenic" options (no foreign DNA inserted), whereas transformation produces transgenic events requiring different regulatory pathways.

25

VATs Based on Gene Editing

VATs Based on Gene Editing

🧭 Overview

🧠 One-sentence thesis

Gene editing offers a non-transgenic alternative for creating value-added traits (VATs) by making precise, targeted changes to an organism's genome using its own DNA repair system, enabling trait development that may face fewer regulatory hurdles than traditional genetic engineering.

📌 Key points (3–5)

  • What gene editing is: a range of molecular techniques using nucleases to make precise cuts in DNA, enabling substitutions, deletions, or insertions through the organism's own repair mechanisms.
  • How it differs from traditional GM: gene editing can produce changes without introducing foreign DNA, potentially creating "non-transgenic" options.
  • CRISPR-Cas9 example: one type of gene editing system using Clustered Regularly Interspaced Short Palindromic Repeats coupled with a bacterial nuclease.
  • Commercial examples: include herbicide-tolerant canola and non-browning mushrooms with extended shelf life.
  • Common confusion: gene editing vs genetic engineering—gene editing can work through the organism's own repair system without necessarily inserting foreign genes, though both create VATs.

🧬 What gene editing is and how it works

🔬 Core definition and mechanism

Gene editing (or genome editing): a range of molecular techniques that facilitate targeted changes to be made in the genome, involving the use of certain nucleases to make precise cuts in the DNA, which then can become the sites of base pair substitutions, DNA deletions or DNA insertions, harnessing the organism's own DNA repair system.

  • The key difference from random mutation: changes are targeted to specific locations.
  • The process relies on the plant's own DNA repair system to complete the edit.
  • Techniques are typically named for the type of nuclease (DNA-cutting enzyme) used.

🧰 CRISPR-Cas9 system

The excerpt highlights one prominent gene editing system:

  • Clustered
  • Regularly
  • Interspaced
  • Short
  • Palindromic
  • Repeats

Coupled with Cas9, a programmable nuclease derived from bacteria.

This system represents one approach among multiple gene-editing techniques available.

🌾 Commercial applications

🌿 SU Canola™

  • Developed by Cibus to provide a non-transgenic option for weed control.
  • Confers tolerance to sulfonylurea herbicides.
  • Promoted for use with Draft™ herbicide from Rotam.
  • Key point: marketed specifically as a non-transgenic alternative, suggesting regulatory or market advantages.

🍄 Non-browning mushrooms

  • Developed by Professor Yinong Yang at Pennsylvania State University.
  • Uses the CRISPR-Cas9 system.
  • Several genomic deletions (not insertions) achieved through gene editing.
  • Benefits: longer shelf life and less post-harvest loss.
  • Status: commercialization anticipated in the near future.

Example: A mushroom that normally browns quickly after cutting can now maintain appearance longer, reducing waste in the supply chain—all achieved by deleting genes rather than adding foreign DNA.

🔄 Integration with broader VAT strategy

🎯 Maximizing VAT use

The excerpt notes that gene-edited VATs fit into the same commercialization framework as other VATs:

  • VATs are branded (tradename, logo) to drive customer recognition and loyalty.
  • High demand and substantial development investment drive desire to maximize use.
  • From a breeding standpoint: integrating each VAT into a wide array of elite cultivars to maximize market penetration.

🧩 Stacking and pyramiding

  • Although the excerpt illustrates stacking with GM trait examples, any type of VATs may be pyramided in a cultivar.
  • Mixtures of combined VATs are common.
  • The same customer base demanding one type of VAT may demand others for maximum value.

Don't confuse: Gene editing as a creation method vs. trait integration as a deployment strategy—gene editing is one way to create a VAT, but once created, that VAT goes through the same breeding integration process as VATs created by other methods.

📋 Regulatory context

🏛️ Government approval requirements

The excerpt places gene-edited VATs within the regulatory framework:

  • VATs created through genetic engineering are subject to government regulation on a country-by-country basis.
  • Before approving cultivation or import, governmental bodies generally review:
    • Problem addressed and effectiveness of the proposed solution
    • Food safety
    • Safety to the environment and biosphere

🌍 Adoption challenges

  • Adoption of transgenic VATs in some countries is hampered by lack of appropriate governmental bodies to review and authorize use.
  • Breeding strategies and seed transport must account for government regulations and comply with containment policies.

Implication: Gene-edited VATs that avoid transgenic classification may face different (potentially simpler) regulatory pathways, though the excerpt does not explicitly state this—it only notes that non-transgenic options exist (e.g., SU Canola™).

26

Trait Integration

Trait Integration

🧭 Overview

🧠 One-sentence thesis

Trait integration through backcross breeding aims to transfer value-added traits into elite cultivars while recovering the complete, unaltered agronomic package of the elite parent, but linkage drag and residual donor DNA can interfere with performance recovery unless molecular markers are used to accelerate and improve the process.

📌 Key points (3–5)

  • Goal of trait integration: convert elite cultivars to include value-added traits (VATs) through backcross breeding while preserving all original performance attributes of the elite line.
  • Linkage drag problem: residual DNA from the trait donor parent near the desired gene reduces slowly across backcross generations and can carry deleterious alleles, mutations, or heterosis-reducing segments that interfere with performance recovery.
  • Theoretical vs. actual recovery: genetic theory predicts 99.22% recurrent parent (RP) germplasm after six backcrosses, but selection for the desired gene retains more non-recurrent parent (NRP) DNA, especially on the carrier chromosome and flanking regions.
  • Common confusion: the mean percentage of RP germplasm in a backcross generation vs. the distribution among individuals—not all progeny recover the same proportion, and selection for the trait skews the distribution.
  • Molecular markers solve key bottlenecks: marker-assisted backcrossing (MABC) trims ≥3 generations, eliminates linkage drag, accelerates trait fixation, and increases probability of performance equivalency.

🧬 What are value-added traits and how are they created

🧬 Types of VATs

Value-added traits (VATs) are traits integrated into crops to add value. The excerpt describes two main creation methods:

MethodDescriptionExample from excerpt
Genetic engineering (GM)Transgenic events requiring government approval; may involve "molecular stacking" (multiple genes in one event)SmartStax product: 8 genes delivered via 4 events (MON88017, MON89034, TC6275, DAS59122-7)
Gene editingTargeted genome changes using nucleases (e.g., CRISPR-Cas9) to make substitutions, deletions, or insertions via the organism's DNA repair systemSU Canola™ (sulfonylurea herbicide tolerance), non-browning mushrooms (genomic deletions for longer shelf life)

🏷️ Commercialization and stacking

  • VATs are branded (tradename, logo) to drive customer recognition and loyalty because they are in high demand and developed at substantial investment.
  • Stacking/pyramiding: combining multiple VATs (any type—GM, gene-edited, or conventional) in a single cultivar to maximize value for the customer base.
  • From a breeding standpoint, each VAT is integrated into a wide array of elite cultivars to maximize market penetration.

🔐 Regulatory requirements for GM VATs

Before approving cultivation or import, governmental bodies review:

  • Problem addressed and effectiveness of the proposed solution
  • Food safety
  • Safety to the environment and biosphere

Adoption is hampered in some countries by lack of appropriate review bodies. Breeding strategies and seed transport must comply with containment policies.

🔄 The backcross conversion process

🔄 General formula for single-gene integration

Backcross conversion: a breeding method where an elite line (recurrent parent, RP) is crossed with a trait donor (non-recurrent parent, NRP), followed by repeated backcrossing to the RP to recover the RP germplasm while retaining the desired gene.

Workflow: Elite Cultivar → backcross breeding → Elite Value-added Trait Cultivar

Steps (Table 1 in excerpt):

  1. Cross NRP to RP → F₁
  2. Select F₁ individuals with desired gene; backcross to RP → BC₁
  3. Repeat selection and backcrossing through BC₆ (generations 3–7)
  4. Self-pollinate BC₆ → BC₆S₁
  5. Continue selfing to BC₆S₂, BC₆S₃
  6. Identify line phenotypically similar to RP that stably expresses trait
  7. Create seed for performance equivalency testing (testcross if RP is hybrid parent, or self)

Total: 11 or more generations required.

📊 Theoretical recovery of RP germplasm

The general formula for estimated mean percentage of RP germplasm:

%RP = 1 − (1/2)^(n+1) where n = backcross generation number.

After six backcrosses (BC₆), progeny are on average 99.22% genetically similar to the RP (Fig. 12).

Key assumption: with each cross to the RP, the average percentage of NRP germplasm is reduced by half, assuming no selection.

📉 Distribution reality: not all individuals are equal

  • Each backcross generation represents a distribution of %RP germplasm among individuals.
  • Example: BC₁ generation has a mean of 75% RP germplasm with a standard deviation of 4.86% (Fig. 13).
  • Don't confuse: the population mean (e.g., 99.22% at BC₆) does not mean every individual has that percentage—there is variation around the mean.

⚠️ The linkage drag problem

⚠️ What is linkage drag

Linkage drag: residual NRP DNA in the chromosomal regions flanking a value-added trait, which remains under selection pressure because recombination is reduced due to linkage with the desired gene.

  • Selection for the desired gene "pulls along" linked DNA from the donor parent.
  • The effect depends on the genetic contents of these chromosomal segments.

📍 Where linkage drag persists

Computer simulation (Peng et al. 2014) on a 1788 centimorgan (cM) maize genome showed mean residual NRP germplasm (Table 2):

GenerationTotal genome NRP (cM)Non-carrier chromosomes NRP (cM)Carrier chromosome NRP (cM)Flanking region (20 cM around gene) NRP (cM)
BC₁1398.791240.62158.1719.53
BC₂973.90846.89127.0118.88
BC₃681.45578.13103.3218.26
BC₄480.31395.1585.1617.66
BC₅343.20271.4971.7117.12
BC₆248.19187.2360.9616.58

Key observations:

  • NRP germplasm on non-carrier chromosomes decreases incrementally.
  • NRP on the carrier chromosome decreases much less rapidly.
  • NRP in the flanking region (20 cM around the gene) remains somewhat stagnant through 6 backcrosses.

Relationship: 120 cM of NRP germplasm in a 1788 cM genome ≈ 96.6% RP.

🚫 Why linkage drag interferes with performance

Linkage drag can potentially affect performance for key traits like yield and stress tolerance because:

  1. Deleterious genes: NRP germplasm may contain harmful alleles, especially if the trait donor is non-elite.
  2. Somaclonal variation: if the introgressed factor resulted from genetic engineering, NRP DNA may have mutated during tissue culture.
  3. Heterosis reduction: in hybrid cultivars, if the trait donor is from the opposite heterotic group, this DNA may decrease potential heterosis in the converted hybrid.

Impact magnified: the accumulated effect of multiple introgressed genes increases the total linkage drag burden.

Goal threatened: linkage drag interferes with recovering the complete and unaltered agronomic package of the elite line targeted for conversion.

🔧 Stabilizing the trait through selfing

Once backcrossing achieves desired RP recovery, the desired gene must be fixed in homozygous state:

  • Self-pollinate BC₆ → BC₆S₁ (segregating for the desired gene)
  • Continue selfing (BC₆S₂, BC₆S₃) to produce materials for progeny testing and identify non-segregating lines.

🚀 Increasing efficiency with molecular markers

🚀 Technology needs for efficient trait integration

Strategically, technology addressing the following would increase efficiency, effectiveness, and rate of genetic gain:

IssueTechnology goal
Selection accuracyEffective screen for desired gene/trait; identification based on genotype (vs. phenotype)
AccelerationFaster cycling of generations; quicker RP recovery in backcrossing; fewer generations for trait fixation
Quality outcomesEliminating linkage drag; ensuring high degree of %RP recovery

🧬 Marker-assisted backcrossing (MABC) advantages

Molecular markers provide knowledge of genotypes among backcross progeny. Table 3 summarizes advantages:

Molecular markers can be used toAdvantage
Identify individuals that inherited the desired geneMore efficient than phenotypic selection
Select for recovery of RP germplasm in backcross generationTrim ≥3 generations from the TI process, saving 1–2 years in VAT hybrid development
Select against linkage dragIncrease probability of obtaining acceptable (quality) conversion
Identify homozygotes for VAT gene(s) in selfing generationsReduce number of generations to trait fixation

🎯 Three MABC selection schemes

With appropriate marker sets (Fig. 15):

  • Marker(s) for the desired gene
  • Markers providing genome coverage
  • Dense markers (one per cM in the 20 cM flanking region)

Implement three types of selection:

  1. Selection for the target gene/event: identify individuals carrying the desired genetic factor.
  2. Selection against linkage drag: identify individuals with crossovers in the flanking region.
  3. Selection for RP recovery: choose individuals in the upper tail of the backcross population distribution (higher %RP than the mean).

Example: instead of accepting the population mean of 99.22% RP at BC₆, select individuals with even higher RP recovery from the distribution.

🏗️ Multiple trait integration strategy

🏗️ Four steps in multiple trait integration

When integrating multiple VATs or genetic factors, pyramiding is necessary (Fig. 16):

  1. Step 1: (not detailed in excerpt)
  2. Step 2: (not detailed in excerpt)
  3. Step 3: (not detailed in excerpt)
  4. Step 4: (not detailed in excerpt)

The excerpt states the steps exist but does not describe them in detail.

🖥️ Computer simulation for strategic design

  • Multiple trait integration involves a considerable number of options when converting a cultivar for multiple genetic factors.
  • Computer simulation offers the opportunity to identify options that garner efficiency in terms of speed to market, rate of gain, conservation of resources, etc.

🌽 Example: 15-event conversion study

A study (Peng et al. 2014a, 2014b; Sun and Mumm 2015) explored limits of multiple trait integration and strategies for recovering equivalent performance in a corn hybrid converted for 15 events.

Context:

  • Traditionally, a limit of about 5 stacked genes was accepted practice.
  • 15 genetic factors was considered pushing the boundaries of backcross conversion.

Breeding scenario assumptions:

  • Some events were required in the male parent of the hybrid.
  • Decision: balance the 15 events between female and male RPs.
  • 8 events introgressed into female RP; 7 events into male RP.

(The excerpt ends before detailing the results or further strategy.)

27

Designing an Efficient Trait Integration Process

Designing an Efficient Trait Integration Process

🧭 Overview

🧠 One-sentence thesis

Computer simulation reveals that strategic sequencing of selection criteria (event, linkage drag, and recurrent parent recovery) across backcross generations enables efficient integration of multiple genetic factors while minimizing non-elite DNA and resource costs.

📌 Key points (3–5)

  • The core challenge: integrating multiple genetic factors (events) into elite cultivars while recovering target performance and minimizing linkage drag (unwanted non-elite DNA).
  • Three selection criteria: selection for the target event (E), selection against linkage drag (LD), and selection for recurrent parent (RP) genome recovery must be balanced across generations.
  • Common confusion: selecting for E+RP reduces total non-RP DNA but leaves high flanking region (FR) non-RP; selecting for E+LD reduces FR non-RP quickly but total non-RP remains high—optimal strategy requires switching between these approaches.
  • Population size matters: increasing population size during E+LD generations helps break through the ~1 cM FR non-RP equilibrium barrier.
  • Four-step integration process: Single Event Introgression → Event Pyramiding → Trait Fixation → Version Testing, each with distinct goals and resource requirements.

🎯 The marker-assisted backcross (MABC) framework

🎯 Three-part selection strategy

The excerpt describes a marker-assisted approach using dense markers (one per cM in the 20 cM flanking region) to implement three simultaneous selection goals:

  1. Selection for the target gene/event: ensuring individuals carry the desired genetic factor.
  2. Selection against linkage drag: identifying individuals with crossovers in the flanking region to reduce unwanted non-elite DNA.
  3. Selection for RP recovery: choosing individuals in the upper tail of the backcross population distribution to maximize elite germplasm recovery.

Linkage drag: residual non-recurrent parent (NRP) germplasm in the region flanking the introgressed event.

🧬 The 120 cM threshold

  • The excerpt states that 120 cM of NRP germplasm (~6.7% NRP) is the maximal amount consistent with recapturing target hybrid performance.
  • For single event conversions, the goal was set at ≤8 cM Total NRP including ~1 cM of FR NRP (flanking region non-RP).
  • Don't confuse: Total NRP is genome-wide; FR NRP is specifically the 20 cM region around the event—both must be controlled.

🖥️ Computer simulation study design

🖥️ The 15-event scenario

The Mumm Lab study explored converting a corn hybrid for 15 events—pushing beyond the traditional limit of ~5 stacked genes:

  • 8 events introgressed into the female RP.
  • 7 events introgressed into the male RP.
  • All events are new, requiring conversions for each.
  • Key constraint: FR NRP will be unalterable after Step 1 (once pyramiding begins).

📏 Five optimization criteria

To assess efficiency, the study defined criteria in priority order:

PriorityCriterionWhat it measuresWhy it matters
Highest1. Total NRP (cM)Genome-wide non-elite DNADetermines ability to recover equivalent performance
Highest2. FR NRP (cM)Non-elite DNA in flanking regionsDetermines effectiveness of integration outcome
High3. Time (generations)Speed to marketImpacts genetic gain in a major way
Medium4. Marker data points (MDP)Genotyping volumeBudget and labor costs
Medium5. Population size (N)Number of individualsGreenhouse, field space, manpower

The excerpt emphasizes: "Without this [recovering equivalent performance], all resource expenditures and time investment in Trait Integration to further improve the elite hybrid is meaningless."

🔄 Findings: the two-stage selection strategy

🔄 Why single-strategy approaches fail

Preliminary investigation revealed a fundamental trade-off:

  • E+RP selection (event + recurrent parent recovery):

    • Reduces Total NRP to ~12 cM after 10 backcross generations.
    • Problem: most residual NRP resides in the flanking region (FR).
  • E+LD selection (event + linkage drag):

    • Reduces FR NRP to ~1 cM by BC₃ or BC₄ and remains static.
    • Problem: Total NRP remains high.
    • The excerpt notes: "It seemed highly unlikely that FR NRP would be reduced much beyond 1 cM, even with extensive additional backcrossing."

✅ The optimal two-stage approach

The simulation identified a winning strategy:

3 generations of E+LD followed by 2 generations of E+RP

  • Result: Total NRP = 7.86 cM, FR NRP = 1.68 cM (nearly met the goal).
  • Logic: First reduce FR NRP to its equilibrium (~1 cM), then shift focus to reducing Total NRP genome-wide.
  • Still needed refinement because FR NRP was above the ~1 cM target.

🔍 Three-stage and combined strategies

The excerpt presents a comprehensive table comparing:

  • Three-stage: E+LD+RP selection in some or all generations.
  • Two-stage: E+LD for early generations, then E+RP.
  • Combined: E+LD followed by E+LD+RP.

Example: The three-stage strategy with 5 generations of E+LD+RP achieved 17.85 cM Total NRP and 1.42 cM FR NRP, but required 222K marker data points and 2000 total individuals.

Don't confuse: more selection criteria per generation does not always mean better outcomes—the two-stage approach with strategic switching outperformed many three-stage approaches.

📊 Refining with population size

📊 Breaking the FR NRP equilibrium

To fine-tune the strategy and reduce FR NRP below 1.68 cM, the study considered increasing population size during E+LD generations:

  • By increasing population size while maintaining the same selection intensity (selected proportion = 0.01), the probability of recovering individuals with less FR NRP increases.
  • Why: equilibrium for FR NRP is reached when residual NRP in the region is about 1 cM—larger populations provide more chances to find rare recombinants.

🎯 The optimized strategy

Final winning strategy:

  • 3 generations of E+LD with population size N=600.
  • 2 generations of E+RP with population size N=400.

Results:

  • Total NRP = 6.57 cM
  • FR NRP = 1.18 cM
  • Goal achieved!

Resource requirements:

  • 100,600 marker data points
  • 2,600 individual backcross progeny overall

The excerpt notes: "Costs for these could be estimated to provide a comprehensive view of the budget implications versus benefits of implementing this strategy."

📈 Comparison of population size variations

StrategyBC₁ E+LDBC₂ E+LDBC₃ E+LDBC₄ E+RPBC₅ E+RPTotal NRPFR NRPMDP (K)N total
14004004004004007.861.6894.02000
26006006004004006.571.18100.62600
38008008004004006.101.13107.23200
  • Strategy 2 meets the goal with reasonable resource investment.
  • Strategy 3 offers marginal improvement at significantly higher cost.

🧬 Choosing future donor parents

🧬 Using "clean" conversions

Once conversions with ~1 cM of non-elite DNA surrounding the event have been achieved:

"Clean" conversions: conversions with only ~1 cM of non-elite DNA surrounding the event of interest.

  • These represent excellent choices as donor parents for subsequent conversions.
  • Why: minimal linkage drag from the start reduces the backcrossing burden.

🔗 Genetic similarity to the recurrent parent

Another key factor: genetic similarity between donor parent (NRP) and the line targeted for conversion (RP).

  • If NRP is 30% similar to RP: complete conversion can be achieved by BC₄ using markers for E, LD, and RP.
  • If NRP is 86% similar to RP: complete conversion can be achieved by BC₃.

Logic: the goal of backcrossing is to recover the likeness of the RP; if the donor is already related to the RP, fewer backcross generations are needed.

🏗️ The four-step multiple trait integration process

🏗️ Overview of the four steps

The excerpt describes a structured pipeline for integrating multiple genetic factors:

  1. Single Gene/Event Introgression: backcrossing to introduce each event individually into the RP background.
  2. Gene/Event Pyramiding: combining multiple single-event lines through crosses.
  3. Trait Fixation: self-pollination to achieve homozygosity for all events.
  4. Version Testing: evaluating converted hybrids to confirm performance recovery.

🎯 Goals for each step

Single Event Introgression:

  • Outcome: k versions of each RP, each representing a single event introgression created through backcrossing.
  • Example: with 8 events being stacked in the female parent, k = 8 for RP_F.
  • At the close, backcross lines carry the desired event in heterozygous state.

Event Pyramiding:

  • Outcome: all events combined in the RP background in heterozygous state.
  • Method: symmetrical crosses.

Trait Fixation:

  • Outcome: all events in homozygous state.
  • Method: self-pollination.

🔗 Event pyramiding details

🔗 Symmetrical crossing scheme

To stack 8 events in RP_F:

  1. Generation 1: Cross pairs of single-event lines → double-event progeny.
  2. Generation 2: Cross double-event progeny → quad-event progeny.
  3. Generation 3: Cross quad-event progeny → all 8 events combined in heterozygous state.

Total: 3 generations required to combine all 8 events.

⚠️ Special consideration for linked events

The excerpt notes: "Although the simulation considered all events for a particular RP unlinked, in real life, linked events demand special attention in crossing to minimize the occurrence of repulsion linkages."

  • Repulsion linkage: when favorable alleles at two linked loci are on opposite chromosomes, making it harder to combine them.

🌱 Trait fixation and F₂ enrichment

🌱 The probability challenge

Theoretically, one generation of self-pollination (S₁) could recover at least one progeny homozygous for all 8 events (each segregating in Mendelian fashion).

Problem: the probability of recovering such an individual is very small.

  • For 8 events in heterozygous state (Aa), the probability of recovering one individual homozygous for all (AA) is (0.5)⁸ = 0.00390625 (about 1 in 256).

🔄 F₂ enrichment concept

To alleviate bottlenecking due to extremely low frequency of the desired genotype:

F₂ enrichment (Bonnett et al. 2005): introducing an extra generation of self-pollination to allow individuals that have at least one copy of every event to be advanced.

The strategy:

  1. S₁ generation: advance individuals with 8 events in heterozygous/homozygous state (AA/Aa for all events).
    • Probability per event: 0.75 (either AA or Aa).
    • Probability for all 8 events: (0.75)⁸ = 0.100112915 (about 1 in 10).
  2. S₂ generation: from the enriched S₁ population, recover individuals homozygous for all 8 events.
    • Probability: (0.5)⁸ = 0.00390625 (same as before, but now from a pre-selected population).

Benefit: the extra generation increases the pool of favorable individuals, reducing the risk of failing to recover the desired genotype.

📐 Determining minimum population size

📐 Three critical factors

The excerpt emphasizes:

  1. Population size (N): how many individuals to grow.
  2. Expected frequency of the desired genotype (q): a probability based on Mendelian genetics.
  3. Number of individuals to be recovered (x): how many plants with the desired genotype are needed.

🧮 The binomial formula

Based on Sedcole (1977), the minimum population size is determined by:

General formula: N is calculated such that the probability p of recovering at least x individuals with the desired genotype (frequency q) is achieved.

Special case when x=1 (recovering at least one individual):

In words: N equals the natural logarithm of (1 minus the desired probability) divided by the natural logarithm of (1 minus the genotype frequency).

🧪 Worked example

To recover with 99% probability at least one plant with a genotype expected at 0.25 frequency:

  • p = 0.99
  • q = 0.25
  • N = ln(1 - 0.99) / ln(1 - 0.25) = ln(0.01) / ln(0.75) ≈ 17

Interpretation: 99% of the time, 17 plants are needed to recover ≥1 with the desired genotype when that genotype occurs in 1 in 4 individuals.

📊 Sedcole's reference table

The excerpt provides a handy table for quickly estimating population size:

  • p = probability of success (0.95 or 0.99).
  • q = frequency of desired genotype (1/2, 1/3, 1/4, 1/8, 1/16, 1/32, 1/64).
  • r = number of plants to be recovered (1, 2, 3, 4, 5, 6, 8, 10, 15).

Example from the table:

  • To recover 1 plant with 99% probability when genotype frequency is 1/8: need 35 plants.
  • To recover 10 plants with 99% probability when genotype frequency is 1/8: need 146 plants.

🧬 Expected genotype frequencies in the pipeline

🧬 Frequency table for 8-event integration

GenerationGenotype per eventFormulaProbability
Pyramid 2 EventsAa(0.5)²0.25
Pyramid 4 EventsAa(0.5)⁴0.0625
Pyramid 8 EventsAa(0.5)⁸0.00390625
S₁ (8 events heterozygous/homozygous)AA/Aa(0.75)⁸0.100112915
S₂ (8 events homozygous)AA(0.5)⁸0.00390625

Key insight: the probability drops dramatically as more events are combined—this drives the need for larger populations or enrichment strategies.

🎯 Overarching objectives and assumptions

🎯 The performance recovery goal

The breeding scenario assumed:

  • The target hybrid yielded 14.72 tons per hectare (235.6 bushels per acre) on average.
  • Overarching objective: recovery of a converted hybrid that yields within 3% of the target hybrid.

Why this matters: the entire integration process is meaningless if the converted hybrid does not perform equivalently to the original elite hybrid.

🔧 Scenario assumptions recap

  • Hybrid conversions involved stacking 8 events in RP_F and 7 events in RP_M.
  • On each side of the pedigree, events are on different chromosomes (unlinked).
  • Up to 5 versions of each converted hybrid were considered.
  • Residual NRP germplasm in the flanking region (FR NRP) will be unalterable after Step 1 (once pyramiding is initiated).
  • Population size of 400 each generation for baseline simulations, with final selection of the top four plants (selected proportion = 0.01).

Don't confuse: "unalterable after Step 1" means that once single-event conversions are pyramided, you cannot go back and reduce FR NRP—this is why optimizing Step 1 is the highest priority.

28

Practical Considerations in Trait Integration

Practical Considerations

🧭 Overview

🧠 One-sentence thesis

Success in recovering desired genotypes during trait stacking depends critically on calculating the right population size based on the expected frequency of the target genotype and the desired probability of recovery.

📌 Key points (3–5)

  • Two key factors: population size and expected frequency of the desired genotype determine success in recovering specific genotypes.
  • Probability-based planning: expected frequency is calculated using Mendelian genetics probabilities (e.g., 0.5 raised to the power of the number of events).
  • Binomial distribution formula: allows breeders to determine the minimum population size needed to recover a specified number of individuals with a given probability of success.
  • Common confusion: don't confuse the frequency of the genotype in the population (q) with the probability of achieving the breeding goal (p)—they are distinct parameters in the calculation.
  • Efficiency gains: seed chipping and rapid cycling (speed breeding) can accelerate the trait fixation process and reduce resource waste.

🧮 Core calculation framework

🎯 What determines success

The excerpt identifies three critical factors that must be specified for each generation:

  • The population size (how many plants to grow)
  • The expected frequency of the desired genotype in that population
  • The number of individuals to be recovered

All three must be stated clearly before planting.

📊 Expected frequency calculation

Expected frequency of the desired genotype: a probability function based on Mendelian genetics.

The excerpt provides a table showing how frequency changes across generations:

GenerationGenotypeFormulaProbability
Pyramid 2 eventsAa(0.5)²0.25
Pyramid 4 eventsAa(0.5)⁴0.0625
Pyramid 8 eventsAa(0.5)⁸0.00390625
S1 with 8 events (het/hom)AA/Aa(0.75)⁸0.100112915
S2 with 8 events (hom)AA(0.5)⁸0.00390625
  • More events stacked = lower frequency of the desired genotype
  • The probability drops exponentially as the number of events increases

🔢 Determining minimum population size

🧪 The binomial distribution approach

The excerpt references Sedcole (1977) and provides formulas based on binomial distribution to calculate minimum population size.

Key variables:

  • N = minimal population size needed
  • x = number of recovered individuals with the desired genotype
  • p = probability of achieving the breeding goal
  • q = frequency of the desired genotype in the population

💡 Worked example

The excerpt provides a concrete calculation:

  • Goal: recover at least one plant (x=1) with 99% probability (p=0.99)
  • Genotype frequency: 0.25 (q=0.25, meaning 1 in 4 plants)
  • Result: 17 plants needed

Interpretation: 99% of the time, growing 17 plants will yield at least one with the desired genotype when that genotype occurs in 1 out of every 4 individuals.

Don't confuse: the 99% is the confidence level (how sure you want to be), while 0.25 is the biological frequency (how common the genotype is).

📋 Population size reference table

The excerpt includes a table from Sedcole (1977) showing required population sizes for different scenarios:

  • Rows vary by probability (p = 0.95 or 0.99) and genotype frequency (q = 1/2, 1/4, 1/8, etc.)
  • Columns show different numbers of plants to recover (r = 1, 2, 3, etc.)
  • Example: to recover 1 plant with 99% probability when frequency is 1/8, you need 35 plants

Pattern: rarer genotypes (smaller q) require much larger populations.

⚡ Efficiency strategies

🌱 Seed chipping advantage

With seed chipping, selection can be performed before planting so resources are focused only on the individuals with the desired genotype.

How it works:

  • Genotype plant materials before planting (seed chipping) or at seedling stage (leaf tissue sampling)
  • Selection happens before field planting
  • Resources go only to individuals with the desired genotype

Benefits:

  • Reduced waste of field space and inputs
  • Decreased cycle time because S2 outcome is known before planting
  • If testcrossing is needed, S2 materials can be planted appropriately to produce hybrids immediately

🔄 Rapid cycling (speed breeding)

The excerpt mentions achieving more than one generation per year through:

  • Off-season nurseries
  • Continuous nurseries in tropical locations
  • Greenhouse facilities

Example: with corn, continuous nurseries have enabled four generations per year.

This accelerates the entire trait integration process across backcrossing, pyramiding, and trait fixation stages.

🎯 Trait fixation outcomes

🌾 Breeder seed production

Individuals confirmed to be fixed (homozygous) for all introgressed genetic factors are:

  • Self-pollinated to produce seed
  • Bulked to comprise the converted new elite line
  • Handed off to Supply Chain once performance equivalency is confirmed

✅ Version testing

The final step assesses whether the converted cultivar performs equivalently to its unconverted counterpart.

Success factors:

  • Amount of residual non-recurrent parent germplasm in the converted cultivar
  • Probability of recovering at least one version with equivalent performance
  • Number of stacked versions of each recurrent parent

Important note: the excerpt mentions that Sun and Mumm (2015) showed high likelihood of recovering an equivalent conversion even with 15 stacked events, which is encouraging for complex trait stacking projects.

29

Response to Selection, R

Response to Selection, R

🧭 Overview

🧠 One-sentence thesis

Response to selection (R) serves as an efficiency criterion for optimizing cultivar development pipelines by measuring genetic gain and guiding breeding process design choices.

📌 Key points (3–5)

  • What R measures: the shift in trait means from base population to progeny generation after selection; represents efficiency of the breeding process.
  • Key formula components: R depends on selection intensity (i or k), narrow-sense heritability (h²), and genetic/phenotypic variance ratios.
  • Breeder-controlled factors: parental control (c), form of selection unit (f), population size (affecting k), and cycle length all influence R and genetic gain rate (ΔG).
  • Common confusion: selection intensity vs population size—higher k requires larger populations to maintain genetic diversity; selecting 10% from 10 individuals leaves no diversity, but 10% from 250 leaves 25 individuals.
  • Optimization strategy: maximize R by increasing the numerator (selection intensity, heritability, genetic variance) or reducing the denominator (phenotypic variance).

🔄 Core concept of response to selection

🔄 What R represents

Response to selection (R): the shift in trait means from the base population (μ₀) to the progeny generation after selecting top-performing parents.

  • R measures how much genetic improvement occurs in one selection cycle.
  • The process: select top individuals from base population → use as parents → evaluate progeny → best progeny become next cycle's parents.
  • R serves as an efficiency indicator for comparing different breeding process designs.

📐 Relationship to heritability

Narrow-sense heritability (h²): the portion of genetic variance that can be transmitted to the next generation; equals the ratio of R to S (selection differential).

  • Heritability reflects what proportion of observed superiority in parents will appear in offspring.
  • R can be expressed as: (selection intensity) × (narrow-sense heritability) × (additive genetic standard deviation).
  • The square root of h² represents accuracy of selection.

⏱️ Genetic gain over time

The rate of genetic gain (ΔG) incorporates cycle length: ΔG = R divided by cycle length.

  • Shorter breeding cycles increase the rate of improvement even if R per cycle stays constant.
  • Optimization goal: maximize ΔG by maximizing R and minimizing time.

🎯 Breeder-controlled factors affecting R

🎯 Parental control factor (c)

The parental control factor reflects the relationship between the selection unit (what you evaluate) and the recombination unit (what produces the next generation).

Three scenarios:

c valueSelection & recombination relationshipExample
c = ½Same unit; only female parents selectedMass selection without pollen control
c = 1Same unit; both parents selectedStandard selection
c = 2Different units; recombination unit is selfed seed or cloneControlled pollination of selected lines
  • Key insight: Mass selection with pollen control (c = 2) is twice as effective as without pollen control (c = ½), all else equal.
  • Conducting controlled pollinations using only selected lines doubles genetic gain compared to uncontrolled pollination.
  • Don't confuse: the selection unit is what you measure; the recombination unit is what you cross.

🧬 Form of selection unit (f)

The variable f relates to what type of progeny are evaluated and their inbreeding level.

Additive variance captured by different family types:

Family typef valueAdditive variance proportionWith full inbreeding (F=1)
Half-sib¼¼σ²ₐ¼(1+F)σ²ₐ
Full-sib½½σ²ₐ½(1+F)σ²ₐ
Selfedvaries(1+F)/2 × σ²ₐσ²ₐ (for doubled haploids)
Clones1σ²ₐσ²ₐ
  • Example comparison: Full-sib selection is more effective than half-sib selection because full-sibs account for twice the additive variance.
  • F is the inbreeding coefficient: F = 0 in F₂ generation; F = 1 for fully homozygous lines.
  • Only additive variance appears in the numerator because only additive variation transmits to progeny in diploid species; dominance variance contributes only to the denominator.

📊 Selection intensity (k) and population size

k represents the number of phenotypic standard deviations between the selected group's mean and the base population mean.

  • k is equivalent to i (selection intensity) in earlier formulas.
  • Critical relationship: Individuals in distribution extremes are superior but rare; larger populations increase the chance of finding and selecting them.
  • Increasing k without increasing population size risks depleting genetic diversity.

Concrete example from the excerpt:

  • 10% selection from population of 10 → 1 individual selected → no diversity remains
  • 10% selection from population of 250 → 25 individuals selected → sufficient diversity maintained

📈 Relationship between k, proportion selected (p), and threshold

k relates to p (proportion selected) and the truncation point.

Common values:

p (%)k (threshold)i (selection intensity)
0.13.0903.367
12.3262.665
51.6452.063
101.2821.755
5000.798
  • Breeders often set p at certain values; k translates to values from the Z statistical table.
  • Note: k values for 5% and 1% selection (1.645 and 2.326) are familiar statistics from standard normal distribution tables.

🔧 Optimization strategies

🔧 Maximizing the numerator

R can be expressed as a fraction; maximize R by enlarging the numerator:

  • Increase selection intensity (k): test more lines to find superior individuals in distribution extremes.
  • Improve heritability estimation: better measurement reduces environmental noise.
  • Increase additive genetic variance: maintain diverse breeding populations.
  • Optimize parental control: use controlled pollinations (c = 2) rather than open pollination (c = ½).
  • Choose better selection units: full-sib or selfed progeny capture more additive variance than half-sibs.

🔧 Reducing the denominator

The excerpt notes that R can also be optimized by reducing phenotypic variance (σₚ) in the denominator.

  • Lower phenotypic variance means genetic differences are easier to detect.
  • Reducing environmental variation through better experimental design improves selection accuracy.
  • The excerpt mentions "Partitioning Phenotypic Variation" as the next topic but does not elaborate further in the provided text.

🎯 Pipeline optimization goal

Optimization of the product pipeline promotes maximal response to selection in the shortest amount of time at comparative cost to produce improved cultivars that meet the needs of farmers and end-users.

  • Use R as an efficiency criterion to compare various process design options.
  • Balance multiple factors: genetic gain per cycle, cycle length, population size, testing costs, and genetic diversity maintenance.
  • The breeder controls process design choices that directly impact R and ΔG.
30

Partitioning Phenotypic Variation

Partitioning Phenotypic Variation

🧭 Overview

🧠 One-sentence thesis

Breeders can maximize genetic gain (R) by partitioning phenotypic variance into its components—genetic, environmental, error, and genotype-by-environment interaction—and then reducing non-genetic sources through strategic trial design choices such as increasing replications, locations, years, and plants per plot.

📌 Key points

  • What phenotypic variation includes: genotype, environment, genotype-by-environment interaction (GxE), and experimental error.
  • How to maximize R: increase the numerator (selection intensity, heritability) or reduce the denominator (phenotypic standard deviation) by controlling variance components.
  • Trial design levers: increasing replications (r), locations (l), years (y), or plants per plot (n) all reduce phenotypic variance, but locations and years have greater impact because they appear in more denominators.
  • Common confusion: GxE vs. environmental variance—GxE is the interaction that changes genotype rankings across environments, not just environmental noise; it requires multi-location or multi-year testing to measure and reduce.
  • Trade-offs: more locations cost more money; more years cost more time; breeders must balance genetic gain per year against resource constraints.

🧮 Components of phenotypic variance

🧮 The basic partitioning formula

Phenotypic variance (σ²ₚ) is a function of variation attributable to genotype, environment, genotype-by-environment interaction (GxE), and error.

The excerpt expresses the denominator σₚ (phenotypic standard deviation) as:

σₚ² = σ²ₑ / r + σ²(GxE) / (r × l) + σ²g

Where:

  • σ²ₑ = experimental error
  • σ²(GxE) = variance due to genotype-by-environment interaction
  • σ²g = genetic variance
  • r = number of replications
  • l = number of environments (locations, seasons, years, cultural practices, etc.)

🔬 Expanded phenotypic variance formula

The excerpt provides a more detailed breakdown:

σₚ² = (σ²w / n + σ²r) / r + (σ²(GxLxY) + σ²(GxY) / y + σ²(GxL) / l) / (r × l × y) + σ²g

Where:

  • σ²w = within-plot experimental error variance
  • σ²r = variance among replications
  • σ²(GxLxY) = variance due to genotype × location × year interaction
  • σ²(GxY) = variance due to genotype × year interaction
  • σ²(GxL) = variance due to genotype × location interaction
  • n = number of plants per plot
  • y = number of years of testing

🌱 Within-plot error partitioning

The within-plot experimental error variance (σ²w) can be further partitioned:

σ²w = σ²ₑ(micro) + σ²g(within)

  • σ²ₑ(micro): micro-scale environmental effects within the plot that cause genetically identical plants to perform differently (e.g., soil fertility, soil type, soil moisture, shading).
  • σ²g(within): within-plot genetic variation attributable to segregation, such as might occur before lines are fully inbred.

Don't confuse: within-plot genetic variation (σ²g(within)) with the main genetic variance (σ²g); the former is unwanted noise from incomplete inbreeding, while the latter is the signal breeders want to capture.

🎯 Maximizing R through trial design

🎯 Reducing the denominator strategy

The excerpt states that R can be optimized by reducing the denominator (σₚ) in the response-to-selection equation, not just by enlarging the numerator.

Key principle: Every variance component in σₚ² that is not genetic variance (σ²g) dilutes the breeder's ability to identify superior genotypes; reducing these components increases heritability and thus R.

📐 Increasing replications, locations, and years

Design parameterEffect on varianceImpact notes
Replications (r)Reduces σ²ₑ/r and components in (r × l × y) denominatorsAppears in fewer denominators; smaller relative effect
Locations (l)Reduces components in (r × l) and (r × l × y) denominatorsGreater effect than r; appears in more denominators
Years (y)Reduces components in (r × l × y) denominatorsGreater effect than r; appears in more denominators

The excerpt explicitly states:

"Relatively speaking, increasing locations or years in testing has a greater effect than increasing replications (that is, l and y have the opportunity to reduce more components of variation since they are featured in the denominators more often than r)."

Example: If a breeder doubles replications from 2 to 4, σ²ₑ/r is halved. But if the breeder doubles locations from 2 to 4, multiple GxE components are halved simultaneously.

🌾 Increasing plants per plot (n)

The excerpt shows that σ²w is divided by n (number of plants per plot):

σₚ² includes the term (σ²w / n + σ²r) / r

  • When there is only 1 plant per plot, n = 1, so σ²w is not reduced.
  • In family selection with 30 plants per plot, n = 30, so σ²w is divided by 30.

The excerpt provides a table showing the effect of increasing n on a variance component (with constant σ²w):

nVariance component value
122.3
215.8
510.0
107.1
304.1
902.4

Interpretation: Increasing from 1 to 30 plants per plot reduces this component from 22.3 to 4.1—a roughly 5-fold reduction.

🧹 Controlling experimental error

The excerpt notes:

"Furthermore, error variance components, σ²ₑ and σ²r, can be reduced by controlling human error such as mistakes in recording the evaluation data."

  • Uniform fields contribute to less variation within plots (smaller σ²w) and among replications (smaller σ²r).
  • Careful data recording reduces measurement error.

🌍 Genotype-by-environment interaction (GxE)

🌍 What GxE means and why it matters

GxE interferes with the ranking of the progeny and identification of top performers; it has serious ramifications for accuracy in selection.

  • GxE is not just "environmental noise"; it is the interaction that causes genotypes to rank differently across environments.
  • Example: Genotype A performs best in Location 1 but poorly in Location 2, while Genotype B shows the opposite pattern—this is GxE.
  • Why it matters: If a breeder selects based on performance in only one environment, the selected genotypes may not perform well in the target production region.

🌍 Components of GxE variance

The excerpt expands GxE into three components:

  • σ²(GxLxY): genotype × location × year interaction (three-way)
  • σ²(GxY): genotype × year interaction (two-way)
  • σ²(GxL): genotype × location interaction (two-way)

Note: The excerpt states that σ²(GxY) and σ²(GxLxY) "cannot be measured or effectively reduced without testing over multiple years."

🌍 Reducing GxE impact

The excerpt provides two strategies:

  1. Evaluate progeny in multiple locations and over multiple years: This directly measures and reduces the GxE components.
  2. For crops with large GxE: Maximize locations (l) at the expense of replications (r)—i.e., use 1-rep trials at the greatest number of locations possible.

Trade-offs:

  • Cost: With a fixed number of reps, it is more expensive to include more locations than to have multiple reps at fewer locations.
  • Time: Testing over multiple years takes more time; as an alternative, additional locations may be substituted for added years.

Don't confuse: Adding locations vs. adding years—both reduce GxE, but locations cost more money while years cost more time; the excerpt suggests locations can sometimes substitute for years.

🎲 Selection intensity and population size

🎲 The relationship between i, p, and population size

The excerpt introduces selection intensity (i) and its relationship to the proportion selected (p):

  • i (selection intensity): the number of phenotypic standard deviations that the mean of the selected individuals differs from the base population mean.
  • p: the proportion of selected individuals in the base population.
  • The excerpt references Falconer (1989) showing the relationship between p (i.e., the proportion selected) and i and x, where x is the difference between the threshold point of selection (truncation point) and the base population mean, expressed in standard units.

🎲 Common values of i and p

The excerpt provides a table of commonly used values:

p (%)xi
0.13.0903.367
12.3262.665
51.6452.063
101.2821.755
250.6741.271
5000.798

Note: The excerpt points out that x for 5% and 1% are "the familiar statistics from the Z Table, 1.645 and 2.326, respectively."

🎲 Population size and genetic diversity

The excerpt emphasizes:

"Although individuals in the extremes of the distribution may be superior, they are rare. The chance of producing, identifying, and selecting individuals in the extremes of the distribution curve is enhanced as the number of lines tested increases."

Key warning: "It is difficult to increase i without concomitantly increasing the population size. Otherwise, the risk is depleting genetic diversity."

Example from the excerpt:

  • 10% selection from a population size of 10 means 1 individual is selected → no genetic diversity remains for the next round.
  • 10% selection from a population size of 250 means 25 individuals are selected → sufficient diversity remains.

Additional benefit: A sufficient number of selected individuals to be recombined for the next cycle of selection contributes to reducing the potential effects of genetic drift.

🗺️ Choosing testing environments

🗺️ Why site choice is critical

The excerpt states:

"Choice of testing sites is critical to the selection decisions that will be made in the process of cultivar development. Selections will be made based on phenotypic performance at these sites for traits pertinent to the product target."

  • Testing sites are assumed to represent the target production environments.
  • Performance at these sites determines which genotypes are selected and advanced.

Implication: If testing sites do not represent the target region, selected cultivars may fail when deployed commercially.

31

Choosing Testing Environments to Maximize R

Choosing Testing Environments to Maximize R

🧭 Overview

🧠 One-sentence thesis

The usefulness of a testing environment for cultivar development depends on both its ability to differentiate genotypes and its genetic correlation with the target market environment, and breeders must strategically choose test sites that maximize response to selection (R) within budget constraints.

📌 Key points (3–5)

  • What makes a test site useful: ability to differentiate genotypes, accuracy of selection (heritability), and correlation with the target environment.
  • GxE interaction reduces selection accuracy: genotype-by-environment interaction interferes with ranking progeny and identifying top performers; it can be reduced by testing at multiple locations and years.
  • Three patterns of GxE: parallel (no interaction), non-parallel (rankings hold), and crossover (rankings flip); crossover interaction signals distinct mega-environments requiring separate selection strategies.
  • Common confusion: high heritability alone is not enough—a test site must also be representative of (genetically correlated with) the target environment to be truly useful.
  • Tools for choosing sites: heritability-adjusted GGE biplots visualize both the discriminating power (√h²) and representativeness (genetic correlation) of test environments, helping breeders allocate budgets effectively.

🎯 What defines a useful testing environment

🎯 Three criteria for usefulness

The excerpt identifies three characteristics that determine the value of a testing site:

  • Ability to differentiate between genotypes: the site must reveal genetic differences and their magnitude.
  • Accuracy of selection: measured by heritability (h²), the signal-to-noise ratio in estimating breeding value.
  • Correlation with target environment: performance in the test site must predict performance in the market region.

Accuracy of selection (R) is defined as the correlation between breeding value and phenotypic value and can be computed as the square root of narrow-sense heritability.

📏 The usefulness measure

Allen et al. (1978) proposed that the appropriate measure of a test environment's usefulness is:

  • Usefulness depends on both the square root of heritability in the test environment and the genetic correlation (rg) between test and target environments.
  • This means a highly heritable site that does not correlate with the target is less useful than a moderately heritable site that is representative.
  • Example: A test site with very low error variance (high h²) but atypical weather patterns may rank genotypes accurately for that site but fail to predict performance in farmers' fields.

Don't confuse: Heritability alone ≠ usefulness. A site can be discriminating (high h²) but not representative (low rg), making selections misleading.

🌍 The challenge of representing target environments

🌍 The sampling problem

  • The target market region may contain hundreds of thousands of "environments" defined by geography, altitude, season, soil types, topography, tillage, fertilization, water regime, etc.
  • Prospective testing sites are assumed to represent a sample of these environments.
  • Selection aims to identify genotypes with high mean value across the target set of environments.

🔍 Key questions for site selection

The excerpt lists strategic questions breeders must answer:

  • Are there multiple mega-environments (distinct environment groups) within the target market region?
  • Within a specific mega-environment, what are the most representative and discriminating test sites?
  • How many test sites and replications can be implemented each year within budget?

Why it matters: With a fixed budget, understanding mega-environments and choosing the most informative test sites is critical to identifying "best" genotypes.

🔄 Genotype-by-environment (GxE) interaction patterns

🔄 Three patterns of interaction

The excerpt describes three patterns illustrated in Figure 2:

PatternDescriptionImplication for selection
ParallelNo interaction; differences between entries do not change across environmentsRankings are stable; one cultivar can serve the whole region
Non-parallelDifferences change in magnitude but rankings still holdSelection is still effective across the region
CrossoverDifferences change substantially; rankings flipA top entry at one location may be the worst at another; signals distinct mega-environments

🎯 Strategic response to crossover interaction

  • When consistent patterns of GxE occur over years, groups of locations consistently share the best set of genotypes.
  • This repeatable pattern can be exploited in cultivar development.
  • Strategy: Selection will focus on specifically adapted genotypes for each mega-environment.
  • Each mega-environment becomes its own "target environment," and test sites within it must be representative of that target.

Example: If northern and southern regions consistently favor different genotypes, breeders should develop separate cultivars for each region rather than seeking one widely adapted variety.

⚠️ Impact on selection accuracy

  • GxE interferes with ranking progeny and identifying top performers, having serious ramifications for accuracy.
  • Variation due to GxE is captured in variance components for genotype-by-location-by-year, genotype-by-year, and genotype-by-location interactions.
  • Don't confuse: GxE is not random error; it is systematic interaction that can reveal meaningful mega-environments.

📉 Reducing GxE impact

📉 Multiple locations and years

  • The impact of GxE can be reduced by evaluating progeny in multiple locations and over multiple years.
  • For crops with large GxE, the greatest genetic gain per year is realized by maximizing the number of locations at the expense of replications (i.e., 1-rep trials at the greatest number of locations possible).
  • Trade-off: Cost—with a fixed number of reps, it is more expensive to include more locations than to have multiple reps at fewer locations.

⏱️ Time vs. space trade-offs

  • Variance components for genotype-by-year interactions cannot be measured or effectively reduced without testing over multiple years.
  • Trade-off: Time.
  • Alternative option: Additional locations may be substituted for added years.

Example: Instead of testing at 3 locations for 3 years, a breeder might test at 6 locations for 2 years to reduce development time while maintaining information about GxE.

🛠️ Tools for evaluating testing environments

🛠️ Cluster analysis and correlation

  • Cluster analysis has been used to identify similar types of environments and categories of environments.
  • Performance of test entries at one location can be correlated with performance of the same entries at other locations and overall.

📊 AMMI and GGE biplot analysis

AMMI (Additive Main Effects and Multiplicative Interaction) analysis:

  • Dissects GxE via principal components analysis.
  • Characterizes yield stability over environments.
  • Produces a biplot to graphically display relationships of environments with each other and with test entries.

GGE biplot:

  • Involves only genotype and GxE interaction effects in principal components analysis.
  • A biplot displaying environments and entries based on the first two principal components (PCA1 and PCA2) provides insights into:
    • The number of mega-environments represented in the data.
    • Which test environments are most representative and discriminating.

📐 Heritability-adjusted GGE biplot

This specialized analysis (Yan and Holland 2010) provides the most actionable information:

  • When scaled by heritability: The length of the vector from the origin to the environment is proportional to the square root of heritability (√h²) in that environment.
  • Angles between vectors: Approximate the genotypic correlation between environments.
  • Usefulness approximation: Provides approximations of usefulness for each test environment, proportional to the predicted response to selection (R) in the mega-environment based on data from the test environment.

Why it matters: This delivers information breeders can use to choose environments that maximize R within budgetary constraints.

🎯 Interpreting the biplot (Figure 4 example)

The excerpt describes a heritability-adjusted GGE biplot showing five environments in one mega-environment:

  • Usefulness ranking: QC1 > QC2 > NB > PEI > QC3 (based on vector length, reflecting √h²).
  • Representativeness: Environments with shorter distances to the Target Environment Axis (TEA) are more representative.
  • QC3 was least representative despite being part of the mega-environment.

Example interpretation: QC1 has both high discriminating power and good representativeness, making it the most valuable test site; QC3 might differentiate genotypes well but doesn't predict target performance reliably.

Don't confuse: A long vector (high √h²) means good discrimination, but proximity to the TEA means representativeness—both are needed for maximum usefulness.

32

Improving Breeding Efficiency Through the Use of Technology

Improving Breeding Efficiency Through the Use of Technology

🧭 Overview

🧠 One-sentence thesis

Technology such as molecular markers and doubled haploidy can significantly boost breeding efficiency by reducing cycle time, improving selection accuracy, and increasing genetic gain when strategically integrated into the product pipeline.

📌 Key points (3–5)

  • Technology definition: Applied science tools, machines, or methodologies used to solve real-world problems in crop improvement, including DNA-based selection, drones, and doubled haploidy processes.
  • Two key technologies: Molecular markers (for indirect selection) and doubled haploidy (for accelerating homozygous line development) are highlighted as examples with high potential impact.
  • Optimization principle: The value of any technology is maximized by fine-tuning the entire product pipeline to exploit the opportunities the technology creates for increasing genetic gain and reducing development time.
  • Common confusion: Technology adoption alone doesn't guarantee efficiency—the breeding process must be redesigned around the technology's advantages (e.g., combining MAS with estimated breeding value or using F₂ donors instead of F₁ for doubled haploidy).
  • Cost-benefit evaluation: Technologies should be judged by their ability to increase genetic gain and their cost-benefit ratio, not just time savings.

🧬 Marker-Assisted Selection (MAS)

🧬 What MAS does

Marker-Assisted Selection (MAS): Using molecular markers to select individuals with favorable alleles at loci controlling traits of interest, based on markers in close proximity to those loci.

  • Selection is based on genotype at marker loci rather than direct phenotypic observation.
  • Particularly valuable in complex crops like potato (highly heterozygous tetraploid with 40+ target characteristics).
  • Example: In potato, five distinct genotypes can exist at any locus (AAAA, AAAa, AAaa, Aaaa, aaaa), making progeny testing difficult—markers simplify this.

🎯 Ideal marker characteristics

The excerpt identifies five properties of ideal molecular markers:

PropertyDescription
Co-dominantReveals detailed allelic composition
"Perfect"Located within the gene itself to avoid recombination between marker and gene
ReproducibleConsistent assay results across different laboratories
High-throughput (HTP)Amenable to automation for fast, widespread application
Cost effectiveEconomically viable for routine use
  • Single Nucleotide Polymorphisms (SNPs) are currently the most widely used marker type.

✅ Seven advantages of MAS over phenotypic selection

  1. Easier screening: Some traits (e.g., potato cyst nematode resistance) are simpler to screen with markers.
  2. Earlier screening: Selection can occur at earlier plant development stages.
  3. Greater accuracy: Eliminates classification errors caused by environmental effects.
  4. Location/season flexibility: Selection can be applied outside the market region and growing season (off-season nursery or greenhouse).
  5. Larger populations: High-throughput operation enables larger population sizes and higher selection intensity.
  6. Fewer testing years: Reduces the number of years needed for evaluation.
  7. Lower cost: May be less expensive than phenotypic evaluation.
  • Don't confuse: Cost savings can be just as important as time savings in pipeline optimization—sometimes more important depending on the situation.

💰 Real-world cost and time savings

Potato disease resistance example (Slater et al. 2013):

  • MAS screening in second field generation (G₂) cost 37.3% of phenotypic evaluation cost.
  • Additional benefit: Multiplexing marker assays for other single-gene traits or major genes increases cost savings further.
  • Time savings: One year reduction in cultivar development.

Enhanced strategy (Slater et al. 2014):

  • Combined MAS with estimated breeding value (EBV) selection.
  • G₂ clones with high EBV scores but lacking disease resistance are recycled to increase favorable allele frequency for other traits (not discarded).
  • Selected individuals advanced to regional trials, reducing testing by two years.
  • Total time savings: Three years overall.

🌱 Doubled Haploidy (DH)

🌱 What doubled haploidy does

Doubled haploidy: A technology that bypasses the inbreeding process, providing a quick route to homozygosity with high fidelity.

  • Addresses the major time lag between F₂ and fully inbred line development.
  • Manipulates the double fertilization process (normally produces 2N embryo and 3N endosperm).
  • Available for more than 250 plant species.
  • Example in maize: Uses an in vivo maternal haploid system via gynogenesis (first reported by Stadler and Randolph in 1929).

🔬 The in vivo maternal DH process (maize)

Four basic steps:

  1. Pollinate donor plants (F₁ or F₂ progeny from breeding cross) with inducer line.
  2. Identify and recover haploid seed via color marker phenotype.
  3. Germinate haploid seed and apply chromosome doubling agent (e.g., colchicine injection).
  4. Grow DHs to maturity and self-pollinate.

🎨 Haploid selection via color markers

  • Most seed from donor × inducer pollination will be useless F₁s.
  • Small proportion will be haploids (rate determined by inducer line).
  • Color marker system enables sorting: haploid seed shows color only in endosperm, not in scutellum (which harbors the embryo).
  • Don't confuse: Color expression is influenced by genetic background, so visual sorting requires experience.

🌾 Final product characteristics

  • All seed on the harvested ear is genetically identical and completely homozygous.
  • Testing homozygous lines yields more precise performance estimates than segregating lines.
  • Provides more accuracy in selection decisions and QTL effect estimation.

🚀 Maximizing DH technology value

🧪 Choosing the best donor population type

Three options differ in number of meiosis events (cell divisions producing reproductive cells):

Donor typeMeiosis eventsRecombination opportunity
F₁FewerLess recombination
F₂One moreMore recombination
BC₁One moreMore recombination

Key evidence:

  • DH lines from F₁ donors: Mean of 10 recombinations per genome; 37% of lines with ≥4 intact parental chromosomes.
  • RILs: Mean of 15 recombinations per genome; 13% of lines with ≥4 intact parental chromosomes.
  • DH F₂ lines: No more than 3% lower in selection response than RILs; up to 6% higher than DH F₁ lines.

Recommendation: Use F₂ or BC₁ donors for one additional meiosis, enabling more recombination and new favorable allele combinations.

  • Don't confuse: The extra generation doesn't necessarily mean another year—off-season nurseries or greenhouses can facilitate multiple generations per year.

🎯 F₂ enrichment (pre-selection before induction)

Rationale: Focus DH resources on individuals with greater genetic promise to achieve greater genetic gain.

Selection among F₂ families can target:

  • High-heritability traits (plant morphology, disease resistance).
  • Favorable marker genotype for linked traits.
  • Against negative marker genotypes.
  • Increased/decreased frequency of recombinants (breaking repulsion linkages or preserving favorable gene complexes).

More elaborate scheme: Three cycles of genomic selection in recurrent selection to enrich favorable allele frequency for yield and quantitative traits (Bernardo et al. 2010).

📊 Performance testing structure for large DH populations

The new challenge: Doubled haploidy easily produces large numbers of progeny quickly—now the issue is how to test vast numbers of inbred lines efficiently.

Two-stage testing example (Melchinger et al. 2005):

ApproachStage 1Stage 2Result
Traditional250 segregating testcross progeny at 4 locationsBaseline
Two-stage DH739 DH lines at 1 location29 selections at 9 locations (Year 2)Nearly 20% increase in genetic gain
  • Same resource investment (same number of yield plots).
  • Two-stage testing accommodates more lines and focuses higher scrutiny on top performers.
  • More effective in maximizing genetic gain when DH technology is utilized.

🔗 Integration and optimization principles

🔗 Holistic pipeline design

  • Every aspect of breeding and testing process design should relate to factors influencing response to selection and rate of genetic gain.
  • Technologies are useful only when they boost efficiency measurably.
  • Once a technology is adopted, its value is maximized by fine-tuning the entire pipeline to exploit the opportunities it creates.

⚙️ Technology evaluation framework

Two criteria for judging new technology merit:

  1. Ability to increase genetic gain.
  2. Cost-benefit ratio.
  • Don't confuse: Technology deployment is not just about adopting the tool—it requires redesigning the process around the technology's advantages.

🔗 The weakest link principle

"A chain is only as strong as its weakest link!"

  • Integration of all process components is critical to a robust product pipeline.
  • Consistent production of improved cultivars depends on addressing every bottleneck, not just adding new technologies.
  • Example: Combining MAS with EBV selection, or pairing F₂ enrichment with two-stage DH testing, creates synergistic improvements beyond single-technology gains.
33

Chapter 7: Launching Improved Cultivars

Chapter 7: Launching Improved Cultivars Rita H. Mumm

🧭 Overview

🧠 One-sentence thesis

Launching a new cultivar requires a multi-stage supply chain process that ensures genetic integrity, seed quality, and regulatory compliance while protecting the breeder's intellectual property and promoting adoption by farmers.

📌 Key points (3–5)

  • Supply chain as final stage: Once a superior line is identified and can be reliably reproduced, the supply chain manages all steps from variety registration through commercial distribution.
  • Two testing requirements: DUS testing establishes the cultivar's unique identity (distinctiveness, uniformity, stability, varietal identity), while VCU testing demonstrates its merit and superiority to stakeholders.
  • Scale-up priorities differ by propagation method: Clonally-propagated cultivars require clean stock free from pathogens; seed-propagated cultivars require safeguarding seed quality (genetic integrity, purity, vigor, pathogen-free status) through multiple seed classes.
  • Common confusion—seed classes: Breeder's seed → pre-basic/foundation seed → (for hybrids: hybrid seed production) → certified seed; each class has specific standards and purposes, and the breeder maintains the genetic standard at the breeder's seed level.
  • Intellectual property protection mechanisms: PVP (Plant Variety Protection) is the key worldwide mechanism, allowing farmers to save seed and breeders to use the variety in further breeding, but restricting unauthorized commercial multiplication and sale.

🧪 Variety registration and testing

🧪 DUS testing—establishing identity

Variety registration testing (DUS): Testing conducted to establish the identity of a new cultivar based on distinctiveness, uniformity, stability, and varietal identity.

  • The new cultivar must be a new genetic entity, not previously commercialized, and distinguishable from other varieties.
  • Four aspects evaluated:
    • Distinctiveness: different from other existing varieties
    • Uniformity: individual plants show consistency in quality and variation within the population
    • Stability: distinctive characteristics persist through generations of propagation
    • Varietal Identity: defining morphological characteristics
  • The potential new cultivar is compared to a wide range of existing varieties to validate its unique benefits and attributes.
  • Detailed guidelines are available from the International Union for the Protection of New Varieties of Plants (UPOV).

🏆 VCU testing—demonstrating merit

Varietal Performance Testing (VCU): Testing conducted to demonstrate the merit of the new cultivar to stakeholders in the value chain (producers, end-users, consumers).

  • Test results must prove the new cultivar is superior to existing varieties in one or more aspects (e.g., yield, disease resistance, nutritional value).
  • Typically conducted through on-farm trials for multiple years.
  • New cultivars that meet DUS and VCU requirements are officially approved for commercialization and listed in a variety register.
  • Important note: Requirements vary from country to country; DUS and VCU requirements must be accommodated in the design of the testing regime for cultivar development.

🌱 Scale-up for clonally-propagated cultivars

🧼 Creating clean stock

  • To generate "clean" stock, a heat treatment may be applied ahead of isolating meristematic tissue for culturing to kill off any pathogens.
  • Clean plant stock source materials can be stored long-term through cryopreservation.
  • Why this matters: Maintaining genetic integrity and purity while keeping propagation materials free from pests and diseases is critical during scale-up to commercial quantities.

🛡️ Maintaining and increasing clean stock

Best practices include:

  • Physical isolation: planting distance from potential sources of infection
  • Control of insect vectors: use of screen houses or cages, chemical controls, or distance/time isolation
  • Inspection and testing: visual observation, use of indicator plants/grafting, and/or diagnostics (ELISA assays, molecular marker analysis) to monitor for pathogen presence

🌾 Scale-up for seed-propagated cultivars

🌾 Seed quality as key priority

Seed quality: Reflects parameters to ensure genetic integrity and purity, seed vigor, absence of pathogens, and cleanliness.

  • Seed quality is a key priority at every stage in seed production and scale-up.
  • Because producing commercial volumes is a multi-step process, seed quality must be safeguarded through each step to achieve the desired result for distribution to farmers.

📦 Seed classes from breeder's seed to commercialization

Breeder's seed: Seed produced in the initiating breeding program through controlled hand pollination; represents the standard of genetic identity to which all other increases of the line will be compared.

Seed ClassProduced fromComprised of
Pre-basic seedBreeder seedCultivar per se -or- parent line of hybrid cultivar
Foundation seed (or Basic seed)Pre-basic or breeder's seedCultivar per se -or- parent line of hybrid cultivar
Registered seedFoundation seedProgeny of foundation seed or registered seed used to produce certified seed
Hybrid seedCrosses between parent lines increased by Foundation seed groupHybrid cultivar -or- intermediary hybrid in 3- or 4-way hybrid cultivars
Certified seedFoundation or higher seed classesCultivar per se including hybrid cultivar
  • Breeder's responsibility: The development and maintenance of breeder seed is an important obligation of the plant breeder; breeder's seed is provided to Supply Chain through a hand-off from the plant breeder.
  • Foundation group role: Uses breeder's seed to increase the variety or parent line, producing pre-basic and foundation seed.
  • Certified seed production: Governed by national seed regulations; aims to produce seed for commercial distribution meeting standards for genetic purity, seed germination, seed moisture, and presence of premium value-added traits.
  • International standards: OECD provides guidelines for seed certification (though not required by every country for every crop).

🔒 Safeguards for genetic purity

Steps implemented at each stage of seed production:

  • Proper isolation of production fields: Minimize chance of stray pollen; physical isolation distance depends on pollen mobility (e.g., maize: at least 300 meters); fields can also be isolated in time so pollen shed does not occur when production fields are receptive.
  • Field selection: Facilitate control of soil-borne disease (no previous history) and absence of volunteer plants (of the same species).
  • Ideal agronomic management: Fertility, planting, disease control, soil moisture, etc., to facilitate top yields.
  • Careful cleaning of equipment: Planters, combines, threshers, trucks, bins, etc., to ensure no seed carryover and prevent inadvertent contamination through seed mixing.
  • Judicious placement of insect pollinators: As needed depending on the crop to facilitate pollination.
  • Field inspection: Throughout the growing season.
  • Careful record keeping: Maintain chain of custody documentation from seed source through each seed increase to final production.
  • Accurate labeling: Of harvested seed to safeguard genetic authenticity.

🌽 Additional best practices for hybrid seed production

  • Placement and ratio: Female parent plants placed relative to male parent plants to ensure ample pollen movement; ratio designed to facilitate sufficient pollination (e.g., corn: typically 2:1 ratio—four rows of female inbred flanked by two rows of male inbred).
  • Mechanical aids: With some crops (e.g., rice), movement of relatively heavy pollen is aided by mechanical means (e.g., helicopter flown overhead at peak pollen shed).

Nicking: Male and female parent plants flowering simultaneously.

  • To achieve the nick, one parent may be planted later than the other in keeping with known maturity of the lines; with some crops, the earlier female or male parent may be "trimmed" to trigger a second flowering.
  • Male row removal: Generally, male rows are harvested or removed from the field ahead of harvesting female rows to prevent contamination of hybrid seed with selfed seed of the male parent.

🔬 Foundation research role

  • Once breeder seed is handed off to supply chain, the Foundation seed group is responsible for maintaining genetic integrity of the variety or parent seed.
  • Measures taken to prevent genetic drift, contamination through pollen migration, unintended seed mixing, and to ensure accurate seed labeling.
  • Research arm: Often includes research to optimize seed multiplication and hybrid production; each line is characterized in depth; for hybrid cultivars, parent lines are assessed to facilitate best practices for optimal seed quality and output in a cost-effective manner.

🧪 Seed quality tests

🔍 Types of evaluations

Once harvested, each lot of seed undergoes screening and testing:

  • Ratio of seed to other materials: Weed seed and inert matter
  • Genetic authenticity: Ensure proper labeling
  • Seed purity: Percentage of off-types
  • Rate of germination: Measurement of seed vigor
  • Seed moisture content: Indicate whether further drying is required to prolong shelf life
  • Freedom from adventitious presence of GM events
  • Freedom from disease and pests

🧬 Genetic authenticity and seed purity

DNA fingerprinting: Verification of genetic makeup to ensure seed matches the intended cultivar; also represents a genetic profile to support intellectual property protection.

  • Grow-out test: A sample of each seed lot is grown to check for characteristic morphological properties, absence of off-types, and percentage of seed exhibiting premium value-added traits (e.g., herbicide tolerance, oil profile).
  • Sample size requirements: Depend on the predetermined threshold for off-types.
Threshold for off-types (%)Minimum level of genetic puritySample size per seed lot
0.0199.94000
0.2099.82000
0.3099.71350
0.5099.5800
1.0099.0400
  • Example: If no more than 1 percent off-types are allowed, at least 400 seeds (plants) must be tested; for stricter standards, larger sample sizes are required.
  • Grow-out conditions: Must be conducted in the target market region in a field free from potential volunteer plants; managed in keeping with agronomic practices recommended to farmers.
  • Calculation: Percentage of off-types is estimated from the number of seeds (plants) in the sample which do not conform to the prescribed standard.

🌱 Seed germination testing

  • A sample of seed from each seed lot is collected, cleaned, and germinated on a moist substrate under laboratory conditions, with replication.
  • The mean percentage of seeds that germinate to produce normal seedlings can then be estimated for that seed lot.

🦠 Phytosanitary testing

  • Seed lots must be checked for the presence of seed-borne disease and insect pests and disease-carrying organisms.
  • Such tests serve as a backup to field scouting during seed increase and hybrid production.

🧬 Testing for adventitious presence of GM events

  • Seed may be evaluated for the absence of other traits, particularly those resulting from genetically modified (GM) events, that are not intended as part of the genetic package of the cultivar.
  • Why this matters: Traits created through genetic modification are subject to regulation on a country-by-country basis; seed must be tested before product release to ensure only desired GM events, consistent with seed labeling, are present.

Adventitious presence: Unintended presence of GM events; evaluated through DNA analysis using markers for the DNA sequence characteristic of known events.

📦 Packaging and distribution

📦 Preparing seed for distribution

Seed lots that pass strict standards are stored under conditions conducive to long seed life (controlling moisture/temperature levels).

Preparation steps:

  • Cleaning: Remove any weed seed or inert matter
  • Seed treatments: May be applied as desired (e.g., seed-applied fungicides or microbials to stimulate seedling growth)
  • Packaging: Prevent absorption of water from atmosphere; keep seed contained and inaccessible to insects and disease
  • Labeling: Seed bags/containers are labeled

🏷️ Seed container labeling

  • Must be accurately labeled (or "tagged"), listing information related to:
    • Specific variety
    • Seed lot and origin
    • Seed testing results of the seed lot
    • Seed purity
  • Varieties featuring value-added traits typically list the thresholds of seed purity for these traits.
  • Certified seed tags: Reveal these important details and more.

📢 Promoting the new cultivar

📢 Breeder's role in creating awareness

  • The breeder has a role in creating awareness and demand for the new cultivar.
  • May set up demonstration plots to highlight soon-to-be-released varieties or new value-added traits to farmers.
  • Field days: At the research station represent a good opportunity for breeders to exhibit characteristics of new varieties or traits to farmers and growers.
  • Agronomic recommendations: The breeder can make recommendations as to appropriate agronomic practices to maximize and protect yields, critical to farmers' success, especially with new cultivars.
  • Goal: Promote rapid adoption of new, improved cultivars.

🔐 Protecting intellectual property

🔐 Why protection matters

  • The new, improved cultivar represents an important investment of skills, time, energy, and resources.
  • To recoup the investment and provide incentive for further development of improved varieties that meet stakeholder needs, the intellectual property represented by the new cultivar must be protected.

🛡️ Mechanisms available

A number of mechanisms are available to protect the interests of plant breeders and foster the development of a robust national seed sector:

  • Plant Variety Protection (PVP): For seed and tubers
  • Plant patents: For asexually propagated plants, except tubers
  • Utility patents: For any type of plant showing a particular utility or purpose
  • Contracts
  • Trade secrets

🌍 Plant Variety Protection (PVP)

Plant Variety Protection (PVP): A key mechanism recognized worldwide that provides intellectual property protection for unique varieties of a sexually-reproduced plant or tuber-propagated plant for 20 years (25 years for trees and vines).

  • Governed by: International Union for the Protection of New Varieties of Plants (UPOV), established in 1961 as an outcome of the International Convention for the Protection of New Varieties of Plants.
  • UPOV mission: Provide and promote an effective system of plant variety protection, which encourages the development of new, improved varieties of plants for the benefit of society.
  • Application process: The breeder of the new cultivar (or his/her employer) must file an application with the authorities of UPOV; the African Intellectual Property Organization operates a plant breeders' rights system covering 17 member states.

⚖️ PVP provisions—what is allowed and restricted

What PVP allows (does NOT restrict):

  • Farmers saving seed for their own purposes
  • Private use of the variety for non-commercial or experimental purposes
  • Use of the protected variety in breeding to develop other improved cultivars, provided the new cultivar is not essentially derived from the protected cultivar
  • Result: The protected variety can still contribute to the advancement of improved varieties during the 20 years of coverage; PVP does not slow innovation.

What PVP restricts:

  • Marketing and selling of the protected variety, or a variety essentially derived from the protected variety, by other commercial entities
  • Multiplication of the protected variety
  • Import/export without the express authorization of the breeder

Enforcement:

  • Falls to the breeder and his/her organization
  • A DNA fingerprint of the protected variety produced in accord with established standards can be used to expose the genetic identity of suspected offenders

After expiration:

  • Once the term of PVP protection has expired, the variety is considered "public domain."
  • Net result: Encourage more innovation, which ultimately promotes food security and benefits the consumer.

📝 Sharing germplasm

  • Plant breeders are advised to keep good records of germplasm in inventory as well as germplasm shared with others; such records will be important in protecting intellectual property embodied by new cultivars.

Standard Material Transfer Agreement (SMTA): A contract between the provider and the recipient of germplasm.

  • Provider agrees: To share the germplasm and other available non-confidential descriptive information about the plant materials.
  • Recipient agrees: To use the materials for research, breeding, or agricultural training, without claim to intellectual property.
  • Commercialization clause: Should the recipient commercialize a product that incorporates or is developed from the material, the SMTA outlines expectations for compensation, if any, to the provider.

🔄 Starting the cycle anew

🔄 The continuous cycle

  • The launch of a new, improved cultivar that meets the needs of stakeholders embodied in the product target represents the close of one cycle and triggers the start of another.
  • A plant breeder's job never ends!

🔄 Redefining product targets

  • With the start of a new cycle, product targets are redefined or adjusted in keeping with reassessment of stakeholder needs and desires.
  • Armed with information gleaned from past cycles, the breeder is better informed with respect to:
    • Genetic architecture of traits of interest
    • How traits interact with the environment and each other
  • Better able to choose parents to create new breeding populations, identify or create effective testing environments, and employ technologies.
  • Result: Knowledge gleaned can lead to better ways to create useful genetic variation and more effective ways to exploit this variation to achieve stated product targets.

🔄 Information for future cycles

Types of information acquired during the development of an improved cultivar that might serve to better guide choice of parents and enhance evaluation/selection of progeny in future cycles:

  • Genetic architecture of traits of interest
  • Trait interactions with environment
  • Trait interactions with each other
  • Effective testing environments
  • Technology applications
    Cultivar Development | Thetawave AI – Best AI Note Taker for College Students