BSMS205 · Genetics

Recombination, Linkage,
and Haplotype

Chapter 23 · Part IV · Population Genetics

A question every biology student asks

Why do siblings
look different?

Part of the answer · random assortment

Mom has two copies of each chromosome
One from grandma, one from grandpa
At every chromosome, she passes on one of the two
You might get mostly grandma's · your sister mostly grandpa's

The deeper answer · she passes a mosaic

During meiosis, matching chromosomes exchange DNA segments.

The chromosome you inherit from mom is part grandma, part grandpa
You can have grandpa's nose and grandma's eye colour even though both genes are on the same chromosome
This shuffling is called recombination

Roadmap for today

Two flavours · cross-overs and non-cross-overs
What the 2025 Iceland study revealed
How CO and NCO form · the molecular dance
Linkage · alleles that travel together
Regional patterns · hotspots, cold spots, telomeres, centromeres
Haplotypes · blocks of inherited DNA

§ 1

Two Flavours
of Recombination

Cross-over (CO) · the classical event

Large chunks of DNA get swapped between chromosomes
Two ropes cut at the same position · ends swapped
Easy to detect · creates long stretches of one ancestry where you expect the other
This is the event Morgan and Sturtevant studied

Non-cross-over (NCO) · the hidden majority

A small stretch — a few hundred to a few thousand base pairs — gets copied from one chromosome to the other
Also called gene conversion
Overall structure barely changes · local sequence is altered
Hidden until high-resolution sequencing arrived

NCOs outnumber COs · by a lot

5 – 10×

non-cross-overs per cross-over in a typical meiosis

Cross-overs were just the visible tip of the iceberg.

Both start the same way

Every recombination event starts with a double-strand break (DSB) — a deliberate cut in both strands of the DNA helix.

Cells make these breaks on purpose during meiosis
Sounds dangerous · is dangerous if it goes wrong
Repaired using the matching chromosome as a template
How repair proceeds → CO or NCO

§ 2

The 2025 Iceland Study

Palsson et al. · Nature 2025

Whole-genome sequences from over 5,000 Icelandic families
Parents · children · often grandparents
First complete human recombination map including both CO and NCO
Tens of thousands of events classified

Palsson et al. 2025, Nature

Finding 1 · non-cross-overs dominate

Per meiosis · COs

40 – 60

across all 23 chromosome pairs

Per meiosis · NCOs

several 100s

an order of magnitude more

Finding 2 · mothers ≠ fathers

Maternal meiosis: more COs (well known)
Maternal NCOs: longer — typically 1,000 – 2,000 bp
Paternal NCOs: shorter — typically 300 – 500 bp
Why? Unknown · molecular machinery differs in eggs vs sperm

Finding 3 · maternal age raises NCOs

Older mothers produce more NCOs per egg
Effect is strongest outside programmed hotspots
Cross-overs do not change much with age
Possibly a compensatory mechanism as hotspots fail

Finding 4 · centromeres · CO-suppressed

Near centromere: cross-overs are rare
A CO near a centromere can disrupt chromosome segregation
But NCOs happen readily there
NCOs may help homologues recognise and pair without the risk

How they detected it

Palsson et al 2025 Figure 1: DSB mechanism and detection pipeline — DSB → strand invasion → dHJ (gives CO or NCO) or SDSA (gives NCO).
Palsson et al. 2025, *Nature*. CC BY-NC-ND 4.0.

§ 3

The Molecular Dance

Step 1 · double-strand break

Both strands of the DNA helix are cut
The break is made on purpose · catalysed by SPO11
Happens on one of the two paired chromosomes

Step 2 · end processing and strand invasion

Broken ends processed → single-stranded tail (3' overhang)
Tail invades the homologous chromosome
Uses the matching sequence on the other chromosome as a template

Now the fork in the road

How does the cell resolve the invasion structure?

Two paths. Two different outcomes.

Path 1 · SDSA → non-cross-over

Synthesis-Dependent Strand Annealing
Invading strand copies a short stretch from blue
Then pulls back and re-anneals to its original red partner
Small patch converted · large-scale structure unchanged

Path 2 · double Holliday junction → cross-over

Double Holliday Junction (dHJ) forms
Invading strand stays attached · DNA synthesis extends the connection
Two chromosome junctions form
When resolved · large flanking regions swap

Which path does the cell choose?

One of the big mysteries in meiosis biology
Specific proteins promote one outcome over the other
Local chromatin context (DNA packaging) matters
Outcome is biased toward NCO · but COs are crucial

§ 4

Linkage

Linkage · alleles that travel together

The tendency of alleles close together on the same chromosome
to be inherited as a pair.

Recombination is random with respect to position
The closer two variants are, the less likely a recombination lands between them
If nothing separates them, they travel together

Recombination fraction · r

r	Meaning	Genetic distance
0	Complete linkage	0 cM · right next to each other
0.01	Tightly linked	~1 cM
0.10	Moderately linked	~10 cM
0.5	Unlinked · independent assortment	far apart / different chromosomes

Why linkage matters

Linkage analysis for Mendelian diseases
Marker near disease allele → co-segregates with disease
Narrow the search from 3 Gbp to 10 Mbp
Modern GWAS: linkage disequilibrium extends this idea

NCOs break linkage too

Short gene conversions also separate nearby variants
Previous linkage maps missed this contribution
Including NCOs → more accurate linkage estimates
Especially matters for very close variant pairs

§ 5

Regional Patterns ·
Hotspots and Cold Spots

Telomeres · CO-rich

Chromosome ends are rich in cross-overs
COs at the ends don't disrupt chromosome architecture
Functional role: ensure proper segregation
Every chromosome needs at least one CO to segregate correctly

Centromeres · CO-suppressed

Near the centromere · cross-overs are rare
A CO there could disrupt spindle attachment → aneuploidy
But NCOs happen readily here
NCOs may help homologues pair without the segregation risk

Hotspots · short regions of concentrated activity

Typically 1 – 2 kb wide
Recombination is much more frequent than average
Marked by the DNA-binding protein PRDM9
PRDM9 recognises specific sequence motifs · recruits DSB machinery

Hotspots move between populations

PRDM9 itself is highly variable between humans
Different versions recognise different DNA motifs
A hotspot active in Europeans may be silent in East Asians
Over evolutionary time, hotspot locations drift

Regional patterns visualised

Palsson 2025 Figure 2: regional recombination patterns on chromosome 19 — Chromosome 19 · paternal NCOs rise near telomeres · centromere is CO-suppressed · maternal age raises NCO rate by **~20 events per decade**.
Palsson et al. 2025, *Nature*. CC BY-NC-ND 4.0.

§ 6

Haplotypes

Haplotype · a block inherited together

A haplotype is a set of DNA variants inherited together from one parent.

Because recombination is occasional, nearby variants stay linked for many generations
Forms a "block" of co-inherited alleles
The fundamental unit of population genetics

The beaded necklace analogy

Each chromosome = a necklace · each bead = a SNP
Grandma's necklace: red-blue-blue-green-red
Grandpa's necklace: blue-red-green-green-blue
Mom's egg: a mosaic of both · that mosaic = the haplotype she passes on

How haplotypes reveal recombination

Sequence child + both parents + grandparents
Walk along chromosome · track which grandparent each variant matches
A switch point = a recombination event
Large switch (Mb) = cross-over · short switch (kb) = NCO

§ 7

Why This Matters

Four domains

Diversity — recombination creates new allele combinations every generation
Disease genetics — complete maps sharpen fine-mapping
Reproductive health — CO errors cause aneuploidy · miscarriage · Down syndrome
Human evolution — haplotype block lengths record demographic history

Four domains where this matters. First, diversity. Recombination is the engine of genetic variation. Without it, beneficial alleles on separate chromosomes could never be combined into a single individual. Evolution would have much less raw material to work with. Second, disease genetics. Complete recombination maps, including non-cross-overs, improve fine-mapping of disease loci and sharpen the interpretation of linkage disequilibrium patterns. Third, reproductive health. Errors in cross-over placement cause aneuploidy, which underlies many cases of miscarriage and disorders like Down syndrome. Understanding how cross-overs are regulated, and how they fail with age, is a major medical frontier. And fourth, human evolution. The length distribution of haplotype blocks records demographic history — bottlenecks, admixture events, and selective sweeps — all readable by anyone who can reconstruct haplotypes.

§ 8

Summary

What to take away

Recombination has two flavours · COs and NCOs
NCOs outnumber COs by a factor of five to ten
Maternal vs paternal meiosis · different rates, different NCO lengths
Centromeres suppress COs · telomeres enrich them · PRDM9 defines hotspots
Haplotypes are the blocks of co-inherited alleles that record recombination history

Next lecture

How do we encode
alleles, genotypes, and haplotypes
in the data format of modern genomics?

Chapter 24 · Data Types for Alleles and Populations