BSMS205 · Genetics

Polygenic Model
in Traits

Chapter 15 · Part III · Complex Traits

A question to start with

If alleles add up,
what does adding
thousands look like?

Mendel's peas vs your height

Mendel's discrete pea trait vs continuous human height — **Figure.** Mendel's peas: *two* discrete categories (tall / short). Human height: a smooth continuum from short to tall. Same biology — but very different number of contributing genes.

The reconciliation

Mendel · 1866

Discrete · 3:1 ratios
One gene, two alleles
Peas, flowers, mice

Galton · 1889

Continuous · bell curves
Height, intelligence
Looked non-Mendelian

Fisher 1918 · Mendel + many genes = Galton.

The scale of the question

361,194

people · 1,060 traits · 13.7 M variants

The UK Biobank · largest genetic resource on earth
Physical · biomarkers · diseases · behaviors
Tested both additive and dominance effects

Roadmap for today

Why Mendel's ratios disappear in real traits
The polygenic intuition · summing many small effects
Simulation · 1, 5, 50, 1000 loci
Measuring additive effects · the 0-1-2 code
Measuring dominance · the orthogonal trick
UK Biobank results · what is real, what is rare
Polygenic scores & the omnigenic preview

§ 1

Why Most Traits
Don't Follow Mendel

One gene, three genotypes, three values

AA → 170 cm · baseline
AB → 170.5 cm · one B allele
BB → 171 cm · two B alleles

Each B allele adds 0.5 cm.
Stacking blocks · same height per block.

One gene · the histogram

If allele B has frequency p = 0.5, genotypes follow 1 : 2 : 1
The trait histogram has 3 bars
That is what Mendel saw in peas

With one gene, the trait distribution looks discrete.

The leap

What if the trait depends on
two genes? Five? Fifty?

§ 2

From Many Small Effects
to a Bell Curve

Two loci · 9 genotype combinations

Locus 1: AA, AB, BB · adds 0, 0.5, 1 cm
Locus 2: CC, CD, DD · adds 0, 0.5, 1 cm
3 × 3 = 9 joint genotypes
Trait values cluster into 5 bins: 0, 0.5, 1, 1.5, 2 cm

Already a peak in the middle.

Five loci · the shape begins

5 loci, each with 0/1/2 effect alleles
Possible totals: 0 to 10 effect alleles
Number of ways to make each total → binomial
Histogram already peaks in the middle

Most people are average-ish.
Few are extreme.

Why the bell curve · central limit

Trait = effect₁ + effect₂ + effect₃ + ... + effect_N

Each term is a small random contribution
Central limit theorem: sum of many independent variables → Gaussian
The shape is independent of what each gene does individually

1 → 5 → 50 → ∞ loci

Loci	Distinct trait values	Shape
1	3	Three bars
2	5	Slight peak
5	11	Visible bell
50	101	Smooth Gaussian
~10,000	Continuous	Real height

A worked example

Height is the sum of
~10,000 tiny pushes.

§ 3

Measuring
Additive Effects

The 0-1-2 code

AA = 0 · AB = 1 · BB = 2

Just count B alleles
Regress the trait on this count
Slope = additive effect per allele

What additive looks like

Three modes of allelic effects: additive, dominance, overdominance — **Figure.** Three modes of allelic effects. **Panel A · additive**: trait increases linearly with allele count (0 → 1 → 2). **Panel B · dominance**: heterozygote (AB) deviates from the line. **Panel C · overdominance**: heterozygote is more extreme than either homozygote. Blue = additive variance, red = dominance variance — most loci are mostly blue. *Source: Palmer et al. 2021, bioRxiv.*

Why the slope is the answer

If trait really is linear in allele count → slope captures all of it
If not, the residuals tell us about dominance
For ~700 of 1,060 UK Biobank traits → linear fits well

For most traits, the slope is the whole story.

§ 4

Measuring
Dominance — the Tricky Part

What dominance means

Complete dominance: AB looks like BB
Pattern: AA = 0, AB = 1, BB = 1
One copy of B is enough
Examples: FGFR3 (achondroplasia) · FBN1 (Marfan)

The confounding problem

Additive

0 — 1 — 2

Dominant

0 — 1 — 1

At AB alone they look the same.
We need a way to separate them.

The orthogonal encoding

Genotype	Additive code	Dominance code
AA	0	0
AB	1	1
BB	2	0

Dominance code 0–1–0 = "is heterozygote special?"

Orthogonal · the equalizer analogy

Volume slider

= additive effect
Smooth, predictable

Bass boost

= dominance effect
Heterozygote bonus

Set independent sliders → measure each cleanly.

§ 5

What the
UK Biobank Showed

The headline result

700

of 1,060 traits
show additive heritability

183

loci
with detectable dominance

Dominance explains only ~0.5% of additive variance.

The Manhattan picture

Genome-wide additive vs dominance hits across chromosomes — **Figure.** Genome-wide associations. **Bottom (pink):** additive effects — densely packed across all chromosomes. **Top (green):** dominance effects — sparse, only 183 loci. Notable dominance hits labelled (**MC1R**, **APOE**, etc). *Source: Palmer et al. 2021, bioRxiv.*

A few real dominance hits

Gene	Trait	Note
MC1R	Red hair · skin pigment	One copy shifts hair colour
APOE	Alzheimer's risk · LDL	ε4 allele · partial dominance
FGFR3	Achondroplasia (height)	Single mutation → dwarfism
FBN1	Marfan syndrome	Single mutation → tall, thin

Why dominance is rare

Additive effects accumulate — small × thousands = large
Dominance needs a specific architecture at one locus
Selection removes harmful dominant alleles fast
The exceptions: new mutations · late onset

§ 6

Polygenic Scores
· Adding It All Up

Building a polygenic score

PGS = Σ_i ( β_i × dosage_i )

For each variant i: effect size β_i from GWAS
Times the person's dosage (0, 1, or 2)
Sum across millions of variants

What a PGS can and cannot do

Can

Rank relative risk
Identify high-risk tails
Improve disease screening

Cannot

Determine an individual's fate
Transfer cleanly across ancestries
Capture environment

Where polygenicity is extreme

Trait	~Variants	Per-variant effect
Height	~12,000	< 1 mm each
BMI	~3,000	Tiny
Schizophrenia	~270 loci · thousands of variants	OR ~1.05
Educational attainment	~3,900	Days of schooling

Effects are tiny. Counts are huge.

Effect-size distribution

Many variants with very small effects
A handful with moderate effects
Almost none with large effects (for common traits)

It is a long-tailed distribution.
Mostly tail.

The omnigenic preview

If thousands of variants matter,
most genes may matter.

Boyle, Li & Pritchard 2017 · "An Expanded View of Complex Traits"
Core genes + peripheral regulatory network
Almost any expressed gene → tiny push on most traits

§ 7

Summary

What to take away

Most traits are polygenic — sum of thousands of small effects
Many small additive effects → Gaussian (central limit)
UK Biobank: ~700 traits show additive heritability
Dominance is real but rare · ~0.5% of variance · 183 loci
PGS = sum of GWAS effect sizes × dosages

Next lecture

If traits are polygenic —
how much is genes
vs environment?

Chapter 16 · Heritability