BSMS205 · Genetics

Polygenic Model
in Traits

Chapter 15 · Part III · Complex Traits
A question to start with

If alleles add up,
what does adding
thousands look like?

Mendel's peas vs your height

Mendel's discrete pea trait vs continuous human height
Figure. Mendel's peas: two discrete categories (tall / short). Human height: a smooth continuum from short to tall. Same biology — but very different number of contributing genes.

The reconciliation

Mendel · 1866

  • Discrete · 3:1 ratios
  • One gene, two alleles
  • Peas, flowers, mice

Galton · 1889

  • Continuous · bell curves
  • Height, intelligence
  • Looked non-Mendelian
Fisher 1918 · Mendel + many genes = Galton.
The scale of the question
361,194
people · 1,060 traits · 13.7 M variants
  • The UK Biobank · largest genetic resource on earth
  • Physical · biomarkers · diseases · behaviors
  • Tested both additive and dominance effects

Roadmap for today

  1. Why Mendel's ratios disappear in real traits
  2. The polygenic intuition · summing many small effects
  3. Simulation · 1, 5, 50, 1000 loci
  4. Measuring additive effects · the 0-1-2 code
  5. Measuring dominance · the orthogonal trick
  6. UK Biobank results · what is real, what is rare
  7. Polygenic scores & the omnigenic preview
§ 1

Why Most Traits
Don't Follow Mendel

One gene, three genotypes, three values

  • AA → 170 cm · baseline
  • AB → 170.5 cm · one B allele
  • BB → 171 cm · two B alleles
Each B allele adds 0.5 cm.
Stacking blocks · same height per block.

One gene · the histogram

  • If allele B has frequency p = 0.5, genotypes follow 1 : 2 : 1
  • The trait histogram has 3 bars
  • That is what Mendel saw in peas
With one gene, the trait distribution looks discrete.
The leap

What if the trait depends on
two genes? Five? Fifty?

§ 2

From Many Small Effects
to a Bell Curve

Two loci · 9 genotype combinations

  • Locus 1: AA, AB, BB · adds 0, 0.5, 1 cm
  • Locus 2: CC, CD, DD · adds 0, 0.5, 1 cm
  • 3 × 3 = 9 joint genotypes
  • Trait values cluster into 5 bins: 0, 0.5, 1, 1.5, 2 cm
Already a peak in the middle.

Five loci · the shape begins

  • 5 loci, each with 0/1/2 effect alleles
  • Possible totals: 0 to 10 effect alleles
  • Number of ways to make each total → binomial
  • Histogram already peaks in the middle
Most people are average-ish.
Few are extreme.

Why the bell curve · central limit

Trait = effect₁ + effect₂ + effect₃ + ... + effectN
  • Each term is a small random contribution
  • Central limit theorem: sum of many independent variables → Gaussian
  • The shape is independent of what each gene does individually

1 → 5 → 50 → ∞ loci

LociDistinct trait valuesShape
13Three bars
25Slight peak
511Visible bell
50101Smooth Gaussian
~10,000ContinuousReal height
A worked example

Height is the sum of
~10,000 tiny pushes.

§ 3

Measuring
Additive Effects

The 0-1-2 code

AA = 0  ·  AB = 1  ·  BB = 2
  • Just count B alleles
  • Regress the trait on this count
  • Slope = additive effect per allele

What additive looks like

Three modes of allelic effects: additive, dominance, overdominance
Figure. Three modes of allelic effects. Panel A · additive: trait increases linearly with allele count (0 → 1 → 2). Panel B · dominance: heterozygote (AB) deviates from the line. Panel C · overdominance: heterozygote is more extreme than either homozygote. Blue = additive variance, red = dominance variance — most loci are mostly blue. Source: Palmer et al. 2021, bioRxiv.

Why the slope is the answer

  • If trait really is linear in allele count → slope captures all of it
  • If not, the residuals tell us about dominance
  • For ~700 of 1,060 UK Biobank traits → linear fits well
For most traits, the slope is the whole story.
§ 4

Measuring
Dominance — the Tricky Part

What dominance means

  • Complete dominance: AB looks like BB
  • Pattern: AA = 0, AB = 1, BB = 1
  • One copy of B is enough
  • Examples: FGFR3 (achondroplasia) · FBN1 (Marfan)

The confounding problem

Additive

  • 0 — 1 — 2

Dominant

  • 0 — 1 — 1
At AB alone they look the same.
We need a way to separate them.

The orthogonal encoding

GenotypeAdditive codeDominance code
AA00
AB11
BB20

Dominance code 0–1–0 = "is heterozygote special?"

Orthogonal · the equalizer analogy

Volume slider

  • = additive effect
  • Smooth, predictable

Bass boost

  • = dominance effect
  • Heterozygote bonus
Set independent sliders → measure each cleanly.
§ 5

What the
UK Biobank Showed

The headline result

700
of 1,060 traits
show additive heritability
183
loci
with detectable dominance

Dominance explains only ~0.5% of additive variance.

The Manhattan picture

Genome-wide additive vs dominance hits across chromosomes
Figure. Genome-wide associations. Bottom (pink): additive effects — densely packed across all chromosomes. Top (green): dominance effects — sparse, only 183 loci. Notable dominance hits labelled (MC1R, APOE, etc). Source: Palmer et al. 2021, bioRxiv.

A few real dominance hits

GeneTraitNote
MC1RRed hair · skin pigmentOne copy shifts hair colour
APOEAlzheimer's risk · LDLε4 allele · partial dominance
FGFR3Achondroplasia (height)Single mutation → dwarfism
FBN1Marfan syndromeSingle mutation → tall, thin

Why dominance is rare

  • Additive effects accumulate — small × thousands = large
  • Dominance needs a specific architecture at one locus
  • Selection removes harmful dominant alleles fast
  • The exceptions: new mutations · late onset
§ 6

Polygenic Scores
· Adding It All Up

Building a polygenic score

PGS = Σi ( βi × dosagei )
  • For each variant i: effect size βi from GWAS
  • Times the person's dosage (0, 1, or 2)
  • Sum across millions of variants

What a PGS can and cannot do

Can

  • Rank relative risk
  • Identify high-risk tails
  • Improve disease screening

Cannot

  • Determine an individual's fate
  • Transfer cleanly across ancestries
  • Capture environment

Where polygenicity is extreme

Trait~VariantsPer-variant effect
Height~12,000< 1 mm each
BMI~3,000Tiny
Schizophrenia~270 loci · thousands of variantsOR ~1.05
Educational attainment~3,900Days of schooling

Effects are tiny. Counts are huge.

Effect-size distribution

  • Many variants with very small effects
  • A handful with moderate effects
  • Almost none with large effects (for common traits)
It is a long-tailed distribution.
Mostly tail.

The omnigenic preview

If thousands of variants matter,
most genes may matter.
  • Boyle, Li & Pritchard 2017 · "An Expanded View of Complex Traits"
  • Core genes + peripheral regulatory network
  • Almost any expressed gene → tiny push on most traits
§ 7

Summary

What to take away

  • Most traits are polygenic — sum of thousands of small effects
  • Many small additive effects → Gaussian (central limit)
  • UK Biobank: ~700 traits show additive heritability
  • Dominance is real but rare · ~0.5% of variance · 183 loci
  • PGS = sum of GWAS effect sizes × dosages
Next lecture

If traits are polygenic —
how much is genes
vs environment?

Chapter 16 · Heritability