Most people have an intuitive notion of heritability being the genetic component of why close relatives tend to resemble each other more than strangers. More technically, heritability is the fraction of the variance of a trait within a population that is due to genetic factors. This is the pedagogical post on heritability that I promised in a previous post on estimating heritability from genome wide association studies (GWAS).

One of the most important facts about uncertainty and something that everyone should know but often doesn’t is that when you add two imprecise quantities together, while the average of the sum is the sum of the averages of the individual quantities, the total error (i.e. standard deviation) is not the sum of the standard deviations but the square root of the sum of the square of the standard deviations or variances. In other words, when you add two uncorrelated noisy variables, the variance of the sum is the sum of the variances. Hence, the error grows as the square root of the number of quantities you add and not linearly as it had been assumed for centuries. There is a great article in the American Scientist from 2007 called The Most Dangerous Equation giving a history of some calamities that resulted from not knowing about how variances sum. The variance of a trait can thus be expressed as the sum of the genetic variance and environmental variance, where environment just means everything that is not correlated to genetics. The heritability is the ratio of the genetic variance to the trait variance.

Consider a trait that varies like height. If you plot a histogram of the heights of males or females, you will get a normal distribution. Heritability is about what determines the variance of the distribution and not the mean. That is not to say that the mean does not depend on genetics. Obviously, humans are taller than rhesus monkeys and that has everything to do with the genes. However, the mean is mostly determined by the genetic (and environmental) components that everyone shares. The variance is about what is different between people and that is what we can measure. For example, say one person is 178 cm and another is 176 cm and are genetically identical except for a handful of genetic factors. If they were subjected to the same identical environmental conditions then we could attribute the difference in height to those genetic differences. Obviously, there will be many other genetic factors that specify why the height is on average 177 cm and not say 100 cm, namely all the genetic factors that are identical. However, there is no way to figure out which of those identical genes are responsible for height as opposed to say kidney function with this information. The difference between individuals is also what natural selection can work on. That is why population genetics is so focused on variances.

In my previous post on population genetics, I introduced the concept of additive genetic effects. These are genes or more technically alleles whose contributions to the trait are independent of other genes or the environment. What this means is that if you want to know the difference from the mean, you simply add up the contributions of all the additive alleles that influence that trait. The genetic variance can thus be divided into additive genetic variance and non-additive genetic variance. The non-additive parts include everything that has a nonlinear effect such as dominance, where the presence of just one allele contributes as much as two of the same allele, or epistasis where alleles act differently depending on what other alleles are present, or gene-environment effects where the contribution of an allele changes depending on the environment. The fraction of the variance explained by the additive genetic effects is called the narrow-sense heritability as opposed to the broad-sense heritability, which includes all the genetic effects.

The classical way to measure narrow-sense heritability is to take a group of close relatives, say mothers and daughters, and plot the height of daughters versus the height of mothers. The best fit line picks up the additive genetic effects. If we standardize the heights of each generation, i.e. rescale the heights so that the mean is zero and the standard deviation is one, then the slope of the line is given by the correlation between the heights of the daughters and mothers. Note that the magnitude of a correlation is always less than one. Hence, on average daughters will be closer to the mean than their mothers. This is called regression to the mean. Mothers and daughters share exactly half of their genetic material. The heritability is thus twice the slope (i.e. slope divided by coefficient of relatedness). If you plot the height deviations of the daughters against the average of the height deviations of the two parents, then the slope is the heritability. What this means is that you can estimate the average height of your children or any other heritable trait by taking the average height of you and your spouse and multiplying by the narrow sense heritability. The narrow sense heritability of height is about 0.8, so if you and your wife are two standard deviations above the mean, then the average of your children will be 1.6 standard deviations above the mean. If the heritability of the trait is zero, then the average of your children will be the population mean. A recently developed method, as I described in a previous post, can estimate the heritability contained in a set of genetic markers for a population of strangers.

These days, most biologists seem to downplay the importance of additive genetic effects. To me this is a perfect example of discounting the obvious as I blogged about before. Most people seem to believe that the interaction of genes or epistasis must be more important. What I like to say is that epistasis is likely to be important for biology but additive genetic effects are most important for natural selection. The reason is that we inherit genes and not genotypes. Mozart may have been the genius he was because of the specific combination of genes that he possessed but that perfect combination would not be passed on to his children. Thus any allele that confers an advantage will likely only persist in the population if it confers an advantage additively. However, in cases where the population is small and there is some inbreeding, then it could be that combinations of genes that confer a large advantage together but little individually could become fixed in the population. Hence, the way I see evolution proceeding is that it takes small additive steps and then every once in awhile it takes a big nonadditive step.

The genetic variation between people can be divided into common and rare variants. The human genome has about ten million common single nucleotide polymorphisms (SNPs) but each individual will also carry many rare mutations. However, it is possible that the variation in the common variants alone could lead to mind-boggling differences in phenotype. Consider an example due to Steve Hsu. Suppose a trait depends on 400 alleles and there is a 50% chance of getting one of these alleles. Then on average you will have 400*0.5= 200 alleles and the variance around this mean will be 400*0.5*0.5=100. Hence, the standard deviation will be 10 alleles. That means 95% of the population will have between 180 and 220 alleles. This also means on average each allele contributes 0.1 standard deviations. A superoutlier who is four standard deviations away has 240 alleles. That still leaves a lot of room for improvement. If you happened to have all of the alleles, which has a probability of one half to the 400 power, you will be 20 standard deviations above the mean! Now, it could be that nonlinear effects could kick in if you have lots of alleles to saturate the effect. I wouldn’t expect any person could be 20 standard deviations above the mean but some traits could have great room for expansion and selective breeding on additive effects in animals have shown dramatic increases in phenotype.

thats a really good post tho i have to read it more. of course, plomin (uk, behavioral gens). lalandlab. ac.uk point out issues with additive genetic variance, (i’m with lalaland).i can get mean above std deviations; best thing to do is leave me be.

LikeLike

on reading this a little more, i still think its a nice example, and ‘the most dangerous equation’ reference is good. but, i guess i’m on the ‘evan charney’ side of things (Duke U)—he has a recent article on independent science news on ‘missing heritability’. I view the whole additive genetic variance thing to be more or less an assumption similar or equivalent to fisher’s fundamental theorem or hardy-weinberg equilibrium. Alan hastings among others showed how easily those assumptions fail (as has been equivalently shown in general equilibrium theory in econ by scarf, manetl, debreu, sonnenenshein).

In stat mech, the ergodic hypothesis is essentially the same thing (law of large numbers) and KAM, chaos theory, etc. show while it is a good approximation, its wrong. (of course that depends on how one defines ergodicity; in my definition, its a tautology, so always true; also, its possible its true in the standard version—eg Yakovenko’s take on the income distribution as Boltzmann—an ‘eternal return’ universe as predicted by poincare and the hindus.

however, i tend to think pleitropy, epistasis, and gene-environment correlations are very important, though these non-additive effects may be so small and self-canceling they can be neglected in practice. this is the view seen in economics by R Lucas (rational expectations)—people are perfectly rational and knowledgeable, the market always clears, gross subtitutability applies etc so the only reason there are apparently deviations from perfect equilibrium (market crashes, ‘involuntary umployment’ (an oxymoron to Lucas and the chicago school) is due to ‘miracles’ or ‘shocks’.

In ecology you see the same thing in Hubbard and Alan McKane’s approach to biological speciation—the so-called ‘neutral theory’. “We’re just going to assume the world is a Gaussian distribution, and even write down a fokker-planck equation for it and add a few extra plausible terms that cancel out fortunately, so we can solve it. You find the earth is essentially a gaussian distribution,—-only a few really tall people or high mountains, lots near the average, etc. ‘

(They have a Footnote—this theory may not explain why some areas of the earth are like Tibet and quite high, or like death valley in ca or the dead sea, low, nor why the north koreans and south koreans differ in height, nor why the rich and poor seem to have assortive interactions, nor why there are gaps or bifurcations of missing links so you dont simply scale up ants to get bigger and bigger ones instead of elephants or humans. But, one possibility may be the one shown by Darcy Thompson (laws of form) and Sapir-Whorf. Humans and elephants actually are equivalent and large ants (or apes) , as can be seen by the Nash embedding theorem (or Renee thom)—if you put on the correct rose colored glasses, call a spade a spade and a shovel on your iphone, call a cat and a hat a teacup (G H Hardy), then send it to von Neumann who with only 4 parameters can fit your elephant (into an ant) and with 5 make him or her wiggle his or her trunk, and you get another grant, which is the proof).

i see pnas also has a paper on missing heritability. it seems the view there is like ‘dark energy’—the fact that we can’t explain 90% of what is going on, confirms that our theory is correct, because all the other possibilities, if true, would merely work to strengthen our account. Like redistricting—sure, you could use some sort of scientific approach to avoid suggestions of gerrymandering, but even if you did it our candidates would win anyway, which proves we are right.

The nonlinearities are what is interesting i think, and to deal with those one has to deal with useful ‘units of selection’—-single mutations i dont think are very useful (though of course its not wrong—i think russel and whitehead showed you could prove arithmatic in 1000 pages if you worked just with 0’s and 1’s. But sometimes one gets new insights by clumping or coarse graining things—eg you get Godel’s theorems if you move from presberger arithmatic to peano (ie multiplication and prime numbers…) then you can get the reimann hypothesis, etc.

Also if ‘more (really) is different (PW Anderson, possibly via entanglement (or S J Gould’s and Goldman’s web of tangled banks) it may not make any sense to think atoms or nucleotides or genes etc. are anything more than reifications, and idle worship—-though, of course, pragmatically idle worship (like debates in congress) are quite useful things to encourage to keep people from being idle and instead working to support megachurches, etc. (Charney, mentioned above, works at Duke—which of course is a sort of, unsordid, side product of tobacco (which of course was invented by native americans so they could destroy europeans and conquest the world, without even having to leave home and risk drowning; it was all a big plot in which they engineered the fitness landscape so that columbus, etc. were attracted to and finally swalloed by a black hole). Duke also employs N Scafetta in physics (who also works with Bruce West author of ‘Nonlinearity’), who has some really interesting theories aiming to replace the ‘myth of global warming’ with some sort of weird copernican cycles on cycles of things connecting the rise of civilization, climates, etc. to sunspot cycles. (he probably has von neumann on staff). (See Wattsupwiththat blog or realclimate).

ola

LikeLike

@ishi There is no missing heritability. It’s just many variants of small effect. e.g. see papers by Visscher and my recent paper in PLoS Genetics.

LikeLike

just a question—what do you think of the paper in PNAS by lander, zuk, etc. which looks at epistasis. there is another PNAS paper 2008 which seems to have a different view; j crow and slatkin (old school) seemed to have chimed in..

one can also look at a guy at UNC whose name i forget or an old paper in PLOS genetics (2013) by hermani etc.

its why maybe some should think of the old FPU experiments. (4th order harmonic oscillators).

its a higher order markov process (n g van kampen) , despite hsu.

LikeLike

I think epistasis can perturb heritability estimates but it’s not as much as Lander thinks. Common variants seem to be mostly linear.

LikeLike

thanks for your reply. douglas wahlsten at UNC greensboro is the person i was thinking of; but he’s visiting i guess from canada. (he had an old paper in Bull Behav Scis. I’m sortuh doing a hobby and doing an ehrenfest urn model of the various beanbag genetics (using various coins to describe genetics variations–large and small effects). i don’t ‘have a dog in this fight’ so i’m agnsotic. (cahrney’s paper i looked at again (with beckwith of harvard) and its not at all conclusive (like miss measure of man).

even the FPU expermients (ulam, etc.) don’t seem to be conclusive—-it ergodizes acording to some people (maybe christian beck (uk), or mackey (canada) if you wait enough time, or bruce west nc—arxiv (bad news)

).

ornstein and weiss (bull am soc 91 or so) i take my cue from (also cosma shalizi of cmu/sfi, or say slutsky-yule (1930’s, or skolem—-lst theorem) ).

my view is linear approximation will always work (like ergodicity on an attractor, please wise linear, wavelets–gs mackey ) but the nonlinearity is what is interesting. its a higher order markov process. (dont be ashamed from where you’re from).

LikeLike

[…] and diseases like height and type 2 diabetes depend on a large number of linearly acting genes (see here). Their genetic effects are additive. Any nonlinear interactions they have with other genes (i.e. […]

LikeLike

Really nice overview. I liked the point that the reshuffling of recombination probably means that additive variance is more important for natural selection than epistasis.

But at the risk of sounding like a 6-year-old, I wish this had pictures. Some of the explanations would be more digestible with charts, e.g. a dummy plot of mothers’ vs. daughters’ trait values in the narrow-sense heritability example.

-a student

LikeLike

@jebriner I’ll take that into consideration in future posts

LikeLike

[…] is the fraction of the variance of a trait due to genetic variance. Here is a link to a previous post explaining heritability although as my colleague Vipul Periwal points […]

LikeLike

Some comments:

“say one person is 178 cm and another is 176 cm and are genetically identical except for a handful of genetic factors. If they were subjected to the same identical environmental conditions then we could attribute the difference in height to those genetic differences. Obviously, there will be many other genetic factors that specify why the height is on average 177 cm and not say 100 cm, namely all the genetic factors that are identical.”

This is just Storytelling. Proofs:Zero

Missing heritability is a fact. The sum of things irrelevant, does not became relevant. GWAs is just the search of simple causes of a complexity that it is not possible to understand under the paradigm of reductionism and gen-centrism. And in a context assumed that the idea from 1800, the separation of nature and nurture, is recalcitrant. Current knowledge shows that it is just an arbitrary separation.

I think that Heritability does work in breeding, where actually exists a selection. There is a common “environment” , or there are different environments for each organism but it does not affect the character that it is selected. In that population, in that moment. This is the only utility. All the stuff about heritability and behavior and twin experiments is science fiction.

And what about genes? Population genetics is based on the metaphors of mendelian alleles! Genes are what the context ( defined inseparably by development and “environment”) decides. Even same sequences of DNA (or the sum of sequences) can result in differences depending on the individual.

Genetic determinism insist that knowing the sequences ( o many of them) can predict a behavior or a phenotype. Additive effect is a fantasy that it is not according with current knowledge. That it is necessary just to maintain a dogma. And this is the reason why “I like to say is that epistasis is likely to be important for biology but additive genetic effects are most important for natural selection”. Biology is the science that study life empirically but natural selection is just tautology, a semantic ghost.

All this wrong approaches could be corrected and replaced for Biology but there is so much efforts to persist in the dogma of natural selection and gen centrism.

LikeLike