F-statistics
Encyclopedia : F : FS : FST : F-statistics
The concept of F-statistics was developed during the 1920s by the American geneticist Sewall Wright, who was interested in inbreeding in cattle. However, because complete dominance causes the phenotypes of homozygote dominants and heterozygotes to be the same, it was not until the advent of molecular genetics from the 1960s onwards that heterozygosity in populations could be measured.
Definitions and equations
The measures FIS, FST, and FIT are related to the amounts of heterozygosity at various levels of population structure. Together, they are called F-statistics, and are derived from F, the inbreeding coefficient. In a simple two-allele system with inbreeding, the genotypic frequencies are:
- [ p^2 + Fpq] for AA; [2pq(1-F)] for Aa; and [q^2 + Fpq] for aa.
- [ F = 1- \frac(f(\mathbf))} (f(\mathbf))}, \!]
- [ \operatorname(f(\mathbf)) = 2\, p\, q, \!]
For example, consider the data from E.B. Ford (1971) on the scarlet tiger moth:
| Genotype | White-spotted (AA) | Intermediate (Aa) | Little spotting (aa) | Total |
|---|---|---|---|---|
| Number | 1469 | 138 | 5 | 1612 |
From this, the allele frequencies can be calculated, and the expectation of f(AA) derived:
[p = = 0.954]
[q = 1 - p = 0.046]
[F = 1- \frac = 1- = 0.023]
The different F-statistics look at different level of population structure. FIT is the inbreeding coefficient of an individual (I) relative to the total (T) population, as above; FIS is the inbreeding coefficent of an individual (I) relative to the subpopulation (S), using the above for subpopulations and averaging them; and FST is the effect of subpopulations (S) compared to the total population (T), and is calculated by solving the equation:
- [(1-F_)(1-F_) = (1-F_),]
Partition due to population structure
Consider a population that has a population structure of two levels; one from the individual (I) to the subpopulation (S) and one from the subpopulation to the total (T). Then the total F, known here as FIT, can be partitioned into FIS (or θ) and FST (or f):
- [ 1 - F_ = (1 - F_)\,(1 - F_). \!]
- [ 1 - F = \prod_^ (1 - F_) \!]
Fst
A reformulation of the definition of F would be the ratio of the average number of differences between pairs of chromosomes sampled within diploid individuals with the average number obtained when sampling chromosomes randomly from the population (excluding the grouping per individual). One can modify this definition and consider a grouping per sub-population instead of per individual. Population geneticists have used that idea to measure the degree of structure in a population.
Unfortunately, there is a large number of definitions for Fst, causing some confusion in the scientific litterature. A common definition is the following:
[ F_ = \frac(p)} \!]
where the variance of [ p ] is computed across sub-populations.
Effective population size
F can be used to define effective population size.
Path coefficients
External links
- http://darwin.eeb.uconn.edu/eeb348/lecture-notes/wahlund/wahlund.html
References
From Wikipedia, the Free Encyclopedia. Original article here. Support Wikipedia by contributing or donating.
All text is available under the terms of the GNU Free Documentation License See Wikipedia Copyrights for details.
