Degrees of freedom (statistics)
Encyclopedia : D : DE : DEG : Degrees of freedom (statistics)
- For other senses of these terms, see degrees of freedom or degree.
Residuals
In fitting statistical models to data, the vectors of residuals are often constrained to lie in a space of smaller dimension than the number of components in the vector. That smaller dimension is the number of degrees of freedom for error.
Perhaps the simplest example is this. Suppose
- [X_1,\dots,X_n\,]
- [\overline_n=]
- [X_i-\overline_n\,]
An only slightly less simple example is that of least squares estimation of a and b in the model
- [Y_i=a+bx_i+\varepsilon_i\ \mathrm\ i=1,\dots,n]
- [e_i=y_i-(\widehat+\widehatx_i)\,]
- [e_1+\cdots+e_n=0,\,]
- [x_1 e_1+\cdots+x_n e_n=0.\,]
The capital Y is used in specifying the model, and lower-case y in the definition of the residuals. That is because the former are hypothesized random variables and the latter are data.
Another simple and frequently seen example arises in multiple comparisons.
Parameters in probability distributions
The probability distributions of residuals are often parametrized by these numbers of degrees of freedom. Thus one speaks of a chi-square distribution with a specified number of degrees of freedom, an F-distribution, a Student's t-distribution, or a Wishart distribution with specified numbers of degrees of freedom in the numerator and the denominator respectively.
In the familiar uses of these distributions, the number of degrees of freedom takes only integer values. The underlying mathematics in most cases allows for fractional degrees of freedom, which can arise in more sophisticated uses.
See also
External links
- http://seamonkey.ed.asu.edu/~alex/computer/sas/df.html
From Wikipedia, the Free Encyclopedia. Original article here. Support Wikipedia by contributing or donating.
All text is available under the terms of the GNU Free Documentation License See Wikipedia Copyrights for details.
