Statistical independence
Encyclopedia : S : ST : STA : Statistical independence
In probability theory, to say that two events are independent intuitively means that the occurrence of one event makes it neither more nor less probable that the other occurs. For example, the event of getting a "6" when a die is rolled and the event of getting a "6" the second time are independent. Similarly, two random variables are independent if the conditional probability distribution of either given the observed value of the other is the same as if the other's value had not been obeserved. For example, the number obtained the first time a die is rolled and that appearing the second time are independent.
Independent events
The standard definition says:
- Two events A and B are independent if and only if Pr(A ∩ B) = Pr(A)Pr(B).
More generally, any collection of events -- possibly more than just two of them -- are mutually independent if and only if for any finite subset A1, ..., An of the collection we have
- [\Pr(A_1 \cap \cdots \cap A_n)=\Pr(A_1)\,\cdots\,\Pr(A_n).]
If two events A and B are independent, then the conditional probability of A given B is the same as the "unconditional" (or "marginal") probability of A, that is,
- [\Pr(A\mid B)=\Pr(A).\,]
When one recalls that the conditional probability Pr(A | B) is given by
- [\Pr(A\mid B)=,] (so long as Pr(B) ≠ 0 )
- [\Pr(A \cap B)=\Pr(A)\Pr(B)]
Note that independence does not have the same meaning as it does in the vernacular. For example an event is independent of itself if and only if
- [\Pr(A) = \Pr(A \cap A) = \Pr(A)\Pr(A)\,]
Independent random variables
What is defined above is independence of events. In this section we treat independence of random variables. If X is a real-valued random variable and a is a number, then the event that X ≤ a is an event, so it makes sense to speak of its being, or not being, independent of another event.
Two random variables X and Y are independent if and only if for any numbers a and b the events [X ≤ a] (the event of X being less than or equal to a) and [Y ≤ b] are independent events as defined above. Similarly an arbitrary collection of random variables -- possible more than just two of them -- is independent precisely if for any finite collection X1, ..., Xn and any finite set of numbers a1, ..., an, the events [X1 ≤ a1], ..., [Xn ≤ an] are independent events as defined above.
The measure-theoretically inclined may prefer to substitute events [X ∈ A] for events [X ≤ a] in the above definition, where A is any Borel set. That definition is exactly equivalent to the one above when the values of the random variables are real numbers. It has the advantage of working also for complex-valued random variables or for random variables taking values in any topological space.
If any two of a collection of random variables are independent, they may nonetheless fail to be mutually independent; this is called pairwise independence.
If X and Y are independent, then the expectation operator E has the nice property
- E[X Y] = E[X] E[Y],
- var(X + Y) = var(X) + var(Y),
Furthermore, random variables X and Y with distribution functions FX(x) and FY(y), and probability densities fX(x) and fY(y), are independent if and only if the combined random variable (X,Y) has a joint distribution
- :[F_(x,y) = F_X(x) F_Y(y),]
- :[f_(x,y) = f_X(x) f_Y(y).]
Conditionally independent random variables
Main article: Conditional independence
Intuitively, two random variables X and Y are conditionally independent given Z if, once Z is known, the value of Y does not add any additional information about X. For instance, two measurements X and Y of the same underlying quantity Z are not independent, but they are conditionally independent given Z (unless the errors in the two measurements are somehow connected).
The formal definition of conditional independence is based on the idea of conditional distributions. If X, Y, and Z are discrete random variables, then we define X and Y to be conditionally independent given Z if
- P(X = x, Y = y | Z = z) = P(X = x | Z = z) · P(Y = y | Z = z)
- pXY|Z(x, y | z) = pX|Z(x | z) · pY|Z(y | z)
If X and Y are conditionally independent given Z, then
- P(X = x | Y = y, Z = z) = P(X = x | Z = z)
Independence can be seen as a special kind of conditional independence, since probability can be seen as a kind of conditional probability given no events.
See also
From Wikipedia, the Free Encyclopedia. Original article here. Support Wikipedia by contributing or donating.
All text is available under the terms of the GNU Free Documentation License See Wikipedia Copyrights for details.
