Redundancy (information theory)
Encyclopedia : R : RE : RED : Redundancy (information theory)
Redundancy in information theory is the number of bits used to transmit a message minus the number of bits of actual information in the message. Data compression is a way to eliminate unwanted redundancy, while checksums are a way of adding desired redundancy for purposes of error correction when communicating over a noisy channel of limited capacity.
Quantitative definition
Recall that the rate of a source of information is (in the most general case)
- [r=\mathbb E H(M_t|M_,M_,M_, \dots), ]
The absolute rate of a language or source is simply
- [R = \log |M| ,\,]
The absolute redundancy can then be defined as
- [ D = R - r ,\,]
The quantity [\frac D R ] is called the relative redundancy and gives the maximum possible data compression ratio, when expressed as the percentage by which a file size can be decreased. (When expressed as a ratio of original file size to compressed file size, the quantity [R : r] gives the maximum compression ratio that can be achieved.) Complementary to the concept of relative redundancy is efficiency, defined as [\frac r R .] A memoryless source with a uniform distribution has zero redundancy (and thus 100% efficiency), and cannot be compressed.
See also
References
- Fazlollah M. Reza. An Introduction to Information Theory. New York: McGraw-Hill 1961. New York: Dover 1994. ISBN 0486682102
- B. Schneier, Applied Cryptography: Protocols, Algorithms, and Source Code in C. New York: John Wiley & Sons, Inc. 1996. ISBN 0471117090
From Wikipedia, the Free Encyclopedia. Original article here. Support Wikipedia by contributing or donating.
All text is available under the terms of the GNU Free Documentation License See Wikipedia Copyrights for details.
