Opentopia Directory Encyclopedia Tools

Normal number

Encyclopedia : N : NO : NOR : Normal number


A different topic is treated in the article titled normal number (computing).
In mathematics, a normal number is, roughly speaking, a real number whose digits show a random distribution, with all digits being equally likely, all pairs of digits equally likely, all triplets of digits equally likely, etc. "Digits" refers to the finitely many digits before the decimal point (the integer part of the real number) and the infinite sequence of digits after the point (the fractional part of the real number).

While a general proof can be given that "almost all" numbers are normal, this proof is not constructive and only very few concrete numbers have been shown to be normal. It is for instance widely believed that the numbers √2, π and e are normal, but a proof remains elusive.

Definitions

Let [\Sigma] be a finite alphabet of b digits. Let [S\in\Sigma^\infty] be an infinite sequence drawn from the alphabet [\Sigma]. Let [w\in\Sigma^*] be a finite string drawn from the alphabet [\Sigma]. Let n be a positive integer. Define NS(w,n) to be the number of times the string w appears as a substring in the first n digits of the sequence S. (For instance, if S = 01010101..., then NS(010, 8) = 3.) S is normal if, for all finite strings [w\in\Sigma^*],

[\lim_ \frac = \frac]
(where [|w|] denotes the length of the string w; see also limit.) In other words, S is normal if all strings of equal length occur with equal asymptotic frequency. For example, in a normal binary sequence (a sequence over the alphabet ), 0 and 1 each occur with frequency 1/2; 00, 01, 10, and 11 each occur with frequency 1/4; 000, 001, 010, 011, 100, 101, 110, and 111 each occur with frequency 1/8, etc. Roughly speaking, the probability of finding the string w in any given position in S is precisely that expected if the sequence had been produced at random.

Suppose now that b is an integer greater than 1 and x is a real number. Consider the infinite digit sequence expansion Sx,b of x in the base b positional number system (we ignore the decimal point). We say x is normal in base b if the sequence Sx,b is normal. The number x is called a normal number (or sometimes an absolutely normal number) if it is normal in base b for every positive integer b.

A given infinite sequence is either normal or not normal, whereas a real number, having a different base-b expansion for each integer [b \geq 2], may be normal in one base but not in another (Cassels 1959 and Schmidt 1960).

A weaker property than normality is simple normality. A number is simply normal in base b if each individual digit appears with frequency 1/b.

Properties and examples

The concept of a normal number was introduced by Émile Borel in 1909. Using the Borel-Cantelli lemma, he proved the normal number theorem: almost all real numbers are normal, in the sense that the set of non-normal numbers has Lebesgue measure zero. (Borel 1909) This theorem established the existence of normal numbers, but Waclaw Sierpinski in 1917 was the first to give an example of one.

The set of non-normal numbers, even though "small" in the sense of being a null set, is "large" in the sense of being uncountable. Indeed, there are uncountably many numbers whose decimal expansion does not contain the digit 5, and none of these is normal.

Champernowne's number

0.1234567891011121314151617...,
obtained by concatenating the decimal representations of the natural numbers in order, is normal in base 10, but it is not normal in some other bases. The Copeland-Erdős constant
0.235711131719232931374143...,
obtained by concatenating the prime numbers in base 10, is also known to be normal in base 10.

No rational number is normal to any base, since the digit sequences of rational numbers are eventually periodic.

An example of a normal number is given by Chaitin's constant [\ \Omega]. Indeed, every algorithmically random sequence over the alphabet [\Sigma = \] is the base-b expansion of a normal number. A computable absolutely normal number was constructed in (Becher 2002).

It is extremely hard to prove the normality of numbers which were not explicitly constructed for the purpose. It is for instance unknown whether √2, π, ln(2) or e is normal (but all of them are strongly conjectured to be normal, because of some empirical evidence). Proofs are out of reach: it is not even known which digits occur infinitely often in the decimal expansions of those constants. David H. Bailey and Richard E. Crandall conjectured in 2001 that every irrational algebraic number is normal; while no counterexamples are known, there also exists no number that has been proven to be normal in some base and algebraic.

A disjunctive sequence is a sequence in which every finite string appears. A normal sequence is disjunctive, but a disjunctive sequence need not be normal.

Additional properties of normal numbers include:

Connection to finite-state machines

Agafonov showed an early connection between finite-state machines and normal sequences: every subsequence selected from a normal sequence by a regular language is also normal. In other words, if one runs a finite-state machine on a normal sequence, where the finite-state machine outputs the digit it just read whenever it enters an accepting state, then the sequence it outputs will be normal. (Agafonov 1968)

A deeper connection exists with finite-state gamblers (FSG's) and information lossless finite-state compressors (ILFSC's).

Schnorr and Stimm showed that no FSG can succeed on any normal sequence, and Bourke, Hitchcock and Vinodchandran showed the converse. Therefore:

A sequence is normal if and only if there is no finite-state gambler that succeeds on it.
Ziv and Lempel showed:
A sequence is normal if and only if it is incompressible by any information lossless finite-state compressor
(they actually showed that the sequence's optimal compression ratio over all ILFSC's is exactly its entropy rate, a quantitative measure of its deviation from normality, which is 1 exactly when the sequence is normal). Since the LZ compression algorithm compresses asymptotically as well as any ILFSC, this means that the LZ compression algorithm can compress any non-normal sequence. (Ziv Lempel 1978)

These characterizations of normal sequences can be interpreted to mean that "normal" = "finite-state random"; i.e., the normal sequences are precisely those that appear random to any finite-state machine. Compare this with the algorithmically random sequences, which are those infinite sequences that appear random to any algorithm (and in fact have similar gambling and compression characterizations with Turing machines replacing finite-state machines).

Connection to equidistributed sequences

A number x is normal in base b if and only if the sequence [_^\infty] is equidistributed modulo 1, or equivalently, using Weyl's criterion, if and only if
[\lim_\frac\sum_^e^=0 \quad\mbox m\geq 1.]

References

 


From Wikipedia, the Free Encyclopedia. Original article here. Support Wikipedia by contributing or donating.
All text is available under the terms of the GNU Free Documentation License See Wikipedia Copyrights for details.

Search Titles
0123456789
ABCDEFGHIJ
KLMNOPQRST
UVWXYZ?

E-mail this article to:

Personal Message: