Linas Vepstas <firstname.lastname@example.org>
Date: 12 October 2004 (revised 15 November 2004)
This paper is part of a set of chapters that explore the relationship between the real numbers, the modular group, and fractals.
Given a real number , a sequence of integers can be found that define its continued-fraction expansion:
Given a rational and an arbitrary complex number w, define new sequences:
There is also an even-odd symmetry for that we should be aware of, as it plays another important role in the theory. As we point out above, for any given sequence, there is a + and a - generalization. But what if the sequence is already and we choose the - expansion? Then we get
If one needs to take differences, e.g. for derivatives, the even-odd expansion is more useful: Define and as follows:
Let use define the gap as
The teeth at first appear to be random: below follows a scatter-plot showing for a thousand randomly generated small-ish rationals.
However, closer examination reveals some surprising details. Here's the same picture, but this time showing only the rationals (all rationals with numerator between 1 and 719 and denominator of 720:
Note that 720 = 6! (six factorial) and that similar but denser images appear for higher factorials: Here's one for that is, :
Notice that a Moire pattern of fringes seems to be forming at . The appearance of various parabolas at various locations seems to indicate that there are some sort of curious algebraic relationships between the rationals that are worth exploring. The factorial in the denominator seems important: the picture for the denominator blurs out the features, and mostly looks much more random, as shown below:
Now that we know what sort of pattern to look for, we can find the cross-bars in this picture, for a denominator of 1024:
The analogous picture for a denominator consisting of powers of 3 is less distinct. Pictures for prime denominators appear considerably more random, although containing hints of a different kind of structure. For example, here is a picture with denominator 1023:
The next picture below is a scatter-plot showing all rationals with denominators less than 200. This now clearly exhibits a fantastic self-similar fractal latticework:
We will recognize in later chapters that these curves correspond to hyperbolic maps of the unit interval back onto itself, generated by elements of the modular group acting on binary trees. It would be interesting to specify the shapes of all the various bifurcating curves that seem to appear in this latticework, and to give the algebraic equations whose solutions are plotted above. Note that because a curve can seemingly bifurcate into another curve in many places, enumerating all of them would be a trick.
There are more interesting things to be seen, but first we should provide a simpler expression for at . Recall that for any fixed rational , that is of the form for some positive integers . For any given, fixed , there exists an such that there are no poles in in the disk of radius around (Homework: prove this), and thus its analytic on this disk. This justifies an expansion in ; we'll give an exact expression in a later section, showing that these manipulations are safe. For the continued fraction we write the 'th partial convergent as which obeys the recurrence relation . Substituting, we get the explicit recurrence relations for each term:
For reference, overdrawn over the image is the parabola. The vertical black stripes occur at the Farey numbers, and appear to be ordered in size according to the Farey tree (i.e. the largest stripe is at , the next largest at , and the next smaller ones at and etc.). The progression of the horizontal bands are clearly related to the Farey tree through the parabola. Since the parabola gives a rational out when fed one in, we see that we've discovered a different tree of rationals that's related to the Farey tree. One can see that any ratio of polynomials will this give a tree of rationals. We discuss the Farey Tree (or Stern-Brocot Tree) in a different chapter.
The filigree pattern exists only for small rationals. If we start sampling larger rationals, the distribution becomes smooth. The picture below shows a random sampling of rationals with denominators less than 2 million. The image is a bit grainy because the sample size is not large enough; the graininess goes away with larger sample sizes.
This picture tells us something that is surprising: this function is space-filling and dense; it has no holes or islands; it has a fractal measure of 2.0. Even in the land of fractals, this is fairly unusual, as this function is, after all, not some self-similar curve, but rather a 'plain-old' map from the unit interval to the unit interval. By 'dense', I mean the traditional Cauchy-sequence notion of density: for any real values and positive , we can find a rational such that and . Of course, a computer generated picture is not a proof, but a proof is straight-forward, and is discussed below. I am not aware of any other map that has this property.
The distribution of the gaps can be easily found to be which is sharply peaked at : this is the thin red stripe at the bottom of the above picture. This distribution is due to the Jacobean of a parabola, which we derive below.
One can get a better idea of the behavior of by graphing it, as a function of , for a number of representative values of . From the above analysis, we should suspect that for , the leading terms are of magnitude which would make it hard to directly compare different values of So instead, we introduce a 'normalized' variant: and similarly . This is done below, for 200 evenly spaced values of . On the horizontal axis is running from 0 to 1, and along the vertical axis, , from 0 at bottom to 1 at top:
Clearly, it can be seen that is a monotonically increasing function of , which follows easily by examining the partial sums: if then
One finds a mirror symmetry between the even and odd forms: , which is also not quite obvious, given the construction. Its is clear from this picture that the leading and terms dominate. We can guess at the form of these terms: we define and graph it below, exposing the randomness of the cubic and higher terms directly.
The seeming parallel-ness of some trajectories is an optical illusion, due to the pixelization of the picture. There are more pictures, with a greater number of strands, at the Gap Room http://www.linas.org/art-gallery/farey/gap-room/gap-room.html.
The size of the gap can be given an exact expression in terms of the convergents of the continued fraction. Performing this exercise will help explain some of the fractal behavior seen in the previous sections.
One gets the convergent of a continued fraction by evaluating the fraction only up to the 'th term, and expressing it as a ratio . The convergents can be expressed recursively:
The distribution density of the gaps on the unit square can now be understood in terms of the above parabolic formula, as a change-of-variable from the underlying distribution of . Numerically, we can confirm that this distribution is, in a certain sense, perfectly uniform on the half-unit square. Now that we understand that the more fundamental quantity with regard to distributions is , lets repeat some of the earlier graphs. These are shown in and . Visually, they don't differ much from thier earlier analoguous, except for an effective rescaling of . Nonetheless, they are presented here for reference.
The figure above shows the distribution of the ratio of the final two convergent denominators for all rationals with denominators less than 200. Note that the scale along the x-axis runs from 0 to 1 but along the y axis, from 0 to 1/2. To create this distribution, we create an empty grid that is MxM=600x600 pixels in size. We then consider a fraction and its denominator ratio . If we have that and for some integers , then we increment the value at pixel by one. When we are done, we visualize the grid by assigning black to empty pixels, blue to pixels with a very small count, green to pixels with a medium-small count, yellow to pixels with a medium count, and red to pixels with a large count.
The figure above shows the distribution of the ratio of the final two convergent denominators for all rationals with denominators less than 1800. Note that the scale along the x-axis runs from 0 to 1 but along the y axis, from 0 to 1/2. Thus, the largest horizontal line is at , with the remaining lines distributed at the Farey Fractions. The uniform blue color indicates a fairly even distribution, with the coloring as before: red indicating an excess, and black a deficit. As one goes to larger denominators, the uniformity domintes, with the distribution tending towards even-ness.
It is of some interest to examine distribution of other convergents. This section reviews some of these.
The figure above shows the distribution of the ratio of the pre-final two convergent denominators for all rationals with denominators less than 1800. Note that the scale along the x and y-axis runs from 0 to 1. Note that this figure is fundamentally different, in a certain sense, than the previous figures. Here, as one goes to the higher denominators, one seems to develop both a uniform background, and a filigree superimposed on top. This behaviour is qualitatively different than what one sees when one just considers the final ratios.
The figure above shows the distribution of the ratio of the pre-final two convergent denominators for all rationals with denominators less than 16200. The uniform distrubtion,with a pattern overlay persists to high orders.
The figure above shows the distribution of the ratio of the two convergent denominators for all rationals with denominators less than 1800, and which have at least two terms (so that ).
The figure above shows the distribution of the ratio of the two convergent denominators for all rationals with denominators less than 1800, and which have at least three terms (so that ).
The figure above shows the distribution of the ratio of the two convergent denominators for all rationals with denominators less than 1800, and which have at least four terms (so that ).
The figure above shows the distribution of the ratio of the two convergent denominators for all rationals with denominators less than 1800, and which have at least five terms (so that ).
The relationship between convergents implies that the matrix
The computer efforts at first seem to paint two contradicting interpretations about the distribution of the gaps. For small denominators, there seems to be a detailed and fine structure; yet, for large denominators, this structure seems to blur away, and one gets the impression of a perfectly uniform distribution. In fact, both conclusions are correct. Each of the pictures above and below were created with a uniform pixel size: 600 pixels dividing up the interval 0 to 1. For small denominators, each pixel may contain only one gap, and maybe none; for large denominators, each pixel may represent the average of dozens or hundreds of gaps. It seems reasonable to conclude that the structure of horizontal and vertical bars is in fact scale-invariant, and persists down to infinitesimal scales. Looking at large-denominator distributions with only 600x600 pixels effectively blurs all the structure away. Thus, on a coarse-grain, the distribution of the gaps appears to be perfectly random; a finer choice of binning, while holding denominators fixed, would lead to pictures of detailed structures at finer levels, and so on.
Thus, in order to talk about the gaps for ``all rationals'', one has two competing limits that give different answers. In one case, we take the size of the pixels smaller and smaller, and find, for any fixed scale of denominators, that there is a highly fractal filigree. In the other case, we hold the size of the pixels fixed, and take the limit of larger and larger denominators, and find a perfectly uniform distribution. It seems impossible to define the distribution ``in and of itself'' without resorting to talking about pixels at some point.
The incredible 'randomness' of the distribution, as well as its Cauchy-sequence-density on the unit square can now be easily understood. To do this, consider the iteration of the map
Lets try to restate the paradox. We would like to be able to say, that, when considering the entire set of reals on the unit interval, that, when examining the 'th binary digit in the expansion, that 'th digit is completely random, and is uniformly distributed (equipartition of probability), with the probability of the digit being 0 being 1/2 and the probability of it being 1 is also 1/2. But of course, this statement is patently false, and therein lies the paradox. We can, of course, trivially and exactly predict the 'th digit, as the Bernoulli process is completely deterministic. The 'th digit is trivially and the graph of is a square-toothed comb. One has this problem even for getting very large, and approaching infinity: the comb remains uniform and entirely ``predictable'' no matter how large gets.
I suppose there are two ways out of this paradox. One way out is to say something like ``Yea, Verily, Chooseth two Irrationals whose Binary Expansions are Completely Uncorrelated'', which sounds like an invocation of the Axiom of Choice. Indeed, the set of computable numbers is countable, and is of measure zero on the real number line. In order to compute the 'th digit of , one must somehow already 'know' . Since the set of of unknowable numbers has measure one on the unit interval of reals, one arguably must 'choose' if one is to get a truly 'random' number. Thus, we can anchor the concept of randomness of bit-sequences of the binary expansion of real numbers in the concept of Turing-uncomputable numbers and the Axiom of Choice for choosing one of these unknowable numbers. This is cleary dangerous territory, and somewhat tangled as a definition of randomness.
The other way out of the paradox is far more mundane, and seems constructive, and that is to apply shop-worn statistical methods. After all, chaos occurs in numerical simulations, on finite and computable sets, and not in set-theoretical limits. Chaos is a computational phenomenon. What we can say is that for any given real and and , one can always find an integer such that the average value of approaches 1/2 on the interval: that is, there exists an integer such that
Now, to move the conversation back to the iterated continued fraction map, and that proof.
The work of Cantor shows that the cardinality of is the same as that of , namely . This implies that the points of can be enumerated by . Space-filling curves such as those of Peano or Hilbert can be used to develop that enumeration. However, these curves have a locality property, in that if two points are close to each other in , then the points that they enumerate in are also close to each other. This is by construction, of course: the curves of peano or Hilbert are inherently continuous. We can, for example, graph the distance of the Hilbert curve from the the x-axis as a function of the parameter that takes us along the curve. By construction, this is a continuous curve, and it is *not* space-filling.
By contrast, the gap function is not a curve; by construction, it seems to be discontinuous everywhere.
The cause of the principal parabola seen in the pictures is now also clear. Blah Blah Blah write this section too.
The main parabola that is visible is at , which we can solve for directly to obtain and . Recalling that by definition, , we find that the rationals on the main parabola are given by and . Blah blah blah solve these equations.
Exploring the shapes of the hyperbolic maps seen in the pictures requires the development of the theory of these maps as the hyperbolic rotations of a binary tree, through the action of the Modular Group. The development of this theory is done in a later section. We only quote the results xxx here. Blah Blah Blah. What rational numbers show up on which curves? The families of the curves. Write this. So this is another to-do; maybe its own chapter. Again, a deep relationship to the modular group.
The seemingly pure randomness of the gap sizes is quite intriguing, and suggests possible relationships to similar phenomena in other areas. The (Big and little) Picard theorems comes to mind. The little theorem states that an entire function will attain all possible values, save one or two. The Big Picard theorem states that a function with an essential singularity will attain all possible values (with one or two exceptions), infinitely often, within a finite domain of the singularity. Although the gap isn't analytic, I still find the Picard theorems suggestive. We will develop the tools to analyze the gap in later chapters; in the meanwhile, we note that , or, if you prefer, , has an essential singularity at zero, and is instrumental in the construction and description of continued fractions.
We also are reminded that cryptographic hashes depend on the ability to distribute points randomly on the unit square, although they do so only on a finite-sized (but large) lattice. We know that the modular group is central to the study of chaotic dynamical systems and fractals; we find it intriguing that the modular group also underpins the study of elliptic curves, which lead to concepts of elliptic-curve cryptography. Perhaps the seemingly inherent orderly randomness seen in each are different manifestations of the very complex and surprising structure of the modular group.
I'd like to go so far as to propose the 'Fundamental Theorem of Cryptography': Since the gaps are Cauchy-dense on the unit interval, one is always free to pick any sequence of points that one may possibly wish, and then use the as their cryptographic encoding. That is, the sequence of is the plain-text, whereas the sequence is the crypt-text. This encoding is provably unbreakable for any plaintext because there exist an infinite number of possible alternate decodings of that come arbitrarily close to the cryptext, but are arbitrarily distant from the plaintext ; indeed, this is what it means for a set of points to be Cauchy-dense on a plane. What is truly remarkable here is that is a function, and not some fractal, space-filling curve.
As a final bit of madness, let us note that the gap contains all possible stochastic processes on the unit interval. That is, given a stochastic process, and the usual , one can find a sequence of strictly (monotonically) increasing rationals that encode the 'information' in that stochastic sequence to some arbitrarily accurate level. But, by construction, this sequence inherits the modular group symmetry of the continued fractions. Thus, in a certain sense, one might say that stochastic processes have a (perfectly) hidden modular group symmetry. The symmetry is revealed only when one tries to construct things out of the the stochastic sequence; taking, for example, Diffusion Limited Aggregation (DLA). The self-similarity and scaling properties become manifest. In the case of DLA, the dendrites are inherently self-similar. By 'decoding' a pair of dendrites to their representative rationals, one then has a map between the two dendrites, given by the modular group, explicitly exhibiting their self-similarity (to within the of the decoding). One could thus hand-wavingly say, the perfectly random distribution of the gaps is the ``reason'' why self-similarity and scaling appears in systems constructed out of random numbers. The self-similarity and scaling are really just a manifestation of the deeper (fractal, modular group) symmetry of the rationals themselves.