A leap of faith with fractal analysis ...

From: John Conover <john@email.johncon.com>
Subject: A leap of faith with fractal analysis ...
Date: Fri, 16 Jan 1998 19:26:12 -0800
    "... all this is civilization, the Sumerian God of Wisdom tells
    Inanna of Uruk, who will take it back to her city, and thence give
    it to the world. And, if you wish its ... [enigmatic]
    ... benefits, he goes on, you must take all its qualities,
    ...[including its disadvantages] ... without argument. All these
    things I will give you, holy Inannna, but once taken, there can be
    no dispute, and they can not be given back."

                                 -Unknown poet; Uruk, Sumeria, circa
                                 3000BC. (From the epic of Gilgamesh,
                                 transcribed by Sin-leqiuninni around
                                 1300BC, interpreted by the
                                 anthropologist, Michael Wood.)

The Gilgamesh is the tale of the invention of civilization by the
ancient Sumerians, somewhat before the fourth millenia,
BC. Interestingly, it contains a theme that will be echoed by poets
through the ages when describing the enigma of civilization-the
formula for a fractal.

As with any application of abstraction, the quality of the abstraction
has to be evaluated. Historically, at least in mathematics, this was
done by determining the scalability of a theory-in short, how far can
its precision be extrapolated. In the more robust theories, for
example the quantum mechanics, scalability is preserved over many
orders of magnitude.

Fractal analysis is usually used to describe the evolution of a
complex system that operates under uncertainty. Equity values over
time, industrial markets, and complex (eg., iterated game-theoretic,)
economic phenomena are examples.

An argument can be made that if equity values are a complex system
that operates under uncertainty, and so are industrial markets, then
it would be reasonable to assume that the GDP, also, is a complex
system that operates under uncertainty. This argument is based on the
fact that fractals, which describe such things, are
self-similar. (What this means is that a fractal is made up of smaller
fractals, which in turn, are made up of even smaller fractals, and so
on, without end.) And, if that is true, then we should be able to
argue that the same is true for all the GDP's in the history of
civilization.

Such an extrapolation should be viewed with scepticism. Although we
have very precise data for the financial markets in this century, (in
point of fact, the US equity market tickers are some of the most
precise data we have all of science,) we don't really have good data
on the ecology/GDP of the cultural history-and the proposal is to make
a "leap of faith" over nine orders of magnitude! (That is, see if we
can present arguments that relate the US equity market ticker, by the
second, to the history of civilization, by the century.)

It should be regarded as sloppy science to do things like this, but
since I have received several questions on the subject, I'll attempt
it-on the qualification that the analysis be viewed with a significant
degree of scepticism.

Here is the proposal. We need to take the data from the US Equity
markets, by the second, and extrapolate that data to by the day, and
then analyze the fractal probabilities of that data. Then, we need to
take the US GDP data, by the year, and extrapolate that data to by the
day, and do likewise. Then we need to look at the historical record of
cultures/civilizations, and scheme up a way of scaling that data, by
the day.

We need a metric to compare how accurate our extrapolated
probabilities are. For this, I propose that we use the Shannon
probability, based on the day as a time unit. (The Shannon probability
is the simply likelihood that the value for the next day in a data set
will increase.) The reason I propose this is because we do not have
accurate data on the GDP 's of cultures in the historical record-but
we do have reasonably good data on how long cultures persisted, (ie.,
the run length of cultures; the ancient Sumerians were pretty
meticulous with their accounting, as were the Egyptians, the Romans,
the Greeks, the Mayans in Central America, the Chinese Dynasties, as
were the European countries since the Renaissance, etc.)  Granted, the
historical perspective is subject to interpretation, but its the only
data we have. We ask only that the interpretations be axiomatically
consistent.

Here is the strategy. We will look at the distribution of run-lengths,
(ie., how long cultures/civilizations persisted,) and see if it fits a
known distribution, within reason. If the distribution assembles
itself into a reciprocal of the square root of time, (time being
measured in centuries,) then this will give some, but not a lot, of
credibility that civilization/culture is a fractal system that
operates under uncertainty. Since the US is in its second century, we
can use the probability that the run-length will continue in the
second century to compute the Shannon probability for the entire
century, extrapolate it to the day, and compare it with the
extrapolate data for the US equity markets and GDP, for this century.

The argument here is that if civilizations/cultures are fractal
systems, then they should start, and for the most part, grow for two
centuries, on average, then decay back to zero for two centuries, on
average. During the first two centuries, since the GDP/ecology is
expanding/growing, the Shannon probability will be greater than one
half. In the last two centuries, on average, the Shannon probability
will decrease, and be less than one half-being exactly 0.5, on
average, over the a average of four centuries.

Do we see a distribution of the run lengths of civilizations that is a
reciprocal of the square root of time? The data set is limited, (about
400 samples, far too small for analytical comfort,) but we do see
something that appears to be similar to what we expected, (The ancient
Egyptians won the duration lottery-about 30 centuries-which would have
a theoretical probability of about 18.3%. Note that we should have
found more, but our data set only extends only over a time interval of
60 centuries-about twice the duration of the ancient Egyptian
civilization.) The Romans were next, at 8 centuries, with most lasting
one century, or less. The average is about 3.6 centuries-which is
close to the 4 we expected. (Note, this type of distribution could not
be made by a system that operated on simpler statistical mechanisms; a
pure Gaussian distribution doesn't fit, since virtually all of the
distribution would be beyond the left three sigma limit of the curve,
instead of at 4 centuries; a random mechanism would have given a
"splatter" that have produced an equal probability of a civilization
lasting one century, or thirty. Not compelling arguments, but
interesting, nonetheless.)

So, it would appear that, although our data set size is too small to
justify any optimism, that, at least there is some, limited,
circumstantial evidence that culture/civilization evolves as a complex
system that operates under uncertainty.

Performing our "leap of faith," and calculating the Shannon
probability for the ecological growth in second century of a "typical"
culture/civilization, by assuming that the average growth in the
second century, will be reflected, on average, to all the constituent
fractals that make up the cultural/civilization fractal:

    G(t) = sqrt (t)

where t is time, measured in centuries-and taking the derivative with
respect to time to get the marginal growth, M:

    dG(t)        1
    ----- = ------------
     dt     2 * sqrt (t)

and calculating this value at 1.5 centuries, (since we will be
comparing this value to the value for equity markets obtained in the
last half of the second century in the US,) and 2 centuries:

    dG(t)        1
    ----- = -------------- = 0.4082483
     dt     2 * sqrt (1.5)

    dG(t)        1
    ----- = ------------ = 0.3535534
     dt     2 * sqrt (2)

and average the two to get M = 0.3819008.

This means that there should have been a 38% growth, on the average,
in the second century of cultures that lasted longer than two
centuries. I now need to solve for the Shannon probability, by the
century, that would give this growth, ie., solve the equation:

    ln (1 + M) = 1 + P ln (P) + (1 - P) ln (1 - P)
      2                  2                2

which gives P = 0.87859, for the second century of an average
culture/civilization. I now need to convert this value from a century
value, to a daily value. Knowing that:


        R + 1
    P = -----
          2

where R is the root mean square of the century fluctuations:

        0.75718 + 1
    P = -----------
             2

and, using since the daily fluctuations add root mean square to get
the century fluctuations, and there are 25,700 business days in a
century, (because I will be comparing it to equity data, taken by the
business day,) to get the root mean square of the daily fluctuations,
r:

    r = 0.75718 / sqrt (25700) = 0.004723159

and the daily Shannon probability, p, would be:

        r + 1
    p = ----- = 0.5023616
          2

For the analysis of the US GDP, the data came from
http://www.doc.gov/BudgetFY97/index.html, and is by year, 1940 to
1995, inclusive. Again, the data set is far too small. There is an
issue since the equity market numbers are not adjusted for inflation,
(and it is not clear what term "duration of civilization, adjusted for
inflation" would mean,) so I will derive the fractal statistics for
the US GDP in both real and non-adjusted dollars.

The average of the marginal increments of the US GDP is 0.082307, and
the root mean square of the marginal increments is 0.095762. From
this, the non-adjusted Shannon probability, by year, can be
calculated:

        0.082307
        -------- + 1
        0.095762
    P = ------------ = 0.9297477
            2

and, since the daily fluctuations add root mean square to get the
annual fluctuations, and there are 257 business days in a year,
(because I will be comparing it to equity data, taken by the business
day) I need to calculate the Shannon probability of the non-inflation
adjusted US GDP by day, p:



        0.082307
        --------
        0.095762   + 1
        ----------
        sqrt (257)       0.05799607 + 1
    p = -------------- = -------------- = 0.5268069
              2                2

Since there are no reliable inflation numbers, (including the CPI,) by
the year, I will use the deflation numbers provided by the CBO, (which
are a running exponential,) and strike a curve through the inflation
adjusted GDP, calculate the growth, to derive the Shannon probability
for the inflation adjusted US GDP. In 1940 the inflation adjusted US
GDP was 831.735 billion. In 1995, it was 5,438.699 billion-a factor of
growth of 6.53898 in 55 years. Solving:

       55
    (x)   = 6.53898

or:

    55 * ln (x) = ln (6.53898)

and:

         ln (6.53898)
         ------------
              55
   x = e              = 1.034731

and solving the equation:

    ln (x) = 1 + P ln (P) + (1 - P) ln (1 - P)
      2              2                2

to get the Shannon probability of the inflation adjusted US GDP, by
the year, which is P = 0.6299.  Again, using the fact that the daily
fluctuations add root mean square to get the annual fluctuations, and
there are 257 business days in a year, (because I will be comparing it
to equity data, taken by the business day) I need to calculate the
Shannon probability of the inflation adjusted US GDP by day, p:

                 R + 1
    P = 0.6299 = -----
                   2

where R is the root mean square of the annual fluctuations, or:

    R = 0.2598

and dividing R by the square root of 257 to get the root mean square
of the daily fluctuations in the inflation adjusted US GDP, r:

    r = 0.01620588

and the daily Shannon probability, p, would be:

        r + 1   0.01620588 + 1
    p = ----- = -------------- = 0.5081029
          2           2

For the analysis of the US equity markets, the data came from the
NYSE, by second, and was combined for all stocks, integrated and
sampled to directly convert the second ticker data to daily ticker
data, for the inclusive annual time interval, 1966 to 1996.  The
average of the marginal increments of the NYSE is 0.000290, and the
        0.00029
        ------- + 1
        0.00851       0.03407756 + 1
    p = ----------- = -------------- = 0.5170388
             2              2

Here is how the Shannon probabilities, extrapolated over nine orders
of magnitude, compare:

    non-inflation           NYSE       inflation adjusted        duration of
    adjusted US GDP                      US GDP                 civilizations

       0.5268069          0.5170388         0.5081029              0.5023616

or about 0.5135776 +/- 2.33%

Which, depending on one's point of view, are reasonably close. We now
have to ask the question, "how reliable are these probabilities?" It
turns out that traditional statistical estimation techniques let us
down in fractal analysis, (we are hunting for the probability of a
probability that controls a system's evolution over a time interval.)

The assumption was that we were investigating the evolution of a
complex system that operates under uncertainty-ie., a fractal. But
there are whole families of fractals, and the paradigm assumption was
that we were dealing with the simplest fractal of all, fractional
Brownian motion. How valid was that assumption? What is the chance we
were misled and it is really some other kind of fractal?  These are
complicated questions, and depend on the magnitude of the
probabilities under analysis.

The analysis of a system that is controlled by larger Shannon
probabilities can get by with smaller data sets. Smaller Shannon
probabilities require larger data set sizes, (for example, the
analysis of fractional Brownian motion with a Shannon probability of
exactly 50% requires an infinite data set size.) It is not trivial to
develop a methodology to evaluate how reliable our analysis is, but it
can be done. Without beleaguering the issue, here is how are
reliability stacks up for the metrics used in this analysis, where S
is the data set size, and P is the measured Shannon probability:

    non-inflation           NYSE       inflation adjusted        duration of
    adjusted US GDP                      US GDP                 civilizations

    S = 55              S = 46,260,00     S = 55                S = 1600
    P = 0.9297477       P = 0.5170388     P = 0.6299            P = 0.87854
    P' = 0.791092       P' = 0.515843     P' = 0.530797         P' = 0.841083
    x = 85.08674%       x = 99.76872%     x = 84.26687%         x = 95.73645%

Here, P' is the Shannon probability, after being adjusted for a finite
data set size, and x is the ratio of P' to P. Not surprisingly, the
mass of data provided by the NYSE provides the most reliable
probability. Also, not surprisingly, the GDP data is to be considered
the most unreliable, with the historical data being somewhere in
between, (don't forget that we still have to qualify the historical
data as being subject to interpretation.)

Our reliability estimate on these probabilities can only be regarded
as poor to fair, (which was a qualification that was made in the
beginning of the analysis.) However, we can state that there is some
evidence, albeit circumstantial, that we can extrapolate, or scale,
fractal analysis over nine orders of magnitude in the study of
ecology. The evidence, however, can not be regarded as compelling.

        John

--

John Conover, john@email.johncon.com, http://www.johncon.com/
Last modified: Fri Mar 26 18:54:34 PST 1999 $Id: 980116192623.28859.html,v 1.0 2001/11/17 23:05:50 conover Exp $