Hyperoperation

From The Right Wiki
Revision as of 13:45, 28 November 2024 by imported>Chris the speller (replaced: tetration → tetration)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigationJump to search

In mathematics, the hyperoperation sequence[nb 1] is an infinite sequence of arithmetic operations (called hyperoperations in this context)[1][11][13] that starts with a unary operation (the successor function with n = 0). The sequence continues with the binary operations of addition (n = 1), multiplication (n = 2), and exponentiation (n = 3). After that, the sequence proceeds with further binary operations extending beyond exponentiation, using right-associativity. For the operations beyond exponentiation, the nth member of this sequence is named by Reuben Goodstein after the Greek prefix of n suffixed with -ation (such as tetration (n = 4), pentation (n = 5), hexation (n = 6), etc.) [5] and can be written as using n − 2 arrows in Knuth's up-arrow notation. Each hyperoperation may be understood recursively in terms of the previous one by:

a[n]b=a[n1](a[n1](a[n1]([n1](a[n1](a[n1]a)))))b copies of a,n2

It may also be defined according to the recursion rule part of the definition, as in Knuth's up-arrow version of the Ackermann function:

a[n]b=a[n1](a[n](b1)),n1

This can be used to easily show numbers much larger than those which scientific notation can, such as Skewes's number and googolplexplex (e.g. 50[50]50 is much larger than Skewes's number and googolplexplex), but there are some numbers which even they cannot easily show, such as Graham's number and TREE(3).[14] This recursion rule is common to many variants of hyperoperations.

Definition

Definition, most common

The hyperoperation sequence Hn(a,b):(0)30 is the sequence of binary operations Hn:(0)20, defined recursively as follows:

Hn(a,b)=a[n]b={b+1if n=0aif n=1 and b=00if n=2 and b=01if n3 and b=0Hn1(a,Hn(a,b1))otherwise

(Note that for n = 0, the binary operation essentially reduces to a unary operation (successor function) by ignoring the first argument.) For n = 0, 1, 2, 3, this definition reproduces the basic arithmetic operations of successor (which is a unary operation), addition, multiplication, and exponentiation, respectively, as

H0(a,b)=1+b,H1(a,b)=a+b,H2(a,b)=a×b,H3(a,b)=ab=ab.

The Hn operations for n ≥ 3 can be written in Knuth's up-arrow notation. So what will be the next operation after exponentiation? We defined multiplication so that H2(a,3)=a[2]3=a×3=a+a+a, and defined exponentiation so that H3(a,3)=a[3]3=a3=a3=a×a×a, so it seems logical to define the next operation, tetration, so that H4(a,3)=a[4]3=a3=tetration(a,3)=aaa, with a tower of three 'a'. Analogously, the pentation of (a, 3) will be tetration(a, tetration(a, a)), with three "a" in it.

H4(a,b)=ab,H5(a,b)=ab,Hn(a,b)=an2b for n3,

Knuth's notation could be extended to negative indices ≥ −2 in such a way as to agree with the entire hyperoperation sequence, except for the lag in the indexing:

Hn(a,b)=an2b for n0.

The hyperoperations can thus be seen as an answer to the question "what's next" in the sequence: successor, addition, multiplication, exponentiation, and so on. Noting that

a+b=1+(a+(b1))ab=a+(a(b1))ab=a(a(b1))a[4]b=aa[4](b1)

the relationship between basic arithmetic operations is illustrated, allowing the higher operations to be defined naturally as above. The parameters of the hyperoperation hierarchy are sometimes referred to by their analogous exponentiation term; [15] so a is the base, b is the exponent (or hyperexponent),[12] and n is the rank (or grade),[6] and moreover, Hn(a,b) is read as "the bth n-ation of a", e.g. H4(7,9) is read as "the 9th tetration of 7", and H123(456,789) is read as "the 789th 123-ation of 456". In common terms, the hyperoperations are ways of compounding numbers that increase in growth based on the iteration of the previous hyperoperation. The concepts of successor, addition, multiplication and exponentiation are all hyperoperations; the successor operation (producing x + 1 from x) is the most primitive, the addition operator specifies the number of times 1 is to be added to itself to produce a final value, multiplication specifies the number of times a number is to be added to itself, and exponentiation refers to the number of times a number is to be multiplied by itself.

Definition, using iteration

Define iteration of a function f of two variables as

fx(a,b)={f(a,b)if x=1f(a,fx1(a,b))if x>1

The hyperoperation sequence can be defined in terms of iteration, as follows. For all integers x,n,a,b0, define

H0(a,b)=b+1H1(a,0)=aH2(a,0)=0Hn+3(a,0)=1Hn+1(a,b+1)=Hnb+1(a,Hn+1(a,0))Hnx+2(a,b)=Hn(a,Hnx+1(a,b))

As iteration is associative, the last line can be replaced by

Hnx+2(a,b)=Hnx+1(a,Hn(a,b))

Computation

The definitions of the hyperoperation sequence can naturally be transposed to term rewriting systems (TRS).

TRS based on definition sub 1.1

The basic definition of the hyperoperation sequence corresponds with the reduction rules

(r1)H(0,a,b)S(b)(r2)H(S(0),a,0)a(r3)H(S(S(0)),a,0)0(r4)H(S(S(S(n))),a,0)S(0)(r5)H(S(n),a,S(b))H(n,a,H(S(n),a,b))

To compute Hn(a,b) one can use a stack, which initially contains the elements n,a,b. Then, repeatedly until no longer possible, three elements are popped and replaced according to the rules[nb 2]

(r1)0,a,b(b+1)(r2)1,a,0a(r3)2,a,00(r4)(n+3),a,01(r5)(n+1),a,(b+1)n,a,(n+1),a,b

Schematically, starting from n,a,b: WHILE stackLength <> 1 { POP 3 elements; PUSH 1 or 5 elements according to the rules r1, r2, r3, r4, r5; } Example Compute H2(2,2)*4.[16] The reduction sequence is[nb 2][17]

H(S(S(0)),S(S(0)),S(S(0)))_
    r5H(S(0),S(S(0)),H(S(S(0)),S(S(0)),S(0))_)
    r5H(S(0),S(S(0)),H(S(0),S(S(0)),H(S(S(0)),S(S(0)),0)_))
    r3H(S(0),S(S(0)),H(S(0),S(S(0)),0)_)
    r2H(S(0),S(S(0)),S(S(0)))_
    r5H(0,S(S(0)),H(S(0),S(S(0)),S(0))_)
    r5H(0,S(S(0)),H(0,S(S(0)),H(S(0),S(S(0)),0)_))
    r2H(0,S(S(0)),H(0,S(S(0)),S(S(0)))_)
    r1H(0,S(S(0)),S(S(S(0))))_
    r1S(S(S(S(0))))

When implemented using a stack, on input 2,2,2

the stack configurations     represent the equations
2,2,2_ H2(2,2)
    r51,2,2,2,1_     =H1(2,H2(2,1))
    r51,2,1,2,2,2,0_     =H1(2,H1(2,H2(2,0)))
    r31,2,1,2,0_     =H1(2,H1(2,0))
    r21,2,2_     =H1(2,2)
    r50,2,1,2,1_     =H0(2,H1(2,1))
    r50,2,0,2,1,2,0_     =H0(2,H0(2,H1(2,0)))
    r20,2,0,2,2_     =H0(2,H0(2,2))
    r10,2,3_     =H0(2,3)
    r14     =4

TRS based on definition sub 1.2

The definition using iteration leads to a different set of reduction rules

(r6)H(S(0),0,a,b)S(b)(r7)H(S(0),S(0),a,0)a(r8)H(S(0),S(S(0)),a,0)0(r9)H(S(0),S(S(S(n))),a,0)S(0)(r10)H(S(0),S(n),a,S(b))H(S(b),n,a,H(S(0),S(n),a,0))(r11)H(S(S(x)),n,a,b)H(S(0),n,a,H(S(x),n,a,b))

As iteration is associative, instead of rule r11 one can define

(r12)H(S(S(x)),n,a,b)H(S(x),n,a,H(S(0),n,a,b))

Like in the previous section the computation of Hn(a,b)=Hn1(a,b) can be implemented using a stack. Initially the stack contains the four elements 1,n,a,b. Then, until termination, four elements are popped and replaced according to the rules[nb 2]

(r6)1,0,a,b(b+1)(r7)1,1,a,0a(r8)1,2,a,00(r9)1,(n+3),a,01(r10)1,(n+1),a,(b+1)(b+1),n,a,1,(n+1),a,0(r11)(x+2),n,a,b1,n,a,(x+1),n,a,b

Schematically, starting from 1,n,a,b: WHILE stackLength <> 1 { POP 4 elements; PUSH 1 or 7 elements according to the rules r6, r7, r8, r9, r10, r11; } Example Compute H3(0,3)*0. On input 1,3,0,3 the successive stack configurations are

1,3,0,3_r103,2,0,1,3,0,0_r93,2,0,1_r111,2,0,2,2,0,1_r111,2,0,1,2,0,1,2,0,1_r101,2,0,1,2,0,1,1,0,1,2,0,0_r81,2,0,1,2,0,1,1,0,0_r71,2,0,1,2,0,0_r81,2,0,0_r80.

The corresponding equalities are

H3(0,3)=H23(0,H3(0,0))=H23(0,1)=H2(0,H22(0,1))=H2(0,H2(0,H2(0,1))=H2(0,H2(0,H1(0,H2(0,0))))=H2(0,H2(0,H1(0,0)))=H2(0,H2(0,0))=H2(0,0)=0.

When reduction rule r11 is replaced by rule r12, the stack is transformed acoording to

(r12)(x+2),n,a,b(x+1),n,a,1,n,a,b

The successive stack configurations will then be

1,3,0,3_r103,2,0,1,3,0,0_r93,2,0,1_r122,2,0,1,2,0,1_r102,2,0,1,1,0,1,2,0,0_r82,2,0,1,1,0,0_r72,2,0,0_r121,2,0,1,2,0,0_r81,2,0,0_r80

The corresponding equalities are

H3(0,3)=H23(0,H3(0,0))=H23(0,1)=H22(0,H2(0,1))=H22(0,H1(0,H2(0,0)))=H22(0,H1(0,0))=H22(0,0)=H2(0,H2(0,0))=H2(0,0)=0

Remarks

  • H3(0,3)=0 is a special case. See below.[nb 3][nb 4]
  • The computation of Hn(a,b) according to the rules {r6 - r10, r11} is heavily recursive. The culprit is the order in which iteration is executed: Hn(a,b)=H(a,Hn1(a,b)). The first H disappears only after the whole sequence is unfolded. For instance, H4(2,4) converges to 65536 in 2863311767 steps, the maximum depth of recursion[18] is 65534.
  • The computation according to the rules {r6 - r10, r12} is more efficient in that respect. The implementation of iteration Hn(a,b) as Hn1(a,H(a,b)) mimics the repeated execution of a procedure H.[19] The depth of recursion, (n+1), matches the loop nesting. Meyer & Ritchie (1967) formalized this correspondence. The computation of H4(2,4) according to the rules {r6-r10, r12} also needs 2863311767 steps to converge on 65536, but the maximum depth of recursion is only 5, as tetration is the 5th operator in the hyperoperation sequence.
  • The considerations above concern the recursion depth only. Either way of iterating leads to the same number of reduction steps, involving the same rules (when the rules r11 and r12 are considered "the same"). As the example shows the reduction of H3(0,3) converges in 9 steps: 1 X r7, 3 X r8, 1 X r9, 2 X r10, 2 X r11/r12. The modus iterandi only affects the order in which the reduction rules are applied.

Examples

Below is a list of the first seven (0th to 6th) hyperoperations (0⁰ is defined as 1).

n Operation,
Hn(a, b)
Definition Names Domain
0 1+b or a[0]b 1+1+1+1++1+1+1b copies of 1 Increment, successor, zeration, hyper0 Arbitrary
1 a+b or a[1]b a+1+1+1++1+1+1b copies of 1 Addition, hyper1
2 a×b or a[2]b a+a+a++a+a+ab copies of a Multiplication, hyper2
3 ab or a[3]b a×a×a××a×a×ab copies of a Exponentiation, hyper3 b real, with some multivalued extensions to complex numbers
4 ba or a[4]b a[3](a[3](a[3]([3](a[3](a[3]a)))))b copies of a Tetration, hyper4 a ≥ 0 or an integer, b an integer ≥ −1 [nb 5] (with some proposed extensions)
5 ba or a[5]b a[4](a[4](a[4]([4](a[4](a[4]a)))))b copies of a Pentation, hyper5 a, b integers ≥ −1 [nb 5]
6 a[6]b a[5](a[5](a[5]([5](a[5](a[5]a)))))b copies of a Hexation, hyper6

Special cases

Hn(0, b) =

b + 1, when n = 0
b, when n = 1
0, when n = 2
1, when n = 3 and b = 0 [nb 3][nb 4]
0, when n = 3 and b > 0 [nb 3][nb 4]
1, when n > 3 and b is even (including 0)
0, when n > 3 and b is odd

Hn(1, b) =

b, when n = 2
1, when n ≥ 3

Hn(a, 0) =

0, when n = 2
1, when n = 0, or n ≥ 3
a, when n = 1

Hn(a, 1) =

a, when n ≥ 2

Hn(a, a) =

Hn+1(a, 2), when n ≥ 1

Hn(a, −1) =[nb 5]

0, when n = 0, or n ≥ 4
a − 1, when n = 1
a, when n = 2
1/a , when n = 3

Hn(2, 2) =

3, when n = 0
4, when n ≥ 1, easily demonstrable recursively.

History

One of the earliest discussions of hyperoperations was that of Albert Bennett in 1914, who developed some of the theory of commutative hyperoperations (see below).[6] About 12 years later, Wilhelm Ackermann defined the function ϕ(a,b,n), which somewhat resembles the hyperoperation sequence.[20] In his 1947 paper,[5] Reuben Goodstein introduced the specific sequence of operations that are now called hyperoperations, and also suggested the Greek names tetration, pentation, etc., for the extended operations beyond exponentiation (because they correspond to the indices 4, 5, etc.). As a three-argument function, e.g., G(n,a,b)=Hn(a,b), the hyperoperation sequence as a whole is seen to be a version of the original Ackermann function ϕ(a,b,n)recursive but not primitive recursive — as modified by Goodstein to incorporate the primitive successor function together with the other three basic operations of arithmetic (addition, multiplication, exponentiation), and to make a more seamless extension of these beyond exponentiation. The original three-argument Ackermann function ϕ uses the same recursion rule as does Goodstein's version of it (i.e., the hyperoperation sequence), but differs from it in two ways. First, ϕ(a,b,n) defines a sequence of operations starting from addition (n = 0) rather than the successor function, then multiplication (n = 1), exponentiation (n = 2), etc. Secondly, the initial conditions for ϕ result in ϕ(a,b,3)=G(4,a,b+1)=a[4](b+1), thus differing from the hyperoperations beyond exponentiation.[7][21][22] The significance of the b + 1 in the previous expression is that ϕ(a,b,3) = aaa, where b counts the number of operators (exponentiations), rather than counting the number of operands ("a"s) as does the b in a[4]b, and so on for the higher-level operations. (See the Ackermann function article for details.)

Notations

This is a list of notations that have been used for hyperoperations.

Name Notation equivalent to Hn(a,b) Comment
Knuth's up-arrow notation an2b Used by Knuth [23] (for n ≥ 3), and found in several reference books.[24][25]
Hilbert's notation ϕn(a,b) Used by David Hilbert.[26]
Goodstein's notation G(n,a,b) Used by Reuben Goodstein.[5]
Original Ackermann function ϕ(a,b,n1) for 1n3ϕ(a,b1,n1) for n4 Used by Wilhelm Ackermann (for n ≥ 1)[20]
Ackermann–Péter function A(n,b3)+3for a=2 This corresponds to hyperoperations for base 2 (a = 2)
Nambiar's notation an1b Used by Nambiar (for n ≥ 1) [27]
Superscript notation a(n)b Used by Robert Munafo.[21]
Subscript notation (for lower hyperoperations) a(n)b Used for lower hyperoperations by Robert Munafo.[21]
Operator notation (for "extended operations") aOn1b Used for lower hyperoperations by John Doner and Alfred Tarski (for n ≥ 1).[28]
Square bracket notation a[n]b Used in many online forums; convenient for ASCII.
Conway chained arrow notation ab(n2) Used by John Horton Conway (for n ≥ 3)

Variant starting from a

In 1928, Wilhelm Ackermann defined a 3-argument function ϕ(a,b,n) which gradually evolved into a 2-argument function known as the Ackermann function. The original Ackermann function ϕ was less similar to modern hyperoperations, because his initial conditions start with ϕ(a,0,n)=a for all n > 2. Also he assigned addition to n = 0, multiplication to n = 1 and exponentiation to n = 2, so the initial conditions produce very different operations for tetration and beyond.

n Operation Comment
0 F0(a,b)=a+b
1 F1(a,b)=ab
2 F2(a,b)=ab
3 F3(a,b)=a[4](b+1) An offset form of tetration. The iteration of this operation is different than the iteration of tetration.
4 F4(a,b)=(xa[4](x+1))b(a) Not to be confused with pentation.

Another initial condition that has been used is A(0,b)=2b+1 (where the base is constant a=2), due to Rózsa Péter, which does not form a hyperoperation hierarchy.

Variant starting from 0

In 1984, C. W. Clenshaw and F. W. J. Olver began the discussion of using hyperoperations to prevent computer floating-point overflows.[29] Since then, many other authors [30][31][32] have renewed interest in the application of hyperoperations to floating-point representation. (Since Hn(a, b) are all defined for b = -1.) While discussing tetration, Clenshaw et al. assumed the initial condition Fn(a,0)=0, which makes yet another hyperoperation hierarchy. Just like in the previous variant, the fourth operation is very similar to tetration, but offset by one.

n Operation Comment
0 F0(a,b)=b+1
1 F1(a,b)=a+b
2 F2(a,b)=ab=eln(a)+ln(b)
3 F3(a,b)=ab
4 F4(a,b)=a[4](b1) An offset form of tetration. The iteration of this operation is much different than the iteration of tetration.
5 F5(a,b)=(xa[4](x1))b(0)=0 if a>0 Not to be confused with pentation.

Lower hyperoperations

An alternative for these hyperoperations is obtained by evaluation from left to right.[9] Since

a+b=(a+(b1))+1ab=(a(b1))+aab=(a(b1))a

define (with ° or subscript)

a(n+1)b=(a(n+1)(b1))(n)a

with

a(1)b=a+ba(2)0=0a(n)1=afor n>2

This was extended to ordinal numbers by Doner and Tarski,[33] by :

αO0β=α+βαOγβ=sup\limits η<β,ξ<γ(αOγη)Oξα

It follows from Definition 1(i), Corollary 2(ii), and Theorem 9, that, for a ≥ 2 and b ≥ 1, that [original research?]

aOnb=a(n+1)b

But this suffers a kind of collapse, failing to form the "power tower" traditionally expected of hyperoperators:[34][nb 6]

α(4)(1+β)=α(αβ).

If α ≥ 2 and γ ≥ 2,[28][Corollary 33(i)][nb 6]

α(1+2γ+1)βα(1+2γ)(1+3αβ).
n Operation Comment
0 F0(a,b)=a+1 Increment, successor, zeration
1 F1(a,b)=a+b
2 F2(a,b)=ab
3 F3(a,b)=ab
4 F4(a,b)=a(a(b1)) Not to be confused with tetration.
5 F5(a,b)=(xxx(a1))b1(a) Not to be confused with pentation.
Similar to tetration.

Commutative hyperoperations

Commutative hyperoperations were considered by Albert Bennett as early as 1914,[6] which is possibly the earliest remark about any hyperoperation sequence. Commutative hyperoperations are defined by the recursion rule

Fn+1(a,b)=exp(Fn(ln(a),ln(b)))

which is symmetric in a and b, meaning all hyperoperations are commutative. This sequence does not contain exponentiation, and so does not form a hyperoperation hierarchy.

n Operation Comment
0 F0(a,b)=ln(ea+eb) Smooth maximum
1 F1(a,b)=a+b
2 F2(a,b)=ab=eln(a)+ln(b) This is due to the properties of the logarithm.
3 F3(a,b)=aln(b)=eln(a)ln(b) In a finite field, this is the Diffie–Hellman key exchange operation.
4 F4(a,b)=eeln(ln(a))ln(ln(b)) Not to be confused with tetration.

Numeration systems based on the hyperoperation sequence

R. L. Goodstein [5] used the sequence of hyperoperators to create systems of numeration for the nonnegative integers. The so-called complete hereditary representation of integer n, at level k and base b, can be expressed as follows using only the first k hyperoperators and using as digits only 0, 1, ..., b − 1, together with the base b itself:

  • For 0 ≤ nb − 1, n is represented simply by the corresponding digit.
  • For n > b − 1, the representation of n is found recursively, first representing n in the form
b [k] xk [k − 1] xk − 1 [k - 2] ... [2] x2 [1] x1
where xk, ..., x1 are the largest integers satisfying (in turn)
b [k] xkn
b [k] xk [k − 1] xk − 1n
...
b [k] xk [k − 1] xk − 1 [k - 2] ... [2] x2 [1] x1n
Any xi exceeding b − 1 is then re-expressed in the same manner, and so on, repeating this procedure until the resulting form contains only the digits 0, 1, ..., b − 1, together with the base b.

Unnecessary parentheses can be avoided by giving higher-level operators higher precedence in the order of evaluation; thus,

level-1 representations have the form b [1] X, with X also of this form;
level-2 representations have the form b [2] X [1] Y, with X,Y also of this form;
level-3 representations have the form b [3] X [2] Y [1] Z, with X,Y,Z also of this form;
level-4 representations have the form b [4] X [3] Y [2] Z [1] W, with X,Y,Z,W also of this form;

and so on. In this type of base-b hereditary representation, the base itself appears in the expressions, as well as "digits" from the set {0, 1, ..., b − 1}. This compares to ordinary base-2 representation when the latter is written out in terms of the base b; e.g., in ordinary base-2 notation, 6 = (110)2 = 2 [3] 2 [2] 1 [1] 2 [3] 1 [2] 1 [1] 2 [3] 0 [2] 0, whereas the level-3 base-2 hereditary representation is 6 = 2 [3] (2 [3] 1 [2] 1 [1] 0) [2] 1 [1] (2 [3] 1 [2] 1 [1] 0). The hereditary representations can be abbreviated by omitting any instances of [1] 0, [2] 1, [3] 1, [4] 1, etc.; for example, the above level-3 base-2 representation of 6 abbreviates to 2 [3] 2 [1] 2. Examples: The unique base-2 representations of the number 266, at levels 1, 2, 3, 4, and 5 are as follows:

Level 1: 266 = 2 [1] 2 [1] 2 [1] ... [1] 2 (with 133 2s)
Level 2: 266 = 2 [2] (2 [2] (2 [2] (2 [2] 2 [2] 2 [2] 2 [2] 2 [1] 1)) [1] 1)
Level 3: 266 = 2 [3] 2 [3] (2 [1] 1) [1] 2 [3] (2 [1] 1) [1] 2
Level 4: 266 = 2 [4] (2 [1] 1) [3] 2 [1] 2 [4] 2 [2] 2 [1] 2
Level 5: 266 = 2 [5] 2 [4] 2 [1] 2 [5] 2 [2] 2 [1] 2

See also

Notes

  1. Sequences similar to the hyperoperation sequence have historically been referred to by many names, including: the Ackermann function [1] (3-argument), the Ackermann hierarchy,[2] the Grzegorczyk hierarchy[3][4] (which is more general), Goodstein's version of the Ackermann function,[5] operation of the nth grade,[6] z-fold iterated exponentiation of x with y,[7] arrow operations,[8] reihenalgebra[9] and hyper-n.[1][9][10][11][12]
  2. 2.0 2.1 2.2 This implements the leftmost-innermost (one-step) strategy.
  3. 3.0 3.1 3.2 For more details, see Powers of zero.
  4. 4.0 4.1 4.2 For more details, see Zero to the power of zero.
  5. 5.0 5.1 5.2 Let x = a[n](−1). By the recursive formula, a[n]0 = a[n − 1](a[n](−1)) ⇒ 1 = a[n − 1]x. One solution is x = 0, because a[n − 1]0 = 1 by definition when n ≥ 4. This solution is unique because a[n − 1]b > 1 for all a > 1, b > 0 (proof by recursion).
  6. 6.0 6.1 Ordinal addition is not commutative; see ordinal arithmetic for more information

References

Bibliography