Recursive Functions > History of the Ackermann and Péter functions (Stanford Encyclopedia of Philosophy)

Supplement to Recursive Functions

History of the Ackermann and Péter functions

Textbook presentations sometimes suggest that the functions which now bear the names of Wilhelm Ackermann and Rózsa Péter were directly presented as examples of intuitively computable functions which are not primitive recursive.^[34] Such an understanding overlooks the context of the papers Ackermann 1928b and Péter 1935 in which these functions were introduced as well as the subsequent work on forms of recursion which they inspired. The goal of this supplement is to provide a brief account of these developments which, while significant in their own right, lie off the main branch of computability theory after the 1930s.

The first published presentation of a function similar to what we called the Péter function in the main text appears to have been given by Hilbert in his well-known address “On the infinite” (1926). This corresponds to the text of a lecture he delivered in Münster in 1925 which in turn draws on a lecture course (1925) of the same name he delivered in Göttingen in 1924–1925. The published address is often cited as an overview of Hilbert’s finitary standpoint and goal of proving the consistency of infinitary mathematics by finite means. A more extensive treatment was later provided in Grundlagen der Mathematik (1934: §2), where Hilbert and Bernay stress the finitary character of primitive recursive definitions.

Recursion is also an important topic in (1926) and Hilbert’s treatment makes clear that he and his collaborators had taken substantial steps towards developing a general theory of recursive definability. But in neither this address nor the lectures on which it is based does he directly suggest that primitive recursion can be used to characterize finitary mathematics. Rather, Hilbert simply assumes that functions defined by primitive recursion will be regarded as finitary in the course of suggesting that such definitions may play a role in addressing questions from transfinite set theory. In particular, Hilbert’s original introduction (under a different description) of what is now called “the Ackermann function” occurs within the context of his attempted proof of the Continuum Hypothesis (CH).^[35]

In this setting Hilbert begins by distinguishing what he calls ordinary recursion from transfinite recursion. Ordinary recursion is characterized informally as the means by which

a function of a number-theoretic variable $n$ is defined when we indicate what value it has for $n = 0$ and how the value for $n + 1$ is obtained from that for $n$ . (1926 [1967: 386])

As the examples he provides make clear, this form subsumes what we now call primitive recursion.^[36] Hilbert then characterizes transfinite recursion as generalizing the principle “that the value of the function for a value of the variable is determined by the preceding values of the function” from recursion on the natural numbers to ordinals in Cantor’s second number class.

Hilbert next discusses recursion at higher types and it is within this context that a description of the Ackermann function first appears. Hilbert begins by introduces a hierarchy of what Ackermann would later call function types: type 0 is taken to be that of natural numbers, type 1 that of functions from natural numbers to natural numbers, type 2 is that of functions from type 1 functions to type 1 functions, etc. Hilbert next reprises the definitions of type 1 functions denoted in the main text by $α_{1} (x, y)$ (addition), $α_{2} (x, y)$ (multiplication), $α_{3} (x, y)$ (exponentiation), $\dots$ and observes that they may all be defined by ordinary recursion. He next notes that the uniformly defined function $α_{n} (x, y)$ —wherein the position $n$ in the sequence is now itself understood as a variable—cannot be defined by in this manner. On the other hand, he also provides a definition similar to ( $1$ ) below which shows how this function can be defined by recursion on a type 1 variable. Part of Hilbert’s approach to proving CH relied on his characterization of the continuum as class of functions of type $N \to N$ obtainable by recursion at higher types. Since $α_{x} (x, x)$ can be regarded as a function of a single numerical variable $x$ , the original role of this example was thus to illustrate that this class is not exhausted by functions definable by ordinary recursion.

Already at this time Hilbert attributed this result to Ackermann.^[37] However a precise statement and proof would only appear three years later in Ackermann’s paper “On Hilbert’s construction of the real numbers” (1928b). Ackermann (1928b [1967: 495]) begins by presenting a notational variant of Hilbert’s example which takes the following form:

\begin{aligned} (1) & φ (x, y, 0) & = x + y \\ φ (x, y, n + 1) & = ρ_{z} (φ (x, z, n), α (x, n), y) \end{aligned}

where

\begin{aligned} (2) & ρ_{z} (f (z), v, 0) & = v \\ ρ_{z} (f (z), v, n + 1) & = f (ρ_{z} (f (z), v, n)) \end{aligned}

and

\begin{matrix} (3) & α (x, n) = {\begin{cases} 0 & if n = 0 \\ 1 & if n = 1 \\ x & if n ⩾ 2 \end{cases} \end{matrix}

The equations ( $1$ ) define $φ (x, y, z)$ by primitive recursion relative to the auxiliary function $ρ_{z}$ . Note, however, that ( $2$ ) defines $ρ_{z}$ not by primitive ordinary recursion but rather by recursion on a type 1 function. In particular, $ρ_{z} (f (z), v, n)$ denotes the n-fold iteration of the function $f$ in which the variable $z$ is replaced with the argument $v$ , i.e.,

ρ_{z} (f (z), v, n) = \underset{n times}{\underset{⏟}{f (\dots f}} (v) \dots)

In Ackermann’s notation, the subscript $z$ in $ρ_{z}$ thus plays the role of an abstraction operator so that its first argument would now be more conventionally denoted by $λ z . f (z)$ . $ρ_{z}$ is thus itself a type 2 function—i.e., it has type $(N \to N) \to (N \to (N \to N))$ .

Ackermann next notes that the functions defined by $φ (0, x, y), φ (1, x, y), φ (2, x, y), \dots$ do indeed give the sequence $α_{1} (x, y), α_{2} (x, y), α_{3} (x, y), \dots$ described by Hilbert—e.g.,

\begin{matrix} (i) & φ (x, y, 0) = x + y \end{matrix}

\begin{matrix} (ii) & \begin{aligned} φ (x, y, 1) & = ρ_{z} (φ (x, z, 0), α (x, 0), y) \\ = ρ_{z} (φ (x, z, 0), 0, y) \\ = ρ_{z} (x + z, 0, y) \\ = x + ρ_{z} (x + z, 0, y - 1) \\ = x + \underset{y - 1 times}{\underset{⏟}{(x + (\dots + (x}} + ρ_{z} (x + z, 0, 0)) \dots)) \\ = \underset{y times}{\underset{⏟}{x + (x + (\dots + (x}} + 0)) \dots)) \\ = \underset{y times}{\underset{⏟}{x + x + \dots + x}} \\ = x \times y \end{aligned} \end{matrix}

\begin{matrix} (iii) & \begin{aligned} φ (x, y, 2) & = ρ_{z} (φ (x, z, 1), α (x, 1), y) \\ = ρ_{z} (φ (x, z, 1), 1, y) \\ = ρ_{z} (x \times z, 1, y) \\ = x \times ρ_{z} (x \times z, 1, y - 1) \\ = x \times \underset{y - 1 times}{\underset{⏟}{(x \times (\dots \times (x}} \times ρ_{z} (x \times z, 1, 0)) \dots)) \\ = \underset{y times}{\underset{⏟}{x \times (x \times (\dots \times (x}} \times 1)) \dots)) \\ = \underset{y times}{\underset{⏟}{x \times x \times \dots \times x}} \\ = x^{y} . \end{aligned} \end{matrix}

The goal of Ackermann’s paper was to demonstrate that the function $φ (x, x, x)$ —i.e., that obtained by taking $x = y = z$ in ( $1$ ) which is itself of type 1—cannot be defined by ordinary recursion. This is accomplished by showing that $φ (x, x, x)$ grows more rapidly than any function defined by ordinary recursion—i.e., that for any such function $ψ (x)$ , there exists $n_{0}$ such that for all $n > n_{0}$ , $φ (n, n, n) > ψ (n)$ .

In order to do so, Ackermann shows that although Hilbert originally defined $φ (x, y, z)$ by higher-order recursion using the functional $ρ_{z}$ , the same function in extension can be defined by a form of recursion directly on the natural numbers which is more complex than ordinary recursion. In particular, the first two lemmas of Ackermann (1928b) jointly entail that this function is also given by the following definition:

\begin{aligned} (4) & φ (x, y, 0) & = x + y \\ φ (x, 0, n + 1) & = α (x, n) \\ φ (x, y + 1, n + 1) & = φ (x, φ (x, y, n + 1), n) \end{aligned}

It is this definition which many subsequent authors identify as giving the Ackermann function.

While the arguments on the righthand size of ( $4$ ) are natural numbers, this definition does not conform to the primitive recursion scheme as the value of $φ (x, y + 1, n + 1)$ is defined relative to its prior values at both $y$ and $n$ . In light of this, Ackermann originally referred to this as an instance of simultaneous recursion. His original argument that $φ (x, y, z)$ is not primitive recursive was somewhat involved. As such, subsequent presentations often demonstrate the existence of a function definable by simultaneous but not primitive recursion via the simpler example of the binary Péter function $π (x, y)$ .

Recall that as presented in the main text this is given by

\begin{aligned} (5) & π (0, y) & = y + 1 \\ π (x + 1, 0) & = π (x, 1) \\ π (x + 1, y + 1) & = π (x, π (x + 1, y)) \end{aligned}

This definition was presented by Péter (1935) (see also 1967: 9) and bears a similar relation to the function $β (i)$ —which was defined in the main text in terms of the functional $Iter$ —as does ( $4$ ) to ( $1$ ). Relative to the terminology she subsequently introduced, the definition ( $5$ ) is an instance of nested double recursion—i.e., it exemplifies two separate generalizations of primitive recursion which are both subsumed under Ackermann’s original term “simultaneous recursion”.

Double recursion is in turn an instance of multiple or k-recursion which was considered in the general case by Péter (1937). For double recursion (i.e., $k = 2$ ), the case for binary functions is given by the scheme

\begin{aligned} (6) & δ (0, y) & = α_{1} (y) \\ δ (x + 1, 0) & = α_{2} (x, δ (x, k))) \\ δ (x + 1, y + 1) & = β (x, y, δ (x, γ (x, y)), δ (x + 1, y)) \end{aligned}

where $α_{1}, α_{2}$ and $γ$ are previously given functions and $k$ is fixed. In order to calculate the value $δ$ for the arguments $x + 1, y + 1$ we must thus first obtain its value for the pairs $⟨ x, γ (x, y) ⟩$ and also $⟨ x + 1, y ⟩$ . Note, however, that both of these pairs come before $⟨ x + 1, y + 1 ⟩$ in the lexicographical ordering $≺$ of $N \times N$ —i.e., $⟨ x_{1}, y_{1} ⟩ ≺ ⟨ x_{2}, y_{2} ⟩$ iff $x_{1} \leq x_{2}$ or $x_{1} = x_{2}$ and $y_{1} \leq y_{2}$ . Thus under the assumption that we can compute the value of $γ (x, y)$ , ( $6$ ) can still be understood as specifying a finite inductive procedure for calculating the values of $δ$ .^[38] And in fact Péter (1937) showed that the generalization of ( $6$ ) to multiple recursion on $k$ variables with parameters does not lead outside the class of primitive recursive functions by showing that this procedure can be reduced to course of values recursion (see also 1967: 6).

In order to understand the generalization involved with nested recursion, ( $5$ ) can be compared directly with ( $6$ ). The former can be considered as a case of the latter in which $β$ is the projection function which returns its third argument $δ (x, γ (x, y))$ . In the case of ( $6$ ) we assume that $γ (x, y)$ is a given function already known to be in the class under consideration. But in the case of ( $5$ ), $γ (x, y)$ corresponds to $π (x + 1, y)$ —i.e., a substitution instance of the function which is itself being defined. It is in this sense which the definition exemplified by ( $5$ ) is said to involve a form of recursion which is “nested” (eingeschachtelte). For although a calculation carried out by applying the clauses of ( $5$ ) also follows the lexicographic order, when the first argument of $π$ decreases, the value of the second argument required to calculate $π (x + 1, y + 1)$ is not determined independently of the specification of function $π (x, y)$ itself. It is in this sense in which the definition of the Ackermann and Péter functions have subsequently been said to exhibit a form impredicativity which makes them inaccessible from Hilbert’s finitary standpoint (see, e.g., Tait 1961, 1968, 1981).^[39]

Nested recursion was introduced by Péter (1935) and is considered in relation to k- and transfinite recursion in Tait (1961), Péter (1967), and Rose (1984). The definitions ( $4$ ) and ( $5$ ) can be understood to exemplify the following general scheme of double nested recursion formulated by Epstein and Carnielli (2008: 117):

\begin{aligned} (7) & f (0, y) & = g (y) \\ f (x + 1, 0) & = j (x, f (x, k)) \\ f (x + 1, y + 1) & = h (x, y, f (x + 1, y), f (x, j (x, y, f (x + 1, y)))) \end{aligned}

for a fixed but arbitrary $k$ . But again this scheme on its own does not necessarily lead out of the class of primitive recursive functions. In fact Péter (1956) shows that if $g$ , $h$ and $j$ are primitive recursive, and $f (x, u) \leq r (x, u)$ where $r$ is also primitive recursive and $u$ is calculated by means of $j (x, y, f (x + 1, y)$ ), then $f$ is also primitive recursive (see also 1967: 10).

Open access to the SEP is made possible by a world-wide funding initiative.
The Encyclopedia Now Needs Your Support
Please Read How You Can Help Keep the Encyclopedia Free

Supplement to Recursive Functions

History of the Ackermann and Péter functions

Browse

About

Support SEP

Mirror Sites