On Recursive Functions

In this article, we’ll explore one of the most fascinating concepts in computer science, namely the Y combinator. It can simulate recursion in a language that doesn’t support it.

We’re going to use the Factorial function as an example. Factorial gives us the product of an integer and all the integers below it. For example \(4! = 4*3*2*1 = 24\). In JavaScript, we can implement it as follows:

const fact = n => {
    if (n === 0) {
        return 1;
    }
    return n * fact(n - 1);
}

So fact(4) will result in the following computation:

fact(4)
4 * fact(3)
4 * (3 * fact(2))
4 * (3 * (2 * fact(1)))
4 * (3 * (2 * (1 * fact(0))))
4 * (3 * (2 * (1 * 1)))
4 * (3 * (2 * 1))
4 * (3 * 2)
4 * 6
24

What if our language does not support recursion. That means we’re not allowed to call fact within itself. Actually, let’s make it even more challenging - our language supports only function definition and function application. No goto or any kind of looping constructs whatsoever. Also, our functions are allowed to take exactly one argument; no more, no less. How would we solve this problem?

The Lambda Calculus

We’re going to construct our solution with the means of the lambda calculus and will implement its equivalent in JavaScript.
Lambda calculus is a minimalistic, Turing-complete language, powerful enough to express any kind of computation that can be performed by a modern-day computer language. A detailed description of lambda calculus is outside the scope of this article. I hope that the examples should be intuitive enough to understand, even if you’re not familiar with it. For those of you who want to dive deeper, I’ve provided some resources in the references section of this article.

In λ-calculus we have two basic operations:

For example, let’s define the Identity function:
\[I \equiv \lambda x.x \]

Which in JavaScript we can define like:

const id = x => x;

It returns the argument that it has been provided. To pass an argument to the function, we can simply write
\[ I \space N \] or
\[ (\lambda x.x)N \] In λ-calculus, we refer to \(I\) as a combinator. The Identity combinator has a simple reduction sequence

\[ (\lambda x.x)N \] \[ \to (\lambda x.x)[x \mapsto N] \] \[ \to N \]

We also say that the aforementioned λ-term β-reduces to \(N\) or that \(N\) is the normal form of the term. A term is in its normal form when no more reductions can be applied. The process is called reduction because it gets rid of an application. On the second line of the reduction using \([x \mapsto N]\) we denote that \(x\) is being substituted with \(N\).

The \(\Omega\)-combinator

Are there λ-terms without a normal form? The answer is - yes and this is a crucial part of constructing our solution. Let’s take a look at the following combinator:

\[\omega := \lambda x.xx\]

It takes a function and applies it to itself. We can express it in JavaScript in the following way:

const w = x => x(x);

We can now define:

\[ \Omega := \omega \omega \equiv (\lambda x.x x)(\lambda x.x x) \]

const W = w(w); // (x => x(x))(x => x(x))

So if we try reducing \(\Omega\) we’ll end up in an infinite reduction sequence because the second \(\omega\) substitutes the \(x\) in the first term so after each reduction we’ll get the same term over and over again.

\[ (\lambda x.x x)(\lambda x.x x) \] \[ \to (\lambda x.x x)[x \mapsto (\lambda x.x x)] \] \[ \to (\lambda x.x x)(\lambda x.x x) \] \[ \to (\lambda x.x x)[x \mapsto (\lambda x.x x)] \] \[ \to (\lambda x.x x)(\lambda x.x x) \to … \]

This construction is useful because it encodes an infinite loop.

The Y Combinator

So to simulate recursion, we are looking for a combinator that, given an argument some function \(F\), would not only reproduce itself but also pass \(F\) on itself. We already saw the self-reproducing term \(\Omega\) so using it as our basis we can define:

\[ \omega_F := \lambda x.F(x x) \]

const wF = x => F(x(x));

The difference here is that we pass \(x\) to itself and the result, we pass to some function \(F\). We can now define \(Y\):

\[ Y_F := \omega_F \omega_F \equiv (\lambda x.F(x x))(\lambda x.F(x x)) \]

or in more general terms:

\[ Y := \lambda f. (\lambda x.f(x x))(\lambda x.f(x x)) \]

Let’s implement the Y combinator in JavaScript:

const Y = f => {
    const g = x => f(x(x));
    return g(g);
};

So if we pass a function \(F\) to the Y combinator, we’re going to end up with the following reduction sequence:

\[ Y F \] \[ \equiv (\lambda f. (\lambda x.f(x x))(\lambda x.f(x x))) \space F \] \[ \to (\lambda f. (\lambda x.f(x x))(\lambda x.f(x x)))[f \mapsto F] \] \[ (\lambda x.F(x x))(\lambda x.F(x x))\] \[ \to (\lambda x.F(x x))[x \mapsto \lambda x.F(x x)]\] \[ \to F((\lambda x.F(x x))(\lambda x.F(x x)))\] \[ \to F((\lambda x.F(x x))[x \mapsto \lambda x.F(x x)])\] \[ \to F(F((\lambda x.F(x x))(\lambda x.F(x x)))) \to … \]

We can see where this is going.

The Fixed-Point Theorem

The fixed point of a function \(F\) is some value \(x\) such that \(F(x) = x\). That is when applied to a function, it returns the same value. Let’s see the following example:

\[ f(x) = x^2 - 3x + 4 \] \[ f(2) = 2^2 - 3*2 + 4 = 2 \]

In this case, \(2\) is a fixed point of \(f\). The fixed-point theorem states that each function has at least one such value.

Y is indeed a fixed-point combinator meaning that when applied to an arbitrary function \(F\) we get the same result as applying \(F\) to the result of Y applied to \(F\). That is, we treat \(Y F\) as an input to \(F\).

\[ Y F = F(Y F) \]

Factorial in λ-calculus

Let’s assume that we have already defined the following functions:

So if λ-calculus supported recursion, we’d write Factorial in the following way:

\[fact := \lambda n.(if \space (iszero \space n) \space 1 \space (mult \space n \space (fact \space (pred \space n)))) \]

This is the same as the JavaScript definition that we made above. Remember though - in λ-calculus we cannot make recursive calls. So the definition above is not valid. That’s where we’re going to apply the Y combinator, therefore, we have to define our \(F\) by slightly tweaking the above definition:

\[ factStep := \lambda f.\lambda n. \space (iszero \space n) \space 1 \space (mult \space n \space (f \space (pred \space n))) \]

So we’ve wrapped the Factorial definition inside a function that takes a function \(f\) as an argument and returns a new function which calls \(f\) in one of its execution branches. We can think of \(factStep\) as a factory - a function that returns another function.

In our case \(factStep\), plays the role of a pseudo-recursive function that is going to be passed and invoked over and over again until we reach the bottom of our pseudo-recursion. If we think in terms of iteration \(factStep\) is the body of the loop.

In JavaScript it’s defined in the following way:

const factStep = nextStep /* f */ => {
    return n => {
        if (n === 0) {
            return 1;
        }
        return n * nextStep(n - 1);
    }
}

We cannot reference a function from itself, that’s why it makes sense to pass it to itself. Now we are ready to define Factorial using the Y combinator. Let’s compute Factorial of 2 by going through its reduction sequence.

The sequence might seem rather tedious but by going through it one line at a time I hope it should be easy to understand. I’d also suggest writing it down, as that helped me a grasp the concept when I was learning about it. I’ve put parentheses when the precedence of the operations is not obvious. The application in λ-calculus is left associative so that the expression: \(M \space N \space P \) is equivalent to \( ((M \space N) \space P) \) meaning that \(N\) will be passed to \(M\) and \(P\) will be passed to the result of \(M \space N\).

\[ Y \space factStep \space 2 \] \[ \to factStep \space (Y \space factStep) \space 2 \] \[ \equiv (\lambda f.\lambda n. \space (iszero \space n) \space 1 \space (mult \space n \space (f \space (pred \space n)))) \space (Y \space factStep) \space 2 \] \[ \to (\lambda f.\lambda n. \space (iszero \space n) \space 1 \space (mult \space n \space (f \space (pred \space n)))) \space [f \mapsto (Y \space factStep)] \space 2 \] \[ \to \lambda n. \space (iszero \space n) \space 1 \space (mult \space n \space ((Y \space factStep) \space (pred \space n))) \space 2 \] \[ \to \lambda n. \space (iszero \space n) \space 1 \space (mult \space n \space ((Y \space factStep) \space (pred \space n))) \space [ n\mapsto 2 ] \] \[ \to (iszero \space 2) \space 1 \space (mult \space 2 \space ((Y \space factStep)\space (pred \space 2))) \] \[ \to mult \space 2 \space ((Y \space factStep)\space 1) \] \[ \to mult \space 2 \space ((factStep \space (Y \space factStep)) \space 1) \] \[ \equiv mult \space 2 \space (((\lambda f.\lambda n. \space (iszero \space n) \space 1 \space (mult \space n \space (f \space (pred \space n)))) \space (Y \space factStep)) \space 1) \] \[ \to mult \space 2 \space (((\lambda f.\lambda n. \space (iszero \space n) \space 1 \space (mult \space n \space (f \space (pred \space n)))) \space [f \mapsto (Y \space factStep)] \space) \space 1) \] \[ \to mult \space 2 \space (\lambda n. \space (iszero \space n) \space 1 \space (mult \space n \space ((Y \space factStep) \space (pred \space n)))) \space 1 \] \[ \to mult \space 2 \space (\lambda n. \space (iszero \space n) \space 1 \space (mult \space n \space ((Y \space factStep) \space (pred \space n)))) [n \mapsto 1] \] \[ \to mult \space 2 \space ((iszero \space 1) \space 1 \space (mult \space 1 \space ((Y \space factStep) \space (pred \space 1)))) \] \[ \to mult \space 2 \space (mult \space 1 \space ((Y \space factStep) \space 0)) \] \[ … \to mult \space 2 \space (mult \space 1 1)\] \[ \to mult \space 2 \space 1\] \[ \to 2 \]

Essentially, by passing \(factStep\) to itself, we allow it to call itself in a recursive manner. Keep in mind that \(factStep\) is only an intermediate value - it is not the recursive function itself, as it still needs a reference to itself to do the recursion.

Evaluation Strategies

Now we can put it all together:

const fact = Y(factStep);

But not so fast! The code above will result in:

RangeError: Maximum call stack size exceeded

This is due to the fact that JavaScript and Lambda Calculus have different models of evaluation. In fact, JavaScript uses applicative order (call by value) evaluation which means that the function’s arguments are evaluated before calling the function. This causes the Y combinator to expand infinitely (I found this out the hard way 🤦‍♂️ ). Let’s recall:

\[ Y \space factStep \space 2 \] \[ \to factStep \space (Y \space factStep) \space 2 \]

So instead of passing \((Y \space factStep)\) to \(factStep\) as it is, JavaScript would try to evaluate it first which would lead to an infinite expansion, therefore, causing stack overflow.

\[ \to factStep \space (factStep \space … \space (factStep \space (Y \space factStep))) \space 2 \]

So the Y combinator is suited for languages with normal order (call by name) evaluation. Normal order evaluates the function first, before evaluating its arguments. In order to adapt it for an applicative order language we have to do a slight modification to our fixed-point combinator:

- λf.(λx.f (x x)) (λx.f (x x))
+ λf.(λx.λy.f (x x) y) (λx.λy.f (x x) y)
function Y1(f) {
    const g = x => {
-       return f(x(x));
+       return y => f(x(x))(y);
    };
    return g(g);
}

We implement “laziness” by wrapping the function call into another function. A function definition is an evaluated term for JavaScript so it won’t be reduced further. Now we are ready to implement Factorial:

const fact = Y1(factStep)
fact(5); // 120

Conclusion

There’re other fixed-point combinators like the Z combinator which is suited for call-by-value languages:

\[ Z := \lambda f.(\lambda x.f \space (\lambda v.((x \space x) \space v))) \space (\lambda x.f \space (\lambda v.((x \space x) \space v))) \]

If you want to play some more with this stuff, I’d recommend implementing the Z combinator and writing down its reduction sequence. You can also try defining recursive functions with multiple arguments, for example, the Ackermann function.

Further Reading and References

comments powered by Disqus