Fast recursive lambda functions in C++.

Or Y combinators matter.

Jan 07, 2023

As soon as C++ got lambdas, developers use and abuse them to make code more concise and better encapsulated. Since they come from lambda calculus and functional programming languages where recursion is very common, it’s tempting to use C++ lambdas for recursive functions as well. The first natural way attempt to implement factorial that looks something like

unfortunately would result in a compiler error

After some web searching, developers usually end up with a second attempt that uses std::function

It seems ok, but what about concerns about std::function issues some of which were mentioned in

Software Bits Newsletter

Use std::function not.

It’s easy to spot most former Java programmers by how they write C++. But while always using the new operator for instantiation is obviously not a good idea, some other “signature moves” are harder to notice. Our privacy infrastructure APIs allow access only to data with compatible policy. Its implementation is somewhat involved reflecting essential doma…

2 years ago · 1 like · Taras Tsugrii

Well, the generated assembly looks somewhat suspicious

but before we come to any conclusions it would be nice to have something working to compare this with. Is there a way to have a recursive implementation based on pure lambdas? Yes and that’s where Y combinators come into the picture. I’ll spare you the details that are easy to find and go directly to the code

It looks a little strange since we are passing the lambda as a parameter to itself, but this way we can recursively call the function. Its assembly looks a little scary

but as soon as SIMD registers show up in the assembly, it’s usually a good sign that compilers are able to vectorize some computations.

At this point we can spend a lot of time analyzing generated assembly and predicting potential performance impact or we can run some benchmarks

This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters. Learn more about bidirectional Unicode characters

Show hidden characters

	#include <functional>

	int fact_std_func(int n) {
	std::function<int(int)> fact = [&](auto n) {
	if (n == 1) return 1;
	return n * fact(n-1);
	};
	return fact(n);
	}

	int fact_comb(int n) {
	auto fact = [](auto n, auto self) -> int {
	if (n == 1) return 1;
	return n * self(n-1, self);
	};
	return fact(n, fact);
	}

	const int n = 1000;

	static void FactStdFunc(benchmark::State& state) {
	for (auto _ : state) {
	benchmark::DoNotOptimize(fact_std_func(n));
	}
	}
	BENCHMARK(FactStdFunc);

	static void FactComb(benchmark::State& state) {
	for (auto _ : state) {
	benchmark::DoNotOptimize(fact_comb(n));
	}
	}
	BENCHMARK(FactComb);

view raw std_func_vs_y_comb.cpp hosted with ❤ by GitHub

The results look unambiguous with 13X runtime difference.

This is yet another example of how useful Y combinators are and a reason why there is even a proposal to add Y combinator into the standard library. Until then, you know what to do.