This is the middle part of a three-part article describing the approach that Richard Bird and I took to using functional programming for algorithm design in our recent book Algorithm Design with Haskell.
Part 0 contrasted algorithm development in a functional style with the corresponding activity using an imperative language. In this part, we see how the functional approach can be applied to the class of greedy algorithms: these can be used when the characteristics of a problem allow a globally optimal solution to be reached by making a series of locally optimal steps. Greedy algorithms are often very simple, but the proof that they do indeed yield a globally optimal solution can be surprisingly tricky. The final Part 2 presents the main novelty of the book: a step outside pure FP to admit careful and limited use of nondeterministic functions, which turn out to be necessary for the typical situation when the solution ordering being optimized admits ties.
Greedy algorithms
Let’s look at greedy algorithms as a more interesting example. We’ll consider algorithms that construct a single optimal candidate out of a collection of components. Instances include Huffman coding (a shortest encoding constructed from given symbols), line breaking (a least wasteful paragraph constructed from given words), minimum-cost spanning tree again (a lightest spanning tree constructed from given edges), and making change (a smallest handful of coins constructed from given denominations).
The greedy approach assembles an optimal candidate step by step from the given components, maintaining just a single partial candidate throughout: “think globally, act locally”. This generally leads to a simple algorithm, but often one with a tricky proof of correctness.
The problem can be specified as computing a minimum-cost candidate from given components:
Here, selects from a non-empty list the (leftmost) minimal element according to a given cost function:
( is a standard Haskell function to aggregate a non-empty list). We will suppose that
can be written as an instance of
:
constructing a finite non-empty list of candidates, starting with a trivial initial candidate , and using a function
that extends a single candidate by a single component in all possible ways. (The book also considers problems where
uses a different recursion pattern.)
For example, consider sorting a string: the components are characters, and candidates are strings, permutations of those components. The trivial initial candidate is the empty string, and is extended by each component in turn in all possible ways:
So . The cost to be minimized is the inversion count, the number of out-of-order pairs in the permutation:
Calculating the greedy algorithm
We can calculate a greedy algorithm for computing a minimum-cost candidate by fusing selection with generation
, so that we maintain only a single candidate at each step. Since the generation process is an instance of
, we use the fusion law:
Now is
,
is
,
is
,
is
, and we have to come up with a function
, the “greedy step”, to play the role of
—that is, to satisfy
We calculate:
The first step is just to unfold the definition of the function . The second step is a “bookkeeping law”: to aggregate a collection of collections, either combine them into one big collection to aggregate, or aggregate each subcollection and aggregate the intermediate results. The third step fuses two consecutive maps into one. The fourth step simply names the subterm
. The fifth step
is wishful thinking: that taking the best final candidate obtained by making a greedy step from each of a collection of initial candidates gives the same result as taking a greedy step from the single best initial candidate. Call this wishful thought the greedy condition.
The point is that the greedy condition alone suffices to satisfy the premises of the fusion law; every other step in the calculation is universally valid. If the greedy condition is satisfied, then we have the greedy algorithm
for finding the minimum-cost candidate. In words, from the empty candidate , make a single greedy step with each component
in turn. A single greedy step consists of considering each possible way of extending the current candidate, and taking the (locally) best of them.
Greedy sorting
So, does the greedy condition hold for sorting? Sadly, it does not. Consider the two partial candidates “eabc” and “cbae” (in that order), each with inversion count 3, and suppose that the next component is ‘d’. Because these two candidates tie on inversion count, it is an arbitrary choice which is ‘minimal’; our happens to pick the leftmost minimal element, namely “eabc”. But the best extension of “eabc” with ‘d’ is “eabcd”, with inversion count 4, whereas the best extension of “cbae” with ‘d’ is “cbade”, with inversion count 3. The single best initial candidate “eabc” does not lead to the best final candidate.
The essence of the obstacle is that ordering by inversion count is not linear, because not antisymmetric: two different permutations can have the same inversion count. I can think of three possible fixes.
The first fix is that sometimes this kind of tie can’t happen in context. In particular, for sorting, there is always a unique minimal-cost permutation, namely the sorted one (at least, it is unique when the elements themselves are linearly ordered). Although “eabc” and “cbae” are tied, they and all other permutations lose to “abce”. One can refine the calculation of the greedy algorithm to take this context into account, which will overcome the obstacle for sorting. However, this fix does not work for all greedy algorithms.
The second fix is to change the cost function. In the case of sorting, we can switch from to
—no-one said that the ‘cost’ had to be a number. This fixes the issue at a single stroke! The problem is then to find the lexically least permutation (namely, the sorted one), which will also be the one with minimal inversion count (namely, zero). In general, the ordering is refined to a linear order, often by adding dimensions to the cost function. However, I think this is really cheating: prejudicing the specification in light of the intended implementation.
The third fix, and the one taken forward in the book, is to allow nondeterministic functions in a few, carefully controlled places. Specifically, when there is a tie, we consider all optimal partial candidates, not an arbitrary single one of them. This third fix is explained in Part 2 of the article.
Bio: Jeremy Gibbons is Professor of Computing at the University of Oxford, where he leads the Algebra of Programming research group and is former Deputy Head of Department. He served as Vice Chair then Past Vice Chair of ACM SIGPLAN, with a particular focus on Open Access. He is also Editor-in-Chief of the Journal of Functional Programming, SC Chair of ICFP, on the Advisory Board of PACMPL, an editor of Compositionality, and former Chair of IFIP Working Group 2.1 on Algorithmic Languages and Calculi.
Disclaimer: These posts are written by individual contributors to share their thoughts on the SIGPLAN blog for the benefit of the community. Any views or opinions represented in this blog are personal, belong solely to the blog author and do not represent those of ACM SIGPLAN or its parent organization, ACM.