Greedy Algorithms

In the two last examples of dynamic programming, we had to consider many sub-problems to solve a problem. For instance, in the rodding cutting problem we had to consider all the possible ways of making the first cut. Similarly, in the LCS problem there were two potential ways for obtaining the optimal solution when the last symbols in the sequence were not the same.

However, sometimes it is possible to greedily select just one case and thus reduce the problem into one smaller sub-problem. We next consider one such simple greedy algorithm. In the graphs rounds, we’ll see other greedy algorithms when we study the minimum spanning tree algorithms.

Example IV: Activity-selection problem

Assume that we are given a finite set of activities A={a1,...,an}. Each activity ai is associated with

  • ​ a start time si, and

  • ​ a finish time fi with si<fi.

An activity executes in the half-open interval [si,fi). Two activities, ai and aj for ij, are said to be compatible if they do not execute at the same time, meaning that either fisj or fjsi. Our task is find a maximal compatible subset of A, i.e., a subset AA of largest size such that all the activities in the set are mutually compatible.

Example

Consider the set of activities A={a1,...,a11} with

i1234567891011si130535688212fi4567991011121416

A maximal compatible subset is {a2,a4,a9,a11}.

_images/dynprog-activities.png

A dynamic programming formulation

In order to relate dynamic programming and greedy algorithms, let us first see a dynamic programming approach for solving the activity selection problem.

Assume that the set of activities is A={a1,...,an}. For obtaining the recursive definition for optimal solutions, let

Al,u={aiAlsifiu}

be the set of activities that start at l or later and finish at latest at u. If we use L=mini[1..n]si and U=maxi[1..n]fi, then the original problem instance A is AL,U.

Each (sub-)problem on Al,u has the optimal sub-solution property:

  • ​ Take any activity aiAl,u

  • ​ Let Al,si and Afi,u be any maximal compatible subsets for the sets Al,si of activities in Al,u ending before ai and Afi,u of activities in Al,u starting after ai.

  • ​ Then Al,u at i=Al,si{ai}Afi,u is a maximal compatible subset for Al,u that contains ai.

Thus a maximal compatible subset for Al,u is

Al,u=maxajAl,uAl,u at j.

The solution for the original problem is AL,U. We now have a recursive definition that can be turned into a dynamic programming algorithm. Note that the number of sub-problems Al,u one has to consider is at most (n+1)×(n+1) because (i) the start index l of each Al,u is either L or an ending time of an activity, and (ii) the ending index u of each Al,u is either U or a starting time of an activity. Thus the dynamic programming approach can be made to work in polynomial time. On the contrary, the straighforward approach of

  • ​ enumerating all the subsets of A,

  • ​ checking the compatibility of each, and

  • ​ remembering a largest compatible one

would require at least exponential time as there are exponetially many subsets.

In Scala, we can implement the dynamic programming algorithm as follows. Note that the implementation is not very well optimized.

def activitySelectorDynprog(acts: Set[(Int,Int)]) =
  require(acts.forall(a => a._1 >= 0 && a._1 < a._2))
  val mem = collection.mutable.Map[Set[(Int,Int)],Set[(Int,Int)]]()
  def inner(current: Set[(Int,Int)]): Set[(Int,Int)] =
    mem.getOrElseUpdate(current, {
      var best = Set[(Int,Int)]()
      for act <- current do
        val earlier = current.filter(a => a._2 <= act._1)
        val later = current.filter(a => act._2 <= a._1)
        val earlierResult = inner(earlier)
        val laterResult = inner(later)
        if earlierResult.size + 1 + laterResult.size > best.size then
          best = (earlierResult ++ laterResult) + act
      best
    })
  inner(acts)

Running our example with it, we get:

scala> val acts = Set((1,4),(3,5),(0,6),(5,7),(3,9),(5,9),(6,10),
                      (8,11),(8,12),(2,14),(12,16))
acts: Set[(Int, Int)] = HashSet((8,12), (1,4), (8,11), (5,9), (3,9), (3,5), (12,16), (0,6), (5,7), (6,10), (2,14))

scala> activitySelectorDynprog(acts).toVector.sortBy(_._1)
res1: Vector[(Int, Int)] = Vector((1,4), (5,7), (8,12), (12,16))

The greedy approach

The activity selection problem has a property that we can use to obtain an even faster and simpler algorithm. Namely, we do not have to consider all the activities in a subset but can

  • ​ just select one that finishes as early as possible, and

  • ​ then, consider the sub-problem obtained by removing that activity and all the activities that are not compatible with it.

This optimization is based on the observation that if we have a maximal compatible subset that does not contain an activity ai that finishes as early as possible, we can substitute one that is in the set with ai and obtain a compatible subset of the same maximal size. This is formalized in the following theorem.

Theorem

Let AA be a maximal subset of compatible activities, ai an activity with the earliest finish time among those in A, and aj be the activity with the earliest finish time among those in A. Then (A{aj}){ai} is also a maximal subset of compatible activities.

Proof

If i=j, we are done.

Otherwise, it must be that fifj. As aj is the activity with the earliest finish time among those in A, A does not contain any ak, kj, with skfi. Therefore, it is safe to substitute aj with ai in A and obtain a set of compatible activities that has the same maximal size.

Example

Consider again the activities A={a1,...,a11}, where

i1234567891011si130535688212fi4567991011121416

One maximal compatible subset is {a2,a4,a9,a11}. It does not contain the activity a1 with the earliest finish time among those in A. If we substitute the activity a2 having the earliest finish time among those in A with the activity a1, the result is the maximal compatible subset {a1,a4,a9,a11}.

Observe that {a4,a9,a11} is a maximal compatible subset of the “reduced” sub-problem {a4,a6,a7,a8,a9,a11} obtained by removing a1 and all the activities that are not compatible with it.

_images/dynprog-activities-2.png

In Scala, we can implement the greedy algorithm as follows:

def activitySelectorGreedy(acts: Set[(Int,Int)]) =
  require(acts.forall(a => a._1 >= 0 && a._1 < a._2))
  val sorted = acts.toVector.sortBy(_._2)
  val sol = collection.mutable.ArrayBuffer[(Int,Int)]()
  var time = Int.MinValue
  for act <- sorted do
    if act._1 >= time then
      sol += act
      time = act._2
  sol.toSet

Running our example with it, we get:

scala> val acts = Set((1,4),(3,5),(0,6),(5,7),(3,9),(5,9),(6,10),(8,11),(8,12),(2,14),(12,16))
acts: Set[(Int, Int)] = HashSet((8,12), (1,4), (8,11), (5,9), (3,9), (3,5), (12,16), (0,6), (5,7), (6,10), (2,14))
                                                                                
scala> activitySelectorGreedy(acts).toVector.sortBy(_._1)
res0: Vector[(Int, Int)] = Vector((1,4), (5,7), (8,11), (12,16))