Introduction: sets and maps
In this and the next round we are interested in the abstract data types for dynamic sets and maps. Here “dynamic” means that the keys, or (key,value) associations in maps, are inserted, searched for and removed all the time in an online fashion.
Sets
A basic API for a set abstract data type could be
insert(k)for inserting the keykin the set,
search(k)for finding whether the keykis in the set, and
remove(k)for deleting the keykfrom the set.
The figure below shows the set diagrams for two sets of strings, where the latter is obtained by inserting a new string in it.
In Scala, this API corresponds to the collection.mutable.Set trait, which is implemented, among others, in the mutable TreeSet and HashSet classes. The methods corresponding to the ones in the asbtract API are
s.add(k)ors += kfor inserting keykinto the sets,
s.contains(k)ors(k)for checking whether the setscontains the keyk, and
s.remove(k)ors -= kfor removing the keykfrom the sets.
In Java, the counterpart is the java.util.Set interface. In C++, the standard library contains two classes, set and unordered_set, implementing a variant of the API (the difference is explained below).
Maps
For maps, also called dictionaries in some languages, the API also allows associating a value to each key.
In Scala collection.mutable.Map trait, the methods are called
m.update(k,v)or simplym(k) = vfor setting the value of the keyktovin the mapm,
m.apply(k)or simplym(k)for getting the value associated with the keykin the mapm, and
m.remove(k)for removing the keykfrom the mapm.
In the Java java.util.Map interface
the methods are called put, get, and remove.
In the C++ standard library map and unordered_map classes, the methods are called
insert, or simplym[k] = v, for a mapm, keyk, and valuev,
m.at(k), or simplym[k], and
m.erase(k).
Ordered sets and maps
In this round, we will in fact consider ordered sets and ordered maps that
assume an ordering between the keys, and
allow efficient searching for the smallest, next smallest, etc keys.
The abstract API of ordered sets extends the basic API with
minfor getting the smallest key in the set,maxfor getting the largest key in the set,predecessor(k)for getting the largest key that is smaller thank, andsuccessor(k)for getting the smallest key that is larger thank.
Our goal is to have data structures and algorithms allowing the operations to be done in logarithmic time in the size of the set. Furthermore, the keys can be listed in ascending order in linear time. Note the difference to the priority queue API of the previous round (Section Priority queues with binary heaps): removing arbitrary keys as well as finding the minimum and successor are now also supported.
In Scala, the TreeSet class implements a variant of the API allowing ordered iteration over the keys in the set.
In the class,
firstKeygives the smallest key whilelastKeyfinds the largest key. Note:minandmaxare slow, linear time generic operations, do not use them.Note 2: in the current Scala version,
val s = collection.mutable.Set()creates a new “hash set” (covered in the next round) in which the operations above take linear time in the worst case. Useval s = collection.mutable.TreeSet()to create an ordered set. In Java, the java.util.TreeSet API is close to the abstract one:
firstgives the smallest key,
lastgives the largest key,
higher(k)gives the smallest key that is larger thank, and
lower(k)gives the largest key that is smaller thank.
The C++ standard library has the set class.
For maps, the corresponding classes are
TreeMap in Scala,
java.util.TreeMap in Java, and
map in the C++ standard library.