Van Emde Boas tree

A van Emde Boas tree, also known as a vEB tree or van Emde Boas priority queue, is a tree data structure which implements an associative array with -bit integer keys. It performs all operations in time, or equivalently in time, where is the maximum number of elements that can be stored in the tree. The is not to be confused with the actual number of elements stored in the tree, by which the performance of other tree data-structures is often measured. The vEB tree has good space efficiency when it contains many elements, as discussed below. It was invented by a team led by Dutch computer scientist Peter van Emde Boas in 1975.

Supported operations

A vEB supports the operations of an ordered associative array, which includes the usual associative array operations along with two more order operations, FindNext and FindPrevious:

Insert: insert a key/value pair with an -bit key
Delete: remove the key/value pair with a given key
Lookup: find the value associated with a given key
Find: find the key/value pair with the smallest key which is greater than a given
FindPrevious: find the key/value pair with the largest key which is smaller than a given

A vEB tree also supports the operations Minimum and Maximum, which return the minimum and maximum element stored in the tree respectively. These both run in time, since the minimum and maximum element are stored as attributes in each tree.

How it works

For the sake of simplicity, let for some integer k. Define. A vEB tree over the universe has a root node that stores an array of length. is a pointer to a vEB tree that is responsible for the values. Additionally, T stores two values and as well as an auxiliary vEB tree.
Data is stored in a vEB tree as follows: The smallest value currently in the tree is stored in and largest value is stored in. Note that is not stored anywhere else in the vEB tree, while is. If T is empty then we use the convention that and. Any other value x is stored in the subtree where. The auxiliary tree keeps track of which children are non-empty, so contains the value j if and only if is non-empty.

FindNext

The operation that searches for the successor of an element x in a vEB tree proceeds as follows: If then the search is complete, and the answer is. If then the next element does not exist, return M. Otherwise, let. If then the value being searched for is contained in so the search proceeds recursively in. Otherwise, we search for the value i in. This gives us the index j of the first subtree that contains an element larger than x. The algorithm then returns. The element found on the children level needs to be composed with the high bits to form a complete next element.
function FindNext.
if x < T.min then
return T.min
if x ≥ T.max then // no next element
return M
i = floor
lo = x mod

if lo < T.children.max then
return + FindNext
j = FindNext
return + T.children.min
end
Note that, in any case, the algorithm performs work and then possibly recurses on a subtree over a universe of size . This gives a recurrence for the running time of, which resolves to.

Insert

The call that inserts a value into a vEB tree operates as follows:

If T is empty then we set and we are done.
Otherwise, if then we insert into the subtree responsible for and then set. If was previously empty, then we also insert into
Otherwise, if then we insert into the subtree responsible for and then set. If was previously empty, then we also insert into
Otherwise, so we insert into the subtree responsible for. If was previously empty, then we also insert into.

In code:
function Insert
if T.min > T.max then // T is empty
T.min = T.max = x;
return
if x < T.min then
swap
if x > T.max then
T.max = x
i = floor
lo = x mod
Insert
if T.children.min T.children.max then
Insert
end
The key to the efficiency of this procedure is that inserting an element into an empty vEB tree takes time. So, even though the algorithm sometimes makes two recursive calls, this only occurs when the first recursive call was into an empty subtree. This gives the same running time recurrence of as before.

Delete

Deletion from vEB trees is the trickiest of the operations. The call that deletes a value x from a vEB tree T operates as follows:

If then x is the only element stored in the tree and we set and to indicate that the tree is empty.
Otherwise, if then we need to find the second-smallest value y in the vEB tree, delete it from its current location, and set. The second-smallest value y is, so it can be found in time. We delete y from the subtree that contains it.
If and then we delete x from the subtree that contains x.
If then we will need to find the second-largest value y in the vEB tree and set. We start by deleting x as in previous case. Then value y is either or, so it can be found in time.
In any of the above cases, if we delete the last element x or y from any subtree then we also delete i from

In code:
function Delete
if T.min T.max x then
T.min = M
T.max = −1
return
if x T.min then
hi = T.aux.min *
j = T.aux.min
T.min = x = hi + T.children.min
i = floor
lo = x mod
Delete
if T.children is empty then
Delete
if x T.max then
if T.aux is empty then
T.max = T.min
else
hi = T.aux.max *
j = T.aux.max
T.max = hi + T.children.max
end
Again, the efficiency of this procedure hinges on the fact that deleting from a vEB tree that contains only one element takes only constant time. In particular, the last line of code only executes if x was the only element in prior to the deletion.

Discussion

The assumption that is an integer is unnecessary. The operations and can be replaced by taking only higher-order and the lower-order bits of, respectively. On any existing machine, this is more efficient than division or remainder computations.
The implementation described above uses pointers and occupies a total space of. This can be seen as follows. The recurrence is.
Resolving that would lead to.
One can, fortunately, also show that by induction.
In practical implementations, especially on machines with shift-by-k and find first zero instructions, performance can further be improved by switching to a bit array once equal to the word size is reached. Since all operations on a single word are constant time, this does not affect the asymptotic performance, but it does avoid the majority of the pointer storage and several pointer dereferences, achieving a significant practical savings in time and space with this trick.
An obvious optimization of vEB trees is to discard empty subtrees. This makes vEB trees quite compact when they contain many elements, because no subtrees are created until something needs to be added to them. Initially, each element added creates about new trees containing about pointers all together. As the tree grows, more and more subtrees are reused, especially the larger ones. In a full tree of elements, only space is used. Moreover, unlike a binary search tree, most of this space is being used to store data: even for billions of elements, the pointers in a full vEB tree number in the thousands.
However, for small trees the overhead associated with vEB trees is enormous: on the order of. This is one reason why they are not popular in practice. One way of addressing this limitation is to use only a fixed number of bits per level, which results in a trie. Alternatively, each table may be replaced by a hash table, reducing the space to at the expense of making the data structure randomized. Other structures, including y-fast tries and x-fast tries have been proposed that have comparable update and query times and also use randomized hash tables to reduce the space to or.

Popular movies

The Hunger Games (film) - 2012 American dystopian action thriller science fiction-adventure film directed by Gary Ross and based on Suzanne Collins’s 2008 novel of the same name. It is the first insta...
untitled Captain Marvel sequel - part of Marvel Cinematic Universe....
Killers of the Flower Moon (film project) - Killers of the Flower Moon - film project in United States of America. It was presented as drama, detective fiction, thriller. The film project starred Leonardo Dicaprio, Robert De Niro. Director of...
Five Nights at Freddy's (film) - Five Nights at Freddy's - film published in 2017 in United States of America. Scenarist of the film - Scott Cawthon....

Popular books

Book of Revelation - The Book of Revelation is the final book of the New Testament, and consequently is also the final book of the Christian Bible. Its title is derived from the first word of the Koine Greek text: apok...
Book of Genesis - account of the creation of the world, the early history of humanity, Israel's ancestors and the origins...
Gospel of Matthew - The Gospel According to Matthew is the first book of the New Testament and one of the three synoptic gospels. It tells how Israel's Messiah, rejected and executed in Israel, pronounces judgement on ...
Michelin Guide - Michelin Guides are a series of guide books published by the French tyre company Michelin for more than a century. The term normally refers to the annually published Michelin Red Guide , the oldest...
Psalms - The Book of Psalms , commonly referred to simply as Psalms , the Psalter or "the Psalms", is the first book of the Ketuvim , the third section of the Hebrew Bible, and thus a book of th...
Ecclesiastes - Ecclesiastes is one of 24 books of the Tanakh , where it is classified as one of the Ketuvim . Originally written c. 450–200 BCE, it is also among the canonical Wisdom literature of the Old Tes...
The 48 Laws of Power - non-fiction book by American author Robert Greene. The book...

Popular television series

The Crown (TV series) - historical drama web television series about the reign of Queen Elizabeth II, created and principally written by Peter Morgan, and produced by Left Bank Pictures and Sony Pictures Tel...
Friends - American sitcom television series, created by David Crane and Marta Kauffman, which aired on NBC from September 22, 1994, to May 6, 2004, lasting ten seasons. With an ensemble cast sta...
Young Sheldon - spin-off prequel to The Big Bang Theory and begins with the character Sheldon...
Modern Family - American television mockumentary family sitcom created by Christopher Lloyd and Steven Levitan for the American Broadcasting Company. It ran for eleven seasons, from September 23...
Loki (TV series) - upcoming American web television miniseries created for Disney+ by Michael Waldron, based on the Marvel Comics character of the same name. It is set in the Marvel Cinematic Universe, shar...
Game of Thrones - American fantasy drama television series created by David Benioff and D. B. Weiss for HBO. It...
Shameless (American TV series) - American comedy-drama television series developed by John Wells which debuted on Showtime on January 9, 2011. It...