A Visual Introduction to Treap Data Structure (Part I: The Basics) (2024)

Igor Carpanese

Published in

Carpanese's Blog

10 min read

Random Binary Search Trees

Let’s talk about one of the most beautiful interpretations of treaps.

Suppose we have, a priori, all the keys that will be inserted in a BST (a simple one, not a treap). We know that if the keys are sorted, our BST will degenerate into a list. What should we do to minimize the height of the BST if we could change the order of the insertions?

A Visual Introduction to Treap Data Structure (Part I: The Basics) (4)

In fact, it can be prove that a simple shuffle in the list of keys is sufficient to made the tree balanced (see CLRS, 3rd edition, theorem 12.4). The intuition is that the more sorted the input, the higher will be tree. The shuffle breaks the order, making the height of the tree almost minimum. In practice, we can assume that the h = lg(n).

A Visual Introduction to Treap Data Structure (Part I: The Basics) (5)

That’s an interesting fact, but it’s also almost useless because we have to read all insertions first. The treap, however, is a data structure that can help us shuffle the keys in a smart way after each insertion.

The idea is that we can insert all the keys in the worst possible way (which means in a sorted way), but the random priority will make sure to shuffle it “in real time”.

A Visual Introduction to Treap Data Structure (Part I: The Basics) (6)

We don’t know the order of the insertions in that treap (we don’t even know how to insert elements in a treap yet), but we can say for sure that the treap simulates one specific order.

Take a look at the priorities of each node. Let’s sort the vertices based on their priorities.

A Visual Introduction to Treap Data Structure (Part I: The Basics) (7)

The insertion order a treap simulates is given by the priority of its nodes. Note that, in our case, we don’t know the “real” insertion order, but the treap made a real time shuffle to 7, 6, 8, 2, 1, 9, 4, 10, 5 and 3.

In fact, the only difference between figures 2b and 3 is that the first one we shuffle a already known input, and the last one we shuffle the input in real time using random priorities.

Insertion

We already know that there could be more than one BST for a given set of nodes (see figure 2a). In fact, there’s two simple operations that allows us to modify the tree and keep the BST property, the right and left rotations.

A Visual Introduction to Treap Data Structure (Part I: The Basics) (8)

Both operations are common in self-balancing BST, like AVL and red black trees. Each data structure uses different strategies to identify the best moment and the best place to rotate the tree. We’ll, on the other hand, use rotations in a different way.

To insert a node on a treap, we’ll, in a first moment, ignore the priority and insert it like a simple BST.

A Visual Introduction to Treap Data Structure (Part I: The Basics) (9)

Even though BST property is being preserved, the heap property is not. We should move the new node (7, 12) up, but if we do that the BST property will break. How can we fix that?

We’ll move it up using the rotations! The steps are described in figure 7.

A Visual Introduction to Treap Data Structure (Part I: The Basics) (10)

A Visual Introduction to Treap Data Structure (Part I: The Basics) (11)

A Visual Introduction to Treap Data Structure (Part I: The Basics) (12)

A Visual Introduction to Treap Data Structure (Part I: The Basics) (13)

In the first step, we go down from the root to the leaf. In the second step, we go up from a leaf to the root at maximum using O(1) rotations. It means that the insertion time complexity is O(2h) = O(log(n)).

We can understand a treap insertion as a BST insertion (considering only the key) followed by a heap insertion (start as a leaf and go up to the tree until you fix the heap property), which is another way to thing about the O(log(n)) time complexity.

Let’s think a little more about the second step, when we start at a leaf and go up to the tree. The only way to go from the leaf to the root is if the new priority has the highest value. In other words, if you are generating random numbers using an uniform distribution, the new value must be the highest from a set of |T| (number of nodes of the tree), witch will happen with a probability of 1/|T| (try to prove that). It means that the rotations occur often near the leaves.

Offline Building

It’s easy to see that is always possible to insert a node in a treap. It means that to build a treap given a set of n nodes we just have to make n insertions.

If we have all the keys sorted, we can build the treap in O(n) time complexity instead. The idea is to split the array into two parts, from 0 to m-1 and from m+1 from n-1, where n is the size of the array and m is n/2.

A Visual Introduction to Treap Data Structure (Part I: The Basics) (14)

Now we apply the same rule recursively for the left child and the subarray [0, m-1] and the right child and the subarray [m+1, n-1].

A Visual Introduction to Treap Data Structure (Part I: The Basics) (15)

Note that we choose m = n/2 to minimize the height. It means that we don’t have to worry about the priorities. We can assign any priority to any node and just heapify them in linear time.

If it’s the first time you see this construction, it’s not easy to understand why the heapify function is linear. You can check the formal proof here.

Deletion

Given a key, we can’t just find its node and remove it. Pulling out internal nodes splits the tree and we don’t want it. The solution is to move the node down to the tree and remove it only when it becomes a leaf.

The question now is what path do we choose to go down? That node may have two children, which one should we select?

To maintain the heap property, it’s easy to see that we have to rotate the tree from the children with higher priority (see image 10).

A Visual Introduction to Treap Data Structure (Part I: The Basics) (16)

A Visual Introduction to Treap Data Structure (Part I: The Basics) (17)

The idea is simple: After the rotation, one of the children will be the father of the other one, so we have to “promote” the children with highest priority, otherwise we’ll break the heap property.

Split

The power of treaps comes from their unique operations, which are impossible to implement in a efficient way using others BSTs. This article will cover one of those operations, the split.

We want to split the original treap into two parts L and R such that both L and R are treaps as well, all nodes of L are smaller or equal to a given value X and all nodes of R are greater than X.

A Visual Introduction to Treap Data Structure (Part I: The Basics) (18)

A Visual Introduction to Treap Data Structure (Part I: The Basics) (19)

It’s real simple to implement it. Just insert a new node with key X and infinity priority. The heap property will make that artificial node become the root of the treap, and the BST property will guarantee that its left and right child satisfy the conditions we want.

A Visual Introduction to Treap Data Structure (Part I: The Basics) (20)

Note that if X is already a key in the treap, by the definition of BST, the original node will be the left child of the new artificial node.

Search

Nothing really special here. It’s exactly the same algorithm used in any other BST.

If you search for “treap data structures” on Google, you’ll probably find some results related to cartesian trees. What are they? Is the treap a cartesian tree? First things first, let’s define it.

A cartesian tree can be define recursivelly as follows: Each one of its nodes can be written in the form (x, y) for x, y ∊ ℝ such that x_l < x < x_r and y_l, y_r < y for the left child (x_l, y_l) and right child (x_r, y_r). Geometrically, it means that for each node, if we translate the origin to (x, y) the right child will be in the first quadrant and the left child will be in the second quadrant (see figure 13).

A Visual Introduction to Treap Data Structure (Part I: The Basics) (21)

Cartesian trees are really useful in some contexts, for example, to prove the equivalence between the range minimum query and the lowest common ancestor (I will write about that someday). It appears in that proof in a slightly different way. Without going into details, we’ll select the minimum value of the array to become the root of the tree, let’s say it is found at position i. The left child will be the minimum value in the interval [0, i-1] and the right child will be the minimum value in the interval [i+1, n-1]. We’ll continue this process recursively (see figure 14).

A Visual Introduction to Treap Data Structure (Part I: The Basics) (22)

Now that we understood what a cartesian tree is, what’s its relation to treaps?

Note that the x value of cartesian tree’s node maintains the BST property, while the y value maintains the heap property. We just said it with different words, but it’s the same definition. If fact, figure 10 and figure 7d are different visualizations of the same tree (see figure 15)!

A Visual Introduction to Treap Data Structure (Part I: The Basics) (23)

It’s not totally right to say that a cartesian tree is a treap because treaps have random values for y. On the other hand, it’s not totally wrong to say that both data structures are the same thing because randomization is not essential and is only used to guarantee a logarithmic height. I like to think about them as different, but equivalent data structures. Something similar to the difference between points and vectors.

This article introduced a first view of treaps. We saw how to use them as simple binary search trees. This way of using it is far from explore the full power of this data structure.

In the next part will focus on split and merge operations and how to use them to write a simple and efficient implementation of treaps. We’ll also see more interesting problems, like how to reverse an subarray in logarithmic time complexity, and how to add/remove values in any position of the array, also in logarithmic time.

1. If we remove a key from treap and reinsert it immediately afterwards, the treap structure will not change. Prove it or give a counterexample.

2. Given a set of nodes of the form (key, priority), there’s only one possible treap associated with those nodes (excluding all isomorphic variations). Prove it or give a counterexample.

3. Write a function to merge two treaps L, R such that all keys of the left tree are smaller than all keys of the right tree. You can understand the merge operation as the inverse of a split.

This article would not be possible without the effort made by previous contributions.

Randomized Search Trees

The original article written in 1989 by Cecilia R. Aragon and Raimund G. Seidel. You can read it here.

Open Data Structures

The article wrote by Open Data Structures gives good insights about the treap properties and some formal explanations as well. Its implementation is not the cleanest one, however.

Techie Delight

The implementation I used in this article is strongly based on the Techie Delight’s one. It’s really simple and easy to understand. The page has some really cool problems, very similar to the Geeks for Geeks of a few years ago.

E-Maxx (CP Algorithms)

E-Maxx is a wide-known site about competitive programming and it has an rich page about treaps. My next article is strongly based on that material, but I used some of their concepts here as well, like the off-line building in linear time.

A Visual Introduction to Treap Data Structure (Part I: The Basics) (2024)

FAQs

What is a treap in data structure? ›

A treap is a search tree data structure based on binary search trees (BST) and heaps. The basic structure of a treap is that of a binary tree, with each node containing a key and a priority.

View Details ›

How to build a treap? ›

It means that to build a treap given a set of n nodes we just have to make n insertions. If we have all the keys sorted, we can build the treap in O(n) time complexity instead. The idea is to split the array into two parts, from 0 to m-1 and from m+1 from n-1, where n is the size of the array and m is n/2.

Read The Full Story ›

What are the advantages of treaps? ›

Of the many types of balanced trees that have been developed over the years, treaps [26] have the advantage of both being simple and general—in addition to insertion and deletion they easily support efficient finger searching, joining, and splitting.

Read On ›

How do I delete from a treap? ›

Delete in treaps

Search for the node X containing K using the usual BST find algorithm.
If the node X is a leaf, just delete the node (unlink it from its parent)
Otherwise, use AVL rotations to rotate the node down until it becomes a leaf; then delete it.

More items...

What are the use cases of treap? ›

It maintains a dynamic set of ordered keys and allows binary searches among these keys. Treaps can be used for two use cases: a regular treap as a balanced binary search tree, or an implicit cartesian tree as rope data structure. This article covers both.

Learn More Now ›

How to split a treap? ›

Splitting. The split method takes in a pointer to the root of a treap root and a value x, and returns two treaps denoted as left and right. Like the name suggests, it splits the tree such that all nodes in left have keys less than or equal to x and all nodes in right have keys greater than x.

Explore More ›

How do you insert into a treap? ›

Insert

Create new node with key equals to x and value equals to a random value.
Perform standard BST insert.
A newly inserted node gets a random priority, so Max-Heap property may be violated.. Use rotations to make sure that inserted node's priority follows max heap property.

Nov 27, 2023

Learn More Now ›

What are tries in data structure? ›

A trie is a tree-like data structure whose nodes store the letters of an alphabet. By structuring the nodes in a particular way, words and strings can be retrieved from the structure by traversing down a branch path of the tree.

Discover More Details ›

What is 2/3 tree in data structure? ›

In computer science, a 2–3 tree is a tree data structure, where every node with children (internal node) has either two children (2-node) and one data element or three children (3-node) and two data elements. A 2–3 tree is a B-tree of order 3.

What is the uniqueness of the treap? ›

A treap is ordered from left to right by key (A, B, C, F, J, L) and heap-ordered by priorities (e.g., 87 > 59 and 77; 77 > 42 and 10). Uniqueness: It turns out that there is exactly one treap for any given set of (distinct) keys and their (distinct) priorities.

Get More Info Here ›

Is treap used to solve connectivity problems? ›

Treap is a very useful data structure in computer science, it is used to solve various problems like connectivity problems, range query problems and can be used as good alternatives to segment trees/sparse tables and splay trees in some of the scenerios as they are relatively also easier to code.

Discover More Details ›

How to build a cartesian tree? ›

To construct the Cartesian tree, set its root to be the minimum number in the sequence, and recursively construct its left and right subtrees from the subsequences before and after this number. It is uniquely defined as a min-heap whose symmetric (in-order) traversal returns the original sequence.

Get More Info ›

Which button is used to delete? ›

The delete key (often abbreviated del) is a button on most computer keyboards which is typically used to delete either (in text mode) the character ahead of or beneath the cursor, or (in GUI mode) the currently-selected object. The key is sometimes referred to as the "forward delete" key.

Explore More ›

How do I delete a subtree? ›

Removing a subtree

Select the subtree you want to remove.
Expand the Select Action menu, select Delete subtree and click Go.
When asked to confirm the deletion, click OK.

Get More Info Here ›

Is a treap a combination of a tree and a heap? ›

Treap data structure is a hybrid of a binary search tree and a heap. The treap and the randomized binary search tree are two binary search tree data structures that keep a dynamic set of ordered keys and allow binary searches among the keys.

Learn More Now ›

What is a skip list in data structure? ›

In computer science, a skip list (or skiplist) is a probabilistic data structure that allows average complexity for search as well as average complexity for insertion within an ordered sequence of. elements.

Get More Info ›

What is Fibonacci Heap in data structure? ›

A Fibonacci heap is a heap data structure similar to the binomial heap. It uses Fibonacci numbers and also used to implement the priority queue element in Dijkstra's shortest path algorithm which reduces the time complexity from O(m log n) to O(m + n log n), giving the algorithm a huge boost in terms of running time.

Find Out More ›

How do I insert into treap? ›

Insert

Create new node with key equals to x and value equals to a random value.
Perform standard BST insert.
A newly inserted node gets a random priority, so Max-Heap property may be violated.. Use rotations to make sure that inserted node's priority follows max heap property.

Nov 27, 2023

Explore More ›