Quadtree
A quadtree is a tree data structure in which each internal node has exactly four children. Quadtrees are the two-dimensional analog of octrees and are most often used to partition a two-dimensional space by recursively subdividing it into four quadrants or regions. The data associated with a leaf cell varies by application, but the leaf cell represents a "unit of interesting spatial information".
The subdivided regions may be square or rectangular, or may have arbitrary shapes. This data structure was named a quadtree by Raphael Finkel and J.L. Bentley in 1974. A similar partitioning is also known as a Q-tree. All forms of quadtrees share some common features:
- They decompose space into adaptable cells
- Each cell has a maximum capacity. When maximum capacity is reached, the bucket splits
- The tree directory follows the spatial decomposition of the quadtree.
Types
Quadtrees may be classified according to the type of data they represent, including areas, points, lines and curves. Quadtrees may also be classified by whether the shape of the tree is independent of the order in which data is processed. The following are common types of quadtrees.Region quadtree
The [|region quadtree] represents a partition of space in two dimensions by decomposing the region into four equal quadrants, subquadrants, and so on with each leaf node containing data corresponding to a specific subregion. Each node in the tree either has exactly four children, or has no children. The height of quadtrees that follow this decomposition strategy is sensitive to and dependent on the spatial distribution of interesting areas in the space being decomposed. The region quadtree is a type of trie.A region quadtree with a depth of n may be used to represent an image consisting of 2n × 2n pixels, where each pixel value is 0 or 1. The root node represents the entire image region. If the pixels in any region are not entirely 0s or 1s, it is subdivided. In this application, each leaf node represents a block of pixels that are all 0s or all 1s. Note the potential savings in terms of space when these trees are used for storing images; images often have many regions of considerable size that have the same colour value throughout. Rather than store a big 2-D array of every pixel in the image, a quadtree can capture the same information potentially many divisive levels higher than the pixel-resolution sized cells that we would otherwise require. The tree resolution and overall size is bounded by the pixel and image sizes.
A region quadtree may also be used as a variable resolution representation of a data field. For example, the temperatures in an area may be stored as a quadtree, with each leaf node storing the average temperature over the subregion it represents.
If a region quadtree is used to represent a set of point data, regions are subdivided until each leaf contains at most a single point.
Point quadtree
The point quadtree is an adaptation of a binary tree used to represent two-dimensional point data. It shares the features of all quadtrees but is a true tree as the center of a subdivision is always on a point. It is often very efficient in comparing two-dimensional, ordered data points, usually operating in O time. Point quadtrees are worth mentioning for completeness, but they have been surpassed by k-d trees as tools for generalized binary search.Point quadtrees are constructed as follows. Given the next point to insert, we find the cell in which it lies and add it to the tree. The new point is added such that the cell that contains it is divided into quadrants by the vertical and horizontal lines that run through the point. Consequently, cells are rectangular but not necessarily square. In these trees, each node contains one of the input points.
Since the division of the plane is decided by the order of point-insertion, the tree's height is sensitive to and dependent on insertion order. Inserting in a "bad" order can lead to a tree of height linear in the number of input points. If the point-set is static, pre-processing can be done to create a tree of balanced height.
Node structure for a point quadtree
A node of a point quadtree is similar to a node of a binary tree, with the major difference being that it has four pointers instead of two as in an ordinary binary tree. Also a key is usually decomposed into two parts, referring to x and y coordinates. Therefore, a node contains the following information:- four pointers: quad, quad, quad, and quad
- point; which in turn contains:
- * key; usually expressed as x, y coordinates
- * value; for example a name
Point-region (PR) quadtree
Edge quadtree
Edge quadtrees are used to store lines rather than points. Curves are approximated by subdividing cells to a very fine resolution, specifically until there is a single line segment per cell. Near corners/vertices, edge quadtrees will continue dividing until they reach their maximum level of decomposition. This can result in extremely unbalanced trees which may defeat the purpose of indexing.Polygonal map (PM) quadtree
The polygonal map quadtree is a variation of quadtree which is used to store collections of polygons that may be degenerate.A big difference between PM quadtrees and edge quadtrees is that the cell under consideration is not subdivided if the segments meet at a vertex in the cell.
There are three main classes of PM Quadtrees, which vary depending on what information they store within each black node. PM3 quadtrees can store any amount of non-intersecting edges and at most one point. PM2 quadtrees are the same as PM3 quadtrees except that all edges must share the same end point. Finally PM1 quadtrees are similar to PM2, but black nodes can contain a point and its edges or just a set of edges that share a point, but you cannot have a point and a set of edges that do not contain the point.
Compressed quadtrees
This section summarizes a subsection from a book by Sariel Har-Peled.If we were to store every node corresponding to a subdivided cell, we may end up storing a lot of empty nodes. We can cut down on the size of such sparse trees by only storing subtrees whose leaves have interesting data. We can actually cut down on the size even further. When we only keep important subtrees, the pruning process may leave long paths in the tree where the intermediate nodes have degree two. It turns out that we only need to store the node at the beginning of this path and attach the subtree rooted at its end to. It is still possible for these compressed trees to have a linear height when given "bad" input points.
Although we trim a lot of the tree when we perform this compression, it is still possible to achieve logarithmic-time search, insertion, and deletion by taking advantage of Z-order curves. The Z-order curve maps each cell of the full quadtree in time to a one-dimensional line, creating a total order on the elements. Therefore, we can store the quadtree in a data structure for ordered sets. We must state a reasonable assumption before we continue: we assume that given two real numbers expressed as binary, we can compute in time the index of the first bit in which they differ. We also assume that we can compute in time the lowest common ancestor of two points/cells in the quadtree and establish their relative Z-ordering, and we can compute the floor function in time. With these assumptions, point location of a given point , insertion, and deletion operations can all be performed in time.
To perform a point location for :
- Find the existing cell in the compressed tree that comes before in the Z-order. Call this cell.
- If, return.
- Else, find what would have been the lowest common ancestor of the point and the cell in an uncompressed quadtree. Call this ancestor cell.
- Find the existing cell in the compressed tree that comes before in the Z-order and return it.
Some common uses of quadtrees
- Image representation
- [|Image processing]
- [|Mesh generation]
- Spatial indexing, point location queries, and range queries
- Efficient collision detection in two dimensions
- View frustum culling of terrain data
- Storing sparse data, such as a formatting information for a spreadsheet or for some matrix calculations
- Solution of multidimensional fields
- Conway's Game of Life simulation program.
- State estimation
- Quadtrees are also used in the area of fractal image analysis
-
Image processing using quadtrees
Image Union/Intersection
One of the advantages of using quadtrees for image manipulation is that the set operations of union and intersection can be done simply and quickly.Given two binary images, the image union produces an image wherein a pixel is black if either of the input images has a black pixel in the same location. That is, a pixel in the output image is white only when the corresponding pixel in both input images is white, otherwise the output pixel is black. Rather than do the operation pixel by pixel, we can compute the union more efficiently by leveraging the quadtree's ability to represent multiple pixels with a single node. For the purposes of discussion below, if a subtree contains both black and white pixels we will say that the root of that subtree is coloured grey.
The algorithm works by traversing the two input quadtrees while building the output quadtree. Informally, the algorithm is as follows. Consider the nodes and corresponding to the same region in the images.
- If or is black, the corresponding node is created in and is colored black. If only one of them is black and the other is gray, the gray node will contain a subtree underneath. This subtree need not be traversed.
- If is white, and the subtree underneath it is copied to .
- If both and are gray, then the corresponding children of and are considered.
The intersection of two images is almost the same algorithm. One way to think about the intersection of the two images is that we are doing a union with respect to the white pixels. As such, to perform the intersection we swap the mentions of black and white in the union algorithm.
Connected Component Labelling
Consider two neighbouring black pixels in a binary image. They are adjacent if they share a bounding horizontal or vertical edge. In general, two black pixels are connected if one can be reached from the other by moving only to adjacent pixels. Each maximal set of connected black pixels is a connected component. Using the quadtree representation of images, Samet showed how we can find and label these connected components in time proportional to the size of the quadtree. This algorithm can also be used for polygon colouring.The algorithm works in three steps:
- establish the adjacency relationships between black pixels
- process the equivalence relations from the first step to obtain one unique label for each connected component
- label the black pixels with the label associated with their connected component
Step one is accomplished with a post-order traversal of the quadtree. For each black leaf we look at the node or nodes representing cells that are Northern neighbours and Eastern neighbours. Since the tree is organized in Z-order, we have the invariant that the Southern and Western neighbours have already been taken care of and accounted for. Let the Northern or Eastern neighbour currently under consideration be. If represents black pixels:
- If only one of or has a label, assign that label to the other cell
- If neither of them have labels, create one and assign it to both of them
- If and have different labels, record this label equivalence and move on
Step three performs another post-order traversal. This time, for each black node we use the union-find's find operation to find and assign its new label.
Mesh generation using quadtrees
This section summarizes a chapter from a book by Har-Peled and de Berg et al.Mesh generation is essentially the triangulation of a point set for which further processing may be performed. As such, it is desirable for the resulting triangulation to have certain properties to make further processing quicker and less error-prone. Quadtrees built on the point set can be used to create meshes with these desired properties.
Consider a leaf of the quadtree and its corresponding cell. We say is balanced if the cell's sides are intersected by the corner points of neighbouring cells at most once on each side. This means that the quadtree levels of leaves adjacent to differ by at most one from the level of. When this is true for all leaves, we say the whole quadtree is balanced.
Consider the cell and the neighbourhood of same-sized cells centred at. We call this neighbourhood the extended cluster. We say the quadtree is well-balanced if it is balanced, and for every leaf that contains a point of the point set, its extended cluster is also in the quadtree and the extended cluster contains no other point of the point set.
Creating the mesh is done as follows:
- Build a quadtree on the input points.
- Ensure the quadtree is balanced. For every leaf, if there is a neighbour that is too large, subdivide the neighbour. This is repeated until the tree is balanced. We also make sure that for a leaf with a point in it, the nodes for each leaf's extended cluster are in the tree.
- For every leaf node that contains a point, if the extended cluster contains another point, we further subdivide the tree and rebalance as necessary. If we needed to subdivide, for each child of we ensure the nodes of 's extended cluster are in the tree.
- Repeat the previous step until the tree is well-balanced.
- Transform the quadtree into a triangulation.
The remaining squares are triangulated according to some simple rules. For each regular square, introduce the diagonal. Note that due to the way in which we separated points with the well-balancing property, no square with a corner intersecting a side is one that was warped. As such, we can triangulate squares with intersecting corners as follows. If there is one intersected side, the square becomes three triangles by adding the long diagonals connecting the intersection with opposite corners. If there are four intersected sides, we split the square in half by adding an edge between two of the four intersections, and then connect these two endpoints to the remaining two intersection points. For the other squares, we introduce a point in the middle and connect it to all four corners of the square as well as each intersection point.
At the end of it all, we have a nice triangulated mesh of our point set built from a quadtree.
Pseudocode
The following pseudo code shows one means of implementing a quadtree which handles only points. There are other approaches available.Prerequisites
It is assumed these structures are used.// Simple coordinate object to represent points and vectors
struct XY
// Axis-aligned bounding box with half dimension and center
struct AABB
QuadTree class
This class represents both one quad tree and the node where it is rooted.class QuadTree
Insertion
The following method inserts a point into the appropriate quad of a quadtree, splitting if necessary.class QuadTree
Query range
The following method finds all points contained within a range.class QuadTree
General references
- Chapter 14: Quadtrees: pp. 291–306.