Sedna (database)


Sedna is an open-source database management system that provides native storage for XML data.
The distinctive design decisions employed in Sedna are schema-based clustering storage strategy for XML data and memory management based on layered address space.

Data organization

organization in Sedna is designed with the goal of providing a balance in performance between XML queries and updates execution.
The two primary design decisions in data organization in Sedna are:
  1. Direct pointers are used to represent XML node relationships such as parent, child, and sibling ones. Unlike relational-based approaches that require performing joins for traversing an XML document, traversing in Sedna is performed by simply following a direct pointer.
  2. A descriptive schema-driven storage strategy is developed which consists of clustering nodes of an XML document according to their positions in the descriptive schema of the document. In contrast to a prescriptive schema that is known in advance and is usually specified in DTD or XML Schema, the descriptive schema is generated from data dynamically and represents a concise and an accurate structure summary for data. Using the descriptive schema instead of the prescriptive one makes the storage strategy applicable to any XML document, even a one that comes with no prescriptive schema.
The following figure illustrates the overall principles of data organization in Sedna.
The descriptive schema represented as a tree of schema nodes is the central component in the data organization.
Each schema node is labeled with an XML node kind and has a pointer to data blocks that store XML nodes corresponding to the given schema node.
Depending on their node kind, some schema nodes are also labeled with names.
Data blocks related to a common schema node are linked via pointers into a bidirectional list. Node descriptors in a list of blocks are partly ordered according to document order.

Citations