Shadow paging


In computer science, shadow paging is a technique for providing atomicity and durability in database systems. A page in this context refers to a unit of physical storage, typically of the order of 1 to 64 KiB.
Shadow paging is a copy-on-write technique for avoiding in-place updates of pages. Instead, when a page is to be modified, a shadow page is allocated. Since the shadow page has no references, it can be modified liberally, without concern for consistency constraints, etc. When the page is ready to become durable, all pages that referred to the original are updated to refer to the new replacement page instead. Because the page is "activated" only when it is ready, it is atomic.
If the referring pages must also be updated via shadow paging, this procedure may recurse many times, becoming quite costly. One solution, employed by the Write Anywhere File Layout file system is to be lazy about making pages durable. This increases performance significantly by avoiding many writes on hotspots high up in the referential hierarchy at the cost of high commit latency.
Write-ahead logging is a more popular solution that uses in-place updates.
Shadow paging is similar to the old master-new master batch processing technique used in mainframe database systems. In these systems, the output of each batch run was written to two separate disks or other form of storage medium. One was kept for backup, and the other was used as the starting point for the next day's work.
Shadow paging is also similar to purely functional data structures, in that in-place updates are avoided.