Behavior tree (artificial intelligence, robotics and control)


A behavior tree is a mathematical model of plan execution used in computer science, robotics, control systems and video games. They describe switchings between a finite set of tasks in a modular fashion. Their strength comes from their ability to create very complex tasks composed of simple tasks, without worrying how the simple tasks are implemented. Behavior trees present some similarities to hierarchical state machines with the key difference that the main building block of a behavior is a task rather than a state. Its ease of human understanding make behavior trees less error prone and very popular in the game developer community. Behavior trees have been shown to generalize several other control architectures. Mathematically, they are directed acyclic graphs.

Background

Behavior trees originate from the computer game industry as a powerful tool to model the behavior of non-player characters.
They have been extensively used in high-profile video games such as Halo, Bioshock, and Spore. Recent works propose behavior trees as a multi-mission control framework for UAV, complex robots, robotic manipulation, and multi-robot systems.
Behavior trees have now reached the maturity to be treated in Game AI textbooks, as well as generic game environments such as Pygame and Unreal Engine.
Behavior trees became popular for their development paradigm: being able to create a complex behavior by only programming the NPC's actions and then designing a tree structure whose leaf nodes are actions and whose inner nodes determine the NPC's decision making. Behavior trees are visually intuitive and easy to design, test, and debug, and provide more modularity, scalability, and reusability than other behavior creation methods.
Over the years, the diverse implementations of behavior trees kept improving both in efficiency and capabilities to satisfy the demands of the industry, until they evolved into event-driven behavior trees. Event-driven behavior trees solved some scalability issues of classical behavior trees by changing how the tree internally handles its execution, and by introducing a new type of node that can react to events and abort running nodes. Nowadays, the concept of event-driven behavior tree is a standard and used in most of the implementations, even though they are still called "behavior trees" for simplicity.

Key concepts

A behavior tree is graphically represented as a directed tree in which the nodes are classified as root, control flow nodes, or execution nodes. For each pair of connected nodes the outgoing node is called parent and the incoming node is called child. The root has no parents and exactly one child, the control flow nodes have one parent and at least one child, and the execution nodes have one parent and no children. Graphically, the children of a control flow node are placed below it, ordered from left to right.
The execution of a behavior tree starts from the root which sends ticks with a certain frequency to its child. A tick is an enabling signal that allows the execution of a child. When the execution of a node in the behavior tree is allowed, it returns to the parent a status running if its execution has not finished yet, success if it has achieved its goal, or failure otherwise.

Control flow node

A control flow node is used to control the subtasks of which it is composed. A control flow node may be either a selector node or a sequence node. They run each of their subtasks in turn. When a subtask is completed and returns its status, the control flow node decides whether to execute the next subtask or not.

Selector (fallback) node

Fallback nodes are used to find and execute the first child that does not fail. A fallback node will return immediately with a status code of success or running when one of its children returns success or running. The children are ticked in order of importance, from left to right.
In pseudocode, the algorithm for a fallback composition is:
1 for i from 1 to n do
2 childstatus ← Tick
3 if childstatus = running
4 return running
5 else if childstatus = success
6 return success
7 end
8 return failure

Sequence node

Sequence nodes are used to find and execute the first child that has not yet succeeded. A sequence node will return immediately with a status code of failure or running when one of its children returns failure or running. The children are ticked in order, from left to right.
In pseudocode, the algorithm for a sequence composition is:
1 for i from 1 to n do
2 childstatus ← Tick
3 if childstatus = running
4 return running
5 else if childstatus = failure
6 return failure
7 end
8 return success

Mathematical state space definition

In order to apply control theory tools to the analysis of behavior trees, they can be defined as three-tuple.
where is the index of the tree, is a vector field representing the right hand side of an ordinary difference equation, is a time step and
is the return status, that can be equal to either
Running,
Success, or
Failure.
Note: A task is a degenerate behavior tree with no parent and no child.

Behavior tree execution

The execution of a behavior tree is described by the following standard ordinary difference equations:
where represent the discrete time, and is the state space of the system modelled by the behavior tree.

Sequence composition

Two behavior trees and can be composed into a more complex behavior tree using a Sequence operator.
Then return status and the vector field associated with are defined as follows: