Biogeography-based optimization
Biogeography-based optimization is an evolutionary algorithm that optimizes a function by stochastically and iteratively improving candidate solutions with regard to a given measure of quality, or fitness function. BBO belongs to the class of metaheuristics since it includes many variations, and since it does not make any assumptions about the problem and can therefore be applied to a wide class of problems.
BBO is typically used to optimize multidimensional real-valued functions, but it does not use the gradient of the function, which means that it does not require the function to be differentiable as required by classic optimization methods such as gradient descent and quasi-newton methods. BBO can therefore be used on discontinuous functions.
BBO optimizes a problem by maintaining a population of candidate solutions, and creating new candidate solutions by combining existing ones according to a simple formula. In this way the objective function is treated as a black box that merely provides a measure of quality given a candidate solution, and the function's gradient is not needed.
Like many EAs, BBO was motivated by a natural process; in particular, BBO was motivated by biogeography, which is the study of the distribution of biological species through time and space. BBO was originally introduced by in 2008.
Underlying principles
Mathematical models of biogeography describe speciation, the migration of species between islands, and the extinction of species. Islands that are friendly to life are said to have a high habitat suitability index. Features that correlate with HSI include rainfall, vegetative diversity, topographic diversity, land area, temperature, and others. The features that determine are called suitability index variables. In terms of habitability, SIVs are the independent variables and HSI is the dependent variable.Islands with a high HSI can support many species, and islands with a low HSI can support only a few species. Islands with a high HSI have many species that emigrate to nearby habitats because of the large populations and the large numbers of species that they host. Note that emigration from an island with a high HSI does not occur because species want to leave their home; after all, their home island is an attractive place to live. Emigration occurs because of the accumulation of random effects on a large number of species with large populations. Emigration occurs as animals ride flotsam, swim, fly, or ride the wind to neighboring islands. When a species emigrates from an island, it does not mean that the species completely disappears from its original island; only a few representatives emigrate, so an emigrating species remains present on its original island while at the same time migrating to a neighboring island. However, in BBO it is assumed that emigration from an island results in extinction from that island. This assumption is necessary in BBO because species represent the independent variables of a function, and each island represents a candidate solution to a function optimization problem.
Islands with a high HSI not only have a high emigration rate, but they also have a low immigration rate because they already support many species. Species that migrate to such islands will tend to die in spite of the island's high HSI, because there is too much competition for resources from other species.
Islands with a low HSI have a high immigration rate because of their low populations. Again, this is not because species want to immigrate to such islands; after all, these islands are undesirable places to live. The reason that immigration occurs to these islands is because there is a lot of room for additional species. Whether or not the immigrating species can survive in its new home, and for how long, is another question. However, species diversity is correlated with HSI, so when more species arrive at a low HSI island, the island's HSI will tend to increase.
The figure on the right illustrates an island migration model. The immigration rate and the emigration rate are functions of the number of species on the island. The maximum possible immigration rate occurs when there are zero species on the island. As the number of species increases, the island becomes more crowded, fewer species are able to survive immigration, and the immigration rate decreases. The largest possible number of species that the habitat can support is, at which point the immigration rate is zero. If there are no species on the island, then the emigration rate is zero. As the number of species on the island increases, it becomes more crowded, more species representatives are able to leave the island, and the emigration rate increases. When the island contains the largest number of possible species, the emigration rate reaches its maximum possible value.
In BBO, is the probability that a given independent variable in the -th candidate solution will be replaced; that is, is the immigration probability of. If an independent variable is to be replaced, then the emigrating candidate solution is chosen with a probability that is proportional to the emigration probability. This is usually performed using roulette wheel selection.
for, where is the number of candidate solutions in the population.
Algorithm
Like most other EAs, BBO includes mutation. A basic BBO algorithm with a population size of for optimizing an -dimensional function can be described as follows.Initialize a population of candidate solutions
While not
For each, set emigration probability fitness of, do
with
For each, set immigration probability do
For each individual do
For each independent variable index do
Use to probabilistically decide whether to immigrate to
If immigrating then
Use to probabilistically select the emigrating individual
End if
Next independent variable index:
Probabilistically mutate
Next individual:
Next generation
Discussion of the BBO algorithm
- The population size is a tuning parameter. If is too small or too large, then the optimization performance of BBO will suffer. Typical implementations of BBO use a value of somewhere between 20 and 200.
- The initial population of candidate solutions is usually generated randomly. However, it could be generated in a problem-dependent way based on some reasonable guesses or previously-known good solutions to the optimization problem.
- The termination criterion is problem-dependent, like in any other EA. In most applications the termination criterion is a generation count limit or a function evaluation limit.
- is a temporary population so that all emigrating variables can originate from the population that is in place at the beginning of the generation, which is.
Algorithmic variations
- Elitism is implemented in most EAs to make sure that the best candidate solution is not lost from one generation to the next. This can be implemented in a variety of ways, but one common way is to save the best candidate solutions at the beginning of each generation in a set ; then replace the worst candidate solutions with at the end of the generation, after migration and mutation have completed. The size of is a tuning parameter, but typically includes the best two individuals. Elitism was originally proposed for genetic algorithms by DeJong. Elitism can make a significant difference in the performance of BBO, and is highly recommended.
- Duplicate replacement is often implemented in BBO. This is a procedure at the end of each generation that replaces duplicate individuals in the population. Scanning for duplicates can be computationally intensive because it is an operation, so it is often performed only every few generations, rather than every generation.
- Blending can be implemented in BBO. With blending, instead of replacing in an immigrating candidate solution with from the emigrating candidate solution, is set equal to a linear combination of its original value and :
- The BBO algorithm presented above is called partial immigration-based BBO because the immigrating candidate solution is selected before the emigrating candidate solution is selected, and migration for each independent variable in the immigrating candidate solution is performed independently of all other independent variables. Other approaches for selecting the immigrating and emigrating candidate solutions have also been proposed.
- The migration curves in the above figure are linear, but nonlinear migration curves often give better performance.
Hybridization
- BBO has been hybridized with several other EAs, including particle swarm optimization, differential evolution, evolution strategy, , case-based reasoning, artificial bee colony algorithm, bacterial foraging optimization, harmony search, and the simplex algorithm.
- BBO can be combined with local search to create a memetic algorithm that performs much better than BBO alone.
Software
MATLAB
- The following MATLAB code gives a BBO implementation for minimizing the 20-dimensional Rosenbrock function. Note that the following code is very basic, although it does include elitism. A serious BBO implementation should include some of the variations discussed above, such as duplicate replacement, blending, nonlinear migration, and local optimization.
function BBO
% Biogeography-based optimization to minimize a continuous function
% This program was tested with MATLAB R2012b
GenerationLimit = 50; % generation count limit
PopulationSize = 50; % population size
ProblemDimension = 20; % number of variables in each solution
MutationProbability = 0.04; % mutation probability per solution per independent variable
NumberOfElites = 2; % how many of the best solutions to keep from one generation to the next
MinDomain = -2.048; % lower bound of each element of the function domain
MaxDomain = +2.048; % upper bound of each element of the function domain
% Initialize the population
rng; % initialize the random number generator
x = zeros; % allocate memory for the population
for index = 1 : PopulationSize % randomly initialize the population
x = MinDomain + * rand;
end
Cost = RosenbrockCost; % compute the cost of each individual
= PopulationSort; % sort the population from best to worst
MinimumCost = zeros; % allocate memory
MinimumCost = Cost; % save the best cost at each generation in the MinimumCost array
disp;
z = zeros; % allocate memory for the temporary population
% Compute migration rates, assuming the population is sorted from most fit to least fit
mu = / ; % emigration rate
lambda = 1 - mu; % immigration rate
for Generation = 1 : GenerationLimit
% Save the best solutions and costs in the elite arrays
EliteSolutions = x;
EliteCosts = Cost;
% Use migration rates to decide how much information to share between solutions
for k = 1 : PopulationSize
% Probabilistic migration to the k-th solution
for j = 1 : ProblemDimension
if rand < lambda % Should we immigrate?
% Yes - Pick a solution from which to emigrate
RandomNum = rand * sum;
Select = mu;
SelectIndex = 1;
while &&
SelectIndex = SelectIndex + 1;
Select = Select + mu;
end
z = x; % this is the migration step
else
z = x; % no migration for this independent variable
end
end
end
% Mutation
for k = 1 : PopulationSize
for ParameterIndex = 1 : ProblemDimension
if rand < MutationProbability
z = MinDomain + * rand;
end
end
end
x = z; % replace the solutions with their new migrated and mutated versions
Cost = RosenbrockCost; % calculate cost
= PopulationSort; % sort the population and costs from best to worst
for k = 1 : NumberOfElites % replace the worst individuals with the previous generation's elites
x = EliteSolutions;
Cost = EliteCosts;
end
= PopulationSort; % sort the population and costs from best to worst
MinimumCost = Cost;
disp
end
% Wrap it up by displaying the best solution and by plotting the results
disp
close all
plot;
xlabel
ylabel
return
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
function = PopulationSort
% Sort the population and costs from best to worst
= sort;
x = x;
return
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
function = RosenbrockCost
% Compute the Rosenbrock function value of each element in x
NumberOfDimensions = size;
Cost = zeros; % allocate memory for the Cost array
for PopulationIndex = 1 : length
Cost = 0;
for i = 1 : NumberOfDimensions-1
Temp1 = x;
Temp2 = x;
Cost = Cost + 100 * ^2 + ^2;
end
end
return
R
- "bbo: Biogeography-Based Optimization" is an R package for continuous BBO.
Extensions
Moreover, a micro biogeography-inspired multi-objective optimization algorithm was implemented: it is suitable for solving multi-objective optimisations in the field of industrial design because it is based on a small number of islands, i.e. few objective function calls are required.
Mathematical analyses
BBO has been mathematically analyzed using Markov models and dynamic system models.Applications
Scholars have applied BBO into various academic and industrial applications. They found BBO performed better than state-of-the-art global optimization methods.For example, Wang et al. proved BBO performed equal performance with FSCABC but with simpler codes.
Yang et al. showed BBO was superior to GA, PSO, and ABC.