Reverse Monte Carlo


The Reverse Monte Carlo modelling method is a variation of the standard Metropolis-Hastings algorithm to solve an inverse problem whereby a model is adjusted until its parameters have the greatest consistency with experimental data. Inverse problems are found in many branches of science and mathematics, but this approach is probably best known for its applications in condensed matter physics and solid state chemistry.

Applications in condensed matter sciences

Basic method

This method is often used in condensed matter sciences to produce atom-based structural models that are consistent with experimental data and subject to a set of constraints.
An initial configuration is constructed by placing atoms in a periodic boundary cell, and one or more measurable quantities are calculated based on the current configuration. Commonly used data include the pair distribution function and its Fourier transform, the latter of which is derived directly from neutron or x-ray scattering data. Other data that are used included Bragg diffraction data for crystalline materials, and EXAFS data. The comparison with experiment is quantified using a function of the form
where and are the observed and calculated quantities respectively, and is a measure of the accuracy of the measurement. The sum is over all independent measurements, which will include the sum over all points in a function such as the pair distribution function.
An iterative procedure is run where one randomly chosen atom is moved a random amount, followed by a new calculation of the measurable quantities. Such a process will cause to either increase or decrease in value by an amount. The move is accepted with the probability according to the normal Metropolis-Hastings algorithm, ensuring that moves that give better agreement with experimental data are accepted, and moves that worsen agreement with experimental data can be accepted to a greater or lesser extent corresponding to how much the agreement has worsened. Moreover, the move may also be rejected if it breaks certain constraints, even if the agreement with data is improved. An example would be to reject a move which brings two atoms closer than a preset limit, to prevent overlap or collision between the two atoms.
Following the acceptance/rejection test, the procedure is repeated. As the number of accepted atom moves increases, the calculated quantities will become closer to the experimental values until they reach an equilibrium state. From then onward the RMC algorithm will simply generate a small oscillation in the value of. The resulting atomic configuration should be a structure that is consistent with the experimental data within its errors.

Applications

The RMC method for condensed matter problems was initially developed by McGreevy and Pusztai in 1988, with application to liquid argon. For several years the primary application was for liquids and amorphous materials, particularly because this provides the only means to obtain structural models from data, whereas crystallography has analysis methods for both single crystal and powder diffraction data. More recently, it has become clear that RMC can provide important information for disordered crystalline materials also.

Issues with the RMC method

The RMC method suffers from a number of potential problems. The most notable problem is that often more than one qualitatively different model will give similar agreement with experimental data. For example, in the case of amorphous silicon, the integral of the first peak in the pair distribution function may imply an average atomic coordination number of 4. This might reflect the fact that all atoms have coordination number of 4, but similarly having half the atoms with coordination number of 3 and half with 5 will also be consistent with this data. Unless a constraint on the coordination number is employed, the RMC method will have no means of generating a unique coordination number and most likely a spread of coordination numbers will result. Using amorphous silicon as an example, Biswas, Atta-Fynn and Drabold were the first to elucidate the importance of including constraints in RMC modeling. Since the RMC method follows the normal rules of statistical mechanics, its final solution will be the one with the highest degree of disorder possible. A second problem comes from the fact that without constraints the RMC method will typically have more variables than observables. One result from this will be that the final atomic configuration may have artifacts that arise from the method attempting to fit noise in the data.
One should remark, however, that most applications of the RMC approach today take account of these problems by appropriate use of implicit or explicit constraints.

Implementations of the RMC method

There are four publicly available implementations of the RMC method.

fullrmc

FUndamental Library Language for Reverse Monte Carlo or fullrmc is a multicore RMC modeling package. fullrmc is a fully object-oriented python interfaced package where every definition can be overloaded allowing easy development, implementation and maintenance of the code. fullrmc's computation blocks and modules are optimized written in cython/C. fullrmc is not a standard RMC package but it is rather unique in its approach to solving an atomic or molecular structure. fullrmc supports atomic and molecular systems, all types of periodic boundary conditions systems as well as the so-called infinite boundary conditions to model nanoparticles or isolated systems. fullrmc's Engine is defined and used to launch a RMC calculation. By definition, Engine reads only Protein Data Bank atomic configuration files and handles other definitions and attributes. In fullrmc atoms can be grouped into rigid bodies or semi-rigid bodies called groups so the system can evolve atomically, clusterly, molecularly or any combination of those. Every group can be assigned a different and customizable move generator. Groups selection by the fitting engine can also be customizable. Also fullrmc uses Artificial intelligence and Reinforcement learning algorithms to improve the ratio of accepted moves.

RMCProfile

RMCProfile is a significantly developed version of the original RMC code written by McGreevy and Puszta. It is written in Fortran 95 with some Fortran 2003 features. It has maintained the ability to model liquids and amorphous materials using the pair distribution function, total scattering and EXAFS data, but also includes the capability of modelling crystalline materials by explicitly using the information contained within the Bragg diffraction data. RMCProfile gives users a range of constraints, including the inclusion of molecular potentials and distance windows, which exploit possibilities afforded by the lack of significant diffusion in crystalline materials. RMCProfile allows simulation of magnetic materials, using the magnetic component of total scattering data, and also allows simulation of materials where atoms are allowed to swap positions.

RMC++

RMC++ a rewritten, C++ version of the original RMC code developed by McGreevy and Pusztain. RMC++ is designed specifically for the study of liquids and amorphous materials, using pair distribution function, total scattering and EXAFS data.

HRMC

Hybrid Reverse Monte Carlo is a code capable of fitting both the pair correlation function and structure factor along with bond angle and coordination distributions. Unique to this code is the implementation of a number of empirical interatomic potentials for carbon, silicon and germanium. This allows the code to fit experimental data along with minimizing the total system energy.