Process mining is a family of techniques in the field of process management that support the analysis of business processes based on event logs. During process mining, specialized data mining algorithms are applied to event log data in order to identify trends, patterns and details contained in event logs recorded by an information system. Process mining aims to improve process efficiency and understanding of processes. It is critical to understand processes because as-is processes usually differ from the desired ones, and companies need to have data-driven evidence to fix for improving their processes. Process mining is also known as Automated Business Process Discovery. However, in academic literature the term Automated Business Process Discovery is used in a narrower sense to refer specifically to techniques that take as input an event log and produce as output a business process model. The term Process Mining is used in a broader setting to refer not only to techniques for discovering process models, but also techniques for business process conformance and performance analysis based on event logs.
Overview
Process mining techniques are often used when no formal description of the process can be obtained by other approaches, or when the quality of existing documentation is questionable. Per Gartner, Process Mining is a subset of hyperautomation. For example, application of process mining methodology to the audit trails of a workflow management system, the transaction logs of an enterprise resource planning system, or the electronic patient records in a hospital can result in models describing processes, organizations, and products. Event log analysis can also be used to compare event logs with prior model to understand whether the observations conform to a prescriptive or descriptive model. It is required that the event logs data be linked to a case ID, activities, and timestamps. Contemporary management trends such as BAM, BOM, and BPI illustrate the interest in supporting diagnosis functionality in the context of Business Process Management technology.
Application
Process mining follows the options established in business process engineering, then goes beyond those options by providing feedback for business process modeling:
process analysis filters, orders and compresses logfiles for further insight into the connex of process operations.
process design may be supported by feedback from process monitoring
process enactment uses results from process mining based on logging for triggering further process operations
A list all the major process mining initiatives.
Classification
There are three classes of process mining techniques. This classification is based on whether there is a prior model and, if so, how the prior model is used during process mining.
Discovery: Previous models do not exist. Based on an event log, a new model is constructed or discovered based on low-level events. For example, using the alpha algorithm. Many established techniques exist for automatically constructing process models based on an event log. Recently, process mining research has started targeting the other perspectives. One example is the technique described in, which can be used to construct a social network.
Conformance checking: Used when there is an a priori model. The existing model is compared with the process event log; discrepancies between the log and the model are analyzed. For example, there may be a process model indicating that purchase orders of more than 1 million Euro require two checks. Another example is the checking of the so-called "four-eyes" principle. Conformance checking may be used to detect deviations to enrich the model. An example is the extension of a process model with performance data, i.e., some a priori process model is used to project the potential bottlenecks. Another example is the decision miner described in which takes an a priori process model and analyzes every choice in the process model. For each choice the event log is consulted to see which information is typically available the moment the choice is made. Then classical data mining techniques are used to see which data elements influence the choice. As a result, a decision tree is generated for each choice in the process.
Performance Mining: Used when there is an a priori model. The model is extended with a new performance information such as processing times, cycle times, waiting times, costs, etc., so that the goal is not to check conformance, but rather to improve the performance of the existing model with respect to certain process performance measures. An example is the extension of a process model with performance data, i.e., some prior process model dynamically annotated with performance data.