Computer-aided diagnosis
Computer-aided detection, also called computer-aided diagnosis, are systems that assist doctors in the interpretation of medical images. Imaging techniques in X-ray, MRI, and ultrasound diagnostics yield a great deal of information that the radiologist or other medical professional has to analyze and evaluate comprehensively in a short time. CAD systems process digital images for typical appearances and to highlight conspicuous sections, such as possible diseases, in order to offer input to support a decision taken by the professional.
CAD also has potential future applications in digital pathology with the advent of whole-slide imaging and machine learning algorithms. So far its application has been limited to quantifying immunostaining but is also being investigated for the standard H&E stain.
CAD is an interdisciplinary technology combining elements of artificial intelligence and computer vision with radiological and pathology image processing. A typical application is the detection of a tumor. For instance, some hospitals use CAD to support preventive medical check-ups in mammography, the detection of polyps in the colon, and lung cancer.
Computer-aided detection systems are usually confined to marking conspicuous structures and sections. Computer-aided diagnosis systems evaluate the conspicuous structures. For example, in mammography CAD highlights microcalcification clusters and hyperdense structures in the soft tissue. This allows the radiologist to draw conclusions about the condition of the pathology. Another application is CADq, which quantifies, e.g., the size of a tumor or the tumor's behavior in contrast medium uptake. Computer-aided simple triage is another type of CAD, which performs a fully automatic initial interpretation and triage of studies into some meaningful categories. CAST is particularly applicable in emergency diagnostic imaging, where a prompt diagnosis of critical, life-threatening condition is required.
Although CAD has been used in clinical environments for over 40 years, CAD usually does not substitute the doctor or other professional, but rather plays a supporting role. The professional is generally responsible for the final interpretation of a medical image. However, the goal of some CAD systems is to detect earliest signs of abnormality in patients that human professionals cannot, as in [|diabetic retinopathy], architectural distortion in mammograms, ground-glass nodules in thoracic CT, and non-polypoid lesions in CT colonography.
Topics
A Brief History
In the late 1950s, with the dawn of modern computers researchers in various fields started exploring the possibility of building computer-aided medical diagnostic systems. These first CAD systems used flow-charts, statistical pattern-matching, probability theory or knowledge bases to drive their decision making process.Since the early 1970s, some of the very early CAD systems in medicine, which were often referred as “expert systems” in medicine, were developed and used mainly for educational purposes. The MYCIN expert system, the Internist-I expert system and the CADUCEUS are some of such examples.
During the beginning of the early developments, the researchers were aiming at building entirely automated CAD / expert systems. The expectation of what computers can do was unrealistically optimistic among these scientists. However, after the breakthrough paper, “Reducibility among Combinatorial Problems” by Richard M. Karp, it became clear that there were limitations but also potential opportunities when one develops algorithms to solve groups of important computational problems.
As result of the new understanding of the various algorithmic limitations that Karp discovered in the early 1970s, researchers started realizing the serious limitations that CAD and expert systems in medicine have. The recognition of these limitations brought the investigators to develop new kinds of CAD systems by using advanced approaches. Thus, by the late 1980s and early 1990s the focus sifted in the use of data mining approaches for the purpose of using more advanced and flexible CAD systems.
In 1998, the first commercial CAD system for mammography, the ImageChecker system, was approved by the US Food and Drug Administration. In the following years several commercial CAD systems for analyzing mammography, breast MRI, medical imagining of lung, colon, and heart also received FAD approvals. Currently, CAD systems are used as a diagnostic aid to provide physicians for better medical decision-making.
Methodology
CAD is fundamentally based on highly complex pattern recognition. X-ray or other types of images are scanned for suspicious structures. Normally a few thousand images are required to optimize the algorithm. Digital image data are copied to a CAD server in a DICOM-format and are prepared and analyzed in several steps.1. Preprocessing for
- Reduction of artifacts
- Image noise reduction
- Leveling of image quality for clearing the image's different basic conditions e.g. different exposure parameter.
- Filtering
- Differentiation of different structures in the image, e.g. heart, lung, ribcage, blood vessels, possible round lesions
- Matching with anatomic databank
- Sample gray-values in volume of interest
Every detected region is analyzed individually for special characteristics:
- Compactness
- Form, size and location
- Reference to close by structures / ROIs
- Average greylevel value analyze within a ROI
- Proportion of greylevels to border of the structure inside the ROI
After the structure is analyzed, every ROI is evaluated individually for the probability of a TP. The following procedures are examples of classification algorithms.
- Nearest-Neighbor Rule
- Minimum distance classifier
- Cascade classifier
- Naive Bayesian Classifier
- Artificial Neural Network
- Radial basis function network
- Support Vector Machine
- Principle Component Analysis
Sensitivity and specificity
CAD systems seek to highlight suspicious structures. Today's CAD systems cannot detect 100% of pathological changes. The hit rate can be up to 90% depending on system and application.A correct hit is termed a True Positive, while the incorrect marking of healthy sections constitutes a False Positive. The less FPs indicated, the higher the specificity is. A low specificity reduces the acceptance of the CAD system because the user has to identify all of these wrong hits. The FP-rate in lung overview examinations could be reduced to 2 per examination. In other segments the FP-rate could be 25 or more. In CAST systems the FP rate must be extremely low to allow a meaningful study triage.
Absolute detection rate
The absolute detection rate of the radiologist is an alternative metric to sensitivity and specificity. Overall, results of clinical trials about sensitivity, specificity, and the absolute detection rate can vary markedly. Each study result depends on its basic conditions and has to be evaluated on those terms. The following facts have a strong influence:- Retrospective or prospective design
- Quality of the used images
- Condition of the x-ray examination
- Radiologist's experience and education
- Type of lesion
- Size of the considered lesion
Challenges that CAD in Medicine Faces Today
Some challenges are related to various algorithmic limitations in the procedures of a CAD system including input data collection, preprocessing, processing and system assessments. Algorithms are generally designed to select a single likely diagnosis, thus providing suboptimal results for patients with multiple, concurrent disorders. Today input data for CAD mostly come from electronic health records. Effective designing, implementing and analyzing for EHR is a major necessity on any CAD systems.
Due to the massive availability of data and the need to analyze such data, big data is also one of the biggest challenges that CAD systems face today. The increasingly vast amount of patient data is a serious problem. Often the patient data are complex and can be semi-structured or unstructured data. It requires highly developed approaches to store, retrieve and analyze them in reasonable time.
During the preprocessing stage, input data requires to be normalized. The normalization of input data includes noise reduction, and filtering. Processing may contain a few sub-steps depending on applications. Basic three sub-steps on medical imaging are segmentation, feature extraction / selection and classification. These sub-steps require advanced techniques to analyze input data with less computational time. Although much effort has been devoted on creating innovative techniques for these procedures of CAD systems, there is still not the single best algorithm for each step. Ongoing studies in building innovative algorithms for all the aspects of CAD systems is essential.
There is also a lack of standardized assessment measures for CAD Systems. This fact may cause the difficulty for obtaining FDA approval for commercial use. Moreover, while many positive developments of CAD systems have been proven, studies for validating their algorithms for clinical practice has hardly been confirmed.
Other challenges are related to the problem for healthcare providers to adopt new CAD systems in clinical practice. Some negative studies may discourage the use of CAD. In addition, the lack of training of health professionals on the use of CAD sometimes brings the incorrect interpretation of the system outcomes. These challenges are described in more detail in.
Applications
CAD is used in the diagnosis of breast cancer, lung cancer, colon cancer, prostate cancer, bone metastases, coronary artery disease, congenital heart defect, pathological brain detection, Alzheimer's disease, and diabetic retinopathy.Breast cancer
CAD is used in screening mammography. Screening mammography is used for the early detection of breast cancer. CAD systems are often utilized to help classify a tumor as malignant or benign. CAD is especially established in US and the Netherlands and is used in addition to human evaluation, usually by a radiologist. The first CAD system for mammography was developed in a research project at the University of Chicago. Today it is commercially offered by iCAD and Hologic. However, while achieving high sensitivities, CAD systems tend to have very low specificity and the benefits of using CAD remain uncertain. A 2008 systematic review on computer-aided detection in screening mammography concluded that CAD does not have a significant effect on cancer detection rate, but does undesirably increase recall rate. However, it noted considerable heterogeneity in the impact on recall rate across studies.Procedures to evaluate mammography based on magnetic resonance imaging exist too.
Lung cancer (bronchial carcinoma)
In the diagnosis of lung cancer, computed tomography with special three-dimensional CAD systems are established and considered as appropriate second opinions. At this a volumetric dataset with up to 3,000 single images is prepared and analyzed. Round lesions from 1 mm are detectable. Today all well-known vendors of medical systems offer corresponding solutions.Early detection of lung cancer is valuable. However, the random detection of lung cancer in the early stage in the X-ray image is difficult. Round lesions that vary from 5–10 mm are easily overlooked.
The routine application of CAD Chest Systems may help to detect small changes without initial suspicion. A number of researchers developed CAD systems for detection of lung nodules in chest radiography and CT, and CAD systems for diagnosis of lung nodules in CT. Virtual dual-energy imaging improved the performance of CAD systems in chest radiography.
Colon cancer
CAD is available for detection of colorectal polyps in the colon in CT colonography. Polyps are small growths that arise from the inner lining of the colon. CAD detects the polyps by identifying their characteristic "bump-like" shape. To avoid excessive false positives, CAD ignores the normal colon wall, including the haustral folds.Coronary artery disease
CAD is available for the automatic detection of significant coronary artery disease in coronary CT angiography studies.Congenital heart defect
Early detection of pathology can be the difference between life and death. CADe can be done by auscultation with a digital stethoscope and specialized software, also known as Computer-aided auscultation. Murmurs, irregular heart sounds, caused by blood flowing through a defective heart, can be detected with high sensitivity and specificity. Computer-aided auscultation is sensitive to external noise and bodily sounds and requires an almost silent environment to function accurately.Pathological brain detection (PBD)
Chaplot et al. was the first to use Discrete Wavelet Transform coefficients to detect pathological brains. Maitra and Chatterjee employed the Slantlet transform, which is an improved version of DWT. Their feature vector of each image is created by considering the magnitudes of Slantlet transform outputs corresponding to six spatial positions chosen according to a specific logic.In 2010, Wang and Wu presented a forward neural network based method to classify a given MR brain image as normal or abnormal. The parameters of FNN were optimized via adaptive chaotic particle swarm optimization. Results over 160 images showed that the classification accuracy was 98.75%.
In 2011, Wu and Wang proposed using DWT for feature extraction, PCA for feature reduction, and FNN with scaled chaotic artificial bee colony as classifier.
In 2013, Saritha et al. were the first to apply wavelet entropy to detect pathological brains. Saritha also suggested to use spider-web plots. Later, Zhang et al. proved removing spider-web plots did not influence the performance. Genetic pattern search method was applied to identify abnormal brain from normal controls. Its classification accuracy was reported as 95.188%. Das et al. proposed to use Ripplet transform. Zhang et al. proposed to use particle swarm optimization. Kalbkhani et al. suggested to use GARCH model.
In 2014, El-Dahshan et al. suggested to use pulse coupled neural network.
In 2015, Zhou et al. suggested to apply naive Bayes classifier to detect pathological brains.
Alzheimer's disease
CADs can be used to identify subjects with Alzheimer's and mild cognitive impairment from normal elder controls.In 2014, Padma et al. used combined wavelet statistical texture features to segment and classify AD benign and malignant tumor slices. Zhang et al. found kernel support vector machine decision tree had 80% classification accuracy, with an average computation time of 0.022s for each image classification.
In 2019, Signaevsky et al. have first reported a trained Fully Convolutional Network for detection and quantification of neurofibrillary tangles in Alzheimer's disease and an array of other tauopathies. The trained FCN achieved high precision and recall in naive digital whole slide image semantic segmentation, correctly identifying NFT objects using a SegNet model trained for 200 epochs. The FCN reached near-practical efficiency with average processing time of 45 min per WSI per Graphic Processing Unit, enabling reliable and reproducible large-scale detection of NFTs. The measured performance on test data of eight naive WSI across various tauopathies resulted in the recall, precision, and an F1 score of 0.92, 0.72, and 0.81, respectively.
Eigenbrain is a novel brain feature that can help to detect AD, based on Principal Component Analysis or Independent Component Analysis decomposition. Polynomial kernel SVM has been shown to achieve good accuracy. The polynomial KSVM performs better than linear SVM and RBF kernel SVM. Other approaches with decent results involve the use of texture analysis, morphological features, or high-order statistical features
Nuclear medicine
CADx is available for nuclear medicine images. Commercial CADx systems for the diagnosis of bone metastases in whole-body bone scans and coronary artery disease in myocardial perfusion images exist.With a high sensitivity and an acceptable false lesions detection rate, computer-aided automatic lesion detection system is demonstrated as useful and will probably in the future be able to help nuclear medicine physicians to identify possible bone lesions.
Diabetic retinopathy
Diabetic retinopathy is a disease of the retina that is diagnosed predominantly by fundoscopic images. Diabetic patients in industrialised countries generally undergo regular screening for the condition. Imaging is used to recognize early signs of abnormal retinal blood vessels. Manual analysis of these images can be time-consuming and unreliable. CAD has been employed to enhance the accuracy, sensitivity, and specificity of automated detection method. The use of some CAD systems to replace human graders can be safe and cost effective.Image pre-processing, and feature extraction and classification are two main stages of these CAD algorithms.
Pre-processing methods
Image normalization is minimizing the variation across the entire image. Intensity variations in areas between periphery and central macular region of the eye have been reported to cause inaccuracy of vessel segmentation. Based on the 2014 review, this technique was the most frequently used and appeared in 11 out of 40 recently published primary research.Histogram equalization is useful in enhancing contrast within an image. This technique is used to increase local contrast. At the end of the processing, areas that were dark in the input image would be brightened, greatly enhancing the contrast among the features present in the area. On the other hand, brighter areas in the input image would remain bright or be reduced in brightness to equalize with the other areas in the image. Besides vessel segmentation, other features related to diabetic retinopathy can be further separated by using this pre-processing technique. Microaneurysm and hemorrhages are red lesions, whereas exudates are yellow spots. Increasing contrast between these two groups allow better visualization of lesions on images. With this technique, 2014 review found that 10 out of the 14 recently published primary research.
Green channel filtering is another technique that is useful in differentiating lesions rather than vessels. This method is important because it provides the maximal contrast between diabetic retinopathy-related lesions. Microaneurysms and hemorrhages are red lesions that appear dark after application of green channel filtering. In contrast, exudates, which appear yellow in normal image, are transformed into bright white spots after green filtering. This technique is mostly used according to the 2014 review, with appearance in 27 out of 40 published articles in the past three years. In addition, green channel filtering can be used to detect center of optic disc in conjunction with double-windowing system.
Non-uniform illumination correction is a technique that adjusts for non-uniform illumination in fundoscopic image. Non-uniform illumination can be a potential error in automated detection of diabetic retinopathy because of changes in statistical characteristics of image. These changes can affect latter processing such as feature extraction and are not observable by humans. Correction of non-uniform illumination can be achieved by modifying the pixel intensity using known original pixel intensity, and average intensities of local and desired pixels . Walter-Klein transformation is then applied to achieve the uniform illumination. This technique is the least used pre-processing method in the review from 2014.
Morphological operations is the second least used pre-processing method in 2014 review. The main objective of this method is to provide contrast enhancement, especially darker regions compared to background.
Feature extractions and classifications
After pre-processing of funduscopic image, the image will be further analyzed using different computational methods. However, the current literature agreed that some methods are used more often than others during vessel segmentation analyses. These methods are SVM, multi-scale, vessel-tracking, region growing approach, and model-based approaches.Support vector machine is by far the most frequently used classifier in vessel segmentation, up to 90% of cases. SVM is a supervised learning model that belongs to the broader category of pattern recognition technique. The algorithm works by creating a largest gap between distinct samples in the data. The goal is to create the largest gap between these components that minimize the potential error in classification. In order to successfully segregate blood vessel information from the rest of the eye image, SVM algorithm creates support vectors that separate the blood vessel pixel from the rest of the image through a supervised environment. Detecting blood vessel from new images can be done through similar manner using support vectors. Combination with other pre-processing technique, such as green channel filtering, greatly improves the accuracy of detection of blood vessel abnormalities. Some beneficial properties of SVM include
- Flexibility – Highly flexible in terms of function
- Simplicity – Simple, especially with large datasets
Vessel tracking is the ability of the algorithm to detect "centerline" of vessels. These centerlines are maximal peak of vessel curvature. Centers of vessels can be found using directional information that is provided by Gaussian filter. Similar approaches that utilize the concept of centerline are the skeleton-based and differential geometry-based.
Region growing approach is a method of detecting neighboring pixels with similarities. A seed point is required for such method to start. Two elements are needed for this technique to work: similarity and spatial proximity. A neighboring pixel to the seed pixel with similar intensity is likely to be the same type and will be added to the growing region. One disadvantage of this technique is that it requires manual selection of seed point, which introduces bias and inconsistency in the algorithm. This technique is also being used in optic disc identification.
Model-based approaches employ representation to extract vessels from images. Three broad categories of model-based are known: deformable, parametric, and template matching. Deformable methods uses objects that will be deformed to fit the contours of the objects on the image. Parametric uses geometric parameters such as tubular, cylinder, or ellipsoid representation of blood vessels. Classical snake contour in combination with blood vessel topological information can also be used as a model-based approach. Lastly, template matching is the usage of a template, fitted by stochastic deformation process using Hidden Markov Mode 1.