Alzheimer’s Disease (AD), the most common form of dementia, is an incurable neurological condition that results in a progressive mental deterioration. Although definitive diagnosis of AD is difficult, in practice, AD diagnosis is largely based on clinical history and neuropsychological data including magnetic resource imaging (MRI). Increasing research has been reported on applying machine learning to AD recognition in recent years. This paper presents our latest contribution to the advance. It describes an automatic AD recognition algorithm that is based on deep learning on 3D brain MRI. The algorithm uses a convolutional neural network (CNN) to fulfil AD recognition. It is unique in that the three dimensional topology of brain is considered as a whole in AD recognition, resulting in an accurate recognition. The CNN used in this study consists of three consecutive groups of processing layers, two fully connected layers and a classification layer. In the structure, every one of the three groups is made up of three layers, including a convolutional layer, a pooling layer and a normalization layer. The algorithm was trained and tested using the MRI data from Alzheimer’s Disease Neuroimaging Initiative. The data used include the MRI scanning of about 47 AD patients and 34 normal controls. The experiment had shown that the proposed algorithm delivered a high AD recognition accuracy with a sensitivity of 1 and a specificity of 0.93.
Dementia describes a collection of symptoms that are caused by disorders affecting the brain. There are more than 46.8 million people worldwide with dementia today and 131.5 million predicted by 2050 [
Deep learning (DL) simulates the hierarchical structure of human brain, processes data from lower levels to higher levels, and gradually composes more and more semantic concepts. As a new machine learning paradigm, deep learning has been increasingly explored in the development of technology for big data and artificial intelligence [
This paper introduces an automatic AD recognition algorithm that is based on deep learning on 3D brain MRI. In Section 2, we describe the algorithm in detail, including its major steps. In Section 3, we describe the results of the experiments of applying the proposed algorithm on real MRI data, and the related performance discussion. Finally, in Section 4, we conclude by summarising the approach and pointing out possible future pursuits in the area.
The proposed AD recognition system can be completed in two processes including system training and practical testing. The training mainly builds the system based on designing and training data. The test will give recognition result based on current input data. In particular, the recognition algorithm starts from using MRI data as input, pre-processes the data to increase recognition accuracy, then completes the recognition using convolutional neural networks. The following subsections describe these steps.
The proposed system is built and tested with the MRI data acquired from Alzheimer’s Disease Neuroimaging Initiative (ADNI) database [
The data used include the MRI scans of about 47 AD patients and 34 normal controls (NC) from ADNI1. They are all from different subjects, making the recognition performance test more reliable. The scans are T1-weighted, acquired according to the ADNI acquisition protocol. They are 3D magnetization-pre- pared rapid gradient echo (MP-RAGE) sequence. The sequence was undergone a series of pre-processing including Gradwarp, B1 non-uniformity, N3, and scale.
Gradwarp is a system specific correction of image geometry distortion due to gradient non-linearity. B1 non-uniformity correction employs B1 calibration scans to correct the image intensity non-uniformity that results when RF transmission is performed with a more uniform body coil while reception is performed with a less uniform head coil. N3 is a histogram peak sharpening algorithm which is applied to all images. It is applied after grad warp and after B1 correction. N3 will reduce intensity non-uniformity due to the wave or the dielectric effect. In scaling, the image is scaled based on the spatial scale factors that are collected from a local phantom.
The algorithm uses a convolutional neural network (CNN) to fulfil AD recognition. The CNN architecture used in this study consists of three convolutional layers, of each is followed by normalization and spatial max-pooling [
pooling reduces the number of free parameters and introduces small spatial invariance in the network. The fully connected layers followed by a sigmoid logistic regression which outputs a score ranging between 0 and 1, indicating the probability of the slice to be AD. All network parameters are randomly initialized according to a normal distribution with variance 0.02. The network was trained using stochastic gradient descent with learning rate0.001. The cost function was defined as follows:
C ( l , s ) = − ∑ i = 1 B l i log log s i + ( 1 − l i ) log log ( 1 − s i ) (1)
where s is the assigned pixel probability score, l is the reference pixel label and is the size of mini-batch.
The original MRI data had different sizes: 256 × 256 × 166 and 192 × 192 × 160 . To standardized the data, we reform all data into 56 × 56 × 56 . The MRI data consists of many slices. The starting and ending slices do not contain brain tissues. Therefore, we only extracted middle cross-section for AD detection [
The MRI data do not have rotational invariance. Therefore, randomly rotation and mirror cannot be used for data augmentation. We generate additional training data using random zooming in and out and cropping. This is especially important as there are too few examples in the training set, and unbalanced number of AD and NC cases.
We randomly selected 66% data for training and 33% for testing. Details were given in
When testing the proposed CNN, we also extracted seven middle cross-sections as patches. Since not all patches are abnormal in AD cases, only when all seven patches were classified as NC, we considered the data as NC. If one or more than one of the patches was/were classified as AD, we considered the data as AD.
Two performance metrics were measured: sensitivity and specificity. Let TP, TN,
Data | Number of data | Number of Patch | ||
---|---|---|---|---|
Training | Validation | Training | Validation | |
AD | 33 | 16 | 2300 | 1000 |
NC | 20 | 10 | 2300 | 1000 |
FP and FN represents true positive, true negative, false positive and false negative respectively. Sensitivity is defined as
s e n s i t i v i t y = T P T P + F N
Specificity is defined as
s p e c i f i c i t y = T N T N + F P
According to the definition, the sensitivity reflects how many AD cases were detected accurately. Higher the sensitivity, less AD cases were missing. The specificity reflects how many NC cases were detected accurately. Higher the specificity, less normal cases were misrecognized as AD. Using these two metrics together, the performance of the proposed method can be evaluated comprehensively.
The AD recognition performance of the proposed method was tested on ADNI1 data. The testing data consist of 17 AD cases and 16 NC cases. 119 and 112 patches were extracted for AD and NC cases, respectively. 37 AD patches were classified as NC and 2 NC patches were classified as AD. The sensitivity at patch level is 0.69 and specificity is 0.98. When using the AD detection rule which were mentioned in testing section, the case level sensitivity is 1 and specificity is 0.93. The details were given in
This paper describes an automatic AD recognition algorithm that is based on deep learning on 3D brain MRI. The algorithm uses a convolutional neural network (CNN) to fulfil AD recognition. It is unique in that the 3D topology of brain is considered as a whole in AD recognition, resulting in an accurate recognition. The CNN used in this study consists of three consecutive groups of processing layers, two fully connected layers and a classification layer. In the structure, every one of the three groups is made up of three layers, including a convolutional layer, a pooling layer and a normalization layer. The algorithm was trained and tested using the MRI data from Alzheimer’s Disease Neuroimaging Initiative. The data used include the MRI scanning of about 47 AD patients and 34 normal controls. The experiment had shown that the proposed algorithm
Number of patches | Number of patches | Sensitivity | Specificity | |
---|---|---|---|---|
AD (GT) | 82 | 37 | 0.69 | 0.98 |
NC (GT) | 6 | 96 | ||
AD (CNN) | NC (CNN) |
Number of patches | Number of patches | Sensitivity | Specificity | |
---|---|---|---|---|
AD (GT) | 17 | 0 | 1 | 0.93 |
NC (GT) | 3 | 13 | ||
AD (CNN) | NC (CNN) |
delivered a high AD recognition accuracy with a sensitivity of 1and a specificity of 0.93. Future study will include searching for more efficient data processing and CNN structure, and testing on more cases.
Data collection and sharing for this project was funded by the Alzheimer’s Disease Neuroimaging Initiative (ADNI) (National Institutes of Health Grant U01 AG024904) and DOD ADNI (Department of Defense award number W81XWH-12-2-0012). ADNI is funded by the National Institute on Aging, the National Institute of Biomedical Imaging and Bioengineering, and through generous contributions from the following: AbbVie, Alzheimer’s Association; Alzheimer’s Drug Discovery Foundation; Araclon Biotech; BioClinica, Inc.; Biogen; Bristol-Myers Squibb Company; CereSpir, Inc.; Cogstate; Eisai Inc.; Elan Pharmaceuticals, Inc.; Eli Lilly and Company; EuroImmun; F. Ho mann-La Roche Ltd and it’s a liated company Genentech, Inc.; Fujirebio; GE Healthcare; IXICO Ltd.; Janssen Alzheimer Immunotherapy Research & Development, LLC.; Johnson & Johnson Pharmaceutical Research & Development LLC.; Lumosity; Lundbeck; Merck & Co., Inc.; Meso Scale Diagnostics, LLC.; NeuroRx Research; Neurotrack Technologies; Novartis Pharmaceuticals Corporation; Pfizer Inc.; Piramal Imaging; Servier; Takeda Pharmaceutical Company; and Transition Therapeutics. The Canadian Institutes of Health Research is providing funds to support ADNI clinical sites in Canada. Private sector contributions are facilitated by the Foundation for the National Institutes of Health (www.fnih.org). The grantee organization is the Northern California Institute for Research and Education, and the study is coordinated by the Alzheimer's Therapeutic Research Institute at the University of Southern California. ADNI data are disseminated by the Laboratory for Neuro Imaging at the University of Southern California.
Luo, S.H., Li, X.C. and Li, J.M. (2017) Automatic Alzheimer’s Disease Recognition from MRI Data Using Deep Learning Method. Journal of Applied Mathematics and Physics, 5, 1892-1898. https://doi.org/10.4236/jamp.2017.59159