Hire a web Developer and Designer to upgrade and boost your online presence with cutting edge Technologies

Thursday 20 April 2023

A convolutional neural network method based on Adam optimizer with power-exponential learning rate for bearing fault diagnosis

 A CONVOLUTIONAL NEURAL NETWORK METHOD BASED ON A DAM OPTIMIZER WITH POWER- EXPONENTIAL LEARNING RATE FOR BEARING

FAULT DIAGNOSIS . Y OUMING WANG, Z HAO X IAO, G ONGQING CAO
ISSN PRINT 1392-8716, ISSN O NLINE 2538-8460, K AUNAS, L ITHUANIA 669
where 𝜃, 𝜃௧ାଵ is objective function, 𝑡 is time parameter, 𝛽, 𝛽 [0,1) represents the decay rate
of the moving mean index, 𝜂 is learning rate, 𝜀 is a constant parameter, 𝜀 = 0.9999, 𝑚 and 𝑣 is
the first-order and second-order moment estimation after the gradient modification respectively.
If 𝑚 and 𝑣 are initialized to zero vector, they will be offset to zero. It is necessary to correct the
deviation [42], 𝑚 and 𝑣 will be corrected as:
𝑚 = 𝑚
1 − 𝛽
, 𝑣 = 𝑣
1 − 𝛽
. (4)
In the original Adam optimizer algorithm, the first-order moment to non central second-order
moment estimation is modified, and the offset is reduced. However, in the process of rolling
bearing fault diagnosis and classification, the algorithm has poor effect in fitting the convergence
state of the model. A correction factor was added to the learning rate to address the shortcomings
of the original Adam optimizer algorithm. The power-exponential learning rate of the downward
trend is used as the basis, and the gradient value of the previous stage is used to adjust it to meet
the requirements of adaptive adjustment, so as to change the convergence performance of the
network model. The model for power-exponential learning rate is:
𝜂 = 𝜂 𝑚ି , (5)
where 𝜂 is initial learning rate, 𝜂 = 0.1, 𝑘 is a hyperparameter, 𝑘 = 0.8, 𝑚 is the iterative
intermediate, 𝑚 is determined by the number of iterations and the maximum number of iterations
is defined as follows:
𝑚 = 1 + 𝑡
𝑅, (6)
where 𝑡 is iteration number, 𝑅 is the maximum number of iterations. When Eq. (6) is combined
with Eq. (5), the form of learning rate update is:
𝜂ሺ𝑡) = 𝜂 1 + 𝑡
𝑅൨ି

. (7)
The pseudo code of the improved Adam optimization algorithm is shown in Table 1.
Table 1. Improved Adam optimization algorithm
Algorithm: Adam with power-exponential learning rate
Require: 𝜂 = 0.1, 𝛽 = 0.9, 𝛽 = 0.999, 𝜀 = 0.9999, 𝑞 = 0.8
Require: Initialize time step 𝑡, parameter 𝜃, first/second moment estimation 𝑚 , 𝑣
while stopping criterion is not met do
Update first/second moment
𝑚 ← 𝛽 𝑚 ௧ିଵ + (1 − 𝛽)𝑔, 𝑣 ← 𝛽 𝑣௧ିଵ + (1 − 𝛽)𝑔

Moment correction:
𝑚 ← 𝑚 /1 − 𝛽
, 𝑣 ← 𝑣/1 − 𝛽

Power-exponential learning rate
𝜂 ← 𝜂 𝑚ି , 𝑚 ← 1 +

𝜂(𝑡) ← 𝜂 1 +
ቃି
Update parameters: 𝜃 ← 𝜃௧ିଵ − 𝜂 1 +
ቃି
ඥ௩ ାఌ
end while
Return optimized parameters 𝜃
A CONVOLUTIONAL NEURAL NETWORK METHOD BASED ON A DAM OPTIMIZER WITH POWER- EXPONENTIAL LEARNING RATE FOR BEARING
FAULT DIAGNOSIS . Y OUMING WANG, Z HAO X IAO, G ONGQING CAO
670 J OURNAL OF V IBROENGINEERING. J UNE 2022, V OLUME 24, I SSUE 4
2.2. Convolutional neural network with improved Adam algorithm and identity mapping
To solve the problems of degradation and gradient disappearance of traditional CNN, the
identity mapping is embedded between the layers [31]. As shown in Fig. 1, the module is
composed of several convolution layers and a shortcut connection channel. Identity mapping 𝑥 is
added through shortcut connection channel. ReLU is used as the activation function to alleviate
the problem of gradient disappearance caused by network deepening. Shortcut connection channel
is the difference between identity mapping and ordinary CNN, which enables the data calculated
from the shallow convolution layer to reach the deep convolution layer directly. This module
alleviates the vanishing gradient problem caused by increasing the number of convolutional layers
and improves the training accuracy of multi-convolutional layer CNNs.
Convolutional
layer
Convolutional
layer
ReLU

ReLU
Inputx
F(x)
OutputH(x)
x
identity
Fig. 1. Identity mapping module
The output of 𝐻(𝑥) is converted to 𝐹(𝑥) + 𝑥, that is:
𝐻(𝑥) = 𝐹(𝑥) + 𝑥, (8)
where 𝑥 is the input, 𝐻(𝑥) is the desired output of the underlying mapping, 𝐹(𝑥) is the residual
mapping. The optimal learning of the network aims to make 𝐹(𝑥) tend to 0. Arbitrary L-layer
features of deep neural network can be obtained by recursion:
𝐻(𝑥) = 𝐹(𝑥) +
௅ିଵ
௜ୀଵ
𝑥, (9)
where 𝐻(𝑥) is the output of the lth layer. Eq. (9) is substituted in back propagation:
𝜕𝐿𝑜𝑠𝑠
𝜕𝑥
= 𝜕𝐿𝑜𝑠𝑠
𝜕𝑥
1 + 𝜕
𝜕𝑥
𝐹(𝑥)
௅ିଵ
௜ୀଵ
. (10)
It should be noted from Eq. (10) that the problem of gradient vanishing will not occur even if
the weight of the intermediate layer matrix is small.
The CNN model based on identity mapping built in this paper is shown in Fig. 2. The model
consists of 10 convolution layers, 1 maximum pooling layer and 1 full connection layer. After the
full connection layer, the improved Adam optimizer is used to update and calculate the network
parameters which affect the model training and output to make them approximate or reach the
optimal value. Finally, the data is passed through the softmax classifier and the corresponding
classification results are output.
A CONVOLUTIONAL NEURAL NETWORK METHOD BASED ON A DAM OPTIMIZER WITH POWER- EXPONENTIAL LEARNING RATE FOR BEARING
FAULT DIAGNOSIS . Y OUMING WANG, Z HAO X IAO, G ONGQING CAO
ISSN PRINT 1392-8716, ISSN O NLINE 2538-8460, K AUNAS, L ITHUANIA 671
32*32*64 16*16*256
(3,3,1,64)
Conv1 Conv2
0
1

9
(3,3,2,64)(3,3,1,128)(3,3,1,256)
Pooling
layer
Full connection
layer
Softmax
Fig. 2. Structure of CNN model based on identity mapping
Eight convolution layers with identity mapping are embedded in the network. In the involved
convolution layer, different step lengths and output channels are used. The size of the convolution
kernel is all 3×3. In the fourth convolution layer, the step length is 2, the size of the convolution
kernel remains unchanged, which can reduce the translation times of the convolution kernel and
the amount of calculation in the network model. The structural parameter of convolution layers
with identity mapping is shown in Table 2. In addition, BN layer can not only reduce the number
of training steps and accelerate the convergence on the premise of reaching the same accuracy,
but also reduce the disappearance of gradient and improve the generalization ability. When the
input and output of the shortcut connection have different numbers of channels, zero filling is used
to match the number of channels. In order to extract significant bearing fault features and improve
the network training efficiency, the maximum pool is selected. The pooled window size is 2×2.
Table 2. Structural parameters of eight convolution layers with identity mapping
Identity mapping convolution layer Convolution layer parameters
Conv1_1 Conv(3,3,1,64)
Conv1_2 Conv(3,3,1,64)
Conv2_1 Conv(3,3,1,64)
Conv2_2 Conv(3,3,2,64)
Conv3_1 Conv(3,3,1,128)
Conv3_2 Conv(3,3,1,128)
Conv4_1 Conv(3,3,1,256)
Conv4_2 Conv(3,3,1,256)
3. Experimental results
3.1. Data acquisition
The datasets of the experiments conducted in MaFaulDa [43] and Case Western Reserve
University (CWRU) [44] are used to verify the effectiveness of the proposed method. MaFaulDa
bearing test bench is monitored by two different sets of equipment, including three industrial
sensors, 601A01 accelerometer, a tachometer and a microphone. Three defective bearings,
including outer ring failure, inner ring failure, and rolling element failure were used in the
experiments. The experiment parameters of MaFaulDa bearing monitoring are shown in Table 3.
The rolling bearing test platform of CWRU is shown in Fig. 3. An acceleration sensor with a
frequency of 12 khz is used to collect CWRU bearing fault data at the driving end. The
experimental platform includes three fault types of inner ring, outer ring and rolling element faults.
3.2. Data preprocessing
For MaFaulDa bearing dataset, 10 types of fault diagnosis signals are selected including no
fault, the outer ring fault, inner ring fault, and rolling element fault under loads of 6 g, 20 g and
35 g. The fault categories are labeled 0-9. For the collected sample vibration signals, of which
80 % are used as the training set and 20 % as the test set. Each sample contains 1024 data points,
A CONVOLUTIONAL NEURAL NETWORK METHOD BASED ON A DAM OPTIMIZER WITH POWER- EXPONENTIAL LEARNING RATE FOR BEARING
FAULT DIAGNOSIS . Y OUMING WANG, Z HAO X IAO, G ONGQING CAO
ISSN PRINT 1392-8716, ISSN O NLINE 2538-8460, K AUNAS, L ITHUANIA 673
by the improved Adam and identity mapping is improved by 3.05 % on the basis of the CNN
method with identity mapping.
Table 4. 5-fold cross-validation diagnosis results of rolling bearing
fault monitoring with different models (%)
Experiments times
Fault diagnosis model 1 2 3 4 5
Proposed method
CNN with identity mapping
One LeNet-5
LSTM
99.71 98.69 99.37 99.14 98.25
96.44 95.31 96.25 95.94 96.07
86.92 89.44 86.83 86.11 84.27
76.92 79.65 82.63 80.81 74.36
Proposed method
CNN with identity mapping
Two LeNet-5
LSTM
97.68 99.87 98.34 98.36 98.70
94.17 94.62 96.92 95.09 96.11
86.92 89.44 86.83 86.11 84.27
77.42 76.35 81.03 77.68 74.09
Proposed method
CNN with identity mapping
Three LeNet-5
LSTM
98.25 99.43 99.62 99.46 98.02
96.39 95.28 96.37 95.90 96.15
87.97 86.54 88.21 86.91 84.25
79.85 80.31 76.55 73.19 72.06
The error matrix is an index to evaluate the classification accuracy of the algorithm. Each
column of the error matrix represents the prediction category, the value of each column represents
the accuracy of the data predicted for that category. Each row represents the real belonging
category of the data, and the value of each row represents the accuracy of the data classification
diagnosis of the category. In order to study the performance of the CNN optimized by the
improved Adam, the TensorFlow framework is used to import the sklearn and seaborn function
libraries under the combination of test targets and actual bearing combination fault classification,
the error matrix is drawn by the heatmap. Set the heatmap visualization through the Cmap
parameter to the greater the probability value, the darker the color. Fig. 4 shows the classification
error matrix of 10 selected bearing fault diagnosis signals, including outer ring fault, inner ring
fault and rolling element fault under normal operation state and 6 g, 20 g and 35 g load.
From Fig. 4(a) and (b), the average fault diagnosis recognition rate can reach 99.3 % and
94.5 % respectively by embedding the identity mapping. The method which using improved Adam
optimizer has better diagnostic effect. From Fig. 4(c), LeNet-5 with two groups of convolution
layers has poor diagnosis effect for outer ring fault under 6 g and 35 g loads and rolling element
fault under 20 g load. The average diagnosis accuracy is 86.4 %, which is higher than 74 % of
LSTM network. Compared with other models, on the one hand, the proposed model reduces the
number of model parameters while extracting high-dimensional features after embedding identity
mapping. On the other hand, Adam optimizer with power-exponential learning rate changes the
convergence performance of the network, so that the model has stronger recognition and diagnosis
ability for MaFaulDa bearing dataset.
3.4. CWRU bearing dataset fault diagnosis result analysis
To more intuitively illustrate the adaptive feature learning capability of the CNN optimized by
the improved Adam and identity mapping, t-SNE algorithm [46] is used to visualize the effect
characteristics of fault classification. t-SNE is a nonlinear dimensionality reduction algorithm for
studying high-dimensional data, which maps high-dimensional data to two-dimensional or
multi-dimensional suitable for observation. It constructs a probability distribution among
high-dimensional objects, so that similar objects have a higher probability of being selected, and
different objects have a lower probability of being selected. The test samples are input into the
CNN trained by the improved Adam and identity mapping, the distribution of fault classification
features is shown in Fig. 5.
A CONVOLUTIONAL NEURAL NETWORK METHOD BASED ON A DAM OPTIMIZER WITH POWER- EXPONENTIAL LEARNING RATE FOR BEARING
FAULT DIAGNOSIS . Y OUMING WANG, Z HAO X IAO, G ONGQING CAO
674 J OURNAL OF V IBROENGINEERING. J UNE 2022, V OLUME 24, I SSUE 4
a) Proposed method b) CNN with identity mapping
c) LeNet-5 d) LSTM
Fig. 4. Error matrix of fault diagnosis results
Each color in Fig. 5 represents a fault type, including three different specifications of inner
ring, outer ring, rolling element fault and normal state. The features learned by the model for each
fault are highly separated. Deep learning is characterized by learning features through multiple
layers of convolution, which can lead to a loss of detail. Therefore, classification accuracy can be
improved by adding shortcut connection that combine local information from the first
convolutional layer with global information from the final convolutional layer. The visualization
result shows that the CNN optimized by the improved Adam and identity mapping can learn
different fault features from bearing vibration data with good fault classification ability.
The common evaluation indexes of bearing fault diagnosis include F1 score, accuracy rate,
precision rate and recall rate. The level of evaluation index directly affects the diagnostic ability
and comprehensive performance of the model. In the case study of rolling bearing fault diagnosis,
the same dataset is selected. The traditional SVM and BPNN methods [47], lenet-5 and LSTM
methods use the same experimental environment as the proposed method, the experimental results
are obtained by constructing the corresponding network structure. The experimental results of
bearing fault monitoring and diagnosis are shown in Table 5. The diagnosis accuracy of CNN
method optimized by the improved Adam and identity mapping is 98.53 %, which has higher
diagnostic recognition rate. The average accuracy of SVM and BPNN are 83.77 % and 77.46 %
respectively, both of which limit the data processing ability of the network and have poor fault
diagnosis results. Traditional deep learning methods such as LeNet-5 suffer from the gradient
vanishing problem. The identity mapping in our proposed method allows the deep layer to learn
data directly from the shallow layer, which alleviates the gradient disappearance and overfitting
problems associated with increasing network depth, extracts high-level abstract features and
significantly improves the accuracy of bearing faults.
A CONVOLUTIONAL NEURAL NETWORK METHOD BASED ON A DAM OPTIMIZER WITH POWER- EXPONENTIAL LEARNING RATE FOR BEARING
FAULT DIAGNOSIS . Y OUMING WANG, Z HAO X IAO, G ONGQING CAO
ISSN PRINT 1392-8716, ISSN O NLINE 2538-8460, K AUNAS, L ITHUANIA 675
Fig. 5. Distribution of fault classification features
Table 5. Fault monitoring and diagnosis results of rolling bearing with different models (%)
Fault diagnosis model Accuracy
rate
Precision
rate
Recall
rate
F1
score
Deep learning
method
Proposed method 98.53 98.38 98.02 98.19
LeNet-5 94.41 93.26 93.14 93.19
LSTM 95.55 94.61 94.95 94.77
Traditional
method
SVM + EMD + Hilbert 83.77
BPNN + EMD + Hilbert 77.46
4. Conclusions
In this paper, a new CNN model with 8 convolution layers based on identity mapping and
Adam optimizer is proposed to solve the problems of rolling bearing fault diagnosis. By
embedding identity mapping, the data calculated from the shallow convolution layer can directly
reach the deep convolution layer without adding additional parameters and increasing the
computational complexity. The problem of degradation and gradient disappearance caused by
increasing depth of neural network model is solved, and the training accuracy and speed of the
CNN is improved. The proposed Adam optimizer implements adaptive changes to the learning
rate of the optimizer by adding a power-exponential correction factor. The decay mechanism of
the adaptive power-exponential learning rate guides the parameters to converge towards the global
minima, which improves the convergence performance of the CNN network model. The
performance of the proposed network, general network and traditional method are compared and
verified by using MaFaulDa and CWRU bearing datasets. Compared with LeNet-5, LSTM and
traditional SVM and BPNN fault diagnosis methods, the proposed method can diagnose the fault
type of bearing fault data more accurately.
Acknowledgements
This work was supported by the Key Research and Development Program of Shaanxi Province
of China (2022SF-259). It was also supported by the graduate student innovation fund of Xi’an
University of Post and Telecommunications (CXJJLZ202016).
References
[1] T. Tran, B.-S. Yang, F. Gu, and A. Ball, “Thermal image enhancement using bi-dimensional empirical
mode decomposition in combination with relevance vector machine for rotating machinery fault
A CONVOLUTIONAL NEURAL NETWORK METHOD BASED ON A DAM OPTIMIZER WITH POWER- EXPONENTIAL LEARNING RATE FOR BEARING
FAULT DIAGNOSIS . Y OUMING WANG, Z HAO X IAO, G ONGQING CAO
676 J OURNAL OF V IBROENGINEERING. J UNE 2022, V OLUME 24, I SSUE 4
diagnosis,” Mechanical Systems and Signal Processing, Vol. 38, No. 2, pp. 601–614, Jul. 2013,
https://doi.org/10.1016/j.ymssp.2013.02.001
[2] B.-S. Yang, X. Di, and T. Han, “Random forests classifier for machine fault diagnosis,” Journal of
Mechanical Science and Technology, Vol. 22, No. 9, pp. 1716–1725, Sep. 2008,
https://doi.org/10.1007/s12206-008-0603-6
[3] R. Jegadeeshwaran and V. Sugumaran, “Fault diagnosis of automobile hydraulic brake system using
statistical features and support vector machines,” Mechanical Systems and Signal Processing,
Vol. 52-53, pp. 436–446, Feb. 2015, https://doi.org/10.1016/j.ymssp.2014.08.007
[4] Z. Li, H. Fang, and M. Huang, “Diversified learning for continuous hidden Markov models with
application to fault diagnosis,” Expert Systems with Applications, Vol. 42, No. 23, pp. 9165–9173, Dec.
2015, https://doi.org/10.1016/j.eswa.2015.08.027
[5] A. Youssef, C. Delpha, and D. Diallo, “An optimal fault detection threshold for early detection using
Kullback-Leibler Divergence for unknown distribution data,” Signal Processing, Vol. 120,
pp. 266–279, Mar. 2016, https://doi.org/10.1016/j.sigpro.2015.09.008
[6] M. M. Gupta and D. H. Rao, “On the principles of fuzzy neural networks,” Fuzzy Sets and Systems,
Vol. 61, No. 1, pp. 1–18, Jan. 1994, https://doi.org/10.1016/0165-0114(94)90279-8
[7] Y. Zhu, X. Cheng, and L. Wang, “A novel fault detection method for an integrated navigation system
using Gaussian process regression,” Journal of Navigation, Vol. 69, No. 4, pp. 905–919, Jul. 2016,
https://doi.org/10.1017/s0373463315001034
[8] P. C. Jena, “Fault assessment of FRC cracked beam by using neuro-fuzzy hybrid technique,” Materials
Today: Proceedings, Vol. 5, No. 9, pp. 19216–19223, 2018,
https://doi.org/10.1016/j.matpr.2018.06.277
[9] N. Koteleva, N. Korolev, Y. Zhukovskiy, and G. Baranov, “A soft sensor for measuring the wear of an
induction motor bearing by the park’s vector components of current and voltage,” Sensors, Vol. 21,
No. 23, p. 7900, Nov. 2021, https://doi.org/10.3390/s21237900
[10] G. Harutyunyan, S. Martirosyan, S. Shoukourian, and Y. Zorian, “Memory physical aware multi-level
fault diagnosis flow,” IEEE Transactions on Emerging Topics in Computing, Vol. 8, No. 3,
pp. 700–711, Jul. 2020, https://doi.org/10.1109/tetc.2018.2789818
[11] H. Li, G. Hu, J. Li, and M. Zhou, “Intelligent fault diagnosis for large-scale rotating machines using
binarized deep neural networks and random forests,” IEEE Transactions on Automation Science and
Engineering, Vol. 99, pp. 1–11, 2021, https://doi.org/10.1109/tase.2020.3048056
[12] Y. Lu, R. Xie, and S. Y. Liang, “CEEMD-assisted kernel support vector machines for bearing
diagnosis,” The International Journal of Advanced Manufacturing Technology, Vol. 106, No. 7-8,
pp. 3063–3070, Feb. 2020, https://doi.org/10.1007/s00170-019-04858-w
[13] Y. Nikitin, P. Božek, and J. Peterka, “Logical-linguistic model of diagnostics of electric drives with
sensors support,” Sensors, Vol. 20, No. 16, p. 4429, Aug. 2020, https://doi.org/10.3390/s20164429
[14] P. S. Kumar, L. A. Kumaraswamidhas, and S. K. Laha, “Selection of efficient degradation features for
rolling element bearing prognosis using Gaussian process regression method,” ISA Transactions,
Vol. 112, pp. 386–401, Jun. 2021, https://doi.org/10.1016/j.isatra.2020.12.020
[15] B. Zhao, X. Zhang, Z. Zhan, and S. Pang, “Deep multi-scale convolutional transfer learning network:
A novel method for intelligent fault diagnosis of rolling bearings under variable working conditions
and domains,” Neurocomputing, Vol. 407, No. 24, pp. 24–38, Sep. 2020,
https://doi.org/10.1016/j.neucom.2020.04.073
[16] Wei et al., “Rolling bearing fault diagnosis based on the deep neural networks,” (in Chinese), Modular
Machine Tool and Automatic Manufacturing Technique, Vol. 11, pp. 88–91, 2017,
https://doi.org/10.13462/j.cnki.mmtamt.2017.11.023
[17] F. Lv, C. Wen, M. Liu, and Z. Bao, “Weighted time series fault diagnosis based on a stacked sparse
autoencoder,” Journal of Chemometrics, Vol. 31, No. 9, p. e2912, Sep. 2017,
https://doi.org/10.1002/cem.2912
[18] H. Shao, H. Jiang, X. Zhang, and M. Niu, “Rolling bearing fault diagnosis using an optimization deep
belief network,” Measurement Science and Technology, Vol. 26, No. 11, p. 115002, Nov. 2015,
https://doi.org/10.1088/0957-0233/26/11/115002
[19] H. Liu, J. Zhou, Y. Zheng, W. Jiang, and Y. Zhang, “Fault diagnosis of rolling bearings with recurrent
neural network-based autoencoders,” ISA Transactions, Vol. 77, pp. 167–178, Jun. 2018,
https://doi.org/10.1016/j.isatra.2018.04.005
A CONVOLUTIONAL NEURAL NETWORK METHOD BASED ON A DAM OPTIMIZER WITH POWER- EXPONENTIAL LEARNING RATE FOR BEARING
FAULT DIAGNOSIS . Y OUMING WANG, Z HAO X IAO, G ONGQING CAO
ISSN PRINT 1392-8716, ISSN O NLINE 2538-8460, K AUNAS, L ITHUANIA 677
[20] W. Fuan, J. Hongkai, S. Haidong, D. Wenjing, and W. Shuaipeng, “An adaptive deep convolutional
neural network for rolling bearing fault diagnosis,” Measurement Science and Technology, Vol. 28,
No. 9, p. 095005, Sep. 2017, https://doi.org/10.1088/1361-6501/aa6e22
[21] S. Mandal, B. Santhi, S. Sridhar, K. Vinolia, and P. Swaminathan, “Nuclear power plant thermocouple
sensor fault detection and classification using deep learning and generalized likelihood ratio test,”
IEEE Transactions on Nuclear Science, Vol. 64, No. 6, pp. 1–1, 2017,
https://doi.org/10.1109/tns.2017.2697919
[22] V. Veerasamy et al., “LSTM recurrent neural network classifier for high impedance fault detection in
solar PV integrated power system,” IEEE Access, Vol. 9, pp. 32672–32687, 2021,
https://doi.org/10.1109/access.2021.3060800
[23] M. Liang, Y. Wang, X. Wu, L. Qian, and L. Chen, “Fault recognition of rolling bearing with small-
scale dataset based on transfer learning,” Journal of Vibroengineering, Vol. 23, No. 5, pp. 1160–1170,
Aug. 2021, https://doi.org/10.21595/jve.2021.21784
[24] J. Grezmak, J. Zhang, P. Wang, and R. X. Gao, “Multi-stream convolutional neural network-based
fault diagnosis for variable frequency drives in sustainable manufacturing systems,” Procedia
Manufacturing, Vol. 43, pp. 511–518, 2020, https://doi.org/10.1016/j.promfg.2020.02.181
[25] I. Mukherjee and S. Tallur, “Light-weight CNN enabled edge-based framework for machine health
diagnosis,” IEEE Access, Vol. 9, pp. 84375–84386, 2021,
https://doi.org/10.1109/access.2021.3088237
[26] P. Kumar and A. Shankar Hati, “Convolutional neural network with batch normalisation for fault
detection in squirrel cage induction motor,” IET Electric Power Applications, Vol. 15, No. 1,
pp. 39–50, Jan. 2021, https://doi.org/10.1049/elp2.12005
[27] I. Lomov, M. Lyubimov, I. Makarov, and L. E. Zhukov, “Fault detection in Tennessee Eastman process
with temporal deep learning models,” Journal of Industrial Information Integration, Vol. 23,
p. 100216, Sep. 2021, https://doi.org/10.1016/j.jii.2021.100216
[28] Zhu et al., “Fault diagnosis for rolling element bearings based on multi-sensor signals and CNN,” (in
Chinese), Vibration and Shock, Vol. 39, pp. 172–178, 2020,
https://doi.org/10.13465/j.cnki.jvs.2020.04.022
[29] Liu et al., “Real-time anti-noise fault diagnosis algorithm of one-dimensional convolutional neural
network,” (in Chinese), Journal of Harbin Institute of Technology, Vol. 51, No. 7, pp. 89–95, 2019,
https://doi.org/10.11918/j.issn.0367-6234.201809020
[30] Q. Niu, “Discussion on fault diagnosis of and solution seeking for rolling bearing based on deep
learning,” Academic Journal of Manufacturing Engineering, Vol. 16, No. 1, pp. 58–64, 2018.
[31] K. He, X. Zhang, S. Ren, and J. Sun, “Deep residual learning for image recognition,” in 2016 IEEE
Conference on Computer Vision and Pattern Recognition (CVPR), pp. 770–778, Jun. 2016,
https://doi.org/10.1109/cvpr.2016.90
[32] K. He, X. Zhang, S. Ren, and J. Sun, “Identity mappings in deep residual networks,” in Computer
Vision – ECCV 2016, pp. 630–645, 2016, https://doi.org/10.1007/978-3-319-46493-0_38
[33] H. Zhao, F. Liu, H. Zhang, and Z. Liang, “Research on a learning rate with energy index in deep
learning,” Neural Networks, Vol. 110, pp. 225–231, Feb. 2019,
https://doi.org/10.1016/j.neunet.2018.12.009
[34] Yang et al., “Improved CNN algorithm based on Dropout and ADAM optimizer,” (in Chinese), Journal
of Huazhong University of Science and Technology, Vol. 46, No. 7, pp. 122–127, 2018,
https://doi.org/10.13245/j.hust.180723
[35] P. Christoffersen and K. Jacobs, “The importance of the loss function in option valuation,” Journal of
Financial Economics, Vol. 72, No. 2, pp. 291–318, May 2004,
https://doi.org/10.1016/j.jfineco.2003.02.001
[36] H. Shao, D. Xu, and G. Zheng, “Convergence of a batch gradient algorithm with adaptive momentum
for neural networks,” Neural Processing Letters, Vol. 34, No. 3, pp. 221–228, Dec. 2011,
https://doi.org/10.1007/s11063-011-9193-x
[37] Duchi John C., E. Hazan, and Y. Singer, “Adaptive subgradient methods for online learning and
stochastic optimization,” Journal of Machine Learning Research, Vol. 12, pp. 2121–2159, 2011.
[38] S. Postalcıoğlu, “Performance analysis of different optimizers for deep learning-based image
recognition,” International Journal of Pattern Recognition and Artificial Intelligence, Vol. 34, No. 2,
p. 2051003, Feb. 2020, https://doi.org/10.1142/s0218001420510039
A CONVOLUTIONAL NEURAL NETWORK METHOD BASED ON A DAM OPTIMIZER WITH POWER- EXPONENTIAL LEARNING RATE FOR BEARING
FAULT DIAGNOSIS . Y OUMING WANG, Z HAO X IAO, G ONGQING CAO
678 J OURNAL OF V IBROENGINEERING. J UNE 2022, V OLUME 24, I SSUE 4
[39] Z. Chang, Y. Zhang, and W. Chen, “Electricity price prediction based on hybrid model of Adam
optimized LSTM neural network and wavelet transform,” Energy, Vol. 187, p. 115804, Nov. 2019,
https://doi.org/10.1016/j.energy.2019.07.134
[40] R. K. Yadav and Anubhav, “PSO-GA based hybrid with Adam optimization for ANN training with
application in Medical Diagnosis,” Cognitive Systems Research, Vol. 64, pp. 191–199, Dec. 2020,
https://doi.org/10.1016/j.cogsys.2020.08.011
[41] A. Rehman and T. Saba, “Neural networks for document image preprocessing: state of the art,”
Artificial Intelligence Review, Vol. 42, No. 2, pp. 253–273, Aug. 2014, https://doi.org/10.1007/s10462-
012-9337-z
[42] D. Zachariah and P. Stoica, “Online hyperparameter-free sparse estimation method,” IEEE
Transactions on Signal Processing, Vol. 63, No. 13, pp. 3348–3359, Jul. 2015,
https://doi.org/10.1109/tsp.2015.2421472
[43] M. A. Marins, F. M. L. Ribeiro, S. L. Netto, and E. A. B. Da Silva, “Improved similarity-based
modeling for the classification of rotating-machine failures,” Journal of the Franklin Institute,
Vol. 355, No. 4, pp. 1913–1930, Mar. 2018, https://doi.org/10.1016/j.jfranklin.2017.07.038
[44] W. A. Smith and R. B. Randall, “Rolling element bearing diagnostics using the Case Western Reserve
University data: A benchmark study,” Mechanical Systems and Signal Processing, Vol. 64-65,
pp. 100–131, Dec. 2015, https://doi.org/10.1016/j.ymssp.2015.04.021
[45] Ron Kohavi, “A study of cross-validation and bootstrap for accuracy estimation and model selection,”
in IJCAI’95: Proceedings of the 14th international joint conference on Artificial intelligence,
pp. 1137–1145, Jan. 1995.
[46] Laurens van der Maaten and Geoffrey Hinton, “Visualizing data using t-SNE,” Journal of Machine
Learning Research, Vol. 9, No. 86, pp. 2579–2605, 2008.
[47] D. Li, F. Yang, and X. Wang, “Study on ensemble crop information extraction of remote sensing images
based on SVM and BPNN,” Journal of the Indian Society of Remote Sensing, Vol. 45, No. 2,
pp. 229–237, Apr. 2017, https://doi.org/10.1007/s12524-016-0597-y

No comments:

Post a Comment

Connect broadband

How to Evaluate the Skill of Deep Learning Models

 I often see practitioners expressing confusion about how to evaluate a deep learning model. This is often obvious from questions like: W...