Published: 04 November 2023

A comprehensive review of mechanical fault diagnosis methods based on convolutional neural network

Junjian Hou1
Xikang Lu2
Yudong Zhong3
Wenbin He4
Dengfeng Zhao5
Fang Zhou6
1, 2, 3, 4, 5, 6Henan Provincial Key Laboratory of Intelligent Manufacturing of Mechanical Equipment, Mechanical and Electrical Engineering Institute, Zhengzhou University of Light Industry, Zhengzhou, China
Corresponding Authors:
Xikang Lu, Wenbin He
Views 242
Reads 140
Downloads 198

Abstract

Mechanical fault diagnosis can prevent the deterioration of mechanical equipment failures and is important for the stable operation of mechanical equipment. Firstly, this paper reviews three basic methods of fault diagnosis and common methods of data-driven fault diagnosis, focusing on the characteristics and advantages of deep learning and convolutional neural networks. Then, the basic structure and working principle of CNN (Convolutional Neural Networks) and some basic methods to achieve better training results are introduced. In the next place, from data processing, data fusion, sample set construction, and so on, it is reviewed that the method of fault diagnosis based on CNN and their application scenarios and advantages and disadvantages; for another, the related knowledge and concepts of transfer learning are introduced, and some current application scenarios and advantages and disadvantages of mechanical fault diagnosis techniques combining migration learning and convolutional neural networks are reviewed. Finally, the current difficulties and challenges of convolutional neural networks are discussed, and the research directions have been prospected for CNN applied to the field of fault diagnosis. Although there is quite some similar literature reviewed, this review aims to introduce the basic methods of fault diagnosis, which draw forth the basic applications of the fault diagnosis of data-driven, CNN in the domain of fault diagnosis, and the application scenarios and advantages and disadvantages of combining TL (Transfer Learning) and CNN in fault diagnosis, as well as some problems and prospects. It helps researchers to have a basic understanding of this.

A comprehensive review of mechanical fault diagnosis methods based on convolutional neural network

Highlights

  • The fault diagnosis methods of mechanical equipment are introduced in detail, and the mechanical fault diagnosis based on a convolutional neural network is emphasized.
  • The advantages of deep learning models represented by convolutional neural networks compared with traditional machine learning are summarized and compared.
  • The concept of transfer learning is introduced and the application of convolutional neural network based on transfer learning in mechanical equipment fault diagnosis is introduced.
  • Some difficulties in the current mechanical equipment fault diagnosis are listed and some promising research directions are provided.

1. Introduction

The fault diagnosis of mechanical equipment needs to monitor, diagnose and predict the state of equipment to ensure the stable operation of the machine [1-3]. With the deepening of the depth and breadth of industrialization, the safe and stable operation of mechanical equipment and its component mechanical system is becoming more and more important in industrial production. In particular, the failure of large mechanical equipment will usually bring huge economic losses and a large number of casualties. As mechanical equipment becomes more and more sophisticated and control systems become more and more complex, it is particularly necessary to carry out fault diagnosis of mechanical equipment to maintain their stable operation [4-6].

At present, mechanical equipment fault diagnosis methods mainly include the based method of the physical model and signal processing, and data-driven [7-10], as shown in Fig. 1. The method based on the physical model is mainly to obtain the running data of mechanical equipment and analyze the data with the original physical model, to obtain the running state of mechanical equipment. This method requires an in-depth understanding of the working principle of mechanical equipment, and the more complex the mechanical equipment is, the more difficult it is to establish a complete physical model. The method based on signal processing mainly uses a variety of filtering techniques to remove the noise signal to highlight the fault signal. This method needs to understand the relevant fault characterization theory and mathematical knowledge, so it is difficult to popularize widely. The methods based on data-driven are to find out the deep fault of feature representation relationship behind the data, through data mining, the deep fault feature representation relationship behind the data is found, and the mapping relationship between the data and the fault is established to detect and identify the fault source [11]. The data-driven fault diagnosis process is shown in Fig. 2. As the methods do not require deep professional knowledge and have a certain intelligence, the data-driven methods have a good development prospect.

Fig. 1Mechanical equipment fault diagnosis

Mechanical equipment fault diagnosis

Fig. 2Data-driven fault diagnosis

Data-driven fault diagnosis

As an early data-driven method for mechanical fault diagnosis, machine learning not only has a wide range of applications but also has mature algorithm patterns, such as support vector machine, decision tree, random forest, logistic regression, naive Bayes, and neural network. Kumar A. et al. [12] proposed a multi-scale kernel support vector machine for rolling bearing fault diagnosis, which has higher accuracy and generalization ability than traditional SVM. Wan et al. [13] proposed a fault identification method for rolling bearings based on random forest, which has higher recognition accuracy than BP neural network and K-nearest neighbor algorithm. Zhou et al. [14] used simulation software to build a diagnosis scheme for a diesel-electric hybrid power system based on a support vector machine, and the fault recognition rate was up to 98 %, which indicates that the diagnosis algorithm based on a support vector machine could effectively identify faults. C Tutivén et al. [15] proposed an MDP-SVM fault discrimination method, which can identify multiple types of faults without any need. Compared with the traditional algorithm, their accuracy rate is up to 95.397 %. Wang et al. [16] proposed an AdaBoost algorithm based on Decision Tree. Compared with the conventional algorithm, it has better generalization performance and fewer iterations with the same accuracy.

However, traditional machine learning methods require the necessary mathematical processing of the collected data to extract high-quality data features before they can be applied to fault identification and diagnosis, which requires researchers to have certain data processing and signal analysis capabilities. Moreover, the generalization ability of the obtained model is weak and can only solve specific problems. In general, traditional machine learning has the following disadvantages: (1) It requires professional knowledge and a mathematical basis to design and extract features, which is greatly influenced by manual work; (2) The extracted features are shallow features with weak generalization ability; (3) Model training stage and feature extraction stage are separated, and the whole stage cannot be optimized at the same time; (4) Weak data processing ability and difficult to adapt to the background of big data.

Deep learning does not need to manually extract features. It directly imports data into the model and trains it, and finally realizes fault identification of mechanical equipment. According to different network structures, stacked auto-encoders, recurrent neural networks, deep confidence networks, convolutional neural networks, etc. are widely used in mechanical fault diagnosis. For example, Gu et al. [17] proposed a fault diagnosis method based on multi-task deep learning. Compared with the single-task deep learning model, the fault recognition accuracy is higher and has better anti-noise performance. Tran V. T. et al. [18] proposed a fault diagnosis method for industrial robots based on a deep confidence network, and the research results showed that the fault diagnosis accuracy was as high as 99.4 %. Park P. et al. [19] proposed a fault diagnosis method for AC motor systems based on a long short-term memory network (LSTM), and the results show that the proposed method has a high fault recognition rate. Yan et al. [20] proposed a bearing intelligent fault diagnosis method based on an improved superposition auto-encoder, and the results showed that the fault recognition rate reached 98.93 %, which was better than the comparison method.

As typical of the algorithm of deep learning, CNN is widely used in machine vision, speech processing, and other fields [21]. The wide application of CNN mainly stems from the following advantages: The multi-layer convolutional structure has powerful feature extraction ability; The design of the pooling layer prevents model overfitting; Feature extraction, feature selection, and classifier training realize the overall joint optimization. The powerful feature extraction ability of CNN can dig out the feature correspondence in the depth of data. Although massive data training leads to excessive parameters and slow iteration speed, CNN greatly reduces model parameters with its local receptive field, weight sharing, and pooling operations, which not only improves the training speed but also prevents overfitting. The above advantages make CNN can be well applied in the field of fault diagnosis.

Although CNN has achieved good results in the domain of mechanical fault diagnosis, the introduction of TL can better improve the fault diagnosis recognition rate and model generalization ability in the face of small samples. At present, transfer learning combined with CNN has the greatest development potential in the field of mechanical fault diagnosis [22].

The existing literature usually reviews the deep learning or transfer learning methods, theoretical architectures, and related books, or makes classification experiments to verify the effectiveness of the described methods for the problems existing in specific application objects, then, pointing out the problems and future development prospects of the described methods [23-25]. For example, Qian et al. [26] take the rotating machinery of nuclear power plants as the object and introduce the deep learning algorithms, and theoretical architectures in more detail, and comparative experiments of the same type are done to verify that the described models have better effectiveness and robustness. This paper summarizes the basic methods and development direction of fault diagnosis, which leads to the current hot research directions of machine learning and deep learning. Taking convolutional neural networks in deep learning as an example, it introduces its basic structure, principle, function, and application scenario combined with the literature. Due to the small sample size, the concept of transfer learning is introduced, and the concept and basic methods of transfer learning are briefly introduced. However, the importance of the combination of transfer learning and convolutional neural networks is emphasized because of the functions, effects, and application scenarios of mechanical fault diagnosis. In this review, the development of mechanical fault diagnosis based on convolutional neural networks in recent years is summarized from the basic concepts, basic methods, and application scenarios, rather than the existing theoretical framework and mathematical derivation, to facilitate the research of researchers.

In this paper, the development of mechanical fault diagnosis technology is reviewed, with an emphasis on the data-driven fault diagnosis method. In the data-driven fault diagnosis method, the advantages and disadvantages of the traditional machine learning-based diagnosis method and the deep learning-based diagnosis method are analyzed one by one. In the framework based on deep learning, the basic structure and principle of work of convolutional neural networks are introduced, and their peculiarities and advantages of theirs are analyzed. Firstly, the fault diagnosis process based on CNN is introduced, and then the fault diagnosis methods based on CNN are summarized and combined with specific mechanical system fault research. Secondly, the principle of transfer learning is introduced, and the application of convolutional neural network combined with transfer learning to mechanical system fault diagnosis method is summarized. Finally, some conclusions are given, the current difficulties and challenges of convolutional neural networks are discussed, and the research directions of CNN in the domain of fault diagnosis have prospected.

Fig. 3The structure of this paper

The structure of this paper

As shown in Fig. 3, the structure of this paper is organized as follows: the basic structure and working principle of CNN are introduced in the second part; the third part is about the mechanical fault diagnosis based on the convolutional neural network; the introduction of TL (Transfer Learning) and the combination of CNN and TL of mechanical fault diagnosis are detail organized in the fourth part. Finally, some conclusions and prospects are given.

2. Basic principles of convolutional neural networks

The convolutional neural network is a kind of multi-layer feedforward neural network. Its local receptive field, weight sharing, pooling layer, and other structures can greatly decrease the parameters of a model and improve the speed of training without losing the expression effect. The structure of a typical convolutional neural network usually includes an input layer, convolutional layer, pooling layer, fully connected layer, and output layer. The essence of the multi-layer stacking of convolution and pooling is to extract features from the original data many times, and its function is equivalent to the filter. After several times of extraction, the deep features are gradually obtained which can be used for fault classification or recognition are obtained. And these characteristics do not change with the geometric transformation of the data. Because it can make data sparse, it is very suitable for deep learning to deal with big data. Its basic structure is shown in Fig. 4.

Fig. 4The framework of CNN

The framework of CNN

The input layer is to import the collected information into the network structure in a way acceptable to the network structure after data processing. The quality of data processing usually directly affects the fault recognition rate. Convolutional layers mainly perform convolutional operations, and different combinations of convolutional layers can extract the deep features of input layers, which is the core component of convolutional neural networks. In the pooling layer, the downsampling method is used to process the data again, which can not only reduce the computation amount but also reduce the risk of overfitting and speed up the convergence. There are two main pooling methods, maximum pooling, and average pooling. After multiple convolutional layers and pooling layers, CNN usually adopts a single-layer or multi-layer fully connected layer as a high-level structure to carry out higher-level inference and classification. Softmax classifier is usually used for output.

The above is the basic structure of the convolutional neural network. After the convolutional neural network is built, the data will be imported into the network for training. In order to obtain good training results, the following four aspects should be paid special attention to: (1) With the use of dropout technology, the output results of the network get a good performance in the training set, while the fitting degree in the validation set is very poor, which is the data overfitting. The dropout technique is used to randomly drop some neurons in each layer of the neural network with a certain probability, so that the network structure is not repeated in each iteration update, thus avoiding the overfitting problem. Dropout is either placed in the input layer or the fully connected layer, both of which can achieve good performance. (2) Selection of learning rate; it is equivalent to the step length of the learning process. For example: in the process of gradient descent, if the step selected is too small, the minimum value of the model cannot be found even after many iterations of the network structure, if the step selected is too large, the model may be swinging back and forth between the minimum value. Only when it is chosen properly can the network find the minimum quickly and well. At present, the selection of the learning rate is mainly due to the accumulation of experience without a well-fixed algorithm. (3) Size of convolution kernel: only the dimension of the convolution kernel is greater than 1 and can play the role of the receptive field; The larger the dimension of the convolution kernel, the more parameters. In the case of the same receptive field, the smaller the convolution kernel, the smaller the parameters and computation required by the network. (4) Selection of activation function: The nonlinear activation function can enhance the fitting ability of the network. Now the commonly used activation function is ReLU, which can alleviate the problem of vanishing gradient in network training, improve training speed and reduce training time.

3. Mechanical fault diagnosis based on convolutional neural network

3.1. The basic process of fault diagnosis based on a convolutional neural network

Due to the excellent performance of the convolutional neural network in many fields, researchers applied the convolutional neural network to the field of fault diagnosis. Its general steps mainly include: first, defining the fault mode; Second, the construction of data sets; Third, data preprocessing; Fourth, the convolutional neural network model is built. Fifth, model training of the convolutional neural network. Sixth, the test samples are input into the model for model evaluation and optimization; Finally, the fault diagnosis results are obtained. The process structure is shown in Fig. 5.

Fig. 5The process of fault diagnosis

The process of fault diagnosis

To construct the fault diagnosis of the model of CNN, the corresponding fault data set should be established first. The data quality of the input CNN model directly affects the result of fault diagnosis. Different diagnosis objects require different fault signals. The corresponding fault data can be collected by the sensor signal acquisition system through experiments or in real fault scenarios, and then the construction of data sets is completed after preprocessing. However, since a single signal acquisition system cannot meet the requirements of complex mechanical fault diagnosis, a multi-sensor is used to collect multiple signal information. And then data fusion is carried out to complete the construction of the sample dataset, which is used to improve the diagnostic accuracy of a complex mechanical system and provide support for the construction of a fault diagnosis model of a convolutional neural network.

3.2. Fault diagnosis method based on convolutional neural network

Although the generalization ability of the convolutional neural network has been greatly improved, the corresponding convolutional neural network still needs to be established for different fault diagnosis problems. In a shallow network, deep features cannot be extracted, and the fault diagnosis accuracy is not high. If the network level is too deep, the effect will be worse, which is the problem of network degradation. So far, there is no universal convolutional neural network structure that can be applied to all fault diagnoses, so the construction of a convolutional neural network model is particularly important.

The convolutional neural network is usually used to process image-type data. Some researchers transform the collected signals into two-dimensional images through data processing for mechanical fault diagnoses, such as Che et al. [27] proposed a fault diagnosis algorithm for rolling bearing cages based on CNN to solve the problems of unstable vibration signals and difficulty in extracting features. The fault recognition rate reached more than 99 %, with good generalization and robustness. Azamfar M. et al. [28] proposed a fault diagnosis method for rolling bearings based on a convolutional neural network for the characteristics of non-stationarity, nonlinearity, and easy interference of rolling bearing signals, which can change the dimension of the signal to suit the network input. The results show that the method can effectively identify fault types and has high stability. Piedad E. J. et al. [29] proposed a fault diagnosis method based on a two-dimensional convolutional neural network for rolling bearings under variable working conditions. The results show that compared with a traditional convolutional neural network, this method has significantly improved fault diagnosis accuracy and diagnosis efficiency.

Due to the translation invariance of a convolutional neural network, the data can be processed into two-dimensional picture information for fault diagnosis. However, the collected one-dimensional temporal signals can more completely contain the fault data of the machinery, so some researchers begin to directly input the collected one-dimensional signals into the network for fault diagnosis. Such as Du et al. [30] proposed a fault diagnosis method for analog circuits based on a one-dimensional convolutional neural network, which can realize end-to-end fault diagnosis for analog circuits, effectively extracting deep fault of features, and having higher classification accuracy and classification stability. Jin et al. [31] proposed a drill pipe fault diagnosis model based on a one-dimensional convolutional neural network, which can effectively identify drill pipe fault types with an average accuracy of 98.7 %. It also has good performance in different working conditions and noisy environments. Wu et al. [32] proposed a kind based on adaptive noise cancellation and a one-dimensional convolutional neural network gearbox bearing fault diagnosis method for strong vibration interference problems, which can separate periodic signal and random signal, intelligently extract fault features of a random signal, and realize the high accuracy of fault diagnosis under the interference of strong vibration. Niyongabo J. et al. [33] proposed a fault diagnosis method based on one-dimensional convolution and orthogonal regularization to solve the problem that the fault diagnosis effect of industrial robots is not good. The results show that compared to existing methods, this method has a higher fault recognition rate and can effectively diagnose faults in industrial robots.

However, mechanical systems tend to be more complicated. By adopting data augmentation and expansion, multi-scale data processing, and multi-sensor fault signal acquisition, the fault types of mechanical systems can be more accurately identified. Such as An et al. [34] proposed a model development fault diagnosis method based on a multi-scale convolutional neural network for rolling bearings under the conditions of large noise, variable load, and complex working conditions, which can effectively improve the fault diagnosis accuracy and enhance the robustness of the network. Li et al. [35] proposed a fault diagnosis method based on multi-scale one-dimensional deep convolution aiming at the difficulty of fault feature extraction and fault feature recognition for electromechanical equipment. The results show that this method has a high fault recognition rate, high diagnosis accuracy, and strong robustness. Hasan M. J. et al. [36] proposed a bearing fault diagnosis method based on multi-sensor information fusion to solve the problem that it is hard to diagnose the internal and external multi-excitation mechanical faults of aero-engine bearings. Compared with the traditional methods of fault diagnosis such as SVM and ANN, the accuracy of the method is improved by 36.92 % and 18.9 %, respectively, which can effectively identify fault types and has a high accuracy rate. Other relevant literature is shown in the following Table 1.

As mentioned in the above table, the convolutional neural network can be divided into one-dimensional convolution and two-dimensional convolution according to the dimension of data processing. Researchers have achieved good results in fault diagnosis of complex mechanical systems based on this method.

Table 1Convolutional neural network classification table

Classification
Application situation
Advantages
Disadvantages
Two-dimensional
convolution
Multiscale network;
Network convergence
Pumping unit fault diagnosis [37, 38], Mine hoisting mechanism dynamic system fault diagnosis [39], Diesel engine misfire fault real-time diagnosis [40], Fan blade icing fault detection [41, 42], AC-DC transmission system fault diagnosis [43]
It has small storage space, high diagnostic accuracy, high accuracy, good anti-interference ability and generalization ability, and good stability. By converting one-dimensional signals into two-dimensional time-frequency information, deeper hidden fault information can be mined.
A small convolution kernel is used to enhance the nonlinear representation, but a large number of fault samples are needed. The original signal cannot be directly used, and the loss in the process of information conversion is incalculable.
Preprocessing data;
Model merging
Bearing fault diagnosis [44-57], Planetary gearbox fault diagnosis [58-60], Circuit breaker Fault Diagnosis [61]
One-dimensional
convolution
Simple one-dimensional convolution
Intelligent fault diagnosis of Rotating machinery [62-65], Mechanical fault diagnosis application [66-71]
The end-to-end fault diagnosis is realized by using a one-dimensional vibration signal directly. The integrity of signal transmission is maintained.
The original signal collected directly contains a lot of noise, so extracting hidden fault features from the original signal has higher requirements for the network structure and specific hyper-parameters.
Variable convolution kernel
Bearing fault identification [72-75], Numerical control Machine tool ball screw pair Fault diagnosis [76], Fault identification of data imbalance [77]

Select part of the literature in the above table, respectively investigate the accuracy of the model and the comparison model in the test set, and draw the line chart, as shown in Fig. 6.

In Fig. 6, the horizontal axis represents specific references and the vertical axis represents accuracy. CNN variants mean a variant of the framework of CNN; CNN means simple convolutional neural network; ML stands for machine learning, such as SVM, etc. It is not difficult to find from the figure above that the deep learning framework represented by CNN generally performs better than machine learning in the field of fault diagnosis. In addition, machine learning usually requires manual extraction of fault features, which has poor generalization compared with end-to-end deep learning frameworks. In the face of complex fault diagnosis, generally only relying on convolutional neural networks cannot achieve good diagnosis effects, so it is necessary to develop a specific fault diagnosis framework based on CNN to achieve better diagnosis results. However, developing new convolutional neural network structures for different fault problems not only wastes time but also consumes a lot of resources. The introduction of transfer learning can help the network accumulate prior knowledge and complete fault classification better even in the case of few fault samples. The research is mainly based on the same distribution of high-quality and sufficient training samples and test samples, but the real collected data usually cannot well meet the above conditions. Transfer learning relaxes the above restrictions, and fine-tuning the network parameters through the trained model can also achieve high diagnostic accuracy under small samples, at the same time it greatly saves the training time of the network model. The applications of transfer learning combined with deep learning in the diagnosis of fault have become the most potential research direction.

Fig. 6Reference method bar chart

Reference method bar chart

4. Fault diagnosis method based on convolutional neural network and transfer learning

As a classic and mature deep learning model, CNN is also limited by the conditions of the same distribution and the number of training samples. Although convolutional neural networks have achieved good results in the field of fault diagnosis, the introduction of transfer learning will further make them perform better in the field of fault diagnosis. To solve the problem of insufficient label sample data of fault diagnosis objects, researchers began to study the theory of transfer learning in the 1990s. Currently, transfer learning is widely used in the domain of fault diagnosis, and Stanford Professor Andrew Ng says that transfer learning will be the driving force of machine learning in the future.

4.1. The concept and classification of transfer learning

There are two basic concepts in transfer learning: domain and task. The existing knowledge is called the source domain and the corresponding task is called the source task. The knowledge to be learned and the corresponding task respectively target domain and target task. The domain contains two contents: sample set and its distribution in feature space; the task also has a corresponding decision function of feature sample and corresponding label.

Transfer learning is to extract transferable structures or parameters from known source domains (Ds) and source tasks (DT) and apply them to target tasks (Ts) in another situation to solve problems (see Fig. 7). As shown in Fig. 8, transfer learning can be divided into the following types according to different classification methods.

Fig. 7The concept of transfer learning

The concept of transfer learning

Fig. 8The classification of Transfer learning

The classification of Transfer learning

4.2. Fault diagnosis based on CNN and transfer learning

Fault diagnosis based on transfer learning aims to accumulate knowledge of the learned model and apply it to the target domain to improve learning efficiency. From the perspective of transfer learning, traditional machine learning usually assumes that the training set and test set have the same feature distribution, including support vector machine, artificial neural network, and so on. Traditional machine learning is a process of starting from scratch whenever it comes to new tasks and new fields, and new fields usually do not have a large number of data samples. The knowledge Ds is transferred to help solve the new task Dt, and the learning efficiency is improved while adapting to the small sample data.

The diagnosis process of fault diagnosis combined with CNN and transfer learning is shown in Fig. 9. Usually, the collected source domain signals are input into the established convolutional neural network model. Train the convolutional neural network until the ideal result is obtained, and then transfer the parameters or network structure of CNN to the newly built fault diagnosis network. The collected fault signals of the target domain are input to the newly built fault diagnosis network, which can help the target domain to complete fault diagnosis and reduce the training time when the fault data of the source domain is less.

According to the classification of transfer learning methods, feature-based transfer learning mainly extracts features from the dataset of the source domain and a target domain and then maps the features to the same space to reduce the characteristic differences of the source domain and find similar features. For instance, Liu et al. [78]. proposed a rolling bearing fault diagnosis method using deep transfer learning and adaptive weighting to solve the problem that additional fault state samples would affect the fault diagnosis accuracy. The results show that this method can overcome the influence of additional fault state samples. Their diagnostic accuracy is above 89 %, while the other method of comparison is below 80 %. Zhong S-s et al. [79] aiming at the inconsistency of bearing data feature distribution under variable working conditions, proposed a subdomain adaptive deep transfer learning fault diagnosis method. The results show that the average accuracy of this method is as high as 99 %, which is more effective and superior to other methods. Zhao et al. [80] proposed a UATL (Unsupervised Adversarial Transfer Learning) bearing fault diagnosis method to solve the problems such as the difficulty of obtaining bearing fault data labels and the weak generalization ability of the model. The results showed that the method had high diagnostic accuracy and good model generalization ability.

Fig. 9The fault diagnosis of CNN combined with transfer learning

The fault diagnosis of CNN combined with transfer learning

Model-based transfer learning is to use the source domain dataset to train the network, transfer the network structure or parameters to the target domain, and realize the parameter sharing between DS and TS. Kumaresan S. et al. [81] aiming at the problem of insufficient training data for fault diagnosis, proposed an intelligent fault diagnosis method for rolling bearings based on the combination of long short-term memory and transfer learning. The results show that this method is more intelligent than the traditional fault diagnosis methods in identifying various fault categories, and has better accuracy and generalization ability. Xia et al. [82] proposed a spacecraft fault diagnosis method based on deep transfer learning to solve the problems of a few spacecraft telemetry data samples, high noise, and difficulty in fault identification in traditional fault diagnosis. The results show that this method can quickly and accurately identify spacecraft fault types. Qin et al. [83] proposed a fast fault diagnosis algorithm based on transfer learning and a deep residual network to solve the problems of large training amounts and the long training time of existing deep learning for rolling bearing fault diagnosis. The results showed that in the experiments of Case Western Reserve University and Paderborn University datasets, compared with the traditional fault diagnosis algorithm, it achieves higher diagnosis accuracy and less training time, which can be used for the rapid diagnosis of bearing faults in a practical environment. Udmale S. S. et al. [84] aiming at the problem of insufficient training samples in the actual environment, proposed a bearing diagnosis method based on small sample transfer learning. The results show that this method can achieve high accuracy of fault diagnosis using only a small number of samples, which is of a certain value to the application of transfer learning in bearing fault diagnosis. Hasan M. J. et al. [85] proposed a rolling bearing fault diagnosis method based on AlexNet and transfer learning to solve the problem that traditional bearing fault diagnosis methods require complex signal processing, expert knowledge, and fewer fault data. The results show that the proposed method achieves 100 % diagnostic accuracy in the Case Western Reserve University Bearing Data Center dataset. At the same time, the proposed method still has high diagnostic accuracy in the case of scarce fault data, which is superior to the existing advanced methods. Wang et al. [86] proposed a fault diagnosis method based on time-frequency analysis VGG19 network transfer learning, aiming at the problem that bearing fault diagnosis relies on expert experience to extract features manually. The results showed that the accuracy of diagnosis of this method was 5.42 % higher than that of the comparison method. At the same time, the validity of the method in signal processing applications is verified, and it can solve the problem of fault diagnosis with a small sample. Hakim M. et al. [87] proposed a rolling bearing fault diagnosis method based on a one-dimensional CNN with multi-source domain transfer learning to solve the problem of the dependence of mechanical equipment fault diagnosis on complete data and the scarcity of actual malfunction data. The results show that the classification accuracy of the proposed method is significantly higher than that of the traditional fault diagnosis methods in the case of sparse fault data, and it has a faster convergence speed and better stability.

Combined with the advantages of transfer learning, the application of CNN to mechanical fault diagnosis has attracted more and more attention from researchers. A convolutional neural network model is trained with a large number of labeled data, and the network structure or model parameters in the model are extracted for another situation, in which there are usually few fault sample data, so it is difficult to train a high-quality fault diagnosis model. The trained source domain model is migrated to a new target domain for fault identification or diagnosis, which not only reduces the training time but also improves the recognition accuracy. The combination of CNN and transfer learning provides technical support for fault diagnosis in new fields and new working conditions. Other relevant literature is shown in the following Table 2.

As mentioned in the above table, researchers mainly study different situations and different categories of fault diagnosis from feature-based and model-based transfer learning. In most cases, convolutional neural networks are used to extract features, train models, and migrate network models to the target domain can help identify fault types in the target domain.

Select part of the literature in the above table, respectively investigate the accuracy of the model and the comparison model in the test set, and draw the bar chart, as shown in Fig. 10.

Fig. 10Reference method bar chart

Reference method bar chart

In Fig. 10, the horizontal axis represents specific references and the vertical axis represents accuracy. DL represents the combination of deep learning framework represented by CNN and transfers learning; ML represents the combination of machine learning and transfer learning. It is not difficult to see from the above figure that the DL curve is generally more accurate than the ML curve. This indicates that the end-to-end fault diagnosis carried out by the deep learning framework represented by CNN is more advantageous than the machine learning model of manual feature extraction followed by transfer learning, mainly because the end-to-end fault diagnosis can optimize the feature extraction and fault classification as a whole. However, feature extraction and fault classification of machine learning are carried out separately, so both cannot be taken into account in the process of transfer learning, resulting in low accuracy. Research and practice show that the deep learning framework can be used to extract fault features and migrate the network structure to a new working condition, and then better fault diagnosis results can be obtained by fine-tuning the network. In particular, the fault diagnosis method combining CNN and TL can effectively identify the fault types in mechanical and other fields. Although the theoretical research of transfer learning has been successfully applied and practiced, it still needs further research.

Table 2The classification of transfer learning

Classification
Application situation
Advantages
Disadvantages
Feature-based
Model merging Multitask combination
Rolling Bearing fault diagnosis [88-91], Mechanical intelligent fault diagnosis [92-98]
High diagnostic accuracy; Good generalization ability; High robustness; Reduced distribution difference and improved recognition accuracy; Adaptive adjustment of marginal distribution and conditional distribution; Improved transfer learning effect
It is difficult to select high-quality fault features that can contain complex information. The adaptation of the source domain and target domain is difficult, and the most common method is the maximum mean difference.
Small sample;
Migration optimization
Gear fault diagnosis [99, 100], Axle box bearing fault diagnosis [101]
Model-based
Multiple conditions;
Multiscale convolution
Pump fault diagnosis [102, 103], Partial discharge type diagnosis [104], Bearing fault diagnosis [105-109], Crane type recognition strategy [110]
High diagnostic accuracy and good generalization performance; To overcome the requirement of different sample demands and distribution; Less training time and smaller model size; Automatic learning of deep features; Small sample demand.
It is difficult to train high-quality generic models. Model parameters do not converge easily.
Data fusion;
Hybrid input
Power grid fault diagnosis [111], Texture image recognition [112, 113], Distracted driving behavior recognition [114]

5. Conclusion and prospect

5.1. Conclusions

As a mature deep learning model, the convolutional neural network has been widely used in the field of mechanical fault diagnosis. This paper introduces the basic structure and principle of CNN and analyzes and summarizes the characteristics of its application in fault diagnosis in recent years. The concepts of transfer learning and the application of transfer learning combined with convolutional neural networks in fault diagnosis are introduced. Finally, some difficulties still faced are introduced, and the future development direction is forecasted. Although convolutional neural networks and transfer learning have achieved good results in the field of mechanical fault diagnosis, there are still some problems to be further studied.

5.2. Difficulties

As a classic deep learning model, a convolutional neural network has long been introduced in the field of mechanical equipment fault diagnosis. Although some achievements have been made, the characteristics of mechanical fault diagnosis and many difficulties still limit the further development of convolutional neural networks. Its characteristics and difficulties are as follows: (1) The mechanical system is huge and complex, and the fault formation level is different. (2) Faults and features are not simple linear correspondence, but complex nonlinear mapping. (3) There are many interference factors in the fault signal and the fault data are less, and the fault mode is not complete. (4) The amount of data is too large, limited by the hardware, and the information processing capacity of the computer is insufficient.

5.3. Prospects

5.3.1. CNN fault diagnosis based on data imbalance

A large amount of data can effectively train network models. However, at present, the construction of data sets is faced with such a problem: there are more simulated fault data and less actual fault data. This is because when a fault occurs, the equipment cannot continue to operate, otherwise, a major accident may occur, so the actual data monitored are mostly normal. The imbalance between fault data and normal data brings difficulties to network training. The generative adversarial network can be used to simulate fault data to improve the training difficulties caused by data imbalance. Transfer learning can also be used to transfer similar network structures to the target task, and fine-tune the network through small samples to solve the fault identification of the target task.

5.3.2. Migration application of CNN diagnostic system in the same type of devices

The successful application of the CNN fault diagnosis system requires a large amount of data for training, and it is unrealistic to carry out a large number of fault tests for each device. Therefore, if the data mapping between devices of the same type can be realized, and the mutual reconstruction can realize the migration of fault pattern recognition, the CNN fault diagnosis system can be popularized in the field of fault diagnosis.

5.3.3. Application of CNN fault diagnosis method based on multi-sensor information fusion

With the increasing complexity of mechanical equipment, a single sensor can no longer describe the complete fault type. Multi-sensor fault information collection and integrated applications can improve the accuracy of fault identification. However, there are still some problems in the data fusion technology of multi-type sensors. At present, there is no optimal method for multi-sensor information fusion, and different fusion levels have their limitations. The correct fusion method for different fault tasks is also different. Therefore, the application of CNN fault diagnosis based on multi-sensor information fusion is still a major challenge.

References

  • I. Attoui, B. Oudjani, N. Boutasseta, N. Fergani, M.-S. Bouakkaz, and A. Bouraiou, “Novel predictive features using a wrapper model for rolling bearing fault diagnosis based on vibration signal analysis,” The International Journal of Advanced Manufacturing Technology, Vol. 106, No. 7-8, pp. 3409–3435, Feb. 2020, https://doi.org/10.1007/s00170-019-04729-4
  • W. Oñate, Y. Gallardo, R. Pérez, and G. Caiza, “Comparative analysis of high frequencies for the broken bar fault diagnosis using MCSA and Park’s vector demodulation,” Smart Innovation, Systems and Technologies, pp. 119–130, 2022, https://doi.org/10.1007/978-981-16-4884-7_10
  • S. Yin, S. X. Ding, X. Xie, and H. Luo, “A review on basic data-driven approaches for industrial process monitoring,” IEEE Transactions on Industrial Electronics, Vol. 61, No. 11, pp. 6418–6428, Nov. 2014, https://doi.org/10.1109/tie.2014.2301773
  • J. Chen, C. Lin, D. Peng, and H. Ge, “Fault diagnosis of rotating machinery: a review and bibliometric analysis,” IEEE Access, Vol. 8, pp. 224985–225003, 2020, https://doi.org/10.1109/access.2020.3043743
  • S. Khan and T. Yairi, “A review on the application of deep learning in system health management,” Mechanical Systems and Signal Processing, Vol. 107, pp. 241–265, Jul. 2018, https://doi.org/10.1016/j.ymssp.2017.11.024
  • G. Toh and J. Park, “Review of vibration-based structural health monitoring using deep learning,” Applied Sciences, Vol. 10, No. 5, p. 1680, Mar. 2020, https://doi.org/10.3390/app10051680
  • L. C. Brito, G. A. Susto, J. N. Brito, and M. A. V. Duarte, “An explainable artificial intelligence approach for unsupervised fault detection and diagnosis in rotating machinery,” Mechanical Systems and Signal Processing, Vol. 163, p. 108105, Jan. 2022, https://doi.org/10.1016/j.ymssp.2021.108105
  • Y. Li, G. Cheng, Y. Pang, and M. Kuai, “Planetary gear fault diagnosis via feature image extraction based on multi central frequencies and vibration signal frequency spectrum,” Sensors, Vol. 18, No. 6, p. 1735, May 2018, https://doi.org/10.3390/s18061735
  • M. M. R. Taha, A. Noureldin, J. L. Lucero, and T. J. Baca, “Wavelet transform for structural health monitoring: A compendium of uses and features,” Structural Health Monitoring, Vol. 5, No. 3, pp. 267–295, Sep. 2006, https://doi.org/10.1177/1475921706067741
  • S. Wang, J. Xiang, Y. Zhong, and Y. Zhou, “Convolutional neural network-based hidden Markov models for rolling element bearing fault identification,” Knowledge-Based Systems, Vol. 144, pp. 65–76, Mar. 2018, https://doi.org/10.1016/j.knosys.2017.12.027
  • L. Wen, X. Li, L. Gao, and Y. Zhang, “A new convolutional neural network-based data-driven fault diagnosis method,” IEEE Transactions on Industrial Electronics, Vol. 65, No. 7, pp. 5990–5998, Jul. 2018, https://doi.org/10.1109/tie.2017.2774777
  • A. Kumar and R. Kumar, “Time-frequency analysis and support vector machine in automatic detection of defect from vibration signal of centrifugal pump,” Measurement, Vol. 108, pp. 119–133, Oct. 2017, https://doi.org/10.1016/j.measurement.2017.04.041
  • L. Wan, K. Gong, G. Zhang, X. Yuan, C. Li, and X. Deng, “An efficient rolling bearing fault diagnosis method based on spark and improved random forest algorithm,” IEEE Access, Vol. 9, pp. 37866–37882, 2021, https://doi.org/10.1109/access.2021.3063929
  • S. Zhou, S. Qian, W. Chang, Y. Xiao, and Y. Cheng, “A novel bearing multi-fault diagnosis approach based on weighted permutation entropy and an improved SVM ensemble classifier,” Sensors, Vol. 18, No. 6, p. 1934, Jun. 2018, https://doi.org/10.3390/s18061934
  • C. Tutivén, Y. Vidal, A. Insuasty, L. Campoverde-Vilela, and W. Achicanoy, “Early fault diagnosis strategy for WT main bearings based on SCADA data and one-class SVM,” Energies, Vol. 15, No. 12, p. 4381, Jun. 2022, https://doi.org/10.3390/en15124381
  • X. Wang, H. Gu, T. Wang, W. Zhang, A. Li, and F. Chu, “Deep convolutional tree-inspired network: a decision-tree-structured neural network for hierarchical fault diagnosis of bearings,” Frontiers of Mechanical Engineering, Vol. 16, No. 4, pp. 814–828, Dec. 2021, https://doi.org/10.1007/s11465-021-0650-6
  • J. Gu and M. Huang, “Fault diagnosis method for bearing of high-speed train based on multitask deep learning,” Shock and Vibration, Vol. 2020, pp. 1–8, Dec. 2020, https://doi.org/10.1155/2020/8873504
  • T. Tran, F. Althobiani, T. Tinga, A. Ball, and G. Niu, “Single and combined fault diagnosis of reciprocating compressor valves using a hybrid deep belief network,” Proceedings of the Institution of Mechanical Engineers, Part C: Journal of Mechanical Engineering Science, Vol. 232, No. 20, pp. 3767–3780, Oct. 2018, https://doi.org/10.1177/0954406217740929
  • P. Park, P. D. Marco, H. Shin, and J. Bang, “Fault detection and diagnosis using combined autoencoder and long short-term memory network,” Sensors, Vol. 19, No. 21, p. 4612, Oct. 2019, https://doi.org/10.3390/s19214612
  • X. Yan, Y. Xu, D. She, and W. Zhang, “Reliable fault diagnosis of bearings using an optimized stacked variational denoising auto-encoder,” Entropy, Vol. 24, No. 1, p. 36, Dec. 2021, https://doi.org/10.3390/e24010036
  • A. Krizhevsky, I. Sutskever, and G. E. Hinton, “ImageNet classification with deep convolutional neural networks,” Communications of the ACM, Vol. 60, No. 6, pp. 84–90, May 2017, https://doi.org/10.1145/3065386
  • K. Weiss, T. M. Khoshgoftaar, and D. Wang, “A survey of transfer learning,” Journal of Big Data, Vol. 3, No. 1, pp. 1–40, Dec. 2016, https://doi.org/10.1186/s40537-016-0043-6
  • J. Jiao, M. Zhao, J. Lin, and K. Liang, “A comprehensive review on convolutional neural network in machine fault diagnosis,” Neurocomputing, Vol. 417, pp. 36–63, Dec. 2020, https://doi.org/10.1016/j.neucom.2020.07.088
  • C. Li, S. Zhang, Y. Qin, and E. Estupinan, “A systematic review of deep transfer learning for machinery fault diagnosis,” Neurocomputing, Vol. 407, pp. 121–135, Sep. 2020, https://doi.org/10.1016/j.neucom.2020.04.045
  • W. Li et al., “A perspective survey on deep transfer learning for fault diagnosis in industrial scenarios: Theories, applications and challenges,” Mechanical Systems and Signal Processing, Vol. 167, p. 108487, Mar. 2022, https://doi.org/10.1016/j.ymssp.2021.108487
  • G. Qian and J. Liu, “A comparative study of deep learning-based fault diagnosis methods for rotating machines in nuclear power plants,” Annals of Nuclear Energy, Vol. 178, p. 109334, Dec. 2022, https://doi.org/10.1016/j.anucene.2022.109334
  • C. Che, H. Wang, X. Ni, and Q. Fu, “Intelligent fault diagnosis method of rolling bearing based on stacked denoising autoencoder and convolutional neural network,” Industrial Lubrication and Tribology, Vol. 72, No. 7, pp. 947–953, Apr. 2020, https://doi.org/10.1108/ilt-11-2019-0496
  • M. Azamfar, J. Singh, I. Bravo-Imaz, and J. Lee, “Multisensor data fusion for gearbox fault diagnosis using 2-D convolutional neural network and motor current signature analysis,” Mechanical Systems and Signal Processing, Vol. 144, p. 106861, Oct. 2020, https://doi.org/10.1016/j.ymssp.2020.106861
  • E. J. Piedad, Y.-T. Chen, H.-C. Chang, and C.-C. Kuo, “Frequency occurrence plot-based convolutional neural network for motor fault diagnosis,” Electronics, Vol. 9, No. 10, p. 1711, Oct. 2020, https://doi.org/10.3390/electronics9101711
  • T. Du, H. Zhang, and L. Wang, “Analogue circuit fault diagnosis based on convolution neural network,” Electronics Letters, Vol. 55, No. 24, pp. 1277–1279, Nov. 2019, https://doi.org/10.1049/el.2019.2892
  • L.-J. Jin, J.-M. Zhan, J.-H. Chen, and T. J. J. O. Z. U. Wang, “Drill pipe fault diagnosis method based on one-dimensional convolutional neural network,” Zhejiang University Journals, Vol. 54, No. 3, pp. 467–474, 2020.
  • Q. Wu, Y. Guo, H. Chen, X. Qiang, and W. Wang, “Establishment of a deep learning network based on feature extraction and its application in gearbox fault diagnosis,” Artificial Intelligence Review, Vol. 52, No. 1, pp. 125–149, Jun. 2019, https://doi.org/10.1007/s10462-019-09710-x
  • J. Niyongabo, Y. Zhang, and J. Ndikumagenge, “Bearing fault detection and diagnosis based on densely connected convolutional networks,” Acta Mechanica et Automatica, Vol. 16, No. 2, pp. 130–135, Jun. 2022, https://doi.org/10.2478/ama-2022-0017
  • F. An and J. Wang, “Rolling bearing fault diagnosis algorithm using overlapping group sparse-deep complex convolutional neural network,” Nonlinear Dynamics, Vol. 108, No. 3, pp. 2353–2368, May 2022, https://doi.org/10.1007/s11071-022-07314-9
  • X. Li, W. Zhang, and Q. Ding, “Deep learning-based remaining useful life estimation of bearings using multi-scale feature extraction,” Reliability Engineering and System Safety, Vol. 182, pp. 208–218, Feb. 2019, https://doi.org/10.1016/j.ress.2018.11.011
  • M. J. Hasan, M. M. M. Islam, and J.-M. Kim, “Bearing fault diagnosis using multidomain fusion-based vibration imaging and multitask learning,” Sensors, Vol. 22, No. 1, p. 56, Dec. 2021, https://doi.org/10.3390/s22010056
  • D. Yu and H. Zhang, “Fault diagnosis method for submersible reciprocating pumping unit based on deep belief network,” IEEE Access, Vol. 8, pp. 109940–109948, 2020, https://doi.org/10.1109/access.2020.3002376
  • M. T. Nguyen and J. H. Huang, “Fault detection in water pumps based on sound analysis using a deep learning technique,” Proceedings of the Institution of Mechanical Engineers, Part E: Journal of Process Mechanical Engineering, Vol. 236, No. 2, p. 095440892110393, Aug. 2021, https://doi.org/10.1177/09544089211039304
  • X. Zhang, M. Zhang, Z. Xiang, and J. Mo, “Research on diagnosis algorithm of mechanical equipment brake friction fault based on MCNN-SVM,” Measurement, Vol. 186, p. 110065, Dec. 2021, https://doi.org/10.1016/j.measurement.2021.110065
  • N. Kuzhagaliyeva, A. Thabet, E. Singh, B. Ghanem, and S. M. Sarathy, “Using deep neural networks to diagnose engine pre-ignition,” Proceedings of the Combustion Institute, Vol. 38, No. 4, pp. 5915–5922, 2021, https://doi.org/10.1016/j.proci.2020.10.001
  • R. Liu, G. Meng, B. Yang, C. Sun, and X. Chen, “Dislocated time series convolutional neural architecture: An intelligent fault diagnosis approach for electric machine,” IEEE Transactions on Industrial Informatics, Vol. 13, No. 3, pp. 1310–1320, Jun. 2017, https://doi.org/10.1109/tii.2016.2645238
  • Z. Meng, W. Cao, D. Sun, Q. Li, W. Ma, and F. Fan, “Research on fault diagnosis method of MS-CNN rolling bearing based on local central moment discrepancy,” Advanced Engineering Informatics, Vol. 54, p. 101797, Oct. 2022, https://doi.org/10.1016/j.aei.2022.101797
  • Y. Wang, C. Bai, X. Qian, W. Liu, C. Zhu, and L. Ge, “A DC series arc fault detection method based on a lightweight convolutional neural network used in photovoltaic system,” Energies, Vol. 15, No. 8, p. 2877, Apr. 2022, https://doi.org/10.3390/en15082877
  • B. Wang, G. Feng, D. Huo, and Y. Kang, “A bearing fault diagnosis method based on spectrum map information fusion and convolutional neural network,” Processes, Vol. 10, No. 7, p. 1426, Jul. 2022, https://doi.org/10.3390/pr10071426
  • L. Wan, Y. Chen, H. Li, and C. Li, “Rolling-element bearing fault diagnosis using improved lenet-5 network,” Sensors, Vol. 20, No. 6, p. 1693, Mar. 2020, https://doi.org/10.3390/s20061693
  • M. T. Pham, J.-M. Kim, and C. H. Kim, “Accurate bearing fault diagnosis under variable shaft speed using convolutional neural networks and vibration spectrogram,” Applied Sciences, Vol. 10, No. 18, p. 6385, Sep. 2020, https://doi.org/10.3390/app10186385
  • S. Xiong et al., “A novel end-to-end fault diagnosis approach for rolling bearings by integrating wavelet packet transform into convolutional neural network structures,” Sensors, Vol. 20, No. 17, p. 4965, Sep. 2020, https://doi.org/10.3390/s20174965
  • V. Sinitsin, O. Ibryaeva, V. Sakovskaya, and V. Eremeeva, “Intelligent bearing fault diagnosis method combining mixed input and hybrid CNN-MLP model,” Mechanical Systems and Signal Processing, Vol. 180, p. 109454, Nov. 2022, https://doi.org/10.1016/j.ymssp.2022.109454
  • X. Yu, F. Dong, E. Ding, S. Wu, and C. Fan, “Rolling bearing fault diagnosis using modified LFDA and EMD with sensitive feature selection,” IEEE Access, Vol. 6, pp. 3715–3730, 2018, https://doi.org/10.1109/access.2017.2773460
  • L. Eren, T. Ince, and S. Kiranyaz, “A generic intelligent bearing fault diagnosis system using compact adaptive 1D CNN classifier,” Journal of Signal Processing Systems, Vol. 91, No. 2, pp. 179–189, Feb. 2019, https://doi.org/10.1007/s11265-018-1378-3
  • J. Zheng, H. Pan, and J. Cheng, “Rolling bearing fault detection and diagnosis based on composite multiscale fuzzy entropy and ensemble support vector machines,” Mechanical Systems and Signal Processing, Vol. 85, pp. 746–759, Feb. 2017, https://doi.org/10.1016/j.ymssp.2016.09.010
  • C. Lu, Z. Wang, and B. Zhou, “Intelligent fault diagnosis of rolling bearing using hierarchical convolutional network based health state classification,” Advanced Engineering Informatics, Vol. 32, pp. 139–151, Apr. 2017, https://doi.org/10.1016/j.aei.2017.02.005
  • M. M. M. Islam and J.-M. Kim, “Automated bearing fault diagnosis scheme using 2D representation of wavelet packet transform and deep convolutional neural network,” Computers in Industry, Vol. 106, pp. 142–153, Apr. 2019, https://doi.org/10.1016/j.compind.2019.01.008
  • R. N. Toma, F. Piltan, and J.-M. Kim, “A deep autoencoder-based convolution neural network framework for bearing fault classification in induction motors,” Sensors, Vol. 21, No. 24, p. 8453, Dec. 2021, https://doi.org/10.3390/s21248453
  • M.-T. Pham, J.-M. Kim, and C.-H. Kim, “2D CNN-based multi-output diagnosis for compound bearing faults under variable rotational speeds,” Machines, Vol. 9, No. 9, p. 199, Sep. 2021, https://doi.org/10.3390/machines9090199
  • R. Nishat Toma, C.-H. Kim, and J.-M. Kim, “Bearing fault classification using ensemble empirical mode decomposition and convolutional neural network,” Electronics, Vol. 10, No. 11, p. 1248, May 2021, https://doi.org/10.3390/electronics10111248
  • R. N. Toma et al., “A bearing fault classification framework based on image encoding techniques and a convolutional neural network under different operating conditions,” Sensors, Vol. 22, No. 13, p. 4881, Jun. 2022, https://doi.org/10.3390/s22134881
  • J. Lingli, L. Shuhui, L. Xuejun, L. Jiale, and Y. Dalian, “Fault diagnosis of a planetary gearbox based on a local bi-spectrum and a convolutional neural network,” Measurement Science and Technology, Vol. 33, No. 4, p. 045008, Apr. 2022, https://doi.org/10.1088/1361-6501/ac471a
  • C. D. Nguyen, Z. Ahmad, and J.-M. Kim, “Gearbox fault identification framework based on novel localized adaptive denoising technique, wavelet-based vibration imaging, and deep convolutional neural network,” Applied Sciences, Vol. 11, No. 16, p. 7575, Aug. 2021, https://doi.org/10.3390/app11167575
  • W. Sun et al., “An Intelligent gear fault diagnosis methodology using a complex wavelet enhanced convolutional neural network,” Materials, Vol. 10, No. 7, p. 790, Jul. 2017, https://doi.org/10.3390/ma10070790
  • S. Sun et al., “Fault diagnosis of conventional circuit breaker contact system based on time-frequency analysis and improved AlexNet,” IEEE Transactions on Instrumentation and Measurement, Vol. 70, pp. 1–12, 2021, https://doi.org/10.1109/tim.2020.3045798
  • L. Ciabattoni, F. Ferracuti, A. Freddi, and A. Monteriu, “Statistical spectral analysis for fault diagnosis of rotating machines,” IEEE Transactions on Industrial Electronics, Vol. 65, No. 5, pp. 4301–4310, May 2018, https://doi.org/10.1109/tie.2017.2762623
  • R. Liu, B. Yang, E. Zio, and X. Chen, “Artificial intelligence for fault diagnosis of rotating machinery: A review,” Mechanical Systems and Signal Processing, Vol. 108, pp. 33–47, Aug. 2018, https://doi.org/10.1016/j.ymssp.2018.02.016
  • X. Wu, Z. Peng, J. Ren, C. Cheng, W. Zhang, and D. Wang, “Rub-impact fault diagnosis of rotating machinery based on 1-D convolutional neural networks,” IEEE Sensors Journal, Vol. 20, No. 15, pp. 8349–8363, Aug. 2020, https://doi.org/10.1109/jsen.2019.2944157
  • J. Yang, S. Yin, Y. Chang, and T. Gao, “A fault diagnosis method of rotating machinery based on one-dimensional, self-normalizing convolutional neural networks,” Sensors, Vol. 20, No. 14, p. 3837, Jul. 2020, https://doi.org/10.3390/s20143837
  • H. Oh, J. H. Jung, B. C. Jeon, and B. D. Youn, “Scalable and unsupervised feature engineering using vibration-imaging and deep learning for rotor system diagnosis,” IEEE Transactions on Industrial Electronics, Vol. 65, No. 4, pp. 3539–3549, Apr. 2018, https://doi.org/10.1109/tie.2017.2752151
  • M. Hakim, A. A. B. Omran, A. N. Ahmed, M. Al-Waily, and A. Abdellatif, “A systematic review of rolling bearing fault diagnoses based on deep learning and transfer learning: Taxonomy, overview, application, open challenges, weaknesses and recommendations,” Ain Shams Engineering Journal, Vol. 14, No. 4, p. 101945, Apr. 2023, https://doi.org/10.1016/j.asej.2022.101945
  • O. Abdeljaber, O. Avci, S. Kiranyaz, M. Gabbouj, and D. J. Inman, “Real-time vibration-based structural damage detection using one-dimensional convolutional neural networks,” Journal of Sound and Vibration, Vol. 388, pp. 154–170, Feb. 2017, https://doi.org/10.1016/j.jsv.2016.10.043
  • C. Wu, P. Jiang, C. Ding, F. Feng, and T. Chen, “Intelligent fault diagnosis of rotating machinery based on one-dimensional convolutional neural network,” Computers in Industry, Vol. 108, pp. 53–61, Jun. 2019, https://doi.org/10.1016/j.compind.2018.12.001
  • T. Ince, “Real-time broken rotor bar fault detection and classification by shallow 1D convolutional neural networks,” Electrical Engineering, Vol. 101, No. 2, pp. 599–608, Jun. 2019, https://doi.org/10.1007/s00202-019-00808-7
  • Z. Mo, Z. Zhang, and K.-L. Tsui, “The variational kernel-based 1-D convolutional neural network for machinery fault diagnosis,” IEEE Transactions on Instrumentation and Measurement, Vol. 70, pp. 1–10, 2021, https://doi.org/10.1109/tim.2021.3105252
  • L. Eren, “Bearing Fault Detection by One-Dimensional Convolutional Neural Networks,” Mathematical Problems in Engineering, Vol. 2017, pp. 1–9, 2017, https://doi.org/10.1155/2017/8617315
  • J. He, P. Wu, Y. Tong, X. Zhang, M. Lei, and J. Gao, “Bearing fault diagnosis via improved one-dimensional multi-scale dilated CNN,” Sensors, Vol. 21, No. 21, p. 7319, Nov. 2021, https://doi.org/10.3390/s21217319
  • Q. Wang, C. Yang, H. Wan, D. Deng, and A. K. Nandi, “Bearing fault diagnosis based on optimized variational mode decomposition and 1D convolutional neural networks,” Measurement Science and Technology, Vol. 32, No. 10, p. 104007, Oct. 2021, https://doi.org/10.1088/1361-6501/ac0034
  • X. Wang, D. Mao, and X. Li, “Bearing fault diagnosis based on vibro-acoustic data fusion and 1D-CNN network,” Measurement, Vol. 173, p. 108518, Mar. 2021, https://doi.org/10.1016/j.measurement.2020.108518
  • C. Yin, Y. Wang, Y. He, L. Liu, Y. Wang, and G. Yue, “Early fault diagnosis of ball screws based on 1-D convolution neural network and orthogonal design,” Proceedings of the Institution of Mechanical Engineers, Part O: Journal of Risk and Reliability, Vol. 235, No. 5, pp. 783–797, Oct. 2021, https://doi.org/10.1177/1748006x21992886
  • Z. Xing, R. Zhao, Y. Wu, and T. He, “Intelligent fault diagnosis of rolling bearing based on novel CNN model considering data imbalance,” Applied Intelligence, Vol. 52, No. 14, pp. 16281–16293, Nov. 2022, https://doi.org/10.1007/s10489-022-03196-x
  • S. Liu and L. Fan, “An adaptive prediction approach for rolling bearing remaining useful life based on multistage model with three-source variability,” Reliability Engineering and System Safety, Vol. 218, p. 108182, Feb. 2022, https://doi.org/10.1016/j.ress.2021.108182
  • S.-S. Zhong, S. Fu, and L. Lin, “A novel gas turbine fault diagnosis method based on transfer learning with CNN,” Measurement, Vol. 137, pp. 435–453, Apr. 2019, https://doi.org/10.1016/j.measurement.2019.01.022
  • K. Zhao, H. Jiang, K. Wang, and Z. Pei, “Joint distribution adaptation network with adversarial learning for rolling bearing fault diagnosis,” Knowledge-Based Systems, Vol. 222, p. 106974, Jun. 2021, https://doi.org/10.1016/j.knosys.2021.106974
  • S. Kumaresan, K. S. J. Aultrin, S. S. Kumar, and M. D. Anand, “Transfer learning with CNN for classification of weld defect,” IEEE Access, Vol. 9, pp. 95097–95108, 2021, https://doi.org/10.1109/access.2021.3093487
  • M. Xia, H. Shao, D. Williams, S. Lu, L. Shu, and C. W. de Silva, “Intelligent fault diagnosis of machinery using digital twin-assisted deep transfer learning,” Reliability Engineering and System Safety, Vol. 215, p. 107938, Nov. 2021, https://doi.org/10.1016/j.ress.2021.107938
  • Y. Qin, “A new family of model-based impulsive wavelets and their sparse representation for rolling bearing fault diagnosis,” IEEE Transactions on Industrial Electronics, Vol. 65, No. 3, pp. 2716–2726, Mar. 2018, https://doi.org/10.1109/tie.2017.2736510
  • S. S. Udmale, S. K. Singh, R. Singh, and A. K. Sangaiah, “Multi-fault bearing classification using sensors and ConvNet-based transfer learning approach,” IEEE Sensors Journal, Vol. 20, No. 3, pp. 1433–1444, Feb. 2020, https://doi.org/10.1109/jsen.2019.2947026
  • M. J. Hasan, M. Sohaib, and J.-M. Kim, “1D CNN-based transfer learning model for bearing fault diagnosis under variable working conditions,” in Advances in Intelligent Systems and Computing, pp. 13–23, 2019, https://doi.org/10.1007/978-3-030-03302-6_2
  • T. Wang, Z. Liu, G. Lu, and J. Liu, “Temporal-spatio graph based spectrum analysis for bearing fault detection and diagnosis,” IEEE Transactions on Industrial Electronics, Vol. 68, No. 3, pp. 2598–2607, Mar. 2021, https://doi.org/10.1109/tie.2020.2975499
  • M. Hakim et al., “Bearing fault diagnosis using lightweight and robust one-dimensional convolution neural network in the frequency domain,” Sensors, Vol. 22, No. 15, p. 5793, Aug. 2022, https://doi.org/10.3390/s22155793
  • M. Cerrada et al., “A review on data-driven fault severity assessment in rolling bearings,” Mechanical Systems and Signal Processing, Vol. 99, pp. 169–196, Jan. 2018, https://doi.org/10.1016/j.ymssp.2017.06.012
  • Y. An, K. Zhang, Q. Liu, Y. Chai, and X. Huang, “Deep transfer learning network for fault diagnosis under variable working conditions,” 2021 CAA Symposium on Fault Detection, Supervision, and Safety for Technical Processes (SAFEPROCESS), Vol. 11, No. 12, p. 16878, Dec. 2021, https://doi.org/10.1109/safeprocess52771.2021.9693606
  • M. J. Hasan, M. Sohaib, and J.-M. Kim, “A multitask-aided transfer learning-based diagnostic framework for bearings under inconsistent working conditions,” Sensors, Vol. 20, No. 24, p. 7205, Dec. 2020, https://doi.org/10.3390/s20247205
  • R. Zhang and Y. Gu, “A transfer learning framework with a one-dimensional deep subdomain adaptation network for bearing fault diagnosis under different working conditions,” Sensors, Vol. 22, No. 4, p. 1624, Feb. 2022, https://doi.org/10.3390/s22041624
  • L. Jiang, C. Zheng, and Y. Li, “Rotating machinery fault diagnosis based on transfer learning and an improved convolutional neural network,” Measurement Science and Technology, Vol. 33, No. 10, p. 105012, Oct. 2022, https://doi.org/10.1088/1361-6501/ac7d3d
  • C. Öztürk, M. Taşyürek, and M. U. Türkdamar, “Transfer learning and fine‐tuned transfer learning methods’ effectiveness analyse in the CNN‐based deep learning models,” Concurrency and Computation: Practice and Experience, Vol. 35, No. 4, Feb. 2023, https://doi.org/10.1002/cpe.7542
  • A. Khorram, M. Khalooei, and M. Rezghi, “End-to-end CNN+LSTM deep learning approach for bearing fault diagnosis,” arXiv, Vol. 51, pp. 736–751, 2019, https://doi.org/10.48550/arxiv.1909.07801
  • J. Shuai, C. Shen, and Z. Zhu, “Adaptive morphological feature extraction and support vector regressive classification for bearing fault diagnosis,” International Journal of Rotating Machinery, Vol. 2017, pp. 1–10, 2017, https://doi.org/10.1155/2017/2384184
  • J. Sun, C. Yan, and J. Wen, “Intelligent bearing fault diagnosis method combining compressed data acquisition and deep learning,” IEEE Transactions on Instrumentation and Measurement, Vol. 67, No. 1, pp. 185–195, Jan. 2018, https://doi.org/10.1109/tim.2017.2759418
  • J. Zhu, N. Chen, and C. Shen, “A new deep transfer learning method for bearing fault diagnosis under different working conditions,” IEEE Sensors Journal, Vol. 20, No. 15, pp. 8394–8402, Aug. 2020, https://doi.org/10.1109/jsen.2019.2936932
  • R. Zhao, R. Yan, Z. Chen, K. Mao, P. Wang, and R. X. Gao, “Deep learning and its applications to machine health monitoring,” Mechanical Systems and Signal Processing, Vol. 115, pp. 213–237, Jan. 2019, https://doi.org/10.1016/j.ymssp.2018.05.050
  • P. Cao, S. Zhang, and J. Tang, “Preprocessing-free gear fault diagnosis using small datasets with deep convolutional neural network-based transfer learning,” IEEE Access, Vol. 6, pp. 26241–26253, 2018, https://doi.org/10.1109/access.2018.2837621
  • A. Ibrahim, F. Anayi, and M. Packianather, “New transfer learning approach based on a CNN for fault diagnosis,” IECMA 2022, Vol. 24, No. 1, p. 16, Sep. 2022, https://doi.org/10.3390/iecma2022-12905
  • M. J. Afridi, A. Ross, and E. M. Shapiro, “On automated source selection for transfer learning in convolutional neural networks,” Pattern Recognition, Vol. 73, pp. 65–75, Jan. 2018, https://doi.org/10.1016/j.patcog.2017.07.019
  • H. Cheng, H. Yu, P. Zeng, E. Osipov, S. Li, and V. Vyatkin, “Automatic recognition of sucker-rod pumping system working conditions using dynamometer cards with transfer learning and SVM,” Sensors, Vol. 20, No. 19, p. 5659, Oct. 2020, https://doi.org/10.3390/s20195659
  • Y. He, H. Tang, Y. Ren, and A. Kumar, “A deep multi-signal fusion adversarial model based transfer learning and residual network for axial piston pump fault diagnosis,” Measurement, Vol. 192, p. 110889, Mar. 2022, https://doi.org/10.1016/j.measurement.2022.110889
  • D. Xiao, Y. Huang, C. Qin, Z. Liu, Y. Li, and C. Liu, “Transfer learning with convolutional neural networks for small sample size problem in machinery fault diagnosis,” Proceedings of the Institution of Mechanical Engineers, Part C: Journal of Mechanical Engineering Science, Vol. 233, No. 14, pp. 5131–5143, Jul. 2019, https://doi.org/10.1177/0954406219840381
  • M. J. Hasan, M. M. M. Islam, and J.-M. Kim, “Acoustic spectral imaging and transfer learning for reliable bearing fault diagnosis under variable speed conditions,” Measurement, Vol. 138, pp. 620–631, May 2019, https://doi.org/10.1016/j.measurement.2019.02.075
  • B. Zhao, X. Zhang, Z. Zhan, and S. Pang, “Deep multi-scale convolutional transfer learning network: A novel method for intelligent fault diagnosis of rolling bearings under variable working conditions and domains,” Neurocomputing, Vol. 407, pp. 24–38, Sep. 2020, https://doi.org/10.1016/j.neucom.2020.04.073
  • M. Liao, C. Liu, C. Wang, and J. Yang, “Research on a rolling bearing fault detection method with wavelet convolution deep transfer learning,” IEEE Access, Vol. 9, pp. 45175–45188, 2021, https://doi.org/10.1109/access.2021.3067152
  • B. Yang, Y. Lei, F. Jia, and S. Xing, “An intelligent fault diagnosis approach based on transfer learning from laboratory bearings to locomotive bearings,” Mechanical Systems and Signal Processing, Vol. 122, pp. 692–706, May 2019, https://doi.org/10.1016/j.ymssp.2018.12.051
  • Y. Zou, Y. Liu, J. Deng, Y. Jiang, and W. Zhang, “A novel transfer learning method for bearing fault diagnosis under different working conditions,” Measurement, Vol. 171, p. 108767, Feb. 2021, https://doi.org/10.1016/j.measurement.2020.108767
  • D. Li, Y. Wang, W.-J. Yan, and W.-X. Ren, “Acoustic emission wave classification for rail crack monitoring based on synchrosqueezed wavelet transform and multi-branch convolutional neural network,” Structural Health Monitoring, Vol. 20, No. 4, pp. 1563–1582, Jul. 2021, https://doi.org/10.1177/1475921720922797
  • J. Kim and K.-I. Kim, “Partial discharge online detection for long-term operational sustainability of on-site low voltage distribution network using CNN transfer learning,” Sustainability, Vol. 13, No. 9, p. 4692, Apr. 2021, https://doi.org/10.3390/su13094692
  • M. Ahishali, S. Kiranyaz, T. Ince, and M. Gabbouj, “Dual and single polarized SAR image classification using compact convolutional neural networks,” Remote Sensing, Vol. 11, No. 11, p. 1340, Jun. 2019, https://doi.org/10.3390/rs11111340
  • M. J. Hasan and J.-M. Kim, “Bearing fault diagnosis under variable rotational speeds using stockwell transform-based vibration imaging and transfer learning,” Applied Sciences, Vol. 8, No. 12, p. 2357, Nov. 2018, https://doi.org/10.3390/app8122357
  • F. Omerustaoglu, C. O. Sakar, and G. Kar, “Distracted driver detection by combining in-vehicle and image data using deep learning,” Applied Soft Computing, Vol. 96, p. 106657, Nov. 2020, https://doi.org/10.1016/j.asoc.2020.106657

About this article

Received
11 May 2023
Accepted
11 September 2023
Published
04 November 2023
SUBJECTS
Fault diagnosis based on vibration signal analysis
Keywords
convolutional neural network (CNN)
fault diagnosis
transfer learning (TL)
data-driven
Acknowledgements

This work was supported by the key scientific and technological project of Henan Province (222102240077 and 232102221040), and partly supported by Key Research and Development Projects of Henan Province in 2022 (221111240200) and Major Science and Technology Projects of Henan Province in 2022 (221100220200).

Data Availability

The datasets generated during and/or analyzed during the current study are available from the corresponding author on reasonable request.

Author Contributions

Junjian Hou is responsible for the idea and methodology development; Xikang Lu is responsible for validation and manuscript writing; Yudong Zhong is responsible for idea development and supervision; Wenbin He is responsible for methodology discussion; Dengfeng Zhao is responsible for manuscript refinement; Fang Zhou is responsible for refinement and revision. Apart from the above contributions, Junjian Hou and Yudong Zhong are also responsible for funding support and manuscript finalization.

Conflict of interest

The authors declare that they have no conflict of interest.