Abstract
Gearbox is the key component of mechanical transmission system. Accurate fault diagnosis of gearbox is of great significance to ensure the operation of rotating machinery. Based on the comprehensive simulation testbed in the laboratory, a gearbox fault diagnosis method based on QPSOKELM is proposed. Firstly, the fault pre planting experiments of gear fault, bearing fault and gear bearing mixed fault are carried out on the comprehensive simulation testbed. Then, the vibration signals collected are preprocessed by TSA to eliminate noise. The time domain, frequency domain and NASA feature parameters of the preprocessed signals are taken as training samples and test samples of QPSOKELM. The experimental results show that the proposed method can effectively solve the problem of gearbox fault pattern recognition, and the fault diagnosis accuracy is higher than traditional methods, so the research has certain reference significance and engineering application value.
1. Introduction
Gearbox is widely used in fans, machine tools, vehicles and other equipments. As a key component of mechanical transmission system, gearbox will cause huge economic losses and even casualties in case of failure during equipment operation. Therefore, the earlier the fault is found in the gearbox and the earlier the maintenance is carried out, the more loss can be reduced. The main components of gearbox are shaft, bearing and gear, among which gear failure and bearing failure are quite common.
At present, the mechanical transmission system fault diagnosis research is extensive, and the method is diverse. As the most mainstream classification method, intelligent classification algorithm has achieved good results in the field of equipment fault diagnosis. Common intelligent classification algorithms include neural network [1, 2], support vector machine (SVM) [35], kernel extreme learning machine (KELM) [6, 7], deep learning [810] and other methods. Wang [2] proposed a fault diagnosis method based on RDGWPRMSE and PNN, which is used to realize the automatic fault identification of electric submersible pump. Wang and Yan [4] used the energy of IMF component after SVD decomposition as feature parameters to input and train SVM model, so as to realize bearing fault diagnosis. Saari et al. [5] proposed a fault diagnosis method for wind turbine bearing based on one class SVM. Iosifidis A et al. [7] studied the classification method of KELM, and achieved good results. Yang Yu et al. [9] proposed a structure adaptive DBN algorithm to solve the problem of difficult to determine the structure of DBN, which was successfully applied to the fault classification of rolling bearings. Shao et al. [10] successfully diagnosed the bearing fault by combining the dual tree complex wavelet packet transform with the adaptive DBN algorithm.
Compared with these methods, KELM has a stronger comprehensive advantage in sample size, network generalization, calculation speed and accuracy, so KELM is more suitable for solving gearbox fault diagnosis problems that require higher calculation speed and accuracy. Since the classification capability of KELM network is greatly affected by the value of kernel parameters and penalty coefficient, it is necessary to optimize the above structure parameters. Currently, genetic algorithm [11], fish swarm algorithm [12], whale algorithm[13], ant colony algorithm [14], wolf algorithm [15], particle swarm algorithm [16] and other intelligent algorithms have been widely applied in the field of parameter optimization, such as Xu [13] used whale algorithm to optimize wavelet kernel extreme learning machine (WKELM) and Wang [17] used the modified WKELM wolves algorithm in diagnosing rolling bearing fault; Wu [18] used ant colony algorithm for fault diagnosis of rotating mechanical equipment. Pei [19] used particle swarm optimization algorithm to optimize KELM for transformer fault diagnosis. Liu [20] proposed the method of combining the variational mode decomposition (VMD) with the improved KELM for engine fault diagnosis. However, most of the above parameter optimization methods have the problem of long calculation time and easy to fall into the local optimum. Therefore, the QPSOKELM (KELMbased on Quantum Particle Swarm Optimization) [2123] was put forward. With strong global search ability, the QPSO can find the optimal parameters of KELM, thereby, the learning speed and classification accuracy of KELM can be improved and the accuracy of gearbox fault diagnosis will also be improved.
2. Fault diagnosis strategy of gearbox based on QPSOKELM
The fault diagnosis process of gearbox based on QPSOKELM is described as follows:
(1) Signal acquisition. In general, the closer the sensor is to the source of the vibration, the better the signal will be.
(2) Extract the fault characteristic parameters. The timedomain characteristic parameters, frequencydomain characteristic parameters and the characteristic parameters in the technical report of NASA were extracted from the collected signals. Then, the corresponding training set and test set were selected from all samples in a ratio of 3:1.
(3) The training of KELM. Aiming at maximizing the training accuracy, the training set is input to KELM, and the QPSO algorithm is used to search the optimal parameters. After that, the trained KELM can be obtained.
(4) Calculate the test accuracy. Input the test samples to the trained KELM, the correct classification times of test samples will be calculated and the test accuracy will be calculated.
The above process is shown in Fig. 1.
Fig. 1The process of fault diagnosis of gearbox based on QPSOKELM
In the Fig. 1, the gearbox fault diagnosis process also involves timedomain synchronous average algorithm [24], timedomain synchronous resampling algorithm [25, 26], prewhitening algorithm [27], relevant fault characteristic parameters, KELM, QPSO and other relevant contents.
2.1. Signal preprocessing
During the signal acquisition process, the signal are easy interfered by other vibration signals, such as the vibration of the bearing of the equipment itself, the vibration of the motor, the meshing vibration of the gear independent of the synchronous shaft and the vibration from other machines and equipment. As a signal preprocessing technology, time synchronous averaging (TSA) technology can eliminate the signal components irrelevant to the synchronous shaft and enhance the fault signals of the synchronous shaft gear and its meshing gear, making the gear fault diagnosis easier. Where, the reference axis is the synchronous axis and the nonreference axis is the nonsynchronous axis. The theoretical equation of TSA is as follows:
where, ${y}_{n}$ is the TSA signal, $x$ is the original signal, and $M$ is the average times.
In the actual process, the implementation steps of TSA are as follows:
(1) Determine the zero crossing point of the synchronous axis according to the speed pulse signal, that is, the starting or ending position of each rotation of the synchronous axis.
(2) According to the position of zero crossing, divide the vibration acceleration signal into $M1$ segments, and each segment represents the sampling signal of the integer rotation of the synchronous axis.
(3) Perform cubic curve interpolation for each signal segment, so that the sampling points of each signal segment are the same. The newly interpolated sampling points are $n={2}^{ceiling\left(log2\left(g\right)\right)}$, where $g$ is the mean of sampling points of each signal segment.
(4) Stack and average the resampled signals to obtain the timedomain synchronous average signal.
Based on the speed signal sampled at the same angle, TSA resamples the timedomain vibration acceleration signal to ensure that each revolution has the same number of sampling points, that is, equal angle sampling. The speed signal is usually obtained by the photoelectric sensor, and a rotation of the axis produces several pulses. The more pulses generated by a rotation, the more accurate the speed signal will be. The position of zero crossing can be obtained by rotating speed signal, and it is the key of the TSA algorithm. The gear fault data is mainly the fault characteristic parameters extracted from the timedomain synchronous average signal.
2.2. Extract feature parameters [28]
After preprocessing, the feature extraction of the preprocessed signal is carried out. The feature parameters are the basis for feature classification and fault recognition. The feature parameters adopted in this paper include timedomain feature parameters, frequencydomain feature parameters and some feature parameters mentioned in the report of NASA.
Timedomain characteristic parameters include: maximum (${X}_{1}$), minimum (${X}_{2}$), peaktopeak (${X}_{3}$), mean value (${X}_{4}$), mean square value (${X}_{5}$), root mean square (${X}_{6}$), variance (${X}_{7}$), standard deviation (${X}_{8}$), energy (${X}_{9}$), root square amplitude (${X}_{10}$), mean square amplitude (${X}_{11}$), mean square amplitude (${X}_{12}$), kurtosis (${X}_{13}$), skewness (${X}_{14}$), waveform index (${X}_{15}$), peak index (${X}_{16}$), pulse index (X_{17}), margin index (${X}_{18}$), clearance coefficient (${X}_{19}$), etc:
${X}_{9}=\sqrt{\frac{1}{N}\sum _{i=1}^{N}{\left({x}_{i}{X}_{4}\right)}^{2}},{X}_{10}=\sum _{i=1}^{N}{x}_{i}^{2},{X}_{11}=\frac{\frac{1}{N}\sum _{i=1}^{N}{\left({x}_{i}{X}_{4}\right)}^{3}}{{X}_{6}^{3}},$
${X}_{12}=\frac{\frac{1}{N}\sum _{i=1}^{N}{\left({x}_{i}{X}_{4}\right)}^{4}}{{X}_{6}^{4}},{X}_{13}={\left(\frac{1}{N}\sum _{i=1}^{N}{\left{x}_{i}\right}^{1/2}\right)}^{2},{X}_{14}=\frac{1}{N}\sum _{i=1}^{N}\left{x}_{i}\right,$
${X}_{15}=\frac{{X}_{6}}{\left{X}_{14}\right},{X}_{16}=\frac{{X}_{7}}{{X}_{6}},{X}_{17}=\frac{{X}_{7}}{{X}_{14}},{X}_{18}=\frac{{X}_{7}}{{X}_{13}},{X}_{19}=\frac{{X}_{1}}{{X}_{3}}.$
Some common statistical features in the frequency domain are shown below. Feature ${p}_{1}$ namely mean frequency. Features ${p}_{2}$${p}_{4}$, ${p}_{6}$, and ${p}_{11}$${p}_{13}$ describe the convergence of the spectrum power, reflecting the energy of frequency spectrum. Features ${p}_{5}$ and ${p}_{7}$${p}_{9}$ show the change of main frequencies which are dominant in the frequency spectrum. The features in frequencydomain are calculated from the spectrum of the original signal, which contains more effective information than timedomain features:
${p}_{4}=\frac{\sum _{k=1}^{K}\left(\right(s\left(k\right){p}_{1}){)}^{4}}{K{{p}_{2}}^{2}},{p}_{5}={X}_{fc}=\frac{\sum _{k=1}^{K}{f}_{k}s\left(k\right)}{\sum _{k=1}^{K}s\left(k\right)},{p}_{6}=\sqrt{\frac{\sum _{k=1}^{K}({f}_{k}{p}_{5}{)}^{2}s(k)}{K}},$
${p}_{7}={X}_{rmsf}=\sqrt{\frac{\sum _{k=1}^{K}{{f}_{k}}^{2}s\left(k\right)}{\sum _{k=1}^{K}s\left(k\right)}},{p}_{8}=\sqrt{\frac{\sum _{k=1}^{K}{{f}_{k}}^{4}s\left(k\right)}{\sum _{k=1}^{K}{{f}_{k}}^{2}s\left(k\right)}},{p}_{9}=\frac{\sum _{k=1}^{K}{{f}_{k}}^{2}s\left(k\right)}{\sqrt{\sum _{k=1}^{K}s\left(k\right)\sum _{k=1}^{K}{{f}_{k}}^{4}s\left(k\right)}},$
${p}_{10}=\frac{{p}_{6}}{{p}_{5}},{p}_{11}=\frac{\sum _{k=1}^{K}({f}_{k}{p}_{5}{)}^{3}s(k)}{K{{p}_{6}}^{3}},{p}_{12}=\frac{\sum _{k=1}^{K}({f}_{k}{p}_{5}{)}^{4}s(k)}{K{{p}_{6}}^{4}},$
${p}_{13}=\frac{\sum _{k=1}^{K}({f}_{k}{p}_{5}{)}^{1/2}s(k)}{K\sqrt{{p}_{6}}},{p}_{14}=\sqrt{\frac{\sum _{k=1}^{K}({f}_{k}{p}_{5}{)}^{2}s(k)}{\sum _{k=1}^{K}s\left(k\right)}}.$
In addition to the common timedomain and frequencydomain features, NASA also proposed some features, such as FM0, ER, FM4, FM4*, M6A, M6A*, M8A, M8A*, NA4, NA4*, NB4, NB4*, ER, EOP, etc. These features are mainly used to assess the gear failure:
$M6A=\frac{{N}^{2}\sum _{i=1}^{N}{\left({d}_{i}\overline{d}\right)}^{6}}{{\left[\sum _{i=1}^{N}{\left({d}_{i}\overline{d}\right)}^{2}\right]}^{3}},M6{A}^{\mathrm{*}}=\frac{1/N\sum _{i=1}^{N}{\left({d}_{i}\stackrel{}{d}\right)}^{6}}{{\left[1/{M}^{\mathrm{\text{'}}}\sum _{j=1}^{{M}^{\mathrm{\text{'}}}}1/N\sum _{k=1}^{N}{\left(d{\mathrm{\text{'}}}_{jk}\stackrel{}{d}{\mathrm{\text{'}}}_{j}\right)}^{2}\right]}^{3}},$
3. Research on classification method based on QPSOKELM
3.1. KELM
In KELM algorithm, the kernel function is introduced into ELM, and the input weights and offsets in ELM are replaced by kernel function mapping, which makes the output of the ELM more stable and solves the over learning problems. Therefore, the KELM is widely used in classification identify areas.
ELM is a single hidden layer feedforward neural network, and its hidden layer weight does not need to be adjusted by feedback regulation. The structure of the ELM is shown in the Fig. 2, including an input layer, a hidden layer and an output layer.
Suppose $\mathbf{Z}=\{{\mathbf{x}}_{i},{\mathbf{t}}_{i}{\}}_{i=1}^{N}$ are $N$ training sample sets, where ${x}_{i}=[{x}_{i1},{x}_{i1},\cdots ,{x}_{im}{]}^{T}\in {\mathbf{R}}^{m}$ is the input characteristic parameter and ${t}_{i}=[{t}_{i1},{t}_{i1},\cdots ,{t}_{in}{]}^{T}\in {\mathbf{R}}^{n}$ is the sample label, then the output function expression of ELM training model is:
where, ${w}_{i}=[{w}_{1i},{w}_{2i},\cdots ,{w}_{mi}]$ and ${\beta}_{i}=[{\beta}_{i1},{\beta}_{i2},\cdots ,{\beta}_{in}{]}^{T}$ represent the input and output weight matrix of the $i$th hidden layer node respectively; ${o}_{j}=[{o}_{i1},{o}_{i2},\cdots ,{o}_{in}{]}^{T}$ represents the output layer vector of the ELM.
Fig. 2ELM model
The goal of ELM training is to make the error between the actual output of the training sample and the sample label close to 0, the equation is:
So the existence of ${\beta}_{i}$, ${w}_{i}$ and ${b}_{i}$ can be used to obtain the Eq. (4), as below:
The Eq. (4) can be expressed as a matrix:
where, $H$ is the output matrix of the hidden layer, which can be expressed in the following equation:
For $\beta $, the equation is as follows:
where, ${H}^{+}$ represents the inverse matrix of $H$, ${\beta}^{\mathrm{\text{'}}}$ is the optimal quadratic solution to $\beta $.
According to the structural optimization and ERM criteria, the output weight $\beta $ of ELM can be determined, that is:
where, ${\xi}_{i}={\left[{\xi}_{i,1},\cdots ,{\xi}_{i,m}\right]}^{T}$ represents network calculation error, $\frac{1}{2}{\Vert \beta \Vert}^{2}$ is the structural error, $\frac{1}{2}{\sum _{p=1}^{N}\Vert {\xi}_{p}\Vert}^{2}$ is the empirical error, and $C$ is the penalty factor.
Through the optimization of the solution, the optimal solution of $\beta $ can be obtained as follows:
Therefore, the output model function of ELM is as follows:
By using kernel operation to replace matrix operation $H$ in ELM, the KELM classification model can be established. The kernel matrix is defined as:
where $\mathrm{\Omega}$ is the symmetric matrix of $N\times N$ and $K(x,y)$ is the kernel function.
Eq. (11) can be expressed by kernel function as follows:
where, $\lambda $ represents the output weight matrix of KELM.
Different support vector machines can be constructed by selecting different kernel functions. Among the commonly used kernel functions, compared with the polynomial kernel function and Sigmoid kernel function, the radial basis kernel function has the advantages of simple parameters and strong adaptability to randomly distributed samples. Therefore, a SVM based on radial basis kernel function is established in this paper, and its expression is shown as follows:
According to Eq. (9) and (14), it can be seen that KELM contains kernel parameter $\sigma $ and penalty coefficient $C$, and it is necessary to find the optimal parameter to make the classification effect of KELM the best.
3.2. QPSOKELM
By quantizing the iterative updating process of particles of PSO algorithm, the QPSO can reduce the algorithm complexity, improve algorithm convergence speed and global search ability. The basic principles of QPSO are as follows:
Assume that $\mathrm{\Omega}$ is the $d$dimensional search space, and the population number of particles in the space is $b$, then the position of the $i$th particle can be expressed as:
Suppose the individual optimal position of the particle is ${p}_{ibest}$, and the global optimal position of the particle is ${p}_{gbest}$, the ${p}_{ibest}$ and ${p}_{gbest}$ are as follows:
The particle can find and update its individual optimal position ${p}_{i}$ and population optimal position ${p}_{g}$ through iterative operation, and the average optimal position $mbest$ can be introduced as the population optimal center. Then the particle optimization process can be expressed as:
where, $\alpha $ is the contraction expansion factor, which is dynamically adjusted in the iterative operation according to the Eq. (21):
where, $N$ represents the maximum number of iterations. ${\alpha}_{1}$ and ${\alpha}_{2}$ are the initial and final values of $\alpha $ respectively, usually the two parameters are set as: ${\alpha}_{1}=\text{0.5}$, ${\alpha}_{2}=\text{1}$.
According to the basic principle of KELM and QPSO, the network training process which is used QPSO to optimize the network structure parameters of KELM is as follows:
(1) Initialize the position of population particles, and set parameters such as particle swarm size, iteration step size, termination conditions and so on.
(2) Initialize the current position of each particle as ${p}_{ibest}$, define the classification accuracy of KELM as fitness function, and calculate the fitness value of each particle. The position of the particle with the maximum fitness value was initialized as ${p}_{gbest}$.
(3) Update the particle position according to Eq. (1820).
(4) The fitness value of each particle is calculated, and the individual optimal position ${p}_{best}$, the group optimal position ${p}_{gbest}$ and the group optimal center $mbest$ are updated based on the optimal fitness value.
(5) Judge whether the termination conditions are met. If so, stop the calculation and output the result, if not, return to step (3).
According to the above steps, the modeling flow chart of QPSOKELM can be drawn as shown in Fig. 3.
Fig. 3Modeling flow chart of QPSOKELM
4. Experimental verification and discussion
To verify whether the QPSOKELM is available, the pre planting fault tests are carried out for gear fault, bearing fault and gearbearing mixed fault. The fault simulation test rig is shown in Fig. 4. The test rig consists of four parts: the power and control part, the bearing fault simulation part (not used), the gear fault simulation part and the data acquisition part (not shown). This section mainly uses the gearbox fault simulation part, which is mainly composed of a twostage reduction spur gearbox, a magnetic powder brake (supply the load) and a magnetic powder brake controller (control the load change).
The perspective and internal structure of the twostage reduction spur gearbox are shown in Fig. 5. The teeth number in the gearbox from high speed shaft to low speed shaft is 41, 79, 36 and 90. The fault gear is the intermediate shaft pinion gear (gear 3 with 36 teeth). The fault types include wear, break and miss teeth. The fault bearing is located in the deep groove ball bearing at the end cover of the gear 3 side of the mediumspeed shaft (Fig. 5(b)). The deep groove ball bearing type is ER16k, and its size parameters are shown in Table 1. The fault types include inner race fault, outer race fault and single roller fault.
Fig. 4The mechanical fault simulation test bench
In the test, we set two vibration acceleration sensors in vertical and horizontal directions respectively. Both sensors are installed outside the end cover of the fault bearing (Fig. 5(c)), which are used to collect the vibration acceleration data in vertical and horizontal directions. The data acquisition parameters are set as: sampling frequency $fs=$ 20.48 kHz, sampling time $t=$ 48 s, motor speed 30 r/s (the actual motor speed is about 29.602 r/s). According to the speed of the motor, the number of gear teeth of the gearbox and the main dimension parameters of the bearing ER16K (Table 1), the main relevant frequency of the gearbox can be calculated by the calculation formula of bearing fault characteristic frequency (Table 2). The calculation results are shown in Table 3.
Fig. 5The structure of gearbox
a) Perspective view
b) The inner structure
c) The outer structure
4.1. Bearing fault diagnosis
Firstly, the single fault of bearing is analyzed. In the experiment, the motor speed is set to 30 r/s, and the signal sampling frequency is 20.48 kHz. The analysis data is mainly collected by sensor1 (vertical direction). The test data types include bearing normal data, rolling element fault data, inner ring fault data and outer ring fault data, each type contains 240 groups of data, and 960 groups of data is in total. The sampling time of each group of data is 2 seconds.
Table 1Main dimensions of bearing ER16K
Number of rolling elements ${N}_{b}$  Ball diameter $d$ / inch  Pitch diameter $D$ / inch  Contact angle $\theta $ (°) 
9  0.3125  1.516  0 
Table 2The formulas of fault characteristic frequencies to bearings
Name  Formulas 
Fault characteristic frequencies of outer race  ${f}_{BPFO}=\frac{{N}_{b}}{2}{f}_{r}\left[1\frac{d\mathrm{c}\mathrm{o}\mathrm{s}\left(\theta \right)}{D}\right]$ 
Fault characteristic frequencies of inner race  ${f}_{BPFI}=\frac{{N}_{b}}{2}{f}_{r}\left[1+\frac{d\mathrm{c}\mathrm{o}\mathrm{s}\left(\theta \right)}{D}\right]$ 
Fault characteristic frequencies of ball spin  ${f}_{BSF}=\frac{D}{d}{f}_{r}\left[1\frac{{d}^{2}\mathrm{c}\mathrm{o}{\mathrm{s}}^{2}\left(\theta \right)}{{D}^{2}}\right]$ 
${N}_{b}$ is the number of rolling elements, $d$ means the Ball diameter, $D$ means the Pitch diameter, $\theta $ means the Contact angle, ${f}_{r}$ is the bearing rotation frequency 
Table 3Relevant frequencies of the gearbox
Rotating frequency / r∙s^{1}  Meshing frequency / Hz  Fault feature frequency of bearing/ Hz  
${f}_{r1}$  ${f}_{r2}$  ${f}_{r3}$  ${f}_{m1}$  ${f}_{m2}$  ${f}_{BPFO}$  ${f}_{BPFI}$  ${f}_{BSF}$ 
29.602  15.363  6.145  1213.662  553.061  54.882  82.448  35.681 
${f}_{r1}$ is the rotating frequencies of high speed axis, ${f}_{r2}$ is the rotating frequencies of medium speed axis, ${f}_{r3}$ is the rotating frequencies of low speed axis; ${f}_{m1}$ is the meshing frequencies of gear 1 and gear 2, ${f}_{m2}$ is the meshing frequencies of gear 3 and gear 4; ${f}_{BPFO}$ is the fault characteristic frequencies of the bearing outer race, ${f}_{BPFI}$ is the fault characteristic frequencies of the bearing inner race, ${f}_{BSF}$ is the fault characteristic frequencies of the bearing ball spin 
When diagnosing the bearing fault, follow the procedure shown in Fig. 1. Firstly, the timedomain, frequencydomain and NASA characteristic parameters of the bearing fault were extracted from the bearing data of each group, and 960 samples will be obtained. And each group of the samples is marked with the following rules: normal1, roller2, inner3, outer4. Then, the training set and test set are selected from all samples in a ratio of 3:1, and the training set is input to KELM for training. In order to maximize the test accuracy, QPSO is used to optimize the parameters of KELM. Finally, in order to verify the superiority and effectiveness of the method, the ELM and PSOKELM are also applied to bearing fault diagnosis, and the experiment results of different methods are used to compare with each other.
Since the standard elm algorithm uses a single layer feedforward neural network structure, it is unnecessary to consider the parameter optimization of ELM. So the parameter optimization process of PSOKELM and QPSOKELM are analyzed, the results is shown in Fig. 6. Therefore, it is only necessary to analyze the parameter optimization process of PSOKELM and QPSOKELM.
Fig. 6a) Iterative process of the two methods, b) the local amplify of a)
a)
b)
Fig. 6(a) is iterative process of the two methods, Fig. 6(b) is the local amplify of Fig. 6(a). It can be seen from the figure that PSOKELM and QPSOKELM algorithm have the highest test accuracy from the beginning, which shows that the fault characteristics of bearing single fault are obvious, and it can achieve high classification accuracy without parameter iteration optimization. The initial parameters setting of QPSOKELM is $C\in \left[\mathrm{0.01,1000}\right]$, $\sigma \in \left[\mathrm{0.01,100}\right]$, and the optomized parameters are $C=$1000, $\sigma =$18.
The final training accuracy and test accuracy of the methods are shown in Table 4. It can be seen from Table 4 that both of the training accuracy and the test accuracy of QPSO are the highest and reach 99.58 %. The QPSOKELM classification results of the test dataset are shown in Fig. 7, in which there is only one misjudgment point, so the accuracy of the trained QPSOKELM model is reliable.
Table 4Bearing fault classification accuracy
SVM  CNN  ELM  PSOKELM  QPSOKELM  
Training accuracy  90.43 %  98.89 %  91.74 %  98.47 %  99.58 % 
Testing accuracy  89.97 %  98.74 %  91.74 %  97.92 %  99.58 % 
Fig. 7Classification results of bearing test dataset based on QPSOKELM
4.2. Gear fault diagnosis
Secondly, the single fault of gear is analyzed. The fault gear is the mediumspeed shaft pinion in the gearbox, which is gear 3 in Fig. 5(b). In the experiment, the motor speed and the signal sampling frequency are the same as in section 4.1. The analysis data is mainly collected by sensor 1 (vertical direction). The test data types include gear normal data, worn teeth data, broken tooth data and missing tooth data, each type contains 240 groups of data, and 960 groups of data is in total. The sampling time of each group of data is 2 seconds.
When diagnosing the gear fault, follow the procedure shown in Fig. 1. Firstly, the characteristic parameters of gear fault in timedomain, frequencydomain and NASA are extracted from each group of gear data, and 960 samples will be gotten. And each group of the samples is marked with the following rules: normal1, wear2, miss3, break4. Then, the training set and test set are selected from all samples in a ratio of 3:1, and the training dataset is input to KELM for training. In order to maximize the test accuracy, QPSO is used to optimize the parameters of KELM. Finally, in order to verify the effectiveness of the method and highlight the advantages of the proposed method. The ELM and PSOKELM are also used for gear fault diagnosis to compare with QPSOKELM.
The parameters optimization of PSOKELM and QPSOKELM are analyzed, the iteration process is shown in Fig. 8. It can be seen from the figure that the QPSOKELM algorithm has the highest test accuracy from the beginning, which indicates that the QPSOKELM has stronger learning ability, while PSOKELM reaches the maximum value at the 11th iteration, which indicates that its learning ability is slightly inferior to the method proposed in this paper. The initial setting of parameters is the same as that in Section 4.1, and the settings after parameter optimization are as follows: $C=$1000, $\sigma =$19.
The final training accuracy and test accuracy of the methods are shown in Table 5. It can be seen from Table 5 that both of the training accuracy and the test accuracy of QPSO are the highest and the test accuracy reaches 98.75 %. The QPSOKELM classification results of the test dataset are shown in Fig. 9, in which there are only three error points, so the accuracy of the trained QPSOKELM model is available. From the iteration process and fault diagnosis accuracy, it can be seen that the fault features of bearing are more obvious than that of gear. Therefore, gear fault diagnosis needs less iteration times, but higher test accuracy can be achieved.
Fig. 8Iterative process of the two methods
Fig. 9Classification results of gear test dataset based on QPSOKELM
Table 5Gear fault classification accuracy
SVM  CNN  ELM  PSOKELM  QPSOKELM  
Training accuracy  83.41 %  98.43 %  80.88 %  98.75 %  99.31 % 
Testing accuracy  81.57 %  97.69 %  80.88 %  97.92 %  98.75 % 
4.3. Gearbearing fault diagnosis
At last, the composite faults of gearbearing are analyzed. The fault parts are the gear and bearing mentioned in the previous section. There are 10 types of gearbearing data: normal gearnormal bearing data, worn gear – faulty bearing rolling element data, worn gearfaulty bearing inner race data, worn gearfaulty bearing outer race data, gear tooth missingfaulty bearing rolling element data, missing gear toothfaulty bearing inner race data, missing gear toothfaulty bearing outer race data, broken gear toothfaulty bearing rolling element data, broken gear toothfaulty bearing inner race data, broken gear toothfaulty bearing inner race data. The mixed faults of gearbearing is shown in Table 6.
In the experiment, the motor speed is set to 30 r/s, and the signal sampling frequency is 20.48 kHz. The sampling time of each group of data is 2 seconds. The total data is 2472, except the data of the normal state (sample size is 288) and the broken tooth faulty inner race (sample size is 264), the sample sizes of the other states are all 240 of each group.
Table 6The mixed faults of gearbearing
Combinations  Gear fault  Bearing fault  
Normal  Wear  Miss  Break  Normal  Roller  Inner  Outer  
1  +  +  
2  +  +  
3  +  +  
4  +  +  
5  +  +  
6  +  +  
7  +  +  
8  +  +  
9  +  +  
10  +  + 
When diagnosing the gearbearing fault, follow the procedure shown in Fig. 1. Firstly, the feature parameters of gearbearing fault in timedomain, frequencydomain and NASA are extracted from each group of gearbearing data, and 2742 samples will be gotten. Each group of samples is labeled with the following rules: normal1, wearroller2, wearinner3, wearouter4, missroller5, missinner6, missouter7, breakroller8, breakinner9, breakouter10. Then, the training set and test set are randomly selected from all samples in a ratio of 3:1, and the training dataset is input to KELM for training. In order to maximize the test accuracy, QPSO is used to optimize the parameters of KELM. Finally, in order to verify the effectiveness of the method and the superiority of the method, the ELM and PSOKELM are also used for gearbearing fault diagnosis to compare with QPSOKELM.
The optimization process of PSOKELM and QPSOKELM are shown in Fig. 10. It can be seen from the figure that the test progress of QPSOKELM algorithm reach the highest after six iterations, while the PSOKELM algorithm has no transformation from the beginning, and the accuracy is low, indicating that the classification ability of PSOKELM is weaker than that of QPSOKELM. The initial setting of parameters is the same as that in Section 4.1, and the settings after parameter optimization are as follows: $C=$1000, $\sigma =$20.
The final classification accuracy of the three methods for mixed faults of gearbearing is shown in Table 7. It can be seen from Table 7 that the training accuracy of QPSO is not the highest in the training process, but when comparing the final test accuracy, the classification accuracy of QPSO is far higher than that of the other two calculations, and the test accuracy is 90.13 %. However, the test accuracy of the other two methods are no higher than 60 %. The QPSOKELM classification results of the test dataset are shown in Fig. 11. As can be seen from Fig. 11, the error points are mainly concentrated in three states: the wornrolling element fault state, wornouter race fault state and brokenouter race fault state. Therefore, more sensitive feature parameters should be extracted for these three states.
Fig. 10Iterative process of the two methods
Table 7Classification accuracy of mixed faults of gearbearing
SVM  CNN  ELM  PSOKELM  QPSOKELM  
Training accuracy  58.43 %  90.67 %  53.34 %  92.27 %  92.45 % 
Testing accuracy  55.52 %  89.49 %  53.34 %  89.64 %  90.13 % 
Fig. 11Classification results of gearbearing test dataset based on QPSOKELM
5. Conclusions
A novel QPSOKELM method is proposed in this paper and applied to gearbox fault diagnosis. The method is implemented by using QPSO algorithm to select the proper KELM parameters, kernel parameter $\sigma $ and penalty coefficient $C$. In order to verify the effectiveness and the superiority of the method, the timedomain, frequencydomain and NASA features are collected, the results of the bearing single fault, gear single fault and gearbearing fault are analyzed. After comparing with two other methods (ELM and PSOKELM), and analyzing the test results, the following conclusions can be given: (1) Experimental results show that the proposed method is effective, it can identify the types of faults mentioned above; (2) Compared with SVM, ELM, CNN and PSOKELM, OPSOKELM has more advantages. The accuracy of QPSOKELM is more than 90 % in both training data and test data, which is higher than other methods.
The study focuses on optimizing the structural parameters of KELM and the application in gearbox fault diagnosis. From the analysis of experimental results, however, it can be found that there is still a space to improve the identification accuracy of gearbearing hybrid fault. Since the accuracy of fault judgment is closely related to fault feature extraction, in order to further improve the fault identification accuracy of composite faults, the next step of the study is to extract more sensitive and effective fault feature parameters.
References

Qu Jianling, Yu Lu, Yuan Tao Adaptive fault diagnosis algorithm for rolling bearing based on onedimensional convolutional neural network. Chinese Journal of Scientific Instrument, Vol. 39, Issue 7, 2018, p. 134143.

Wang Yakun A diagnosis method for electrical submersible pump based on RSGWPTMSE and PNN. Mechanical Engineer, Vol. 1, 2018, p. 167170.

Yu Yang, Bai Rui, Yang Ping Application of support vector machine based on genetic algorithm optimization on acoustic emission detection of gear fault. Journal of Mechanical transmission, Vol. 29, Issue 9, 2017, p. 21892197.

Wang X., Yan W. Fault diagnosis of roller bearings based on the variational mode decomposition and SVM. Journal of Vibration and Shock, Vol. 36, Issue 18, 2017, p. 252256.

Saari J., Strombergsson D., Lundberg J., et al. Detection and identification of windmill bearing faults using a oneclass support vector machine (SVM). Measurement, Vol. 137, 2019, p. 287301.

Huang G. B., Zhou H., Ding X. Extreme learning machine for regression and multiclass classification. IEEE Transactions on Systems, Man and Cybernetics, Part B (Cybernetics), Vol. 42, Issue 2, 2012, p. 513529.

Iosifidis A., Tefas A., Pitas I. On the kernel extreme learning machine classifier. Pattern Recognition Letters, Vol. 54, Issue 1, 2015, p. 1117.

Wen Jiangtao, Yan Changhong, Sun Jiedi Bearing fault diagnosis method based on compression acquisition and deep learning. Chinese Journal of Scientific Instrument, Vol. 39, Issue 1, 2018, p. 171179.

Yang Yu, Lu Peng, Gan Lei, et al. SADBN and its application in rolling bearing fault identification and classification. Journal of Vibration and Shock, Vol. 38, Issue 15, 2019, p. 1116.

Shao H., Jiang H., Wang F., et al. Rolling bearing fault diagnosis using adaptive deep belief network with dualtree complex wavelet packet. ISA Transactions, Vol. 3, Issue 69, 2017, p. 187201.

He Guangyu, Liu Xinxue, Wei Xuenkai Applications of genetic algorithm to aeroengine fault diagnosis. Fire Control and Command Control, Vol. 31, Issue 11, 2006, p. 8284.

Zhang Ning, Wei Xiuye, Guo Xiaoyong, et al. Study on fault diagnosis for bearings based on improved fish swarm algorithm to optimize neural network. Bearing, Vol. 474, Issue 5, 2019, p. 4953.

Xu J., Wang Y., Ji Z. Fault diagnosis method of rolling bearing based on WKELM optimized by whale optimization algorithm. Journal of System Simulation, Vol. 29, Issue 9, 2017, p. 21892197.

Duan Haibin, Wang Daobo, Yu Xiufen Ant colony algorithm: survey and prospect. Engineering Science, Vol. 9, Issue 2, 2007, p. 98102.

Wu Husheng, Zhang Fengming, Wu Lushan New swarm intelligence algorithm – wolf pack algorithm. Systems Engineering and Electronics, Vol. 35, Issue 11, 2013, p. 24302438.

Agrawal S., Panigrahi B. K., Tiwari M. K. Multiobjective particle swarm algorithm with fuzzy clustering for electrical power dispatch. IEEE Transactions on Evolutionary Computation, Vol. 12, Issue 5, 2008, p. 529541.

Wang Tiantian, Wang Yan, Ji Zhicheng Fault diagnosis of rolling bearing based on improved extreme learning machine. Journal of System Simulation, Vol. 30, Issue 11, 2018, p. 390397.

Wu Wei, Gui Boxiang Research on intelligent diagnosis method of machinery fault based on ant colony algorithm. Mechanical Research and Application, Vol. 6, 2014, p. 1517.

Pei Fei, Chen Xuechen, Zhu Yongli, et al. Transformer fault diagnosis based on particle swarm optimization and kernelbased extreme learning machine. Computer Engineering and Design, Vol. 5, 2015, p. 13271331.

Liu Min, Zhang Yingtang, Li Zhining, et al. Engine fault diagnosis based on IVMD and modified KELM. Journal of Vibration, Measurement and Diagnosis, Vol. 39, Issue 4, 2019, p. 875883.

Coello C. A. C., Pulido G. T., Lechuga M. S. Handling multiple objectives with particle swarm optimization. IEEE Transactions on Evolutionary Computation, Vol. 8, Issue 3, 2004, p. 256279.

Pang S., Yang X., Lin X. Evolutionary extreme learning machine optimized by quantumbehaved particle swarm optimization. Journal of System Simulation, 2017.

Rehman O. U., Yang S., Khan S. A quantum particle swarm optimizer with enhanced strategy for global optimization of electromagnetic devices. IEEE Transactions on Magnetics, Vol. 55, Issue 8, 2019, p. 7000804.

Luo Deyang Principle and application of timedomain synchronous averaging. Journal of Vibration, Measurement & Diagnosis, Vol. 19, Issue 3, 1999, p. 202207.

Li Jie, Zhao Jianmin Chaotic characteristic verification of gearbox vibration signal based on the time synchronous averaging and separation technology. Journal of Mechanical Transmission, Vol. 7, 2018, p. 33.

Del Moral P., Doucet A., Jasra A. On adaptive resampling strategies for sequential Monte Carlo methods. Bernoulli, Vol. 18, Issue 1, 2012, p. 252278.

Sawalhi N., Randall R. B. Signal prewhitening using cepstrum editing (liftering) to enhance fault detection in rolling element bearings. Proceedings of the 24th International Congress on Condition Monitoring and Diagnostic Engineering Management, 2011.

Li Haiping, Zhao Jianmin, Song Wenyuan Method of planetary gearbox feature extraction and condition recognition based on EMD and EDT. Journal of Vibration and Shock, Vol. 35, Issue 3, 2016, p. 4854.