Bearing fault diagnosis based on intrinsic time-scale decomposition and improved Support vector machine model

In order to achieve the bearing fault diagnosis so as to ensure the steadiness of rotating machinery. This article proposed a model based on intrinsic time-scale decomposition (ITD) and improved support vector machine method (ISVM), so as to deal with the non-stationary and nonlinear characteristics of bearing vibration signals. Firstly, the feature extraction method intrinsic time-scale decomposition (ITD) is used and the energy entropy are extracted so as to process the vibration signal in this paper. Then, the local tangent space alignment (LTSA) method is introduced to extract the characteristic features and reduce the dimension of the selected entropy features. Finally, the features are used to train the ISVM model as to classify bearings defects. Cases of actual were analyzed. The results validate the effectiveness of the proposed algorithm.


Introduction
As vital components in many rotating machinery, the failure of rolling element bearings may cause great economic loss and threaten people's life.This makes bearing diagnostics a prime research field to improve efficiency and durability of the industrial mechanical systems.Ideal diagnostics should also provide a real-time condition of the components in order to monitor them until the appearance of the first signals of malfunctioning.This could ensure a longer lifetime of the parts than other maintenance techniques, e.g. the preventive maintenance, where components are substituted at given time intervals independently of their actual conditions, with a relevant economic advantage.However, it is not an easy work to achieve the bearing fault diagnosis.The last two decades, the interest for efficient and robust diagnostics of roller element bearings via condition monitoring approaches as well as advanced signal processing has seen an extremely high increase.Of paramount importance in the diagnostic task are two items: 1) the feature extraction of damage sensitive features from the recorded condition monitoring signals and 2) the pattern recognition approach that leads to fault identification [1,2].
To extract the fault features, the signal processing tools that usually been applied, such as: time domain analysis, frequency domain analysis and time-frequency domain analysis.Time domain analysis is the simplest, its diagnoses faults mainly by analyzing statistical indicators, such as kurtosis, crest factor and root-mean square, directly extracted from the time series of the vibration signals.Frequency domain methods usually diagnose mechanical faults by observing the fault characteristic frequencies hidden in the vibration signals.Although FFT is the simplest frequency domain analysis method, its frequency resolution is low when there is a small number of sampling points.The time-frequency method is used more commonly.Wavelet transform is the most commonly used time-frequency analysis method and is widely used in feature extraction [3].
However, the selection of the wavelet mother coefficient is difficult, and for different researcher the selected coefficient not unique.So it is necessary to find an effective method to extract the fault-related features hidden in the complex and non-linear bearing vibration signals.The EMD [4] is other tool usually used in time-frequency domain analysis which is suitable for analyzing non-stationary signals.However, the EMD method has the problems of endpoint leak, and the modal aliasing.In this article, the intrinsic time-scale decomposition (ITD) is used to process the collected vibration signal [5,6] and the ITD Shannon entropy is used to extract the original features from the signal.
However, the ITD energy entropy features remain to be high-dimensional, and excessive redundant information still existed in them.In order to reduce the dimension of the features, the non-linear embedding method local tangent space alignment (LTSA) is used to extract the most useful features as inputs of the bearing running state recognition model [7].LTSA can achieve the transformation from input space to feature space through the nonlinear mapping method, so it has strong nonlinear dimension reduction ability.
Recognizing condition, in order to identify the work condition of roller bearing further, the SVM is served as a classifier [8,9].However, the traditional SVM is not sensitive to the nonlinear feature classification, in recent years, the combination of wavelet theories and SVM has drawn considerable attention owing to its high classification ability for a wide range of applications and better performance than other traditional leaning machines, in this paper, the Morlet kernel is use to construct the new SVM model [10].The paper is organized as follows: Section 2 the concept of ITD energy entropy is proposed and the ITD energy entropies of different vibration signals are calculated.In Section 3, the non-linear embedding method LTSA is described.The Morlet wavelet kernel SVM model is presented in Section 4. In Section 5, the degradation state method is applied to roller bearing diagnosis.The conclusion of this paper is given in Section 6.
The flowchart of the proposed method is showed in Fig. 1.

Signal processing
This section presents a brief discussion on bearing vibration signal processing from ITD. ITD is specifically formulated for application to nonlinear or non-stationary signals of arbitrary origin and obtained from complex systems with underlying dynamics that change on multiple time-scales simultaneously.The ITD overcomes the limitations of EMD listed earlier, as well as those previously mentioned and associated with more classical approaches such as Fourier and wavelets [5].
Given a signal , we define an operator ℓ, which extracts a baseline signal from in a manner that causes the residual to be a proper rotation.More specifically, can be decomposed as: where = ℓ is the baseline signal and = (1 − ℓ) is a proper rotation.Suppose { , ≥ 0} is a real-valued signal, and let { , = 1, 2, 3, …} denote the local extrema of , and for convenience define = 0.In the case of intervals on which is constant, but which contain extrema due to neighbouring signal fluctuations, is chosen as the right endpoint of the interval.To simplify notation, let and denote and , respectively.Suppose that and have been defined on [0, ] and that is available for ∈ [0, ].We can then define a (piece-wise linear) baseline-extracting operator ℓ, on the interval ( , ] between successive extrema as follows: where: and 0 < < 1 is typically fixed with = 1/2.We construct the baseline signal , in this manner in order to maintain the monotonicity of between extrema, while at the same time remaining inside an envelope generated by some wave riding atop this baseline.The extrema are interpreted as evidence of some proper rotation, riding wave to be extracted.The baseline is constructed as a linearly transformed contraction of the original signal in order to make the residual function monotonic between extrema, a necessity for proper rotations.The approach also enables information 'intrinsic' to the original signal to be passed down to the baseline and residual components.We found that other attempts at baseline construction that were not based on use of the input signal, inevitably failed to produce proper rotation residuals.After defining the baseline signal according to Eqs. ( 2) and (3), we are able to define the residual, proper-rotation-extracting operator ℏ, as: Through the decomposition of the original signal, the intrinsic scale component (ISC) can be

Feature extraction
Once the ISCs and a residue ( ) are obtained, where the energy of the ISCs is , , …, can be calculated respectively; then, due to the orthogonality of the ITD decomposition, the sum of the energy of the ISCs should be equal to the total energy of the original signal when the residue ( ) is ignored.As the ISCs ( ), ( ) , …, ( ) include different frequency components, = { , , …, } forms an energy distribution in the frequency domain of roller bearing vibration signal, and then the corresponding ITD energy entropy is designated as: where = / is the percent of the energy of ( ) in the whole signal energy ( = ∑ ).

Basic concepts of Local tangent space alignment
The basic idea of LTSA is to use the tangent space of sample points to represent the geometry of the local character.Then these local manifold structures of space are lined up to construct the global coordinates.Given a data set =[ , , …, ], ∈ , a mainstream shape of -dimension ( > ) is extracted.The LTSA feature extraction algorithm is as follows [11]: 1) Extract local information: for each , =1, 2, …, , used the Euclidean distance to determine a set =[ , , , , …, , ] of its neighborhood adjacent points ( nearest neighbors, for example).
2) Local linear fitting: In the neighborhood of data points , a set of orthogonal basis can be selected to construct the -dimension neighborhood space of and the orthogonal projection of each point , ( = 1, 2, …, ) can be calculated to the tangent space of ( ) = ( , − ̅ ).
̅ is the mean data for the neighborhood.The orthogonal projection in the tangent space of neighborhood data of is composed of local coordinate Θ = [ ( ), , ( ), , …, ( ), ] that describes the most important information of the geometry of the .
3) Global order of the Local coordinates: supposing the global coordinates of converted by the Θ is = [ , , …, ], then the error is: where the is the identity matrix; the is the unit vector; the is the points number of the neighborhood; The is the transformation matrix.In order to minimize the error, the and should be found, then: where: the Θ * is the Moor-Penrose generalized inverse of Θ .Supposing the: = , is a selected matrix from 0-1, the are global coordinates, their weight matrix: The constraints is: 4) Extract of the low-dimensional manifolds feature: Since the is the eigenvalue of matrix , so the corresponding minimum eigenvectors matrix is composed of eigenvalue.Section 2 to section + 1 of matrix make of the .is the global coordinate mapping in the Mainstream form of low-dimensional transformed from the non-linear high-dimensional data set of .

The Morlet wavelet kernel SVM model
The support vector's kernel function can be described as not only the product of point, such as ( , ′) = (〈 ⋅ ′〉), but also the horizontal floating function, such as ( , ′) = (〈 ⋅ ′〉), In fact, if a function satisfies the condition of Mercer's Theorem, it is the allowable support vector's kernel function.A specific Mercer's Theorem description can be found in literature [10].
According to Mercer's Theorem, the number of wavelet kernel functions which can be shown by existent functions is few.Now, an existent wavelet kernel is given, the Morlet wavelet kernel.It can prove that this function can satisfy the condition of allowable support vector's kernel function.The Morlet wavelet function is defined as follows: The Morlet wavelet kernel function is defined as follows: Then, the Morlet wavelet kernel function is being used as the support vector's kernel function, the SVM is defined as: Through the Eqs.( 15) and ( 16), the Morlet wavelet kernel SVM is constructed, and the new constructed SVM which is effective in classification is used the achieve bearing fault diagnosis.
The method consists of 3 procedures sequentially: data processing and features extracting, merge of the original features, constructing-training ISVM model for fault diagnosis.The role of each procedure is explained as follows: Step 1. Data processing and features extraction.The ITD signal processing methods are used to extract the original features from the collected mass vibration data.
Step 2. Merge of the original features.The LTSA method is used to extract the typical features and reduce the dimension of the features.The extracted features are used for training the ISVM model.
Step 3. Construct the ISVM model.The ISVM model is constructed.The rotating machine fault diagnosis is achieved.

Case 1
In order to verify the effectiveness of the proposed method, the bearing running state data sets of the normal state and several fault states were analyzed.The proposed method was applied to bearing fault signals obtained from the Case Western Reserve University [12].The bearing type in the experiments is SKF 6205-2RS JEM.Experiments were conducted by using a 2 hp reliance electric motor.Bearings were seeded with faults by using electro-discharge machining.The test is to simulate the bearing normal running state and fault running states, with fault depth of 0.18 mm, 0.36 mm, 0.53 mm and 0.71 mm at the inner raceway, outer raceway and the ball to reflect the deteriorating state of the bearing; the inner raceway fault signals were chosen in this case.Data was collected at the rate of 12,000 samples per second.4096 data points were selected to analyze.50 groups of test data of each fault states were selected, with 20 groups for training, the other 30 groups for testing.The collected vibration signals of normal state and inner-race four different fault depths are shown in Fig. 2.Then, normalize the 20 groups of entropy values, and input them into the LTSA to reduce the dimension.In order to compare the dimension reduction and redundant treatment effect of LTSA, the principle component analysis (PCA) [13], and the Kernel based principle component analysis (KPCA) [14] method is used to reduce the dimension.The results are shown in Figs. 4, 5, and 6.To be comparable, the dimensions of PCA, KPCA and LTSA, are set to 5, so the input dimension of ISVM is 5 and the neighborhood number is set to 10.
By comparing Figs. 4, 5, and 6, the results show that the PCA-based data dimension reduction method can't effectively separate the high dimension features, and there is still serious aliasing, which will affect the accuracy of the ISVM state recognition effect.The KPCA-based data dimension reduction method works better than the PCA methods, however, there still have some data mixed together.The LTSA-based data dimension reduction method can effectively separate the features of different running states with high calculation accuracy and a higher computational efficiency than the PCA and KPCA methods, which conform more to the actual project requirement.Thus, in the study, the LTSA method is selected.After dimension reduction with the LTSA, the extracted features are input into the ISVM (the wavelet of Morlet with set to 5, equal to set to 0.3) to train the model so as to recognize the states.In order to compare the identifying effect with and without manifold learning method, the following comparisons are done: Use ITD Shannon entropy to extract the features and directly input the extract features into the ISV, without the KPCA dimension reduction process.
1) Use ITD Shannon entropy to extract the features and process the extracted features by PCA to reduce the dimension, then input the features into the ISVM.
2) Use ITD Shannon entropy to extract the features and process the extracted features by KPCA to reduce the dimension, then input the features into the ISVM.
3) The method proposed in this article.
The comparison results are shown in Table 1.Table 1 showing that after the LTSA-based dimension reduction method and features extraction, the accuracy of states recognition improved significantly, much higher than the other algorithms.Therefore, the use of LTSA for dimension reduction in this research is necessary and valuable.
In order to further verify the identification accuracy of the proposed method, the features extracted by LTSA are input into the BP neural network (with the learning rate of the neural network is 0.01; The iteration number is 2000; the training error is 0.001; The hidden number = 15).the RBF SVM (with penalty factor set to 100, nuclear parameterγset to 0.1) and the proposed method.The comparison results are shown in Table 2. Table 2 shows that the ISVM can better identify and approach the sensitive features than the SVM and BPNN because of the Morlet kernel wavelet is used.Thus the choice of ISVM to determine the bearing running states can effectively improve recognition accuracy.
Next, a comparison about the training and test time loss of different methods is implemented: 1) The vibration data processed by ITD Shannon entropy and the extract features are direct input into the ISVM, without the LTSA dimension reduction.
2) The vibration data processed by ITD Shannon entropy and the features are processed by LTSA to reduce the dimension; then the extracted features are input into the SVM.
3) The proposed method in this research.
The comparison results are shown in Table 3.
In Table 3, after the dimension reduction, the recognition speed of ISVM improved significantly.
The time loss of the proposed method is the shortest.The reason is that the LTSA can extract the typical features and the Morlet kernel can improve the SVM model so as to quickly get the cluster centers.The result validates the proposed method and can effectively recognize the bearing fault.

Case 2
After validating the efficacy of the proposed method, the method is used on another case.The test rig is shown in Fig. 7.The bearings are hosted on the shaft; the shaft is driven by AC motor.The rotation speed is kept at 1000 rpm; a radial load of 3 kg is added to the bearing.The data sampling rate is 25600 Hz and the data length is 102400 collected points, as shown in Fig. 8. Every 2 hours, the vibration data is collected once.The bearing is run for one year.Then a set of data from each of the 2 months is selected; the data sets are used to test whether or not the proposed method can identify the bearing running state.4096 data points are selected to analyze, and 60 groups of collected data of different faults are obtained, with 30 groups for training and the other 30 groups for testing.4 (not normalized beforehand).
Then, the 30 groups' entropy values are normalized and input into the LTSA in order to reduce the dimension and extract the typical features; the extracted features are input into the improved SVM.The recognize results are shown in Table 5.
Table 5 shows that, although the actual bearing running state is very complex, the proposed method yields a high recognize accuracy.The results confirm that the proposed method can recognize the bearing running states effectively.

Conclusion
Firstly, this research used the ITD Shannon entropy method to extract the original features from the vibration signals.The LTSA was used to reduce the dimension and data redundancy of the entropy features.Through those methods, the typical features could be extracted effectively.
Then, in order to more accurately identify the bearing running state, the Morlet kernel wavelet model is used to improve and construct the SVM, so as to improve the recognition accuracy of SVM effectively.
Thirdly, through different comparisons we can see that the proposed method makes good use of the advantage of all parts and together to obtain better recognition accuracy and efficiency.
Finally, through the tested signals in the research, the results show the significant efficacy of the proposed method in identifying the bearing faults.

Fig. 1 .
Fig. 1.The flowchart of the proposed method

Fig. 2 .
Fig. 2. The collected vibration signals of normal state and inner-race four different fault depths Next, the ITD decomposition was used to decompose each group of signals into ISCs, and Shannon entropy was used to extract the features.A group of inner-race entropy of Fig. 2 is obtained, the 0.71 mm inner raceway fault ISCs (decomposed in 5 ISCs) decomposed by ITD as shown in Fig. 3.Then, normalize the 20 groups of entropy values, and input them into the LTSA to reduce the dimension.In order to compare the dimension reduction and redundant treatment effect of LTSA, the principle component analysis (PCA)[13], and the Kernel based principle component analysis (KPCA)[14] method is used to reduce the dimension.The results are shown in Figs.4, 5, and 6.To be comparable, the dimensions of PCA, KPCA and LTSA, are set to 5, so the input dimension of ISVM is 5 and the neighborhood number is set to 10.By comparing Figs. 4, 5, and 6, the results show that the PCA-based data dimension reduction

Fig. 8 .
Fig. 8.The collected vibration data Next, the ITD decomposition was used to decompose each group of signals into ISCs, and Shannon entropy was used to extract the features.A group of features of different fault conditions are obtained, as shown in Table4(not normalized beforehand).Then, the 30 groups' entropy values are normalized and input into the LTSA in order to reduce the dimension and extract the typical features; the extracted features are input into the improved SVM.The recognize results are shown in Table5.
1934.BEARING FAULT DIAGNOSIS BASED ON INTRINSIC TIME-SCALE DECOMPOSITION AND IMPROVED SUPPORT VECTOR MACHINE MODEL.
JINLU SHENG, SHAOJIANG DONG, ZHU LIU acquired, which represents different frequency components of original signal.Which can be represents as:

Table 1 .
The states recognition rate of three different methods (recognition rate %)

Table 2 .
The recognition rate of BPNN, SVM and the ISVM

Table 3 .
The time loss of three different methods Methods Without LTSA dimension reduction SVM method The proposed method

Table 4 .
A group of ITD energy entropy of different running states of the actual signal

Table 5 .
The states recognition rate of different states based on the proposed method (recognition rate %)