Neighborhood preserving discrimination for rotor fault feature data set dimensionally reduction

Shi, Kunju; Wu, Peng; Liu, Mingshuai; Dai, Yuanjun

doi:10.21595/vp.2020.21691

Vibroengineering Procedia

Browse Procedia

Published: 19 October 2020

Check for updates

Neighborhood preserving discrimination for rotor fault feature data set dimensionally reduction

Kunju Shi¹

Peng Wu²

Mingshuai Liu³

Yuanjun Dai⁴

^{1, 3, 4}School of Mechanical Engineering, Shanghai Dianji University, Shanghai, China

²School of Mechanical Engineering and Rail Transit, Changzhou University, Changzhou, 213164, China

Corresponding Author:

Peng Wu

Cite the article Download PDF

Downloads 843

Abstract

NPP (Neighborhood Preserving Projections) is an incremental subspace learning methods which has a nature of maintaining the data local neighborhood geometry constant. To improve the discriminatory power of NPP, NPD (Neighborhood Preserving Discrimination) algorithm was proposed to be used for the rotor system fault data set feature dimensionality reduction. Floyd algorithm based on graph theory and MMC (Maximum Margin Criterion) were introduced in the NPP which makes NPD avoid the short-circuit problem that occurs in the high curvature high dimensional space data sets, while enhancing data discrimination information during the dimensionality reduction. In addition, NPD can maintain the manifold of data set unchanged. At last, the rotor-bearing experiment has been made to verify the effectiveness of the NPD method.

1. Introduction

There are significant differences between the data dimensionality reduction techniques used to describe data characteristics and those used for discriminative classification. The former aims to minimize information loss before and after dimensionality reduction. For example, the PCA tries to find a linear projection direction that can best represent the original data structure [1]. The direction matrix is composed of m orthogonal directions that can have as much data variance as possible, and some important discrimination information may be ignored [2, 3]. Considering that the dimensionality reduction method used to describe the characteristics of the data does not reveal the category information suitable for classification, discriminant analysis emphasizes the following problems [4]: Given a two-category data set, find the optimal feature or feature set that can distinguish the two types of data . LDA and its improved version-Maximum Edge Criterion (MMC) belong to this type of method [5]. It tries to find a projection direction that can achieve the dual purposes of reducing the number of variables and extracting discriminant information, but its projection direction may not fully express the sample data. Internal structure. Different data types, or the same data type under different circumstances, will highlight different data information. Some data types contain relatively more global structure information, and some data types contain more discriminative classification information [6, 7]. Therefore, it is necessary to find a new method to make the discriminant information prominent under the premise of retaining the maximum data information before and after the dimensionality reduction, so as to facilitate the failure mode identification.

Neighborhood Preserving Projection (NPP) is an approximate incremental expression of the LLE algorithm. It is an incremental subspace learning meth-od to describe the characteristics of data and reduce the dimension. It has the property of keeping the local neighborhood geometry of the data unchanged. In order to improve the discrimination ability of NPP, this chapter proposes the NPD (Neighborhood Preserving Discrimination) algorithm, which is used to reduce the dimension of the fault feature data set of the rotor system. Introduced the Floyd and Maximum Spacing (MMC) criteria based on graph theory into NPP, which makes NPD avoid the short-circuit problem of high-dimensional space and high-curvature data set while enhancing the discriminative information of the reduced-dimensional data while maintaining manifold information. Finally, the method was validated with the data of a two-span rotor test bench.

2. Method of NPP

The NPP method solves the out-of-sample problem of the LLE (Local Linear Embedding) method, and approximates LLE through a linear transformation. The data set $X$ ( $X \in R^{D}$ ) is obtained through the projection matrix $A$ to obtain a low-dimensional feature data set $Y$ ( $Y \in R^{d}$ ). The key step to implement this method is to solve the projection matrix $A$ that can maximize the characteristics of the original data set under the premise of the minimum reconstruction error before and after dimensionality reduction. The calculation formula is as follows:

1

\min J_{2} (Y) = \sum_{i = 1}^{N} ‖y_{i} - \sum_{j = 1}^{k} w_{i j} y_{j}‖ = t r a c e (Y M Y^{T}),

2

s t : \frac{1}{N - 1} Y Y^{T} = I,

where $w_{i j}$ is the reconstruction weight coefficient matrix:

3

M = (I - W)^{T} (I - W),

because $Y = A^{T} X$ , Eq. (1) can be changed to:

4

\min J_{2} (Y) = t r a c e ((A^{T} X) M (A^{T} X)) = t r a c e (A^{T} (X M X) A^{T}) .

From Eqs. (1-3), the Lagrange extreme method is used to solve and simplify as Eq. (5):

5

((X X)^{- 1} (X M X)) A^{T} = λ A^{T} .

According to Eq. (5), it can be known that the projection matrix $A$ is a generalized eigenvector of $({(X X)}^{- 1} (X M X))$ .

3. Derivation of NPD data dimension reduction formula

The basic idea of NPD is that if the linear transformation obtained by NPP satisfies Eq. (1) at the same time, the discriminability of the data will be greatly improved. This problem can be expressed as a multi-objective optimization problem:

6

\underset{A}{m i n} t r \{A^{T} [(X M X^{T})] A\},

7

\underset{A}{m a x} t r \{A^{T} (S_{b} - S_{w}) A\},

8

s . t . \frac{1}{N - 1} Y Y^{T} = Ι .

Then translated into a Constrained Optimization Problem:

9

\underset{A}{m i n} t r \{A^{T} [(X M X^{T}) - (S_{b} - S_{W})] A\},

10

s . t . \frac{1}{N - 1} A^{T} (X X^{T}) A = 0 .

Using Lagrange multiplier method to solve this problem, we have:

11

\frac{\partial}{\partial A} \{A^{T} [(X M X^{T}) - (S_{b} - S_{W})] A - \frac{λ}{N - 1} A^{T} (X X^{T}) A + λ Ι\} = 0 .

Further derivation yields:

12

[(X M X^{T})^{T} - (S_{b} - S_{w})] A = \frac{λ}{N - 1} (X X^{T}) A,

13

(N - 1) (X X^{T})^{- 1} [(X M X^{T}) - (S_{b} - S_{w})] A = λ A .

The projection matrix $A$ is $(N - 1) (X X^{T})^{- 1} [(X M X^{T}) - (S_{b} - S_{w})$ ’s Eigenvector.

The calculation steps of this method are as follows:

Input: $D \times N$ matrix $X$ , number of neighbor points $K$ , connection distance $c$ , low-dimensional embedding dimension $d$ ( $d < D$ );

Output: $d \times N$ low-dimensional matrix $Y$ .

Step 1. Calculate the Euclidean distance $d x (i, j)$ ( $i =$ 1, 2,…, $N$ , $j =$ 1, 2,…, $K$ ) between the points in the $X$ matrix;

Step 2. Set the connection threshold $c$ , determine the weight of each point, calculate the distance from $x_{i}$ to the remaining points, and construct a weighting map;

Step 3. Using the Floyd search algorithm based on graph theory, select the first $K$ minimum distances as the $K$ neighborhood of $x_{i}$ points;

Step 4. Calculate the reconstruction weight matrix $W$ ;

Step 5. Calculate $M$ according;

Step 6. Calculate the projection matrix $A$ according to Eq. (13), $Y = A^{T} X$ .

4. Design of dimension reduction method for LLD fault feature dataset

Based on the above analysis, the LLD dimension reduction process of the rotor system fault data set is shown in Fig. 1.

The implementation steps of this process are as follows:

Step 1. Collect vibration signals of typical faults of the rotor system, and divide them into training and test sets;

Step 2. Calculate the time and frequency domain characteristics of each data acquisition channel;

Step 3. Use the Fisher feature selection algorithm for the features of each data collection channel, arrange the obtained Fisher values in descending order and select the statistical features with larger values to form the original training feature data set;

Step 4. Use the NPD algorithm to reduce the dimensions of the original training feature data set to obtain the projection matrix $A$ ;

Step 5. Quantitatively extract the features of the test set according to the recorded best features of each channel, and use the formula $Y = A^{T} X$ for dimension reduction.

Fig. 1Dimension reduction flowchart of rotor system fault data set

5. Dimension reduction and classification of NPD fault feature dataset

5.1. Rotor system’s original fault characteristic data set construction

In order to verify the effectiveness of the NPD method, normal, inner fault, rolling fault and race fault experiments were performed on the test bench. The sampling frequency is 5000 Hz, and the vibration signals of the above four operating states of the bearing at 3000 r/min are collected. Select 13 frequency domain features, and 17 time domain features(mean, square root amplitude, standard deviation, variance, absolute mean, skewness, kurtosis, peak-to-peak, peak index, pulse index, margin index, waveform index, C factor, root mean square value, L factor, S factor, I factor), which are mean, square root amplitude, standard deviation, variance, absolute mean, skewness, kurtosis, peak-to-peak, peak index, pulse index, margin index, waveform index, C factor, root mean square value, L factor, S factor, I factor, a total of 30 statistical characteristics. Calculate the time-domain and frequency-domain eigenvalues of each channel separately (Table 3 is the calculation results).

For each state, 40 sample points are taken to form a 360×160 matrix $X_{1}$ . Feature selection is performed for each feature of $X_{1}$ , and the Fisher value of each feature is calculated separately. Each feature is arranged in descending order of Fisher value, and the first 100 features are taken as the original feature training set $X_{12}$ . According to the selected channel and statistical characteristics, the original feature space is constructed for the signals collected again. Each state takes 40 sample points to form a 100×160 matrix $X_{2}$ as the test set.

Table 1Original characteristic data set of rotor fault (Unit: mm)

Inner fault

Outer fault

Feature

No.

X₁

X₂

…

X₄₀

…

X₁₂₁

X₁₂₂

…

X₁₆₀

Ch1

Time domain

T₁

1

0.5529

0.6559

…

0.3382

…

0.8020

0.8743

…

1

…

T₁₇

17

0.2851

0.2708

…

0.2555

…

0.2675

0.2667

…

0.2655

Frequency domain

F₁

18

0.0021

0.0000

…

0.0065

…

0.8620

0.7797

…

0.6921

…

F₁₃

30

0.0089

0.0000

…

0.0183

…

0.8415

0.8341

…

0.7463

…

Ch12

Time domain

T₁

331

0.8528

0.8562

0.6843

0.1540

0.0497

0.2138

…

Frequency domain

…

F₁₃

360

0.4805

0.3331

…

0.8458

…

0.1804

0.1237

…

0.1181

Fig. 2Fisher value for each feature

5.2. NPD dimension reduction of rotor system fault data set

The NPD algorithm is implemented on the above matrix, and the calculation results are shown in Fig. 3.

Fig. 3(a) is a three-dimensional scatter plot obtained by implementing the NPD algorithm. Record the channel numbers and statistical characteristics of the first 100 features, and calculate the test set according to the recorded channel numbers and statistical characteristics to form the original spatial high-dimensional data set of the test set. The low-dimensional embedding matrix can be obtained for this test set according to Step 5 in Section 4. Fig. 3(b) is a scatter plot of this matrix.

Fig. 3NPD dimensionality reduction scatter plot

a) Training

b) Test

Fig. 4KPCA dimensionality reduction scatter plot

a) Training

b) Test

Fig. 5NPP dimensionality reduction scatter plot

a) Training

b) Test

Fig. 6LLD dimensionality reduction scatter plot

a) Training

b) Test

In order to further verify the dimensionality reduction effect of NPD, the classic KPCA, NPP and LLD algorithm was used to reduce the dimensionality of samples and test data sets, and a polynomial kernel function $d =$ 5 was selected. Figs. 4-6 are three-dimensional scatter plot obtained by KPCA, NPP and LLD.

Table 2 is the statistical result of the inner distance in the low-dimensional feature component classes extracted by the NPP algorithm and KPCA, NPP and LLD method. It can be seen from the table that the features extracted by the NPP algorithm have smaller class inner distances.

Table 2Low-dimensional space inner distance (unit: mm)

	Normal	Inner fault	Rolling fault	Race fault
NPD training	1.1×10^-2	1.3×10^-2	1.1×10^-2	1.0×10^-2
KPCA training	8.0	12.5	18.9	18.0
NPP training	5.14×10^-2	5.03×10^-4	1.42×10^-3	1.68×10^-3
LLD training	1.84×10^-3	5.98×10^-4	1.81×10^-3	3.28×10^-3
NPD test	1.3×10^-2	8.1×10^-4	2.9×10^-4	4.7×10^-4
KPCA test	1.5×10⁷	8.1×10⁶	1.2×10⁸	1.2×10⁷
NPP test	7.01×10^-2	2.64×10^-2	2.08×10^-2	1.53×10^-2
LLD test	2.91×10^-2	1.61×10^-2	9.31×10^-2	7.58×10^-2

6. Conclusions

The training and test sample set of the four running state sample points of the bearing, the class distance calculated by the NPD algorithm is much smaller than the result calculated by the KPCA algorithm. Compared with KPCA, NPP and LLD algorithm, the results calculated by NPD algorithm have better clustering. The analysis results show that the NPD algorithm after adding the NPP algorithm to the MMC criterion with discriminative performance can ensure that the discriminant information is extracted after dimensionality reduction, which is more conducive to the classifier’s fault pattern recognition.

References

Deepak Ranjan Nayak, Ratnakar Dash, Banshidhar Majhi An improved pathological brain detection system based on two-dimensional PCA and evolutionary extreme learning machine. Journal of Medical Systems, Vol. 42, Issue 19, 2018, p. 19-28.

Publisher
Zhao Yue, You Xinge, Yu Shujian, Xu Chang, Tao Dacheng Multi-view manifold learning with locality alignment. Pattern Recognition, Vol. 78, Issue 2018, 2018, p. 154-166.

Publisher
Zhang Yongli, Christoph Eick F. Tracking events in twitter by combining an lda-based approach and a density–contour clustering approach. International Journal of Semantic Computing, Vol. 13, Issue 1, 2019, p. 87-110.

Publisher
Liu Jin, Pengren A., Ge Qianqian, Zhao Hang Gabor tensor based face recognition using the boosted nonparametric maximum margin criterion. Multimedia Tools and Applications, Vol. 7, Issue 17, 2018, p. 9055-9069.

Publisher
Yu Xuelian, Wang Xuegang, Liu Benyong Supervised kernel neighborhood preserving projections for radar target recognition. Signal Processing, Vol. 88, Issue 19, 2018, p. 2335-2339.

Publisher
Zou X., Liu Y., Zou L., Zheng Z. Improved discriminant sparseity preserving projecting face recognition algorithm. Huazhong Keji Daxue Xuebao (Ziran Kexue Ban)/Journal of Huazhong University of Science and Technology (Natural Science Edition), Vol. 46, Issue 11, 2018, p. 53-57.

Search CrossRef
Ma Jiayi, Jiang Junjun, Zhou Huabing, Zhao Ji, Guo Xiaojie Guided locality preserving feature matching for remote sensing image registration. IEEE Transactions on Geoscience and Remote Sensing, Vol. 56, Issue 8, 2018, p. 4435-4447.

Publisher

About this article

Received

13 September 2020

Accepted

05 October 2020

Published

19 October 2020

SUBJECTS

Fault diagnosis based on vibration signal analysis

DOI

https://doi.org/10.21595/vp.2020.21691

Keywords

neighborhood preserving projections

neighborhood preserving discrimination

maximum margin criterion

Floyd algorithm

Acknowledgements

This work is partially supported by Plateau Discipline Foundation Project of School of Mechanical Engineering, Shanghai Dianji University The authors also gratefully acknowledge the helpful comments and suggestions of the reviewers, which have improved the presentation.

This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Previous article in issue Previous Next article in issue Next

Research article

2022 06 30

A convolutional neural network method based on Adam optimizer with power-exponential learning rate for bearing fault diagnosis

Youming Wang, Zhao Xiao, Gongqing Cao

Research article

2019 11 15

Fault diagnosis using an improved fusion feature based on manifold learning for wind turbine transmission system

Ping Ma, Hongli Zhang, Wenhui Fan, Cong Wang

Research article

2018 05 15

A novel classification method combining adaptive local iterative filtering with singular value decomposition for fault diagnosis

Yong Lv, Yi Zhang, Cancan Yi, Han Xiao, Zhang Dang

Research article

2017 12 31

EEMD-Based cICA method for single-channel signal separation and fault feature extraction of gearbox

Junfa Leng, Shuangxi Jing, Chenxu Luo, Zhiyang Wang

K. Shi, P. Wu, M. Liu, and Y. Dai, “Neighborhood preserving discrimination for rotor fault feature data set dimensionally reduction,” Vibroengineering PROCEDIA, Vol. 33, pp. 11–16, Oct. 2020, https://doi.org/10.21595/vp.2020.21691

Copy Extrica

Copied to clipboard!

TY  - JOUR
DO  - 10.21595/vp.2020.21691
UR  - https://doi.org/10.21595/vp.2020.21691
TI  - Neighborhood preserving discrimination for rotor fault feature data set dimensionally reduction
T2  - Vibroengineering PROCEDIA
AU  - Wu, Peng
AU  - Shi, Kunju
AU  - Liu, Mingshuai
AU  - Dai, Yuanjun
PY  - 2020
DA  - 2020/10/19
PB  - JVE International Ltd.
SP  - 11-16
VL  - 33
SN  - 2345-0533
SN  - 2538-8479
ER  - 

Copy Ris

Copied to clipboard!

@article{Wu_2020,
	doi = {10.21595/vp.2020.21691},
	url = {https://doi.org/10.21595/vp.2020.21691},
	year = 2020,
	month = {oct},
	publisher = {{JVE} International Ltd.},
	volume = {33},
	pages = {11--16},
	author = {Peng Wu and Kunju Shi and Mingshuai Liu and Yuanjun Dai},
	title = {Neighborhood preserving discrimination for rotor fault feature data set dimensionally reduction},
	journal = {Vibroengineering {PROCEDIA}}
}

Copy Bibtex

Copied to clipboard!

[1]P. Wu, K. Shi, M. Liu, and Y. Dai, “Neighborhood preserving discrimination for rotor fault feature data set dimensionally reduction,” Vibroengineering PROCEDIA, vol. 33, pp. 11–16, Oct. 2020, doi: 10.21595/vp.2020.21691.

Copy IEEE

Copied to clipboard!

Wu, Peng, Kunju Shi, Mingshuai Liu, and Yuanjun Dai. “Neighborhood Preserving Discrimination for Rotor Fault Feature Data Set Dimensionally Reduction.” Vibroengineering PROCEDIA 33 (October 19, 2020): 11–16. https://doi.org/10.21595/vp.2020.21691.

Copy Chicago

Copied to clipboard!

Neighborhood preserving discrimination for rotor fault feature data set dimensionally reduction

Abstract

1. Introduction

2. Method of NPP

3. Derivation of NPD data dimension reduction formula

4. Design of dimension reduction method for LLD fault feature dataset

5. Dimension reduction and classification of NPD fault feature dataset

5.1. Rotor system’s original fault characteristic data set construction

5.2. NPD dimension reduction of rotor system fault data set

6. Conclusions

References

About this article

Related Articles