Application of improved support vector machine regression analysis for medium- and long-term vibration trend prediction

Jin, Xiangyang; Sun, Zhihui; Wang, Heteng; Wang, Feipeng; Yan, Qingwu

Journal of Vibroengineering

Browse Journal

Submit article

Published: June 30, 2013

Check for updates

Application of improved support vector machine regression analysis for medium- and long-term vibration trend prediction

Xiangyang Jin¹

Zhihui Sun²

Heteng Wang³

Feipeng Wang⁴

Qingwu Yan⁵

^{1, 2, 3}School of Light Industry, Harbin University of Commerce, Harbin 150028, China

^{4, 5}Harbin Gewu Technology Development Limited Liability Company, Harbin 150025, China

Cite the article Download PDF

Downloads 2111

Abstract

Aircraft engine fault diagnosis plays a crucial role in cost-effective operations of aircraft engines. However, successful detection of signals due to vibrations in multiple transmission channels is not always easy to accomplish, and traditional tests for nonlinearity are not always capable of capturing the dynamics. Here we applied a new method of smooth support vector machine regression (SSVMR) to better fit complicated dynamic systems. Since quadratic loss functions are less sensitive, the constrained quadratic optimization could be transferred to the unconstrained optimization so that the number of constraint conditions could be reduced. Meanwhile, the problem of slow operation speed and large memory space requirement associated with quadratic programming could be solved. Based on observed input and output data, the equivalent dynamic model of aircraft engineers was established, and model verification was done using historical vibration data. The results showed that SSVMR had fast operation speed and high predictive precision, and thus could be applied to provide early warning if engine vibration exceeds the required standard.

1. Introduction

The performance of aircraft engine decreased through time, and it is important to effectively evaluate and predict this performance. To do so, the change rate of vibration parameters need to be analyzed to help better understand the vibration trend, discover potential faults, estimate the remaining useful life of critical components, and reveal the recession trend of the performance of aircraft engine [1]. Accurate trend forecast of the engine performance is essential, which could help determine whether the engine would be operable in the near future. Engine performance checking and potential fault diagnosis need to be done regularly, and methods for predicting performance trends of relevant applications need to be updated to increase the diagnostic accuracy, reduce the occurrence of false alarms and improve flight safety [2]. Furthermore, recorded data could be further processed using more complicated algorithms and models so that early detection of abnormal conditions, including unbalance, wear, tear and rubbing fault can be realized, and corresponding actions can be taken in a timely manner.

The existing prediction methods can be divided into qualitative prediction and quantitative prediction. Compared to the latter, the former often fails to provide accurate predictions. The latter can be further divided into timing sequence relationship prediction and causality prediction. Also, it can be specifically classified into regression analysis prediction, timing sequence prediction, Markov chain prediction, grey prediction, Bayes network prediction and neural network prediction [3], among which timing sequence prediction, Markov chain prediction, grey prediction and neural network prediction belong to timing sequence relationship prediction category, whereas Bayes network prediction and regression analysis prediction belong to causality prediction category. In general, timing sequence relationship prediction is widely applied because timing sequence data are easy to collect. In some cases, however, further analysis with causality prediction applied is required [4], and the selection of time lags for time series forecasting by fuzzy inference systems, for which optimal non-uniform attractors are embedded in the multidimensional delay phase space, is especially needed [5]. Recently, a new model was proposed to predict rain rate time series in tropical rainforests [6], and time series prediction techniques have been applied to significantly reduce the radio use and energy consumption of wireless sensor nodes performing periodic data collection tasks [7]. A computational framework was used to generate daily temperature maps using time-series method based on temperature measurements from the national network of meteorological stations [8]. Because each prediction method has its own advantages and disadvantages, the best predictive method needs to be selected in accordance with different modes of faults and features.

Based on the predictive operation conditions and fault variation properties of aircraft engine, this study tested the applicability of nonlinear predictive method in predicting the operational vibrating faults of aircraft engine in a long-term scale. Support vector regression (SVR) was applied to evaluate whether the predicted vibrating values were in accordance with the technical requirements of the predictive technology used for evaluating aircraft engine operational conditions. This study proposed that improved support vector regression algorithms with smooth operators included could increase the predictive power and accuracy of both medium- and long-term trend. Moreover, the constrained quadratic optimization problem could be converted to unconstrained optimization problem, and model verification was done using historical vibration data. The results showed that even vibrating values in a longer interval could be predicted with high accuracy.

2. Improvement of the support vector regression algorithm

Support vector machine was initially used to solve classification problems. Afterwards, its application was extended to the regression field [9]:

1

D = \{(x_{1}, y_{1}), (x_{2}, y_{2}), \dots, (x_{m}, y_{m})\} .

In the above set, $x_{i}$ is a $n$ -dimensional coordinate and $y_{i}$ is the corresponding value of $x_{i}$ . In any regression analysis, a function needs to be defined to predict the value of any given point $ω$ . Meanwhile, the error of regression model needs to be minimized [10]. In a linear regression model, regression function is defined as follows:

2

f (x) = ω \cdot x + b .

Nuclear technology is of great importance in support vector machine analysis, and, nonlinear problems should be converted into linear problems by adopting methods commonly used in nuclear technology to solve regression problems [11]. A nonlinear mapping $Φ$ is adopted to map the data in a high-dimensional feature space $G$ . A linear approximation is then applied to minimize the expected risk function as follows:

3

f (x) = (ω \cdot Φ (x)) + b,

Φ : R^{n} \to F, ω \in G .

In the above formula, $ω$ is the generalized parameter in the high-dimensional feature space $G$ , $(\cdot)$ is the inner product and $b$ is the constant term.

Function approximation is equivalent to minimize the structure risk function $R_{r e g} [f]$ :

4

R_{r e g} [f] = R_{e m p} [f] + λ {‖ω‖}^{2} = \sum_{i = 1}^{s} l (y_{i} - f (x_{i})) + \frac{1}{2} {‖ω‖}^{2} .

In the above formula, $R_{e m p}$ is the empirical risk and $l (\cdot)$ is the loss function, representing the deviation between $y$ and $C$ [12]. $ε$ is used to represent the error of regression model:

5

{| y - f (x) |}_{ε} = m a x \{0, |y - f (x)| - ε\} .

Then empirical risk is:

6

R_{e m p}^{ε} [f] = \frac{1}{s} \sum_{i = 1}^{s} {|y_{i} - f (x_{i})|}_{ε} .

In the above formula, $C$ is a positive parameter defined in advance. When the difference between the predictive value and the observed value is less than $ε$ , it is considered that no loss is produced. $R_{e m p}^{ε} [f]$ is used to minimize the formula 2, with both the training error and model complexity controlled. Similar to the mentality of classification problem, this problem could be converted into an optimization problem [13]:

7

m i n J = \frac{1}{2} {‖ω‖}^{2} + C \sum_{i = 1}^{s} (ξ i + ξ_{i}^{*}),

s . t . \{\begin{matrix} y_{i} - ω \cdot Φ (x_{i}) - b \leq ε + ξ_{i}^{*}, \\ ω \cdot Φ (x_{i}) + b - y_{i} \leq ε + ξ_{i}, \\ ξ_{i}^{*}, ξ i \geq 0 . \end{matrix}

In the above formula, $ξ_{i}^{*}$ , $ξ_{i}$ is the slack variable when the difference between the predictive value and observed value is larger than $ε$ , thus allows some points functioning as outliers far away from predictive functions [14]. $C$ is the punishment variable, which is applied when the difference between the predictive value and observed value is larger than $ε$ . Similar to classification problems, Lagrangian function could be established to calculate the extreme points, get dual problem and finally calculate weight vector and excursion.

Formula (7) can be converted into the following formula by adopting kernel function $K (x_{i}, x_{j}) = Φ (x_{i}) \cdot Φ (x_{j})$ :

8

m a x J = - \frac{1}{2} \sum_{i, j = 1}^{s} (α_{i}^{*} - α_{i}) (α_{j}^{*} - α_{j}) K (x_{i}, x_{j}) + \sum_{i = 1}^{s} α_{i}^{*} (y_{i} - ε) - \sum_{i = 1}^{s} α_{i}^{*} (y_{i} + ε),

s . t . \{\begin{matrix} \sum_{i = 1}^{s} α_{i} = \sum_{i = 1}^{s} α_{i}^{*}, \\ 0 \leq α_{i} \leq C, 0 \leq α_{i}^{*} \leq C . \end{matrix}

Nonlinear mapping with convex quadratic programming is represented as:

9

f (x) = \sum_{i = 1}^{s} (α_{i} - α_{i}^{*}) K (x_{i}, x) + b .

Clearly, how to select design parameters $C$ and $ε$ is an essential question for support vector regression model analysis. The value of $C$ determines the penalty to samples when training errors exceed $ε$ . The selection of $C$ exerts great influence on the generalization ability of the system, and the value of $ε$ directly influences the predictive accuracy. If a small value of $ε$ is selected, the requirement for regression predictive accuracy will be high and the quantity of support vector machines will increase [15]; by contrast, if a large value of $ε$ is selected, regression predictive accuracy will decrease. Therefore, the selection process of $C$ and $ε$ is extremely complicated, although how to select these parameters in order to attain a good regression result remains unsolved in practice.

As support vector regression method deals with a constrained quadratic programming problem, the high computational complexity could lead to a low computing speed. Also, this method requires large computing memory and thus its use would be limited, especially for massive data. In order to improve the computing peed of support vector regression algorithm and decrease its training complexity, a methodical improvement of support vector regression is necessary. By adopting the smoothing method, the constrained quadratic optimization problem in traditional support vector regression could be converted to unconstrained convex quadratic optimization problem, with the loss function converted to an $ε$ insensitive quadratic function. For convenience, this conversion process in support vector machine regression is represented as follows:

10

m i n J = \frac{1}{2} | | ω | |^{2} + C \sum_{i = 1}^{s} ξ i + C \sum_{i = 1}^{s} ξ_{i}^{*},

s . t . \{\begin{matrix} y_{i} - ω \cdot Φ (x_{i}) - b \leq ε + ξ_{i}^{*}, \\ ω \cdot Φ (x_{i}) + b - y_{i} \leq ε + ξ_{i}, \\ ξ_{i}^{*}, ξ i \geq 0 . \end{matrix}

Institute the regular factor $C$ with $\frac{C}{2}$ and add another term $\frac{b^{2}}{2}$ , formula (10) becomes:

11

m i n J = \frac{1}{2} ({‖ω‖}^{2} + b) + \frac{C}{2} \sum_{i = 1}^{s} ξ i + \frac{C}{2} \sum_{i = 1}^{s} ξ_{i}^{*},

s . t . \{\begin{matrix} y_{i} - ω \cdot Φ (x_{i}) - b \leq ε + ξ_{i}^{*}, \\ ω \cdot Φ (x_{i}) + b - y_{i} \leq ε + ξ_{i}, \\ ξ_{i}^{*}, ξ i \geq 0 . \end{matrix}

Apply the following conversion:

12

z_{i} = ω \cdot Φ (x_{i}) + b - y_{i} - ε,

z_{i}^{*} = y_{i} - ω \cdot Φ (x_{i}) - b - ε .

Define the following function ${(u)}_{+} = m a x {u, 0}$ and make $ξ_{i} = {(z_{i})}_{+}$ , $ξ_{i}^{*} = {(z_{i}^{*})}_{+}$ . Also, formula (11) can be converted into the following unconstrained quadratic optimization problem:

13

m i n J = \frac{1}{2} ({‖ω‖}^{2} + b) + \frac{C}{2} \sum_{i = 1}^{S} (Z_{i})_{+}^{2} + \frac{C}{2} \sum_{i = 1}^{S} (Z_{i}^{*})_{+}^{2} .

Formula (13) represents an unconstrained convex quadratic optimization problem. Because the target function is not twice differentiable, we define a strictly convex and infinitely differentiable smooth function $p (u, a) = \frac{1}{α} l n (1 + e^{α u}), α > 0$ and institute ${(u)}_{+}$ with $p (u, a)$ . Therefore, a smooth support vector regression objective function is represented as follows:

14

m i n J = \frac{1}{2} ({‖ω‖}^{2} + b^{2}) + \frac{C}{2} \sum_{i = 1}^{s} p {(u, a)}^{2} + \frac{C}{2} \sum_{i = 1}^{s} p {(z_{i}^{*}, a)}^{2} .

3. Trend prediction simulation analysis based on SVR

To compare the effectiveness of artificial neural network and improved support vector regression, both RBF neural network and support vector regression algorithms are programmed under MatlabR2009a environment [16]. A continuous function model is given as following:

$y = c o s (e^{x}) + s i n (x)$ , $x \in [2, 4]$ .

The sampling interval is 0.1, with 21 sample data produced. In RBF neural network model, the network training is set as 1000 times, and the minimum error is set as 0.002. In support vector regression model, RBF kernel function is adopted, with kernel parameter $σ =$ 0.07 and model parameters $C =$ 600, $ε =$ 0.02. The regression result of the continuous function model is shown in Fig. 1.

Fig. 1Regression results derived from both SVR and RBF

It can be seen form Fig. 1 that the predictive curves and actual curves of the two models coincide with each other with small predictive error and high regression accuracy.

The main problem of neural network, however, lies in that the regression curve is not smooth enough. As can be seen from Fig. 1, the regression curve (the dotted line) is just a direct connection among sample points and thus yields unsatisfactory results [17]. By contrast, the regression curve of SVR (the solid line) coincides well with the realistic curve of the continuous function model with better regression accuracy.

Although neural network has a strong capability of nonlinear approximation, it focuses exclusively on error minimization but loses the generalization property and thus is limited for time series analysis. By contrast, support vector regression algorithm has higher generalization ability, and thus provides a better fit of the data.

4. Predictions of aircraft engine vibrations using improved SVR models

Vibration data were measured for eight flights in 155 days so that the dataset could be represented in a matrix of 8×155. The time sequence of the dataset was shown in Fig. 2.

Fig. 2Tendency chart of all vibration data

Further analysis and denoising process were applied to the collected vibration data. The maximum value of vibration intensity was obtained to form an unvaried time sequence. The sequence included 155 data points, as shown in Fig. 3. These data points were further divided into two parts, with 130 data points used for training models and related optimizing model parameters, and the rest 25 data points used for model evaluation and validation.

Fig. 3Single variable sequence of the vibration intensity

In the first step, linear function was applied and corresponding predictive trend was shown in Fig. 4. It could be seen that the increasing trend of total vibration was obvious.

Obviously, when linear function was applied, the displayed trend was simple and clear, and related fault problems were easy to identify. However, one obvious disadvantage of linear function was its high predictive error. In Fig. 4, we could see that the difference between the predictive value and practical value was large with an average error of 7.32 %.

BP algorithm is commonly adopted for artificial neural network. The predictive aim error for setting BP neural network is 0.001 and network training is operated 2000 times. However, BP network fails to achieve the target value set after network training is operated more than 2000 times. At that stage, the owe training problem occurs. After network training is operated for 3573 times, the network stops training with the final training error as 0.049.

Fig. 4Linear prediction trend chart and error analysis

The prediction trend chart is shown in Fig. 5. Obviously, the predictive curve of BP algorithm matches the real variation trend perfectly, but it brings other problems, such as longer calculation time, larger memory space requirement and lower prediction accuracy.

Radial kernel function was adopted for SVR prediction and the parameter settings were as follows: $C =$ 1200, $σ =$ 1.5, $ε =$ 0.005. All predictive values were shown in Fig. 6.

SVR was applied to the same time series dataset (i.e., 130 data points), with extremely small error gained from the training model, and the remaining 25 data points were used to test prediction accuracy of SVR model (the predictive average relative error was 1.327 %). The results showed that SVR was well applied to nonlinear time series for accurate prediction in a long-term scale. Also, it demonstrated the effectiveness of the predictive method proposed in this study when used for predicting the performance of aircraft engine.

Fig. 5Neural network prediction trend chart and error analysis

Fig. 6Prediction trend chart and error analysis of SVR model

Fig. 7 showed the comparison chart of errors in the engine predictive results using both SVR and BP. It could be seen from the chart that the prediction error using SVR was smaller then that using BP, with the average predictive relative error 3.573 % smaller for the former compared to that of the latter.

Fig. 7Comparison between predictive errors of neural network and SVR

Compared with neural network, SVR prediction model has smaller error and stronger generalization ability. As neural network unilaterally minimizes errors when it is used to deal with practical dataset with noises, over fitting or over learning could easily occur, which would decrease the prediction accuracy of neural network. By contrast, SVR reflects the obvious features that support vector machine networks and help compromise structural risks. Since the time series variation rule of the predictive values and the real values are approximately the same, time series analysis could be widely applied to simulate the real process.

5. Conclusions

Support vector algorithm could be used to solve problems associated with quadratic programming, such as slow running speed and large storage space requirement. Smoothing operators are introduced in this study to improve model fitness and decrease the complexity of training. Simulation tests show that smoothing support vector machine has fast operation speed and high predictive accuracy, and is very fit for eary detection of the performance of aircraft engines with complicated vibration patterns.

The effectiveness of SVR method depends on insensitive factor $ε$ , penalty factor $C$ , the type of kernel function and its parameter $S$ . The interplay between these parameters is complicated, and a proper matching could help improve prediction accuracy.

However, trend analysis method has its disadvantages. For example, it has high requirements for data collections. Also, it can only detect engine faults rather than reveal the causes of these faults. Furthermore, it is insensitive to detect some types of engine faults. Therefore, further studies should address above mentioned questions.

References

Kim Kyoung-Jae Financial time series forecasting using support vector machines. Neurocomputing, Vol. 55, 2003, p. 307-319.

Search CrossRef
Samui P., Kothari D. P. Utilization of a least square support vector machine for slope stability analysis. Scientia Iranica, Vol. 18, Issue 1A, 2011, p. 70-78.

Search CrossRef
Srinil Narakorn Analysis and prediction of vortex-induced vibrations of variable-tension vertical risers in linearly sheared currents. Applied Ocean Research, Vol. 33, Issue 1, 2011, p. 41-53.

Search CrossRef
Ghosh Abhijit, K. Majumdar Sujit Modeling blast furnace productivity using support vector machines. International Journal of Advanced Manufacturing Technology, Vol. 52, Issue 9-12, 2011, p. 989-1003.

Search CrossRef
Kristina Lukoseviciute, Minvydas Ragulskis Evolutionary algorithms for the selection of time lags for time series forecasting by fuzzy inference systems. Neurocomputing, Vol. 73, Issues 10-12, 2010, p. 2077-2088.

Search CrossRef
D. Das, A. Maitra Time series prediction of rain rate during rain events at a tropical location. IET Microwaves, Antennas & Propagation, Vol. 6, Issue 15, 2012, p. 1710-1716.

Search CrossRef
Yann-Ael Le Borgne, Gianluca Bontempi Time series prediction for energy-efficient wireless sensors: applications to environmental monitoring and video games. Social Informatics and Telecommunications Engineering, Vol. 102, 2012, p. 63-72.

Search CrossRef
Tomislav Hengl, Gerard B. M. Spatio-temporal prediction of daily temperatures using time-series of MODIS LST images. Theoretical and Applied Climatology, Vol. 107, Issue 1-2, 2012, p. 265-277.

Search CrossRef
Mansfield Shawn D., Kang Kyu-Young, Iliadis Lazaros, et al. Predicting the strength of populus spp. clones using artificial neural networks and Ε-regression support vector machines. Holzforschung, Vol. 65, Issue 6, 2011, p. 855-863.

Search CrossRef
Guven Aytac A predictive model for pressure fluctuations on sloping channels using support vector machine. International Journal for Numerical Methods in Fluids, Vol. 66, Issue 11, 2011, p. 1371-1382.

Search CrossRef
Kisi Ozgur, Cimen Mesut Precipitation forecasting by using wavelet-support vector machine conjunction model. Engineering Applications of Artificial Intelligence, Vol. 25, Issue 4, 2012, p. 783-792.

Search CrossRef
Monguzzi Angelo, Milani Alberto, Mech Agniezka, et al. Predictive modeling of the vibrational quenching in emitting lanthanides complexes. Synthetic Metals, Vol. 161, Issue 23-24, 2012, p. 2693-2699.

Search CrossRef
Vladimir N. Vapnik An overview of statistical learning theory. IEEE Transactions on Neural Networks, Vol. 10, Issue 5, 1999, p. 988-999.

Search CrossRef
Kamali M., Ataei M. Prediction of blast induced vibrations in the structures of Karoun III power plant and dam. Journal of Vibration and Control, Vol. 17, Issue 4, 2011, p. 541-548.

Search CrossRef
Bossi L., Rottenbacher C., Mimmi G., Magni L. Multivariable predictive control for vibrating structures: an application. Control Engineering Practice, Vol. 19, Issue 10, 2011, p. 1087-1098.

Search CrossRef
Tseng F. M., Yu H. C., Tzeng G. H. Applied hybrid grey model to forecast seasonal time series. Technological Forecasting and Social Change, Vol. 67, Issue 2, 2001, p. 291-302.

Search CrossRef
Moschioni G., Saggin B., Tarabini M. Prediction of data variability in hand-arm vibration measurements. Measurement: Journal of the International Measurement Confederation, Vol. 44, Issue 9, 2011, p. 1679-1690.

Search CrossRef

About this article

Received

April 8, 2013

Accepted

June 3, 2013

Published

June 30, 2013

Keywords

aircraft engine

trend forecasting

smooth support vector regression

quadratic functions

Acknowledgements

This work was supported by Scientific Research Fund of Heilongjiang Provincial Education Department (No. 1252CGZH18), Twelfth Five-Year Plan Issues for Heilongjiang High Education Scientific Research (No. HGJXHB2110792), and the Youth Science and Technology Innovative Talents Project of Harbin Science and Technology Bureau (No. 2012RFQXG076).

This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.