Machine learning for rotating machines: simulation, diagnosis and control

Kornaev, Alexey; Savin, Leonid; Kornaev, Nickolay; Zaretsky, Roman; Kornaeva, Elena; Babin, Alexander; Stebakov, Ivan

doi:10.21595/vp.2020.21549

Vibroengineering Procedia

Browse Procedia

Published: 29 June 2020

Check for updates

Machine learning for rotating machines: simulation, diagnosis and control

Alexey Kornaev¹

Leonid Savin²

Nickolay Kornaev³

Roman Zaretsky⁴

Elena Kornaeva⁵

Alexander Babin⁶

Ivan Stebakov⁷

^{1, 2, 3, 4, 5, 6, 7}Department of Mechatronics, Mechanics and Robotics, Orel State University named after I.S. Turgenev, Orel, 302026, Russian Federation

Corresponding Author:

Alexey Kornaev

Cite the article Download PDF

Downloads 1315

CrossRef Citations 1

Abstract

The goal of this work is association of several machine learning methods in a study of rotating machines with fluid-film bearings. A fitting method is applied to fit a non-linear reaction force in a bearing and solve a rotor dynamics problem. The solution in the form of a simulation model of a rotor machine has become a part of a control system based on reinforcement learning and the policy gradient method. Experimental part of the paper deals with a pattern recognition and fault diagnosis problem. All the methods are effective and accurate enough.

Machine learning for rotating machines: simulation, diagnosis and control

Highlights

Fault diagnosis in fluid-film bearings.
Reinforcement learning for rotation machine control.
Rotor dynamics simulation.

1. Introduction

The main tool in modern machine learning is an artificial neural network (ANN) [1]. This work deals with applications of machine learning to rotating machines with fluid-film bearings. A rotor rotation is usually accompanied by lateral vibrations [2, 3]. Rotor trajectory contains information about the condition of the bearings and the rotor machine at a whole. Rotor dynamics modeling is a difficult task especially when the rotor has fluid-film bearings [2]. Hydrodynamic calculations are computationally expensive. Therefore, this part of the rotor dynamics problem can be implemented using ANNs [4]. Modern rotating machines can be equipped with a number of sensors. Analysis of their measurements can be automated using specialized ANNs. These ANNs implement logistic regression [1]. Both shallow learning and deep learning are used in pattern recognition. Deep convolutional neural networks are widely used in fault diagnosis [5, 6]. Deep learning is emerging in reinforcement learning and continuous control systems [7, 8].

This paper unites theoretical and experimental results achieved by the authors in applications of machine learning to simulation, diagnosis and control of rotating machines with fluid-film bearings.

2. Shallow learning for rotor dynamics simulation and fault diagnosis

The goal of supervised learning is to determine relationship between two sets: an input set $X$ and a target set $Y$ . The difference between predictions $H$ and targets $Y$ is minimized in training process. This error function is called a target function, a cost function, or a loss function.

The scheme of a simple feed forward ANN called multilayer perceptron is represented in Fig. 1. The $l$ -layer ANN has an input layer, $l - 2$ hidden layers and an output layer. A feature of the network architecture is that each neuron of the previous layer transmits its signal to each neuron of the next layer [4].

Input data in the form of $n_{1}$ numbers can be represented as a one-dimensional matrix. The input layer receives this matrix $X = ((x_{i}^{(1)})) (i = 1 \dots n_{1}$ ) and simply transmits it to the second layer with an additional unit element. The matrix $A^{(1)} = ((\begin{matrix} 1 & X \end{matrix}))$ is the output of the first layer. In the second (hidden) layer data from the first layer is multiplied by the weights matrix $Θ^{(1)}$ and added $Z^{(2)} = A^{(1)} Θ^{(1)}$ . An activation function is applied to the result $Z^{(2)}$ , and resulting matrix with additional unit element is the output of the second layer $A^{(2)} = ((\begin{matrix} 1 & {a c t i v a t i o n}^{(2)} (Z^{(2)}) \end{matrix}))$ . The same actions take place on an arbitrary $k - 1$ hidden layer:

1

Z^{(k - 1)} = A^{(k - 2)} Θ^{(k - 2)}, A^{(k - 1)} = ((\begin{matrix} 1 & {a c t i v a t i o n}^{(k - 1)} (Z^{(k - 1)}) \end{matrix})),

where $Z^{(k - 1)}$ is matrix with results of multiplication by weights and adding in the current layer, $A^{(k - 2)}$ , $A^{(k - 1)}$ are matrices of outputs in the previous and the current layers respectively, $Θ^{(k - 2)}$ is the previous layer weights matrix, ${a c t i v a t i o n}^{(k - 1)}$ is an activation function.

Similar calculations occur in the output layer. The result of the calculation in the output layer is the matrix of predictions $H = (({a c t i v a t i o n}^{(l)} (Z^{(l)})))$ .

The unknown weights matrices $Θ^{(k)}$ are determined by minimizing the objective function $J (Θ^{(k)}) \Rightarrow m i n$ . The type of objective function depends on the type of a given problem.

Fig. 1An l-layer feed forward neural network with n1 inputs and nl outputs

2.1. Multi-dimensional mapping for rotor dynamics simulation

A rigid unbalanced rotor with a gear coupling and a fluid-film bearing at the opposite tips is considered (see Fig. 2). The equations of the rotor’s motion can be represented as follows [3]:

2

\{\begin{array}{l} m d^{2} X / d t^{2} = C + B + F, \\ J_{d} d^{2} Ψ / d t^{2} + J_{p} d Ψ / d t = C ∙ G C + B ∙ G B, \end{array}

where $m$ is rotor mass, $X$ , $Ψ$ are coordinates of the center of mass and angles of the rotor rotation respectively, $t$ is time, $C$ , $B$ are reaction forces of the gear coupling and the bearing respectively, $F = m Δ ω^{2} [[\begin{matrix} c o s (ω t + φ) \\ s i n (ω t + φ) \end{matrix}]]$ is inertia force, $m Δ$ is the rotor unbalance, $φ$ is phase of unbalance, $J_{p}$ , $J_{d}$ are polar and diametral moments of inertia respectively, $G C$ , $G B$ are radius-vectors to the coupling and to the bearing respectively.

It is assumed that reaction forces are equivalent to springs and dampers: $C = C (X, d X / d t)$ , $B = B (X, d X / d t)$ . Also, it is assumed that reaction in a coupling is linear with given linear coefficients and reaction in a bearing is non-linear. Calculation of the bearing reaction is connected with solution of the Reynolds equation [2, 4]:

3

\frac{\partial}{\partial x_{1}} [\frac{h^{3}}{μ} \frac{\partial p}{\partial x_{1}}] + \frac{\partial}{\partial x_{3}} [\frac{h^{3}}{μ} \frac{\partial p}{\partial x_{3}}] = 6 \frac{\partial}{\partial x_{1}} (u_{1} h) - 12 u_{2},

where $x_{i}$ are coordinates connected with the thin oil film, $p = p (x_{1}, x_{3})$ is the unknown pressure function, $h = h (x_{1})$ is the oil film thickness, $μ$ is viscosity, $u_{j} = u_{j} (x_{1})$ are the tangential and normal components of the journal surface velocity, where $i = 1, 2, 3$ , $j = 1,2$ .

Fig. 2Calculation schematic of a rotor-bearing test rig with a fluid-film bearing

Given the pressure function, the components of reaction can be calculated by integration:

4

\begin{matrix} B_{1} = - \int_{0}^{L} \int_{0}^{π D} p (x_{1}, x_{3}) \cos (α) d x_{1} d x_{3}, & B_{2} = - \int_{0}^{L} \int_{0}^{π D} p (x_{1}, x_{3}) \sin (α) d x_{1} d x_{3}, \end{matrix}

where $L$ , $D$ are the bearing length and diameter respectively, $α = 2 x_{1} / D$ .

Approximation of Eq. (4) with the function $B = B (X, V = d X / d t)$ of four arguments can be implemented by the ANN represented in Fig. 1. The input layer receives a matrix with components of rotors position and its velocity of lateral vibrations $((\begin{matrix} \begin{matrix} X_{1} & X_{2} \end{matrix} & \begin{matrix} V_{1} & V_{2} \end{matrix} \end{matrix}))$ . The output of the ANN is the fluid-film reaction force $((\begin{matrix} B_{1} & B_{2} \end{matrix}))$ . A 3-layer feed-forward ANN with sigmoid activation function $a_{j}^{(2)} = 1 / 1 + e^{{- z}_{j}^{(2)}}$ ( $j = 1, \dots n_{2}$ ) in the hidden layer and linear function in the output layer $B_{j} = h_{j} = a_{j}^{(3)} = z_{j}^{(3)}$ is used to solve multi-dimentional mapping problem. Forward propagation in the ANN includes following calculations:

5

\begin{matrix} A^{(1)} = ((\begin{matrix} \begin{matrix} 1 & X_{1} \end{matrix} & \begin{matrix} X_{2} & V_{1} & V_{2} \end{matrix} \end{matrix})), & Z^{(2)} = A^{(1)} Θ^{(1)}, A^{(2)} = ((\begin{matrix} 1 & s i g m o i d (Z^{(2)}) \end{matrix})), \end{matrix}

Z^{(3)} = A^{(2)} Θ^{(2)}, B = H = A^{(3)} = Z^{(3)} .

The number of hidden neurons $n_{2}$ is arbitrary. Network training takes place on a large number of samples, i.e. pairs of input $((\begin{matrix} \begin{matrix} X_{1} & X_{2} \end{matrix} & \begin{matrix} V_{1} & V_{2} \end{matrix} \end{matrix}))$ and output $((\begin{matrix} B_{1} & B_{2} \end{matrix}))$ matrices. In mapping problems, the cost function has the following form [1]:

6

J (Θ^{(k)}) = \frac{1}{2 m} \sum_{i = 1}^{m} \sum_{j = 1}^{n_{l}} {(h_{j}^{(i)} - y_{j}^{(i)})}^{2} + \frac{λ}{2 m} \sum_{k = 1}^{l - 1} \sum_{i = 1}^{n_{k}} \sum_{j = 1}^{n_{k + 1}} {(θ_{i j}^{(k)})}^{2} \Rightarrow m i n,

here $m$ is number of samples in a dataset, $h_{j}^{(i)}$ , $y_{j}^{(i)}$ are predicted and target values of $j$ -th output value calculated for the $i$ -th sample, $n_{k}$ , $n_{l}$ are the numbers of neurons in the $k$ -th layer and in the output layer respectively, $λ$ is a regularization parameter.

In the training process the values of weights $Θ^{(k)}$ and the regularization parameter $λ$ are calculated. The network is trained with Levenberg-Marquardt backpropagation algorithm. The training process is implemented in one of the specialized programming environments [9, 10].

2.2. Classification and pattern recognition tools for rotating machine diagnosis

The main idea is the same: to determine relationship between inputs $X$ and targets $Y$ . The main difference is that the targets values are discrete and equal to 0 or 1, and predictions $H$ approximated by logistic function has continuous values in the interval (0 1) [1].

Sensor measurements are recorded during the tests under various conditions of a rotating machine. Given conditions are needed to be recognized by ANNs. The data from different types of sensors can be normalized [1] and merged into an input matrix $X$ . The number of classes in a target matrix $Y$ is equal to the number of observed conditions of a rotating machine. A 3-layer feed-forward ANN with a sigmoid activation function in the hidden layer (see section 2.1) and a softmax function in the output layer $h_{j} = a_{j}^{(3)} = e^{z_{j}^{(3)}} / \sum_{i = 1}^{{(n}_{3} + 1)} e^{z_{i}^{(3)}}$ ( $j = 1, \dots n_{3}$ ) is used to solve pattern recognition problem. Forward propagation includes following calculations:

7

\begin{matrix} A^{(1)} = ((\begin{matrix} 1 & X \end{matrix})), & Z^{(2)} = A^{(1)} Θ^{(1)}, A^{(2)} = ((\begin{matrix} 1 & s i g m o i d (Z^{(2)}) \end{matrix})), \end{matrix}

Z^{(3)} = A^{(2)} Θ^{(2)}, A^{(3)} = H = ((\begin{matrix} 1 & s o f t m a x (Z^{(3)}) \end{matrix})) .

As for the previous ANNs architecture (see subsection 2.1) the number of hidden neurons $n_{2}$ is arbitrary and training process needs a number of training samples. In pattern recognition problems, the cost function has the following form [1]:

8

J (Θ^{(k)}) = \frac{- 1}{m} \sum_{i = 1}^{m} \sum_{j = 1}^{n_{l}} y_{j}^{(i)} \ln (h_{j}^{(i)}) + \frac{λ}{2 m} \sum_{k = 1}^{l - 1} \sum_{i = 1}^{n_{k}} \sum_{j = 1}^{n_{k + 1}} θ_{i j}^{(k)} \Rightarrow m i n .

The network is trained with scaled conjugate gradient using functions of specialized programming environments [9, 10].

3. Deep reinforcement learning for rotating machine control

The main objective of the control system under study is minimization of energy consumption. The fluid-film reaction and the friction torque in a bearing are non-linear functions depending on pressure distribution (see Eq. (3)). It is assumed that the rotor vibrations described with Eq. (2), the coupling reaction $C$ is linear and the bearing reaction $B$ is simulated by the ANN described in Subsection 2.1 (see Fig. 2). In terms of reinforcement learning the rotor-bearing simulation model is an envinonment and the control system is an agent. At each time step $t$ the agent receives feedback from the environment $S_{t}$ in form of a matrix of simulation results and takes an action $a_{t}$ in response in form of pressure supply or any other parameter of a fluid-film bearing. The main idea is to train agent after the event giving him higher reward $r_{t}$ for better actions.

The deep deterministic policy gradient (DDPG) is used. The algorithm of DDPG agents is represented in [8]. At each time step the value function $v_{t}$ is calculated as follows [8]:

9

v_{t} = r_{t} + γ q' (S_{t + 1}, μ' (S_{t + 1} | Θ^{μ}) {, | Θ}^{q}),

where $γ$ is a discount factor, $q'$ is a $q$ -function calculated by a target critic, $μ'$ is a policy function of the action by a target agent, $Θ^{μ}$ , $Θ^{q}$ are unknown parameters of the actor and the critic ANNs respectively.

The architectures of the networks will be represented in the next section of the paper. The unknown parameters are calculated by minimizing the loss function [8]:

10

J (Θ^{μ}, Θ^{q}) = \frac{1}{m} \sum_{j = 1}^{m} {(v_{i}^{} - q (S_{i}, a_{i} | Θ^{q}))}^{2} \Rightarrow m i n .

The networks are trained with stochastic gradient descent method using functions of specialized programming environments [9, 10].

4. Simulation and experimental results

The first series of simulation tests was performed with a model of the test rig based on Eqs. (2-4). A set of rotor trajectories was calculated. Then the ANN described by Eqs. (5-6) was trained and tested in comparison with a known linear model characterized by the spring and damper matrices [4]. The results demonstrated that the rotor dynamics simulation program with the ANN module allows calculation rotor trajectory two times faster than a real time process. It was demonstrated also that the ANN allows simulation of non-linear transient processes with variable rotor speed and high vibrations [4].

The second series of tests was performed using the test rig with a multi-sensor measurement system. Two displacement sensors measured the rotor vibrations, three vibroaccelerometers measured the rotor and the electromotor housings vibroaccelerations, a microphone measured the operating rotor machine noise. Inside the bearing the pressure sensor measured pressure supply and the contact resistance sensor measured the fluid-film thickness. Six conditions of the test rig were studied, including the normal condition, the conditions with loosened bolts and the rotor unbalance condition. The general classification problem for six classes recognition and the simplified classification problem for two classes recognition (normal or abnormal) were solved. Several random samples of two classes dataset with 800 data points each are shown in Fig. 3

Fig. 3Random samples of two classes dataset with normalized measurements results from the microphone and from one of the vibroaccelerometers

Then the ANN described by Eqs. (7-8) was trained and tested to solve the classification problems. The accuracy of the two classes and the six classes recognition were up to 80 % and 90 % respectively. It should be noted, that the accuracy of the two classes classification by experts was up to 70 %. The value of accuracy means that classification process was close to random.

The third series of the tests was simulation. The rotor dynamics simulation model became an environment observed and controlled by an agent. The agent model was based on the DDPG algorithm (see section 2.3). The ANNs architectures are shown in Fig. 4.

Fig. 4The DDPG networks architectures: a) the actor network and b) the critic network [2]

a)

b)

The agent controlled the bearing clearance size $h$ (see Eq. (3)) directly, and indirectly the pressure distribution and the reaction forces in the bearing. The goal of the control system was minimization of power loss due to friction and vibration in the bearing. The ANNs were trained and tested. The results demonstrated decreasing the power loss up to 17 % by the DDPG agent.

5. Conclusions

Suggested tools of rotor dynamics simulation, condition classification and control based on artificial neural networks allow development of predictive modeling systems, fault diagnosis and control of energy efficient operation to design intellectual rotating machines. All the developed systems can be combined in one device. The following study is connected with design of the device which will be able to combine multiple functions of predictive modeling, fault diagnosis and control in interaction with a rotating machine.

References

Goodfellow Y., Bengio Y, Courville A. Deep Learning. MIT Press, 2016.

Search CrossRef
Hori Y. Hydrodynamic Lubrication. Yokendo Ltd, Tokyo, 2006.

Search CrossRef
Friswell M. I. Dynamics of Rotating Machines. Cambridge University Press, 2010.

Publisher
Kornaev A. V., Kornaev N. V., Kornaeva E. P., Savin L. A. Application of artificial neural networks to calculation of oil film reaction forces and dynamics of rotors on journal bearings. International Journal of Rotating Machinery, Vol. 2017, 2017, p. 9196701.

Publisher
Lei Y., Yang B., Jiang X., Jia F., Li N., Nandi A. K. Applications of machine learning to machine fault diagnosis: a review and roadmap. Mechanical Systems and Signal Processing, Vol. 138, 2020, p. 106587.

Publisher
Liu R., Yang B., Zio E., Chen X. Artificial intelligence for fault diagnosis of rotating machinery: A review. Mechanical Systems and Signal Processing, Vol. 108, 2018, p. 33-47.

Publisher
Busoniu L, Bruin T., Tolic D., Kober J., Palunko I. Reinforcement learning for control: Performance, stability, and deep approximators. Annual Review in Control, Vol. 46, 2018, p. 8-28.

Publisher
Lillicrap T. P., Hunt J. J., Pritzel A., Heess N., Erez T., Tassa Y., Silver D., Wierstra D. Continuous control with deep reinforcement learning. International Conference on Learning Representations, 2016.

Search CrossRef
Mathworks: help center. Nnstart tool, https://www.mathworks.com.

Search CrossRef
Keras API, https://keras.io/api/.

Search CrossRef

Cited by

Theoretical and experimental study of motion suppression and friction reduction of rotor systems with active hybrid fluid-film bearings

(2023)

About this article

Received

11 June 2020

Accepted

18 June 2020

Published

29 June 2020

SUBJECTS

Mathematical models in engineering

DOI

https://doi.org/10.21595/vp.2020.21549

Keywords

supervised learning

reinforcement learning

rotor dynamics

fault diagnosis

control

Acknowledgements

This work was supported by the Russian Science Foundation under the Project No. 16-19-00186. The authors gratefully acknowledge this support. Authors would also like to thank A. Rodichev, A. Fetisov, Y. Kazakov and S. Popov for the multi-sensory test rig development.

This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Previous article in issue Previous Next article in issue Next

Research article

2022 06 13

Equivalent health assessment of rotating machinery with imbalance rotor based on metric learning

Haifei Liu, Laifa Tao, Xuyang Pu, Kaixin Jin, Tong Zhang

Review article

2021 11 26

Fault diagnosis and health management of bearings in rotating equipment based on vibration analysis – a review

Adnan Althubaiti, Faris Elasha, Joao Amaral Teixeira

Research article

2020 03 31

Fault diagnosis of rotating machinery under time-varying speed based on order tracking and deep learning

Taiyong Wang, Lan Zhang, Huihui Qiao, Peng Wang

Research article

2017 12 31

A novel intelligent fault diagnosis method of rotating machinery based on deep learning and PSO-SVM

Peiming Shi, Kai Liang, Dongying Han, Ying Zhang

A. Kornaev et al., “Machine learning for rotating machines: simulation, diagnosis and control,” Vibroengineering PROCEDIA, Vol. 32, pp. 223–228, Jun. 2020, https://doi.org/10.21595/vp.2020.21549

Copy Extrica

Copied to clipboard!

TY  - JOUR
DO  - 10.21595/vp.2020.21549
UR  - https://doi.org/10.21595/vp.2020.21549
TI  - Machine learning for rotating machines: simulation, diagnosis and control
T2  - Vibroengineering PROCEDIA
AU  - Kornaev, Alexey
AU  - Savin, Leonid
AU  - Kornaev, Nickolay
AU  - Zaretsky, Roman
AU  - Kornaeva, Elena
AU  - Babin, Alexander
AU  - Stebakov, Ivan
PY  - 2020
DA  - 2020/06/29
PB  - JVE International Ltd.
SP  - 223-228
VL  - 32
SN  - 2345-0533
SN  - 2538-8479
ER  - 

Copy Ris

Copied to clipboard!

@article{Kornaev_2020,
	doi = {10.21595/vp.2020.21549},
	url = {https://doi.org/10.21595/vp.2020.21549},
	year = 2020,
	month = {jun},
	publisher = {{JVE} International Ltd.},
	volume = {32},
	pages = {223--228},
	author = {Alexey Kornaev and Leonid Savin and Nickolay Kornaev and Roman Zaretsky and Elena Kornaeva and Alexander Babin and Ivan Stebakov},
	title = {Machine learning for rotating machines: simulation, diagnosis and control},
	journal = {Vibroengineering {PROCEDIA}}
}

Copy Bibtex

Copied to clipboard!

[1]A. Kornaev et al., “Machine learning for rotating machines: simulation, diagnosis and control,” Vibroengineering PROCEDIA, vol. 32, pp. 223–228, Jun. 2020, doi: 10.21595/vp.2020.21549.

Copy IEEE

Copied to clipboard!

Kornaev, Alexey, Leonid Savin, Nickolay Kornaev, Roman Zaretsky, Elena Kornaeva, Alexander Babin, and Ivan Stebakov. “Machine Learning for Rotating Machines: Simulation, Diagnosis and Control.” Vibroengineering PROCEDIA 32 (June 29, 2020): 223–28. https://doi.org/10.21595/vp.2020.21549.

Copy Chicago

Copied to clipboard!

Machine learning for rotating machines: simulation, diagnosis and control

Abstract

Highlights

1. Introduction

2. Shallow learning for rotor dynamics simulation and fault diagnosis

2.1. Multi-dimensional mapping for rotor dynamics simulation

2.2. Classification and pattern recognition tools for rotating machine diagnosis

3. Deep reinforcement learning for rotating machine control

4. Simulation and experimental results

5. Conclusions

References

Cited by

About this article

Related Articles