Maintenance policy for two-stage deteriorating mode system based on cumulative damage model

Ni, Xianglong; Zhao, Jianmin; Wang, Guangyan; Teng, Hongzhi

Journal of Vibroengineering

Browse Journal

Submit article

Published: May 15, 2015

Check for updates

Maintenance policy for two-stage deteriorating mode system based on cumulative damage model

Xianglong Ni¹

Jianmin Zhao²

Guangyan Wang³

Hongzhi Teng⁴

^{1, 2, 3, 4}Mechanical Engineering College, Shijiazhuang, 050003, China

Corresponding Author:

Xianglong Ni

Cite the article Download PDF

Downloads 1447

Abstract

For the system degradation process undergoing a sudden change, optimal maintenance policies were developed using the cumulative damage model and two-stage degradation modeling. Single shock damage value and the number of shock times are assumed to be normal distribution and homogeneous Poisson process, respectively. On this basis, average long-run cost rate of a renewal cycle was modeled with considering the probabilities of corrective, preventive and continuous monitoring, respectively. In order to develop an optimal policy, four types of maintenance policies (i.e., global, time-depended, adaptive and simplified adaptive policies) were analyzed with different alarm thresholds and inter-inspection time. Influence analysis of different parameters for maintenance policy was given, where different maintenance policies were compared in terms of average long-run cost rate. In addition, the impacts of degradation model parameters (i.e., change-point distribution, shock strength, shock frequency) on the average long-run cost rate were analyzed. Finally, maintenance policy for gearbox degradation experiment was analyzed in case study.

1. Introduction

Performance degradation is a common phenomenon in many systems, especially in mechanical and structural systems. Deterioration modeling plays a more and more important role in maintenance decision-making. Many researchers have mainly studied intensively systems degradation with stationary processes to optimize maintenance problems [1-4]. However, the degradation processes for many systems are non-stationary due to internal mechanism or external environment influences etc. [5]. For example, some systems are deteriorating in a process of two stages [5-11], where the degradation rate is usually small in the first stage and large in the second stage.

In order to study system degradation, it is necessary to establish corresponding degradation model. The deterioration process model with independent random increment is divided into two types, continuous time model and cumulative damage model. The continuous time model [12, 13, 14] presents system degradation in terms of continuous time stochastic process. The most representative models for continuous time model are Brownian motion model and Gamma process model. Whereas, in the cumulative damage model [15, 16, 17], it is assumed that system degradation process is discrete. The model describes the degradation process by cumulating a number of random increments caused by damage in the system operation. Existing studies for two-stage degraded system mainly focused on continuous time model. Application of cumulative damage model to two-stage degradation process was seldom investigated.

Due to the discrepancy of different deterioration modes for a two-stage degraded system, maintenance decision-making methods for single-stage degradation process system cannot be applied to two-stage deteriorating mode system. In order to solve the problem, In order to solve the problem, studies have been done and a number of maintenance strategies were developed. An activation zone method for maintenance decision-making is presented by Saassouh [6] with considering the random change of degradation rate, but the degradation rate change time is assumed to be continuous and perfectly monitored. Ponchet [12] developed two maintenance decision-making methods with and without considering the deterioration mode change in system degradation processes, respectively. The results of numerical example show that it can bring considerable benefits if a policy with changeable thresholds was used. A predictive maintenance policy for a system with two deterioration mode based on process data was proposed by Zhao [18], and the maintenance actions were implemented based on different reliability thresholds. Some maintenance policies for two-stage deteriorating mode systems have been presented, but no much study has been done to investigate the performance of different maintenance polices with different thresholds and inspection intervals.

In this paper, degradation modeling and maintenance decision-making methods for two-stage deteriorating mode system based on cumulative damage model will be investigated. The main contributions of this study are: (a) Cumulative damage model is used for two-stage degradation modeling, and it shows system degradation rate change through different shock strengths and shock frequencies. (b) An optimal policy of a two-stage degraded system is developed by analyzing and comparing four types of maintenance policies (global, time-depended, adaptive, simplified adaptive).

The remainder of this paper is organized as follows. Section 2 is devoted to two-stage deteriorating modeling based on cumulative damage model. Section 3 studies four kinds of maintenance policies and analyzes maintenance policy evaluation method. Numerical examples are used to analyze the influence of different factors on maintenance policies in Section 4. Conclusions are made in Section 5.

2. Two-stage deteriorating modeling

2.1. System description

The considered system is assumed to be an observable system which degrading stochastically. The degradation level at time $t$ is supposed to be presented by a random variable $Y (t)$ [19, 20]. The system degradation process is an increasing stochastic process with initial state $Y (t) = 0$ . System will be declared as failed when deterioration level $Y (t)$ exceeds a failure threshold $Y_{f}$ (namely $Y (t) \geq Y_{f}$ ). $T_{f}$ is defined as system failure time. System failed does not mean that the system cannot work, but implies that it’s economic and safety impacts will be unacceptable if it still in operation.

Fig. 1Two-stage degradation process

System degradation rate changes at time $t_{c}$ during a life cycle (as shown in Fig. 1). It means that the parameters of system degradation process undergo transitions at change-point $t_{c}$ . The system is supposed to be in a nominal degradation mode and denoted by $M_{1}$ before $t_{c}$ . After $t_{c}$ , the system degradation mode evolves to an accelerated mode $M_{2}$ . Degradation rate is usually small in the first stage $M_{1}$ and large in the second stage $M_{2}$ . Therefore, system degradation process can be modeled by two stochastic processes under the same law but with different parameters [6].

Most physical degradation observation and the property of the Levy processes have shown that system deterioration can be thought as the accumulation of large numbers of small shocks [21], and deterioration level can be defined as the sum of damage values due to shock. When the system is in mode $M_{k}$ , damage values $x_{k i}$ ( $k =$ 1, 2; $i =$ 0, 1, 2, …) are assumed to be normally distribution, $x_{k i} ~ N (u_{k}, σ_{k}^{2})$ . $x_{k i}$ is a constant in mode $M_{k}$ (see Fig. 1). Random variable $N_{S}$ represents the number of shock times during time [0, $t_{s}$ ] and it obeys to homogeneous Poisson distribution with strength $λ_{k}$ in mode $M_{k}$ . Therefore, the probability of shock times $N_{S}$ just as $n$ in mode $M_{k}$ is:

1

P (N_{s} = n) = \frac{{(λ_{k} t_{s})}^{n}}{n!} e^{- λ_{k} t_{s}} .

For the convenience of expression, in this paper $x_{k i}$ is denoted as shock strength and $1 / λ_{k}$ is denoted as shock frequency. It is not difficult to find that shock strength and shock frequency decide the size of degradation rate.

2.2. Degradation level modeling

In the two-stage degradation process, the value of damage time $t_{s}$ may be in the first stage ( $0 \leq t_{s} \leq t_{c}$ ) or in the second stage ( $t_{s} > t_{c}$ ). The calculation methods of degradation level are not alike in different degradation stage. According to the above notation, degradation level at time $t_{s}$ can be written as:

2

Y = Y_{t_{s}}^{1} Ι_{\{t_{s} \leq t_{c}\}} + (Y_{t_{c}}^{1} + Y_{t_{s} - t_{c}}^{2}) Ι_{\{t_{s} > t_{c}\}},

where $Y_{t_{s}}^{k}$ stands for degradation level in mode $M_{k}$ , $I_{\{E\}} = 1$ if $E$ is true and $I_{\{E\}} = 0$ otherwise. When $0 \leq t_{s} \leq t_{c}$ , the degradation level is:

3

Y = Y_{t_{s}}^{1} = \sum_{i = 0}^{N_{1}} x_{1 i} .

In this case, the probability of shock times equal to $n$ within time [0, $t_{s}$ ] is the same to Eq. (1). Due to every shock damage is independently and unrelated, it can be known from the characters of normal distribution (the sum of normal distribution parameters still in line with normal distribution) that $Y$ is obey to normal distribution, namely:

4

Y = Y_{t_{s}}^{1} ~ N (N_{1} μ_{1}, N_{1} σ_{1}^{2}) .

When $t_{s} > t_{c}$ , degradation level consists of the damage in first stage $M_{1}$ and the damage in second stage $M_{2}$ . In this case, degradation time length in the first stage is $t_{c}$ , $t_{s} - t_{c}$ to the second stage. Hence, the system degradation level is:

5

Y = Y_{t_{c}}^{1} + Y_{t_{s} - t_{c}}^{2} = \sum_{i = 0}^{N_{1}} x_{1 i} + \sum_{j = 0}^{N_{2}} x_{2 j} .

Similar to the Eq. (4), system degradation level is in line with normal distribution, there is:

6

Y = Y_{t_{c}}^{1} + Y_{t_{s} - t_{c}}^{2} ~ N (N_{1} μ_{1} + N_{2} μ_{2}, N_{1} σ_{1}^{2} + N_{2} σ_{2}^{2}) .

Because shocks between the two stages are independently, the probability of the number of shock times just as $m$ in mode $M_{1}$ and equal to $n$ in mode $M_{2}$ is:

7

P (N_{1} = m, N_{2} = n) = P (N_{1} = m) \cdot P (N_{2} = n) = \frac{{(λ_{1} t_{c})}^{m}}{m!} \frac{{(λ_{2} (t_{s} - t_{c}))}^{n}}{n!} \cdot e^{- λ_{1} t_{c} - λ_{2} (t_{s} - t_{c})} .

2.3. Reliability modeling

In engineering practice, the change-point $t_{c}$ for degradation rate is not fixed, but is distributed in a certain time interval. Shown as Fig. 1, the time distribution interval of change-point $t_{c}$ for degradation rate is [ $t_{A}$ , $t_{B}$ ], in other words, the deteriorating mode may change at any time from $M_{1}$ to $M_{2}$ when system works during time [ $t_{A}$ , $t_{B}$ ].

System reliability is the probability for degradation level $Y$ less than failure threshold $Y_{f}$ when damage time is $t_{s}$ . Shock strength, shock frequency and change-point should be considered in reliability modeling, which are main factors in cumulative damage model. Reliability modeling for two-stage degraded system is specific expressed as following.

When $0 \leq t_{s} \leq t_{c}$ , system reliability is:

8

R (t_{s}) = P (Y \leq Y_{f}) = P (\sum_{i = 0}^{N_{s}} x_{1 i} \leq Y_{f}) = \sum_{m = 0}^{\infty} Φ (\frac{Y_{f} - m μ_{1}}{\sqrt{m} σ_{1}}) \cdot \frac{{(λ_{1} t_{s})}^{m}}{m!} \cdot e^{- λ_{1} t_{s}}

= \sum_{m = 0}^{\infty} Φ (\frac{Y_{f} - m μ_{1}}{\sqrt{m} σ_{1}}) \cdot \frac{{(λ_{1} t_{s})}^{m}}{m!} \cdot e^{- λ_{1} t_{s}} .

When $t_{s} > t_{c}$ , system reliability is:

9

R (t_{s}) = P (Y < Y_{f}) = P (\sum_{i = 0}^{N_{1}} x_{1 i} + \sum_{j = 0}^{N_{2}} x_{2 j} < Y_{f})

= \sum_{m = 0}^{\infty} \sum_{n = 0}^{\infty} Φ (\frac{Y_{f} - (m μ_{1} + n μ_{2})}{\sqrt{m σ_{1}^{2} + n σ_{2}^{2}}}) \cdot P (N_{1} = m, N_{2} = m)

= \sum_{m = 0}^{\infty} \sum_{n = 0}^{\infty} Φ (\frac{Y_{f} - (m μ_{1} + n μ_{2})}{\sqrt{m σ_{1}^{2} + n σ_{2}^{2}}}) \cdot \frac{λ_{1}^{m} \cdot λ_{2}^{n}}{m! \cdot n!} \cdot \int_{t_{A}}^{t_{B}} (e^{- λ_{1} τ - λ_{2} (t_{s} - τ)} τ^{m} {(t - τ)}^{n} g (τ)) d τ,

where $g (t)$ is the probability density distribution function for change-point during time [ $t_{A}$ , $t_{B}$ ].

3. Maintenance policies

Research of maintenance decision-making is one of the focuses for system degradation modeling. Condition-based maintenance policy widely uses in various systems, which is structured according to the information available through on-line monitoring [12]. In order to reduce maintenance costs, preventive maintenance actions take place before system failure by monitoring. In other words, suitable monitoring method and maintenance policy can help to improve the efficiency and profitability of a system.

The selections of alarm threshold and inter-inspection time are the keys to maintenance policy. According to different alarm thresholds and inter-inspection times, this paper considers four kinds of maintenance decision-making methods. The first kind of method is global maintenance policy. There are just one alarm threshold and one inter-inspection time in this method, which are constants and never change. The second kind of method is time-depended maintenance policy. There are also one alarm threshold and one inter-inspection time in this method, but the inter-inspection time is change with system working time. The next kind of method is adaptive maintenance policy. Different alarm thresholds and inter-inspection times corresponding to different degradation rates, that is say, there are two alarm thresholds and two inter-inspection times in this method. The finally kind of method is simplified adaptive maintenance policy. This method is similar to adaptive maintenance policy, but it just has one inter-inspection time.

In the framework of this study, there are three possible maintenance actions, inspection, preventive maintenance and corrective maintenance, respectively. System is perfectly monitored through periodic monitor, and system state restores to be as good as new after preventive maintenance or corrective maintenance with negligible time.

3.1. Global maintenance policy

In order to show the importance of considering the changes of system degradation rate, traditional maintenance decision-making method is presented in the first place, which called global maintenance policy. The method just defines a single alarm threshold $Y_{A}$ and a single inter-inspection time $∆ T$ , as done in Dieulle et al. [22]. It is not difficult to find that the method only pay attention to system degradation level and ignore the degradation rate.

The possible maintenance actions which can put into practice after inspection time $T_{i}$ are defined as follows:

• If $Y (T_{i}) < Y_{A}$ , do nothing and the system is left as it is until next inspection time $T_{i + 1} = T_{i} + ∆ T$ .

• If $Y_{A} \leq Y (T_{i}) < Y_{f}$ , the system is too badly deteriorated so it is necessary to perform preventive maintenance.

• If $Y (T_{i}) \geq Y_{f}$ , the system is considered as failed and it has to be performed corrective maintenance.

The rule of global maintenance policy is illustrated in Fig. 2.

Fig. 2Global maintenance policy

3.2. Time-depended maintenance policy

As the degradation rate in mode $M_{2}$ larger than in mode $M_{1}$ , the inter-inspection time should be shorter and shorter in term of work time. This kind of maintenance decision-making method called time-depended maintenance policy. For example, the inter-inspection time of $i$ th monitor is $∆ T_{i}$ , the next inter-inspection time is $∆ T_{i + 1}$ , $∆ T_{i + 1} = q \cdot ∆ T_{i}$ and $q < 1$ .

The possible maintenance actions which can put into practice after inspection time $T_{i}$ are defined as follows:

• If $Y (T_{i}) < Y_{A}$ , do nothing and the system is left as it is until next inspection time $T_{i + 1} = T_{i} + ∆ T_{i + 1}$ .

• If $Y_{A} \leq Y (T_{i}) < Y_{f}$ , the system is too badly deteriorated so it is necessary to perform preventive maintenance.

• If $Y (T_{i}) \geq Y_{f}$ , the system is considered as failed and it has to be performed corrective maintenance.

The rule of time-depended maintenance policy is illustrated in Fig. 3.

Fig. 3Time-depended maintenance policy

3.3. Adaptive maintenance policy

According to the characteristics of system degradation rate suddenly changes from nominal mode $M_{1}$ to accelerated mode $M_{2}$ , Saassouh et al. [6, 12] put forward adaptive maintenance policy. The method is different with global maintenance policy, it considers system degradation level and degradation rate. As a result, this maintenance decision-making method is more responsive to systems with two-stage deteriorating mode.

The alarm threshold and inter-inspection time for adaptive maintenance policy are defined as follows:

10

Y = Y_{n o m} Ι_{\{t_{s} \leq t_{c}\}} + Y_{a c c} Ι_{\{t_{s} > t_{c}\}},

11

Δ T = Δ T_{n o m} Ι_{\{t_{s} \leq t_{c}\}} + Δ T_{a c c} Ι_{\{t_{s} > t_{c}\}} .

Set $Y_{n o m}$ as the alarm threshold and $∆ T_{n o m}$ is the inter-inspection time for nominal degradation mode $M_{1}$ . When the inspection time $T_{i}$ is less than change-point $t_{c}$ ( $T_{i} \leq t_{c}$ ), the possible maintenance actions which can put into practice are defined as follows:

• If $Y (T_{i}) < Y_{n o m}$ , do nothing and the system is left as it is until next inspection time $T_{i + 1} = T_{i} + ∆ T_{n o m}$ .

• If $Y_{n o m} \leq Y (T_{i}) < Y_{f}$ , the system is too badly deteriorated so it is necessary to perform preventive maintenance.

• If $Y (T_{i}) \geq Y_{f}$ , the system is considered as failed and it has to be performed corrective maintenance.

Set $Y_{a c c}$ as the alarm threshold and $∆ T_{a c c}$ is the inter-inspection time for accelerated degradation mode $M_{2}$ . When the inspection time $T_{j}$ is greater than change-point $t_{c}$ ( $T_{j} > t_{c}$ ), the possible maintenance actions which can put into practice are defined as follows:

• If $Y (T_{j}) < Y_{a c c}$ , do nothing and the system is left as it is until next inspection time $T_{j + 1} = T_{j} + ∆ T_{a c c}$ .

• If $Y_{a c c} \leq Y (T_{j}) < Y_{f}$ , the system is too badly deteriorated so it is necessary to perform preventive maintenance.

• If $Y (T_{j}) \geq Y_{f}$ , the system is considered as failed and it has to be performed corrective maintenance.

As the degradation rate for mode $M_{2}$ is greater than mode $M_{1}$ , so the maintenance policy parameters $Y_{a c c} < Y_{n o m}$ and $∆ T_{a c c} < ∆ T_{n o m}$ . The rule of adaptive maintenance policy is illustrated in Fig. 4.

Fig. 4Adaptive maintenance policy

3.4. Simplified adaptive maintenance policy

Adaptive maintenance policy so complex that not suitable for engineering application. Therefore, adaptive maintenance policy is simplified in application by some researchers [12]. Inter-inspection time $∆ T$ is a constant value and never changes in simplified adaptive maintenance policy.

The maintenance policy rule (alarm threshold, possible maintenance action) is similar to adaptive maintenance policy, the only difference is that: no matter $Y (T_{i}) < Y_{n o m}$ or $Y (T_{i}) < Y_{a c c}$ , the next inspection time always $T_{i + 1} = T_{i} + ∆ T$ (namely $∆ T_{n o m} = ∆ T_{a c c} = ∆ T$ ).

3.5. Maintenance policy evaluation

3.5.1. Evaluation method

Maintenance cost occurs when a maintenance action is performed. In this study the average long-run cost rate over an infinite time span is used to evaluate maintenance policy. As it has assumed that system state restores to as good as new if a preventive/corrective maintenance action performed, renewal reward theory [23] can be used to calculate the average long-run cost rate as follows:

12

E (C_{\infty}) = \underset{t \to \infty}{l i m} \frac{E [C (t)]}{t} = \frac{E [C (T)]}{E [T]},

where $C (t)$ is the total maintenance cost at time $t$ , $T$ is the average time length of a renewal cycle.

The total maintenance cost in a renewal cycle $T$ can be expressed as follows:

13

E [C (T)] = C_{I} E [N_{I} (T)] + C_{P} P_{P} + C_{C} P_{C} .

The expected time length of a renewal cycle $T$ is written as:

14

E [T] = P_{P} T_{P} + P_{C} T_{f} .

Adaptive maintenance policy is the most complex method relative to other three maintenance policies, which parameters obtained more difficult. In this paper, parameters obtained method of adaptive maintenance policy are mainly analyzed, parameters for other three maintenance policy can also be obtained as this method.

3.5.2. Probability of corrective maintenance

If any one event of the following events ( $A_{C 1}$ , $A_{C 2}$ , $A_{C 3}$ ) occurs, system is considered as failure. That is to say, system needs corrective replacement and it will cause corrective maintenance cost $C_{C}$ . Take the event $A_{C 1}$ as a example, system degradation process is in stage $M_{1}$ ( $T_{z - 1} < T_{z} \leq t_{c}$ ), if the degradation level $Y (T_{z - 1}) < Y_{n o m}$ for $(z - 1)$ th inspection and $Y (T_{z}) > Y_{f}$ for $z$ th inspection, corrective maintenance action will be performed.

A_{C 1} = \{Y (T_{z - 1}) < Y_{n o m} ⋂ Y (T_{z}) \geq Y_{f} ⋂ T_{z - 1} < T_{z} \leq t_{c}\},

A_{C 2} = \{Y (T_{z - 1}) < Y_{a c c} ⋂ Y (T_{z}) \geq Y_{f} ⋂ t_{c} < T_{z - 1} \leq T_{z}\},

A_{C 3} = \{Y (T_{z - 1}) < Y_{n o m} ⋂ Y (T_{z}) \geq Y_{f} ⋂ T_{z - 1} < t_{c} \leq T_{z}\} .

The probability for a corrective maintenance in a renewal cycle is expressed as:

15

P_{C} = P (A_{C 1}) + P (A_{C 2}) + P (A_{C 3}) .

System cumulative damage distribution is the probability for system degradation level $Y$ less than a certain value $y$ when shock time is $t_{s}$ . The formula of cumulative damage distribution in stage $M_{1}$ can be denoted as:

16

P (Y \leq y) = \sum_{n = 0}^{\infty} Φ (\frac{y - n μ_{1}}{\sqrt{n} σ_{1}}) \cdot P (N_{s} = n) .

Therefore, the density function of cumulative damage distribution in stage $M_{1}$ is:

17

f (y) = \frac{1}{\sqrt{2 π}} \cdot \sum_{n = 0}^{\infty} e^{- \frac{{(y - n μ_{1})}^{2}}{2 n σ_{_{1}}^{2}}} \cdot P (N_{s} = n) .

The specific analytic formula of the probability for corrective maintenance event $A_{C 1}$ is expressed as follows:

18

P (A_{C 1}) = \sum_{z = 1}^{\infty} P (Y (T_{z - 1}) < Y_{n o m}, Y (T_{z}) \geq Y_{f}) \cdot P (T_{z} \leq t_{c})

= \sum_{z = 1}^{\infty} P (Y (T_{z - 1}) < Y_{n o m}, Y (T_{z} - T_{z - 1}) \geq Y_{f} - Y (T_{z - 1})) \cdot P (T_{z} \leq t_{c})

= \sum_{z = 1}^{\infty} P (\sum_{i = 0}^{N (T_{z - 1})} x_{1 i} < Y_{n o m}, \sum_{j = 0}^{N (T_{z} - T_{z - 1})} x_{1 j} \geq Y_{f} - \sum_{i = 0}^{N (T_{z - 1})} x_{1 i}) \cdot P (T_{z} \leq t_{c})

= \sum_{z = 1}^{\infty} \sum_{m = 0}^{\infty} \sum_{n = 0}^{\infty} P (\sum_{i = 0}^{m} x_{1 i} < Y_{n o m}, \sum_{j = 0}^{n} x_{1 j} \geq Y_{f} - \sum_{i = 0}^{m} x_{1 i}) \cdot P (N (T_{z - 1}) = m) \cdot P (N (T_{z} - T_{z - 1}) = n) \cdot P (T_{z} \leq t_{c})

= \frac{1}{2 π} \sum_{z = 1}^{\infty} \sum_{m = 0}^{\infty} \sum_{n = 0}^{\infty} \int_{0}^{Y_{n o m}} \int_{Y_{f} - u}^{Y_{f}} {e^{- \frac{{(u - m μ_{1})}^{2}}{2 m σ_{_{1}}^{2}}}}^{- \frac{{(τ - n μ_{1})}^{2}}{2 n σ_{_{1}}^{2}}} d τ d u \cdot P (N (T_{z - 1}) = m) \cdot P (N (T_{z} - T_{z - 1}) = n) \cdot P (T_{z} \leq t_{c}),

where, $P (N (T_{z - 1}) = m)$ is the probability of the number of shock times just as $m$ during time $[0, T_{z - 1}]$ , $P (N (T_{z} - T_{z - 1}) = n)$ is the probability of the number of shock times equal to $n$ within time $[T_{z - 1}, T_{z}]$ , $P (T_{z} \leq t_{c})$ is the probability for system degradation in the first stage $M_{1}$ .

The probability for corrective maintenance events $A_{C 2}$ , $A_{C 3}$ can be expresses as follows:

19

P (A_{C 2}) = \sum_{z = 1}^{\infty} P (Y (T_{z - 1}) < Y_{a c c}, Y (T_{z}) \geq Y_{f}) \cdot P (T_{z - 1} > t_{c})

= \frac{1}{\sqrt{8 π^{3}}} \sum_{z = 1}^{\infty} \sum_{m = 0}^{\infty} \sum_{n = 0}^{\infty} \sum_{l = 0}^{\infty} (\begin{matrix} \int_{0}^{Y_{a c c}} \int_{0}^{Y_{a c c} - u} \int_{Y_{f} - u - ω}^{\infty} {e^{- \frac{{(u - m μ_{1})}^{2}}{2 m σ_{_{1}}^{2}}}}^{- \frac{{(ω - n μ_{2})}^{2}}{2 n σ_{_{2}}^{2}} - \frac{{(τ - n μ_{2})}^{2}}{2 n σ_{_{2}}^{2}}} d τ d ω d u \cdot \\ P (N (t_{c}) = m) \cdot P (N (T_{z - 1} - t_{c}) = n) \cdot P (N (T_{z} - T_{z - 1}) = l) \cdot P (T_{z - 1} > t_{c}) \end{matrix}),

20

P (A_{C 3}) = \sum_{z = 1}^{\infty} P (Y (T_{z - 1}) < Y_{n o m}, Y (T_{z}) \geq Y_{f}) \cdot P (T_{z - 1} < t_{c} \leq T_{z})

= \frac{1}{\sqrt{8 π^{3}}} \sum_{z = 1}^{\infty} \sum_{m = 0}^{\infty} \sum_{n = 0}^{\infty} \sum_{l = 0}^{\infty} (\begin{matrix} \int_{0}^{Y_{n o m}} \int_{Y_{f} - u}^{\infty} \int_{Y_{f} - u - ω}^{\infty} {e^{- \frac{{(u - m μ_{1})}^{2}}{2 m σ_{_{1}}^{2}}}}^{- \frac{{(ω - m μ_{1})}^{2}}{2 m σ_{_{1}}^{2}}}^{- \frac{{(τ - n μ_{2})}^{2}}{2 n σ_{_{2}}^{2}}} d τ d ω d u \cdot \\ P (N (T_{z - 1}) = m) \cdot P (N (t_{c} - T_{z - 1}) = n) \cdot P (N (T_{z} - t_{c}) = l) \cdot P (T_{z - 1} < t_{c} \leq T_{z}) \end{matrix}) .

3.5.3. Probability of preventive maintenance

If any one event of the following events ( $A_{P 1}$ , $A_{P 2}$ , $A_{P 3}$ ) occurs, it is considered that preventive replacement needs to be performed and it will cause preventive maintenance cost $C_{P}$ . When a twice continuous monitoring just happened before and after the change-point for degradation rate ( $T_{z - 1} < t_{c} \leq T_{z}$ ), if the degradation level $Y (T_{z - 1}) < Y_{n o m}$ for ( $z - 1$ )th inspection and $Y (T_{z}) \geq Y_{f}$ for $z$ th inspection, preventive maintenance action will be performed.

A_{P 1} = \{Y (T_{z - 1}) < Y_{n o m} ⋂ Y_{n o m} \leq Y (T_{z}) < Y_{f} ⋂ T_{z - 1} < T_{z} \leq t_{c}\},

A_{P 2} = \{Y (T_{z - 1}) < Y_{a c c} ⋂ Y_{a c c} \leq Y (T_{z}) < Y_{f} ⋂ t_{c} < T_{z - 1} \leq T_{z}\},

A_{P 3} = \{Y (T_{z - 1}) < Y_{n o m} ⋂ Y_{a c c} \leq Y (T_{z}) < Y_{f} ⋂ T_{z - 1} < t_{c} \leq T_{z}\} .

The probability for a preventive maintenance in a renewal cycle is expressed as:

21

P_{C} = P (A_{C 1}) + P (A_{C 2}) + P (A_{C 3}) .

The probability for preventive maintenance events $A_{P 1}$ , $A_{P 2}$ , $A_{P 3}$ can be written as follows:

22

P (A_{P 1}) = \sum_{z = 1}^{\infty} P (Y (T_{z - 1}) < Y_{n o m}, Y_{n o m} < Y (T_{z}) < Y_{f}) \cdot P (T_{z} \leq t_{c})

= \frac{1}{2 π} \sum_{z = 1}^{\infty} \sum_{m = 0}^{\infty} \sum_{n = 0}^{\infty} \int_{0}^{Y_{n o m}} \int_{Y_{n o m} - u}^{Y_{f} - u} {e^{- \frac{{(u - m μ_{1})}^{2}}{2 m σ_{_{1}}^{2}}}}^{- \frac{{(τ - n μ_{1})}^{2}}{2 n σ_{_{1}}^{2}}} d τ d u \cdot P (N (T_{z - 1}) = m) P (N (T_{z} - T_{z - 1}) = n) P (T_{z} \leq t_{c}),

23

P (A_{P 2}) = \sum_{z = 1}^{\infty} P (Y (T_{z - 1}) < Y_{a c c}, Y_{a c c} \leq Y (T_{z}) < Y_{f}) \cdot P (T_{z - 1} > t_{c})

= \frac{1}{\sqrt{8 π^{3}}} \sum_{z = 1}^{\infty} \sum_{m = 0}^{\infty} \sum_{n = 0}^{\infty} \sum_{l = 0}^{\infty} (\begin{matrix} \int_{0}^{Y_{a c c}} \int_{0}^{Y_{a c c} - u} \int_{Y_{a c c} - u - ω}^{Y_{f} - u - ω} {e^{- \frac{{(u - m μ_{1})}^{2}}{2 m σ_{_{1}}^{2}}}}^{- \frac{{(ω - n μ_{2})}^{2}}{2 n σ_{_{2}}^{2}} - \frac{{(τ - n μ_{2})}^{2}}{2 n σ_{_{2}}^{2}}} d τ d ω d u \cdot \\ P (N (t_{c}) = m) \cdot P (N (T_{z - 1} - t_{c}) = n) \cdot P (N (T_{z} - T_{z - 1}) = l) \cdot P (T_{z - 1} > t_{c}) \end{matrix}),

24

P (A_{P 3}) = \sum_{z = 1}^{\infty} P (Y (T_{z - 1}) < Y_{n o m}, Y_{a c c} \leq Y (T_{z}) < Y_{f}) \cdot P (T_{z - 1} < t_{c} \leq T_{z})

= \frac{1}{\sqrt{8 π^{3}}} \sum_{z = 1}^{\infty} \sum_{m = 0}^{\infty} \sum_{n = 0}^{\infty} \sum_{l = 0}^{\infty} (\begin{matrix} \int_{0}^{Y_{n o m}} \int_{Y_{a c c} - u}^{Y_{f} - u} \int_{Y_{a c c} - u - ω}^{Y_{f} - u - ω} {e^{- \frac{{(u - m μ_{1})}^{2}}{2 m σ_{_{1}}^{2}}}}^{- \frac{{(ω - m μ_{1})}^{2}}{2 m σ_{_{1}}^{2}}}^{- \frac{{(τ - n μ_{2})}^{2}}{2 n σ_{_{2}}^{2}}} d τ d ω d u \cdot \\ P (N (T_{z - 1}) = m) \cdot P (N (t_{c} - T_{z - 1}) = n) \cdot P (N (T_{z} - t_{c}) = l) \cdot P (T_{z - 1} < t_{c} \leq T_{z}) \end{matrix}) .

3.5.4. Probability of continuous monitoring

If any one event of the following events ( $A_{I 1}$ , $A_{I 2}$ ) occurs, the system is left as it is until next inspection time and it will cause monitoring cost $C_{I}$ .

A_{I 1} = \{Y (T_{z}) < Y_{n o m} ⋂ T_{z} \leq t_{c}\},

A_{I 2} = \{Y (T_{z}) < Y_{a c c} ⋂ T_{z} > t_{c}\} .

The probability for system left until next inspection in a renewal cycle can be expressed as:

25

P_{I} = P (A_{I 1}) + P (A_{I 2}) .

The probability for continuous monitoring events $A_{I 1}$ , $A_{I 2}$ can be written as follows:

26

P (A_{I 1}) = \sum_{z = 0}^{\infty} P (Y (T_{z}) < Y_{n o m}) \cdot P (T_{z} \leq t_{c})

= \frac{1}{\sqrt{2 π}} \sum_{z = 0}^{\infty} \sum_{m = 0}^{\infty} \int_{0}^{Y_{n o m}} e^{- \frac{{(u - m μ_{1})}^{2}}{2 m σ_{_{1}}^{2}}} d u \cdot P (N (T_{z}) = m) \cdot P (T_{z} \leq t_{c}),

27

P (A_{I 2}) = \sum_{z = 0}^{\infty} P (Y (T_{z}) < Y_{a c c}) \cdot P (T_{z} > t_{c})

= \frac{1}{2 π} \sum_{z = 0}^{\infty} \sum_{m = 0}^{\infty} \int_{0}^{Y_{a c c}} \int_{0}^{Y_{a c c} - u} e^{- \frac{{(u - m μ_{1})}^{2}}{2 m σ_{_{1}}^{2}} - \frac{{(τ - m μ_{2})}^{2}}{2 m σ_{_{2}}^{2}}} d τ d u \cdot P (N (t_{c}) = m) \cdot P (N (T_{z} - t_{c}) = n) \cdot P (T_{z} > t_{c}) .

Average number of monitoring actions in a renewal cycle $T$ is:

28

E [N_{I} (T)] = \sum_{z = 0}^{\infty} z P_{I} .

3.5.5. Expected time length of renewal cycle

As Eq. (14) shown, expected time length of renewal cycle $E [T]$ is affected by system life $T_{f}$ and average work time length $T_{P}$ when system ends with preventive replacement. If system faults, it is considered that system will not work any time. Therefore, the expected time length $T_{f}$ when system ends with corrective maintenance is the time interval for degradation level from initial value 0 to failure threshold $Y_{f}$ . However, the expected time length $T_{P}$ when system ends with preventive maintenance is different. System will no longer work if monitoring shows preventive replacement should be performed, so system life when system ends with preventive maintenance is times of inter-inspection time $∆ T$ .

System fault occurs in the second stage when system degradation with two-stage mode. System life $T_{f}$ is affected by shock strength $x_{1 i}$ , $x_{2 j}$ and change-point $t_{c}$ . The system mean time to failure is:

29

E [T_{f}] = \int_{t_{c}}^{\infty} R (t) d t

= \sum_{m = 0}^{\infty} \sum_{n = 0}^{\infty} Φ (\frac{Y_{f} - (m μ_{1} + n μ_{2})}{\sqrt{m σ_{1}^{2} + n σ_{2}^{2}}}) \cdot \frac{λ_{1}^{m} \cdot λ_{2}^{n}}{m! \cdot n!} \cdot \int_{t_{A}}^{t_{B}} \int_{τ}^{\infty} (e^{- λ_{1} τ - λ_{2} (t - τ)} \cdot τ^{m} \cdot {(t - τ)}^{n} \cdot g (τ)) d t d τ .

The average system life when system ends with preventive maintenance is:

30

E [P_{P} T_{P}] = P (A_{P 1}) z_{P 1} Δ T_{n o m} + P (A_{P 2}) (t_{c} + \int_{0}^{Δ T_{n o m}} ω f_{ω} d ω + z_{P 2} Δ T_{a c c})

+ P (A_{P 3}) (t_{c} + \int_{0}^{Δ T_{n o m}} ω f_{ω} d ω) .

Fig. 5The definition of ω

Where $z_{p 1}$ is the number of monitoring in mode $M_{1}$ for event $A_{p 1}$ , $z_{p 2}$ is the number of monitoring in mode $M_{2}$ for event $A_{p 2}$ , the time length from $t_{c}$ to next inspection time is $w$ (as shown in Fig. 5), $f_{w}$ is the distribution density function of w during [ $T_{z - 1}$ , $T_{z}$ ].

4. Influence analysis of different parameters for maintenance policy

This section aims to find some characteristics for two-stage deteriorating mode system: (a) Making a comparison of the average long-run cost rate for different maintenance policies in the same situation in order to raise the awareness of monitoring method. (b) Studying the influence of parameters in the degradation model and average long-run cost model, for the purpose of improving the understanding in two-stage deteriorating modeling and developing an optimal maintenance policy.

4.1. Choice of parameters values

In this study, the time distribution of change-point $t_{c}$ is assumed to follow uniform distribution. In order to emphasize the influence of distribution of change-point $t_{c}$ , different uniform distribution parameters are considered as follows:

• Change-point of two-stage degradation mode: $t_{c} ~ U (1, 200)$ .

• Early change-point of two-stage degradation mode: $t_{c} ~ U (1, 100)$ .

• Middle change-point of two-stage degradation mode: $t_{c} ~ U (50, 150)$ .

• Late change-point of two-stage degradation mode: $t_{c} ~ U (100, 200)$ .

The upper limit of change-point distribution 200 is considered that a majority of system failures occur in degradation mode $M_{2}$ and seldom in degradation mode $M_{1}$ . Early and late time distributions present the first and second half of full change-point distribution, respectively.

In order to make the influence analysis of different parameters more close to the actual situation of gearbox degradation process, the selection of failure threshold $Y_{f}$ is based on the actual value of gearbox life-cycle experiment in Section 5. Therefore, the failure threshold is evaluated as $Y_{f} =$ 10000 g² in this study. Meanwhile, for the purpose of ensuring the credibility of optimization results for global maintenance policy (the optimization results will be regarded as a basis of comparison), the unit maintenance costs are evaluated as other literatures [1, 12, 24], so $C_{I}$ = 5, $C_{P}$ = 50, $C_{C}$ = 100.

4.2. Influence of maintenance policy

Fig. 6EC∞ is affected by adaptive maintenance policy parameters

a) $E (C_{\infty})$ is affected by $Y_{n o m}$ and $T_{a c c}$

b) $E (C_{\infty})$ is affected by $Y_{a c c}$ and $T_{a c c}$

c) $E (C_{\infty})$ is affected by $T_{n o m}$ and $T_{a c c}$

For the purpose of studying on the influence of different monitoring methods, the four kinds of condition-based maintenance policies (global, time-depended, simplified adaptive and adaptive) presented in Section 3 are assessed. Because adaptive maintenance policy is the most complex in the four methods, an example focusing on adaptive policy analyzing is presented to show the approach of obtaining minimal average long-run maintenance cost rate $E (C_{\infty})$ . When $x_{1 i} ~ N$ (10, 20²), $x_{2 j} ~ N$ (40, 80²) and $λ_{1} = λ_{2} = 1$ , $E (C_{\infty})$ is affected by adaptive maintenance policy parameters $Y_{n o m}$ , $Y_{a c c}$ , $T_{n o m}$ , $T_{a c c}$ , as shown in Fig. 6. It illustrates that $Y_{n o m}$ , $Y_{a c c}$ , $T_{n o m}$ , $T_{a c c}$ should be considered at the same time when optimizing $E (C_{\infty})$ . Contour map (Fig. 7) shows $E (C_{\infty})$ for different values of $Y_{a c c}$ and $T_{a c c}$ under adaptive maintenance decision when $Y_{n o m}$ = 8000, $∆ T_{n o m}$ = 70. $E (C_{\infty})$ in the same contour are equal. It can be seen that optimal parameter values which minimize the cost rate ( $E_{4} (C_{\infty})$ = 0.3547) are $Y_{a c c}$ = 7000 and $∆ T_{a c c}$ = 37 when $Y_{n o m}$ = 8000, $∆ T_{n o m}$ = 70.

Fig. 7EC∞ under adaptive maintenance policy parameters (Ynom = 8000, ∆Tnom = 70)

As shown in Table 1, for the global maintenance policy, the optimal inter-inspection time $Δ T^{1}$ falls in between $Δ T_{n o m}^{4}$ and $Δ T_{a c c}^{4}$ for adaptive maintenance policy. The situation of $Δ T^{3}$ for simplified adaptive maintenance policy is in between $Δ T_{n o m}^{4}$ and $Δ T_{a c c}^{4}$ , too. $Δ T^{2}$ , the first inter-inspection time for time-depended maintenance policy, is the largest optimal inter-inspection time for the four kinds of condition-based maintenance policies.

The expected costs for the four kinds of maintenance policies are different because they are impacted by alarm thresholds and inter-inspection times. As the monitoring method for global maintenance policy does not consider the influence of degradation rate changes, taking the expected cost $E_{1 (C_{\infty})}$ of global maintenance policy as a basis of comparison, the expected cost $E_{2 (C_{\infty})}$ for time-depended maintenance policy have a decrease of 0.0286, the optimal value is 7.69 % of $E_{1 (C_{\infty})}$ . As shown in Table 1, it is obviously that the expected costs for other three maintenance policies are optimized compare with the average cost for global maintenance policy. Adaptive maintenance policy is better than simplified adaptive maintenance policy. Time-depended maintenance policy is the best replacement strategy for the system with two-stage degradation process.

4.3. Influence of change-point distribution

$E (C_{\infty})$ for different change-point distributions are computed, see results in Table 1. It can be noticed that the decreases of expected costs for different maintenance policies (time-depended, simplified adaptive, adaptive) are 3.94 %, 1.59 %, 3.14 %, respectively, when the change-point is in early distribution $U$ (1, 100). The impact of average long-run maintenance costs rate when the change-point is in middle distribution $U$ (50, 150) and late distribution $U$ (100, 200) are also given in Table 1. These results show that more the time of change-point occurs late, more the maintenance policies have a decrease on $E (C_{\infty})$ .

As seen previously, decreases of 7.69 %, 2.71 %, 4.68 % can be obtained respectively for different maintenance policies when the change-point distribution is $U$ (1, 200), and they are 6.53 %, 2.61 %, 4.34 %, respectively for the case that the change-point distribution is $U$ (50, 150). It is not difficult to find that the decreases for former are larger than latter. Meanwhile, in the two situations, the mean value of change-point distribution is the same, both equal to 100. Therefore, it is shown that more profits can be obtained in using different maintenance policies for a larger interval of time distribution.

The results show that the distribution of change-point impact on expected costs for different maintenance policies. More late for change-point occurs and more a large time interval of change-point distribution, more benefits can be obtained.

4.4. Influence of shock strength

In degradation modeling based on cumulative damage model, degradation rate is decided by shock strength and shock frequency. The degradation rate is in direct proportion to shock strength and shock frequency. Hence, the degradation rate can be expressed by shock strengths if the shock frequencies are the same.

The influence of different maintenance policies and change-point distribution have been analyzed for a two-stage degraded system in Table 1 when $x_{1 i} ~ N$ (10, 20²), $x_{2 j} ~ N$ (40, 80²). The same computed results with $x_{1 i} ~ N$ (10, 20²) and $x_{2 j} ~ N$ (20, 40²) are shown in Table 2. The degradation rate size of mode $M_{2}$ is four times superior than the mode $M_{1}$ in Table 1, while it is twice in Table 2. From Table 1 and Table 2, it can be known that the decreases are more considerable in Table 1. That is say, the profits is more considerable when degradation rate changes more significantly between mode $M_{2}$ and mode $M_{1}$ .

Table 1Influence of different maintenance policies and change-point time distribution (x1i~N(10, 202), x2j~N(40, 802), λ1=λ2= 1)

Time distribution	Policy structure	Optimal parameters	Expected cost	Impact
$t_{c} ~U (1, 200)$	Global	$Y_{A}^{1} = 4700$ , $Δ T^{1} = 66$	$E_{1} (C_{\infty}) = 0.3721$
	Time-depended	$Y_{A}^{2} = 5300$ , $Δ T^{2} = 111$ , $q = 0.66$	$E_{2} (C_{\infty}) = 0.3435$	0.0286 (7.69 %)
	Simplified Adaptive	$Y_{n o m}^{3} = 7750$ , $Y_{a c c}^{3} = 5250$ , $Δ T^{3} = 60$	$E_{3} (C_{\infty}) = 0.3620$	0.0101 (2.71 %)
	Adaptive	$Y_{n o m}^{4} = 8000$ , $Δ T_{n o m}^{4} = 70$ , $Y_{a c c}^{4} = 7000$ , $Δ T_{a c c}^{4} = 37$	$E_{4} (C_{\infty}) = 0.3547$	0.0174 (4.68 %)
$t_{c} ~ U (1, 100)$	Global	$Y_{A}^{1} = 5300$ , $Δ T^{1} = 62$	$E_{1} (C_{\infty}) = 0.4268$
	Time-depended	$Y_{A}^{2} = 7100$ , $Δ T^{2} = 84$ , $q = 0.66$	$E_{2} (C_{\infty}) = 0.4100$	0.0168 (3.94 %)
	Simplified Adaptive	$Y_{n o m}^{3} = 7750$ , $Y_{a c c}^{3} = 5000$ , $Δ T = 66$	$E_{3} (C_{\infty}) = 0.4200$	0.0068 (1.59 %)
	Adaptive	$Y_{n o m}^{4} = 8400$ , $Δ T_{n o m}^{4} = 68$ , $Y_{a c c}^{4} = 6800$ , $Δ T_{a c c}^{4} = 38$	$E_{4} (C_{\infty}) = 0.4134$	0.0134 (3.14 %)
$t_{c} ~ U (50, 150)$	Global	$Y_{A}^{1} = 4700$ , $Δ T^{1} = 74$	$E_{1} (C_{\infty}) = 0.3598$
	Time-depended	$Y_{A}^{2} = 6100$ , $Δ T^{2} = 95$ , $q = 0.72$	$E_{2} (C_{\infty}) = 0.3363$	0.0235 (6.53 %)
	Simplified Adaptive	$Y_{n o m}^{3} = 7750$ , $Y_{a c c}^{3} = 5000$ , $Δ T^{3} = 69$	$E_{3} (C_{\infty}) = 0.3504$	0.0094 (2.61 %)
	Adaptive	$Y_{n o m}^{4} = 7800$ , $Δ T_{n o m}^{4} = 76$ , $Y_{a c c}^{4} = 7000$ , $Δ T_{a c c}^{4} = 37$	$E_{4} (C_{\infty}) = 0.3442$	0.0156 (4.34 %)
$t_{c} ~ U (100, 200)$	Global	$Y_{A}^{1} = 4300$ , $Δ T^{1} = 76$	$E_{1} (C_{\infty}) = 0.3104$
	Time-depended	$Y_{A}^{2} = 5000$ , $Δ T^{2} = 125$ , $q = 0.69$	$E_{2} (C_{\infty}) = 0.2810$	0.0294 (9.47 %)
	Simplified Adaptive	$Y_{n o m}^{3} = 7500$ , $Y_{a c c}^{3} = 5000$ , $Δ T^{3} = 69$	$E_{3} (C_{\infty}) = 0.3017$	0.0087 (2.80 %)
	Adaptive	$Y_{n o m}^{4} = 8400$ , $Δ T_{n o m}^{4} = 84$ , $Y_{a c c}^{4} = 6600$ , $Δ T_{a c c}^{4} = 43$	$E_{4} (C_{\infty}) = 0.2942$	0.0162 (5.22 %)

Table 2Influence of different maintenance policies and change-point time distribution (x1i~N(10, 202), x2j~N(20, 402), λ1=λ2= 1)

Time distribution	Policy structure	Optimal parameters	Expected cost	Impact
$t_{c} ~ U (1, 200)$	Global	$Y_{A}^{1} = 6600$ , $Δ T^{1} = 89$	$E_{1} (C_{\infty}) = 0.2395$
	Time-depended	$Y_{A}^{2} = 6500$ , $Δ T^{2} = 105$ , $q = 0.96$	$E_{2} (C_{\infty}) = 0.2297$	0.0098 (4.09 %)
	Simplified Adaptive	$Y_{n o m}^{3} = 8000$ , $Y_{a c c}^{3} = 7000$ , $Δ T^{3} = 84$	$E_{3} (C_{\infty}) = 0.2348$	0.0047 (1.96 %)
	Adaptive	$Y_{n o m}^{4} = 8000$ , $Δ T_{n o m}^{4} = 95$ , $Y_{a c c}^{4} = 7000$ , $Δ T_{a c c}^{4} = 75$	$E_{4} (C_{\infty}) = 0.2304$	0.0091 (3.80 %)
$t_{c} ~ U (1, 100)$	Global	$Y_{A}^{1} = 6600$ , $Δ T^{1} = 92$	$E_{1} (C_{\infty}) = 0.2513$
	Time-depended	$Y_{A}^{2} = 6100$ , $Δ T^{2} = 126$ , $q = 0.96$	$E_{2} (C_{\infty}) = 0.2427$	0.0086 (3.42 %)
	Simplified Adaptive	$Y_{n o m}^{3} = 8000$ , $Y_{a c c}^{3} = 6750$ , $Δ T^{3} = 91$	$E_{3} (C_{\infty}) = 0.2484$	0.0029 (1.15 %)
	Adaptive	$Y_{n o m}^{4} = 8000$ , $Δ T_{n o m}^{4} = 85$ , $Y_{a c c}^{4} = 7500$ , $Δ T_{a c c}^{4} = 65$	$E_{4} (C_{\infty}) = 0.2440$	0.0073 (2.99 %)
$t_{c} ~ U (50, 150)$	Global	$Y_{A}^{1} = 6900$ , $Δ T^{1} = 95$	$E_{1} (C_{\infty}) = 0.2314$
	Time-depended	$Y_{A}^{2} = 6700$ , $Δ T^{2} = 114$ , $q = 0.90$	$E_{2} (C_{\infty}) = 0.2217$	0.0097 (4.19 %)
	Simplified Adaptive	$Y_{n o m}^{3} = 7750$ , $Y_{a c c}^{3} = 6500$ , $Δ T^{3} = 99$	$E_{3} (C_{\infty}) = 0.2272$	0.0042 (1.82 %)
	Adaptive	$Y_{n o m}^{4} = 8000$ , $Δ T_{n o m}^{4} = 90$ , $Y_{a c c}^{4} = 7500$ , $Δ T_{a c c}^{n o m} = 65$	$E_{4} (C_{\infty}) = 0.2228$	0.0086 (3.72 %)
$t_{c} ~ U (100, 200)$	Global	$Y_{A}^{1} = 6000$ , $Δ T^{1} = 110$	$E_{1} (C_{\infty}) = 0.2125$
	Time-depended	$Y_{A}^{2} = 7300$ , $Δ T^{2} = 128$ , $q = 0.81$	$E_{2} (C_{\infty}) = 0.2012$	0.0113 (5.32 %)
	Simplified Adaptive	$Y_{n o m}^{3} = 8500$ , $Y_{a c c}^{3} = 6250$ , $Δ T^{3} = 106$	$E_{3} (C_{\infty}) = 0.2080$	0.0045 (2.12 %)
	Adaptive	$Y_{n o m}^{4} = 8000$ , $Δ T_{n o m}^{4} = 95$ , $Y_{a c c}^{4} = 7000$ , $Δ T_{a c c}^{4} = 70$	$E_{4} (C_{\infty}) = 0.2042$	0.0083 (3.91 %)

4.5. Influence of shock frequency

As the previously analysis, $E (C_{\infty})$ for different shock frequencies can be obtained when shock strengths are the same. When $x_{1 i} ~ N$ (10, 20²), $x_{2 j} ~ N$ (40, 80²), $t_{c} \in$ [50, 150], shock frequency parameters are $λ_{1} = λ_{2} =$ 0.5, $λ_{1} = λ_{2} =$ 1, $λ_{1} = λ_{2} =$ 2, respectively, $E (C_{\infty})$ obtained in Table 3. As results obtained in Section 4.2, it can be known that adaptive maintenance policy is better than simplified adaptive maintenance policy, time-depended maintenance policy is the best replacement strategy for the system with two-stage degradation process.

Further analysis, $E (C_{\infty})$ for different maintenance policies are respectively 2.64 %, 4.78 %, 8.28 % when degradation model parameters $λ_{1} =$ 1, $λ_{2} =$ 0.5, $x_{1 i} ~ N$ (10, 20²), $x_{2 j} ~ N$ (40, 80²) and respectively 5.38 %, 1.38 %, 2.54 % when degradation model parameters $λ_{1} =$ 1, $λ_{2} =$ 2, $x_{1 i} ~ N$ (10, 20²), $x_{2 j} ~ N$ (40, 80²) as shown in Table 3. The degradation rate size of mode $M_{2}$ is eight times superior than the mode $M_{1}$ in the former, while it is twice in the latter. It can be known from the results that adaptive replacement policy is always better than simplified adaptive maintenance policy, especially under the situation that degradation rate undergoes change hugely. But the time-depended monitor method is no suitable for a system which the degradation rate in mode $M_{2}$ is significantly larger than the degradation rate in mode $M_{1}$ .

Table 3Influence of different maintenance policies and shock frequencies (x1i~N(10, 202), x2j~N(40, 802), tc∈ [50, 150])

Time distribution	Policy structure	Optimal parameters	Expected cost	Impact
$λ_{1} = λ_{2} = 0.5$	Global	$Y_{A}^{1} = 4200$ , $Δ T^{1} = 54$	$E_{1} (C_{\infty}) = 0.5305$
	Time-depended	$Y_{A}^{2} = 6300$ , $Δ T^{2} = 67$ , $q = 0.65$	$E_{2} (C_{\infty}) = 0.5019$	0.0286 (5.39 %)
	Simplified Adaptive	$Y_{n o m}^{3} = 7000$ , $Y_{a c c}^{3} = 4500$ , $Δ T^{3} = 50$	$E_{3} (C_{\infty}) = 0.5152$	0.0153 (2.88 %)
	Adaptive	$Y_{n o m}^{4} = 7400$ , $Δ T_{n o m}^{4} = 52$ , $Y_{a c c}^{4} = 6400$ , $Δ T_{a c c}^{4} = 26$	$E_{4} (C_{\infty}) = 0.5046$	0.0259 (4.88 %)
$λ_{1} = λ_{2} = 1$	Global	$Y_{A}^{1} = 4700$ , $Δ T^{1} = 74$	$E_{1} (C_{\infty}) = 0.3598$
	Time-depended	$Y_{A}^{2} = 6100$ , $Δ T^{2} = 95$ , $q = 0.72$	$E_{2} (C_{\infty}) = 0.3363$	0.0235 (6.53 %)
	Simplified Adaptive	$Y_{n o m}^{3} = 7750$ , $Y_{a c c}^{3} = 5000$ , $Δ T^{3} = 69$	$E_{3} (C_{\infty}) = 0.3504$	0.0094 (2.61 %)
	Adaptive	$Y_{n o m}^{4} = 7800$ , $Δ T_{n o m}^{4} = 76$ , $Y_{a c c}^{4} = 7000$ , $Δ T_{a c c}^{4} = 37$	$E_{4} (C_{\infty}) = 0.3442$	0.0156 (4.34 %)
$λ_{1} = λ_{2} = 2$	Global	$Y_{A}^{1} = 6000$ , $Δ T^{1} = 107$	$E_{1} (C_{\infty}) = 0.2168$
	Time-depended	$Y_{A}^{2} = 6900$ , $Δ T^{2} = 160$ , $q = 0.69$	$E_{2} (C_{\infty}) = 0.2051$	0.0117 (5.40 %)
	Simplified Adaptive	$Y_{n o m}^{3} = 7500$ , $Y_{a c c}^{3} = 6500$ , $Δ T^{3} = 103$	$E_{3} (C_{\infty}) = 0.2135$	0.0033 (1.52 %)
	Adaptive	$Y_{n o m}^{4} = 8000$ , $Δ T_{n o m}^{4} = 110$ , $Y_{a c c}^{4} = 7000$ , $Δ T_{a c c}^{4} = 80$	$E_{4} (C_{\infty}) = 0.2097$	0.0071 (3.27 %)
$\begin{array}{l} λ_{1} = 1 \\ λ_{2} = 0.5 \end{array}$	Global	$Y_{A}^{1} = 4500$ , $Δ T^{1} = 53$	$E_{1} (C_{\infty}) = 0.4841$
	Time-depended	$Y_{A}^{2} = 5800$ , $Δ T^{2} = 71$ , $q = 0.69$	$E_{2} (C_{\infty}) = 0.4713$	0.0128 (2.64 %)
	Simplified Adaptive	$Y_{n o m}^{3} = 7000$ , $Y_{a c c}^{3} = 4500$ , $Δ T^{3} = 45$	$E_{3} (C_{\infty}) = 0.4611$	0.0230 (4.78 %)
	Adaptive	$Y_{n o m}^{4} = 8200$ , $Δ T_{n o m}^{4} = 59$ , $Y_{a c c}^{4} = 5800$ , $Δ T_{a c c}^{4} = 26$	$E_{4} (C_{\infty}) = 0.4440$	0.0401 (8.28 %)
$\begin{array}{l} λ_{1} = 1 \\ λ_{2} = 2 \end{array}$	Global	$Y_{A}^{1} = 6800$ , $Δ T^{1} = 100$	$E_{1} (C_{\infty}) = 0.2323$
	Time-depended	$Y_{A}^{2} = 6400$ , $Δ T^{2} = 155$ , $q = 0.75$	$E_{2} (C_{\infty}) = 0.2198$	0.0125 (5.38 %)
	Simplified Adaptive	$Y_{n o m}^{3} = 8250$ , $Y_{a c c}^{3} = 6750$ , $Δ T^{3} = 98$	$E_{3} (C_{\infty}) = 0.2291$	0.0032 (1.38 %)
	Adaptive	$Y_{n o m}^{4} = 8300$ , $Δ T_{n o m}^{4} = 104$ , $Y_{a c c}^{4} = 6500$ , $Δ T_{a c c}^{4} = 94$	$E_{4} (C_{\infty}) = 0.2264$	0.0059 (2.54 %)

5. Case study

A case study is carried out for a gearbox deterioration modeling and decision-making on maintenance using experiment data. In the case study, a gearbox life-cycle experiment has done to obtain the degradation data that a gearbox ran from new to failure. The experiment rig is shown in Fig. 8, where four accelerometers are fitted onto the casing of gearbox to record vibration data. In the experiment, the sampling frequency is 20 kHz. Lots of equal-spaced vibration monitoring performed in the test process. Each vibration monitoring provides a date file collected in 2 seconds at every 5 minutes, twelve groups of date files are collected in every hour. The magnetic brake provide about 2-2.5 times of the rated torque of gearbox in order to accelerate the test and reduce the lifetime of gearbox.

Fig. 8Experiment rig (1 – load, 2 – accelerometers, 3 – sensor of speed and torque, 4 – electromotor, 5 – test bed, 6 – gearbox system)

Fig. 9Gear after experiment

Fig. 10Special frequency band energy of vibration signal

The total experimental time is 450 hours, the gear after experiment is shown in Fig. 9. As shown in Fig. 10, the special frequency band energy of vibration signal presents that degradation process of gearbox is obviously two-stage process. Using linear fitting analysis, degradation parameters of gearbox are obtained as follows: $x_{1 i} ~ N$ (12, 25²), $x_{2 i} ~ N$ (76, 130²), $t_{c} \in$ [0, 450], $λ_{1} = λ_{2} =$ 1. Meanwhile, the failure threshold is evaluated as $Y_{f}$ = 10000g².

Based on the proposed model and maintenance policies, optimal results for different maintenance policies of gearbox are shown in Table 4. Because the degradation rate of second stage for two-stage deteriorating mode system is faster than the first stage, the alarm threshold and inter-inspection time for the second stage should be smaller than the first stage. The initial inter-inspection time of time-depended maintenance policy $∆ T^{2}$ is larger than the inter-inspection time of global maintenance policy $∆ T^{1}$ , but inter-inspection time of time-depended maintenance policy is smaller and smaller with working time, as a result the expected cost $E_{2} (C_{\infty})$ make a decrease of 0.0108 from $E_{1} (C_{\infty})$ . The alarm thresholds of the first stage for adaptive and simplified adaptive maintenance policy $Y_{n o m}^{4}$ , $Y_{n o m}^{3}$ are both larger than alarm threshold of global maintenance policy $Y_{A}^{1}$ , but the alarm thresholds of the second stage $Y_{a c c}^{4}$ , $Y_{a c c}^{3}$ are both smaller than $Y_{A}^{1}$ . Meanwhile, the inter-inspection times $∆ T^{1}$ , $∆ T^{2}$ both between $Δ T_{n o m}^{4}$ and $Δ T_{a c c}^{4}$ . These phenomena conform to the conjecture in modeling. If use adaptive or simplified adaptive maintenance policy, the average long-run cost can reduce 9.29 %, 6.43 %, respectively. It can be seen that adaptive maintenance policy is the best method for gearbox.

Table 4Optimal results for different maintenance policies

Policy structure	Optimal parameters	Expected cost	Impact
Global	$Y_{A}^{1} = 6400$ , $Δ T^{1} = 130$	$E_{1} (C_{\infty}) = 0.2971$
Time-depended	$Y_{A}^{2} = 7500$ , $Δ T^{2} = 142$ , $q = 0.72$	$E_{2} (C_{\infty}) = 0.2863$	0.0108 (3.64 %)
Simplified Adaptive	$Y_{n o m}^{3} = 7200$ , $Y_{a c c}^{3} = 6100$ , $Δ T^{3} = 115$	$E_{3} (C_{\infty}) = 0.2780$	0.0191 (6.43 %)
Adaptive	$Y_{n o m}^{4} = 8000$ , $Δ T_{n o m}^{4} = 138$ , $Y_{a c c}^{4} = 6000$ , $Δ T_{a c c}^{4} = 103$	$E_{4} (C_{\infty}) = 0.2695$	0.0276 (9.29 %)

6. Conclusions

This paper is meant to investigate degradation modeling and maintenance decision-making methods for two-stage deteriorating mode system, where the degradation rate is usually small in the first stage and large in the second stage. To this purpose, degradation level modeling and reliability modeling based on cumulative damage model are studied at first place, then four kinds of maintenance policies (global, time-depended, adaptive, simplified adaptive) are studied and evaluated through their average long-run cost rate. The four kinds of maintenance policies are differentiated from alarm threshold and inter-inspection time.

Moreover, influence analysis of different parameters for maintenance policy is studied and proves that: (a) It is necessary to consider degradation process undergoing a sudden change in maintenance policy, suitable maintenance policy can help to improve system efficiency. (b) It is obvious that the average long-run cost rate is impacted by change-point distribution, shock strength and shock frequency.

The case study of degradation data analysis for gearbox life-cycle experiment shows that degradation process of gearbox presents obviously two-stage feature. In addition, it is helpful to reduce the average maintenance cost by choosing appropriate maintenance policy.

References

Grall A., Dieulle L., Brenguer C., Roussignol M. Continuous-time preventive maintenance scheduling for a deteriorating system. IEEE Transactions on Reliability, Vol. 51, Issue 2, 2002, p. 141-150.

Search CrossRef
Wang H. Z. A survey of maintenance policies of deteriorating systems. Europian Journal of Operational Research, Vol. 139, Issue 3, 2002, p. 469-489.

Search CrossRef
Noortwijk J. M. V., Kallen M. J. Optimal periodic inspection of a deterioration process with sequential condition states. International Journal of Pressure Vessels and Piping, Vol. 83, Issue 4, 2006, p. 249-255.

Search CrossRef
Noortwijk J. M. V., Frangopol D. M. Two probabilistic life-cycle maintenance models for deteriorating civil infrastructures. Probabilistic Engineering Mechanics, Vol. 19, Issue 4, 2004, p. 345-359.

Search CrossRef
Mitra F., Antoine G., Laurence D. On the use of on-line detection for maintenance of gradually deteriorating systems. Reliability Engineering & System Safety, Vol. 93, Issue 12, 2008, p. 1814-1820.

Search CrossRef
Saassouh B., Dieulle L., Grall A. Online maintenance policy for a deterioration system with random change of mode. Reliability Engineering and System Safety, Vol. 92, Issue 12, 2007, p. 1677-1685.

Search CrossRef
Deloux E., Castanier B., Berenguer C. Maintenance policy for a non-stationary deteriorating system. Annual Reliability and Maintainability Symposium, Las Vegas, 2008.

Search CrossRef
Wang W. B. A two-stage prognosis model in condition based maintenance. European Journal of Operational Research, Vol. 182, Issue 3, 2007, p. 1177-1187.

Search CrossRef
Wang Z., Huang H. Z., Li Y., Xiao N. C. An approach to reliability assessment under degradation and shock process. IEEE Transactions on Reliability, Vol. 60, Issue 4, 2011, p. 852-863.

Search CrossRef
Liu Y., Huang H. Z. Optimal replacement policy for multi-state system under imperfect maintenance. IEEE Transactions on Reliability, Vol. 59, Issue 3, 2010, p. 483-495.

Search CrossRef
Liu Y., Huang H. Z. Optimal selective maintenance strategy for multi-state systems under imperfect maintenance. IEEE Transactions on Reliability, Vol. 59, Issue 2, 2010, p. 356-367.

Search CrossRef
Ponchet A., Fouladirad M., Grall A. Assessment of a maintenance model of a multi-deteriorating mode system. Reliability Engineering & System Safety, Vol. 95, Issue 11, 2010, p. 1244-1254.

Search CrossRef
Si X. S., Wang W. B., Hu C. H., Chen M. Y., Zhou D. H. A Wiener-process-based degradation model with a recursive filter algorithm for remaining useful life estimation. Mechanical Systems and Signal Processing, Vol. 35, Issues 1-2, 2013, p. 219-237.

Search CrossRef
Minh D. L., Cher M. T. Optimal maintenance strategy of deteriorating system under imperfect maintenance and inspection using mixed inspection scheduling. Reliability Engineering and System Safety, Vol. 113, 2013, p. 21-29.

Search CrossRef
Qian C. H., Nakamura S., Nakagawa T. Cumulative damage model with two kinds of shocks and its application to the backup policy. Journal of the Operations Research, Vol. 42, Issue 4, 1999, p. 501-511.

Search CrossRef
Song S. L., Coit D. W., Feng Q. M., Peng H. Reliability analysis for multi-component systems subject to multiple dependent competing failure process. IEEE Transactions on Reliability, Vol. 63, Issue 1, 2014, p. 331-345.

Search CrossRef
Wang X. L., Jiang P., Guo B., Cheng Z. J. Real-time reliability evaluation based on damaged measurement degradation data. Journal of Central South University, Vol. 19, Issue 11, 2012, p. 3162-3169.

Search CrossRef
Zhao Z., Wang F., Jia M., Wang S. Preventive maintenance policy based on process data. Chemometrics and Intelligent Laboratory Systems, Vol. 103, Issue 2, 2010, p. 137-143.

Search CrossRef
Grall A., Berenguer C., Dieulle L. A condition-based maintenance policy for stochastically deteriorating systems. Reliability Engineering & System Safety, Vol. 76, Issue 2, 2002, p. 167-180.

Search CrossRef
Grall A., Dieulle L., Berenguer C., Roussignol M. Asymptotic failure rate of a continuously monitored system. Reliability Engineering & System Safety, Vol. 91, Issue 2, 2006, p. 126-130.

Search CrossRef
Dagg R. A. Optimal Inspection and Maintenance for Stochastically Deteriorating Systems. Ph.D. Thesis, the City University, London, 1999.

Search CrossRef
Dieulle L., Berenguer C., Grall A., Roussignol M. Sequential condition-based maintenance scheduling for a deteriorating system. European Journal of Operational Research, Vol. 150, Issue 2, 2003, p. 451-461.

Search CrossRef
Sheldon M. R. Stochastic Processes for Insurance and Finance. Wiley Series in Probability and Statistics, Johon Wiley & Sons, New York, 1996, p. 639.

Search CrossRef
Mitra F., Grall A. Condition-based maintenance for a system subject to a non-homogeneous wear process with a wear rate transition. Reliability Engineering & System Safety, Vol. 96, Issue 6, 2011, p. 611-618.

Search CrossRef

About this article

Received

December 24, 2014

Accepted

April 10, 2015

Published

May 15, 2015

SUBJECTS

Fault diagnosis based on vibration signal analysis

Keywords

cumulative damage model

two-stage degradation

degradation level

maintenance policy

This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.