Journals & Magazines >IEEE Access >Volume: 12

Transfer Diagnosis Model of Internal Combustion Engine With Embedded Vibration Signal Impact Decomposition

This paper proposed a transfer diagnosis model based on the deep subdomain adaptive network framework. In the framework, adaptive decomposition layer is designed to decom...

Abstract:

Traditional transfer diagnosis models for internal combustion engines show a decrease in generalization ability due to the multisource features aliasing in vibration sign...Show More

Metadata

Abstract:

Traditional transfer diagnosis models for internal combustion engines show a decrease in generalization ability due to the multisource features aliasing in vibration signals and the effect of variable operating conditions. To address this problem, this paper proposes a transfer diagnosis model based on the deep subdomain adaptive network framework. To address feature aliasing, based on minimizing amplitude moment and reconstruction loss, a new adaptive decomposition layer is designed and embedded into the framework to decompose complex signals into single-impact components in time domain. To alleviate the effect of operating conditions, a new constraint for minimizing signal feature variance loss is designed and introduced into the framework’s loss function. This constraint calculates the variance of the sample features of the same fault label under variable operating conditions, aiming to excavate invariant features of operating conditions and complete feature mapping of domain adaptation. Validation with experimental data yields an accuracy of 94.81%.

This paper proposed a transfer diagnosis model based on the deep subdomain adaptive network framework. In the framework, adaptive decomposition layer is designed to decom...

Published in: IEEE Access ( Volume: 12)

Page(s): 62779 - 62792

Date of Publication: 23 April 2024

Electronic ISSN: 2169-3536

DOI: 10.1109/ACCESS.2024.3392768

Funding Agency:

Contents

SECTION I.

Introduction

Reciprocating internal combustion engines are the key power source in fields such as marine vessels, vehicles, and electricity generation, characterized by their compact structure, numerous components, and high failure rates. During normal operation, internal combustion engines’ shell vibration signals exhibit characteristics of multi-source impact signal coupling. For instance, impacts from the opening and closing of the intake and exhaust valves in each operating cycle, combustion impacts within the cylinder, and impacts from piston reversal. The opening and closing impacts of the valves, combustion impacts within the cylinder, and piston reversal impacts directly affect the cylinder head shell. The piston reversal impact is also transmitted to the cylinder head shell through certain signal paths. Therefore, monitoring the vibration signals of the cylinder head shell of the internal combustion engine can obtain quasi-periodic multi-impact coupling signals in the time domain, thus reflecting the working status of different components. There are many studies extracting fault characteristics from the vibration signals of the cylinder head shell of internal combustion engines and constructing diagnostic models [1], [2].

Mechanical faults, such as wear on connecting rod bearings and valves, produce additional impact signals. These signals overlap with the normal impact signals mentioned above in the time and frequency domains, posing challenges for fault feature extraction. Meanwhile, variations in engine speed and load conditions are common. Under variable operating conditions, changes occur in the dynamic behavior of engine moving parts, resulting in corresponding changes in the time–frequency features of vibration signals [3], [4]. The features of vibration signals from the engine casing generally exhibit nonlinear distributions during variable operating conditions. Therefore, the feature aliasing and the effects of variable operating conditions are two problems in internal combustion engine fault diagnosis.

To address the problem of feature aliasing, signal modal decomposition methods (i.e., empirical mode decomposition, variational mode decomposition, and wavelet decomposition) are commonly conducted to obtain multiscale single-modal components containing mechanical operational state information [5], [6]. Then the components are applied to extract dynamics and thermodynamic features for diagnosis [7]. For example, Bi et al. [8] used variational mode decomposition and expectation maximization method to analyze multi-channel vibration signals and extract knowledge features for internal combustion engine state recognition. Further, deep learning techniques, including convolutional neural networks [9], [10], [11], graph attention networks [12] and autoencoders [13] are employed to explore the deep features of internal combustion engine vibration signals. For instance, Liang et al. [10] and Xie et al. [11] constructed internal combustion engine vibration signal feature extraction models based on autoencoders and graph attention networks, respectively, using vibration signals directly as input. However, internal combustion engine faults often manifest as abnormal impacts in time domain signals, with a particularly severe feature aliasing problem [15]. The abovementioned research on feature extraction of internal combustion engines did not consider feature aliasing problem and directly used deep networks for feature extraction, resulting in insufficient correlation between feature extraction results and faults, decreased generalization, and limited application.

To alleviate the effect of variable operating conditions, the transfer learning (TL) method has been proposed to further improve the generalization of diagnostic models [16]. TL aims to adjust existing model parameters to construct models that adapt to the diagnostic needs of new engine units. However, the nonlinear variation of signal features under different operating conditions of internal combustion engines makes it challenging to extract domain-invariant features for operating conditions, decreasing the generalization ability of fault transfer diagnostic models between units. Currently, limited research exists on transfer diagnostic models between different internal combustion engine units under variable operating conditions. But in other mechanical domains, such as bearings, gears, and turbines, domain adaptation (DA) methods have been proposed to reduce the feature distribution difference between the source and target domains [17], [18], [19], [20], [21]. DA reduces feature distribution discrepancy by introducing a distance function into the model’s objective function to drive multidomain feature alignment [22]. Additionally, incorporating a loss constraint related to label prediction results, combined with the aforementioned feature distribution distance function, DA can reduce the feature distribution difference between the source and target domains while enhancing the diversity of distribution results among multiple classes of source domain features, thereby improving model generalization [23]. Lu et al. [24] used a model with maximum mean discrepancy (MMD) to minimize differences between various machine data instances and achieved an accurate transfer diagnosis of typical gearbox-bearing faults. Shen et al. [25] proposed a deep subdomain adaptation network (DSAN) to extract features of multiscale vibration signals using local maximum mean discrepancy (LMMD) loss to reduce the distance between source and target domains and achieve transfer diagnosis of wind turbine system faults. Li et al. [26] introduced an adversarial DA method based on conditional adversarial DA (CDAN), which enforces better intraclass compactness and interclass separability of label-related prediction results to improve model generalization, enabling transfer diagnosis of typical bearing faults under variable conditions. The above studies have shown that constructing feature distribution constraint functions effectively obtains domain-invariant features under variable operating conditions.

In summary, when constructing internal combustion engine fault transfer diagnostic models, challenges arise such as feature aliasing of vibration multisource impact signals, difficulty in feature extraction, and nonlinear distributions in multidomain feature distribution under variable operating conditions. These challenges lead to difficulty extracting domain-invariant features for operating conditions and a decrease in the generalization ability of fault transfer diagnostic models between engine units.

In this study, first, to address the difficulty in feature extraction caused by feature aliasing, a new decomposition layer of multi-impact vibration signals in internal combustion engines is proposed. This layer is based on the periodic multi-impact interval distribution and rapid decay characteristics of impacts in vibration signals. This proposed layer has a signal moment minimization decomposition objective, enabling the decomposition of multisource impact vibration signals into single-impact modalities. Second, to address the problem of signal feature shift under variable operating conditions, which makes it challenging to mine domain-invariant features, a new feature variance loss (FVL) constraint is established. By introducing source domain labels to group source domain samples and calculating the variance of each feature under all operating conditions, this constraint drives the minimization of the variance of feature distribution for different operating condition subsignals with the same fault label, reducing the sensitivity of sample features to operating conditions. Furthermore, the decomposition layer of the impact vibration signals and the FVL constraint are integrated into the signal processing and DA modules of the DSAN framework, respectively. This integration leads to the construction of a DSAN framework for internal combustion engine transfer diagnosis, enabling fault transfer diagnosis between internal combustion engine units.

Our contributions are as follows:

Proposed a new decomposition layer based on signal amplitude moment and reconstruction loss minimization of internal combustion engine vibration signals, and the signal moment minimization decomposition objective. This layer is embedded in DSANs and decomposes multiple impact signals into single-impact modes. It also adopts a unified optimization algorithm, parameter update strategy, and learning rate adjustment method with other modules in the model to improve the model efficiency.
Proposed a new FVL constraint to mine invariant features in the operating condition domain. This constraint introduces source domain labels to group source domain samples and calculates the variance of each feature under all operating conditions, driving the minimization of the variance of feature distribution for different operating condition subsignals with the same fault label to reduce the sensitivity of sample features to operating conditions.
Constructed a new internal combustion engine transfer diagnostic model based on a DSAN framework. This framework incorporates the proposed multi-impact vibration signal decomposition layer and FVL constraint and establishes an algorithm for multiobjective optimization of the model.
Based on experiments conducted on two internal combustion engine test benches, simulating three typical faults under different operating conditions, the performance of the proposed model is validated.

The remaining content of this study is as follows: Chapter 2 presents the proposed decomposition layer, the FVL, and the constructed transfer diagnostic model. Chapter 3 describes the experiments and the data set. Chapter 4 presents the comparative validation results of the proposed method, and Chapter 5 provides the conclusions of our study.

SECTION II.

Proposed Method

This study constructed a DSAN framework for the internal combustion engine transfer diagnosis model with embedded vibration signal impact decomposition, and Figure 1 shows the model’s overall structure. This structure included a preprocessing module for the signal decomposition layer, a feature extraction module, a DA module with the proposed embedded FVL, and a fault diagnosis module.

FIGURE 1.

Framework of the proposed method is based on signal decomposition, feature extraction, domain adaptation, and diagnostic modules.

Show All

A. Decomposition Layer of Multi-Impact Vibration Signals

The decomposition layer aims to separate the impulse waveforms within the vibration signal, forming multiple subsignals containing only individual impulse waveforms. Its decomposition objective can be summarized as minimizing the information loss in reconstructing the source signal, with the subsignals conforming to the morphology of single impulse waveforms [1]. Additionally, for ease of calculation, the signal amplitude and time range need to be transformed to [0,1], and after decomposition, inverse transformation can restore the original feature range.

The design of the decomposition window and the calculation process of window parameters are as follows.

To construct the decomposition window $W$ , we defined the center position coefficient ($w_{c}$ ) and half-width coefficient ($w_{l}$ ). Window $W$ comprises variants of Sigmoid and ReLU functions. Multiplying the signal $S$ by $W$ intercepts the signal, as shown in Eqs. (1)–(2).\begin{align*} & F_{g}\left ({{ w_{c},w_{l};x }}\right ) \\ & =\frac {1}{1+\exp \left ({{ w_{c}-w_{l}-x }}\right )} \tag {1}\\ & W\left ({{ w_{c},w_{l};x }}\right ) \\ & = \mathrm {ReLU}\left ({{ F_{g}\left ({{ w_{c},w_{l};x }}\right )-F_{g}\left ({{ w_{c},-w_{l};x }}\right )-\varepsilon }}\right ), \tag {2}\end{align*} View Source where $F_{g}$ is the sigmoid activation function of the variant. ReLU is the standard activation function. The $x$ represents the time point of the sequence. $\varepsilon $ represents a minimum value, usually taken as $10^{-4}$ .

The formula for calculating ($w_{c}$ ) and ($w_{l}$ ) are shown in Eqs. (3)-(4).\begin{align*} w_{c}& =\text {sigmoid}\left ({{ w_{w_{c}}^{\prime }S }}\right ) \tag {3}\\ w_{l}& =\text {sigmoid}\left ({{ w_{w_{l}}^{\prime }S }}\right ), \tag {4}\end{align*} View Source where $w_{c}$ and $w_{l}$ are generated by deep learning weights $w_{w_{c}}^{\prime }$ and $w_{w_{l}}^{\prime }$ , respectively.

The calculation process of subsignals is shown in Eq. (5) \begin{equation*} s_{k}\left ({{ x }}\right )=W_{k}\left ({{ w_{c_{k}},w_{l_{k}};x }}\right )S\left ({{ x }}\right ), \tag {5}\end{equation*} View Source where $s_{k}\left ({{ x }}\right )$ represents the separately decomposed subsignals. $k$ represents the index of the decomposed subsignals, and $S\left ({{ x }}\right )$ represents the source signal.

The decomposition objectives are as follows.

Considering that the decomposition target needs to minimize the information loss of the reconstructed source signal, the decomposition and reconstruction loss $\delta $ is proposed as an indicator to evaluate the decomposition performance. As shown in in Eq. (6).\begin{equation*} \delta =\frac {1}{K}\left ({{ \sum \limits _{k=1}^{K} {s_{k}\left ({{ x }}\right )-S\left ({{ x }}\right )} }}\right )^{2}, \tag {6}\end{equation*} View Source where $K$ represents the number of decompositions

Considering the morphological characteristics of a single impact waveform, the decomposition layer introduces the concepts of impact amplitude moment and impact time domain center.

The $p$ -th amplitude moment of signal $s\left ({{ x }}\right )$ for a certain moment ($x_{k}$ ) is shown in Eq. (7).\begin{equation*} M_{t}\left ({{ x\vert x_{k},s\left ({{ x }}\right ),p }}\right )=\int _{0}^{+\infty } {\left ({{ x-x_{k} }}\right )^{p}s\left ({{ x }}\right )dx}, \tag {7}\end{equation*} View Source where the $M_{t}$ represents the amplitude moment. When ($x_{k}$ ) is the time center of gravity of the signal $s\left ({{ x }}\right )$ , the amplitude moment reaches its minimum value at point ($x_{k}$ ). In this study, under the condition of $p=1$ , ($x_{k}$ ) represents the time domain center of the signal s in the time domain.

The smaller the reconstruction loss, the less information lost during decomposition. The smaller the amplitude moment $M$ , the closer $x_{k}$ is to the true time domain center of the impulse waveform. The decomposition target is shown in Eq. (8).\begin{equation*} \min _{\left \{{{s_{k}}}\right \},\left ({{t_{k}}}\right \}} \boldsymbol {\mathcal {L}}_{D}=\delta +\beta \sum _{k=1}^{K} M_{t}\left ({{x \mid x_{k}, s_{k}(x), p=1}}\right )_{2}^{2} \tag {8}\end{equation*} View Source where $\boldsymbol {\mathcal {L}}_{D}$ represents the decomposition target, and the $\beta $ represents the adjustment coefficient.

The optimization calculation process for the decomposition number $K$ is as follows. This study sets the decomposition number $K$ to start from two and iteratively calculates upward. When the decomposition target $\boldsymbol {\mathcal {L}}_{D}$ is the smallest, the value of $K$ is the optimum.

The proposed signal decomposition layer is shown in Figure 2.

FIGURE 2.

Schematic diagram of decomposition layer for multi-impact vibration signals.

Show All

Embed the above signal decomposition layer into the DSAN framework. Compared to conventional decomposition methods that are independent of diagnostic networks for signal decomposition, the signal decomposition layer in this study utilizes the common Adam optimization algorithm in deep learning. This maintains the same parameter update strategy and learning rate adjustment method as other modules in the framework, thereby improving the efficiency of the transfer diagnosis model.

B. Feature Variance Loss

In the current study, computing feature variance has been utilized in model regularization loss to drive the model to generate feature distributions closer to both the existing training set and unobserved real-world data, thereby enhancing model generalization [27]. The proposed FVL in this paper further incorporates information on operating conditions and fault labels for sample grouping, aiming to reduce the sensitivity of the model’s feature extraction results to operating conditions. It should be pointed out that FVL loss is applied to the DA module and works together with the distance function of the DA module itself.

The rationale for introducing labels is as follows. Although minimizing the variance of feature dimensions calculated from samples under different operating conditions can decrease the feature variation caused by operating conditions, the prerequisite for minimizing the variance of feature dimensions is sampled with the same fault label, given that the samples required during model training are randomly selected. Otherwise, the reduction in feature variance for samples with different fault labels will result in similar feature values, making fault diagnosis difficult for the model.

Therefore, FVL groups the source domain samples by introducing label information. Furthermore, based on the pre-grouped samples and the fault label information from the source domain, FVL calculates the variance of each feature under all operating conditions, driving the minimization of the variance of feature distribution for different operating condition sub-signals with the same fault label, (i.e., minimizing the sensitivity of features to operating conditions).

Consider a batch of samples with n samples containing T classes of faults. The number of samples for each class of fault is $n_{t}$ , and each sample has M deep learning features, as shown in Eq. (9).\begin{equation*} F=\left \{{{f_{i,j} }}\right \},F\in \mathbb {R}^{\left ({{ \sum \nolimits _{t=1}^{T} n_{t} }}\right )\ast m}, \tag {9}\end{equation*} View Source where F represents the total feature matrix for a batch of samples, the $f_{i,j}$ represents the $j$ -th feature of the $i$ -th sample, $n$ is the total number of samples in a batch, and $n_{t}$ represents the number of samples for the $t$ -th class of fault.

For features belonging to the same fault class, the goal is to minimize feature variance, which drives the model to learn noise features as zero features. Therefore, the variance-based penalty constraint is as shown in Eq. (10).\begin{equation*} \boldsymbol {\mathcal {L}}_{F V L}=\min _{W_{E}} \frac {1}{M T} \sum _{m=1}^{M} \sum _{t=1}^{T} \mathrm {var}\left ({{f_{n_{t-1} \sim n_{t}, m}}}\right ), \tag {10}\end{equation*} View Source where $var$ represents variance calculation, $T$ and $M$ represent the number of fault and feature categories, respectively, $f_{n_{t-1}\sim n_{t},m}$ is the $m$ -th feature of samples from $n_{t-1}\sim n_{t}$ , and $W_{E}$ represents the weights generated in the feature process.

After feature extraction, further evaluation of each feature dimension in the feature map is required. If any feature value in a dimension is zero, that feature dimension will be removed.

The process of DA with FVL is shown in Figure 3.

FIGURE 3.

Schematic diagram of using FVL to reduce the impact of variable operating conditions in the DA process.

Show All

C. Model Framework for Transfer Learning

This study adopts a DSAN framework to achieve transfer fault diagnosis, embedding the proposed signal decomposition layer and FVL constraint. The feature extraction module of the framework consists of the classical VGG structure [28], while the DA module employs LMMD as the distance loss function. For the diagnostic module, a classical softmax classifier is applied.

LMMD aims to drive the alignment of sample features in each subdomain. Usually, the source domain data are divided into subdomains based on class labels, whereas the target domain data is divided into subdomains using the probability distribution predicted by the network. The definition of LMMD is shown in Eqs. (11)-(12). \begin{align*} w_{i}^{c}\mathbf {}& =\frac {y_{ic}}{\sum \nolimits _{\left ({{ x_{j},y_{j} }}\right )\in D} y_{ic}} \tag {11}\\ \boldsymbol {\mathcal {L}}_{LMMD}\mathbf {}& =\frac {1}{T} \sum \nolimits _{c=1}^{T} \\ & \quad \times {\left \|{{ \sum \nolimits _{x_{i}^{s}\in D^{S}} {w_{i}^{sc}\phi \left ({{ x_{i}^{s} }}\right )-} \sum \nolimits _{x_{j}^{t}\in D^{T}} {w_{j}^{tc}\phi \left ({{ x_{j}^{t} }}\right )} }}\right \|_{H}^{2},} \tag {12}\end{align*} View Source where, $ D^{S}$ and $D^{T}$ represent the feature matrix of the source and target domain, respectively, $T$ is the number of classifications, and $\phi $ represents a kernel function, such as a Gaussian kernel.

The softmax classifier is a classical classifier used for multiclass classification, with focal loss ($\boldsymbol {\mathcal {L}}_{\boldsymbol {FL}}$ ) as the diagnostic target for unbalanced data. $\boldsymbol {\mathcal {L}}_{\boldsymbol {FL}}$ is shown in Eq. (13).\begin{equation*} \boldsymbol {\mathcal {L}}_{\boldsymbol {FL}}\left ({{ p_{t,i} }}\right )=-\alpha _{i}\left ({{ 1-p_{t,i} }}\right )^{\gamma }\log \left ({{ p_{t,i} }}\right ), \tag {13}\end{equation*} View Source where $p_{t,i}$ is the cross-entropy loss between the predicted probability of class $i$ and the true label. Factor $\alpha $ is a used to adjust the weight of each category. Factor $\gamma $ is a used to adjust the weight relationship between samples.

The objective function comprises three learning tasks, which can be divided into the preprocessing part ${\mathcal {T}}_{1}$ , the FVL part ${\mathcal {T}}_{2}$ , and the diagnostic part ${\mathcal {T}}_{3}$ , as shown in Eqs. (14)-(16). \begin{align*} {\mathcal {T}}_{1}& =\boldsymbol {\mathcal {L}}_{D} \tag {14}\\ {\mathcal {T}}_{2}& =\boldsymbol {\mathcal {L}}_{FVL} \tag {15}\\ {\mathcal {T}}_{3}& =\lambda \boldsymbol {\mathcal {L}}_{LMMD}\mathbf {+}\boldsymbol {\mathcal {L}}_{FL}, \tag {16}\end{align*} View Source where $\lambda $ is the balancing coefficient.

Algorithm 1. Proposed Transfer Learning ModelInput: source data

Input: source data $S_S$ , source label $Y_s$ , target data $S_t$ , task ${\mathcal{T}}_1$ and its threshold $\psi_1$ , limit of epochs $\mathrm{L}_{\mathrm{ep} 1}$ , task $\mathcal{T}_3$ and its threshold $\psi_3$ , limit of epochs $\mathrm{L}_{\mathrm{ep} 2}$

Output: Predict the results of target data $\hat{y}_t$

Epoch=0

While $\mathcal{T}_1 > \psi_1 or Epoch < \mathrm{L}_{\mathrm{ep1}} do$

Based on $S_s, S_t$ , train decomposition layer with task $\mathcal{T}_1=\mathcal{L}_D$

Epoch = Epoch +1

end while

Obtain the subsignals set $S_s, S_t$ of $S_S, S_T$

Based on $S_s$ and $y_s$ , group $S_s$ as $\mathrm{G}\left(S_s\right)$

Epoch = 0

While $\mathcal{T}_3 > \psi_3 or Epoch < L_{\text {ep2 }} do$

Based on the transfer learning model (Model), calculate $F^S$ of $\mathrm{G}\left(S_s\right)$ with minimize task $\mathcal{T}_2=\mathcal{L}_{F V L}$

Based on Model, source label $Y_s$ , calculate $F^S, F^T$ , $\hat{Y}^T$ of $S_s, S_t$ , with minimize $\mathcal{L}_{L M M D}$

Based on $F^S$ and source label $Y_s$ , conduct classification diagnosis with minimize $\mathcal{L}_{F L}$

Task $\mathcal{T}_3=\lambda \mathcal{L}_{L M M D}+\mathcal{L}_{F L}$

Epoch = Epoch +1

end while

Based on the trained modelandtarget data $S_t$ , predict the results of target data $\hat{y}_t$

For these three learning tasks, a multiobjective stepwise training method was designed. First, the source and target domain data were trained using a multi-impact vibration signal decomposition layer to achieve signal decomposition based on the minimization objective ${\mathcal {T}}_{1}$ . After training, the decomposed source domain signals were further grouped and trained, primarily driving the model parameters to acquire domain-invariant features for operating conditions by minimizing the ${\mathcal {T}}_{2}$ . Subsequently, DA and diagnosis were performed on both the source and target domain data, (i.e., adjusting the model parameters by minimizing objective ${\mathcal {T}}_{3}$ ). The model then alternately underwent training to minimize objectives ${\mathcal {T}}_{2}$ and ${\mathcal {T}}_{3}$ until objective ${\mathcal {T}}_{3}$ met the requirements.

Furthermore, the training approach for the model is defined by Algorithm 1 below, and the specific model structure is illustrated in Figure 4.

FIGURE 4.

Specific framework of the proposed method based on signal decomposition layer, feature extraction, domain adaptation, and diagnostic modules.

Show All

SECTION III.

Data and Model

A. Data

To validate the transfer diagnostic effectiveness of the proposed method, experiments were conducted involving combustion engine misfires, abnormal valve clearance, and piston collision faults. The misfire fault was achieved by manually cutting off combustion injection into the cylinder, the abnormal valve clearance was introduced by manually adjusting the exhaust valve clearance to increase it by +0.3 mm, and the piston collision fault was created by adding soft copper sheets to the piston head. The source domain data were obtained from a 12-cylinder V-type direct-injection combustion engine, referred to as Group A, whereas the target domain data were obtained from a 6-cylinder Inline-type direct-injection combustion engine, referred to as Group B. The fault experiments on Groups A and B are shown in Figure 5. The sample collection results are shown in Table 1.

FIGURE 5.

Structure of internal combustion engines (unit A and unit B), layout of engine measuring points and installation locations of engine sensors.

Show All

TABLE 1 Sample collection results for the two internal combustion engines.

In Figure 5, the green symbol represents the vibration acceleration sensor installed on the cylinder head to obtain vibration signals. The number and location of sensors may vary depending on the unit structure and installation location conditions. Unit A has installed 12 sensors of this type parallel to the direction of the piston application on the cylinder head; Unit B installed 6 sensors of this type on the cylinder head perpendicular to the direction of the piston operation. The orange symbol represents the key phase sensor, installed on the flywheel to intercept the vibration signal of one working cycle; the Purple symbol represents the instantaneous speed sensor, used to obtain the operating condition label. The key phase sensor (parallel to the flywheel axis) and instantaneous speed sensor (perpendicular to the flywheel axis) of both units are installed on the flywheel fluted-disc structure. The key phase sensor is used to capture vibration signals of a complete working cycle, while the instantaneous speed sensor is used to obtain operating condition labels. The faulty cylinders of Units A (faulty cylinder A3) and B (faulty cylinder 2#) are used to collect vibration signal samples. These samples are applied as the training and validation dataset for the proposed method.

To better collect high-frequency vibration information caused by mechanical faults, the sampling frequency was set to 25.6 kHz during our experiment (the sensor manual recommends the highest sampling rate). The sample signals are shown in Figure 6.

FIGURE 6.

Schematic diagram of vibration signals for normal, misfire, valve, and collision faults in Unit A and Unit B.

Show All

As shown in Table 1 and Figure 6, the experiments for Group A were distributed between 1000–1500 rpm and 0–400 Nm, while Group B covered the range of 1000–1800 rpm and 0–600 Nm, complying with variable-condition criteria. Observing the vibration signal samples from both groups, the signals from Group A are clearer, with distinct impacts from valve seating and ignition. However, the signals from Group B exhibited numerous unidentified impacts. Our sensor has a sampling frequency of 25.6 kHz and collected data from 1000–1800 rpm. Given that a complete working cycle of an internal combustion engine requires the main shaft to rotate twice, the number of sampling sequence points for a sample is 1700–3072. To minimize information loss, a sampling number of 3072 is taken as the interpolation length of the overall signal.

B. Model

The model constructed here includes four modules or methods: the signal decomposition layer for signal preprocessing, the modified VGG module for feature extraction and diagnosis, the LMMD method for DA, and the FVL method for feature selection.

The parameters of the modified VGG module are shown in Table 2. The model structure and parameters refer to the commonly used VGG structure [28]. It should be noted that for calculation convenience, the length of the input signal in this article was uniformly interpolated to 3072, and the training method used was train-on-batch, which randomly extracts a batch of samples from the dataset for one-step training. Therefore, the $K$ value depends on the maximum number of decompositions of the input signal in a dataset after being decomposed by the signal decomposition layer, and the remaining signals are zeroed to satisfy the same shape of a batch input signal (3072, K).For calculation convenience, the length of the input signal in this article was uniformly interpolated to 3,072, and the training method used was train-on-batch, which randomly extracts a batch of samples from the dataset for one-step training. Therefore, the $K$ value depends on the maximum number of decompositions of the input signal in a dataset after being decomposed by the signal decomposition layer, and the remaining signals are zeroed to satisfy the same shape of a batch input signal (3072, K).

TABLE 2 Structure of VGG in the transfer diagnosis model.

It should be pointed out that the parameter settings in Table 3 are based on the following criteria.

TABLE 3 Each module and its parameter values in the transfer diagnosis model.

Referring to the [1], the parameter “$p$ ” was set to “1” in the signal decomposition layer, which has been validated by actual vibration signal cases of internal combustion engines. Referring to the [29], $\gamma $ was set to “2” to focus the model more on difficult-to-classify samples. Since the dataset contains four states: normal, misfire, valve fault, and collision, the parameter “T” was set to “4”. The balancing coefficient $\lambda $ was set to balance the numerical magnitudes of various loss terms in the total loss function of the model. Under the conditions of the dataset used in this study, setting the parameter “$\lambda $ ” to “0.1” can ensure that the specific values of $\boldsymbol {\mathcal {L}}_{\boldsymbol {LMMD}}$ and $\boldsymbol {\mathcal {L}}_{\boldsymbol {FL}}$ differ within one order of magnitude during the training process.

In addition, the settings for the number of signal decomposition and transfer training epochs thresholds $\mathrm {L}_{\mathrm {ep1}}, \mathrm {L}_{\mathrm {ep2}}$ , and the model signal decomposition and transfer task loss thresholds $\mathrm {\psi }_{1},\mathrm {\psi }_{3}$ , were determined based on multiple training iterations of the model. These settings were based on the convergence range of the model’s final loss (loss fluctuating around a certain value) and the number of training steps required to reach this convergence threshold, selecting a set of parameter values suitable for the dataset used in this study.

SECTION IV.

Results

A. Effect of the Decomposition Layer of Multi-Impact Vibration Signals

This study measured the decomposition effectiveness using the decomposition metrics. The reconstruction energy loss (EL) and orthogonal loss (OL) coefficient between the decomposed subsignals were provided as a metric for assessing the decomposition method, as shown in Eqs. (17)–(18).\begin{align*} \mathrm {EL}& =\frac {\left |{{ \mathrm {RMS}\left ({{ \sum \nolimits _{i=1}^{K} {s_{i}\left ({{ t }}\right )} }}\right )\mathrm {-RMS}\left ({{ S\left ({{ t }}\right ) }}\right ) }}\right |}{\mathrm {RMS}\left ({{ S\left ({{ t }}\right ) }}\right )}\ast 100\% \tag {17}\\ \cos _{i, j}& =\frac {\left |{{\left \langle {{ s_{i}(t), s_{j}(t)}}\right \rangle }}\right |}{\left \|{{s_{i}(t)}}\right \|_{2}\left \|{{s_{j}(t)}}\right \|_{2}}, s_{i}(t), s_{j}(t) \in \left \{{{s_{k}(t)}}\right \}, \\ \mathrm {OL}& =\frac {1}{K(K-1)}\left ({{\sum _{i=1}^{K} \sum _{j=1}^{K} \cos _{i, j}-K}}\right ), \tag {18}\end{align*} View Source where EL represents the reconstruction energy loss between the decomposed and original signals, RMS represents the root-mean-square value, and OL reflects the orthogonal loss between each subsignal. The smaller OL represents better orthogonality.

Taking a valve fault sample of the source unit and a cylinder collision fault sample of the target unit as examples, the decomposition results are shown in Figures 7 and 8, and the indicators are shown in Table 4.

FIGURE 7.

Decomposition results of a cylinder valve fault sample of the source unit.

Show All

FIGURE 8.

Decomposition results of a cylinder collision fault sample of the target unit.

Show All

TABLE 4 Average decomposition metrics (EL and ol) of unit A and unit b.

Table 4 shows that the proposed method achieves an average reconstruction EL of 1.54% and an OL of 0.023 on Unit A and an average reconstruction EL of 2.72% and an OL of 0.028 on Unit B.

B. Effect of the FVL

The proposed FVL drives the minimization of the same feature variances under the same fault conditions reflected in the fault-related feature maps. Specifically, it reduces the number of valid features and increases the number of zero features. Zero features can be compressed (squeezed) and omitted in the DA, which helps reduce the dimensionality of DA calculations.

The comparative process is as follows: Using the VGG model, the $\boldsymbol {\mathcal {L}}_{\boldsymbol {FVL}}$ loss was introduced in the feature layers. Diagnostic classification and visualization of feature maps were performed using the decomposed signal set from Group A. The proportion of zero features to the total number of features was then calculated to verify the increase in zero features and the decrease in valid features introduced by $\boldsymbol {\mathcal {L}}_{\boldsymbol {FVL}}$ . Figure 9 depicts the visualized results after taking the absolute value of the feature maps in one training process.

FIGURE 9.

Introduction of FVL increased the proportion of zero features and decreased average feature variance under same fault-operating condition label in the signal feature map.

Show All

Based on the decomposed signal set from Group A, Figure 9 and Table 5 reveal that introducing FVL increased the proportion of zero features from 62.50% to 84.38% and decreased average feature variance from 0.026 to 0.011 under the same fault-operating condition.

TABLE 5 Effect of FVL on zero features ratio and feature variance.

C. Ablation Experiments Using the Proposed Method

The previous section verified FVL role in feature selection and in limiting the number of features. We conducted ablation experiments to further validate the effectiveness of the remaining parts of the constructed diagnostic model. As shown in Figure 4, after all sample signals are decomposed using the signal decomposition layer, the structure of each part of the proposed model can be simplified, as shown in Figure 10.

FIGURE 10.

Structure of the proposed method using the decomposition layer for signal decomposition, VGG as the feature extraction module, FVL for feature optimization, and LMMD for domain adaptation.

Show All

The VGG model was used as the feature extraction module, with $\boldsymbol {\mathcal {L}}_{\boldsymbol {LMMD}}$ employed as the target loss function and FVL introduced to enhance DA. Softmax was used as the classifier and label predictor.

According to Table 6, retaining only VGG for training and transfer diagnosis resulted in a diagnostic accuracy of only 24.98% and an F1 score of only 24.19% for the target internal combustion unit, indicating poor diagnostic performance. However, retaining the signal decomposition layer + VGG for training and transfer diagnosis resulted in diagnostic accuracy of only 48.95% and an F1 score of only 32.41% for the target internal combustion unit, indicating poor diagnostic performance. After introducing only LMMD in VGG, the diagnostic results of the target domain significantly improved, achieving an accuracy of 93.44% and an F1 score of 93.17%. Furthermore, after introducing LMMD, VGG further introduced FVL to optimize the features of the input MMD, achieving an accuracy of 94.81% and an F1 score of 94.42%, improving the diagnostic results.

TABLE 6 Ablation experiments results of proposed transfer diagnosis model.

D. Comparison with Other Methods

To validate the effectiveness of the proposed model in both diagnosis and DA—to achieve effective transfer fault diagnosis for combustion engines under varying operating conditions—we selected the samples from Table 1, with 80% used for training and 20% used for testing in both source and target domains. Furthermore, the following typical methods were employed for comparison:

VGG: VGG is a classic convolutional deep learning model. This study used the VGG11 model, which was retrained using the source domain training dataset and transferred to the target domain training dataset using a freeze-tune approach.
DAN: For comparison, this study used the VGG11 model as the feature extraction module, and MMD was added to the VGG11 model feature layers for DA, thereby constructing the DAN model.
DANN: This model includes a feature extractor that maps data to a specific feature space, a label predictor that classifies data from the source domain, and a domain discriminator that classifies data in the feature space. This study used the VGG11 model as the feature extraction module and a gradient reversal layer to build the domain discriminator.
CDAN: CDAN employs two novel conditional adjustment strategies—multilinear and entropy—to enhance DA. For comparison, this study used the VGG11 model as the feature extraction module for CDAN.
DAN + FVL: This study used the VGG11 model as the feature extraction module. FVL was introduced to enhance the DA process with the MMD function. Softmax was used as the classifier and label predictor.
Proposed method: This study used the VGG11 model as the feature extraction module. FVL was introduced to enhance the DA process with the LMMD function. Softmax was used as the classifier and label predictor.

Considering the data sample imbalance between the source (Unit A) and target (Unit B) domains, accuracy and F1-score (Macroaverage) were selected as the evaluation metrics to further demonstrate the performance of the aforementioned methods. The evaluation was conducted over 10 iterations to comprehensively assess the diagnostic performance of these methods, as shown in Table 7 and Figures 11 and 12.

TABLE 7 Results of the various methods used in the present study.

FIGURE 11.

Comparison of the accuracy obtained from VGG, DAN, DANN, CDAN, DAN+FVL, and the proposed method.

Show All

FIGURE 12.

Comparison of the F1 scores obtained from VGG, DAN, DANN, CDAN, DAN+FVL, and the proposed method.

Show All

Table 7 and Figures 11 and 12 show that all diagnostic models constructed by these methods could recognize the source domain A-group test data and the training time for all methods is acceptable. However, the effectiveness of the VGG, DAN, DANN, CDAN, and DAN + FVL methods for the target domain B-group dropped significantly (below 80%), with only the proposed method maintaining an accuracy of 94.81% and an F1-score of 94.42%.

To further analyze the reasons for this phenomenon, we extracted the results from a single training run for each method and displayed their structural differences. Specific diagnostic results based on confusion matrices, and feature visualization based on t-distributed stochastic neighbor embedding (t-SNE) are presented in Figure 13 and Figure 14.

FIGURE 13.

Confusion matrix diagram of the diagnostic results for normal(T1), misfire(T2), valve-fault(T3), and collision(T4) faults in the target domain internal combustion engine:(a) Confusion matrix of VGG, (b) Confusion matrix of DAN, (c) Confusion matrix of DANN, (d) Confusion matrix of CDAN, (e) Confusion matrix of DAN+FVL, (f) Confusion matrix of proposed method.

Show All

FIGURE 14.

TSNE visualization results of source and target domain data features in the feature space: (a) Visualization results of VGG, (b) Visualization results of DAN, (c) Visualization results of DANN, (d) Visualization results of CDAN, (e) Visualization results of DAN+FVL, (f) Visualization results of proposed method.

Show All

From the confusion matrix perspective, the sharp drop in fault recognition performance can be attributed to the difficulty in clearly distinguishing between normal and misfire samples in the target domain group. This phenomenon is exhibited to varying degrees in VGG, DAN, DANN, CDAN, DAN + FVL, and the proposed method. The VGG and DANN methods further undermine the overall fault recognition performance due to their inability to identify knocking. Consequently, the proposed method exhibits the strongest recognition performance.

From a feature visualization clustering perspective, both VGG and DANN show a lack of overlap between the source domain and target domain data in the feature space. The proposed method has a better overlap compared to any other method in this paper. In summary, the above findings validate the effectiveness of the proposed method in transfer diagnosis.

SECTION V.

Conclusion

This study proposes a new model for transfer diagnosis of internal combustion engines based on a DSAN framework, achieving transfer diagnosis between different internal combustion engine units under variable operating conditions.

First, a new multi-impact vibration signal decomposition layer is designed to minimize the signal moment to decompose multiple impact signals into single-impact modes, thereby alleviating the problem of feature aliasing in internal combustion engine vibration signals and extracting signal features rich in equipment operating state information. Second, the FVL constraint is proposed to calculate the feature variance under all operating conditions. This drives the minimization of the feature variance distribution of different operating condition subsignal features with the same fault label, reducing the sensitivity of sample features to operating conditions and exploring domain-invariant features. Third, the proposed vibration signal decomposition layer and domain FVL constraint are embedded based on the DSAN framework to construct the transfer diagnosis model. Finally, four fault experiments were conducted on the V12 high-power internal combustion engine test bench, and the same experiments were replicated on another V6 high-power internal combustion engine test bench, serving as the target domain for transfer. The experiments validated that the proposed model achieved an accuracy of 94.81% and an F1 score of 94.42% in the target domain under variable operating conditions, confirming the transfer diagnostic performance of the proposed method.

However, our proposed method has limitations. For example, it can only identify fault types existing in the source domain, which must cover those found in the target data to achieve better transfer results in the target domain. To address this issue, current research is exploring methods such as assigning multiple sublabel dictionaries to samples capable of expressing their features to describe unknown faults or using adversarial learning methods to identify such faults. These are directions for future research.

ACKNOWLEDGMENT

Declaration of competing interest:

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

References is not available for this document.

Transfer Diagnosis Model of Internal Combustion Engine With Embedded Vibration Signal Impact Decomposition

Abstract:

Metadata

Abstract:

Funding Agency:

Introduction

Proposed Method