Introduction
Reciprocating internal combustion engines are the key power source in fields such as marine vessels, vehicles, and electricity generation, characterized by their compact structure, numerous components, and high failure rates. During normal operation, internal combustion engines’ shell vibration signals exhibit characteristics of multi-source impact signal coupling. For instance, impacts from the opening and closing of the intake and exhaust valves in each operating cycle, combustion impacts within the cylinder, and impacts from piston reversal. The opening and closing impacts of the valves, combustion impacts within the cylinder, and piston reversal impacts directly affect the cylinder head shell. The piston reversal impact is also transmitted to the cylinder head shell through certain signal paths. Therefore, monitoring the vibration signals of the cylinder head shell of the internal combustion engine can obtain quasi-periodic multi-impact coupling signals in the time domain, thus reflecting the working status of different components. There are many studies extracting fault characteristics from the vibration signals of the cylinder head shell of internal combustion engines and constructing diagnostic models [1], [2].
Mechanical faults, such as wear on connecting rod bearings and valves, produce additional impact signals. These signals overlap with the normal impact signals mentioned above in the time and frequency domains, posing challenges for fault feature extraction. Meanwhile, variations in engine speed and load conditions are common. Under variable operating conditions, changes occur in the dynamic behavior of engine moving parts, resulting in corresponding changes in the time–frequency features of vibration signals [3], [4]. The features of vibration signals from the engine casing generally exhibit nonlinear distributions during variable operating conditions. Therefore, the feature aliasing and the effects of variable operating conditions are two problems in internal combustion engine fault diagnosis.
To address the problem of feature aliasing, signal modal decomposition methods (i.e., empirical mode decomposition, variational mode decomposition, and wavelet decomposition) are commonly conducted to obtain multiscale single-modal components containing mechanical operational state information [5], [6]. Then the components are applied to extract dynamics and thermodynamic features for diagnosis [7]. For example, Bi et al. [8] used variational mode decomposition and expectation maximization method to analyze multi-channel vibration signals and extract knowledge features for internal combustion engine state recognition. Further, deep learning techniques, including convolutional neural networks [9], [10], [11], graph attention networks [12] and autoencoders [13] are employed to explore the deep features of internal combustion engine vibration signals. For instance, Liang et al. [10] and Xie et al. [11] constructed internal combustion engine vibration signal feature extraction models based on autoencoders and graph attention networks, respectively, using vibration signals directly as input. However, internal combustion engine faults often manifest as abnormal impacts in time domain signals, with a particularly severe feature aliasing problem [15]. The abovementioned research on feature extraction of internal combustion engines did not consider feature aliasing problem and directly used deep networks for feature extraction, resulting in insufficient correlation between feature extraction results and faults, decreased generalization, and limited application.
To alleviate the effect of variable operating conditions, the transfer learning (TL) method has been proposed to further improve the generalization of diagnostic models [16]. TL aims to adjust existing model parameters to construct models that adapt to the diagnostic needs of new engine units. However, the nonlinear variation of signal features under different operating conditions of internal combustion engines makes it challenging to extract domain-invariant features for operating conditions, decreasing the generalization ability of fault transfer diagnostic models between units. Currently, limited research exists on transfer diagnostic models between different internal combustion engine units under variable operating conditions. But in other mechanical domains, such as bearings, gears, and turbines, domain adaptation (DA) methods have been proposed to reduce the feature distribution difference between the source and target domains [17], [18], [19], [20], [21]. DA reduces feature distribution discrepancy by introducing a distance function into the model’s objective function to drive multidomain feature alignment [22]. Additionally, incorporating a loss constraint related to label prediction results, combined with the aforementioned feature distribution distance function, DA can reduce the feature distribution difference between the source and target domains while enhancing the diversity of distribution results among multiple classes of source domain features, thereby improving model generalization [23]. Lu et al. [24] used a model with maximum mean discrepancy (MMD) to minimize differences between various machine data instances and achieved an accurate transfer diagnosis of typical gearbox-bearing faults. Shen et al. [25] proposed a deep subdomain adaptation network (DSAN) to extract features of multiscale vibration signals using local maximum mean discrepancy (LMMD) loss to reduce the distance between source and target domains and achieve transfer diagnosis of wind turbine system faults. Li et al. [26] introduced an adversarial DA method based on conditional adversarial DA (CDAN), which enforces better intraclass compactness and interclass separability of label-related prediction results to improve model generalization, enabling transfer diagnosis of typical bearing faults under variable conditions. The above studies have shown that constructing feature distribution constraint functions effectively obtains domain-invariant features under variable operating conditions.
In summary, when constructing internal combustion engine fault transfer diagnostic models, challenges arise such as feature aliasing of vibration multisource impact signals, difficulty in feature extraction, and nonlinear distributions in multidomain feature distribution under variable operating conditions. These challenges lead to difficulty extracting domain-invariant features for operating conditions and a decrease in the generalization ability of fault transfer diagnostic models between engine units.
In this study, first, to address the difficulty in feature extraction caused by feature aliasing, a new decomposition layer of multi-impact vibration signals in internal combustion engines is proposed. This layer is based on the periodic multi-impact interval distribution and rapid decay characteristics of impacts in vibration signals. This proposed layer has a signal moment minimization decomposition objective, enabling the decomposition of multisource impact vibration signals into single-impact modalities. Second, to address the problem of signal feature shift under variable operating conditions, which makes it challenging to mine domain-invariant features, a new feature variance loss (FVL) constraint is established. By introducing source domain labels to group source domain samples and calculating the variance of each feature under all operating conditions, this constraint drives the minimization of the variance of feature distribution for different operating condition subsignals with the same fault label, reducing the sensitivity of sample features to operating conditions. Furthermore, the decomposition layer of the impact vibration signals and the FVL constraint are integrated into the signal processing and DA modules of the DSAN framework, respectively. This integration leads to the construction of a DSAN framework for internal combustion engine transfer diagnosis, enabling fault transfer diagnosis between internal combustion engine units.
Our contributions are as follows:
Proposed a new decomposition layer based on signal amplitude moment and reconstruction loss minimization of internal combustion engine vibration signals, and the signal moment minimization decomposition objective. This layer is embedded in DSANs and decomposes multiple impact signals into single-impact modes. It also adopts a unified optimization algorithm, parameter update strategy, and learning rate adjustment method with other modules in the model to improve the model efficiency.
Proposed a new FVL constraint to mine invariant features in the operating condition domain. This constraint introduces source domain labels to group source domain samples and calculates the variance of each feature under all operating conditions, driving the minimization of the variance of feature distribution for different operating condition subsignals with the same fault label to reduce the sensitivity of sample features to operating conditions.
Constructed a new internal combustion engine transfer diagnostic model based on a DSAN framework. This framework incorporates the proposed multi-impact vibration signal decomposition layer and FVL constraint and establishes an algorithm for multiobjective optimization of the model.
Based on experiments conducted on two internal combustion engine test benches, simulating three typical faults under different operating conditions, the performance of the proposed model is validated.
The remaining content of this study is as follows: Chapter 2 presents the proposed decomposition layer, the FVL, and the constructed transfer diagnostic model. Chapter 3 describes the experiments and the data set. Chapter 4 presents the comparative validation results of the proposed method, and Chapter 5 provides the conclusions of our study.
Proposed Method
This study constructed a DSAN framework for the internal combustion engine transfer diagnosis model with embedded vibration signal impact decomposition, and Figure 1 shows the model’s overall structure. This structure included a preprocessing module for the signal decomposition layer, a feature extraction module, a DA module with the proposed embedded FVL, and a fault diagnosis module.
Framework of the proposed method is based on signal decomposition, feature extraction, domain adaptation, and diagnostic modules.
A. Decomposition Layer of Multi-Impact Vibration Signals
The decomposition layer aims to separate the impulse waveforms within the vibration signal, forming multiple subsignals containing only individual impulse waveforms. Its decomposition objective can be summarized as minimizing the information loss in reconstructing the source signal, with the subsignals conforming to the morphology of single impulse waveforms [1]. Additionally, for ease of calculation, the signal amplitude and time range need to be transformed to [0,1], and after decomposition, inverse transformation can restore the original feature range.
The design of the decomposition window and the calculation process of window parameters are as follows.
To construct the decomposition window \begin{align*} & F_{g}\left ({{ w_{c},w_{l};x }}\right ) \\ & =\frac {1}{1+\exp \left ({{ w_{c}-w_{l}-x }}\right )} \tag {1}\\ & W\left ({{ w_{c},w_{l};x }}\right ) \\ & = \mathrm {ReLU}\left ({{ F_{g}\left ({{ w_{c},w_{l};x }}\right )-F_{g}\left ({{ w_{c},-w_{l};x }}\right )-\varepsilon }}\right ), \tag {2}\end{align*}
The formula for calculating (\begin{align*} w_{c}& =\text {sigmoid}\left ({{ w_{w_{c}}^{\prime }S }}\right ) \tag {3}\\ w_{l}& =\text {sigmoid}\left ({{ w_{w_{l}}^{\prime }S }}\right ), \tag {4}\end{align*}
The calculation process of subsignals is shown in Eq. (5) \begin{equation*} s_{k}\left ({{ x }}\right )=W_{k}\left ({{ w_{c_{k}},w_{l_{k}};x }}\right )S\left ({{ x }}\right ), \tag {5}\end{equation*}
The decomposition objectives are as follows.
Considering that the decomposition target needs to minimize the information loss of the reconstructed source signal, the decomposition and reconstruction loss \begin{equation*} \delta =\frac {1}{K}\left ({{ \sum \limits _{k=1}^{K} {s_{k}\left ({{ x }}\right )-S\left ({{ x }}\right )} }}\right )^{2}, \tag {6}\end{equation*}
Considering the morphological characteristics of a single impact waveform, the decomposition layer introduces the concepts of impact amplitude moment and impact time domain center.
The \begin{equation*} M_{t}\left ({{ x\vert x_{k},s\left ({{ x }}\right ),p }}\right )=\int _{0}^{+\infty } {\left ({{ x-x_{k} }}\right )^{p}s\left ({{ x }}\right )dx}, \tag {7}\end{equation*}
The smaller the reconstruction loss, the less information lost during decomposition. The smaller the amplitude moment \begin{equation*} \min _{\left \{{{s_{k}}}\right \},\left ({{t_{k}}}\right \}} \boldsymbol {\mathcal {L}}_{D}=\delta +\beta \sum _{k=1}^{K} M_{t}\left ({{x \mid x_{k}, s_{k}(x), p=1}}\right )_{2}^{2} \tag {8}\end{equation*}
The optimization calculation process for the decomposition number
The proposed signal decomposition layer is shown in Figure 2.
Embed the above signal decomposition layer into the DSAN framework. Compared to conventional decomposition methods that are independent of diagnostic networks for signal decomposition, the signal decomposition layer in this study utilizes the common Adam optimization algorithm in deep learning. This maintains the same parameter update strategy and learning rate adjustment method as other modules in the framework, thereby improving the efficiency of the transfer diagnosis model.
B. Feature Variance Loss
In the current study, computing feature variance has been utilized in model regularization loss to drive the model to generate feature distributions closer to both the existing training set and unobserved real-world data, thereby enhancing model generalization [27]. The proposed FVL in this paper further incorporates information on operating conditions and fault labels for sample grouping, aiming to reduce the sensitivity of the model’s feature extraction results to operating conditions. It should be pointed out that FVL loss is applied to the DA module and works together with the distance function of the DA module itself.
The rationale for introducing labels is as follows. Although minimizing the variance of feature dimensions calculated from samples under different operating conditions can decrease the feature variation caused by operating conditions, the prerequisite for minimizing the variance of feature dimensions is sampled with the same fault label, given that the samples required during model training are randomly selected. Otherwise, the reduction in feature variance for samples with different fault labels will result in similar feature values, making fault diagnosis difficult for the model.
Therefore, FVL groups the source domain samples by introducing label information. Furthermore, based on the pre-grouped samples and the fault label information from the source domain, FVL calculates the variance of each feature under all operating conditions, driving the minimization of the variance of feature distribution for different operating condition sub-signals with the same fault label, (i.e., minimizing the sensitivity of features to operating conditions).
Consider a batch of samples with n samples containing T classes of faults. The number of samples for each class of fault is \begin{equation*} F=\left \{{{f_{i,j} }}\right \},F\in \mathbb {R}^{\left ({{ \sum \nolimits _{t=1}^{T} n_{t} }}\right )\ast m}, \tag {9}\end{equation*}
For features belonging to the same fault class, the goal is to minimize feature variance, which drives the model to learn noise features as zero features. Therefore, the variance-based penalty constraint is as shown in Eq. (10).\begin{equation*} \boldsymbol {\mathcal {L}}_{F V L}=\min _{W_{E}} \frac {1}{M T} \sum _{m=1}^{M} \sum _{t=1}^{T} \mathrm {var}\left ({{f_{n_{t-1} \sim n_{t}, m}}}\right ), \tag {10}\end{equation*}
After feature extraction, further evaluation of each feature dimension in the feature map is required. If any feature value in a dimension is zero, that feature dimension will be removed.
The process of DA with FVL is shown in Figure 3.
Schematic diagram of using FVL to reduce the impact of variable operating conditions in the DA process.
C. Model Framework for Transfer Learning
This study adopts a DSAN framework to achieve transfer fault diagnosis, embedding the proposed signal decomposition layer and FVL constraint. The feature extraction module of the framework consists of the classical VGG structure [28], while the DA module employs LMMD as the distance loss function. For the diagnostic module, a classical softmax classifier is applied.
LMMD aims to drive the alignment of sample features in each subdomain. Usually, the source domain data are divided into subdomains based on class labels, whereas the target domain data is divided into subdomains using the probability distribution predicted by the network. The definition of LMMD is shown in Eqs. (11)-(12). \begin{align*} w_{i}^{c}\mathbf {}& =\frac {y_{ic}}{\sum \nolimits _{\left ({{ x_{j},y_{j} }}\right )\in D} y_{ic}} \tag {11}\\ \boldsymbol {\mathcal {L}}_{LMMD}\mathbf {}& =\frac {1}{T} \sum \nolimits _{c=1}^{T} \\ & \quad \times {\left \|{{ \sum \nolimits _{x_{i}^{s}\in D^{S}} {w_{i}^{sc}\phi \left ({{ x_{i}^{s} }}\right )-} \sum \nolimits _{x_{j}^{t}\in D^{T}} {w_{j}^{tc}\phi \left ({{ x_{j}^{t} }}\right )} }}\right \|_{H}^{2},} \tag {12}\end{align*}
The softmax classifier is a classical classifier used for multiclass classification, with focal loss (\begin{equation*} \boldsymbol {\mathcal {L}}_{\boldsymbol {FL}}\left ({{ p_{t,i} }}\right )=-\alpha _{i}\left ({{ 1-p_{t,i} }}\right )^{\gamma }\log \left ({{ p_{t,i} }}\right ), \tag {13}\end{equation*}
The objective function comprises three learning tasks, which can be divided into the preprocessing part \begin{align*} {\mathcal {T}}_{1}& =\boldsymbol {\mathcal {L}}_{D} \tag {14}\\ {\mathcal {T}}_{2}& =\boldsymbol {\mathcal {L}}_{FVL} \tag {15}\\ {\mathcal {T}}_{3}& =\lambda \boldsymbol {\mathcal {L}}_{LMMD}\mathbf {+}\boldsymbol {\mathcal {L}}_{FL}, \tag {16}\end{align*}
Algorithm 1. Proposed Transfer Learning ModelInput: source data
Input: source data
Output: Predict the results of target data
Epoch=0
While
Based on
Epoch = Epoch +1
end while
Obtain the subsignals set
Based on
Epoch = 0
While
Based on the transfer learning model (Model), calculate
Based on Model, source label
Based on
Task
Epoch = Epoch +1
end while
Based on the trained modelandtarget data
For these three learning tasks, a multiobjective stepwise training method was designed. First, the source and target domain data were trained using a multi-impact vibration signal decomposition layer to achieve signal decomposition based on the minimization objective
Furthermore, the training approach for the model is defined by Algorithm 1 below, and the specific model structure is illustrated in Figure 4.
Specific framework of the proposed method based on signal decomposition layer, feature extraction, domain adaptation, and diagnostic modules.
Data and Model
A. Data
To validate the transfer diagnostic effectiveness of the proposed method, experiments were conducted involving combustion engine misfires, abnormal valve clearance, and piston collision faults. The misfire fault was achieved by manually cutting off combustion injection into the cylinder, the abnormal valve clearance was introduced by manually adjusting the exhaust valve clearance to increase it by +0.3 mm, and the piston collision fault was created by adding soft copper sheets to the piston head. The source domain data were obtained from a 12-cylinder V-type direct-injection combustion engine, referred to as Group A, whereas the target domain data were obtained from a 6-cylinder Inline-type direct-injection combustion engine, referred to as Group B. The fault experiments on Groups A and B are shown in Figure 5. The sample collection results are shown in Table 1.
Structure of internal combustion engines (unit A and unit B), layout of engine measuring points and installation locations of engine sensors.
In Figure 5, the green symbol represents the vibration acceleration sensor installed on the cylinder head to obtain vibration signals. The number and location of sensors may vary depending on the unit structure and installation location conditions. Unit A has installed 12 sensors of this type parallel to the direction of the piston application on the cylinder head; Unit B installed 6 sensors of this type on the cylinder head perpendicular to the direction of the piston operation. The orange symbol represents the key phase sensor, installed on the flywheel to intercept the vibration signal of one working cycle; the Purple symbol represents the instantaneous speed sensor, used to obtain the operating condition label. The key phase sensor (parallel to the flywheel axis) and instantaneous speed sensor (perpendicular to the flywheel axis) of both units are installed on the flywheel fluted-disc structure. The key phase sensor is used to capture vibration signals of a complete working cycle, while the instantaneous speed sensor is used to obtain operating condition labels. The faulty cylinders of Units A (faulty cylinder A3) and B (faulty cylinder 2#) are used to collect vibration signal samples. These samples are applied as the training and validation dataset for the proposed method.
To better collect high-frequency vibration information caused by mechanical faults, the sampling frequency was set to 25.6 kHz during our experiment (the sensor manual recommends the highest sampling rate). The sample signals are shown in Figure 6.
Schematic diagram of vibration signals for normal, misfire, valve, and collision faults in Unit A and Unit B.
As shown in Table 1 and Figure 6, the experiments for Group A were distributed between 1000–1500 rpm and 0–400 Nm, while Group B covered the range of 1000–1800 rpm and 0–600 Nm, complying with variable-condition criteria. Observing the vibration signal samples from both groups, the signals from Group A are clearer, with distinct impacts from valve seating and ignition. However, the signals from Group B exhibited numerous unidentified impacts. Our sensor has a sampling frequency of 25.6 kHz and collected data from 1000–1800 rpm. Given that a complete working cycle of an internal combustion engine requires the main shaft to rotate twice, the number of sampling sequence points for a sample is 1700–3072. To minimize information loss, a sampling number of 3072 is taken as the interpolation length of the overall signal.
B. Model
The model constructed here includes four modules or methods: the signal decomposition layer for signal preprocessing, the modified VGG module for feature extraction and diagnosis, the LMMD method for DA, and the FVL method for feature selection.
The parameters of the modified VGG module are shown in Table 2. The model structure and parameters refer to the commonly used VGG structure [28]. It should be noted that for calculation convenience, the length of the input signal in this article was uniformly interpolated to 3072, and the training method used was train-on-batch, which randomly extracts a batch of samples from the dataset for one-step training. Therefore, the
It should be pointed out that the parameter settings in Table 3 are based on the following criteria.
Referring to the [1], the parameter “
In addition, the settings for the number of signal decomposition and transfer training epochs thresholds
Results
A. Effect of the Decomposition Layer of Multi-Impact Vibration Signals
This study measured the decomposition effectiveness using the decomposition metrics. The reconstruction energy loss (EL) and orthogonal loss (OL) coefficient between the decomposed subsignals were provided as a metric for assessing the decomposition method, as shown in Eqs. (17)–(18).\begin{align*} \mathrm {EL}& =\frac {\left |{{ \mathrm {RMS}\left ({{ \sum \nolimits _{i=1}^{K} {s_{i}\left ({{ t }}\right )} }}\right )\mathrm {-RMS}\left ({{ S\left ({{ t }}\right ) }}\right ) }}\right |}{\mathrm {RMS}\left ({{ S\left ({{ t }}\right ) }}\right )}\ast 100\% \tag {17}\\ \cos _{i, j}& =\frac {\left |{{\left \langle {{ s_{i}(t), s_{j}(t)}}\right \rangle }}\right |}{\left \|{{s_{i}(t)}}\right \|_{2}\left \|{{s_{j}(t)}}\right \|_{2}}, s_{i}(t), s_{j}(t) \in \left \{{{s_{k}(t)}}\right \}, \\ \mathrm {OL}& =\frac {1}{K(K-1)}\left ({{\sum _{i=1}^{K} \sum _{j=1}^{K} \cos _{i, j}-K}}\right ), \tag {18}\end{align*}
Taking a valve fault sample of the source unit and a cylinder collision fault sample of the target unit as examples, the decomposition results are shown in Figures 7 and 8, and the indicators are shown in Table 4.
Table 4 shows that the proposed method achieves an average reconstruction EL of 1.54% and an OL of 0.023 on Unit A and an average reconstruction EL of 2.72% and an OL of 0.028 on Unit B.
B. Effect of the FVL
The proposed FVL drives the minimization of the same feature variances under the same fault conditions reflected in the fault-related feature maps. Specifically, it reduces the number of valid features and increases the number of zero features. Zero features can be compressed (squeezed) and omitted in the DA, which helps reduce the dimensionality of DA calculations.
The comparative process is as follows: Using the VGG model, the
Introduction of FVL increased the proportion of zero features and decreased average feature variance under same fault-operating condition label in the signal feature map.
Based on the decomposed signal set from Group A, Figure 9 and Table 5 reveal that introducing FVL increased the proportion of zero features from 62.50% to 84.38% and decreased average feature variance from 0.026 to 0.011 under the same fault-operating condition.
C. Ablation Experiments Using the Proposed Method
The previous section verified FVL role in feature selection and in limiting the number of features. We conducted ablation experiments to further validate the effectiveness of the remaining parts of the constructed diagnostic model. As shown in Figure 4, after all sample signals are decomposed using the signal decomposition layer, the structure of each part of the proposed model can be simplified, as shown in Figure 10.
Structure of the proposed method using the decomposition layer for signal decomposition, VGG as the feature extraction module, FVL for feature optimization, and LMMD for domain adaptation.
The VGG model was used as the feature extraction module, with
According to Table 6, retaining only VGG for training and transfer diagnosis resulted in a diagnostic accuracy of only 24.98% and an F1 score of only 24.19% for the target internal combustion unit, indicating poor diagnostic performance. However, retaining the signal decomposition layer + VGG for training and transfer diagnosis resulted in diagnostic accuracy of only 48.95% and an F1 score of only 32.41% for the target internal combustion unit, indicating poor diagnostic performance. After introducing only LMMD in VGG, the diagnostic results of the target domain significantly improved, achieving an accuracy of 93.44% and an F1 score of 93.17%. Furthermore, after introducing LMMD, VGG further introduced FVL to optimize the features of the input MMD, achieving an accuracy of 94.81% and an F1 score of 94.42%, improving the diagnostic results.
D. Comparison with Other Methods
To validate the effectiveness of the proposed model in both diagnosis and DA—to achieve effective transfer fault diagnosis for combustion engines under varying operating conditions—we selected the samples from Table 1, with 80% used for training and 20% used for testing in both source and target domains. Furthermore, the following typical methods were employed for comparison:
VGG: VGG is a classic convolutional deep learning model. This study used the VGG11 model, which was retrained using the source domain training dataset and transferred to the target domain training dataset using a freeze-tune approach.
DAN: For comparison, this study used the VGG11 model as the feature extraction module, and MMD was added to the VGG11 model feature layers for DA, thereby constructing the DAN model.
DANN: This model includes a feature extractor that maps data to a specific feature space, a label predictor that classifies data from the source domain, and a domain discriminator that classifies data in the feature space. This study used the VGG11 model as the feature extraction module and a gradient reversal layer to build the domain discriminator.
CDAN: CDAN employs two novel conditional adjustment strategies—multilinear and entropy—to enhance DA. For comparison, this study used the VGG11 model as the feature extraction module for CDAN.
DAN + FVL: This study used the VGG11 model as the feature extraction module. FVL was introduced to enhance the DA process with the MMD function. Softmax was used as the classifier and label predictor.
Proposed method: This study used the VGG11 model as the feature extraction module. FVL was introduced to enhance the DA process with the LMMD function. Softmax was used as the classifier and label predictor.
Considering the data sample imbalance between the source (Unit A) and target (Unit B) domains, accuracy and F1-score (Macroaverage) were selected as the evaluation metrics to further demonstrate the performance of the aforementioned methods. The evaluation was conducted over 10 iterations to comprehensively assess the diagnostic performance of these methods, as shown in Table 7 and Figures 11 and 12.
Comparison of the accuracy obtained from VGG, DAN, DANN, CDAN, DAN+FVL, and the proposed method.
Comparison of the F1 scores obtained from VGG, DAN, DANN, CDAN, DAN+FVL, and the proposed method.
Table 7 and Figures 11 and 12 show that all diagnostic models constructed by these methods could recognize the source domain A-group test data and the training time for all methods is acceptable. However, the effectiveness of the VGG, DAN, DANN, CDAN, and DAN + FVL methods for the target domain B-group dropped significantly (below 80%), with only the proposed method maintaining an accuracy of 94.81% and an F1-score of 94.42%.
To further analyze the reasons for this phenomenon, we extracted the results from a single training run for each method and displayed their structural differences. Specific diagnostic results based on confusion matrices, and feature visualization based on t-distributed stochastic neighbor embedding (t-SNE) are presented in Figure 13 and Figure 14.
Confusion matrix diagram of the diagnostic results for normal(T1), misfire(T2), valve-fault(T3), and collision(T4) faults in the target domain internal combustion engine:(a) Confusion matrix of VGG, (b) Confusion matrix of DAN, (c) Confusion matrix of DANN, (d) Confusion matrix of CDAN, (e) Confusion matrix of DAN+FVL, (f) Confusion matrix of proposed method.
TSNE visualization results of source and target domain data features in the feature space: (a) Visualization results of VGG, (b) Visualization results of DAN, (c) Visualization results of DANN, (d) Visualization results of CDAN, (e) Visualization results of DAN+FVL, (f) Visualization results of proposed method.
From the confusion matrix perspective, the sharp drop in fault recognition performance can be attributed to the difficulty in clearly distinguishing between normal and misfire samples in the target domain group. This phenomenon is exhibited to varying degrees in VGG, DAN, DANN, CDAN, DAN + FVL, and the proposed method. The VGG and DANN methods further undermine the overall fault recognition performance due to their inability to identify knocking. Consequently, the proposed method exhibits the strongest recognition performance.
From a feature visualization clustering perspective, both VGG and DANN show a lack of overlap between the source domain and target domain data in the feature space. The proposed method has a better overlap compared to any other method in this paper. In summary, the above findings validate the effectiveness of the proposed method in transfer diagnosis.
Conclusion
This study proposes a new model for transfer diagnosis of internal combustion engines based on a DSAN framework, achieving transfer diagnosis between different internal combustion engine units under variable operating conditions.
First, a new multi-impact vibration signal decomposition layer is designed to minimize the signal moment to decompose multiple impact signals into single-impact modes, thereby alleviating the problem of feature aliasing in internal combustion engine vibration signals and extracting signal features rich in equipment operating state information. Second, the FVL constraint is proposed to calculate the feature variance under all operating conditions. This drives the minimization of the feature variance distribution of different operating condition subsignal features with the same fault label, reducing the sensitivity of sample features to operating conditions and exploring domain-invariant features. Third, the proposed vibration signal decomposition layer and domain FVL constraint are embedded based on the DSAN framework to construct the transfer diagnosis model. Finally, four fault experiments were conducted on the V12 high-power internal combustion engine test bench, and the same experiments were replicated on another V6 high-power internal combustion engine test bench, serving as the target domain for transfer. The experiments validated that the proposed model achieved an accuracy of 94.81% and an F1 score of 94.42% in the target domain under variable operating conditions, confirming the transfer diagnostic performance of the proposed method.
However, our proposed method has limitations. For example, it can only identify fault types existing in the source domain, which must cover those found in the target data to achieve better transfer results in the target domain. To address this issue, current research is exploring methods such as assigning multiple sublabel dictionaries to samples capable of expressing their features to describe unknown faults or using adversarial learning methods to identify such faults. These are directions for future research.
ACKNOWLEDGMENT
Declaration of competing interest:
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.