[go: up one dir, main page]

CN116701998A - An Intelligent Fault Migration Diagnosis Method Based on Generalized Networks in Heterogeneous Federal Domains - Google Patents

An Intelligent Fault Migration Diagnosis Method Based on Generalized Networks in Heterogeneous Federal Domains Download PDF

Info

Publication number
CN116701998A
CN116701998A CN202310628150.6A CN202310628150A CN116701998A CN 116701998 A CN116701998 A CN 116701998A CN 202310628150 A CN202310628150 A CN 202310628150A CN 116701998 A CN116701998 A CN 116701998A
Authority
CN
China
Prior art keywords
fault
features
domain
model
source
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202310628150.6A
Other languages
Chinese (zh)
Other versions
CN116701998B (en
Inventor
秦毅
钱泉
蒲华燕
毛永芳
周江洪
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Chongqing University
Original Assignee
Chongqing University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Chongqing University filed Critical Chongqing University
Priority to CN202310628150.6A priority Critical patent/CN116701998B/en
Publication of CN116701998A publication Critical patent/CN116701998A/en
Application granted granted Critical
Publication of CN116701998B publication Critical patent/CN116701998B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02TCLIMATE CHANGE MITIGATION TECHNOLOGIES RELATED TO TRANSPORTATION
    • Y02T90/00Enabling technologies or technologies with a potential or indirect contribution to GHG emissions mitigation

Landscapes

  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)
  • Test And Diagnosis Of Digital Computers (AREA)

Abstract

The invention relates to an intelligent fault migration diagnosis method based on heterogeneous federal domain generalized network, belonging to the field of mechanical fault migration diagnosis. The method comprises the following steps: constructing a DDA model; building HFDGN based on MIWM mechanism; s4: inputting the training samples divided in all auxiliary domains of the source client and the source domain of the target client into the constructed DDA model, and training the corresponding local client model by utilizing an optimized objective function in the DDA model; s5: uploading the parameters of the trained DDA model to a central server, and then performing federal migration fault by using the constructed HFDGN network; s6: after repeated iterative training, the error curve tends to be stable, the HFDGN network training is completed, and the trained HFDGN network is used for heterogeneous multi-source federal migration diagnosis. The invention can meet the application requirements of high data utilization rate and real-time diagnosis in actual engineering.

Description

Intelligent fault migration diagnosis method based on heterogeneous federal domain generalization network
Technical Field
The invention belongs to the field of mechanical fault migration diagnosis, and relates to an intelligent fault migration diagnosis method based on heterogeneous federal domain generalized network.
Background
In the existing mechanical fault migration diagnosis technology, although some federal migration learning intelligent diagnosis methods exist to solve the distribution difference between a source domain and a target domain and the problem of data privacy protection. However, their performance is entirely dependent on the source client and the destination client's corresponding mechanical devices being homogenous, i.e. the source client and the destination client's mechanical data should come from the same mechanical part. At the same time, these methods also require that test target domain data samples in the target client be available during the training process. Considering that in practical engineering, the target domain data is usually invisible and heterogeneous with the source client data, the existing federal migration diagnostic method is difficult to meet the application requirements of high data utilization and real-time diagnosis.
In order to solve the above problems, a novel heterogeneous federal domain generalization network (Heterogeneous federated domain generalization network, HFDGN) is needed to fill the gap of heterogeneous multi-source federal diagnostic methods.
Disclosure of Invention
In view of the above, the present invention aims to provide an intelligent fault migration diagnosis method based on heterogeneous federal domain generalization network, in the HFDGN network, a heterogeneous migration learning framework (Heterogeneous federated transfer learning, HFTL) is adopted, which can implement generalization fault diagnosis of a target client through public knowledge characterization mapping of heterogeneous source clients. In addition, a decoupling domain adaptive base model (Disentangled domain adaptation, DDA) is employed to remove negative effects of noise while at the same time enhancing the ability of domain aliasing and extracting inherent fault-related features.
In order to achieve the above purpose, the present invention provides the following technical solutions:
an intelligent fault migration diagnosis method based on heterogeneous federal domain generalization network specifically comprises the following steps:
s1: acquiring an original vibration signal on mechanical equipment through an acceleration sensor so as to facilitate construction of a subsequent migration diagnosis task; the collected original vibration signal is subjected to sample expansion by utilizing a sliding window sampling technology;
s2: constructing a decoupling domain self-adaptive basic model, namely a DDA model;
s3: constructing a heterogeneous federal migration learning network based on a mutual information weight matching mechanism, wherein the mutual information weight matching mechanism is abbreviated as MIWM; the heterogeneous federal transfer learning network is called HFDGN for short, and adopts a heterogeneous transfer learning framework;
s4: inputting the training samples divided in all auxiliary domains of the source client and the source domain of the target client into the constructed DDA model, and training the corresponding local client model by utilizing an optimized objective function in the DDA model;
s5: uploading the parameters of the trained DDA model to a central server, and then performing federal migration fault diagnosis by using the constructed HFDGN network;
s6: after repeated iterative training, the error curve tends to be stable, the HFDGN network training is completed, and the trained HFDGN network is used for heterogeneous multi-source federal migration diagnosis.
Further, in step S2, the backbone network of the DDA model includes four parts: feature extractor G FEFE ) Decoupler G DD ) Reconstructor G RR ) And fault classifier G FCFC), wherein ,θFE 、θ D 、θ R and θFC Respectively representing trainable weights of the corresponding network models; feature extractors are used to mine distribution difference knowledge and reduce distribution differencesDifferent and then obtain general feature F G The method comprises the steps of carrying out a first treatment on the surface of the The fault classifier is aimed at extracting the tag features F L To identify the type of fault; the decoupler is used for decoupling and separating out the fault related characteristic F FR Fault uncorrelated feature F FI The method comprises the steps of carrying out a first treatment on the surface of the The reconstructor reconstructs the common features from the fault-related features and the fault-uncorrelated features, so the decoupler and the reconstructor can be regarded as the encoder and decoder structures in the self-encoder.
The optimization objective of the DDA model mainly comprises the following three parts: 1) Decoupling characterization: learning a decoupling characterization to separate noise-induced fault uncorrelated features from the generic features; 2) Separation and reconstruction: maximizing a distribution distance between the fault-related features and the fault-unrelated features so as to ensure independence of the extracted fault-related features; 3) Distribution self-adaption: minimizing the distribution difference between any two fault-related features.
Further, in step S2, in the optimization objective 1) of the DDA model, the decoupling characterization is specifically: removing noise-induced fault uncorrelated features in an anti-training manner with decouplers and fault classifiers; first, using cross entropy loss L C Training a feature extractor, a decoupler and a fault classifier to accurately identify fault types;
wherein ,representing a desire for a sample domain; x is X i ,Y i Respectively representing an ith data sample and a corresponding label;Representing a sample field, C representing a fault type; i () represents an indication function, when argmax (Y i ) When=c, i=0;
the weight parameters of the feature extractor and fault classifier are then fixed, trained by maximizing the information entropy lossThe decoupler spoofs the fault classifier to learn fault uncorrelated features, wherein the information entropy loss is a measure that can reflect the purity of the sample predictive label; the smaller the information entropy loss, the higher the predictive tag probability purity. Information entropy (Information entropy, IE) loss L IE The expression is as follows:
wherein ,representing a desire for a sample domain;Representing sample X i Is>Representing the c-th element in the predictive label vector, the predictive label vector is obtained by:
by the countermeasure training between the formulas (1) and (2), the final failure-uncorrelated feature and the failure-correlated feature can be obtained by the decoupler separation.
Further, in step S2, in the optimization objective 2) of the DDA model, the separation and reconstruction are specifically: by maximizing the average difference loss L D Training the decoupler to broaden the distribution difference between the fault-related and fault-uncorrelated features:
wherein ,respectively representing expectations of fault uncorrelated features and fault correlated features; phi (·) is a high-dimensional mapping function in the regenerated hilbert space;Representing the 2 norms of the regenerated hilbert space;
fault-related feature F FR Fault uncorrelated feature F FI Obtained by the formula:
{F FR ,F FI }=G D (G FE (X)) (5)
at the same time, in order to avoid that the excessive training of formula (5) destroys the intrinsic properties of the extracted fault-related features, the loss L is lost through reconstruction R Training a reconstructor to reconstruct generic features from the fault-related features and the fault-uncorrelated features;
wherein ,representing the desire for a sample field.
Further, in step S2, in the optimization objective 3) of the DDA model, the distribution adaptation is specifically: using minimized average difference index as domain aliasing loss L DC Training the feature extractor and decoupler to achieve distributed adaptation;
wherein ,representing the expectation of fault-uncorrelated features, phi (·) being a high-dimensional mapping function in the regenerated hilbert space;Representing the 2 norms of the regenerated hilbert space;
suppose the client contains Q m There will be K domain confusion losses for the auxiliary domains wherein According to the migration learning theory, it can be known that +.>Representing the similarity of distribution of the corresponding two auxiliary domains, i.e.>Smaller represents that the feature extractor will focus too much on the corresponding +.>Is aligned with the distribution of the two auxiliary domains. Therefore, in order for the feature extractor to learn more general domain invariant features, the purity loss L is constructed using information entropy P Feature extractor and decoupler are de-trained to reduce purity between all domain aliasing losses, i.e. to maximize purity loss L P
Further, in step S2, the data of the plurality of auxiliary domains of the source client will completely participate in three optimization targets of the DDA model; because the target client assumes that the target domain data is inaccessible in the training process and can only be used for testing links, the target client only participates in two optimization targets, namely 'decoupling characterization' and 'separation and reconstruction'.
Further, in step S3, the MIWM mechanism is used to evaluate the contribution of each source client to the target client during the weight parameter allocation process;
contribution MI of ith source client to target client i The expression of (2) is:
wherein , andA fault classifier and decoupler in the DDA model trained on behalf of the ith source client; f (F) G and FL Respectively representing the obtained general characteristics and the label characteristics of the source domain data training DDA model in the target client; i Θ Representing two data sample fields +.>Mutual information between the two is expressed as follows:
wherein T (θ) represents a function set formed by network parameters; andRespectively represent-> andEdge probability distribution of>Is a joint probability distribution between two domains;Representing the product of the edge probability distribution between the two domains; theta represents a weight parameter set, theta represents a weight parameter,/->
Representing the expectation of joint probabilities of two domains,representing the product of the two-domain edge probabilities.
Further, in step S5, the central server aggregates the weight parameters of the feature extractors from all source clients, and the calculation formula is:
wherein M represents the number of source clients;a weight parameter representing a feature extractor from an ith source client;Representing the aggregated weight parameters; w (w) i Acquiring weight parameters matched with a corresponding source client and a target client through an MIWM mechanism;
wherein τ represents a temperature coefficient;
the central server then assigns the aggregated weight parameters to each of the source client and the target client.
Further, in step S5, the data is uploaded to the central serverDDA model parameters of (c) include: for all source clients, the weight parameters of the feature extractor, the decoupler and the fault classifier in the DDA model need to be uploaded; for the target client, the general feature F extracted from the source domain data in the DDA model needs to be uploaded G And tag feature F L
The invention has the beneficial effects that: the invention can solve the problem of meeting the application requirements of high data utilization rate and real-time diagnosis in actual engineering. In the Heterogeneous Federal Domain Generalized Network (HFDGN), the invention adopts a heterogeneous migration learning framework (HFTL), and can realize the generalized fault diagnosis of the target client through the public knowledge characterization mapping of the heterogeneous source client. In addition, the invention adopts a decoupling domain adaptive basic model (DDA) to remove negative influence caused by noise, and meanwhile, the basic model can enhance the domain confusion capacity and extract the inherent fault related characteristics.
Additional advantages, objects, and features of the invention will be set forth in part in the description which follows and in part will become apparent to those having ordinary skill in the art upon examination of the following or may be learned from practice of the invention. The objects and other advantages of the invention may be realized and obtained by means of the instrumentalities and combinations particularly pointed out in the specification.
Drawings
For the purpose of making the objects, technical solutions and advantages of the present invention more apparent, the present invention will be described in the following preferred detail with reference to the accompanying drawings, in which:
FIG. 1 is a diagram of a decoupling domain adaptive base model (DDA);
FIG. 2 is a schematic diagram of a heterogeneous Federal transfer learning framework (HFTL);
FIG. 3 is a DDS test stand;
FIG. 4 is a CWRU test stand;
FIG. 5 is an RDS test stand;
FIG. 6 is a SWJTU bench;
FIG. 7 shows the results of comparative experiments.
Detailed Description
Other advantages and effects of the present invention will become apparent to those skilled in the art from the following disclosure, which describes the embodiments of the present invention with reference to specific examples. The invention may be practiced or carried out in other embodiments that depart from the specific details, and the details of the present description may be modified or varied from the spirit and scope of the present invention. It should be noted that the illustrations provided in the following embodiments merely illustrate the basic idea of the present invention by way of illustration, and the following embodiments and features in the embodiments may be combined with each other without conflict.
Wherein the drawings are for illustrative purposes only and are shown in schematic, non-physical, and not intended to limit the invention; for the purpose of better illustrating embodiments of the invention, certain elements of the drawings may be omitted, enlarged or reduced and do not represent the size of the actual product; it will be appreciated by those skilled in the art that certain well-known structures in the drawings and descriptions thereof may be omitted.
The same or similar reference numbers in the drawings of embodiments of the invention correspond to the same or similar components; in the description of the present invention, it should be understood that, if there are terms such as "upper", "lower", "left", "right", "front", "rear", etc., that indicate an azimuth or a positional relationship based on the azimuth or the positional relationship shown in the drawings, it is only for convenience of describing the present invention and simplifying the description, but not for indicating or suggesting that the referred device or element must have a specific azimuth, be constructed and operated in a specific azimuth, so that the terms describing the positional relationship in the drawings are merely for exemplary illustration and should not be construed as limiting the present invention, and that the specific meaning of the above terms may be understood by those of ordinary skill in the art according to the specific circumstances.
Referring to fig. 1 to 7, an embodiment of the present invention provides a cross-bearing migration diagnosis method based on heterogeneous federal domain generalization network, which specifically includes the following steps:
step 1: the original vibration signals on the mechanical equipment are collected through the acceleration sensor so as to facilitate the construction of a subsequent migration diagnosis task. And then, expanding the sample by utilizing a sliding window sampling technology from the acquired original vibration signal.
Step 2: a decoupling domain adaptive basis model (DDA) based on a one-dimensional convolutional neural network is constructed.
As shown in fig. 1, the backbone network of the decoupling domain adaptive base model (DDA) includes four parts: feature extractor G FEFE ) Decoupler G DD ) Reconstructor G RR ) And fault classifier G FCFC), wherein θFE 、θ D 、θ R and θFC Respectively representing trainable weights of the corresponding network model. The feature extractor is used to mine the knowledge of the distribution differences and reduce the distribution differences, and then obtain the general feature F G The method comprises the steps of carrying out a first treatment on the surface of the The fault classifier is aimed at extracting the tag features F L To identify the type of fault; the decoupler is used for decoupling and separating out the fault related characteristic F FR Fault uncorrelated feature F FI The method comprises the steps of carrying out a first treatment on the surface of the The reconstructor reconstructs the common features from the fault-related features and the fault-uncorrelated features, so the decoupler and the reconstructor can be regarded as the encoder and decoder structures in the self-encoder.
The optimization objective of the decoupling domain adaptive basic model mainly comprises the following three parts: (1) decoupling characterization: learning a decoupling characterization to separate noise-induced fault uncorrelated features from the generic features; (2) separation and reconstruction: maximizing a distribution distance between the fault-related features and the fault-unrelated features so as to ensure independence of the extracted fault-related features; (3) distribution self-adaption: minimizing the distribution difference between any two fault-related features. These three parts will be described in detail below:
(1) decoupling characterization: to achieve feature decoupling, noise-induced fault-uncorrelated features are removed in an anti-training manner with decouplers and fault classifiers. First, using cross entropy loss L C Feature extractors, decouplers, and fault classifiers are trained to accurately identify fault categories.
wherein ,Xi ,Y i Respectively representing an ith data sample and a corresponding label;representing a sample field, C representing a fault type; i () represents an indication function, when argmax (Y i ) When=c, i=0.
The weight parameters of the feature extractor and fault classifier are then fixed, and the decoupler spoofs the fault classifier by maximizing the information entropy loss, which is a purity that can reflect the sample prediction labels, to learn the fault uncorrelated features. The smaller the information entropy loss, the higher the predictive tag probability purity. Information entropy (Information entropy, IE) loss L IE The expression is as follows:
wherein ,representing sample X i Is>Representing the c-th element in the predictive label vector, the predictive label vector is obtained by:
by the countermeasure training between the formulas (1) and (2), the final failure-uncorrelated feature and the failure-correlated feature can be obtained by the decoupler separation.
(2) Separation and reconstruction: to enhance the independence of the extracted fault-related features, the average difference loss L is maximized by D Training the decoupler to broaden the distribution difference between the fault-related and fault-uncorrelated features:
wherein φ (·) is a high-dimensional mapping function in regenerated Hilbert space; wherein the fault-related feature F FR Fault uncorrelated feature F FI Obtained by the formula:
{F FR ,F FI }=G D (G FE (X)) (5)
at the same time, in order to avoid that the excessive training of formula (5) destroys the intrinsic properties of the extracted fault-related features, the loss L is lost through reconstruction R Training a reconstructor to reconstruct generic features from the fault-related features and the fault-uncorrelated features;
(3) distribution self-adaption: after the two processes described above, in order to align the distribution of the extracted fault-related features between different domains, the minimized average difference index is also used as the domain aliasing loss L DC To train the feature extractor and decoupler to achieve distributed adaptation.
Suppose the client contains Q m There will be K domain confusion losses for the auxiliary domains wherein According to the migration learning theory, it can be known that +.>Representing the similarity of distribution of the corresponding two auxiliary domains, i.e.>Smaller represents that the feature extractor will focus too much on the corresponding +.>Is aligned with the distribution of the two auxiliary domains. Therefore, in order for the feature extractor to learn more general domain invariant features, the purity loss L is constructed using information entropy P Feature extractor and decoupler are de-trained to reduce purity between all domain aliasing losses, i.e. to maximize purity loss L P
The data of the multiple auxiliary domains of the source client will participate in the above three optimization processes in its entirety. Because the target client assumes that the target domain data is inaccessible in the training process and can only be used for a test link, the target client only participates in two optimization processes of decoupling characterization and separation and reconstruction of the source domain data.
Step 3: a heterogeneous federal migration learning network (HFDGN) based on a mutual information weight matching mechanism (MIWM) was constructed, and as shown in fig. 2, a heterogeneous migration learning framework (HFTL) was adopted.
The heterogeneous migration learning framework proposed by the present invention is explained in detail from three aspects:
(1) heterogeneous migration learning frame structure
As shown in fig. 2, the heterogeneous federal migration learning framework is composed of a plurality of source clients, a target domain client, a central server, and a mutual information weight matching (Mutual information weight matching, MIWM) mechanism. Wherein the source client and the target client may be heterogeneous; the central server is used for realizing the distribution processing of decoupling domain self-adaptive basic model (DDA) weight parameters trained by all source clients; a mutual information weight matching mechanism (MIWM) is used to evaluate the contribution of each source client to the target client during the weight parameter assignment process.
(2) Federal communication paradigm
Through the federal communication paradigm demonstrated in fig. 2, it can be known that all source domain clients need to upload the weight parameters of the feature extractor, the decoupler and the fault classifier in the decoupling domain adaptive base model (DDA) to the central server; for a target client, decoupling common features F extracted from source domain data in a domain adaptive base model (DDA) G And tag feature F L And need to be uploaded to a central server. Because the high-dimensional features extracted by the feature extractor are more generic and migratable, the central server only aggregates the weight parameters of the feature extractor from all source clients.
Wherein M represents the number of source clients;a weight parameter representing a feature extractor from an ith source client; w (w) i The weight parameters matched with the corresponding source client and the target client are obtained through a mutual information weight matching mechanism (MIWM). Finally, the central server assigns the aggregated weight parameters to each of the source client and the target client.
(3) Mutual information weight matching mechanism (MIWM)
In the mutual information weight matching mechanism, mutual information is used to evaluate the contribution of each source client to the target client. The mutual information may reflect the similarity between two data sample fields, with greater mutual information having greater similarity. Two data sample fieldsThe definition of mutual information between them is as follows:
wherein , andRespectively represent-> andIs a boundary probability distribution of (1);Is a joint probability distribution between two domains;Representing the product of the edge probability distribution between the two domains. The relationship between the joint probability distribution and the edge probability distribution is as follows:
because equation (10) is difficult to calculate the mutual information between unknown continuous variables, we re-express equation (10) by the Donsker-vardhan variation theory as:
where T (θ) represents the set of functions formed by the network parameters. By mutual information evaluation of equation (12), the contribution MI of the ith source client to the target client i Can be obtained by the following formula:
wherein , andA fault classifier and decoupler in a decoupling domain adaptive base model trained on behalf of an ith source client; f (F) G and FL And respectively representing the obtained general characteristics and the label characteristics of the source domain data training decoupling domain self-adaptive basic model in the target client. Finally, the weight w in formula (9) i Can be expressed as:
where τ represents the temperature coefficient, the smaller the temperature coefficient, the larger { MI } will be 1 ,MI 2 ,…,MI M Differences between.
In combination with the decoupling domain adaptive basic model and the heterogeneous federation migration learning framework, a final Heterogeneous Federation Domain Generalized Network (HFDGN) is constructed to implement data federation under privacy protection.
Step 4: and inputting the training samples divided in all auxiliary domains of the source client and the source domain of the target client into a built decoupling domain self-adaptive basic model (DDA), and training a corresponding local client model by utilizing an optimized objective function in the decoupling domain self-adaptive basic model (DDA).
Step 5: uploading parameters of a decoupling domain self-adaptive basic model of a trained target client and a source client to a central server, and then performing federal migration fault diagnosis by using a built heterogeneous federal migration learning network (HFDGN).
Step 6: after repeated iterative training, the error curve tends to be stable, the model training is completed, and the trained heterogeneous federal domain generalization network is used for heterogeneous multi-source federal migration diagnosis.
The effectiveness of the above-described intelligent diagnosis method is described below by experimental results.
Verification experiment: the experiment collects vibration signals (CWRU, RDS, SWJTU) of three bearing fault simulation experiment tables and vibration signals (DDS) of one gear fault simulation experiment table.
(1) As shown in fig. 3, the DDS test stand mainly comprises five parts, namely a motor, a planetary gear box, a parallel gear box and a magnetic powder brake. The different working condition signals can be obtained through magnetic powder brake loading simulation. Five kinds of health information are shared by the planetary gear boxes: normal, surface wear, root breakage, tooth defect, tooth breakage. Furthermore, three conditions were simulated: 0 N.m (G1), 1.4 N.m (G2), 2.8 N.m (G3).
(2) The CWRU bearing data set is a CWRU standard bearing data set disclosed by Kassi kitchen university, U.S. and a schematic diagram of the CWRU bearing data set is shown in FIG. 4, which is composed of a motor, bearings at both ends of the motor, a torque sensor and a power meter. Four faults, namely a normal fault, an inner ring fault, a rolling body fault and an outer ring fault, are simulated. It contains four kinds of operating mode information: hp (C1), 1hp (C2), 2hp (C3), and 3hp (C4).
(3) A schematic diagram of a test bed for collecting RDS bearing data sets is shown in FIG. 5, and the test bed consists of a servo motor, a coupler, a rotor, bearings at two ends of the rotor and bearing blocks. As with the CWRU bearing dataset, four health states (normal, inner ring failure, rolling body failure, and outer ring failure) were also simulated. By applying rotors of different weights, four load conditions were simulated: 0N (R1), 14N (R2) 28N (R3), and 44N (R4).
(4) The SWJTU bearing data set is from southwest university of traffic, and a schematic diagram of a test bed is shown in FIG. 6, and consists of a motor, a support bearing, a fault bearing for testing, and a loading system. Four health conditions (normal, inner ring failure, rolling body failure, and outer ring failure) and four load conditions (0 kN (B1), 1kN (B2), 2kN (B3), and 3kN (B4)) were simulated.
From the four data sets described above, a cwru+rds+swjtu→dds heterogeneous multisource federal diagnostic example was constructed to evaluate the diagnostic performance of HFDGN.
Comparison experiment:
to demonstrate the superiority of the Heterogeneous Federal Domain Generalized Network (HFDGN) -based migration diagnostic method of the present invention, experimental results at six generalized migration diagnostic tasks are shown in fig. 7, compared to the current typical domain generalized migration diagnostic model (WDCNN, whitening-net) and the classical domain adaptive diagnostic model (DANN, DDC). From fig. 7, it can be derived that the heterogeneous federal domain generalization network proposed by the present invention has higher migration diagnosis precision and stronger generalization capability. In addition, the average diagnosis accuracy of the HFDGN on six generalized migration diagnosis tasks reaches more than 89.85%, and the contrast domain adaptive diagnosis model (DANN, DDC) has approximately the same diagnosis effect. In particular, for the G1-G3, G2-G3 and G3-G2 migration diagnostic tasks, HFDGN has a more significant improvement in diagnostic accuracy than all other comparative models.
Finally, it is noted that the above embodiments are only for illustrating the technical solution of the present invention and not for limiting the same, and although the present invention has been described in detail with reference to the preferred embodiments, it should be understood by those skilled in the art that modifications and equivalents may be made thereto without departing from the spirit and scope of the present invention, which is intended to be covered by the claims of the present invention.

Claims (9)

1.一种基于异构联邦域泛化网络的智能故障迁移诊断方法,其特征在于,该方法具体包括以下步骤:1. A method for intelligent fault migration diagnosis based on heterogeneous federated domain generalized networks, characterized in that the method specifically includes the following steps: S1:通过传感器采集机械设备上的原始振动信号,并将采集到的原始振动信号进行样本扩充;S1: Collect raw vibration signals from mechanical equipment using sensors, and then amplify the collected raw vibration signals into samples; S2:构建解耦域自适应基础模型,简称DDA模型;S2: Construct the Decoupled Domain Adaptive Base Model, abbreviated as DDA model; S3:搭建基于互信息权重匹配机制的异构联邦迁移学习网络,其中,互信息权重匹配机制简称MIWM;异构联邦迁移学习网络简称HFDGN,采用异构迁移学习框架;S3: Construct a heterogeneous federated transfer learning network based on the mutual information weight matching mechanism, where the mutual information weight matching mechanism is abbreviated as MIWM; the heterogeneous federated transfer learning network is abbreviated as HFDGN, and a heterogeneous transfer learning framework is adopted. S4:将源客户端的所有辅助域中和目标客户端的源域中划分好的训练样本输入到构建好的DDA模型中,利用DDA模型中的优化目标函数去训练对应的本地客户端模型;S4: Input the training samples from all auxiliary domains of the source client and the source domain of the target client into the constructed DDA model, and use the optimization objective function in the DDA model to train the corresponding local client model; S5:将训练好的DDA模型的参数上传到中央服务器,然后利用搭建的HFDGN网络进行联邦迁移故障诊断;S5: Upload the parameters of the trained DDA model to the central server, and then use the built HFDGN network to perform federated migration fault diagnosis. S6:经过多次迭代训练,误差曲线趋于稳定,HFDGN网络训练完成,将训练好的HFDGN网络用于异构多源联邦迁移诊断。S6: After multiple iterations of training, the error curve tends to stabilize, and the HFDGN network training is completed. The trained HFDGN network is then used for heterogeneous multi-source federated migration diagnosis. 2.根据权利要求1所述的智能故障迁移诊断方法,其特征在于,步骤S2中,DDA模型的主干网络包括四个部分:特征提取器GFEFE)、解耦器GDD)、重构器GRR)和故障分类器GFCFC),其中,θFE、θD、θR和θFC分别表示对应网络模型的可训练权重;特征提取器是被用于挖掘分布差异知识并减小分布差异,然后获得通用特征FG;故障分类器是致力于提取标签特征FL来识别故障种类;解耦器是用于解耦分离出故障相关特征FFR和故障不相关特征FFI;重构器是从故障相关特征和故障不相关特征中重构出通用特征;2. The intelligent fault migration diagnosis method according to claim 1, characterized in that, in step S2, the backbone network of the DDA model comprises four parts: a feature extractor GFE ( θFE ), a decoupler GD ( θD ), a reconstructor GR ( θR ), and a fault classifier GFC ( θFC ), wherein θFE , θD , θR , and θFC represent the trainable weights of the corresponding network model; the feature extractor is used to mine knowledge of distribution differences and reduce distribution differences, and then obtain general features FG ; the fault classifier is dedicated to extracting label features FL to identify fault types; the decoupler is used to decouple and separate fault-related features FFR and fault-independent features FFI ; the reconstructor reconstructs general features from fault-related features and fault-independent features; DDA模型的优化目标包括以下三个部分:1)解耦表征:学习一个解耦表征从通用特征中分离出噪声导致的故障不相关特征;2)分离与重构:最大化故障相关特征和故障不相关特征之间的分布距离;3)分布自适应:最小化任意两个故障相关特征之间的分布差异。The optimization objectives of the DDA model include the following three parts: 1) Decoupling representation: learn a decoupling representation to separate fault-irrelevant features caused by noise from general features; 2) Separation and reconstruction: maximize the distribution distance between fault-related features and fault-irrelevant features; 3) Distribution adaptation: minimize the distribution difference between any two fault-related features. 3.根据权利要求2所述的智能故障迁移诊断方法,其特征在于,步骤S2中,DDA模型的优化目标1)中,解耦表征具体是:用解耦器和故障分类器以对抗训练的方式去移除噪声导致的故障不相关的特征;首先,用交叉熵损失LC训练特征提取器、解耦器和故障分类器来精确识别故障种类;3. The intelligent fault migration diagnosis method according to claim 2, characterized in that, in step S2, the decoupling representation in the optimization objective 1) of the DDA model is specifically: using a decoupler and a fault classifier in an adversarial training manner to remove fault-irrelevant features caused by noise; firstly, using cross-entropy loss LC to train the feature extractor, decoupler and fault classifier to accurately identify fault types; 其中,表示对样本域求期望;Xi,Yi分别代表第i个数据样本和对应的标签;表示样本域,C表示故障种类;I()表示指示函数,当argmax(Yi)=c时,I=0;in, This represents the expectation over the sample domain; Xi and Yi represent the i-th data sample and its corresponding label, respectively. Let C represent the sample domain and C represent the fault type; I() represents the indicator function, where I = 0 when argmax(Y i ) = c; 然后,固定特征提取器和故障分类器的权重参数,通过最大化信息熵损失来训练解耦器欺骗故障分类器,从而学习到故障不相关特征;信息熵损失LIE表示如下:Then, with the weight parameters of the feature extractor and the fault classifier fixed, the decoupler is trained to deceive the fault classifier by maximizing the information entropy loss, thereby learning fault-independent features; the information entropy loss LIE is expressed as follows: 其中,表示对样本域求期望;表示样本Xi的预测标签向量,表示预测标签向量中的第c个元素,预测标签向量由下式获得:in, This indicates the expectation over the sample domain; This represents the predicted label vector of sample Xi . This represents the c-th element in the predicted label vector, which is obtained by the following formula: 通过公式(1)和(2)之间的对抗训练,最终的故障不相关特征和故障相关特征通过解耦器分离获得。Through adversarial training between formulas (1) and (2), the final fault-independent features and fault-related features are obtained by decoupling. 4.根据权利要求2所述的智能故障迁移诊断方法,其特征在于,步骤S2中,DDA模型的优化目标2)中,分离与重构具体是:通过最大化平均差异损失LD来训练解耦器去扩大故障相关特征和故障不相关特征之间的分布差异:4. The intelligent fault migration diagnosis method according to claim 2, characterized in that, in step S2, the separation and reconstruction in the optimization objective 2) of the DDA model specifically involves: training the decoupler to amplify the distribution difference between fault-related features and fault-independent features by maximizing the average difference loss LD . 其中,分别表示对故障不相关特征和故障相关特征求期望;φ(·)是再生希尔伯特空间中的高维映射函数;表示再生希尔伯特空间的2范数;in, Let represent the expectation of fault-independent features and fault-dependent features, respectively; φ(·) is a high-dimensional mapping function in the regenerated Hilbert space. The 2-norm of the regenerated Hilbert space; 故障相关特征FFR和故障不相关特征FFI通过下式获得:Fault-related characteristics FFR and fault-independent characteristics FFI are obtained by the following formula: {FFR,FFI}=GD(GFE(X)) (5){F FR ,F FI }=G D (G FE (X)) (5) 同时,通过重构损失LR来训练重构器,从故障相关特征和故障不相关特征中重构出通用特征;Meanwhile, the reconstructor is trained by reconstructing the loss L R to reconstruct general features from fault-related features and fault-independent features; 其中,表示对样本域求期望。in, This indicates the expectation over the sample domain. 5.根据权利要求2所述的智能故障迁移诊断方法,其特征在于,步骤S2中,DDA模型的优化目标3)中,分布自适应具体是:用最小化平均差异指标作为域混淆损失LDC来训练特征提取器和解耦器,从而实现分布自适应;5. The intelligent fault migration diagnosis method according to claim 2, characterized in that, in step S2, the distribution adaptation in the optimization objective 3) of the DDA model is: using the minimization of the average difference index as the domain confusion loss LDC to train the feature extractor and decoupler, thereby achieving distribution adaptation; 其中,表示对故障不相关特征求期望,φ(·)是再生希尔伯特空间中的高维映射函数;表示再生希尔伯特空间的2范数;in, φ(·) represents the expectation of fault-independent features, and is a high-dimensional mapping function in the regenerated Hilbert space. The 2-norm of the regenerated Hilbert space; 假设客户端含有Qm个辅助域,将有K个域混淆损失其中利用信息熵构建的纯度损失LP去训练特征提取器和解耦器来减小所有域混淆损失之间的纯度,即最大化纯度损失LPAssuming the client has Qm auxiliary fields, there will be K field confusion losses. in The purity loss LP, constructed using information entropy, is used to train the feature extractor and decoupler to reduce the purity among all domain confusion losses, i.e., to maximize the purity loss LP . 6.根据权利要求2所述的智能故障迁移诊断方法,其特征在于,步骤S2中,源客户端的多个辅助域的数据将完整参与DDA模型的三个优化目标;目标客户端仅源域数据参与“解耦表征”和“分离与重构”两个优化目标。6. The intelligent fault migration diagnosis method according to claim 2, characterized in that, in step S2, the data from multiple auxiliary domains of the source client will fully participate in the three optimization objectives of the DDA model; the source domain data of the target client will only participate in the two optimization objectives of "decoupling representation" and "separation and reconstruction". 7.根据权利要求1所述的智能故障迁移诊断方法,其特征在于,步骤S3中,MIWM机制被用于评估权重参数分配处理过程中每个源客户端对目标客户端的贡献度;7. The intelligent fault migration diagnosis method according to claim 1, wherein in step S3, the MIWM mechanism is used to evaluate the contribution of each source client to the target client during the weight parameter allocation process; 第i个源客户端对目标客户端的贡献度MIi的表达式为:The expression for the contribution MI of the i-th source client to the target client is: 其中,代表第i个源客户端训练的DDA模型中的故障分类器和解耦器;FG和FL分别代表目标客户端中源域数据训练DDA模型的得到的通用特征和标签特征;IΘ表示两个数据样本域之间的互信息,表达式为:in, and Represents the fault classifier and decoupler in the DDA model trained on the i-th source client; F<sub>G</sub> and FL<sub> L </sub> represent the general features and label features obtained from training the DDA model on the source domain data in the target client, respectively; I <sub>Θ </sub> represents the two data sample domains. The mutual information between them is expressed as: 其中,T(θ)表示网络参数形成的函数集;分别代表的边缘概率分布,是两个域之间的联合概率分布;代表两个域之间边缘概率分布的积;Θ表示权重参数集,θ表示权重参数,表示两域联合概率求期望,表示对两域边缘概率的积求期望。Where T(θ) represents the set of functions formed by the network parameters; and Represent and Marginal probability distribution, It is the joint probability distribution between two domains; This represents the product of the marginal probability distributions between two domains; Θ represents the set of weight parameters, and θ represents the weight parameters. To express the expectation of the joint probability of two domains, This represents the expectation of the product of the marginal probabilities of the two domains. 8.根据权利要求7所述的智能故障迁移诊断方法,其特征在于,步骤S5中,中央服务器聚合来自所有源客户端的特征提取器的权重参数,计算公式为:8. The intelligent fault migration diagnosis method according to claim 7, characterized in that, in step S5, the central server aggregates the weight parameters of the feature extractors from all source clients, and the calculation formula is: 其中,M代表源客户端的数量;表示来自于第i个源客户端的特征提取器的权重参数;表示聚合后的权重参数;wi是通过MIWM机制获取对应源客户端与目标客户端匹配的权重参数;Where M represents the number of source clients; This represents the weight parameters of the feature extractor from the i-th source client; This represents the aggregated weight parameters; w<sub>i</sub> is the weight parameter obtained by the MWIM mechanism to match the source client and the target client. 其中,τ代表温度系数;Where τ represents the temperature coefficient; 然后,中央服务器将聚合后的权重参数分配给每个源客户端和目标客户端。Then, the central server assigns the aggregated weight parameters to each source client and target client. 9.根据权利要求1或8所述的智能故障迁移诊断方法,其特征在于,步骤S5中,上传到中央服务器的DDA模型参数包括:对于所有源客户端,需要上传DDA模型中特征提取器、解耦器和故障分类器的权重参数;对于目标客户端,需要上传DDA模型中从源域数据中提取的通用特征FG和标签特征FL9. The intelligent fault migration diagnosis method according to claim 1 or 8, characterized in that, in step S5, the DDA model parameters uploaded to the central server include: for all source clients, the weight parameters of the feature extractor, decoupler and fault classifier in the DDA model need to be uploaded; for the target client, the general features FG and label features FL extracted from the source domain data in the DDA model need to be uploaded.
CN202310628150.6A 2023-05-30 2023-05-30 Intelligent fault migration diagnosis method based on heterogeneous federal domain generalization network Active CN116701998B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202310628150.6A CN116701998B (en) 2023-05-30 2023-05-30 Intelligent fault migration diagnosis method based on heterogeneous federal domain generalization network

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202310628150.6A CN116701998B (en) 2023-05-30 2023-05-30 Intelligent fault migration diagnosis method based on heterogeneous federal domain generalization network

Publications (2)

Publication Number Publication Date
CN116701998A true CN116701998A (en) 2023-09-05
CN116701998B CN116701998B (en) 2025-04-29

Family

ID=87836727

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202310628150.6A Active CN116701998B (en) 2023-05-30 2023-05-30 Intelligent fault migration diagnosis method based on heterogeneous federal domain generalization network

Country Status (1)

Country Link
CN (1) CN116701998B (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN119474786A (en) * 2025-01-16 2025-02-18 西安邮电大学 A method and system for equipment fault diagnosis based on federated domain generalization

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20220327035A1 (en) * 2020-06-03 2022-10-13 Soochow University Intra-class adaptation fault diagnosis method for bearing under variable working conditions
CN115420490A (en) * 2022-09-01 2022-12-02 重庆大学 Mechanical fault intelligent diagnosis method based on relationship migration domain generalization network
CN115560983A (en) * 2022-09-30 2023-01-03 哈尔滨理工大学 Rolling bearing fault diagnosis method and system under different working conditions based on federal feature transfer learning

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20220327035A1 (en) * 2020-06-03 2022-10-13 Soochow University Intra-class adaptation fault diagnosis method for bearing under variable working conditions
CN115420490A (en) * 2022-09-01 2022-12-02 重庆大学 Mechanical fault intelligent diagnosis method based on relationship migration domain generalization network
CN115560983A (en) * 2022-09-30 2023-01-03 哈尔滨理工大学 Rolling bearing fault diagnosis method and system under different working conditions based on federal feature transfer learning

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
LU SHIXIANG 等: "Class-Imbalance Privacy-Preserving Federated Learning for Decentralized Fault Diagnosis With Biometric Authentication", 《IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS》, vol. 18, no. 12, 12 July 2022 (2022-07-12), pages 9101 - 9111, XP011921604, DOI: 10.1109/TII.2022.3190034 *
QIAN QUAN 等: "Heterogeneous Federated Domain Generalization Network With Common Representation Learning for Cross-Load Machinery Fault Diagnosis", 《IEEE TRANSACTIONS ON SYSTEMS, MAN, AND CYBERNETICS: SYSTEMS》, vol. 54, no. 09, 25 June 2024 (2024-06-25), pages 5704 - 5716 *
李响 等: "保证数据隐私的装备协同智能故障诊断联邦迁移学习方法", 《机械工程学报》, vol. 59, no. 06, 20 March 2023 (2023-03-20), pages 1 - 9 *

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN119474786A (en) * 2025-01-16 2025-02-18 西安邮电大学 A method and system for equipment fault diagnosis based on federated domain generalization

Also Published As

Publication number Publication date
CN116701998B (en) 2025-04-29

Similar Documents

Publication Publication Date Title
Zhang et al. Data privacy preserving federated transfer learning in machinery fault diagnostics using prior distributions
Peng et al. Fault feature extractor based on bootstrap your own latent and data augmentation algorithm for unlabeled vibration signals
CN117349749B (en) Multi-source domain bearing fault diagnosis method based on hybrid convolution
CN113191215A (en) Rolling bearing fault diagnosis method integrating attention mechanism and twin network structure
CN114004252A (en) Method, device and equipment for bearing fault diagnosis
CN108830127A (en) A kind of rotating machinery fault feature intelligent diagnostic method based on depth convolutional neural networks structure
Qi et al. A review on data-driven condition monitoring of industrial equipment
CN115420490B (en) Intelligent diagnosis method of mechanical faults based on generalized network of relation transfer domain
He et al. Alzheimer's disease diagnosis model based on three-dimensional full convolutional DenseNet
CN112633195A (en) Myocardial infarction identification and classification method based on frequency domain features and deep learning
CN117556317A (en) Rotary bearing fault diagnosis method based on time-frequency image
CN116701998A (en) An Intelligent Fault Migration Diagnosis Method Based on Generalized Networks in Heterogeneous Federal Domains
Wu et al. Multichannel vibration signal fusion based on rolling bearings and MRST-transformer fault diagnosis model
Guo et al. A domain generalization network for imbalanced machinery fault diagnosis
Neili et al. Addressing varying lengths in PCG signal classification with BiLSTM model and MFCC features
Yau et al. Bearing fault detection system based on a deep diffusion model
CN114118149A (en) Induction motor fault diagnosis system based on finite element simulation and symmetric feature migration
Shafiullah et al. Detection of skin cancer: A deep learning approach
Chen et al. Wheel fault diagnosis model based on multichannel attention and supervised contrastive learning
CN117313251A (en) Global fault diagnosis method of train transmission device based on non-lag progressive learning
CN117454082A (en) Quantitative diagnosis method of imbalance fault based on ECA-ResNet and deep reinforcement learning
CN115753036A (en) Fan fault diagnosis method fusing image domain coding and residual error neural network
CN115510902A (en) Rotor fault diagnosis method and device based on multi-source heterogeneous data
CN115048960A (en) Equipment state detection method
CN119646620A (en) Self-lifting diagnosis method of mechatronic transmission system based on memory subnetwork

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant