CN110533071B

CN110533071B - SMT production tracing method based on self-encoder and ensemble learning

Info

Publication number: CN110533071B
Application number: CN201910688024.3A
Authority: CN
Inventors: 常建涛; 张凯磊; 孔宪光; 王佩
Original assignee: Xidian University
Current assignee: Xidian University
Priority date: 2019-07-29
Filing date: 2019-07-29
Publication date: 2022-03-22
Anticipated expiration: 2039-07-29
Also published as: CN110533071A

Abstract

The invention discloses an SMT production tracing method based on a self-encoder and ensemble learning, which comprises the following steps: (1) constructing a self-encoder; (2) acquiring an SPI defect tracing data set; (3) carrying out normalization processing on the SPI defect tracing data set; (4) training a self-encoder; (5) obtaining a classification tree set by using an ensemble learning method; (6) obtaining the SPI production tracing sequence. According to the invention, the normalized SPI defect tracing data set is input into the trained self-encoder to generate a classification data set, a classification tree is trained by using an ensemble learning method, the trained classification tree is traversed to obtain an SMT production tracing sequence, key factors causing product defects are positioned, and the SMT production tracing accuracy is improved.

Description

SMT production tracing method based on self-encoder and ensemble learning

Technical Field

The invention belongs to the technical field of electronics, and further relates to a Surface Mounting Technology (SMT) production tracing method based on a self-encoder and ensemble learning in the technical field of informatization of electronic manufacturing industry. The invention can be applied to tracing the Printed Circuit Board (PCB) of an electronic product in the surface mounting production process, and is used for quickly positioning key factors causing product defects.

Background

SMT is an electronic assembly technique that assembles surface mount components to a printed board. ISO 9000-: "traceability" traces back the history of the object under consideration, the application or the ability of the location where it is located. The method can control and adjust the technical instability factors, human factors or management factors causing the defect points, and continuously improve the product quality. Among various tracing methods, the method based on machine learning and deep learning effectively utilizes various data generated in the SMT production process during tracing, solves the problem of insufficient data utilization, and realizes SMT production tracing.

Shanghai ' an science and technology Limited company discloses an SMT production tracing method and technology in the patent document ' an SMT production intelligent error-proof tracing method and technology ' (patent application No. 201810719538.6, application publication No. CN 109911365A). According to the method, the intelligent warehouse is established, the operation flow of the intelligent warehouse is standardized, the intelligent warehouse and the intelligent production module are jointly used, real-time feedback and optimization of the SMT production line are achieved, and SMT production tracing is achieved. The method has the defects that only relevant data in the production process of the product can be obtained, the relation between SMT production information and production defects cannot be deeply mined, and key factors causing the product defects cannot be timely and accurately positioned.

Ban X et al, in its published paper "Quality tracking of converter steel based adaptive feature selection and multiple linear regression" (2018IEEE International Conference on Big Data and Smart Computing (BigComp). IEEE,2018:462-468), disclose a method for tracing converter steel making process abnormal production Data using an adaptive feature selection method based on correlation and deviation matching for feature selection, using a multivariate linear regression method to analyze the causal relationship between the parameters, and using the feature with the largest coefficient in the regression equation as the key factor causing the production abnormality. The method has the defects that the used self-adaptive feature selection method can only find out features linearly related to the dependent variable, so that excessive features are lost, the used multiple linear regression cannot describe the nonlinear relation between the independent variable and the dependent variable, and the interpretation capability of the relation between the independent variable and the dependent variable is weak.

Disclosure of Invention

The invention aims to overcome the defects of the prior art and provides an SMT production tracing method based on an auto-encoder and ensemble learning so as to locate key factors causing product defects.

The idea for realizing the purpose of the invention is as follows: and (3) constructing 18 self-encoders with the same structure and different parameters, processing the normalized SPI defect tracing data set to obtain a classification data set, then obtaining a classification tree set by using an integrated learning method, finally traversing the classification tree set to obtain an SMT production tracing sequence, and positioning production information which has important influence on SMT production defects.

The method comprises the following specific steps:

(1) constructing an auto encoder:

(1a) build 18 autocoders with the same structure and different parameters, wherein each autocoder has three layers, and the structure of each autocoder is as follows: input layer → fully connected layer → output layer;

(1b) setting the number of nodes of the input layer and the output layer to 76;

(1c) the number of full link layer nodes for each autoencoder is set according to the following equation:

wherein n is_iRepresents the number of fully-connected layer nodes of the ith self-encoder, i e {1,2, …,18},

represents a rounding down operation,% represents a remainder operation;

(1d) calculating an activation value of each node of the fully-connected layers in the 1 st to 9 th self-encoders according to the following formula:

wherein, T_mjRepresents the activation value of the jth node in the m-th self-encoder's fully-connected layer, m ∈ {1,2, …,9}, j ∈ {1,2, …, N ∈_m},N_mRepresenting the total number of fully-connected layer nodes of the mth self-encoder, e^(·)Expressing exponential operations based on a natural constant e，x_mjRepresenting the j-th node input value, x, in the fully-connected layer of the m-th self-encoder_mj＝W_mj ^TX_m,W_mjRepresenting a weight matrix of the network between the input layer of the mth self-encoder and the jth node in the fully-connected layer, the initialized value of each element of the matrix obeying a standard normal distribution, T representing a transposition operation, X_mRepresents a vector consisting of input values of 76 nodes in the input layer of the mth self-encoder;

(1e) calculating an activation value of each node of the fully-connected layers in the 10 th to 18 th self-encoders according to the following formula:

R_ln＝max(0,x_ln)

wherein R is_lnRepresents the activation value of the nth node in the fully-connected layer of the ith self-encoder, i ∈ {10, …,18}, N ∈ {1,2, …, N ∈_l},N_lRepresents the total number of nodes of the fully-connected layer of the ith self-encoder, max (·) represents the max operation, x_lnRepresenting the input value, x, of the nth node in the fully-connected layer of the ith self-encoder_ln＝W_ln ^TX_l,W_lnA weight matrix representing the network between the nth node of the input layer and the fully-connected layer of the ith self-encoder, the initialized value of each element of the matrix obeying a standard normal distribution, X_lRepresenting a vector consisting of input values of 76 nodes in the ith self-encoder input layer;

(1f) calculating a loss error value between each output value from the encoder output layer and the input layer input value according to:

wherein L is_iRepresenting the loss error value between the i-th output value from the encoder output layer and the input layer input value, i e {1,2, …,18}, N_iRepresents the number of input layer nodes and the number of output layer nodes of the ith self-encoder, sigma represents the summation operation, y_ikRepresents the input value of the kth node of the ith self-encoder input layer,

represents the output value of the kth node of the ith self-encoder output layer, k is equal to {1,2, …, N };

(2) acquiring an SPI defect tracing data set:

randomly extracting at least 5320000 pieces of SPI tracing data from a database of a Manufacturing Execution System (MES) to form an MXN-dimensional SPI defect tracing data set, wherein M is at least 70000, and N is at least 76, each row of data represents SPI defect tracing data containing production information, each column of data represents a sequence formed by all values of one attribute in the SPI tracing data set, and at least 20000 rows of SPI tracing data in the SPI defect tracing data set are defective detection data;

(3) according to the following formula, normalizing the data of each attribute in the SPI defect tracing data set to obtain a normalized SPI defect tracing data set:

x'_qp＝(x_qp-min(x_q))/(max(x_q)-min(x_q))

wherein, x'_qpNormalized value, x, of the pth data representing the qth attribute in the SPI defect trace dataset_qpP data representing the q attribute of the SPI defect tracing data set, min (-) represents the minimum value operation, x_qAll data representing the qth attribute of the SPI defect tracing data set, max (·) represents the maximum value operation;

(4) training the self-encoder:

respectively inputting the normalized SPI defect tracing data set into an input layer of each self-encoder of 18 self-encoders, and respectively training each self-encoder by using a random gradient descent method to obtain 18 trained self-encoders in total;

(5) obtaining a set of classification trees using ensemble learning:

(5a) inputting all data of the normalized MXN-dimensional SPI defect tracing data set to a full-connection layer of each trained self-encoder in sequence according to rows, and forming output data of all nodes of the full-connection layer into an MXN-dimensional classification data set, wherein the value of N' is equal to the number of nodes of the full-connection layer;

(5b) selecting A row of data from the classification data set to form a training set, wherein,

representing a rounding-down operation, wherein M represents the line number of the classified data set, and forming a test set by the residual data in the classified data set;

(5c) training the training set by using a classification regression tree CART training method to obtain a trained classification tree;

(5d) classifying the test set by using the trained classification tree to obtain the classification accuracy of the classification tree;

(6) obtaining an SMT production tracing sequence:

(6a) for each trained classification tree, taking a root node of the classification tree as a starting node of each traversal, sequentially taking all leaf nodes of the classification tree as target nodes of each traversal, and taking all attribute names passed by each traversal as a tracing sequence of the classification tree;

(6b) taking the classification accuracy of each classification tree as the credibility of all tracing sequences of the classification tree;

(6c) searching a self-encoder corresponding to each tracing sequence, then searching a node corresponding to each attribute name in the tracing sequence corresponding to the self-encoder in a full connection layer of the self-encoder, and forming a network weight vector of the attribute name by using network weight values from all nodes of an input layer of the self-encoder to the corresponding node in the full connection layer, wherein the total number of elements of the network weight vector corresponding to each attribute name is the same as the number of nodes of the input layer of the corresponding self-encoder;

(6d) arranging the network weight vectors of each attribute name of each tracing sequence according to rows to form a C x D-dimensional risk matrix of the tracing sequence, wherein C represents the total number of the attribute names in the tracing sequence, and D represents the total number of the attributes in the SPI defect tracing data set;

(6e) all the data obtained by summing the risk matrix of each tracing sequence according to columns form a tracing vector of the tracing sequence, wherein each data in the tracing vector represents the importance of a corresponding attribute in the SPI defect tracing data set;

(6f) and sequencing all data of the tracing vectors of each tracing sequence from large to small to form the SMT production tracing sequence, wherein the SMT production information with higher importance has higher influence on SMT production defects.

Compared with the prior art, the invention has the following advantages:

first, the invention uses the self-encoder to be constructed and trained, and simultaneously reserves the independent variables which are linearly related and nonlinearly related to the dependent variable, thereby overcoming the defect that the prior art can only select the independent variables which are linearly related to the dependent variable, and leading the SMT production information contained in the SMT production tracing sequence obtained by the invention to be more comprehensive.

Secondly, the invention obtains the classification tree set by using an ensemble learning method, obtains the SMT production tracing sequence, and simultaneously describes the linear relation and the nonlinear relation between the independent variable and the dependent variable, thereby overcoming the defect that the prior art can only describe the linear relation between the independent variable and the dependent variable, and leading the invention to more accurately obtain the key factors causing the product defects.

Thirdly, because the invention uses the ensemble learning method to obtain the classification tree set, the relationship between the SMT production information and the SMT product defects is deeply excavated, the defect that the prior art can only obtain related data in the production process of the product is overcome, and the invention can obtain hidden factors causing the product defects.

Drawings

FIG. 1 is a flow chart of the present invention;

FIG. 2 is a schematic diagram of the self-encoder structure of the present invention;

FIG. 3 is a schematic diagram of production information for the SPI production traceability dataset of the present invention;

FIG. 4 is a classification tree of the present invention.

Detailed Description

The invention is further described below with reference to the accompanying drawings.

The specific steps of the present invention will be described in further detail with reference to fig. 1.

And step 1, constructing a self-encoder.

Build 18 autocoders with the same structure and different parameters, wherein each autocoder has three layers, and the structure of each autocoder is as follows: input layer → fully connected layer → output layer.

The number of nodes of the input layer and the output layer is set to 76.

The number of full link layer nodes for each autoencoder is set according to the following equation:

denotes a rounding down operation,% denotes a remainder operation.

The structure of the constructed self-encoder will be further described with reference to fig. 2.

The circles in fig. 2, which are marked with X in the leftmost column, represent nodes of the input layer, the circles, which are marked with h in the middle column, represent nodes of the fully connected layer, and the circles, which are marked with y in the right column, represent nodes of the output layer. Each arrow line in fig. 2 represents an input value of the node at the right end of the arrow line after multiplying the value of the node at the left end of the arrow line by the corresponding weight, and the arrow in fig. 2 represents the data flow direction in the prediction process.

Calculating an activation value of each node of the fully-connected layers in the 1 st to 9 th self-encoders according to the following formula:

wherein, T_mjRepresents the activation value of the jth node in the fully-connected layer of the mth self-encoder, m ∈{1,2,…,9},j∈{1,2,…,N_m},N_mRepresenting the total number of fully-connected layer nodes of the mth self-encoder, e^(·)Denotes an exponential operation based on a natural constant e, x_mjRepresenting the j-th node input value, x, in the fully-connected layer of the m-th self-encoder_mj＝W_mj ^TX_m,W_mjA weight matrix representing the network between the input layer of the mth self-encoder and the jth node of the fully-connected layer, the initialized value of each element of the matrix obeys the standard normal distribution, T represents the transposition operation, X_mRepresenting a vector of 76 node input values in the mth self-encoder input layer.

Calculating an activation value of each node of the fully-connected layers in the 10 th to 18 th self-encoders according to the following formula:

R_ln＝max(0,x_ln)

wherein R is_lnRepresents the activation value of the nth node in the fully-connected layer of the ith self-encoder, i ∈ {10, …,18}, N ∈ {1,2, …, N ∈_l},N_lRepresents the total number of nodes of the fully-connected layer of the ith self-encoder, max (·) represents the max operation, x_lnRepresenting the nth node input value, x, in the fully-connected layer of the ith self-encoder_ln＝W_ln ^TX_l,W_lnA weight matrix representing a network between the nth node of the input layer and the fully-connected layer of the ith self-encoder, the initialized value of each element of the matrix obeys a standard normal distribution, T represents a transposition operation, X_lRepresenting a vector of 76 node input values in the ith self-encoder input layer.

Table 118 parameter table of self-encoder

In an embodiment of the present invention, the parameters of the 18 autoencoders are shown in table 1.

Calculating a loss error value between each output value from the encoder output layer and the input layer input value according to:

representing the output value of the ith node from the output layer of the encoder, k e {1,2, …, N }.

And 2, acquiring an SPI defect tracing data set.

At least 5320000 pieces of SPI tracing data are randomly extracted from a database of a Manufacturing Execution System (MES), an MXN dimensionality SPI defect tracing data set is formed, M is at least 70000, N is at least 76, each row of data represents SPI defect tracing data containing production information, each column of data represents a sequence formed by all values of one attribute in the SPI tracing data set, and at least 20000 rows of SPI tracing data are defective detection data in the SPI defect tracing data set.

Five types of production information included in the SPI defect trace back data will be further described with reference to fig. 3. The boxes labeled "process parameters" in fig. 3 represent production information in terms of process parameters, including blade classification speed, blade classification distance, platen print height compensation, platen separation speed, platen separation distance, blade pressure, cleaning speed. The box labeled "printing process status parameters" in fig. 3 represents production information in terms of printing process status parameters, including print time, work file, production count, squeegee count, MASK count, squeegee mean pressure, squeegee minimum pressure, squeegee maximum pressure, auto clean count, manual clean count, print direction, and platen separation delay. The box labeled "intermediate product inspection parameter" in fig. 3 represents production information in terms of intermediate product inspection result parameters, including pad volume, pad area, pad height, and inspection result. The box labeled "environmental parameter" in fig. 3 represents production information in terms of environmental parameters, including humidity and temperature. The box labeled "raw material property parameter" in fig. 3 represents production information in terms of raw material property parameters, including PCB bar code, PCB length, PCB width, PCB thickness, pad number, package type, doctor blade ID, steel mesh ID. The box marked with MES system in the middle represents MES system, and the line in the figure represents 5 aspects of production information from MES system.

And 3, normalizing the data of each attribute in the SPI defect tracing data set according to the following formula to obtain a normalized SPI defect tracing data set:

x'_qp＝(x_qp-min(x_q))/(max(x_q)-min(x_q))

wherein, x'_qpNormalized value, x, of the pth data representing the qth attribute in the SPI defect trace dataset_qpP data representing the q attribute of the SPI defect tracing data set, min (-) represents the minimum value operation, x_qAll data representing the qth attribute of the SPI defect trace back data set, max (·) represents a max operation.

In the embodiment of the present invention, the SPI defect trace back data set before normalization is shown in table 2, and includes 7 types of production information in the obtained SPI defect trace back data set, where each row of data includes 7 types of production information including blade pressure, blade speed, separation speed, pad volume, pad area, pad height, and SPI detection result, where the serial number is the number of the row where the data is located, the unit of blade pressure is newton per square centimeter, the unit of blade speed is millimeter per second, the unit of separation speed is centimeter per second, the pad volume represents the relative value of the pad volume automatically calculated by the SPI detection device, the pad area represents the relative value of the pad area automatically calculated by the SPI detection device, the pad height represents the relative value of the pad height automatically calculated by the SPI detection device, and the SPI detection result represents the detection result of the SPI detection device, 0 represents no defect and 1 represents continuous tin defect.

TABLE 2 partial data table of SPI defect tracing data set

The normalization process in the embodiment of the present invention is illustrated by taking the blade pressure attribute values of the first row data table of table 2 as an example. The maximum value of all data listed in table 2 for blade pressure is 13 and the minimum value is 8. The blade attribute value for the first row of the data table in table 2 is 11, which is normalized as follows:

x'＝(11-8)/(13-8)

the normalized blade attribute value for the first row of the data table was calculated to be 0.6.

The results obtained after normalizing all the data of table 2 are shown in table 3.

Table 3 partial normalized SPI defect tracing data table

And 4, training the self-encoder.

And respectively inputting the normalized SPI defect tracing data sets into an input layer of each of 18 self-encoders, and respectively training each self-encoder by using a random gradient descent method to obtain 18 trained self-encoders in total.

The steps of the random gradient descent method are as follows:

step 1, randomly selecting an unselected data from the normalized SPI defect tracing data set;

and 2, after the selected data is input into the input layer of the self-encoder, calculating a loss error value between the output data of the output layer of the self-encoder and the selected data according to the following formula:

and 3, updating each parameter in the self-encoder network according to the following formula:

wherein, omega'_tDenotes the t-th parameter updated from the encoder, t ∈ {1,2, …,2 × N × (num +1) }, num denotes the total number of full-link layer nodes from the encoder, ω_tRepresents the t-th parameter before updating of the self-encoder, l represents the learning rate, and the value range of l is [0,1 ]]，

Indicating a derivation operation, theta_tThe t-th parameter of the self-encoder before representing the parameter updating;

step 4, inputting the data selected in the step one into an input layer of the self-encoder after the parameters are updated, and calculating a loss error value between output layer output data of each self-encoder after the parameters are updated and the selected data according to the following formula;

wherein L is_iRepresenting the loss error value between the i-th output value from the encoder output layer and the input layer input value, i e {1,2, …,18}, N_iIndicating the ith self-encodingThe number of input layer nodes and the number of output layer nodes of the device, sigma, the summation operation, y_ikRepresents the input value of the kth node of the ith self-encoder input layer,

in the embodiment of the present invention, the training error values of 18 autoencoders are shown in table 4:

table 418 loss error values table from encoder

Self encoder sequence number	Error value of training	Self encoder sequence number	Error value of training
				1	0.0093	10	0.0158
2	0.0159	11	0.0100
				3	0.0095	12	0.0034
4	0.0061	13	0.0081
				5	0.0036	14	0.0075
6	0.0067	15	0.0030
				7	0.0119	16	0.0017
8	0.0195	17	0.0075
				9	0.0194	18	0.0151

Step 5, judging whether the loss error value between the updated output value of the output layer of the self-encoder and the selected data is smaller than the current loss error value threshold value, if so, obtaining a trained self-encoder, otherwise, executing the first step; the threshold value is a value selected from the range of [0,300] according to different requirements on the training precision of the self-encoder network, the larger the selected value is, the lower the training precision of the network is, and the smaller the selected value is, the higher the training precision of the network is.

In the embodiment of the present invention, the threshold value is set to 0.02.

And 5, obtaining a classification tree set by using an ensemble learning method.

And sequentially inputting all data of the normalized MXN-dimensional SPI defect tracing data set to a full-connection layer of each trained self-encoder according to rows, and forming an MXN 'dimensional classification data set by output data of all nodes of the full-connection layer, wherein the value of N' is equal to the number of nodes of the full-connection layer.

Selecting A row of data from the classification data set to form a training set, wherein,

representing a rounding-down operation, M representing the number of rows of the sorted data set, and grouping the remaining data in the sorted data set into a test set.

And training the training set by using a classification regression tree CART training method to obtain a trained classification tree.

The CART training method comprises the following steps:

step 1, taking the sequence number of each column in the training set as an attribute of the training set, and forming a value sequence of the attribute corresponding to the training set by all elements of each column in the training set;

step 2, deleting repeated numerical values in the value sequence of each attribute to obtain a numerical value set of each attribute;

step 3, counting the frequency of each value in the value set of each attribute, wherein the frequency of each value appearing in the value sequence of the attribute corresponding to the training set is used as the frequency of the value;

and 4, calculating the value sequence and the value set of each attribute according to the following formula to obtain the kini index value of each attribute:

wherein, g_bKeny index value, N, representing the b-th attribute_bThe total number of values in the value set representing the b-th attribute, sigma represents summation operation, s represents the sequence number of the values in the value set, and s belongs to [1, N ]_b]，n_bsFrequency count, n, of the s-th value in the set of values representing the b-th attribute_bThe total number of values of the value sequence of the b-th attribute is represented;

step 5, taking the attribute with the maximum Gini index value as the optimal attribute;

step 6, adding the attribute name of the optimal attribute into a base classifier;

step 7, arranging all numerical values of the value sequence of the optimal attribute from small to large as the optimal attribute sequence;

step 8, sequentially taking the average value of each pair of adjacent numerical values in the optimal attribute sequence from left to right as a segmentation point of the optimal attribute sequence, forming all the numerical values smaller than the segmentation point in the sequence into a left sequence of the segmentation point, and forming all the numerical values larger than the segmentation point in the sequence into a right sequence of the segmentation point;

and 9, respectively calculating the Gini index value of each division point according to the following formula:

wherein g represents the importance score of the segmentation point, c represents the number of numerical values of the left sequence of the segmentation point, and d represents the number of numerical values of the right sequence of the segmentation point;

step 10, selecting the value of the segmentation point with the maximum Gini index value as the segmentation threshold value of the optimal attribute;

step 11, taking the row elements of each row in the training set as a piece of classification data;

step 12, forming a left sub-training set by the classification data of which the values of all the optimal attributes are less than or equal to the segmentation threshold, and forming a right sub-training set by the classification data of which the values of all the optimal attributes are greater than the segmentation threshold;

and (13) respectively training the left sub-training set and the right sub-training set by using the CART training method which is the same as that in the step (5c) until the SPI detection results of all data in the left sub-training set and the right sub-training set are the same, and forming a classification tree by all attribute names of the base classifier.

The trained classification tree is further described with reference to fig. 4, where each box in fig. 4 represents each node in a classification tree. Wherein the box labeled "X2" indicates that the value of the node is the attribute name of the output data sequence of the 2 nd node of the full link layer of the self-encoder corresponding to the classification tree. The box labeled "X [16 ]" indicates that the value of the node is the attribute name of the output data sequence of the fully-connected 16 th node of the self-encoder to which the classification tree corresponds. The box labeled "X4" indicates that the value of the node is the attribute name of the output data sequence of the 4 th node of the full link layer of the self-encoder to which the classification tree corresponds. The box labeled "X7" indicates the value of the node and the attribute name of the output data sequence of the fully-connected 7 th node of the self-encoder corresponding to the classification tree. The box labeled "X5" indicates the value of the node and the attribute name of the output data sequence of the fully-connected 5 th node of the self-encoder corresponding to the classification tree. The box labeled "tin-through" indicates that the SPI detection result for the node is tin-through. The box labeled "non-defective" indicates that the SPI detection result for that node is non-defective. In fig. 4, the starting node of the arrow line is a parent node, and the destination node of the arrow line is a child node.

And classifying the test set by using the trained classification tree to obtain the classification accuracy of the classification tree.

And 6, obtaining an SMT production tracing sequence.

And for each trained classification tree, taking a root node of the classification tree as a starting node of each traversal, sequentially taking all leaf nodes of the classification tree as destination nodes of each traversal, and taking all attribute names passed by each traversal as a tracing sequence of the classification tree.

And taking the classification accuracy of each classification tree as the credibility of all the tracing sequences of the classification tree.

Searching a self-encoder corresponding to each tracing sequence, then searching a node corresponding to each attribute name in the tracing sequence corresponding to the self-encoder in a full connection layer of the self-encoder, and forming a network weight vector of the attribute name by using network weight values from all nodes of an input layer of the self-encoder to the corresponding node in the full connection layer, wherein the total number of elements of the network weight vector corresponding to each attribute name is the same as the number of nodes of the input layer of the corresponding self-encoder.

And arranging the network weight vectors of each attribute name of each tracing sequence according to rows to form a C multiplied by D (dimension) risk matrix of the tracing sequence, wherein C represents the total number of the attribute names in the tracing sequence, and D represents the total number of the attributes in the SPI defect tracing data set.

And (3) forming a tracing vector of each tracing sequence by all the data obtained by summing the risk matrix of each tracing sequence according to columns, wherein each data in the tracing vector represents the importance of a corresponding attribute in the SPI defect tracing data set.

And sequencing all data of the tracing vectors of each tracing sequence from large to small to form an SMT production tracing sequence and an SMT production tracing sequence.

In the embodiment of the invention, the finally obtained SMT production trace sequence is shown in Table 5.

TABLE 5 SMT production traceability sequence Listing

Serial number	SMT production tracing sequence
		1	Distance of blade separation>Width of the board>Thickness of the board>The tin is connected with the molten tin,
2	distance of blade separation>Width of the board>Automatic cleaning and counting>The tin is connected with the molten tin,
		3	speed of blade separation>Automatic cleaning and counting>Tin connection
4	Speed of blade separation>Speed of blade separation>Tin connection
		5	Length of the board>Automatic cleaning>Separating speed of the working table>Tin connection
6	Speed of blade separation>Separating speed of the working table>Pressure of the scraper>Tin connection
		7	Separating speed of the working table>Tin connection

Claims

1. a SMT production tracing method based on self-encoder and integrated learning, it is characterized in that, build self-encoder, use integrated learning method to obtain classification tree set, obtain SMT production tracing sequence, the concrete steps of this method comprise as follows:

(1) Build an autoencoder:

(1a) Build 18 autoencoders with the same structure and different parameters. Each autoencoder has three layers, and its structure is: input layer → fully connected layer → output layer;

(1b) Set the number of nodes in the input layer and output layer to 76;

(1c) According to the following formula, set the number of fully connected layer nodes of each autoencoder:

Among them, n _i represents the number of fully connected layer nodes of the ith self-encoder, i∈{1,2,…,18},∈represents a symbol,

Represents a round-down operation, and % represents a remainder operation;

(1d) Calculate the activation value of each node of the fully connected layer in the 1st to 9th autoencoders according to the following formula:

where T _mj represents the activation value of the jth node in the fully connected layer of the mth autoencoder, m∈{1,2,…,9},j∈{1,2,…,N _m },N _m represents the total number of nodes in the fully connected layer of the mth autoencoder, e ^{( )} represents the exponential operation with the natural constant e as the base, and x _mj represents the input of the jth node in the fully connected layer of the mth autoencoder value, x _mj = W _mj ^T X _m , W _mj represents the weight matrix of the network between the input layer of the m-th autoencoder and the j-th node in the fully connected layer, and the initialization value of each element of the matrix obeys Standard normal distribution, T represents the transpose operation, X _m represents the vector composed of the input values of 76 nodes in the input layer of the m-th autoencoder;

(1e) Calculate the activation value of each node of the fully connected layer in the 10th to 18th autoencoders according to the following formula:

R _ln = max(0, x _ln )

Among them, R _ln represents the activation value of the n-th node in the fully-connected layer of the l-th autoencoder, and l∈{10,…,18},n∈{1,2,…,N _l },N _l denotes The total number of nodes in the fully connected layer of the l-th autoencoder, max( ) represents the operation of taking the maximum value, x _ln represents the input value of the n-th node in the fully-connected layer of the l-th autoencoder, x _ln =W _ln ^T X _l , W _ln represents the weight matrix of the network between the input layer of the l-th autoencoder and the n-th node of the fully connected layer. The initialization value of each element of the matrix obeys the standard normal distribution, and X _l represents A vector consisting of the input values of 76 nodes in the input layer of the l-th autoencoder;

(1f) Calculate the loss error value between the output value of each autoencoder output layer and the input value of the input layer according to the following formula:

Among them, Li represents the loss error value between the output value of the _ith autoencoder output layer and the input value of the input layer, i∈{1,2,...,18}, Ni represents the _ith autoencoder's output value The number of nodes in the input layer and the number of nodes in the output layer, ∑ represents the summation operation, y _ik represents the input value of the kth node of the ith autoencoder input layer,

Represents the output value of the kth node of the ith self-encoder output layer, k∈{1,2,…,N};

(2) Obtain the SPI defect traceability data set:

Randomly extract at least 5,320,000 pieces of SPI traceability data from the MES database of the manufacturing execution system to form an M×N dimension of SPI defect traceability data sets, where M is at least 70,000 and N is at least 76, where each row of data represents a piece of production information. SPI defect traceability data, each column of data represents a sequence composed of all values of an attribute in the SPI traceability data set, and at least 20,000 lines of SPI traceability data in the SPI traceability data set are defective inspection data;

(3) According to the following formula, normalize the data of each attribute in the SPI defect traceability data set to obtain the normalized SPI defect traceability data set:

x' _qp =(x _qp -min(x _q ))/(max(x _q )-min(x _q ))

Among them, x' _qp represents the normalized value of the p-th data of the q-th attribute in the SPI defect tracing data set, x _qp represents the p-th data of the q-th attribute in the SPI defect tracing data set, and min( ) means The operation of taking the minimum value, x _q represents all the data of the qth attribute of the SPI defect traceability data set, and max( ) represents the operation of taking the maximum value;

(4) Train the autoencoder:

Input the normalized SPI defect tracing data set into the input layer of each autoencoder of 18 autoencoders, and use the stochastic gradient descent method to train each autoencoder separately, and a total of 18 trained autoencoders are obtained. the autoencoder;

(5) Use the ensemble learning method to obtain a set of classification trees:

(5a) Input all data of the normalized M×N-dimensional SPI defect tracing data set into the fully connected layer of each trained autoencoder row by row, and combine the output data of all nodes in the fully connected layer M×N' dimension classification data set, the value of N' is equal to the number of nodes in the fully connected layer;

(5b) Select row A data from the classification data set to form a training set, wherein,

Represents the round-down operation, M represents the number of rows in the classification data set, and the remaining data in the classification data set is formed into a test set;

(5c) Use the classification regression tree CART training method to train the training set to obtain a trained classification tree;

(5d) using the trained classification tree to classify the test set to obtain the classification accuracy of the classification tree;

(6) Obtain SMT production traceability sequence:

(6a) For each trained classification tree, the root node of the classification tree is used as the starting node of each traversal, all the leaf nodes of the classification tree are used as the destination node of each traversal in turn, and the All attribute names of the classification tree are used as a traceable sequence of the classification tree;

(6b) Taking the classification accuracy of each classification tree as the credibility of all traceable sequences of the classification tree;

(6c) Find the autoencoder corresponding to each traceability sequence, and then find out the node corresponding to each attribute name in the fully connected layer of the autoencoder in the traceability sequence corresponding to the autoencoder, and use the autoencoder The network weight values of all nodes in the input layer of the input layer to the corresponding nodes in the fully connected layer form the network weight vector of the attribute name, and the total number of elements of the network weight vector corresponding to each attribute name is the same as the number of nodes in the corresponding autoencoder input layer;

(6d) Arrange the network weight vector of each attribute name of each traceability sequence in rows to form a C×D-dimensional risk matrix of the traceability sequence, where C represents the total number of attribute names in the traceability sequence, and D represents the SPI The total number of attributes in the defect traceability dataset;

(6e) All the data after the summation of the risk matrix of each traceability sequence by column forms the traceability vector of the traceability sequence, and each data in the traceability vector represents the importance of the corresponding attribute in the SPI defect traceability data set;

(6f) Sort all data of the traceability vector of each traceability sequence from large to small to form an SMT production traceability sequence.

2. the SMT production traceability method based on self-encoder and integrated learning according to claim 1, is characterized in that, the step of the stochastic gradient descent method described in step (4) is as follows:

The first step is to randomly select a piece of unselected data from the normalized SPI defect traceability data set;

The second step is to input the selected data into the input layer of the autoencoder and calculate the loss error value between the output data of the output layer of the autoencoder and the selected data according to the following formula:

Among them, L represents the loss error value between the output value of the output layer after the selected data is input to the self-encoder and the selected data, N represents the total number of nodes in the input layer of the self-encoder, the total number of nodes in the output layer of the self-encoder and The total number of input layer nodes is equal, and the input layer nodes of the auto-encoder correspond to the output layer nodes one-to-one according to the node order, ∑ represents the summation operation, y _k represents the input value of the kth node in the input layer of the auto-encoder,

Represents the output value of the kth node in the output layer of the auto-encoder, k∈{1,2,…,N}, ∈ means belonging to the symbol;

The third step is to update each parameter in the autoencoder network according to the following formula:

Among them, ω′ _t represents the t-th parameter updated by the autoencoder, t∈{1,2,…,2×N×(num+1)}, num represents the total number of fully connected layer nodes of the autoencoder, ω _t represents the t-th parameter before the auto-encoder update, l represents the learning rate, and its value range is [0, 1],

Represents the partial derivative operation, and θ _t represents the t-th parameter of the autoencoder before the parameter update;

The fourth step is to input the data selected in the first step into the input layer of the auto-encoder after the parameter update, and calculate the loss between the output data of the output layer of the auto-encoder after the parameter update and the selected data according to the following formula difference:

where L' represents the loss error value between the output data of the output layer of the autoencoder and the selected data after parameter update,

Represents the output value of the kth node in the output layer of the autoencoder after parameter update;

The fifth step is to judge whether the loss error value between the output value of the updated autoencoder output layer and the selected data is less than the current loss error value threshold. If so, get a trained autoencoder, otherwise, execute the first step. Step; the threshold is a value selected from the range of [0,300] according to the different requirements for the training accuracy of the self-encoder network, the larger the selected value, the lower the training accuracy of the network, the smaller the selected value, the smaller the value of the network. The higher the training accuracy.

3. the SMT production traceability method based on self-encoder and integrated learning according to claim 1, is characterized in that, the production information described in step (2) comprises 5 raw materials, technology, printing process, environment and detection result aspect.

4. the SMT production traceability method based on self-encoder and integrated learning according to claim 1, is characterized in that, the step of the CART training method described in step (5c) is as follows:

In the first step, the serial number of each column in the training set is used as an attribute of the training set, and all elements of each column in the training set are formed into the value sequence of the corresponding attribute of the training set;

The second step is to delete the repeated values in the value sequence of each attribute to obtain the value set of each attribute;

The third step is to count the number of occurrences of each value in the value sequence of the corresponding attribute of the training set in the value set of each attribute, as the frequency of the value;

The fourth step is to calculate the value sequence and value set of each attribute according to the following formula, and obtain the Gini index value of each attribute:

Among them, g _b represents the Gini index value of the bth attribute, N _b represents the total number of values in the value set of the bth attribute, ∑ represents the summation operation, s represents the sequence number of the value in the value set, s∈[1,N _b ] , n _bs represents the frequency of the s th value in the value set of the b th attribute, and n _b represents the total number of values in the value sequence of the b th attribute;

Step 5: Take the attribute with the largest Gini index value as the optimal attribute;

Step 6: Add the attribute name of the optimal attribute to the base classifier;

Step 7: Arrange all the values of the optimal attribute value sequence from small to large as the optimal attribute sequence;

Step 8: Take the average value of each pair of adjacent values in the optimal attribute sequence from left to right, as a split point of the optimal attribute sequence, and combine all the values in the sequence smaller than the split point to form the left side of the split point. Sequence, all the values in the sequence that are greater than the split point form the right sequence of the split point;

Step 9: Calculate the Gini index value of each split point according to the following formula:

Among them, g represents the importance score of the segmentation point, c represents the number of values in the left sequence of the segmentation point, and d represents the number of values in the right sequence of the segmentation point;

Step 10: Select the value of the segmentation point with the largest Gini index value as the segmentation threshold of the optimal attribute;

Step 11: Use the row elements of each row in the training set as a piece of classification data;

The twelfth step: form a left sub-training set of all the classification data whose values of the optimal attributes are less than or equal to the segmentation threshold, and form a right sub-training set of all the classification data whose values of the optimal attributes are greater than the segmentation threshold;

Step 13: Use the same CART training method as in step (5c) to train the left sub-training set and the right sub-training set respectively, until the SPI detection results of all the data in the left sub-training set and the right sub-training set are the same, Combine all attribute names of the base classifier into a classification tree.