CN109620244B

CN109620244B - A Conditional Generative Adversarial Network and SVM-Based Approach for Infant Abnormal Behavior Detection

Info

Publication number: CN109620244B
Application number: CN201811494749.0A
Authority: CN
Inventors: 王世刚; 戴晓辉; 赵岩; 韦健
Original assignee: Jilin University
Current assignee: Jilin University
Priority date: 2018-12-07
Filing date: 2018-12-07
Publication date: 2021-07-30
Anticipated expiration: 2038-12-07
Also published as: CN109620244A

Abstract

The abnormal infant behavior detection method based on conditional generative adversarial network and SVM belongs to the technical field of video image processing and deep learning. The present invention judges whether the infant behavior is abnormal by analyzing the motion trajectory of the infant in the video. First, the infant video is obtained and intercepted with a reasonable length. And convert it into frame images, mark the limbs and the whole body to establish a sample library; then use the conditional generative adversarial network to track the baby's limbs and whole body; then calculate the wavelet approximate waveform and wavelet power spectrum of the obtained target motion trajectory, The obtained features are classified and judged comprehensively by the support vector machine SVM; the present invention performs motion trajectory detection on infant limbs and whole body information, which is more comprehensive than single limb detection information, and the combination of wavelet domain and power spectrum domain training improves detection accuracy, It is of great significance to detect whether the infant's behavior is abnormal and to intervene as soon as possible to prevent diseases such as infant cerebral palsy.

Description

Infant abnormal behavior detection method based on condition generation countermeasure network and SVM

Technical Field

The invention belongs to the technical field of video image processing and deep learning, and particularly relates to a method for detecting abnormal behaviors of infants based on a condition generation countermeasure network and an SVM.

Background

The abnormal behavior of the infant mainly refers to that small-amplitude middle-speed movement in various directions with variable acceleration is not seen in the movement of the infant, the movement of the infant is not suitable for the age of the infant, other movement forms (such as limb midline movement, hand and knee contact, visual search, finger scratching and clothes grabbing and the like) are not available, and the whole movement is poor in fluency. The abnormal behavior of the infant is damaged corresponding to the brain, severe patients can cause cerebral palsy, and cerebral palsy diseases can be diagnosed only after the child ages from one to two, so that the early detection of the abnormal behavior of the infant and timely intervention and treatment have strong practical significance.

In response to this problem, researchers have proposed several methods for detecting abnormal behavior in infants, which can be broadly divided into three types: whole body motion quality assessment, wearable sensor assessment, combined with pattern recognition assessment. The first method is to adopt a specific video recording mode for the baby, and judge whether the behavior is abnormal or not by using the whole body movement quality evaluation criterion for the video recording result, and the mode mainly depends on observation and has certain subjectivity. The second method is to wear a sensor device on the infant to observe the parameters, but the wearable method itself interferes with the movement of the infant to some extent, causing inaccuracy in the prediction result. The third method is to extract the motion characteristics of the baby by using a computer to perform pattern recognition analysis, which does not interfere with the movement of the baby and has objectivity, but in the process of extracting the motion characteristics and recognizing, only a limited number of body parts are observed, the analysis of the whole body motion is not performed, and certain specificity is achieved.

Due to the defects of the above algorithm, it is difficult to achieve the desired effect in practical applications, so improvement is necessary.

Disclosure of Invention

The invention aims to provide a baby abnormal behavior detection method based on a condition generation countermeasure network CGAN, which is combined with a supervised SVM classification method to improve the accuracy of baby abnormal behavior detection.

A baby abnormal behavior detection method based on condition generation countermeasure network and SVM is characterized in that a training sample library required by target tracking is constructed in advance, the condition generation countermeasure network is utilized to track the four limbs and the whole body of a baby, the training sample library comprises the limbs and the whole body marked by the baby, motion trail information is extracted by utilizing wavelet approximate waveform and wavelet power spectrum analysis, and the characteristics of the motion trail information are classified by a support vector machine SVM, and the method comprises the following steps:

1.1, acquiring a baby video and carrying out unified preprocessing;

1.2, intercepting the baby video obtained in the step 1.1 for 15s, uniformly naming, and uniformly naming the images converted into frames;

1.3 tracking of baby motion trail: for the frame image obtained in the step 1.2, a confrontation network CGAN is generated by using conditions to track the four limbs and the whole body movement locus of the baby respectively, and the method specifically comprises the following steps:

1.3.1 constructing a training sample library required by target tracking: marking the left hand, the right hand, the left leg, the right leg and the whole body of the baby in the frame image obtained in the step 1.2, forming a training sample library by the marked limbs and the whole body of the baby as a target data set input to CGAN, and using the corresponding label as a condition Y;

1.3.2 generative model design: randomly dividing each frame of image containing the baby to be used as a pseudo target data set, and inputting the pseudo target data set and the condition Y into a discrimination model device through a convolution layer;

1.3.3 discriminant model design: sending the target data set and the condition Y into a discrimination model to judge limbs and the whole body, sending the pseudo target data set into a discriminator, and judging whether the target is a target or not;

1.3.4 judging whether the target is the target or not, calculating errors to enable the errors to accord with a formula, wherein the specific formula is as follows:

optimizing D:

max_DV(D，G)＝E_x～pdata(x)[log(D(x))]+E_z～pz(z)[log(1-D(G(z)))]

optimizing G:

min_GV(D，G)＝E_z～pz(z)[log(1-D(G(z)))]

wherein: v (D, G) represents a loss function; pdata (x) is the true sample distribution; pz (z) is the pseudo-sample distribution; d (x) represents the real sample data in the discriminator; d (g (z)) represents pseudo sample data in the discriminator; e represents expectation;

performing model parameter adjustment according to the optimization conditions, wherein parameters of the generated model G and the discrimination model D are shared;

1.3.5 if the error is too large, feeding back the error to the input of the generation model, reconstructing a pseudo target data set, judging again until the positions of the four limbs and the whole body of the baby in the pseudo target data set are found, and recording the positions and the motion tracks of the left hand, the right hand, the left leg, the right leg and the whole body of the baby in each frame;

1.4, analyzing the motion trail information: the method specifically comprises the following steps of storing the position information of continuous y-axis coordinate change during movement of the four limbs and the whole body of the baby tracked in the step 1.3, and calculating the wavelet approximation waveform and the wavelet power spectrum of a continuous change waveform diagram formed by the position information of the y-axis coordinate, wherein the continuous change waveform diagram comprises the following steps:

1.4.1 because the coordinate change of the x axis is not obvious, only selecting the coordinate change diagram of the y axis for analysis, firstly, analyzing the approximate waveform of the wavelet, and analyzing the tracked waveform by using harr wavelet to obtain the approximate waveform of the wavelet;

1.4.2 for the y-axis coordinate change graphs of limbs and the whole body, power spectrum information is obtained by utilizing a power spectrogram based on wavelets;

1.5 extracting characteristic vectors from the obtained wavelet approximate oscillogram and wavelet power spectrogram, and training and learning by using a Support Vector Machine (SVM), specifically comprising the following steps:

1.5.1, dividing the sample into normal and abnormal samples for marking, and setting a normal sample label as 1 and an abnormal sample label as-1;

1.5.2 dividing the sample into a training set and a testing set, normalizing the data, and obtaining the highest accuracy by adjusting the values of parameters c and g in the SVM, thereby obtaining the optimal training model;

1.6 comprehensive judgment of abnormal behaviors of infants: according to the optimal training model obtained in the step 1.5.2, different weights are set for different accuracies, and weighting judgment is carried out, and the method specifically comprises the following steps:

1.6.1 for the SVM model trained from the wavelet approximation waveform obtained in step 1.4.1, different weight coefficients are set according to different accuracies of limbs and the whole body, specifically: left upper limb a 1: 0.35; right upper limb a 2: 0.01; left lower limb a 3: 0.2; right lower limb a 4: 0.35; whole body a 5: 0.09; the judgment result vectors of the four limbs and the whole body are respectively represented by Y1 to Y5, and the calculation formula is as follows:

Y1＝(test label+predict label)/2

wherein: test label is the actual label of the test sample; the prediction label is a label predicted by the test sample; y2 through Y5 were calculated in the same manner as above;

the five resulting vectors are weighted, as follows:

Y＝0.35*Y1+0.01*Y2+0.2*Y3+0.35*Y4+0.09*Y5

wherein: the multiplication operation is represented, Y is a judgment value predicted by the wavelet details, a judgment standard is defined, if-1 < Y < -0.3, the baby behavior is judged to be in an abnormal state, if 0.3< Y <1, the baby behavior is judged to be in a normal state, and the rest are all regarded as judgment error states;

1.6.2 for the SVM model trained from the wavelet power spectrum obtained in the step 1.4.2, different weight coefficients are set according to different accuracy of limbs and the whole body, specifically: left upper limb P1: 0.35; right upper limb P2: 0.01; left lower limb P3: 0.35; right lower limb P4: 0.2; whole body P5: 0.09; the judgment result vectors of the four limbs and the whole body are respectively expressed by X1 to X5, and the calculation formula is as follows:

X1＝(test label+predict label)/2

wherein: test label is the actual label of the test sample; the prediction label is a label predicted by the test sample; x2 through X5 were calculated in the same manner as above;

the five resulting vectors are weighted, as follows:

X＝0.35*X1+0.01*X2+0.35*X3+0.2*X4+0.09*X5

wherein: expressing multiplication, wherein X is a judgment value predicted by the wavelet power spectrum, a judgment standard is specified, if-1 is more than X and less than-0.3, the baby behavior is judged to be in an abnormal state, if 0.3 is more than X and less than 1, the baby behavior is judged to be in a normal state, and the rest are all considered to be in a judgment error state;

and comprehensively judging the X and the Y, and if the test sample at least meets one condition of the X and the Y, determining that the judgment result is correct, and distinguishing whether the behavior of the baby is normal.

The generative model design and discriminative model design of step 1.3.2 and step 1.3.3 specifically includes the following steps:

2.1 generative model design: wherein 6 layers of convolution layers are arranged, and the step length is set as 1; 6 layers of pooling layers, the size of the pooling window being 2 x 2; the network applies a corrected Linear unit relu (corrected Linear unit) activation function, which can obtain good results and faster convergence speed, and the specific operation formula is as follows:

F(Z)＝σ(W*Z+b)

wherein: w is the convolution kernel; is a convolution operation; z is a feature vector; b is an offset; σ is a ReLU activation function;

2.2 discriminant model design: wherein 5 layers of convolution layers are arranged, and the step length is set as 1; 5 layers of pooling layers, the size of the pooling window being 2 x 2; applied in the network is a correcting Linear unit ReLU (rectified Linear Unit) activation function.

Step 1.4 the specific calculation of the wavelet approximation waveform map and the wavelet power spectrogram comprises the following steps:

3.1 analyzing the tracked oscillogram by harr wavelets, constructing a five-layer pyramid according to a Mallat pyramid decomposition algorithm of discrete wavelet transformation, extracting a wavelet approximation signal of a fifth layer, corresponding to four limbs and the whole body, and respectively recording as: abnormal left upper limb: a01; abnormal right upper limb: a02; abnormal left lower limb: a03; abnormal right lower limb: a04; abnormal whole body: a05; normal left upper limb: a11; normal right upper limb: a12; normal left lower limb: a13; normal right lower limb: a14; normal whole body: a15;

3.2 for the y-axis coordinate change graphs of limbs and the whole body, utilizing a wavelet-based power spectrogram, wherein the set sampling length is the video total frame length 375, the sampling frequency is 1000, the sampling interval is 1/1000, and the obtained power spectrograms are respectively recorded as: abnormal left upper limb: p01; abnormal right upper limb: p02; abnormal left lower limb: p03; abnormal right lower limb: p04; abnormal whole body: p05; normal left upper limb: p11; normal right upper limb: p12; normal left lower limb: p13; normal right lower limb: p14; normal whole body: p15.

For generative model design and discriminative model design, the method comprises the following steps:

step A1, generating model design: wherein 6 layers of convolution layers are arranged, and the step length is set as 1; 6 layers of pooling layers, the size of the pooling window being 2 x 2; the network applies a corrected Linear unit relu (corrected Linear unit) activation function, which can obtain good results and faster convergence speed, and the specific operation formula is as follows:

F(Z)＝σ(W*Z+b)

step A2, design of a discriminant model: wherein 5 layers of convolution layers are arranged, and the step length is set as 1; 5 layers of pooling layers, the size of the pooling window being 2 x 2; applied in the network is a correcting Linear unit ReLU (rectified Linear Unit) activation function.

Further, the specific calculation of the wavelet approximate oscillogram and the wavelet power spectrogram comprises the following steps:

step R1, analyzing the tracked oscillogram by harr wavelets, constructing a five-layer pyramid according to a Mallat pyramid decomposition algorithm of discrete wavelet transformation, extracting a fifth-layer wavelet approximate waveform corresponding to limbs and the whole body, and respectively recording as: abnormal left upper limb: a01; abnormal right upper limb: a02; abnormal left lower limb: a03; abnormal right lower limb: a04; abnormal whole body: a05; normal left upper limb: a11; normal right upper limb: a12; normal left lower limb: a13; normal right lower limb: a14; normal whole body: a15;

step B2, for the y-axis coordinate change diagrams of limbs and the whole body, utilizing a wavelet-based power spectrogram, wherein the set sampling length is the video total frame length 375, the sampling frequency is 1000, the sampling interval is 1/1000, and the obtained power spectrograms are respectively marked as: abnormal left upper limb: p01; abnormal right upper limb: p02; abnormal left lower limb: p03; abnormal right lower limb: p04; abnormal whole body: p05; normal left upper limb: p11; normal right upper limb: p12; normal left lower limb: p13; normal right lower limb: p14; normal whole body: p15.

The invention adopts a baby abnormal behavior detection method based on condition generation countermeasure network and SVM, firstly preprocessing the acquired baby video, then, a conditional generation countermeasure network CGAN is used for respectively tracking the target movement locus of the four limbs and the whole body of the baby in the video, the obtained movement locus information is stored, then the movement locus information is extracted by utilizing wavelet transformation, establishing a sample set for the extracted wavelet approximate waveform, training the sample set by using a set SVM (support vector machine), solving a power spectrum for motion trajectory information by using the wavelet to obtain a characteristic establishing sample set, training the sample set by using the set SVM support vector machine, testing two trained models, according to the difference of the accuracy of the two models, different weight parameters are set for weighting judgment, so that the optimal training result is obtained;

the invention combines the four limbs and the whole body information of the baby to detect the movement track, the information obtained by the detection is more comprehensive than the information obtained by single limb detection, the track tracking is more accurate by using CGAN network semi-supervised learning, and the characteristics have more abstract specificity by combining a wavelet domain and a power spectrum domain, meanwhile, an SVM support vector machine is used to classify the characteristics, the detection result is weighted and judged, the false detection rate is reduced, whether the baby behavior is abnormal or not is detected, the intervention is carried out as soon as possible, and the invention has important significance for preventing the diseases such as the cerebral palsy of the baby.

Drawings

FIG. 1 is a flow chart of a method for detecting abnormal behavior of an infant based on a condition-generated countermeasure network and SVM

FIG. 2 is a flow chart for tracking a motion trajectory using CGAN

FIG. 3 is an image of single frame infant left upper limb tracking

FIG. 4 is a y-axis motion trace image of a detected baby

FIG. 5 is a diagram of approximate wavelet waveform obtained by wavelet transform

FIG. 6 is a schematic diagram of wavelet power spectrum

FIG. 7 is a flowchart for determining whether baby behavior is abnormal

Detailed Description

The following describes the implementation process of the present invention with reference to the attached drawings.

An infant abnormal behavior detection method based on condition generation countermeasure network and SVM, which integrally realizes a flow, as shown in FIG. 1, and comprises the following steps:

1. and acquiring a baby video and carrying out unified preprocessing.

2. And (3) intercepting the baby video in the step (1) for 15s, uniformly naming, and uniformly naming the images converted into frames.

3. Tracking the motion trail of the baby: for the frame image obtained in step 2, a confrontation network CGAN is generated by using conditions to track the four limbs and the whole body movement locus of the infant, and a flow chart is shown in fig. 2, and specifically includes the following steps:

3.1 constructing a training sample library required by target tracking, marking the left hand, the right hand, the left leg, the right leg and the whole body of the baby in the frame image obtained in the step 2, forming the training sample library by the marked limbs and the whole body of the baby as a target data set input into the CGAN, and taking the corresponding label as a condition Y;

3.2 generative model design: randomly dividing each frame of image containing the baby to be used as a pseudo target data set, and inputting the pseudo target data set and the condition Y into a discrimination model device through a convolution layer;

3.3 discriminant model design: sending the target data set and the condition Y into a discrimination model to judge limbs and the whole body, sending the pseudo target data set into a discriminator, and judging whether the target is a target or not;

3.4 judging whether the target is the target or not, calculating the error to enable the error to accord with a formula, wherein the specific formula is as follows:

optimizing D:

max_DV(D，G)＝E_x～pdata(x)[log(D(x))]+E_z～pz(z)[log(1-D(G(z)))]

optimizing G:

min_GV(D，G)＝E_z～pz(z)[log(1-D(G(z)))]

wherein: v (D, G) represents a loss function; pdata (x) is the true sample distribution; pz (z) is the pseudo-sample distribution; d (x) represents the real sample data in the discriminator; d (g (z)) represents pseudo sample data in the discriminator; e represents expectation.

The purpose of the formula is to minimize the error of the generated model to make the generated false target as true as possible, i.e. to find the target position as possible and to maximize the error of the discriminant model.

3.5 if the error is too large, feeding back the error to the input of the generation model, reconstructing the pseudo target data set, judging again until the positions of the limbs and the whole body of the baby in the pseudo target data set are found, as shown in fig. 3, recording the positions and the motion tracks of the left hand, the right hand, the left leg, the right leg and the whole body of the baby in each frame, wherein the image is a single frame of image tracked by the left upper limb of the baby.

4. Analyzing the motion track information: storing the position information of continuous y-axis coordinate change during movement (as shown in fig. 4) of the four limbs and the whole body of the baby tracked in the step 3, and calculating the wavelet approximation waveform and the wavelet power spectrum of a continuous change waveform diagram formed by the position information of the y-axis coordinate, specifically comprising the following steps:

4.1 because the coordinate change of the x axis is not obvious, only selecting the coordinate change diagram of the y axis for analysis, firstly, carrying out wavelet analysis, and analyzing the tracked oscillogram by using harr wavelets to obtain a wavelet approximate oscillogram;

and 4.2, solving power spectrum information by using a wavelet-based power spectrogram according to the y-axis coordinate change graphs of the limbs and the whole body.

5. Extracting feature vectors from the obtained wavelet approximate oscillogram and wavelet power spectrogram, and training and learning by using a Support Vector Machine (SVM), wherein the method specifically comprises the following steps:

5.1, dividing the sample into normal and abnormal samples for marking, setting a normal sample label as 1, and setting an abnormal sample label as-1;

5.2, dividing the sample into a training set and a testing set, normalizing the data, and obtaining the highest accuracy by adjusting the values of parameters c and g in the SVM, so as to obtain the optimal training model;

6. and (3) comprehensive judgment of abnormal behaviors of the infant: according to the optimal training model obtained in the step 5.2, different weights are set for different accuracies, and weighting judgment is carried out, and the method specifically comprises the following steps:

6.1 for the SVM model trained by the wavelet approximate waveform obtained in the step 4.1, different weight coefficients are set according to different accuracies of limbs and the whole body, specifically: left upper limb a 1: 0.35; right upper limb a 2: 0.01; left lower limb a 3: 0.2; right lower limb a 4: 0.35; whole body a 5: 0.09; the judgment result vectors of the four limbs and the whole body are respectively represented by Y1 to Y5, and the calculation formula is as follows:

Y1＝(test label+predict label)/2

the five resulting vectors are weighted, as follows:

Y＝0.35*Y1+0.01*Y2+0.2*Y3+0.35*Y4+0.09*Y5

wherein: and (4) multiplication operation is performed, Y is a judgment value predicted by the wavelet details, a judgment standard is defined, if-1 < Y < -0.3, the baby behavior is judged to be in an abnormal state, if 0.3< Y <1, the baby behavior is judged to be in a normal state, and the rest are all considered to be in a judgment error state.

6.2 for the SVM model trained from the wavelet power spectrum obtained in the step 4.2, different weight coefficients are set according to different accuracy of limbs and the whole body, specifically: left upper limb P1: 0.35; right upper limb P2: 0.01; left lower limb P3: 0.35; right lower limb P4: 0.2; whole body P5: 0.09; the judgment result vectors of the four limbs and the whole body are respectively expressed by X1 to X5, and the calculation formula is as follows:

X1＝(test label+predict label)/2

wherein: test label is the actual label of the test sample; the prediction label is a label predicted by the test sample; the calculation of X2 through X5 is the same as above.

The five resulting vectors are weighted, as follows:

X＝0.35*X1+0.01*X2+0.35*X3+0.2*X4+0.09*X5

wherein: and (4) multiplication operation is performed, X is a judgment value predicted by the wavelet power spectrum, a judgment standard is defined, if-1 < X < -0.3, the baby behavior is judged to be in an abnormal state, if 0.3< X <1, the baby behavior is judged to be in a normal state, and the rest are all considered to be in a judgment error state.

And comprehensively judging X and Y, wherein a specific flow chart is shown in FIG. 7, if the test sample at least meets one condition of X and Y, the judgment result is considered to be correct, and whether the behavior of the baby is normal can be distinguished.

The invention relates to a generative model design and a discriminant model design, which comprises the following steps:

F(Z)＝σ(W*Z+b)

The specific calculation of the wavelet approximate oscillogram and the wavelet power spectrogram in the invention comprises the following steps:

step B1, analyzing the tracked oscillogram by using hart wavelets, constructing a five-layer pyramid according to the Mallat pyramid decomposition algorithm of discrete wavelet transform, extracting a fifth-layer wavelet approximate waveform (as shown in fig. 5) corresponding to four limbs and the whole body, and respectively recording as: abnormal left upper limb: a01; abnormal right upper limb: a02; abnormal left lower limb: a03; abnormal right lower limb: a04; abnormal whole body: a05; normal left upper limb: a11; normal right upper limb: a12; normal left lower limb: a13; normal right lower limb: a14; normal whole body: A15.

step B2, for the y-axis coordinate change diagrams of limbs and the whole body, using a wavelet-based power spectrogram, wherein the set sampling length is the video total frame length 375, the sampling frequency is 1000, and the sampling interval is 1/1000, and the obtained power spectrograms (as shown in fig. 6) are respectively recorded as: abnormal left upper limb: p01; abnormal right upper limb: p02; abnormal left lower limb: p03; abnormal right lower limb: p04; abnormal whole body: p05; normal left upper limb: p11; normal right upper limb: p12; normal left lower limb: p13; normal right lower limb: p14; normal whole body: p15.

Claims

1. a method for detecting abnormal behavior of infants based on conditional generative adversarial network and SVM, it is characterized in that, build the training sample bank required for target tracking in advance, utilize conditional generative adversarial network to carry out target tracking to infant limbs and whole body, described training. The sample library includes the limbs and whole body marked by the baby, and the motion trajectory information is extracted by wavelet approximate waveform and wavelet power spectrum analysis, and then the support vector machine SVM is used to classify its features, including the following steps:

1.1 Obtain baby videos and perform unified preprocessing;

1.2 Take the baby video of step 1.1 for 15s into one copy, and name them uniformly, and also name the images converted into frames uniformly;

1.3 Infant motion trajectory tracking: For the frame images obtained in step 1.2, the conditional generation confrontation network CGAN is used to track the infant's limbs and the overall motion trajectory of the whole body, including the following steps:

1.3.1 Construct the training sample library required for target tracking: mark the left hand, right hand, left leg, right leg and whole body of the baby in the frame image obtained in step 1.2, with the marked limbs and whole body parts of the baby The training sample library is used as the target data set of the input CGAN, and the corresponding label is used as the condition Y;

1.3.2 Generative model design: randomly segment each frame of images containing babies as a pseudo-target data set, and then input it into the discriminant modeler together with the condition Y through the convolutional layer;

1.3.3 Discriminant model design: The target data set and condition Y are sent to the discriminant model to judge the limbs and the whole body, and then the pseudo-target data set is sent to the discriminator to determine whether it is a target;

1.3.4 Determine whether it is the target, calculate the error, and make the error conform to the formula. The specific formula is as follows:

Optimize D:

max _D V(D, G)=Ex～pdata(x)[log(D(x))]+Ez～pz(z)[log(1-D(G(z)))]

Optimize G:

min _G V(D, G)=Ez～pz(z)[log(1-D(G(z)))]

Among them: V(D, G) represents the loss function; pdata(x) is the real sample distribution; pz(z) is the pseudo-sample distribution; D(x) represents the real sample data in the discriminator; D(G(z)) represents the pseudo sample data in the discriminator; E represents the expectation;

The model parameters are adjusted according to the optimization conditions, wherein the parameters of the generative model G and the discriminant model D are shared;

1.3.5 If the error is too large, feed back to the input of the generative model, reconstruct the pseudo-target data set, and judge again until the position of the baby's limbs and whole body in the pseudo-target data set is found, and record the baby's left hand, right hand, and left hand in each frame. The position and motion trajectory of the leg, right leg and whole body;

1.4 Analyze the motion trajectory information: For the motion trajectory of the baby's limbs and whole body tracked in step 1.3, save the position information of the continuous y-axis coordinate change during exercise, and perform wavelet approximation on the continuous change waveform formed by the y-axis coordinate position information. Waveform and wavelet power spectrum calculation, including the following steps:

1.4.1 Since the change of the x-axis coordinate is not obvious, only the y-axis coordinate change graph is selected for analysis. The first is the wavelet approximation waveform analysis, and the harr wavelet is used to analyze the waveform graph that has been tracked, and the wavelet approximation waveform graph is obtained;

1.4.2 For the y-axis coordinate change diagram of the limbs and the whole body, use the wavelet-based power spectrum diagram to obtain the power spectrum information;

1.5 For the obtained wavelet approximation waveform and wavelet power spectrum, extract the feature vector and use the support vector machine SVM for training and learning, which specifically includes the following steps:

1.5.1 Divide the samples into normal and abnormal samples for labeling, set the label of normal samples as 1, and set the label of abnormal samples as -1;

1.5.2 Divide the samples into training sets and test sets, normalize the data, and obtain the highest accuracy by adjusting the values of the parameters c and g in the SVM, thereby obtaining the best training model;

1.6 Comprehensive judgment of abnormal infant behavior: According to the best training model obtained in step 1.5.2, different weights are set for different accuracies, and weighted judgment is carried out, which includes the following steps:

1.6.1 For the SVM model trained by the wavelet approximate waveform obtained in step 1.4.1, set different weight coefficients according to the different accuracy of the limbs and the whole body, specifically: left upper limb A1: 0.35; right upper limb A2: 0.01; left lower limb A3: 0.2; A4: 0.35 for the right lower limb; A5: 0.09 for the whole body; Y1 to Y5 represent the judgment result vectors of the limbs and the whole body respectively. The calculation formula is as follows:

Y1=(test label+predict label)/2

Among them: test label is the actual label of the test sample; predict label is the predicted label of the test sample; Y2 to Y5 are calculated in the same way;

The five result vectors obtained are weighted as follows:

Y=0.35*Y1+0.01*Y2+0.2*Y3+0.35*Y4+0.09*Y5

Among them: * represents the multiplication operation, Y is the judgment value predicted by the wavelet details, and a judgment standard is specified. If -1<Y<-0.3, the baby's behavior is judged to be abnormal; if 0.3<Y<1, the baby's behavior is judged as Normal state, the rest are regarded as judgment error state;

1.6.2 For the SVM model trained by the wavelet power spectrum obtained in step 1.4.2, set different weight coefficients according to the different accuracy of the limbs and the whole body, specifically: left upper limb P1: 0.35; right upper limb P2: 0.01; Lower extremity P3: 0.35; right lower extremity P4: 0.2; whole body P5: 0.09; X1 to X5 represent the judgment result vectors of the limbs and the whole body respectively. The calculation formula is as follows:

X1=(test label+predict label)/2

Among them: test label is the actual label of the test sample; predict label is the label predicted by the test sample; the calculation methods of X2 to X5 are the same as above;

The five result vectors obtained are weighted as follows:

X=0.35*X1+0.01*X2+0.35*X3+0.2*X4+0.09*X5

Among them: * represents the multiplication operation, X is the judgment value predicted by the wavelet power spectrum, and a judgment standard is specified. If -1<X<-0.3, the baby's behavior is judged to be abnormal, and if 0.3<X<1, the baby's behavior is judged. It is a normal state, and the rest are regarded as a judgment error state;

Comprehensive judgment is made on X and Y. If the test sample meets at least one of the conditions of X and Y, the judgment result is considered to be correct, and it is possible to distinguish whether the baby's behavior is normal.

2. The method for detecting abnormal infant behavior based on conditional generative adversarial network and SVM according to claim 1, characterized in that, the generative model design and discriminant model design described in step 1.3.2 and step 1.3.3 specifically include the following step:

2.1 Generative model design: set 6 layers of convolutional layers, the stride is set to 1; 6 layers of pooling layers, the size of the pooling window is 2*2; the activation function of ReLU (Rectified Linear Unit) is applied in the network , the specific operation formula is as follows:

F(Z)=σ(W*Z+b)

Where: W is the convolution kernel; * is the convolution operation; Z is the feature vector; b is the offset; σ is the ReLU activation function;

2.2 Discriminant model design: 5 layers of convolutional layers are set, and the stride is set to 1; function;

The described generative model design and discriminant model design include the following steps:

Step A1, generating model design: 6 layers of convolution layers are set, and the step size is set to 1; 6 layers of pooling layers are set, and the size of the pooling window is 2*2; ) activation function, the specific operation formula is as follows:

F(Z)=σ(W*Z+b)

Step A2, discriminant model design: 5 layers of convolutional layers are set, and the step size is set to 1; ) activation function.

3. by the described infant abnormal behavior detection method based on conditional generation adversarial network and SVM according to claim 1, it is characterized in that, the concrete calculation of wavelet approximation waveform diagram and wavelet power spectrogram described in step 1.4, comprises the following steps:

3.1 Use harr wavelet to analyze the traced waveforms, build a five-layer pyramid according to the Mallat pyramid decomposition algorithm of discrete wavelet transform, and extract the wavelet approximation signal of the fifth layer, corresponding to the limbs and the whole body, respectively recorded as: abnormal left Upper limb: A01; Abnormal right upper limb: A02; Abnormal left lower limb: A03; Abnormal right lower limb: A04; Abnormal whole body: A05; Normal left upper limb: A11; Normal right upper limb: A12; Normal left lower limb: A13; Normal right lower limb: A14 ; Normal whole body: A15;

3.2 For the y-axis coordinate change diagram of the limbs and the whole body, use the wavelet-based power spectrogram, where the set sampling length is 375 of the total video frame length, the sampling frequency is 1000, and the sampling interval is 1/1000, and the obtained power spectrogram, Abnormal left upper limb: P01; abnormal right upper limb: P02; abnormal left lower limb: P03; abnormal right lower limb: P04; abnormal whole body: P05; normal left upper limb: P11; normal right upper limb: P12; normal left lower limb: P13 ; Normal right lower limb: P14; Normal whole body: P15;

The specific calculation of the wavelet approximation waveform diagram and the wavelet power spectrum diagram includes the following steps:

Step B1, use harr wavelet to analyze the traced waveform, build a five-layer pyramid according to the Mallat pyramid decomposition algorithm of discrete wavelet transform, and extract the wavelet approximate waveform of the fifth layer, corresponding to the limbs and the whole body, which are respectively recorded as: Abnormal left upper limb: A01; abnormal right upper limb: A02; abnormal left lower limb: A03; abnormal right lower limb: A04; abnormal whole body: A05; normal left upper limb: A11; normal right upper limb: A12; normal left lower limb: A13; normal right lower limb : A14; normal whole body: A15;

Step B2, for the y-axis coordinate change diagram of the limbs and the whole body, use the wavelet-based power spectrogram, where the set sampling length is the total video frame length of 375, the sampling frequency is 1000, and the sampling interval is 1/1000, and the obtained power spectrum is obtained. Abnormal left upper limb: P01; abnormal right upper limb: P02; abnormal left lower limb: P03; abnormal right lower limb: P04; abnormal whole body: P05; normal left upper limb: P11; normal right upper limb: P12; normal left lower limb : P13; normal right lower limb: P14; normal whole body: P15.