CN107992937B

CN107992937B - Unstructured data judgment method and device based on deep learning

Info

Publication number: CN107992937B
Application number: CN201610950494.9A
Authority: CN
Inventors: 朱跃生; 罗桂波
Original assignee: Peking University Shenzhen Graduate School
Current assignee: Peking University Shenzhen Graduate School
Priority date: 2016-10-26
Filing date: 2016-10-26
Publication date: 2021-12-03
Anticipated expiration: 2036-10-26
Also published as: CN107992937A

Abstract

The present invention relates to an unstructured data judgment method based on deep learning, which specifically includes: obtaining a trained deep learning neural network model, wherein the trained deep learning neural network model is the result of training with multi-factor training sample data. A neural network model for multi-factor joint judgment; receive clue data collected online in real time, the clue data includes unstructured data of multiple data types; combine the acquired clue data through a trained deep learning neural network model Analyze and extract characteristic clue information that is beneficial to judgment; make judgment on the clue data according to the characteristic clue information, and generate a judgment result; and feed back the judgment result. The above method can perform more efficient and timely decision analysis on unstructured data, and realize online real-time analysis.

Description

Unstructured data judgment method and device based on deep learning

Technical Field

The invention relates to the technical field of artificial intelligence, in particular to an unstructured data judgment method and device based on deep learning.

Background

Data that is not conveniently represented in a database two-dimensional logical table is referred to as unstructured data, as opposed to structured data (i.e., row data, stored in a database, that can be logically represented in a two-dimensional table structure), which includes office documents of all formats, text, pictures, subsets XML in a standard universal markup language, HTML, various types of reports, images, and audio/video information, among others. The unstructured data has the characteristics of large data volume, fast change, multiple types, rich and complex content and non-uniform structure.

In the traditional technology, due to the characteristics of rapid change, non-uniform structure and the like of unstructured data, analysis time consumption is long when unstructured data are analyzed, off-line analysis is generally performed on historical data, and analysis decision efficiency is low.

Disclosure of Invention

In view of the foregoing, there is a need to provide a method and an apparatus for unstructured data decision based on deep learning, which can perform more efficient and timely online decision analysis on unstructured data.

An unstructured data judgment method based on deep learning, the method comprising:

acquiring a trained deep learning neural network model, wherein the trained deep learning neural network model is a multi-factor joint decision neural network model trained by a multi-factor training sample;

receiving clue data acquired online in real time, wherein the clue data is unstructured data comprising a plurality of data types;

performing joint analysis on the received clue data through the trained deep learning neural network model, and extracting characteristic clue information beneficial to judgment;

judging the clue data according to the characteristic clue information to generate a judgment result;

and feeding back the judgment result.

In one embodiment, the step of performing joint analysis on the received clue data through the trained deep learning neural network model to extract characteristic clue information beneficial to decision includes:

performing feature extraction on the received clue data through the trained deep learning neural network model to obtain a high-dimensional feature vector;

and converting the high-dimensional feature vector into a hash code with each dimension incidence relation by adopting a hash retrieval algorithm, and extracting feature clue information favorable for judgment according to the hash code.

In one embodiment, before the step of obtaining a trained deep learning neural network model, where the trained deep learning neural network model is a multi-factor joint decision neural network model trained by multi-factor training samples, the method further includes:

receiving a training sample uploaded by a terminal, wherein the training sample data is a multi-factor training sample;

under an off-line state, a deep learning neural network sub-model is constructed for the training sample corresponding to each factor in the multi-factor training samples by utilizing a deep learning algorithm;

acquiring an incidence relation between the built deep learning neural network submodels;

and fusing the deep learning neural network submodels according to the incidence relation to generate a deep learning neural network model capable of performing multi-factor joint decision.

In one embodiment, the multi-factor training samples include video data, image data, audio data, text data, and network data.

In one embodiment, when the clue data received within a set time contains a data type, the clue data is analyzed and judged by using the deep learning neural network submodel of the factor corresponding to the data type to obtain a judgment result.

An unstructured data decision device based on deep learning, the device comprising:

the neural network model acquisition module is used for acquiring a trained deep learning neural network model, wherein the trained deep learning neural network model is a multi-factor joint decision neural network model trained by a multi-factor training sample;

the real-time data receiving module is used for receiving clue data acquired online in real time, wherein the clue data is unstructured data comprising a plurality of data types;

the characteristic information extraction module is used for carrying out joint analysis on the received clue data through the trained deep learning neural network model and extracting characteristic clue information beneficial to judgment;

the joint judgment module is used for judging the clue data according to the characteristic clue information to generate a judgment result;

and the judgment result feedback module is used for feeding back the judgment result.

In one embodiment, the feature information extraction module is further configured to perform feature extraction on the received clue data through the trained deep learning neural network model to obtain a high-dimensional feature vector; and converting the high-dimensional feature vector into a hash code with each dimension incidence relation by adopting a hash retrieval algorithm, and extracting feature clue information favorable for judgment according to the hash code.

In one embodiment, the apparatus further comprises:

the training sample data receiving module is used for receiving a training sample uploaded by a terminal, wherein the training sample is a multi-factor training sample;

the sub-model training module is used for constructing a deep learning neural network sub-model for the training sample corresponding to each factor in the multi-factor training samples by utilizing a deep learning algorithm in an off-line state;

the incidence relation analysis module is used for acquiring the incidence relation between the built deep learning neural network submodels;

and the combined decision model building module is used for fusing the deep learning neural network submodel according to the incidence relation to generate a deep learning neural network model capable of carrying out multi-factor combined decision.

In one embodiment, the apparatus further comprises: and the submodel judging module is used for analyzing and judging the clue data by using the deep learning neural network submodel of the factor corresponding to the data type to obtain a judgment result when the clue data received within the set time contains one data type.

According to the unstructured data judgment method and device based on deep learning, the multi-factor training sample is used for training the deep learning neural network model to obtain the neural network model capable of performing multi-factor joint judgment, the trained neural network model is used for performing joint analysis on real-time unstructured data with various data types, and characteristic clue information beneficial to data judgment is extracted so as to screen analysis data, and then data analysis judgment is performed more efficiently and timely to obtain a judgment result. Namely, the correlation among various types of real-time data is analyzed, and then a faster and more intelligent judgment scheme is confirmed, so that the online real-time judgment of the application data with large data volume and fast change is realized.

Drawings

FIG. 1 is a diagram of an application environment of an unstructured data decision method based on deep learning in one embodiment;

FIG. 2 is a flow diagram of a method for deep learning based unstructured data determination in one embodiment;

FIG. 3 is a flow diagram of training a deep learning neural network model in one embodiment;

FIG. 4 is a block diagram of an unstructured data decision method based on deep learning in one embodiment;

FIG. 5 is a block diagram of the architecture involved in training a deep learning neural network model in one embodiment.

Detailed Description

In order to make the objects, technical solutions and advantages of the present invention more apparent, the present invention is described in further detail below with reference to the accompanying drawings and embodiments. It should be understood that the specific embodiments described herein are merely illustrative of the invention and are not intended to limit the invention.

As shown in fig. 1, in one embodiment, an application environment diagram of an unstructured data decision method and apparatus based on deep learning is provided, the application environment includes a data acquisition device 110 and a server 120, wherein the data acquisition terminal 110 can communicate with the server 120 through a network. The data acquisition equipment can comprise a camera terminal, a network terminal, a data storage medium and the like, wherein the camera terminal can capture unstructured data such as video data, image data, audio data and the like; the network terminal can acquire unstructured data such as network data, text data, image data, video data and the like; the data storage medium stores unstructured data such as text data, image data, video data, and audio data collected through other offline channels in advance. The unstructured data including multiple data types acquired by the data acquisition equipment 110 can be received on line in real time through the network connection server 120, and the trained deep learning neural network model capable of performing multi-factor judgment is called to perform rapid judgment on the unstructured data of the multiple data types in real time to obtain a judgment result.

In an embodiment, as shown in fig. 2, an unstructured data decision method based on deep learning is provided, which is described by taking an application in the server 120 shown in fig. 1 as an example, and specifically includes the following steps:

step S202: and acquiring a trained deep learning neural network model, wherein the trained deep learning neural network model is a multi-factor joint decision neural network model trained by a multi-factor training sample.

In particular, deep learning refers to a multi-layer neural network, which may form more abstract high-level representation attribute classes or features by combining low-level features to discover a distributed feature representation of the data. The deep neural network is a process for extracting features layer by layer, a computer automatically extracts the features from data without human intervention, and the essential idea is to stack a plurality of neuron layers, each layer extracts certain features and information, and the output of the layer is used as the input of the next layer. In this way, hierarchical representation of the input information can be achieved. To discover a distributed characterization of the data. Taking image recognition as an example, the first layer extracts boundary information, the second layer extracts boundary contour information, then the contours can be combined into subsections, the subsections are combined into objects, thus the features are extracted layer by layer, a more abstract high-level representation attribute category or feature is formed by combining the features of the lower layer, and which kind of object is in the picture is judged through different combinations of the features or the attributes.

The deep learning neural network model has various types, such as a convolutional neural network model, a denoising autoencoder, a Restricted Boltzmann Machine (RBM) network model, and the like. When the deep learning neural network model is trained, any one of the network models can be selected for training.

In this embodiment, the deep learning neural network model is trained by using a multi-factor training sample, the features of each factor and the association relationship between the features of each factor are learned, and a multi-factor joint decision deep learning neural network model is constructed. The multi-factor here may include any of a video data factor, an image data factor, an audio data factor, a text data factor, and a network data factor.

Step S204: clue data acquired by a line in real time is received, and the clue data is unstructured data comprising a plurality of data types.

The server receives unstructured data acquired by each data acquisition terminal in real time by arranging a plurality of interfaces, wherein the unstructured data received by the server comprise a plurality of data types, specifically, real-time video data, real-time image data, real-time audio data, real-time text data, real-time network data and the like.

Step S206: and performing joint analysis on the received clue data through the trained deep learning neural network model, and extracting characteristic clue information favorable for judgment.

Step S208: and judging the cable data according to the characteristic cable information to generate a judgment result.

Specifically, the server receives unstructured data with various data types in real time, such as video data and network data in real time, the trained deep learning neural network model performs feature extraction and analysis on the video data and the network data received in real time, finds out feature clue information beneficial to decision making, and screens and filters massive video data and/or network data of a real-time structure based on the found feature clue information so as to make a decision more quickly and timely.

Step S210: and feeding back a judgment result.

The feedback of the decision structure can be in the form of triggering alarm or directly pushing the feedback structure to a designated display terminal for display.

For example, the purpose of analyzing unstructured data is to: the method for searching the target object from the video data specifically comprises the following steps:

the method comprises the following steps: continuous video data sent by the camera terminal is received in real time, and data of other data types are received, wherein the other data types can be audio data, text data and the like.

Step two: the method comprises the steps of using a trained deep learning neural network model to carry out feature extraction and analysis on clue data received by video data, text data, audio data and the like, and extracting feature clue information beneficial to retrieving a target from the video data, wherein the feature clue information can be position information of the retrieved target and sound information of the retrieved target, and the deep learning neural network model utilizes the extracted feature clue information beneficial to judging to screen the video data, such as searching the video data near the position information of the target and excluding the video data which do not meet position conditions, so that the data quantity needing judgment and analysis is greatly reduced, the data judgment is quicker, and online real-time judgment can be further realized. For another example, when the audio data is searched for the sound information of the search target, the association relationship between the video data and the audio data in the deep learning neural network model is called to locate the video data segment corresponding to the target sound information, and then the target object of the video is quickly searched.

It should be noted that the deep learning decision method in this embodiment is not limited to the above-mentioned decision task of retrieving the target object from the video data, but may also perform real-time fast decision of classification, identification, tracking, and the like of other application data with large data volume and fast change.

In this embodiment, a multi-factor training sample is used to train a deep learning neural network model to obtain a neural network model capable of performing multi-factor joint decision, the trained neural network model is used to perform joint analysis on real-time unstructured data with multiple data types, and characteristic clue information beneficial to data decision is extracted, so as to screen analysis data, and then more efficiently and timely perform data analysis decision to obtain a decision result. Namely, the correlation among various types of real-time data is analyzed, and then a faster and more intelligent judgment scheme is confirmed, so that the online real-time judgment of the application data with large data volume and fast change is realized.

In one embodiment, step S206: the deep learning neural network model after training is used for carrying out joint analysis on the received clue data, and the extraction of the characteristic clue information favorable for judgment comprises the following steps:

and performing feature extraction on the received clue data through the trained deep learning neural network model to obtain a high-dimensional feature vector.

And converting the high-dimensional feature vector into a hash code with each dimension incidence relation by adopting a hash retrieval algorithm, and extracting the characteristic clue information which is favorable for judgment according to the hash code.

Specifically, the high-dimensional feature vector refers to a feature vector which is generated by performing data analysis on multi-dimensional data (each data type is used as a data dimension) received in real time by using the trained deep learning neural network model and can represent the received unstructured data.

In the embodiment, on the basis of ensuring the proximity relation in the original space of each dimension, the generated high-dimensional feature vector is mapped into the hash code of the binary string (binary code) by adopting the hash retrieval algorithm, so that the storage and communication overhead of data can be obviously reduced, the calculation efficiency and speed are effectively improved, and the real-time analysis and judgment of a large amount of unstructured data are realized.

In one embodiment, as shown in FIG. 3, at step S202: the method comprises the following steps of obtaining a trained deep learning neural network model, wherein before the trained deep learning neural network model is a multi-factor joint decision neural network model trained by a multi-factor training sample, the method also comprises a training step of the deep learning deep neural network model, and specifically comprises the following steps:

step S302: and receiving training samples uploaded by the terminal, wherein the training samples are multi-factor training samples.

Specifically, a multi-factor sample is sample data that includes multiple data types. In one embodiment, the training sample data is multi-factor training sample data including multiple types of data such as video sample data, image sample data, audio sample data, text sample data, and network sample data.

Step S304: and under an off-line state, constructing a deep learning neural network sub-model for training sample data corresponding to each factor in the multi-factor training sample by using a deep learning algorithm.

Specifically, a training sample corresponding to each factor is preprocessed, and features of the training sample corresponding to each factor are respectively extracted by using a deep learning algorithm, for example, moving target features of a video training sample are extracted or target object features of the video training sample are extracted; extracting audio features and the like in the audio data, and respectively establishing a deep learning neural network submodel of training sample data corresponding to each factor, such as a video neural network submodel, an audio neural network submodel, an image neural network submodel, a text neural network submodel and a neural network submodel of network data.

Step S306: and acquiring the association relation between the constructed deep learning neural network submodels.

Step S308: and fusing the deep learning neural network submodels according to the incidence relation to generate a deep learning neural network model capable of performing multi-factor joint decision.

Specifically, the incidence relation among a plurality of built deep learning neural network submodels is mined, the submodels are fused, and then the deep learning neural network model capable of performing multi-factor joint decision is generated.

Specifically, if the fused deep learning neural network model capable of performing multi-factor joint decision is used for retrieving a target object in a video, the essence of mining the association relationship between multiple deep learning neural network submodels is as follows: and establishing an incidence relation between the non-video neural network submodel and the video neural network submodel. The non-video neural network submodel is a submodel constructed based on non-video data and extracted feature information, for example, the image neural network submodel is used for extracting facial features of a target object, and the text neural network submodel is used for extracting position features of the target object. And establishing an incidence relation between the video neural network submodel and the non-video neural network submodel so that the constructed combined decision-making deep learning neural network model can utilize the characteristic information extracted by the non-video neural network submodel to quickly search a target object in the video and improve the analysis and decision efficiency.

In one embodiment, when the clue data collected by the terminal in real time and received within the set time comprises one data type, the clue data is analyzed and judged by using the deep learning neural network submodel of the factor corresponding to the data type.

Specifically, when the real-time clue data received online by the server is only one data type within a period of time, and if the real-time clue data is only video data, the analysis and judgment of the video data are directly performed by using the video deep learning neural network submodel.

In the embodiment, the switching between the joint judgment and the single data form judgment can be carried out according to whether the received data is of multiple data types, and the joint judgment is carried out when the received data is of multiple data types, so that the judgment efficiency is improved; when the data type is a single data type, the single data type is directly analyzed, and data judgment is not stopped because the data type is only the single data type.

In one embodiment, as shown in fig. 4, there is provided an unstructured data decision device based on deep learning, the device comprising:

a neural network model obtaining module 402, configured to obtain a trained deep learning neural network model, where the trained deep learning neural network model is a multi-factor joint decision neural network model trained by a multi-factor training sample.

The real-time data receiving module 404 is configured to receive cue data acquired in real time online, where the cue data is unstructured data including multiple data types.

And the characteristic information extraction module 406 is configured to perform joint analysis on the received clue data through the trained deep learning neural network model, and extract characteristic clue information favorable for judgment.

And the joint decision module 408 is configured to decide the cue data according to the characteristic cue information, and generate a decision result.

And a decision result feedback module 410 for feeding back a decision result.

In one embodiment, the feature information extraction module 406 is further configured to perform feature extraction on the received clue data through the trained deep learning neural network model to obtain a high-dimensional feature vector; and converting the high-dimensional feature vector into a hash code with each dimension incidence relation by adopting a hash retrieval algorithm, and extracting the characteristic clue information which is favorable for judgment according to the hash code.

In one embodiment, as shown in fig. 5, the deep learning decision device further includes:

a training sample data receiving module 502, configured to receive a training sample uploaded by a terminal, where the training sample is a multi-factor training sample.

And the submodel training module 504 is configured to construct a deep learning neural network submodel for the training sample corresponding to each factor in the multi-factor training sample by using a deep learning algorithm in an offline state.

And the incidence relation analysis module 506 is used for acquiring the incidence relation between the built deep learning neural network submodels.

And a joint decision model building module 508, configured to fuse the deep learning neural network submodels according to the association relationship, and generate a deep learning neural network model capable of performing multi-factor joint decision.

In one embodiment, the deep learning decision device further includes: and the submodel judging module is used for analyzing and judging the clue data by using the deep learning neural network submodel of the factor corresponding to the data type to obtain a judgment result when the clue data received in the set time contains one data type.

It will be understood by those skilled in the art that all or part of the processes in the methods of the embodiments described above may be implemented by hardware related to instructions of a computer program, and the program may be stored in a computer readable storage medium, for example, in the storage medium of a computer system, and executed by at least one processor in the computer system, so as to implement the processes of the embodiments including the methods described above. The storage medium may be a magnetic disk, an optical disk, a Read-Only Memory (ROM), a Random Access Memory (RAM), or the like.

The technical features of the embodiments described above may be arbitrarily combined, and for the sake of brevity, all possible combinations of the technical features in the embodiments described above are not described, but should be considered as being within the scope of the present specification as long as there is no contradiction between the combinations of the technical features.

The above-mentioned embodiments only express several embodiments of the present invention, and the description thereof is more specific and detailed, but not construed as limiting the scope of the invention. It should be noted that, for a person skilled in the art, several variations and modifications can be made without departing from the inventive concept, which falls within the scope of the present invention. Therefore, the protection scope of the present patent shall be subject to the appended claims.

Claims

1. A deep learning-based unstructured data decision method, the method comprising:

Obtain a trained deep learning neural network model, wherein the trained deep learning neural network model is a multi-factor joint judgment neural network model trained by multi-factor training samples; Deep learning neural network sub-models corresponding to different factors in relation to each other;

Receive clue data collected online in real time, where the clue data includes video data, text data, and audio data;

Jointly analyze the received clue data through the trained deep learning neural network model, and extract characteristic clue information that is beneficial to retrieving the target from the video data; the characteristic clue information includes the position information of the retrieval target or Retrieve the sound information of the target;

Locate the target video data segment in the video data according to the feature clue information, judge the target video data segment, and generate a decision result; the decision result is the target object in the video data;

Feedback the judgment result.

2 . The method according to claim 1 , wherein the received clue data is analyzed jointly by the deep learning neural network model after training, and extraction is conducive to retrieving the target from the video data. 3 . The steps of characteristic clue information include:

Perform feature extraction on the received clue data through the trained deep learning neural network model to obtain a high-dimensional feature vector;

A hash retrieval algorithm is used to convert the high-dimensional feature vector into a hash code with correlations of various dimensions, and extract feature clue information that is beneficial for retrieving the target from the video data according to the hash code.

3. The method according to claim 1, wherein the deep learning neural network model after the training is obtained, wherein the deep learning neural network model after the training is a multi-factor trained by a multi-factor training sample Before the steps of jointly deciding the neural network model, it also includes:

In an offline state, using a deep learning algorithm to construct a deep learning neural network sub-model for the training samples corresponding to each factor in the multi-factor training samples;

Obtain the association relationship between the constructed deep learning neural network sub-models;

According to the association relationship, the deep learning neural network sub-model is fused to generate a deep learning neural network model capable of multi-factor joint decision-making.

4. The method according to claim 3, wherein the multi-factor training samples comprise video data, image data, audio data, text data and network data.

5 . The method according to claim 2 , wherein when the clue data received within a set time includes a data type, a deep learning neural network of the factor corresponding to the data type is used. 6 . The sub-model analyzes and judges the clue data to obtain a judgment result.

6. An unstructured data judgment device based on deep learning, wherein the device comprises:

A neural network model acquisition module, used for acquiring a trained deep learning neural network model, wherein the trained deep learning neural network model is a multi-factor joint judgment neural network model trained by multi-factor training samples; the multi-factor The joint decision neural network model integrates the deep learning neural network sub-models with correlation and corresponding to different factors;

A real-time data receiving module, configured to receive clue data collected online in real time, where the clue data includes video data, text data, and audio data;

A feature information extraction module, configured to perform a joint analysis on the received clue data through the trained deep learning neural network model, and extract the feature clue information that is conducive to retrieving the target from the video data; the feature clue information Including the location information of the retrieval target or the sound information of the retrieval target;

a joint decision module, configured to locate the target video data segment in the video data according to the feature clue information, judge the target video data segment, and generate a decision result; the decision result is the target video data in the video data. object;

The decision result feedback module is used for feeding back the decision result.

7 . The device according to claim 6 , wherein the feature information extraction module is further configured to perform feature extraction on the received clue data through the trained deep learning neural network model to obtain high-dimensional feature vector; using a hash retrieval algorithm to convert the high-dimensional feature vector into a hash code with the relationship of each dimension, and according to the hash code to help retrieve the target feature clue information from the video data extraction.

8. The apparatus of claim 6, wherein the apparatus further comprises:

a training sample data receiving module, configured to receive a training sample uploaded by a terminal, wherein the training sample is a multi-factor training sample;

a sub-model training module for constructing a deep learning neural network sub-model for the training samples corresponding to each factor in the multi-factor training samples using a deep learning algorithm in an offline state;

an association relationship analysis module, used to obtain the association relationship between the constructed deep learning neural network sub-models;

The joint decision-making model building module is used to fuse the deep learning neural network sub-model according to the association relationship to generate a deep learning neural network model capable of multi-factor joint decision-making.

9. The apparatus according to claim 8, wherein the multi-factor training samples comprise video data, image data, audio data, text data and network data.

10. The apparatus according to claim 7, characterized in that, the apparatus further comprises: a sub-model decision module, configured to use the The deep learning neural network sub-model of the factor corresponding to the data type analyzes and judges the clue data to obtain a judgment result.

11. A computer-readable storage medium storing a computer program, wherein when the computer program is executed by a processor, the steps of the method according to any one of claims 1 to 5 are implemented.