CN120145397A

CN120145397A - Vulnerability description and repair suggestion generation method based on large language model reasoning and retrieval enhancement

Info

Publication number: CN120145397A
Application number: CN202510320737.XA
Authority: CN
Inventors: 苏小红; 郑伟宁; 陶文鑫; 董肇会; 魏宏巍; 蒋远
Original assignee: Harbin Institute of Technology Shenzhen
Current assignee: Harbin Institute of Technology Shenzhen
Priority date: 2025-03-18
Filing date: 2025-03-18
Publication date: 2025-06-13

Abstract

The invention discloses a method for generating vulnerability descriptions and repairing suggestions based on large language model reasoning and retrieval enhancement, which comprises the steps of integrating vulnerability databases such as CWE, CVE and the like and external knowledge sources, constructing a vulnerability knowledge base, providing prompt information of professional knowledge for a large language model, preprocessing codes to be detected by using a code analysis tool, extracting vulnerability knowledge most relevant to the codes to be detected from the vulnerability knowledge base through semantic matching and code matching, generating detailed vulnerability descriptions and repairing suggestions by using the large language model based on relevant vulnerability knowledge obtained in the retrieval enhancement stage, and optimizing a generated result through a thinking chain technology. The invention can quickly generate detailed and targeted bug descriptions and bug repairing suggestions by combining the semantic understanding capability, the retrieval enhancement technology and the reasoning enhancement technology of the large language model, remarkably improves the bug repairing efficiency, can be flexibly applied to the existing bug detection tool, and is suitable for various programming languages and bug types.

Description

Vulnerability description and restoration suggestion generation method based on large language model reasoning and retrieval enhancement

Technical Field

The invention relates to a description and repair suggestion generation method for software vulnerabilities, in particular to a vulnerability description and repair suggestion generation method based on large language model (Large Language Models, LLMs) reasoning and retrieval enhancement.

Background

In modern software development, the security of a software system is critical, and bug fixes are key links for guaranteeing the security of the system. Although vulnerability detection techniques have made significant progress to efficiently identify potential vulnerabilities in software, vulnerability remediation remains a complex and time-consuming process. After obtaining the vulnerability detection result, the developer often needs to spend a great deal of time and effort to analyze the cause of the vulnerability, evaluate the potential influence of the vulnerability, and formulate a reasonable repair strategy. This process places high demands on the experience and skill level of the developer, especially for less experienced developers, vulnerability remediation can be a difficult task.

Existing vulnerability detection tools (e.g., static analysis tools and dynamic analysis tools) can provide basic information about the location, type, and severity of vulnerabilities, but these tools often lack in-depth analysis of vulnerability causes, scope of impact, and repair methods. For example, one SQL injection hole may be due to a developer failing to adequately verify and filter user input, while one buffer overflow hole may be due to a lack of adequate boundary checking in code. The cause of these vulnerabilities is often related to details that the developer ignores in the encoding process, and it is difficult for the developer to determine how to repair them effectively without extensive analysis and understanding. Therefore, after the developer receives the vulnerability warning, a great deal of time and effort are often required to analyze the deep cause of the vulnerability and find an appropriate repair scheme. This process is particularly difficult for less experienced developers. Because of the lack of adequate vulnerability analysis and repair experience, they may not be able to understand the cause of the vulnerability accurately, and may even take erroneous repair measures, resulting in the vulnerability failing to be thoroughly resolved, or introducing new problems. For example, some developers may simply fix SQL injection holes by adding input validation, but ignore other potential injection points, resulting in holes still existing. Similarly, for a buffer overflow vulnerability, the developer may add a boundary check, but fail to properly handle all possible boundary conditions, resulting in that the vulnerability may still be exploited. In addition, the existing bug repair process has the problem of low efficiency. Because repair suggestions provided by vulnerability detection tools tend to be too general or lack of pertinence, developers need to spend a great deal of time reviewing related documents, reference cases, or discussions with other developers to develop reasonable repair strategies. The inefficient repair process not only prolongs the time window of bug repair and increases the time of exposure of the system to security risks, but also may cause delays or errors in the repair process, thereby increasing the security risks of the system.

In order to solve the above-mentioned problems, researchers have recently been exploring the use of artificial intelligence techniques, particularly Natural Language Processing (NLP) techniques, to assist in the repair suggestion generation of vulnerabilities. The large language model is used as a natural language processing technology based on deep learning, and has strong semantic understanding and natural language generating capability. The models can generate richer and more detailed vulnerability descriptions based on the characteristics, the contextual information and the prompt information of the vulnerability, and propose feasible repair suggestions. For example, a large language model may help a developer understand and repair vulnerabilities faster by analyzing contextual information of vulnerabilities, deducing potential causes of vulnerabilities, and generating targeted repair suggestions.

However, while large language models perform well in terms of vulnerability descriptions and repair suggestion generation, their performance is still subject to some limitations. First, the results of the generation of large language models often depend on the quality and coverage of their training data. If certain specific types of vulnerability cases are lacking in the training data, the model may not accurately generate relevant repair suggestions. Second, large language models have limited reasoning capabilities, making it difficult to capture complex logical relationships, especially where the causation and repair logic of vulnerabilities may take different forms in different contexts. Therefore, how to design an inference mechanism so that the model can accurately capture these logical relationships and generate a targeted repair suggestion is still a problem to be solved. In addition, retrieval enhancement techniques (RETRIEVAL-Augmented Generation, RAG) are also introduced into the task of vulnerability restoration suggestion generation. The retrieval enhancement technique improves the accuracy and practicality of the generated content by retrieving relevant information from an external knowledge base and combining it with the generated result of the model. However, existing search enhancement techniques still face some challenges in practical applications. For example, there may be some redundancy or irrelevance of the information extracted from the vulnerability database, and how to design efficient search algorithms to ensure that the extracted information is highly relevant to the description and repair suggestions of the current vulnerability remains a key issue.

Disclosure of Invention

In order to solve the problems of low efficiency and insufficient pertinence of generating vulnerability restoration suggestions in the prior art, the invention provides a vulnerability description and restoration suggestion generation method based on large language model reasoning and retrieval enhancement. For the bug codes detected by the bug detection tool, the method aims to generate detailed and accurate bug descriptions and repairing suggestions by combining semantic understanding capability and retrieval enhancement technology of a large language model so as to assist developers to quickly understand the causes of the bugs and formulate efficient repairing strategies. Specifically, the method utilizes the natural language generation capability of a large language model, combines an inference enhancement technology and a retrieval enhancement technology, extracts context information related to the vulnerability from a global angle, and generates a targeted repair suggestion. The method and the device can be flexibly applied to the existing vulnerability detection tools, and are applicable to various programming languages and vulnerability types.

The invention aims at realizing the following technical scheme:

a vulnerability description and repair suggestion generation method based on large language model reasoning and retrieval enhancement comprises the following steps:

step 1, constructing a vulnerability knowledge base:

constructing a structured vulnerability knowledge base for providing prompt information of professional knowledge for a large language model, wherein the vulnerability knowledge base comprises vulnerability definition, vulnerability classification, semantic description, vulnerability restoration suggestion and related context information, and the specific steps are as follows:

step 11, integrating CWE and CVE databases, extracting definitions, types and relevant information of the loopholes, and forming a loophole classification system;

step 12, extracting the vulnerability codes and the corresponding repair patches from an external knowledge source, and enriching the content of a vulnerability knowledge base;

Step 13, generating function semantic description for the vulnerability example codes by using a large language model, and generating a repair suggestion by combining the repair cases in an external knowledge base;

Step 14, storing vulnerability definition, classification, semantic description, repair suggestion and related context information in a structured manner, and generating a code attribute graph of vulnerability example codes by using a code analysis tool;

Step 2, retrieval enhancement phase:

Calculating the similarity between semantic descriptions of codes to be tested and semantic descriptions of vulnerability example codes in a vulnerability knowledge base by using a preliminary screening and fine-ranking model, and screening vulnerability examples related to the semantics of the codes to be tested, wherein the specific steps are as follows:

Preprocessing the input code to be tested, wherein the preprocessing comprises the generation of function semantic description and the extraction of a code attribute graph;

Step 22, matching and searching:

Step 221, respectively obtaining embedded vectors of semantic descriptions of codes to be tested and semantic descriptions of vulnerability examples through a preliminary screening model, calculating semantic matching degree by utilizing cosine similarity, and screening vulnerability examples related to the codes to be tested in terms of semantics through setting a threshold value;

Step 222, after the primary screening is completed, the residual vulnerability code examples enter a fine-ranking model, the fine-ranking model splices semantic descriptions of the codes to be tested with semantic descriptions of the vulnerability examples to form a new input vector, and then the new input vector is sent into a RoBERTa model to obtain matching scores, and the vulnerability examples are ranked according to the matching scores to ensure that the most relevant vulnerability examples are ranked in front;

step 223, in the code matching stage, calculating the similarity between the code to be tested and the vulnerability example by utilizing a twin graph neural network and a weighted graph embedding and matching mechanism;

Step 23, retrieving results:

Integrating semantic matching and code matching results, and extracting vulnerability examples most relevant to the code to be tested, vulnerability descriptions, repairing suggestions and vulnerability related statement information;

step 3, reasoning enhancement stage:

based on the relevant vulnerability knowledge obtained in the retrieval enhancement stage, a detailed vulnerability description and restoration suggestion are generated by using a large language model, and the specific steps are as follows:

step 31, carrying out semantic understanding on the code to be detected by the large language model;

step 32, after the code semantic understanding is completed, the large language model is combined with the vulnerability related sentences to carry out deep analysis on the identified potential vulnerabilities;

step 33, after completing the vulnerability analysis, the large language model generates detailed vulnerability descriptions;

Step 34, on the basis of generating the vulnerability descriptions, the large language model further generates targeted repair suggestions;

and step 4, outputting the generated vulnerability description and the generated repair suggestions to a developer, and enabling the developer to verify and adjust the repair scheme.

Compared with the prior art, the invention has the following advantages:

(1) According to the method, detailed and targeted vulnerability description and repair suggestions can be quickly generated by combining the semantic understanding capability, the retrieval enhancement technology and the reasoning enhancement technology of the large language model, so that the vulnerability repair efficiency is remarkably improved. Compared with the prior art that only simple information of the position and the type of the vulnerability is provided, the method and the device can deeply analyze the cause and the influence of the vulnerability and provide a feasible repairing scheme.

(2) The invention utilizes the reasoning capability and the retrieval enhancement technology of the large language model to realize the intellectualization and automation of vulnerability description and restoration suggestion generation. The developer can automatically generate detailed repair suggestions without manually analyzing the deep cause of the vulnerability, so that the technical threshold of vulnerability repair is reduced.

(3) The retrieval enhancement technology and the reasoning enhancement mechanism have high expandability, and can continuously promote the generation effect along with the updating of the vulnerability knowledge base and the optimization of model training.

Drawings

FIG. 1 is an overall framework diagram of the vulnerability description and repair suggestion generation method based on large language model reasoning and retrieval enhancement of the present invention.

Fig. 2 is a hint for semantic understanding of codes in the reasoning enhancement step.

FIG. 3 is a hint for code vulnerability analysis in the inference enhancement step.

FIG. 4 is a hint information for vulnerability description generation in the inference enhancement step.

Fig. 5 is a prompt for the repair advice generation phase in the reasoning enhancement step.

FIG. 6 is an example code and vulnerability description thereof.

FIG. 7 is a vulnerability description generated using only a large language model.

FIG. 8 is a repair suggestion generated using only a large language model.

FIG. 9 is a vulnerability description generated using the large language model of the present invention.

FIG. 10 is a repair suggestion generated using the large language model after the method of the present invention.

Detailed Description

The following description of the present invention is provided with reference to the accompanying drawings, but is not limited to the following description, and any modifications or equivalent substitutions of the present invention should be included in the scope of the present invention without departing from the spirit and scope of the present invention.

The invention provides a vulnerability description and repair suggestion generation method based on large language model reasoning and retrieval enhancement, which comprises the following steps of firstly, by integrating vulnerability databases such as CWE, CVE and the like and external knowledge sources, a structured vulnerability knowledge base is constructed, and professional knowledge prompt information is provided for a large language model. And then, preprocessing the code to be detected by using a code analysis tool, and extracting vulnerability knowledge most relevant to the code to be detected from a vulnerability knowledge base through semantic matching and code matching. And finally, based on the relevant vulnerability knowledge obtained in the retrieval enhancement stage, generating detailed vulnerability descriptions and repairing suggestions by using a large language model, and optimizing and generating a result by using a thinking chain technology. As shown in fig. 1, the method specifically comprises the following steps:

step 1, constructing a vulnerability knowledge base:

And constructing a structured vulnerability knowledge base for providing prompt information of professional knowledge for the large language model, wherein the vulnerability knowledge base comprises vulnerability definition, vulnerability classification, semantic description, vulnerability restoration suggestion and related context information. The method comprises the following specific steps:

And 11, integrating CWE (Common Weakness Enumeration) and CVE (Common Vulnerabilities and Exposures) databases, and extracting definitions, types and relevant information of the loopholes to form a loophole classification system.

And step 12, extracting the vulnerability codes and the corresponding repair patches from external knowledge sources such as a vulnerability report, an open source code library, a vulnerability data set provided by the vulnerability related paper, and the like, and enriching the content of the vulnerability knowledge library.

And 13, generating function semantic description for the vulnerability example code by using the large language model, and generating a repair suggestion by combining the repair cases in the external knowledge base.

Step 14, storing vulnerability definition, classification, semantic description, repair suggestion and related context information in a structured manner, and generating a code attribute map of vulnerability example codes by using a code analysis tool (such as Joern) for the subsequent retrieval enhancement stage.

Step 2, retrieval enhancement phase:

And calculating the similarity between the semantic description of the code to be tested and the semantic description of the vulnerability example code in the vulnerability knowledge base by using the preliminary screening and the fine-ranking model, and screening vulnerability examples related to the semantics of the code to be tested. The method comprises the following specific steps:

step 21, preprocessing the code to be detected, namely preprocessing the input code to be detected, wherein the preprocessing mainly comprises two parts, namely generating function semantic description and extracting a code attribute graph, and the specific steps are as follows:

Step 211, generating a function semantic description, namely generating the function semantic description of the code to be tested by utilizing an advanced large language model (such as GPT-4 o), wherein the purpose is to express the function and logic of the code to be tested in a natural language form so as to carry out matching search on the function semantic description of other vulnerability code examples in a vulnerability knowledge base.

Extraction of code Attribute diagram the code Attribute diagram (Yamaguchi F,Golde N,Arp D,Rieck K.Modeling and discovering vulnerabilities with code property graphs.In2014 IEEE symposium on security and privacy 2014May 18(pp.590-604).IEEE.), code Attribute diagram can be extracted by using the code parsing tool Joern to expose the control and data flows of the code, better embodying the structure and execution path of the code.

Step 22, matching and searching:

Step 221, performing preliminary screening by using a preliminary screening model. At this stage the invention sets a threshold value and only examples of vulnerability codes whose semantic match exceeds this threshold value can be preserved. And the preliminary screening model adopts RoBERTa after fine adjustment training, and semantic matching degree is calculated by utilizing cosine similarity by respectively acquiring embedded vectors of semantic descriptions of codes to be tested and semantic descriptions of vulnerability examples. Thus, the loophole examples related to the code to be tested in the semanteme can be screened out quickly. Because the embedded vector of the semantic description of the vulnerability example can be calculated in advance and stored in the vulnerability knowledge base, the calculation efficiency of the preliminary screening model is high, and the method is suitable for large-scale screening of all examples of the knowledge base.

Step 222, after the primary screening is completed, the remaining vulnerability code examples enter the fine-ranking model for ranking. The fine-pitch model is also based on RoBERTa after fine-tuning training, but in a different manner than the primary screening model. The fine-ranking model splices semantic descriptions of the code to be detected and semantic descriptions of the vulnerability examples to form a new input vector, and then the new input vector is sent into the RoBERTa model to obtain matching scores. Finally, the vulnerability examples are ranked according to the matching score, and the most relevant vulnerability examples are guaranteed to be ranked in front. Since the splice vector needs to be re-entered each time the score is calculated, the fine-ranking model is computationally inefficient, but is suitable for ranking the small number of samples that are retained after the prescreening.

In step 223, in the code matching stage, the similarity between the code to be detected and the vulnerability example is calculated by mainly utilizing the neural network of the twin map and the embedding and matching mechanism of the weighted map. The method comprises the following specific steps:

Step 2231, for any statement node v _i of the code attribute graph, generates its initial feature vector representation using CodeBERT, denoted as x _i.

Step 2232, obtaining node hidden vector representation by using a twin graph neural network, wherein the specific calculation formula is as follows:

Wherein, AndA hidden vector representation obtained by node v _i after passing through l-1 and l-layer siamese-GNN, respectively; Is a hidden vector representation of node v _j, v _j is a neighbor of v _i and there is an edge from v _j to v _i, f is a propagation function of the Siamese-GNN model for collecting neighbor node information to update the state of the current node, and z is an output function for computing the final output feature vector o _i of node v _i. The calculation modes of f and z are different for different types of Siamese-GNN. The present invention is a general method and is not limited to the type of Siamese-GNN, so only the general formula is given here.

Step 2233, after obtaining the code attribute graph statement node representation of the code to be tested and the vulnerability example, further calculating the final vector representation of the code attribute graph statement node representation and the vulnerability example by adopting a weighted graph embedding mechanism, and calculating the similarity between the code to be tested and the vulnerability example, wherein the specific steps are as follows:

step 22331 for the known vulnerability example code, first calculate weights based on its graph structure to highlight the vulnerability information therein. Specifically, the present invention calculates data-dependent weights and control-dependent weights, denoted by α and β, respectively. For data dependency weights, vulnerability nodes are selected As the root node and is given a weight of α _r. If there is a nodeWith the root nodeConnected by at least k data dependent edges, the nodeIs α _i＝α_r·(L_α)^k, where L _α e (0, 1) is the decay coefficient, controlling the decay rate of the data dependent weights. For control dependent weights, the same is chosenAs the root node, the initial weight is β _r. Suppose a nodeThe control dependent weight is β _i＝β_r·(L_β)^k, where lβe (0, 1) is the decay factor of the control dependent weight, which can be connected to the root node by at least k control dependent edges.

By α _i and β _i, a node can be obtainedThe weight of (2) is calculated as follows:

Wherein, Is a nodeW _S is the weight matrix of node set V _S and n _s is the number of statement nodes in the vulnerability example code.

Then, the final vector representation of the vulnerability example code can be obtained by combining the weights and the node representation matrix, and the calculation formula is as follows:

σ(·)=MaxPool(Relu(Conv(·)))

z_s＝AVG(MLP(σ(W_S*O_S)))

Where σ (·) is defined as the one-dimensional convolution layer Conv with maximum pooling MasPool, relu is the activation function, AVG is the average pooling, MLP represents the multi-layer perceptron, O _S is the output feature vector set of nodes, and z _s is the final vector representation of the vulnerability example code.

Step 22332, for the code to be tested, the present invention assigns weights using the vulnerability information and node attention mechanisms known in the vulnerability example. Specifically, for any node in the code under testSum z _s previously obtainedOutput vector of (a)Make connections and input to the linear layer for computationIs a component of the attention of the person, then use the score asWeights are assigned. Finally, the final vector representation of the code to be tested is obtained in the same way as the vulnerability example, and the calculation formula is as follows:

z_f＝AVG(MLP(σ(W_F*O_F)))

Wherein, Is a nodeLinear is a full connection layer, W _F is the weight matrix of node set V _F, n _f is the number of statement nodes in the code under test, and z _f is the final vector representation of the code under test.

And 22333, calculating the Code similarity code_similarity between the vulnerability example Code and the Code to be tested by using the cosine similarity, wherein the calculation formula is as follows:

and sorting the search results according to the code_similarity.

Step 23, retrieving results:

Integrating semantic matching and code matching results, extracting the most relevant vulnerability examples of the codes to be tested and information such as vulnerability descriptions, repairing suggestions, vulnerability related sentences and the like of the vulnerability examples, and using the information in a subsequent reasoning enhancement stage.

Step 3, reasoning enhancement stage:

Based on the relevant vulnerability knowledge obtained in the retrieval enhancement stage, a detailed vulnerability description and restoration suggestion are generated by using a large language model. The method comprises the following specific steps:

Step 31, in the first step of the reasoning enhancement stage, the large language model needs to perform deep semantic understanding on the code to be tested. This process is not just a surface analysis of the code, but rather requires a comprehensive understanding of the functionality and potential risk of the code in combination with contextual information and vulnerability detection results, while also allowing for the large language model itself to attempt to locate potential security issues. The prompt message designed at this stage of the present invention is shown in fig. 2, in which [ # - # code# - ] is an alternative content where the CODE to be tested needs to be provided.

Step 32, after the code semantic understanding is completed, if the used vulnerability detection tool has vulnerability statement positioning capability, feedback can be performed on the result of one-step vulnerability positioning on the large language model to further optimize the understanding of the large language model on the code to be detected. In addition, the large language model can further conduct deep analysis on the identified potential vulnerabilities in combination with the vulnerability related sentences. Thus, the hint information constructed at this stage of the present invention is shown in FIG. 3. Wherein, # STATEMENTS # is an alternative content, and the vulnerability related sentences located by the vulnerability detection tool can be provided.

Step 33, after completing the vulnerability analysis, the large language model needs to generate detailed vulnerability descriptions. The process not only comprises the summary of basic information of the loopholes, but also needs to combine the retrieved related information to ensure the comprehensiveness and accuracy of the description. The prompt message constructed at this stage of the invention is shown in fig. 4. Wherein, # EXAMPLE CODE# # # and # # EXAMPLE DESCRIPTION # # are replaceable contents, and according to the search enhancement sequencing result, a vulnerability EXAMPLE and a vulnerability description thereof with the highest correlation with the semantics of the CODE to be tested and the highest CODE matching degree are provided.

Based on generating the vulnerability description, the large language model may further generate targeted repair suggestions 34. This process also relies on the deep understanding and analysis of vulnerabilities. The prompt message constructed at this stage of the invention is shown in fig. 5. Wherein, # EXAMPLE CODE# # # and # # FIX RECOMMENDATION # # are replaceable contents, and according to the search enhancement sequencing result, a vulnerability EXAMPLE with the highest correlation with the CODE semantics to be tested and the highest CODE matching degree and a restoration proposal thereof are provided.

Examples:

Taking the code bug shown in fig. 6 as an example, the code bug is marked with a bug statement in front, and the code bug is marked with a patch statement for repairing the bug. The code fragment exposes the portion of the ion_ioctl function that handles the ion_ioc_free command, and also contains the reference vulnerability description. Specifically, the root cause of the vulnerability is that when the ion_ioc_free is called concurrently, multiple threads may access the same ion_handle instance at the same time, resulting in use-after-FREE vulnerability. When the method of the invention is not used, the vulnerability descriptions and repair suggestions generated by the large language model are shown in fig. 7 and 8. From this result, descriptions generated using only large language models are primarily focused on a large number of potential problems in the code, such as mishandling, memory leaks, user input underverification, and the like. Although described more fully, the core problem of the use-after-free vulnerability cannot be accurately identified. Too extensive description, lack of analysis of specific mechanisms of vulnerabilities results in large deviation of the generated vulnerability descriptions from actual conditions. Further, using only a large language model to provide repair suggestions for the code also fails to accurately address the core problem of use-after-free loopholes. After the method of the invention is used, the vulnerability descriptions and the repair suggestions generated by the model are shown in fig. 9 and 10, and from the results, it can be seen that the descriptions generated by the model not only accurately identify the concurrent access problem of the vulnerability, but also analyze the specific mechanism and the potential influence of the use-after-free vulnerability in detail. The description indicates that the ion handle may still be accessed after release, resulting in a memory corruption or security hole. In addition, the model also provides specific repairing measures in detail. It is suggested that access to the ion handle instance should be synchronized by reference counting and mutex locking, ensuring that handles are no longer used by other threads before release. In addition, suggestions have emphasized the importance of verifying its state before operating the handle, and proposed strategies to delay releasing the handle to allow ongoing operations to complete. These measures significantly reduce the risk of use-after-free loopholes, enhancing the overall security of the ION driver. The method combines the advantages of retrieval enhancement and reasoning enhancement, and the generated description is accurate and detailed, so that clear vulnerability understanding and repairing suggestions can be provided for developers.

Claims

1. A method for generating vulnerability descriptions and repair suggestions based on large language model reasoning and retrieval enhancement, characterized in that the method comprises the following steps:

Step 1: Vulnerability knowledge base construction:

Build a structured vulnerability knowledge base to provide professional knowledge prompt information for the large language model. The vulnerability knowledge base includes vulnerability definitions, classifications, semantic descriptions, repair suggestions, and related contextual information.

Step 2: Retrieval enhancement phase:

Using the preliminary screening and refined sorting models, the similarity between the semantic description of the code to be tested and the semantic description of the vulnerability sample code in the vulnerability knowledge base is calculated to screen out vulnerability samples that are semantically related to the code to be tested.

Step 3: Reasoning enhancement phase:

Based on the relevant vulnerability knowledge obtained in the retrieval enhancement phase, a large language model is used to generate detailed vulnerability descriptions and repair suggestions;

Step 4: Output the generated vulnerability description and repair suggestions to the developer, allowing the developer to verify and adjust the repair plan.

2. According to the method for generating vulnerability description and repair suggestions based on large language model reasoning and retrieval enhancement in claim 1, it is characterized in that the specific steps of step 1 are as follows:

Step 11: Integrate the CWE and CVE databases, extract the definition, type and related information of the vulnerability, and form a vulnerability classification system;

Step 12: Extract vulnerability codes and corresponding repair patches from external knowledge sources to enrich the content of the vulnerability knowledge base;

Step 13: Use the large language model to generate function semantic descriptions for the vulnerability sample code, and generate repair suggestions based on the repair cases in the external knowledge base;

Step 14: The vulnerability definition, classification, semantic description, repair suggestion and related context information are stored in a structured manner, and a code property graph of the vulnerability sample code is generated using a code parsing tool.

3. According to the method for generating vulnerability description and repair suggestions based on large language model reasoning and retrieval enhancement in claim 1, it is characterized in that the specific steps of step 2 are as follows:

Step 21: Preprocessing of the code to be tested: preprocessing the input code to be tested, including generating function semantic description and extracting code attribute graph;

Step 22: Matching search:

Step 221: Obtain the embedding vectors of the semantic description of the code to be tested and the semantic description of the vulnerability example through the preliminary screening model, calculate the semantic matching degree by using cosine similarity, and screen out the vulnerability examples that are semantically related to the code to be tested by setting a threshold;

Step 222: After the initial screening is completed, the remaining vulnerability code examples enter the refined ranking model, which concatenates the semantic description of the code to be tested with the semantic description of the vulnerability example to form a new input vector, which is then sent to the RoBERTa model to obtain a matching score. The vulnerability examples are sorted according to the matching score to ensure that the most relevant vulnerability examples are ranked first;

Step 223: In the code matching phase, the similarity between the code to be tested and the vulnerability example is calculated using the twin graph neural network and the weighted graph embedding and matching mechanism;

Step 23: Retrieve Results:

Integrate the results of semantic matching and code matching to extract the vulnerability examples most relevant to the code to be tested, as well as their vulnerability descriptions, repair suggestions, and vulnerability-related statement information.

4. According to the method for generating vulnerability description and repair suggestions based on large language model reasoning and retrieval enhancement in claim 3, it is characterized in that the specific steps of step 21 are as follows:

Step 211: Generate function semantic description: Generate function semantic description for the code to be tested by using the large language model, and express the function and logic of the code to be tested in the form of natural language, so as to match and retrieve the function semantic description of other vulnerability code examples in the vulnerability knowledge base later;

Step 212: Extraction of code property graph: Extract the code property graph by using the code parsing tool Joern.

5. According to the method for generating vulnerability description and repair suggestions based on large language model reasoning and retrieval enhancement in claim 3, it is characterized in that the specific steps of step 223 are as follows:

Step 2231: For any sentence node v _i in the code attribute graph, use CodeBERT to generate its initial feature vector representation, which is counted as x _i ;

Step 2232: Use the twin graph neural network to obtain the node hidden vector representation. The specific calculation formula is as follows:

in, and are the hidden vector representations of node _vi after passing through l-1 and l layers of siamese-GNN respectively; is the hidden vector representation of node _vj , _vj is a neighbor of _vi , and there is an edge from _vj to _vi ; f is the propagation function of the Siamese-GNN model, which is used to collect information of neighboring nodes to update the state of the current node; z is the output function, which is used to calculate the final output feature vector o _i of node _vi ;

Step 2233: After obtaining the code attribute graph statement node representations of the code to be tested and the vulnerability example, a weighted graph embedding mechanism is used to further calculate the final vector representations of the two, and calculate the similarity between the code to be tested and the vulnerability example.

6. According to the method for generating vulnerability description and repair suggestions based on large language model reasoning and retrieval enhancement in claim 5, it is characterized in that the specific steps of step 2233 are as follows:

Step 22331: For the known vulnerability sample code, first, calculate the data dependency weight and control dependency weight based on its graph structure, represented by α and β respectively. For the data dependency weight, select the vulnerability node As the root node, and give it a weight of α _r , if there is a node With the root node If the node is connected by at least k data dependency edges, The data dependency weight is α _i =α _r ·(L _α ) ^k , where L _α ∈(0,1) is the decay coefficient, which controls the decay rate of the data dependency weight. For the control dependency weight, select is the root node, with an initial weight of β _r , assuming that node can be connected to the root node through at least k control dependency edges, then its control dependency weight is β _i =β _r ·(L _β ) ^k , where L _β ∈(0,1) is the attenuation factor of the control dependency weight;

Through α _i and β _i , we can get the node The weight is calculated as follows:

in, Is a node , W _S is the weight matrix of the node set V _S , and n _s is the number of statement nodes in the vulnerability sample code;

Then, the final vector representation of the vulnerability sample code is obtained by combining the weights and the node representation matrix. The calculation formula is as follows:

σ(·)=MaxPool(Relu(Conv(·)))

z _s =AVG(MLP(σ(W _S *O _S )))

Where σ(·) is defined as a one-dimensional convolutional layer Conv with maximum pooling MaxPool, Relu is the activation function, AVG is average pooling, MLP represents multi-layer perceptron, O _S is the output feature vector set of the node, and z _s is the final vector representation of the vulnerability sample code;

Step 22332: For the code to be tested, use the known vulnerability information in the vulnerability example and the node attention mechanism to assign weights. For any node in the code to be tested, The previously obtained z _s and The output vector Connect and input into the linear layer to calculate The attention score is then used as Assign weights and finally obtain the final vector representation of the code to be tested in the same way as the vulnerability example. The calculation formula is as follows:

z _f =AVG(MLP(σ(W _F *O _F )))

in, Is a node , Linear is a fully connected layer, W _F is the weight matrix of the node set V _F , n _f is the number of statement nodes in the code to be tested, and z _f is the final vector representation of the code to be tested;

Step 22333: Use cosine similarity to calculate the code similarity Code_similarity between the vulnerability sample code and the code to be tested. The calculation formula is as follows:

Sort the search results by Code_similarity.

7. According to the method for generating vulnerability description and repair suggestions based on large language model reasoning and retrieval enhancement in claim 1, it is characterized in that the specific steps of step 3 are as follows:

Step 31: The large language model performs semantic understanding on the code to be tested;

Step 32: After completing the semantic understanding of the code, the large language model combines vulnerability-related statements to conduct in-depth analysis of the identified potential vulnerabilities;

Step 33: After completing the vulnerability analysis, the large language model generates a detailed vulnerability description;

Step 34: Based on the generated vulnerability description, the large language model further generates targeted repair suggestions.