JP2022119668A

JP2022119668A - Code change method and code change program

Info

Publication number: JP2022119668A
Application number: JP2021016952A
Authority: JP
Inventors: 晴樹横山; Haruki Yokoyama; 訓広野田; Kunihiro Noda
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 2021-02-04
Filing date: 2021-02-04
Publication date: 2022-08-17

Abstract

【課題】変更対象プログラムコードのＡＳＴノードと変更後サブグラフのＡＳＴノードとの間の接続関係をパターンに基づいて決定できるコード変更方法及びプログラムを提供する。【解決手段】変更前と変更後のプログラムのプログラム依存グラフから抽出されたコード変更パターンに基づいてコード変更する方法であって、変更対象プログラムコードの抽象構文木（変更対象ＡＳＴ）と、変更後プログラムコードの抽象構文木（変更後ＡＳＴ）の夫々において、変更前サブグラフ又は変更後サブグラフのノードを有し、変更対象ＡＳＴと変更後ＡＳＴ間の対応付けがあるマップノードをルートノード又はリーフノードに有する変更対象誘導サブツリーと変更後誘導サブツリーを特定し、変更対象誘導サブツリーを削除した変更対象ＡＳＴに、変更後誘導サブツリーを追加し、変更後誘導サブツリー内の境界ノードを削除された変更対象ＡＳＴ内のノードと接続する。【選択図】図３０A code modification method and program capable of determining a connection relationship between an AST node of a modification target program code and an AST node of a post-modification subgraph based on a pattern. A method for modifying code based on a code modification pattern extracted from a program dependency graph of a program before and after modification, comprising an abstract syntax tree (modified AST) of program code to be modified, and In each abstract syntax tree (post-change AST) of the program code, a map node having a node of a pre-change subgraph or a post-change subgraph and having a correspondence between the change target AST and the post-change AST is set as the root node or leaf node. Identify the modified derived subtree and the modified derived subtree, add the modified derived subtree to the modified AST from which the modified derived subtree is deleted, and add the boundary node in the modified derived subtree to the deleted modified AST node. [Selection drawing] Fig. 30

Description

本発明は、コード変更方法及びコード変更プログラムに関する。 The present invention relates to a code modification method and a code modification program.

コード変更方法及びコード変更プログラムは、プログラム開発を支援する開発支援プログラムに含まれる1つの機能である。プログラム開発では、ソースコードの複数の箇所に類似する変更が加えられることがある。例えば、ソースコード内のAPI（Application Programming Interface）のバージョンアップや変更が生じた場合、APIの利用箇所を一括で変更する、または、ソースコード内のバグを修正する場合に、同種のバグの箇所を一括で変更する、などである。 A code change method and a code change program are one function included in a development support program that supports program development. During program development, similar changes may be made to multiple locations in the source code. For example, if there is an API (Application Programming Interface) version upgrade or change in the source code, you can change the location of the API usage all at once, or fix a bug in the source code. , and so on.

このような、ソースコード内の異なる箇所に類似する変更を加えることを、システマティックエディット（Systematic edit：システマティックな編集または変更もしくは変換。）と呼ぶ。システマティックエディットは、ソースコードの開発履歴において、異なる版で同じ箇所や異なる箇所に発生することもあれば、同じ版の異なる箇所に発生することもある。 Making similar changes to different locations in the source code is called systematic edit. Systematic edits can occur at the same or different locations in different versions of the source code development history, or at different locations in the same version.

システマティックエディットで変更すべき箇所が多数ある場合、変更漏れの発生や、コード書き換えの作業コストが増大する。このような問題を解消するために、過去の開発履歴のソースコードから、システマティックエディットパターン（以下、コード変更パターン、エディットパターン、編集パターン、または単にパターンと称する。）を収集または抽出し、パターンを利用して、コード変更箇所の検出とコード変更や、類似するコード変更作業などを自動化する。このようなコード変更によりプログラム開発の支援が可能になる。 If there are many places to be changed in systematic editing, omission of changes will occur and the work cost of code rewriting will increase. In order to solve such problems, systematic editing patterns (hereinafter referred to as code change patterns, editing patterns, editing patterns, or simply patterns) are collected or extracted from the source code of the past development history, and the patterns are Use it to automate code change detection and code changes, as well as similar code change tasks. Such code changes can assist in program development.

システマティックエディットに基づく変更支援や自動修正の処理の流れは、例えば（１）ソースコードの開発履歴群からコード変更パターンをマイニングし、（２）変更対象プログラム内のコード変更パターンの適用箇所を検出し、（３）検出した適用箇所をコード変更パターンに基づいてコード変更する等である。 The process flow of change support and automatic correction based on systematic editing is, for example, (1) mining code change patterns from a group of source code development histories, and (2) detecting the application points of the code change patterns in the program to be changed. and (3) changing the code of the detected application location based on the code change pattern.

前述のコード変更パターンは、例えば、PDG（Program Dependence Graph：プログラム依存グラフ）の変化で表現される。そして、PDGの変化を表現するグラフであるチェンジグラフ（change graph）の集合から、頻出するサブグラフがコード変更パターンとしてマイニングされる。PDGの変化を表現するチェンジグラフを利用することで、プログラムの意味的な繋がりを考慮したコード変更パターンを表現でき、例えば、AST（Abstract Syntax Tree：抽象構文木）編集スクリプトを利用するより柔軟にコード変更パターンを表現できる。 The aforementioned code change pattern is represented by, for example, changes in PDG (Program Dependence Graph). Then, from a set of change graphs, which are graphs representing changes in the PDG, frequent subgraphs are mined as code change patterns. By using a change graph that expresses changes in PDG, it is possible to express code change patterns that consider the semantic connection of the program, for example, it is more flexible than using an AST (Abstract Syntax Tree) editing script. Can express code change patterns.

PDGのチェンジグラフは、例えば、（１）コード変更前後のソースプログラムをASTに変換し、（２）AST差分計算アルゴリズムによりASTのノード間の対応関係を計算し、（３）関数またはメソッドの変更前後のコードをfgPDG（fine-grained PDG：細粒度PDG）に変換し、変更前と後のfgPDGを左右に並べて表現し、（４）ASTノード間の対応関係に基づいて、変更前後のfgPDGのノード間に対応関係の情報を付与することで、生成される。 For example, the change graph of PDG is: (1) convert the source program before and after the code change into AST; Convert the code before and after to fgPDG (fine-grained PDG), express the fgPDG before and after the change side by side, and (4) based on the correspondence between AST nodes, the fgPDG before and after the change. It is generated by giving correspondence information between nodes.

開発履歴から生成された複数のチェンジグラフにおいて頻出するサブグラフがコード変更パターンとしてマイニングされる。コード変更パターンは変更前サブグラフと変更後サブグラフと両グラフのノード間の対応付け（マップ）とを有する。そして、変更対象プログラムコードのfgPDGから、コード変更パターンの変更前サブグラフと一致するサブグラフがパターン適用箇所として検出される。そして、変更対象プログラムコードのパターン適用箇所が、コード変更パターンの変更後サブグラフに変更される。このコード変更パターンを使用したコード変更が、パターンに基づくコード変更処理である。 Frequent subgraphs in multiple changegraphs generated from the development history are mined as code change patterns. A code change pattern has a pre-change subgraph, a post-change subgraph, and a correspondence (map) between nodes in both graphs. Then, from the fgPDG of the change target program code, a subgraph that matches the pre-change subgraph of the code change pattern is detected as a pattern application location. Then, the pattern application portion of the change target program code is changed to the post-change subgraph of the code change pattern. Code modification using this code modification pattern is pattern-based code modification processing.

以下の特許文献には、プログラムの編集を支援する方法について記載される。また、非特許文献にはPDGの変更としてコード変更パターンを表現することが記載されている。 The following patent documents describe methods for supporting program editing. In addition, non-patent literature describes expressing code change patterns as PDG changes.

特表２０１１－５１３８２４号公報Japanese translation of PCT publication No. 2011-513824 特開２００８－２２５９３２号公報JP 2008-225932 A

Graph-based Mining of In-the-Wild, Fine-grained, Semantic Code Change Patterns, Hoan Nguyen, Tien N. Nguyen, Danny Dig, Son Nguyen, Hieu Tran, Michael Hilton, International Conference on Software Engineering, 2019Graph-based Mining of In-the-Wild, Fine-grained, Semantic Code Change Patterns, Hoan Nguyen, Tien N. Nguyen, Danny Dig, Son Nguyen, Hieu Tran, Michael Hilton, International Conference on Software Engineering, 2019

上記のパターンに基づくコード変更処理は、fgPDGのノードレベルで行うことは困難である。fgPDGのノードはASTのノードと異なりソースコードと１対１に対応していないからである。そのため、fgPDGのノードレベルでパターンに基づくコード変更処理を行うと、変更対象プログラムコードのfgPDGノードと、コード変更パターンの変更後サブグラフのノードとを接続することが困難である。 Code changes based on the above patterns are difficult to do at the node level of fgPDG. This is because, unlike AST nodes, fgPDG nodes do not have a one-to-one correspondence with source code. Therefore, if the code change processing based on the pattern is performed at the node level of fgPDG, it is difficult to connect the fgPDG node of the program code to be changed and the node of the post-change subgraph of the code change pattern.

そこで、パターンに基づくコード変更処理は、ASTのノードレベルで行われる。しかしながら、ASTのノードレベルでのパターンに基づくコード変更処理でも、変更対象プログラムコードのASTノードと、コード変更パターンの変更後サブグラフのASTノードとの間の接続関係が決定できないことがある。 Therefore, the pattern-based code modification process is done at the node level of the AST. However, even in the code change processing based on the pattern at the node level of the AST, there are cases where the connection relationship between the AST node of the program code to be changed and the AST node of the post-change subgraph of the code change pattern cannot be determined.

そこで、本実施の形態の第1の側面の目的は、パターンに基づくコード変更処理で、変更対象プログラムコードのASTノードとコード変更パターンの変更後サブグラフのASTノードとの間の接続関係を決定できるコード変更方法及びコード変更プログラムを提供することにある。 Therefore, the purpose of the first aspect of the present embodiment is a pattern-based code change process that can determine the connection relationship between the AST node of the program code to be changed and the AST node of the post-change subgraph of the code change pattern. An object of the present invention is to provide a code change method and a code change program.

本実施の形態の第１の側面は、変更前プログラムコードと変更後プログラムコードの変更前プログラム依存グラフと変更後プログラム依存グラフから抽出された、変更前サブグラフと変更後サブグラフとを有するコード変更パターンに基づいて、変更対象プログラムコードをコード変更する方法であって、
前記変更前サブグラフと一致した前記変更対象プログラムコードの抽象構文木（以下ASTと称する。）である変更対象ASTと、前記変更後サブグラフを有する前記変更後プログラムコードのASTである変更後ASTそれぞれにおいて、
前記変更前サブグラフまたは前記変更後サブグラフのノードを有し、前記変更対象ASTと変更後AST間の対応付けがあるマップノードをルートノードまたはリーフノードに有する変更対象誘導サブツリーと変更後誘導サブツリーを特定し、
前記変更対象ASTから、前記変更対象誘導サブツリーを削除し、
前記削除された変更対象ASTに、前記変更後誘導サブツリーを追加し、
前記変更後誘導サブツリー内の境界ノードを前記削除された変更対象AST内のノードと接続する処理を有する、コード変更方法である。 A first aspect of the present embodiment is a code change pattern having pre-change sub-graphs and post-change sub-graphs extracted from pre-change program dependency graphs and post-change program dependency graphs of pre-change program code and post-change program code. A method of code-modifying program code to be modified based on
In each of the AST to be changed which is an abstract syntax tree (hereinafter referred to as AST) of the program code to be changed that matches the subgraph before change and the AST after change which is the AST of the program code after change having the subgraph after change ,
Identifying a modified derived subtree and a modified derived subtree having a node of the pre-modification subgraph or the post-modification subgraph and having, as a root node or a leaf node, a map node having a correspondence between the modification target AST and the post-modification AST. death,
deleting the modified derived subtree from the modified AST;
adding the modified derived subtree to the deleted modified AST;
A code modification method comprising a process of connecting boundary nodes in the modified derived subtree with nodes in the deleted modified AST.

第１の側面によれば、パターンに基づくコード変更処理で、変更対象プログラムコードのASTノードとコード変更パターンの変更後サブグラフのASTノードとの間の接続関係を決定できるコード変更方法及びコード変更プログラムを提供することができる。 According to the first aspect, in pattern-based code change processing, a code change method and a code change program capable of determining a connection relation between an AST node of a program code to be changed and an AST node of a post-change subgraph of a code change pattern can be provided.

システマティックエディットの一例を示す図である。FIG. 10 is a diagram showing an example of systematic editing; システマティックエディットによるコード変更支援及び自動修正のフローチャートの一例を示す図である。FIG. 10 is a diagram showing an example of a flowchart of code change support and automatic correction by systematic editing; PDGの変化を示すチェンジグラフからコード変更パターンをマイニングする処理のフローチャートを示す図である。FIG. 10 is a diagram showing a flowchart of processing for mining code change patterns from a change graph showing changes in PDG. PDGの変化としてコード変更パターンをマイニングする詳細処理のフローチャートを示す図である。FIG. 10 is a flowchart of detailed processing for mining code change patterns as changes in PDG; ソースプログラム例に対するAST例を示す図である。FIG. 4 is a diagram showing an example AST for an example source program; コード変更前のASTと変更後のASTと両AST間の対応関係の具体例を示す図である。FIG. 4 is a diagram showing a specific example of the correspondence relationship between an AST before code change, an AST after code change, and both ASTs. ソースプログラムのコードから変換したfgPDGの例を示す図である。FIG. 10 is a diagram showing an example of fgPDG converted from the code of the source program; チェンジグラフの例を示す図である。FIG. 10 is a diagram showing an example of a change graph; FIG. 複数のチェンジグラフから所定の頻度以上のサブグラフをコード変更パターンとして抽出する例を示す図である。FIG. 10 is a diagram showing an example of extracting subgraphs having a frequency equal to or higher than a predetermined frequency from a plurality of changegraphs as code change patterns; コード変更パターンの適用箇所の検出とコード変更の例を示す図である。FIG. 10 is a diagram showing an example of detection of an application location of a code change pattern and code change; 本実施の形態におけるパターンに基づくコード変更を行う開発支援装置の構成例を示す図である。1 is a diagram showing a configuration example of a development support device that changes code based on patterns according to the present embodiment; FIG. 本実施の形態におけるコード変更処理の概略フローチャートを示す図である。It is a figure which shows the schematic flowchart of the code|cord|chord change process in this Embodiment. fgPDGでのパターン適用箇所の検出とコード変更処理を説明する図である。FIG. 10 is a diagram illustrating pattern application location detection and code change processing in fgPDG; fgPDGでのパターン適用箇所の検出処理S2の具体例を示す図である。FIG. 10 is a diagram showing a specific example of pattern application location detection processing S2 in fgPDG; ASTでのパターンに基づくコード変更処理S3の課題を具体例で示す図である。FIG. 10 is a diagram showing a specific example of a problem of the pattern-based code change processing S3 in the AST. 本実施の形態におけるASTによるコード変更処理のフローチャートを示す図である。FIG. 4 is a diagram showing a flowchart of code change processing by AST in this embodiment. 複数のコード変更パターンと複数のパターン適用箇所と複数のチェンジグラフについてそれぞれ実行されるパターンに基づくコード変更処理のフローチャートを示す図である。FIG. 10 is a flowchart of pattern-based code change processing executed for a plurality of code change patterns, a plurality of pattern application locations, and a plurality of change graphs; 最小の誘導サブツリーMIDSの生成のフローチャートを示す図である。Fig. 10 shows a flow chart of generating a minimal guided subtree MIDS; パターンに基づくコード変更でＡ（Ｍ_Ｈ）を求める処理のフローチャートを示す図である。FIG. 10 is a diagram showing a flowchart of processing for obtaining A(M _H ) by pattern-based code modification. 削除処理S51の具体例に基づく説明の図である。FIG. 11 is a diagram for explaining a specific example of deletion processing S51; 追加処理S52の具体例に基づく説明の図である。FIG. 11 is a diagram for explaining a specific example of an addition process S52; （ａ）rootノードがmappedノードの場合の境界ノードの接続関係の決定処理を示す図である。(a) is a diagram showing the process of determining the connection relationship of boundary nodes when the root node is a mapped node. （ａ）rootノードがmappedノードの場合の境界ノードの接続関係の決定処理の具体例を示す図である。(a) is a diagram showing a specific example of determination processing of the connection relationship of boundary nodes when the root node is a mapped node. （ａ）rootノードがmappedノードの場合の境界ノードの接続関係の決定処理の具体例を示す図である。(a) is a diagram showing a specific example of determination processing of the connection relationship of boundary nodes when the root node is a mapped node. （ｂ）rootノードがunmappedなstatementノードの場合の境界ノードの接続関係の決定処理を示す図である。(b) is a diagram showing the process of determining the connection relationship of boundary nodes when the root node is an unmapped statement node. （ｂ）rootノードがunmappedなstatementノードの場合の境界ノードの接続関係の決定処理の具体例を示す図である。(b) is a diagram showing a specific example of determination processing of the connection relationship of boundary nodes when the root node is an unmapped statement node. （ｃ）leafノードがmappedノードの場合の境界ノードの接続関係の決定処理を示す図である。(c) A diagram showing the process of determining the connection relationship of border nodes when the leaf node is a mapped node. （ｃ）leafノードがmappedノードの場合の境界ノードの接続関係の決定処理の具体例を示す図である。(c) is a diagram showing a specific example of determination processing of the connection relationship of boundary nodes when the leaf node is a mapped node. （ｃ）leafノードがmappedノードの場合の境界ノードの接続関係の決定処理の具体例を示す図である。(c) is a diagram showing a specific example of determination processing of the connection relationship of boundary nodes when the leaf node is a mapped node. （ｃ）leafノードがmappedノードの場合の境界ノードの接続関係の決定処理の具体例を示す図である。(c) is a diagram showing a specific example of determination processing of the connection relationship of boundary nodes when the leaf node is a mapped node. 本実施の形態におけるシステマティックエディットを行う開発支援装置１のユーザ画面例を示す図である。FIG. 3 is a diagram showing an example of a user screen of the development support device 1 that performs systematic editing according to the present embodiment;

以下、実施の形態についてJava言語（Javaはオラクルおよびその関連会社の登録商標）を例にして説明するが、実施の形態はJava言語に限定されるものではなく、C言語や他の言語にも適用できる。以下では、まず背景としてSystematic editとコード変更パターンをマイニングする技術について説明した後に、実施の形態におけるコード変更パターンのマイニングについて説明する。 Hereinafter, the embodiments will be described using the Java language (Java is a registered trademark of Oracle and its affiliates) as an example, but the embodiments are not limited to the Java language, and can be applied to C language and other languages. Applicable. In the following, as a background, the technique of mining systematic edit and code change patterns will be described first, and then mining of code change patterns in the embodiment will be described.

［Systematic editに基づくコード変更支援・自動修正］
図１は、システマティックエディットの一例を示す図である。図中、横軸が開発履歴の各バージョン番号v1～v3を示し、それぞれのバージョン番号v1～v3に対応するソースプログラムSP1～SP4が示される。 [Code change support/automatic correction based on systematic edit]
FIG. 1 is a diagram showing an example of systematic editing. In the figure, the horizontal axis indicates the version numbers v1 to v3 of the development history, and the source programs SP1 to SP4 corresponding to the respective version numbers v1 to v3 are indicated.

前述したとおり、プログラム開発では、ソースコードの複数の箇所に類似する変更が加えられることがある。例えば、ソースコード内のAPIのバージョンアップや変更が生じた場合、APIの利用箇所を一括で変更する、または、ソースコード内のバグを修正する場合に、同種のバグの箇所を一括で変更する、などである。このような、ソースコード内の異なる箇所に類似する変更を加えることを、システマティックエディットと呼ぶ。システマティックエディットは、ソースコードの開発履歴において、異なる版で同じ箇所または異なる箇所に発生することもあれば、同じ版の異なる箇所に発生することもある。 As mentioned above, during program development, similar changes may be made to multiple locations in the source code. For example, if the version of API in the source code is upgraded or changed, change the location where the API is used all at once, or when fixing a bug in the source code, change the location of the same type of bug all at once. , etc. Adding similar changes to different locations in the source code is called systematic editing. A systematic edit may occur at the same or different place in different versions, or at different places in the same version, in the development history of the source code.

図１において、各ソースプログラムSP1～SP4には、行頭が「－」の削除される文と、行頭が「+」の挿入される文（共に下線で示す文）とが含まれる。この例では、APIメソッドFile.read()からAPIメソッドIO.read()に変更される。このコード変更は、異なるバージョンのソースプログラムSP1～SP3内の異なる箇所で行われる。 In FIG. 1, each of the source programs SP1 to SP4 includes sentences beginning with "-" to be deleted and sentences beginning with "+" to be inserted (both underlined). In this example, API method File.read() is changed to API method IO.read(). This code change is made at different locations in the different versions of the source programs SP1-SP3.

図１の例において、コード変更パターンに基づくコード変更プログラムを実行するプロセッサが、バージョンv4のソースプログラムSP4の開発過程で、過去に開発されたソースプログラムSP1、SP2を含む開発履歴群から、APIメソッドFile.read()からAPIメソッドIO.read()に変更するというシステマティックエディットパターン（コード変更パターン）をマイニングする。そして、プロセッサが、新たに開発されるソースプログラムSP4を、マイニングされたコード変更パターンに基づいて、同様のコード変更を実行する。 In the example of FIG. 1, a processor that executes a code change program based on a code change pattern selects API method Mine the systematic edit pattern (code change pattern) of changing from File.read() to API method IO.read(). Then, the processor performs similar code changes on the newly developed source program SP4 based on the mined code change patterns.

図２は、システマティックエディットによるコード変更支援及び自動修正のフローチャートの一例を示す図である。システマティックエディット（システマティックなコード編集）は、複数の開発済みソースコードを含む開発履歴群１０から、所定の頻度以上に発生したコード変更パターン１１をマイニングする処理S1を有する。コード変更パターン１１の例は、図１の例のAPIメソッドFile.read()からAPIメソッドIO.read()に変更するコード変更パターンである。ここでメソッドとは、JAVA（登録商標）の関数である。コード変更単位として、メソッドなどの関数を利用する。 FIG. 2 is a diagram showing an example of a flowchart of code change support and automatic correction by systematic editing. Systematic editing (systematic code editing) has a process S1 of mining code change patterns 11 that occur more frequently than a predetermined frequency from a development history group 10 containing a plurality of developed source codes. An example of code change pattern 11 is a code change pattern that changes from the API method File.read( ) in the example of FIG. 1 to the API method IO.read( ). Here, a method is a function of JAVA (registered trademark). Use functions such as methods as code change units.

さらに、システマティックなコード変更は、変更対象プログラム１２内のコード変更コード変更パターン１１に合致するコード変更パターン適用箇所１４を検出する処理S2と、変更対象プログラム１２のコード変更パターン適用箇所１４のコードをコード変更パターンに基づいてコード変更する処理S3とを有する。本実施の形態は、コード変更パターンに基づくコード変更する処理S3の改善に関する。 Furthermore, the systematic code change includes a process S2 of detecting a code change pattern application location 14 that matches the code change code change pattern 11 in the change target program 12, and a code change pattern application location 14 in the change target program 12. and a process S3 of changing the code based on the code change pattern. The present embodiment relates to improvement of the code change processing S3 based on the code change pattern.

コード変更パターンは、ソースプログラムの行またはトークン単位の変化、AST編集スクリプト、プログラム依存グラフ（PDG）の変化、などで表現される。ここで、トークンは、それ以上分解することができないプログラムの最小単位であり、プログラミングの命令文は複数のトークンによって構成される。また、AST編集スクリプトは、ソースコードの差分をAST間の差分として表現したものである。本実施の形態では、コード変更パターンとしてPDGの変化で表現するものを対象とする。PDGの変化は、プログラムの意味的なつながりを考慮した変更パターンを表現できるというメリットがあり、他の２つの表現よりもコード変更履歴内で頻出するコード変更パターンをより柔軟に表現することができる。 Code change patterns are represented by line- or token-level changes in the source program, AST editing scripts, program dependency graph (PDG) changes, and so on. Here, a token is the minimum unit of a program that cannot be further decomposed, and programming statements are composed of a plurality of tokens. Also, the AST editing script expresses the difference between source codes as the difference between ASTs. In the present embodiment, code change patterns expressed by changes in PDG are targeted. Changes in PDG have the advantage of being able to express change patterns that consider the semantic connection of programs, and can express code change patterns that occur frequently in the code change history more flexibly than the other two representations. .

図３は、PDGの変化を示すチェンジグラフからコード変更パターンをマイニングする処理のフローチャートを示す図である。チェンジグラフは、コード変更前のプログラムとコード変更後のプログラムのメソッドの差分から計算されるグラフである。コード変更パターンのマイニング処理は、開発履歴中のバージョン毎に（S10）、そして、変更されたメソッド毎に（S11）、変更されたメソッドmの変更差分からチェンジグラフCGmを計算する処理（S12）を繰り返す。そして、マイニング処理は、計算されたチェンジグラフ群から所定の頻度以上の頻出サブグラフをマイニングする（S13）。この頻出サブグラフがコード変更パターンに相当する。 FIG. 3 is a diagram showing a flowchart of processing for mining code change patterns from a change graph showing changes in PDG. A change graph is a graph calculated from differences in methods of a program before code change and a program after code change. The code change pattern mining process is for each version in the development history (S10), for each changed method (S11), and for calculating the change graph CGm from the change difference of the changed method m (S12). repeat. Then, in the mining process, frequent subgraphs with a predetermined frequency or more are mined from the calculated changegraph group (S13). This frequent subgraph corresponds to the code change pattern.

［PDGの変化としてコード変更パターンをマイニングする詳細処理］
図４は、PDGの変化としてコード変更パターンをマイニングする詳細処理のフローチャートを示す図である。図４のフローチャートは、図２のコード変更パターンのマイニング処理S1のフローチャートであり、図４中の処理S20～S23は、図３のメソッドmの変更差分からチェンジグラフCGmを計算する処理に対応する。以下、具体例を示しながら図４のフローチャートを説明する。 [Detailed processing for mining code change patterns as PDG changes]
FIG. 4 is a flowchart of detailed processing for mining code change patterns as changes in PDG. The flowchart of FIG. 4 is a flowchart of the code change pattern mining process S1 of FIG. 2, and the processes S20 to S23 in FIG. 4 correspond to the process of calculating the change graph CGm from the change difference of the method m in FIG. . Hereinafter, the flowchart of FIG. 4 will be described while showing a specific example.

図４、S20：コード変更パターンのマイニング処理では、開発履歴群１０内の複数のソースコードをそれぞれASTに変換し、AST群５０に蓄積する。 FIG. 4 , S20: In code change pattern mining processing, a plurality of source codes in the development history group 10 are each converted into ASTs and stored in the AST group 50 .

図５は、ソースプログラム例に対するAST例を示す図である。ソースプログラムSP10の例は、メインメソッドm()のブロック｛｝内にメソッド文send(readLines(“a.csv”))が含まれる。図５には、このソースプログラムSP10のASTの例が示される。ASTは、ソースコードをプログラミング言語で規定される構文構造に基づいて解析し、木構造に表現したものである。一般に、ASTは、ソースコード内のスペース等、プログラミング言語上意味を持たない要素が除去された状態になる。ASTの各ノードは、ソースコードの意味を持つ要素と１対１の関係を有する。 FIG. 5 is a diagram showing an example AST for an example source program. An example of the source program SP10 includes a method statement send(readLines(“a.csv”)) within the block { } of the main method m(). FIG. 5 shows an example of the AST of this source program SP10. The AST is obtained by analyzing the source code based on the syntax structure defined by the programming language and expressing it in a tree structure. In general, the AST is in a state in which elements that have no meaning in terms of programming language, such as spaces in the source code, are removed. Each node in the AST has a one-to-one relationship with a semantic element in the source code.

図５のASTの例には、各ノードの先頭に構文要素の種別の略称が示される。それぞれの略称の種別は以下の通りである。
MD: method declaration（メソッド宣言）
BLK: block（JAVAのブロック）
ES: expression statement（式の文）
MI: method invocation（メソッド呼び出し）
A: Assignment（代入）
SN: simple name（名前）
L: literal（固定値） In the AST example of FIG. 5, the abbreviation of the syntax element type is shown at the beginning of each node. The types of abbreviations are as follows.
MD: method declaration
BLK: block (Java block)
ES: expression statement
MI: method invocation
A: Assignment
SN: simple name
L: literal (fixed value)

図５のASTには、ノードとして、上からメソッド宣言MD：m()、ブロックBLK：｛…｝、式の文ES：send(…)、メソッド呼び出しMI：send(…)、名前SN：send、メソッド呼び出しMI：readLines(…)、名前SN：readLines、固定値L：“a.csv”が含まれる。ノード間の矢印方向のノードは子ノード、矢印反対方向のノードは親ノードである。 In the AST of FIG. 5, as nodes, method declaration MD: m(), block BLK: {...}, expression statement ES: send(...), method call MI: send(...), name SN: send , method call MI: readLines(...), name SN: readLines, fixed value L: "a.csv". Nodes in the direction of arrows between nodes are child nodes, and nodes in the direction opposite to the arrows are parent nodes.

図４、S21：コード変更パターンのマイニング処理では、AST差分計算アルゴリズムにより、コード変更前後のAST間の対応関係５１が計算される。 FIG. 4, S21: In the code change pattern mining process, an AST difference calculation algorithm calculates the correspondence 51 between ASTs before and after the code change.

図６は、コード変更前のASTと変更後のASTと両AST間の対応関係の具体例を示す図である。コード変更前（before）のASTbは、図５と同じである。コード変更後（after）のASTaは、ASTbのソースコードが、コード差分CD1に示したように行先頭「－」の文を行先頭「＋」の文に変更した変更後のソースコードのASTである。 FIG. 6 is a diagram showing a specific example of the correspondence relationship between the AST before code change, the AST after code change, and both ASTs. ASTb before the code change (before) is the same as in FIG. After the code change (after) ASTa is the AST of the source code after the source code of ASTb is changed from the sentence at the beginning of the line to the sentence at the beginning of the line, as shown in the code difference CD1. be.

AST差分計算アルゴリズムは、ノード種別の類似性、木構造の類似性等、複数の類似性基準を総合して、対応するノードの組を特定する。例えば、AST差分計算アルゴリズムは、ノード種別の類似度と木構造の類似度が共に0.7以上のノードの組に、ノード間の対応関係（map辺）を付与する。図６のノードSN：readLinesとノードSN：readCSVは、例えば、ASTのルートノード（根ノード）MD：m()からの距離が共に５、ラベル（readLine、readCSV）の文字列が類似、子ノードの数が共に０と同じ、等の類似性があるため、両ノード間に対応関係を示すmap辺が付与される。それ以外の２つのmap辺も、同等の類似性があるため付与される。AST差分計算アルゴリズムは、例えば、Diff/TSなどが知られている。 The AST difference calculation algorithm synthesizes multiple similarity criteria, such as node type similarity, tree structure similarity, etc., to identify corresponding sets of nodes. For example, the AST difference calculation algorithm assigns a correspondence relationship (map edge) between nodes to a set of nodes in which both the node type similarity and the tree structure similarity are 0.7 or higher. Node SN: readLines and node SN: readCSV in FIG. are the same as 0, a map edge indicating the correspondence between the two nodes is given. The other two map edges are also given because they are equally similar. Diff/TS, for example, is known as an AST difference calculation algorithm.

上記のとおり、AST差分計算アルゴリズムは、対応関係の基準（類似性が一定以上）を満たしたノード間にのみ対応関係を示すmap辺を付与する。 As described above, the AST difference calculation algorithm assigns map edges that indicate correspondence only between nodes that satisfy the correspondence criteria (similarity above a certain level).

図４、S22：コード変更パターンのマイニング処理では、変更前後のコードを細粒度PDG（fgPDG: fine-grained Program Dependence Graph）に変換し、fgPDG群５２に蓄積する。 FIG. 4 , S22: In the mining process for code change patterns, the code before and after the change is converted into a fine-grained PDG (fgPDG: fine-grained Program Dependence Graph) and stored in the fgPDG group 52 .

図７は、ソースプログラムのコードから変換したfgPDGの例を示す図である。ソースプログラムSP11は、メインメソッドm()内に代入文n = graph.getName()を有する。PDGは、プログラムコード内の文（statement）や式(expression)等の要素と、要素間の依存関係をグラフ表現したものである。通常のPDGの場合、要素のノードには文や条件式が、ノード間の依存関係にはデータ依存や制御依存が、それぞれ用いられる。それに対して、細粒度PDG（fgPDG）では、ノードには文や条件式よりも単位が細かい式や演算子が用いられ、依存関係にはreceive（recv:受信）、parameter（para:パラメータ）、define（def:定義）、contorol（cont:制御）などの、データ依存や制御依存を細分化したものが用いられる。 FIG. 7 is a diagram showing an example of fgPDG converted from the code of the source program. Source program SP11 has an assignment statement n = graph.getName() in main method m(). The PDG is a graphical representation of elements such as statements and expressions in the program code and the dependencies between the elements. In the case of normal PDG, statements and conditional expressions are used for element nodes, and data dependencies and control dependencies are used for dependencies between nodes. On the other hand, in fine-grained PDG (fgPDG), expressions and operators with finer units than sentences and conditional expressions are used for nodes, and receive (recv: receive), parameter (para: parameter), Subdivided data dependence and control dependence such as define (def: definition) and control (cont: control) are used.

通常のPDGとは異なり、細粒度PDGは、データ依存や制御依存を文や条件式より細分化したコード間で表現でき、文や条件式の一部のみが変更されるようなコード変更パターンも容易に抽出できる。 Unlike normal PDG, fine-grained PDG can express data dependence and control dependence between codes that are more subdivided than statements and conditional expressions, and code change patterns where only a part of a statement or conditional expression is changed. Can be easily extracted.

図７に示したソースプログラムのコードSP11から変換されたfgPDGは、代入文n = graph.getName()を細分化したノードgraph, getName, =, n等と、ノード間のデータ依存recv、para、defを有する。 fgPDG converted from code SP11 of the source program shown in FIG. have a def.

図４、S23：コード変更パターンのマイニング処理では、変更前後のコードからそれぞれ変換した変更前後のfgPDGに、ASTノード間のmap辺を追加してチェンジグラフを生成し、チェンジグラフ群５３に蓄積する。 FIG. 4, S23: In the code change pattern mining process, map edges between AST nodes are added to the fgPDG before and after the change respectively converted from the code before and after the change to generate a change graph, and the change graph is accumulated in the change graph group 53. .

図８は、チェンジグラフの例を示す図である。変更前のプログラムコードSP11とそれから変換された変更前のfgPDGbは、図７の例と同じである。一方、変更後のプログラムコードSP12は、変更前のプログラムコードSP11の代入文n = graph.getName()が、n = resolve(graph)に変更されたものである。変更前の代入文n = graph.getName()は、インスタンスgraphに対してメソッドgetName()が名前を返して、変数nに代入する命令である。一方、変更後の代入文n = resolve(graph)では、メソッドgetNameが式graphを引数（パラメータ）とする。その結果、変更後のコードに対するfgPDGaは、引数graphとメソッドgetNameとの間のデータ依存関係がパラメータparaになることが、変更前のコードに対するfgPDGbと異なる。 FIG. 8 is a diagram showing an example of a change graph. The pre-change program code SP11 and the pre-change fgPDGb converted therefrom are the same as in the example of FIG. On the other hand, in the program code SP12 after change, the assignment statement n = graph.getName() of the program code SP11 before change is changed to n = resolve(graph). The assignment statement n = graph.getName() before the change is an instruction that the method getName() returns the name for the instance graph and assigns it to the variable n. On the other hand, in the assignment statement n = resolve(graph) after the change, the method getName takes the expression graph as an argument (parameter). As a result, fgPDGa for the code after the change differs from fgPDGb for the code before the change in that the data dependency between the argument graph and the method getName becomes the parameter para.

チェンジグラフCG1は、変更前のコードに対するfgPDGbと変更後のコードに対するfgPDGaに、両fgPDGのノード間の対応関係を示すmap辺が追加されたグラフ表示である。このmap辺は、図示しない変更前コードのASTbと変更後コードのASTaについて算出したノード間の対応関係mapのうち、チェンジグラフCG1内のノード間に対応するmap辺を追加されたものである。 The change graph CG1 is a graph representation in which a map edge indicating the correspondence relationship between the nodes of fgPDGb for the code before change and fgPDGa for the code after change is added to fgPDGb for the code before change. This map edge is obtained by adding the map edge corresponding to the nodes in the change graph CG1 from the node correspondence map calculated for the pre-change code ASTb and the post-change code ASTa (not shown).

図８に示したチェンジグラフは、開発履歴群内の複数の変更前後のプログラムコードについてそれぞれ生成される。 The change graph shown in FIG. 8 is generated for each of a plurality of program codes before and after changes in the development history group.

図４、S24：コード変更パターンのマイニング処理では、チェンジグラフのPDGノード間の対応関係（map辺）に基づいて、map辺を持つPDGノードを有するサブグラフのうち、複数のチェンジグラフ内に所定の頻度以上で含まれるサブグラフを、コード変更パターンとして抽出する。この結果、単数または複数のコード変更パターン１１が生成される。 Fig. 4, S24: In the code change pattern mining process, based on the correspondence (map edge) between PDG nodes in the change graph, out of the subgraphs having PDG nodes with map edges, predetermined Subgraphs that are included more than frequently are extracted as code change patterns. As a result, one or more code change patterns 11 are generated.

図９は、複数のチェンジグラフから所定の頻度以上のサブグラフをコード変更パターンとして抽出する例を示す図である。図９には、図１のバージョンv1とv2のソースプログラムSP1、SP2と、それらのソースコードの変更前後のコードに対するチェンジグラフCG11、CG12が示される。バージョンv1でのチェンジグラフCG11は、コード変更前の代入文s=File.read(“a.txt”)と、コード変更後の代入文s=IO.read(“b.txt”)に対するfgPDGbとfgPDGaと、ノード”=”間のmap辺とを有する。同様に、バージョンv2でのチェンジグラフCG12は、コード変更前の代入文u=File.read(path)と、コード変更後の代入文u=IO.read(path)に対するfgPDGbとfgPDGaと、ノード”=”間のmap辺とを有する。 FIG. 9 is a diagram showing an example of extracting subgraphs having a predetermined frequency or more from a plurality of changegraphs as code change patterns. FIG. 9 shows source programs SP1 and SP2 of versions v1 and v2 of FIG. 1, and change graphs CG11 and CG12 for the source code before and after the change. The change graph CG11 in version v1 is fgPDGb and It has fgPDGa and map edges between nodes “=”. Similarly, the change graph CG12 in version v2 is fgPDGb and fgPDGa for the assignment statement u=File.read(path) before the code change, the assignment statement u=IO.read(path) after the code change, and the node” =” with map edges between.

そして、上記２つのチェンジグラフCG11、CG12の共通する部分が頻出サブグラフ、つまりコード変更パターンPTNとして抽出される。 A common portion of the two change graphs CG11 and CG12 is extracted as a frequent subgraph, that is, a code change pattern PTN.

図２のS2、S3に示したとおり、マイニングされたコード変更パターンに基づいて、コード変更前のプログラムコード内のコード変更パターンの適用箇所の検出（S2）と、検出された適用箇所のコードをコード変更パターンに基づいてコード変更すること（S3）が行われる。 As shown in S2 and S3 in Fig. 2, based on the mined code change pattern, the application location of the code change pattern in the program code before the code change is detected (S2), and the code at the detected application location is detected. Code modification (S3) is performed based on the code modification pattern.

図１０は、コード変更パターンの適用箇所の検出とコード変更の例を示す図である。図１０には、図１のバージョンv4のソースプログラムSP4とコード変更パターンPTNとが示される。変更後（after）のソースプログラムSP4に示すとおり、ソースプログラムSP4内の代入文t=File.read(“b.txt”)は、バージョンv1,v2と同様の変更が必要である。 FIG. 10 is a diagram showing an example of detection of a code change pattern application location and code change. FIG. 10 shows the source program SP4 of version v4 of FIG. 1 and the code change pattern PTN. As shown in the after source program SP4, the assignment statement t=File.read(“b.txt”) in the source program SP4 needs to be changed in the same way as versions v1 and v2.

まず、コード変更前のソースプログラムSP4のコードから、コード変更パターンPTNの変更前（before）のPDGグラフと一致するコード辺t=File.read(“b.txt”)が、コード変更パターンPTNの適用箇所として検出される（S2）。次に、適用箇所のコードt=File.read(“b.txt”)が、コード変更パターンPTNの変更後（after）のPDGグラフに基づいて、コードt=IO.read(“b.txt”)に置換される。つまり、map辺が形成されているノード”=”に繋がっているノードが、”File.read”から”IO.read”に置換される。 First, from the code of the source program SP4 before the code change, the code edge t=File.read(“b.txt”) that matches the PDG graph before the change of the code change pattern PTN is It is detected as an application part (S2). Then the code t=File.read(“b.txt”) in the apply location is changed to the code t=IO.read(“b.txt” based on the PDG graph after the code change pattern PTN. ). In other words, the node connected to the node "=" where the map edge is formed is replaced from "File.read" to "IO.read".

上記のように、チェンジグラフ群からマイニングしたコード変更パターンPTNを利用して、コード未変更のプログラム内の変更漏れを検出し、自動的にコード変更を行うことができる。 As described above, by using the code change pattern PTN mined from the change graph group, it is possible to detect omission of changes in a program whose code has not been changed, and to automatically change the code.

［本実施の形態でのパターンに基づくコード変更］
図１１は、本実施の形態におけるパターンに基づくコード変更を行う開発支援装置の構成例を示す図である。開発支援装置１は、サーバ、パーソナルコンピュータ、タブレット端末などである。開発支援装置１は、プロセッサ（CPU）３０、メインメモリ３２、ネットワークインターフェース３４、バス３６、及び大容量のストレージである補助記憶装置２０を有する。ストレージ２０には、開発支援プログラム２１、開発履歴群（ソースコード群）１０、コード変更パターン１１、コード変更パターンによりシステマティックに変更される変更対象プログラム１２、そして、変更済プログラム１３がそれぞれ格納される。 [Code change based on pattern in this embodiment]
FIG. 11 is a diagram showing a configuration example of a development support device that changes code based on patterns according to the present embodiment. The development support device 1 is a server, personal computer, tablet terminal, or the like. The development support device 1 has a processor (CPU) 30, a main memory 32, a network interface 34, a bus 36, and an auxiliary storage device 20 which is a large-capacity storage. The storage 20 stores a development support program 21, a development history group (source code group) 10, a code change pattern 11, a program to be changed 12 systematically changed by the code change pattern, and a changed program 13, respectively. .

プログラム開発を支援する開発支援プログラム２１は、コード変更パターンをマイニングするコード変更パターンマイニングプログラム２１Ａと、変更対象プログラムからコード変更パターンの変更前サブグラフと一致する箇所を検出するパターン適用箇所検出プログラム２１Ｂと、コード変更パターンに基づいて変更対象プログラム２６の一部のパターン適用箇所のコードを変更して変更済プログラム１３を生成するコード変更プログラム２１Ｃとを有する。 The development support program 21 that supports program development includes a code change pattern mining program 21A that mines code change patterns, and a pattern application portion detection program 21B that detects a portion of the program to be changed that matches the pre-change subgraph of the code change pattern. , and a code modification program 21C for generating a modified program 13 by modifying the code at a part of the pattern application location of the modification target program 26 based on the code modification pattern.

ネットワークインターフェース３４は、例えばインターネットなどのネットワークＮＷを介して、クライアント端末装置４０、４２からアクセス可能である。プログラム開発の支援を受けたいユーザは、クライアント端末装置から開発支援装置１にアクセスし、コード変更パターンのマイニングと、パターン適用箇所の検出と、コード変更パターンに基づくコード変更の支援を受ける。 The network interface 34 is accessible from the client terminal devices 40 and 42 via a network NW such as the Internet. A user who wants to receive support for program development accesses the development support device 1 from a client terminal device, and receives support for code change pattern mining, pattern application location detection, and code change based on the code change pattern.

［パターン適用箇所の検出とコード変更処理］
図１２は、本実施の形態におけるコード変更処理の概略フローチャートを示す図である。図２で説明したとおり、開発支援プログラムを実行するプロセッサ３０は、fgPDGのチェンジグラフからマイニングしたコード変更パターン１１に基づいて、変更対象プログラム１２内のコード変更パターン１１の変更前コードと一致するパターン適用箇所１４を検出する。つまり、変更対象プログラム内のパターン適用箇所を検出する処理では、fgPDGのチェンジグラフからマイニングしたfgPDGのコード変更パターンに基づいて、変更対象プログラムのfgPDGのグラフ内において、fgPDGのコード変更パターンの変更前のfgPDGのサブグラフと一致する箇所を検出する。 [Detection of pattern application location and code change processing]
FIG. 12 is a diagram showing a schematic flowchart of code change processing according to the present embodiment. As described with reference to FIG. 2, the processor 30 executing the development support program generates a pattern that matches the pre-change code of the code change pattern 11 in the change target program 12 based on the code change pattern 11 mined from the fgPDG change graph. The application point 14 is detected. In other words, in the process of detecting the pattern application location in the target program, based on the fgPDG code change pattern mined from the fgPDG change graph, in the fgPDG graph of the target program, before the change of the fgPDG code change pattern , which matches the subgraph of fgPDG.

次に、図１２に示すとおり、プロセッサは、コード変更パターン１１に基づくコード変更を実行する（S3）。このコード変更を、パターンに基づくコード変更と称する。パターンに基づくコード変更S3は、fgPDGのグラフではなく、ASTのグラフで行われる。前述したとおり、ASTはソースコードのプログラム要素と１対１に対応するノードのツリーであるので、パターン適用箇所をコード変更パターンの変更後のコードに変更する処理では、ASTのグラフで行うのが望ましい。 Next, as shown in FIG. 12, the processor executes code modification based on the code modification pattern 11 (S3). This code change is called pattern-based code change. Pattern-based code changes S3 are done in the AST graph, not in the fgPDG graph. As mentioned above, the AST is a tree of nodes that correspond one-to-one with the program elements of the source code, so in the process of changing the pattern application location to the code after changing the code change pattern, it is best to use the AST graph. desirable.

図１２に示すとおり、プロセッサは、ソースコードの変更対象プログラム１２を変換した変更対象プログラムのAST１２＿ＡＳＴと、コード変更パターン１１と、パターン適用箇所１４とから、変更済コードのAST１３＿ＡＳＴを生成する（S3）。変更済コードのAST１３＿ＡＳＴは、変更済のソースコードである変更済コード１３に変更され、クライアント端末の画面に表示される。なお、ソースコードからASTへの変換及びASTからソースコードへの変換は、一般的なコンパイラが有する機能により行われる。 As shown in FIG. 12, the processor generates an AST 13_AST of the changed code from the AST 12_AST of the program to be changed obtained by converting the program 12 to be changed of the source code, the code change pattern 11, and the pattern application location 14 (S3). . The modified code AST13_AST is modified to the modified code 13, which is the modified source code, and displayed on the screen of the client terminal. Note that conversion from source code to AST and conversion from AST to source code are performed by a function of a general compiler.

図１３は、fgPDGでのパターン適用箇所の検出とコード変更処理を説明する図である。図１３中、Ｍ_Ｌ、Ｍ_Ｒは、fgPDGのチェンジグラフのコード変更前と変更後のメソッドのfgPDGである。メソッドのfgPDGであるＭ_Ｌ、Ｍ_Ｒ内のＬとＲは、コード変更パターンのコード変更前と変更後のfgPDGであり、メソッドのfgPDGの一部（サブグラフ）である。一方、Ｍ_Ｇは、変更対象プログラムのメソッドのfgPDGであり、Ｍ_Ｇ内のＧは、パターン適用箇所のfgPDGでありメソッドのfgPDGの一部（サブグラフ）である。また、Ｍ_Ｈは、コード変更後のメソッドのfgPDGである。ここで、ＬとＲは、チェンジグラフの左側、右側を示す。また、ＧとＨは、コード変更における変更前Ｇと変更後Ｈ（アルファベットのＧの次のＨ）を示す。 FIG. 13 is a diagram for explaining detection of pattern application locations and code change processing in fgPDG. In FIG. 13, M _L and M _R are fgPDG of the method before and after the code change in the change graph of fgPDG. _L and _R in ML and MR, which are the fgPDG of the method, are the fgPDG before and after the code change of the code change pattern, and are a part (subgraph) of the fgPDG of the method. On the other hand, MG is the _{fgPDG of the method of the program to be changed, and G} _in MG is the fgPDG of the pattern application location and a part (subgraph) of the fgPDG of the method. Also, _MH is the fgPDG of the method after the code change. Here, L and R indicate the left and right sides of the change graph. Also, G and H indicate G before change and H after change (H next to alphabet G) in code change.

図１３中のパターン適用箇所の検出処理S2は、コード変更パターンのコード変更前サブグラフ（Ｌ）と、変更対象プログラムＭ_Ｇ内のサブグラフ（Ｇ）のノード間が対応する、つまり、両サブグラフのfgPDGノードが一致したことを示す。一方、パターンに基づくコード変更処理S3は、変更対象プログラムＭ_Ｇ内のサブグラフ（Ｇ）を削除し、代わりにコード変更パターンのコード変更後のサブグラフ（Ｒ）を追加してコード変更後のメソッドのfgPDG（Ｍ_Ｈ）を生成することを示す。 In the pattern application portion detection processing S2 in FIG. 13, the nodes of the pre-code change subgraph (L) of the code change pattern and the subgraph ( _G ) in the change target program MG correspond to each other. Indicates that the node was matched. On the other hand, the pattern-based code change processing S3 deletes the subgraph ( _G ) in the program to be changed MG, and instead adds the subgraph (R) after code change of the code change pattern to the method after code change. Generating fgPDG(M _H ).

図１２で説明したとおり、コード変更処理S3はASTにより行われる。そこで、ASTにより行われるコード変更処理S3でのASTノード間の接続関係の決定の課題について具体例で説明する。 As described with reference to FIG. 12, the code change processing S3 is performed by AST. Therefore, the problem of determining the connection relationship between AST nodes in the code change processing S3 performed by the AST will be explained using a specific example.

図１４は、fgPDGでのパターン適用箇所の検出処理S2の具体例を示す図である。この具体例のソースコードSCに示されるとおり、開発履歴群内の版v1とv2におけるコード変更例は次の通りである。
版v1におけるコード変更
- a = m1(p1,p2);
+ a = m2(p1,p4) + p3;
版v2におけるコード変更
- a = m1(r1,r2);
+ a = m2(r1,r4) + r3;
上記のコードの表記において、「-」は削除されたコード、「+」は追加されたコードを意味する。つまり、「-」のコードが「+」のコードに変更されたことを意味する。 FIG. 14 is a diagram showing a specific example of the pattern application location detection processing S2 in fgPDG. An example of code changes in versions v1 and v2 in the development history group is as follows, as shown in the source code SC for this example.
Code changes in version v1
- a = m1(p1,p2);
+ a = m2(p1,p4) + p3;
Code changes in version v2
- a = m1(r1,r2);
+ a = m2(r1,r4) + r3;
In the code notation above, "-" means deleted code and "+" means added code. This means that the "-" code has been changed to a "+" code.

上記のようなコード変更履歴から抽出されるコード変更パターンpatternは、従って、以下のようになる。
- a = m1( , );
+ a = m2( , ) + ;
上記のコード変更パターンに対応するfgPDGのサブグラフが、メソッドＭ_Ｌ、Ｍ_Ｒ内のＬとＲに示される。Ｌが、a = m1のfgPDG、Ｒが、a = m2のfgPDGとなる。 The code change pattern pattern extracted from the code change history as described above is therefore as follows.
- a = m1( , );
+ a = m2( , ) + ;
Subgraphs of fgPDG corresponding to the above code change patterns are shown at L and R in methods M _L , M _R . L is fgPDG with a = m1, and R is fgPDG with a = m2.

一方、変更対象コードのメソッドＭ_Ｇ内には、サブグラフＬ（a = m1のfgPDG）と一致するサブグラフＧ（a = m1のfgPDG）が含まれ、パターン適用箇所として検出される。 On the other hand, the method MG of the code to be changed includes a subgraph _G (fgPDG of a=m1) that matches the subgraph L (fgPDG of a=m1), and is detected as a pattern application location.

図１５は、ASTでのパターンに基づくコード変更処理S3の課題を具体例で示す図である。ASTでのパターンに基づくコード変更処理S3の処理内容は、以下の通りである。
（１）ＧのfgPDGノードに対応するＡ（Ｇ）のASTノードを、Ａ（Ｍ_Ｇ）から削除する。
（２）ＲのfgPDGノードに対応するＡ（Ｒ）のASTノードを、削除後のＡ（Ｍ_Ｇ）に追加して、コード変更後のＡ（Ｍ_Ｈ）を求める。 FIG. 15 is a diagram showing a specific example of the problem of the pattern-based code change processing S3 in the AST. The processing contents of the pattern-based code change processing S3 in the AST are as follows.
(1) Delete the AST node of A(G) corresponding to the fgPDG node of G from A(M _G ).
(2) Add the AST node of A(R) corresponding to the fgPDG node of R to A(M _G ) after deletion to obtain A(M _H ) after code change.

図１５のＡ（Ｍ_Ｌ）、Ａ（Ｍ_Ｒ）はメソッドＭ_Ｌ、Ｍ_ＲのASTであり、Ａ（Ｌ）、Ａ（Ｒ）はメソッド内のコード変更パターンの変更前ASTと変更後ASTである。Ａ（Ｍ_Ｇ）、Ａ（Ｍ_Ｈ）はそれぞれメソッドＭ_Ｇ，Ｍ_ＨのASTである。上記のコード変更処理S3の処理（２）の、変更後サブグラフＲのfgPDGノードに対応するＡ（Ｒ）のASTノードを、削除後のＡ（Ｍ_Ｇ）に追加してコード変更後のASTのＡ（Ｍ_Ｈ）を求める場合、Ａ（Ｍ_Ｈ）とＡ（Ｒ）の境界部分で接続関係が決定できないノードがある。図中、？で示したノード間の接続関係の決定が困難なものがある。具体的には、Ａ（Ｒ）には接続が必要な子ノード数が３個（ノード「＋」に１個、ノードm２に２個）であるのに対して、Ａ（Ｍ_Ｈ）には親ノードを要求するノードがノードp１，p２の２個しかない。このようなことが生じる理由は、図１４においてfgPDGのＬとＲとは同型であるが、図１５においてASTのＡ（Ｇ）とＡ（Ｒ）とは必ずしも同じではないからである。 A(M _L ) and A(M _R ) in FIG. 15 are the ASTs of the methods M _L and M _R , and A(L) and A(R) are the AST before and after the code change pattern in the method. is. A(M _G ) and A(M _H ) are ASTs of methods M _G and M _H respectively. The AST node of A(R) corresponding to the fgPDG node of the post-change subgraph R in the process (2) of the above code change process S3 is added to A(M _G ) after deletion, and the AST node after code change is When obtaining A(M _H ), there are nodes whose connection relationship cannot be determined at the boundary between A(M _H ) and A(R). In the figure, ? It is difficult to determine the connection relationship between nodes shown in . Specifically, A(R) has 3 child nodes that need to be connected (1 for node "+" and 2 for node m2), whereas A(M _H ) has There are only two nodes p1 and p2 that require parent nodes. The reason why this happens is that in FIG. 14, L and R of fgPDG are of the same type, but in FIG. 15, A(G) and A(R) of AST are not necessarily the same.

［ASTノード間の接続関係を決定可能にするコード変更処理］
本実施の形態では、ASTによるコード変更処理において、サブツリーであるＡ（Ｒ）の境界部分のノードとＡ（Ｍ_Ｈ）内のノードとの接続関係の決定を可能または容易にする。そのために、プロセッサは、コード変更処理において、Ａ（Ｒ）とＡ（Ｇ）内のfgPDGノードに対応する各ASTノードから、境界部分でのノード間接続関係をより確実に決定できるような最小の誘導サブツリーを検出する。この最小の誘導サブツリーを、MIDS（Munimum Insertable/Deletable induced Subtree,挿入／削除可能な最小の誘導サブツリー）と称する。そして、プロセッサは、Ａ（Ｒ）とＡ（Ｇ）に代えて、最小の誘導サブツリーMIDS（G）をＡ（Ｍ_Ｇ）から削除し、MIDS（R）を追加してＳ（Ｍ_Ｈ）を求める。 [Code change processing that enables determination of connection relationships between AST nodes]
In this embodiment, in the code change processing by AST, it is possible or easy to determine the connection relation between the nodes in the boundary part of A(R), which is a subtree, and the nodes in A(M _H ). Therefore, in the code modification process, the processor, from each AST node corresponding to the fgPDG node in A(R) and A(G), determines the minimum node connection relation at the boundary more reliably. Detect derived subtrees. This minimal induced subtree is called MIDS (Munimum Insertable/Deletable induced Subtree). Then, instead of A(R) and A(G), the processor deletes the smallest derived subtree MIDS(G) from A(M _G ) and adds MIDS(R) to obtain S(M _H ). Ask.

図１６は、本実施の形態におけるASTによるコード変更処理のフローチャートを示す図である。プロセッサは、パターンに基づくコード変更（S3）で、MIDSの生成（S60）を実行し、生成したMIDS(G)とMIDS(R)によりパターンに基づくコード変更を実行し（S3）、変更後のASTとしてＡ（Ｍ_Ｈ）１３＿ASTを生成する。 FIG. 16 is a diagram showing a flowchart of code change processing by AST in this embodiment. The processor executes pattern-based code modification (S3), MIDS generation (S60), pattern-based code modification (S3) with the generated MIDS(G) and MIDS(R), and post-modification Generate A(M _H )13_AST as AST.

MIDSの生成処理S60では、プロセッサは、（１）コード変更パターン１１が抽出されたチェンジグラフCGの変更前PDG（CG_L_PDG）と変更前AST（CG_L_AST）と、（２）コード変更パターン１１が抽出されチェンジグラフCGの変更後PDG（CG_R_PDG）と変更後AST（CG_R_AST）と、（３）パターン適用箇所１４のPDG（14_PDG）とAST（14_AST）とから、コード変更パターンのfgPDGノードに対応するＡ（Ｒ）とＡ（Ｇ）内のASTノードに対しMIDSをそれぞれ生成する（S60）。MIDSの生成方法については後で詳述する。 In MIDS generation processing S60, the processor generates (1) pre-change PDG (CG_L_PDG) and pre-change AST (CG_L_AST) of change graph CG from which code change pattern 11 is extracted, and (2) code change pattern 11 is extracted. A ( MIDS are generated respectively for the AST nodes in R) and A(G) (S60). The MIDS generation method will be described in detail later.

図１６において、処理S1の出力であるコード変更パターン１１と、処理S2の出力であるパターン適用箇所１４は、それぞれ複数存在する。更に、図９に示したとおり、コード変更パターン１１が抽出された元のチェンジグラフCGも複数存在する。それに伴い、プロセッサは、パターンに基づくコード変更S3を、複数のコード変更パターン１１と、複数のパターン適用箇所１４と、複数のチェンジグラフCGについてそれぞれ実行する。 In FIG. 16, there are a plurality of code change patterns 11 output from process S1 and a plurality of pattern application locations 14 output from process S2. Furthermore, as shown in FIG. 9, there are also a plurality of original change graphs CG from which the code change pattern 11 was extracted. Along with this, the processor executes the pattern-based code change S3 for each of the plurality of code change patterns 11, the plurality of pattern application locations 14, and the plurality of change graphs CG.

図１７は、複数のコード変更パターンと複数のパターン適用箇所と複数のチェンジグラフについてそれぞれ実行されるパターンに基づくコード変更処理のフローチャートを示す図である。上記したとおり、プロセッサは、MIDSの生成S60を伴うパターンに基づくコード変更処理S3を、複数のコード変更パターン１１毎に繰り返し（S40）、複数のパターン適用箇所１４毎に繰り返し（S41）、そして、コード変更パターン１１を抽出した複数のチェンジグラフCG毎に繰り返し実行する。 FIG. 17 is a flowchart of pattern-based code change processing executed for a plurality of code change patterns, a plurality of pattern application locations, and a plurality of change graphs. As described above, the processor repeats the pattern-based code modification process S3 with MIDS generation S60 for each of the plurality of code modification patterns 11 (S40), for each of the plurality of pattern application locations 14 (S41), and Repeated execution is performed for each of a plurality of change graphs CG from which the code change pattern 11 is extracted.

MIDSの生成S60を伴うパターンに基づくコード変更処理S3では、プロセッサは、パターン適用箇所のPDG、ASTから誘導サブツリーMIDS（AST）を生成し（S51_1）、パターン適用箇所のAST(A(M_G))から、Ｇの各fgPDGノードに対する誘導サブツリーMIDS（AST）を削除する（S51_2）。更に、プロセッサは、チェンジグラフの変更後のPDGとASTから誘導サブツリーMIDS（AST）を生成し（S52_1）、パターン適用箇所のAST(A(M_G))に、Ｒの各fgPDGノードに対する誘導サブツリーMIDS（AST）を追加する（S52_2）。最後に、プロセッサは、追加したMIDS（AST）の境界ノードのAST(A(M_G))のノードとの接続関係を決定し、両ノード間を接続し、変更後のAST(A(M_H))を生成する（S53）。 In the pattern-based code modification process S3 with MIDS generation S60, the processor generates derived subtree MIDS (AST) from the pattern application location PDG, AST (S51_1), and the pattern application location AST (A(M _G ) ), delete the derived subtree MIDS (AST) for each fgPDG node in G (S51_2). Furthermore, the processor generates an induced subtree MIDS (AST) from the PDG and AST after modification of the change graph (S52_1), and stores the induced subtree MIDS (AST) for each fgPDG node in R in the AST (A(M _G )) of the pattern application location. Add MIDS (AST) (S52_2). Finally, the processor determines the connection relationship between the boundary node of the added MIDS (AST) and the node of AST (A(M _G )), connects both nodes, and converts the AST (A(M _H )) is generated (S53).

なお、図１７のS52_1のChange graphとは、図８に示したfgPDGの変更前サブグラフと変更後サブグラフをmap辺で対応付けたものである。そして、図４で説明したとおり、fgPDGはASTから生成されるので、コード変更パターンの抽出元の複数のfgPDGのChange graphからそれぞれの変更前後のASTを取得することができる。 Note that the Change graph of S52_1 in FIG. 17 is obtained by associating the pre-change subgraph and the post-change subgraph of fgPDG shown in FIG. 8 with map edges. As described with reference to FIG. 4, since fgPDG is generated from AST, ASTs before and after each change can be obtained from Change graphs of multiple fgPDGs from which code change patterns are extracted.

［最小の誘導サブツリーMIDSの生成］
図１８は、最小の誘導サブツリーMIDSの生成のフローチャートを示す図である。以下、最小の誘導サブツリーMIDSをサブツリーMIDSと略して称する。このASTノードNに対するMIDS(N)の生成のフローチャートは、サブツリーMIDSの定義S61と、MIDS(N)の生成処理S62を有する。 Generate Minimal Derived Subtree MIDS
FIG. 18 is a diagram showing a flow chart of generating a minimal induced subtree MIDS. Hereinafter, the minimum derived subtree MIDS will be abbreviated as subtree MIDS. The flow chart for generating MIDS(N) for this AST node N has subtree MIDS definition S61 and MIDS(N) generation processing S62.

ASTノードNのサブツリーMIDS(N)の定義S61は、ASTノードNを含み、条件C1-C3を全て満たす最小のASTサブツリーSTである。
条件C1は、ASTサブツリーST（MIDS）のルート（root）が、mappedノード、または、unmappedなstatementノードであること、である。
条件C2は、ASTサブツリーST（MIDS）のリーフ（reaf）が、mappedノード、または、子ノードを持たないunmappedノードであること、である。
条件C3は、A(R)のノードNのサブツリーMIDS(N)内に、mappedノードが１つ以上存在すること、である。条件C3は、ルートもリーフもunmappedノードの場合に必要な条件となる。 Definition S61 of subtree MIDS(N) of AST node N is the smallest AST subtree ST that contains AST node N and satisfies all conditions C1-C3.
Condition C1 is that the root of the AST subtree ST (MIDS) is a mapped node or an unmapped statement node.
Condition C2 is that the leaf (reaf) of the AST subtree ST (MIDS) is a mapped node or an unmapped node with no child nodes.
Condition C3 is that one or more mapped nodes exist in subtree MIDS(N) of node N of A(R). Condition C3 is a necessary condition when both root and leaf are unmapped nodes.

条件C1で、ASTサブツリーST（MIDS）のルートは、先祖側の境界のノードである。ルートがmappedノードであれば、ルートと対応付けられた（map辺でつながった）ノードがA(M_G)内に存在するので、A(M_G)内の対応付けられたノードの親ノードが、ASTサブツリーST（MIDS）のルートの接続先になる。また、ルートがunmappedであってもstatementの場合は、後述する制御フローを再構築することで、ルートの親ノードを検出できる場合がある。 In condition C1, the root of the AST subtree ST(MIDS) is the node of the ancestral boundary. If the root is a mapped node, the node associated with the root (connected by the map edge) exists in A(M _G ), so the parent node of the mapped node in A(M _G ) is , becomes the destination of the root of the AST subtree ST (MIDS). Also, even if the root is unmapped, if it is a statement, it may be possible to detect the root's parent node by reconstructing the control flow, which will be described later.

さらに、条件C2で、リーフがmappedノードであれば、リーフと対応付けられた（map辺でつながった）ノードがA(M_G)内に存在するので、A(M_G)内の対応付けられたノードの子ノードが、ASTサブツリーST（MIDS）のリーフの接続先になる。また、リーフが子ノードを持たないunmappedノードであれば、そのリーフに対する子ノードはA(M_G)内にはないので、リーフの接続関係を決定する必要はない。 Furthermore, in condition C2, if the leaf is a mapped node, the node associated with the leaf (connected by the map edge) exists in A(M _{G ), so the mapped node in A(M G} ₎ is The child nodes of the node that was created become the connection destinations of the leaves of the AST subtree ST (MIDS). Also, if a leaf is an unmapped node with no child nodes, there is no child node for that leaf in A(M _G ), so there is no need to determine the connectivity of the leaf.

そして、条件C3が満たされれば、A(R)内のノードNのサブツリーMIDS(N)を、mappedノードの対応先のA(MG)内のmappedノードを手掛かりにして、サブツリーMIDS(N)の境界ノードの接続先を決定できる。 Then, if the condition C3 is satisfied, the subtree MIDS(N) of the node N in A(R) is used as a clue for the mapped node in A(MG) corresponding to the mapped node, and the subtree MIDS(N) You can decide where to connect border nodes.

mappedノードとは、対応付けを示すmap辺があるノードであり、unmappedなノードとは、map辺がないノードである。また、statementノードとは、JAVA(登録商標)の場合、BLK（Block）、ES（expression statement）、if、whileなどである。statementのより詳しい説明は、明細書の末尾にある。 A mapped node is a node with a map edge indicating correspondence, and an unmapped node is a node without a map edge. In the case of JAVA (registered trademark), statement nodes are BLK (Block), ES (expression statement), if, while, and the like. A more detailed description of the statement can be found at the end of the specification.

プロセッサは、fgPDGノードに対応するASTノードNを起点に、サブツリーMIDSの条件C1～C3が成立するまで、親ノード及び子ノードを繰り返したどり、辿ったノードを集めてサブツリーMIDS(N)を生成する（S62）。図１８の下部にASTノードm２に対するサブツリーMIDS(m2)が示される。ノードm２の親ノードをたどるとmappedノード「＝」に達し、ノードm２の子ノードをたどると子ノードを持たないunmappedノード「p1」「p4」に達する。また、ノード「＝」の子ノードをたどるとmappedノード「a」に達し、ノード「＋」の子ノードをたどると子ノードを持たないunmappedノード「p3」に達する。 Starting from the AST node N corresponding to the fgPDG node, the processor repeatedly traces parent nodes and child nodes until conditions C1 to C3 of the subtree MIDS are satisfied, collects the traced nodes, and generates a subtree MIDS(N). (S62). The subtree MIDS(m2) for AST node m2 is shown at the bottom of FIG. Tracing the parent node of the node m2 leads to the mapped node "=", and tracing the child nodes of the node m2 leads to the unmapped nodes "p1" and "p4" which have no child nodes. Further, following the child node of the node "=" leads to the mapped node "a", and following the child node of the node "+" leads to the unmapped node "p3" which has no child node.

また、図１８の下部に示すASTの場合、mappedノード「＝」と「ａ」は、条件C1～C3を満たすので、単独でサブツリーMIDSを構成する。ノード「＋」のサブツリーMIDSは、ノード「m2」のサブツリーMIDSと同じである。 Also, in the case of the AST shown in the lower part of FIG. 18, the mapped nodes “=” and “a” satisfy the conditions C1 to C3, so they constitute the subtree MIDS by themselves. The subtree MIDS of node "+" is the same as the subtree MIDS of node "m2".

なお、図１８に示したとおり、誘導サブツリー（MIDS）は、通常のサブツリーと異なり、誘導サブツリーMIDSのルートノードｃが、ルートノードｃ以下の全ての子孫ノードｄ，ｅを有する必要はない。誘導サブツリーMIDSでは、ルートノードｃは子孫ノードｄだけを有しても良い。通常サブツリーは、ルートノードcが、ルートノードc以下の全ての子孫ノードｄ，ｅを有する。 As shown in FIG. 18, in the induced subtree (MIDS), unlike a normal subtree, the root node c of the induced subtree MIDS need not have all descendant nodes d and e below the root node c. In derived subtree MIDS, root node c may only have descendant node d. A normal subtree has a root node c with all descendant nodes d and e below the root node c.

［パターンに基づくコード変更でＡ（Ｍ_Ｈ）を求める処理］
図１９は、パターンに基づくコード変更でＡ（Ｍ_Ｈ）を求める処理のフローチャートを示す図である。図１９の処理S51、S52、S53は、図１７の処理S51_1, S51_2、S52_1, S52_2、S53にそれぞれ対応する。 [Processing for obtaining A(M _H ) by pattern-based code change]
FIG. 19 is a diagram showing a flow chart of processing for obtaining A(M _H ) by pattern-based code modification. Processes S51, S52, and S53 in FIG. 19 correspond to processes S51_1, S51_2, S52_1, S52_2, and S53 in FIG. 17, respectively.

プロセッサは、Ｇの各fgPDGノードに対応する各ASTノードＮ_ＧについてMIDS(N_G)を求め、MIDS(N_G)をA(M_G)から削除する（S51）。但し、MIDS(N_G)内のmappedノードは削除しないでA(M_G)に残す。Ｇの全fgPDGノードに対して上記の削除処理が完了した後のA(M_G)をAd(M_G)と称する。この削除処理S51についての具体例に基づく説明を後述する。 The processor finds MIDS(N _G ) for each AST node N _G corresponding to each fgPDG node in G, and removes MIDS(N _G ) from A(M _G ) (S51). However, the mapped nodes in MIDS(N _G ) are left in A(M _G ) without being deleted. A(M _G ) after the above deletion process is completed for all fgPDG nodes of G is called Ad(M _G ). A description based on a specific example of this deletion processing S51 will be given later.

ここで、Ｇの各fgPDGノードに対応する各ASTノードＮ_Ｇについて求めた複数のMIDS(N_G)には、包含関係を有する複数のMIDS(NG)が含まれ得る。その場合は最も範囲の広いMIDS(N_G)をA(M_G)から削除すればよい。図１８に示したMIDS(m2)とMIDS(=)、MIDS(a)との関係の場合、最も広いMIDS(m2)を削除する。また、任意の２つのMIDSは、包含関係を持つか、互いに交わりを持たない独立した関係を持つかのいずれかの関係になる。交わりを持つが包含関係にないという関係は発生しない。交わりを持つとは両MIDSの一部が互いに重なることである。 Here, multiple MIDS(NG) obtained for each AST node NG corresponding to each fgPDG node of _G may include multiple MIDS( _NG ) having an inclusion relationship. In that case, MIDS(N _G ) with the widest range should be deleted from A(M _G ). In the case of the relationship between MIDS(m2), MIDS(=) and MIDS(a) shown in FIG. 18, the widest MIDS(m2) is deleted. Also, any two MIDSs have either a containment relationship or an independent relationship that does not intersect with each other. Relationships that intersect but do not contain do not occur. Intersecting means that parts of both MIDS overlap each other.

次に、プロセッサは、Ｒの各fgPDGノードに対応する各ASTノードＮ_ＲについてMIDS(N_R)を求め、MIDS(N_R)をA_d(M_G)に追加する（S52）。このとき、Ad(M_G)に削除されずに残されているmappedノードを、MIDS(N_R)内の対応するmappedノードで置換する。この追加処理S52についての具体例に基づく説明も後述する。 Next, the processor finds MIDS(N _R ) for each AST node N _R corresponding to each fgPDG node in R and adds MIDS(N _R ) to A _d (M _G ) (S52). At this time, the mapped nodes left undeleted in Ad(M _G ) are replaced with the corresponding mapped nodes in MIDS(N _R ). A description based on a specific example of this additional processing S52 will also be given later.

そして、プロセッサは、追加したMIDS(N_R)について、その境界ノード（rootノード、leafノード）のA_d(M_G)のノードとの接続関係を決定し、ノード間を接続する（S53）。この接続関係の決定処理についての具体例に基づく説明も後述する。 Then, the processor determines the connection relationship between the boundary nodes (root node, leaf node) of the added MIDS(N _R ) and the node of A _d (M _G ), and connects the nodes (S53). A description based on a specific example of this connection relationship determination process will also be given later.

［削除処理と追加処理の具体例］
図２０は、削除処理S51の具体例に基づく説明の図である。図２１は、追加処理S52の具体例に基づく説明の図である。いずれの説明も図１５の具体例に基づく。 [Specific example of deletion processing and addition processing]
FIG. 20 is an explanatory diagram based on a specific example of the deletion processing S51. FIG. 21 is an explanatory diagram based on a specific example of the addition process S52. Both explanations are based on the specific example of FIG.

図２０に示すとおり、プロセッサは、A(M_G)からA(G)の各ASTノードN_Gに対するMIDS(N_G)を削除して、A_d(M_G)を生成する（S51）。図２０に示した具体例に基いて説明すると、プロセッサは、A(G)内のノード「＝」、「ａ」はmappedノードであるのでそれぞれMIDSの条件を満たすため、ノード「＝」のみを有するMIDS(=)と、ノード「ａ」のみを有するMIDS(a)を生成し、A(M_G)から削除する。但し、ノード「＝」、「ａ」はmappedノードであるので、A(MG)から削除せずAd(M_G)に残す（S51_a）。さらに、プロセッサは、A(G)内のノード「ｍ１」のMIDS(m1)を図２０中左下に示すとおり生成し、MIDS(m1)内のmappedノード「＝」「ａ」以外のノード「m１」「ｐ１」「ｐ２」をA(M_G)から削除する（S51_b）。その結果、図２０中右下のA_d(M_G)が生成される。 As shown in FIG. 20, the processor deletes MIDS(N _G ) for each AST node N _G from A(M _G ) to A(G) to generate A _d (M _G ) (S51). Based on the specific example shown in FIG. 20, since the nodes '=' and 'a' in A(G) are mapped nodes, the processor satisfies the MIDS conditions, so that only the node '=' is processed. Create MIDS(=) with and MIDS(a) with only node 'a' and delete from A(M _G ). However, since the nodes “=” and “a” are mapped nodes, they are left in Ad(MG) without being deleted from A(MG) ( _{S51_a} ). Further, the processor generates MIDS(m1) of node 'm1' in A(G) as shown in the lower left of FIG. ', 'p1' and 'p2' are deleted from A(M _G ) (S51_b). As a result, A _d (M _G ) at the lower right in FIG. 20 is generated.

図２１に示すとおり、プロセッサは、A_d(M_G)にA(R)の各ASTノードN_Rに対するMIDS(N_R)を追加して、A(M_H)を生成する（S52）。図２１に示した具体例に基いて説明すると、プロセッサは、A(R)内のノード「＝」、「ａ」はmappedノードであるのでそれぞれMIDSの条件を満たすため、ノード「＝」のみを有するMIDS(=)と、ノード「ａ」のみを有するMIDS(a)を生成し、A_d(M_G)に追加する。但し、ノード「＝」、「ａ」はmappedノードであるので、A_d(M_G)内の対応するノード「＝」「ａ」と交換する（S52_a）。更に、プロセッサは、ノード「＋」のMIDS(+)とノード「ｍ２」のMIDS(m2)を生成し、両MIDSは同じであるからMIDS(+)またはMIDS(m2)を、A_d(M_G)に追加してA(M_H)を生成する（S52_b）。但し、追加処理でmappedノード（「＋」「ａ」）同士を交換する。 As shown in FIG. 21, the processor adds MIDS(N _R ) for each AST node N _R of A(R) to A _d (M _G ) to generate A(M _H ) (S52). To explain based on the specific example shown in FIG. 21, since the nodes '=' and 'a' in A(R) are mapped nodes, the processor satisfies the conditions of MIDS, respectively, so that only the node '=' is processed. Create MIDS(=) with and MIDS(a) with only node 'a' and add to A _d (M _G ). However, since the nodes '=' and 'a' are mapped nodes, they are replaced with the corresponding nodes '=' and 'a' in A _d (M _G ) (S52_a). Further, the processor generates MIDS(+) of node "+" and MIDS(m2) of node "m2", and since both MIDS are the same, MIDS(+) or MIDS(m2), A _d (M _G ) to generate A(M _H ) (S52_b). However, the mapped nodes (“+” and “a”) are exchanged in the additional processing.

図２０，２１の具体例では、プロセッサは、MIDS(+)とMIDS(m2)のルートノード「＝」がmappedノードであるので、対応するmappedノードと置換することで、ルートノード「＝」をAd(MG)内の親ノード「...」と接続する。一方、MIDS(N_R)内のリーフノード「p1」「p3」「p4」が子ノードを持たないので、リーフノードの接続は発生しない。よって、上記の具体例では、境界ノードの接続関係の決定処理S53は必要ない。そこで、以下、別の具体例に基づき境界ノードの接続関係の決定処理S53について詳述する。 In the specific examples of FIGS. 20 and 21, since the root node "=" of MIDS(+) and MIDS(m2) is a mapped node, the processor replaces the root node "=" with the corresponding mapped node. Connect with the parent node "..." in Ad (MG). On the other hand, since the leaf nodes 'p1', 'p3' and 'p4' in MIDS(N _R ) do not have child nodes, no leaf node connections occur. Therefore, in the above specific example, the boundary node connection relationship determination processing S53 is not necessary. Therefore, the connection relationship determination processing S53 of the boundary nodes will be described in detail below based on another specific example.

［境界ノードの接続関係の決定処理の具体例］
A_d(M_G)に追加したA(R)のMIDS(N_R)の境界ノードの接続関係の決定処理について、次の４種類の境界ノード別に説明する。
（ａ）rootノードがmappedノードの場合
この場合は、mappedノードに対応するA_d(M_G)内のノードの接続関係を元に決定する。
（ｂ）rootノードがunmappedなstatementノードの場合
この場合は、後述する制御フロー再構築により接続関係を決定する。
（ｃ）leafノードがmappedノードの場合
この場合は、mappedノードに対応するA_d(M_G)内のノードの接続関係を元に決定する。
（ｄ）leafノードが子ノードを持たない場合（但し、接続関係が不要であり説明はない） [Concrete example of determination processing of connection relationship of boundary nodes]
The process of determining the connection relation of the boundary nodes of MIDS(N _R ) of A(R) added to A _d (M _G ) will be described for each of the following four types of boundary nodes.
(a) When the root node is a mapped node In this case, determination is made based on the connection relation of the nodes in A _d (M _G ) corresponding to the mapped node.
(b) When the root node is an unmapped statement node In this case, the connection relationship is determined by the later-described control flow reconstruction.
(c) When the leaf node is a mapped node In this case, determination is made based on the connection relation of the nodes in A _d (M _G ) corresponding to the mapped node.
(d) When the leaf node has no child nodes (however, there is no need for a connection relationship, so there is no explanation)

［（ａ）rootノードがmappedノードの場合］
図２２は、（ａ）rootノードがmappedノードの場合の境界ノードの接続関係の決定処理を示す図である。図２３、図２４は、（ａ）rootノードがmappedノードの場合の境界ノードの接続関係の決定処理の２つの具体例をそれぞれ示す図である。 [(a) When the root node is a mapped node]
FIG. 22 is a diagram showing the process of determining the connection relation of boundary nodes when (a) the root node is a mapped node. FIG. 23 and FIG. 24 are diagrams respectively showing two specific examples of (a) the process of determining the connection relationship of boundary nodes when the root node is a mapped node.

図２２において、（ａ）追加したMIDS(N_R)のrootノードN_rがmappedノードの場合、プロセッサは、
（ａ－１）A_d(M_G)におけるMIDSのrootノードN_rに対応するノードの親ノードを、A(M_H)でもrootノードN_rの親ノードとするよう接続する。
（ａ－２）A_d(M_G)においてMIDSのrootノードN_rに対応するノードに子ノードN_cが存在する場合、A(M_H)において、rootノードN_rの子にN_cを追加できるなら、N_cをN_rの子ノードとして追加するよう接続する。追加できないなら、N_c以下のノードをrootノードN_rの子ノードで上書きするか、コード変更不可として処理を終了する（S53_1）。 In FIG. 22, (a) when the root node N _r of the added MIDS(N _R ) is a mapped node, the processor:
(a-1) Connect the parent node of the node corresponding to the MIDS root node N _r in A _d (M _G ) to be the parent node of the root node N _r in A(M _H ) as well.
(a-2) If the node corresponding to the root node N _r of MIDS has a child node N _c in A _d (M _G ), add N _c to the child of the root node N _r in A(M _H ). If possible, connect to add N _c as a child node of N _r . If it cannot be added, the nodes below _Nc are overwritten with the child nodes of the root node Nr, or the code cannot be changed, and the process ends ( _{S53_1} ).

図２３の具体例S53_1(1)では、コード変更パターンpatternは、変更前がif( ){ }であり、変更後がif( ){m1(); m2(p1,p2);}である。この場合、プロセッサは、図示しないMIDS(N_G)（「if」のみを含む）が削除され但しmappedノード「if」が残されたA_d(M_G)に、図示されるMIDS(N_R)を追加して、A(M_H)を生成する。 In the specific example S53_1(1) of FIG. 23, the code change pattern pattern is if( ){ } before change and if( ){m1(); m2(p1,p2);} after change. In this case, the processor replaces the MIDS(N _R ) shown in A _d (M _G ) with the MIDS(N _G ) (containing only 'if') (not shown) removed but the mapped node 'if' left. to generate A(M _H ).

この具体例では、プロセッサは、MIDS(N_-R)のrootノード「if」について、以下の接続関係を決定する。
（ａ－１）MIDS(N_-R)のrootノードN_r「if」の親ノードは、rootノード「if」に対応するA_d(M_-G)内のノード「if」の親ノード「while」とする。
（ａ－２）MIDS(N_-R)のrootノードN_r「if」の子ノードは次の通りである。MIDS(N_-R)のrootノードN_r「if」に対応するA_d(M_-G)内のノード「if」に子ノードN_c「cond」が存在し、MIDS(N_R)のrootノード「if」の子ノードに文「BLK」があるが条件ノードが存在しないため、A_d(M_-G)内の子ノード「cond」をrootノードN_r「if」の子ノードとして追加する。 In this specific example, the processor determines the following connection relations for the root node “if” of MIDS(N _−R ).
(a-1) The parent node of _root node N _r 'if' of _MIDS (N- _R ) is the parent node 'while ”.
(a-2) Child nodes of the root node N _r "if" of MIDS(N- _R ) are as follows. The node 'if' in A _d (M _-G ) corresponding to the root node N _r 'if' of MIDS(N _-R ) has a child node N _c 'cond', and the root node of MIDS(N _R ) Add the child node 'cond' in A _d (M _−G ) as a child node of the root node N _r 'if' because the child node of 'if' has the sentence 'BLK' but no condition node.

図２４の具体例S53_1(2)では、コード変更パターンpatternなどは図２３と同じである。しかし、MIDS(N_-R)のrootノードN_r「if」に対応するA_d(M_-G)内のノード「if」に子ノード「cond」が存在し、一方、MIDS(N_R)のrootノードN_r「if」の子ノードに文「BLK」と条件ノード「cond2」が存在する。この場合は、プロセッサは、MIDS(N_-R)のrootノードN_r「if」について、以下の接続関係を決定する。
（ａ－１）図２３と同じ
（ａ－２）図２２の追加できない場合に該当し、A_d(M_G)のN_c以下のノード「cond」をMIDS(N_R)内のrootノードN_r「if」の子ノード「cond2」で上書きする。もしくは、コード変更不可のコメントを出力する。 In the specific example S53_1(2) in FIG. 24, the code change pattern pattern and the like are the same as in FIG. However, there is a child node 'cond' in node 'if' in A _d (M _-G ) corresponding to root node N _r 'if' in MIDS( _N _-R ), while The child node of the root node N _r "if" is the sentence "BLK" and the conditional node "cond2". In this case, the processor determines the following connection relation for the root node N _r "if" of MIDS(N _-R ).
(a-1) _Same as _FIG . 23 (a- ₂ ) _Corresponds to the case where addition is not possible in FIG. _r Overwrite with child node "cond2" of "if". Alternatively, output a comment that cannot be changed.

［（ｂ）rootノードがunmappedなstatementノードの場合］
図２５は、（ｂ）rootノードがunmappedなstatementノードの場合の境界ノードの接続関係の決定処理を示す図である。図２６は、（ｂ）rootノードがunmappedなstatementノードの場合の境界ノードの接続関係の決定処理の具体例を示す図である。 [(b) When the root node is an unmapped statement node]
FIG. 25 is a diagram showing the process of determining the connection relation of boundary nodes when the root node is an unmapped statement node (b). FIG. 26 is a diagram showing a specific example of the connection relationship determination processing of boundary nodes when the root node is an unmapped statement node (b).

図２５において、（ｂ）追加したMIDS(N_R)のrootノードN_rがunmappedなstatementノードの場合、プロセッサは、以下のように、制御フローを再構築する。
(ｂ－１) MIDS(N_R)においてrootノードN_r の子孫の mapped ノードを N_m とする。
(ｂ－２) A_d(M_G) において、MIDS(N_R)のmappedノードN_m に対応するmappedノードN_m_1から一番近い祖先の制御ノードをN_acとする。ここで、
・制御ノードとは、ブロックノード、もしくは、if, while, do 等の条件分岐・繰り返しを表現するノード、を指す。
・MIDS(N_G)を削除したことでA_d(M_G)を辿れない場合は、A(M_G)を辿って上記条件を満たす制御ノードをN_acと特定する。
(ｂ－３) A(M_H) において制御ノードN_ac の子ノードとして rootノードN_r を追加するように接続する（S53_2）。 In FIG. 25, (b) when the root node N _r of the added MIDS(N _R ) is an unmapped statement node, the processor reconstructs the control flow as follows.
(b-1) In MIDS(N _R ), let N _m be a mapped node that is a descendant of the root node N _r .
(b-2) In A _d (M _G ), let N _ac be the closest ancestor control node from the mapped node N _m — 1 corresponding to the mapped node N _m of MIDS(N _R ). here,
・Control nodes refer to block nodes or nodes that express conditional branching/repetition such as if, while, and do.
If A _d (M _G ) cannot be traced due to deletion of MIDS(N _G ), A(M _G ) is traced to identify the control node that satisfies the above conditions as N _ac .
(b-3) A(M _H ) is connected so as to add the root node N _r as a child node of the control node N _ac (S53_2).

但し、(b-1) で、N_mが複数存在する場合、N_r に一番近いノードをN_mに選び、制御フロー再構築を行う。A_d(M_G) における N_m から N_acまでの親子関係は、A(M_H) では無視する (親子関係を作らない)（S53_2）。 However, in (b-1), if there are multiple N _m , the node closest to N _r is selected as N _m and the control flow is reconstructed. The parent-child relationship from N _m to N _ac in A _d (M _G ) is ignored in A(M _H ) (no parent-child relationship is created) (S53_2).

上記の制御フローを再構築する処理では、プロセッサは、MIDS(N_R)においてrootノードN_r の子孫の mapped ノードを N_mを検出し、A_d(M_G) においてMIDS(N_R)内のmappedノードN_m に対応するA_d(M_G)内のmappedノードN_m_1から一番近い祖先ノード且つ制御ノードN_acを検出し、MIDS(N_R)内のrootノードN_r にA_d(M_G)内の制御ノードN_ac を親ノードとして接続する。 In the process of reconstructing the above control flow, the processor detects mapped nodes _N _m that are descendants of the root node N _r in _MIDS ( _N _R ), and The nearest ancestor node and control node N _ac is detected from the mapped node N _{m_1} in A _d ( _M _G ) corresponding to the mapped node N _m , and A _d ₍ M _G ) in the control node N _ac as a parent node.

図２６の具体例S53_2, S53_3では、コード変更パターンの変更前がm1( );で、変更後がif(cond){m1();}である。そこで、プロセッサは、MIDS(N_G)（m1のみ含む）を削除しmappedノードm1を残したA_d(M_G)に、MIDS(N_R)を追加して、A(M_H)を生成する。この場合、プロセッサは、次の制御フローを再構築する処理を行う。
(ｂ－１) MIDS(N_R)において、rootノードN_r 「If」の子孫の mapped ノード「m1」を N_mとする。
(ｂ－２) MIDS(N_R)内のmapped ノードN_m「m1」に対応するA_d(M_G)内のmappedノードN_m_1 「m1」から一番近い祖先の制御ノード「while」をN_acとする。
(ｂ－３) A(M_H) においてN_ac「while」の子ノードとして MIDS(N_R)内のroot ノードN_r「If」を追加するように接続する。但し、A_d(M_G) 内のmappedノードN_m 「m1」から制御ノードN_ac 「while」までの親子関係は、A(M_H) では無視する。 In specific examples S53_2 and S53_3 in FIG. 26, the code change pattern is m1( ); before change and if(cond){m1();} after change. Therefore, the processor adds MIDS(N _R ) to A _d (M _G ), which deletes MIDS(N _G ) (including only m1) and leaves the mapped node m1, to generate A(M _H ). . In this case, the processor performs processing to reconstruct the next control flow.
(b-1) In MIDS(N _R ), let N _m be the mapped node “m1” descendant of the root node N _r “If”.
(b-2) Mapped node N m_1 in A _d ( _M _G ) corresponding to mapped node N _m 'm1' in MIDS(N _R ). Let it be _Nac .
(b-3) Connect so as to add the root node N _r 'If' in MIDS(N _R ) as a child node of N _ac 'while' in A(M _H ). However, the parent-child relationship from the mapped node N _m 'm1' in A _d (M _G ) to the control node N _ac 'while' is ignored in A(M _H ).

［（ｃ）leafノードがmappedノードの場合］
図２７は、（ｃ）leafノードがmappedノードの場合の境界ノードの接続関係の決定処理を示す図である。図２８、図２９、図３０は、（ｃ）leafノードがmappedノードの場合の境界ノードの接続関係の決定処理の具体例を示す図である。前述したとおり、MIDSは、誘導サブツリーであるので、通常のサブツリーとは異なり、MIDSのleafノードに、A_d(M_G)内のノードが子ノードとして接続する可能性がある。そのため、（ｃ）のleafノードがmappedノードの場合の接続関係の決定処理を考慮する必要がある。 [(c) When the leaf node is a mapped node]
FIG. 27 is a diagram showing the process of determining the connection relation of boundary nodes when (c) the leaf node is a mapped node. 28, 29, and 30 are diagrams showing specific examples of the process of determining the connection relation of boundary nodes when (c) the leaf node is a mapped node. As described above, MIDS is an induction subtree, so unlike normal subtrees, a node in A _d (M _G ) may connect to a leaf node of MIDS as a child node. Therefore, it is necessary to consider the connection relationship determination process when the leaf node in (c) is a mapped node.

図２７において、（ｃ）追加したMIDS(N_R)のleafノードN_l が mapped ノードの場合、プロセッサは、以下の処理を実行する。
(ｃ－１) MIDS(N_R)のleafノードN_l に対応するA_d(M_G) 内のmappedノードの子ノードを、 A(M_H) においても leafノードN_l の子ノードとするように、leafノードN_l を接続する。
(ｃ－２) 以下の条件を満たす場合、 A(M_H) における leafノードN_l を、A(M_G) におけるleafノードN_l のmappedノードN_l_1の修飾名 (修飾子+単純名) ノード群で置換する。
・条件： A(M_G) においてleafノードN_l のmappedノードN_l_1が修飾子 (qualifier、連続可) の子ノードである。
(ｃ－３)(c-2)の条件以外の場合、A(M_H) へ、A_d(M_G) における “N_l_1” と “N_l_1 の親” の関係を明示的に反映する必要はない（ MIDS(N_R)のleafノードN_l がA_d(M_G) の“N_l_1” と交換され、root ノード N_r はその接続関係決定処理にて親子関係が追加されるため、ここでは何もする必要無し）。 In FIG. 27, when the leaf node Nl of (c) added _MIDS (N _R ) is a mapped node, the processor performs the following processing.
(c-1) Make the child node of the mapped node in A _d (M _G ) corresponding to leaf node N _l in MIDS(N _R ) the child node of leaf node N _l in A(M _H ) as well. , connect the leaf node _Nl .
(c-2) If the following conditions are satisfied, the leaf node N _l in A(M _H ) is a qualified name (qualifier + simple name) of the mapped node N _{l _1 of the leaf node N l} _in A(M _G ) Replace with a group of nodes.
- Condition: In A(M _G ), the mapped node N _l _1 of the leaf node N _l is a child node of the qualifier (continuity possible).
(c-3) Except for condition (c-2), explicitly reflect the relationship between “N _l _1” and “parent of N _l _1” in A _d (M _G ) to A(M _H ) (The leaf node N _l of MIDS(N _R ) is exchanged with “N _{l_1} ” of A _d (M _G ), and the parent-child relationship is added to the root node N _r in the connection relationship determination process. (so you don't need to do anything here).

図２８の具体例S53_4(c-1)では、コード変更パターンの変更前がif (cond) {m1( );}で、変更後がdo if(cond){m1();}である。そこで、プロセッサは、A(M_G)からMIDS(N_G)「If」を削除しそのmappedノード「If」を残したA_d(M_G)に、MIDS(N_R)1を追加して、A(M_H)を生成する。この場合、プロセッサは、図３０のS53_4(c-1)に示した次の処理を行う。
MIDS(N_R)1 の leafノードであるノード「If」について、
・MIDS(N_R)1 のleafノード「If」に対応するA_d(M_G) のノード「If」の子ノード BLK, cond を、A(M_H) においてもleafノード「If」の子ノードとするよう接続。
・A_d(M_G) での親ノードWhile は、root ノードの接続関係決定方法の制御フロー再構築により、A(M_H) では MIDS(N_R)1 の rootノード「Do」の親ノードとなるよう接続する。 In the specific example S53_4(c-1) of FIG. 28, the code change pattern is if (cond) {m1( );} before change and do if(cond){m1();} after change. Therefore, the processor deletes MIDS(N _G ) 'If' from A(M _G ) and adds MIDS(N _R )1 to A _d (M _G ) that leaves the mapped node 'If', Generate A(M _H ). In this case, the processor performs the next processing shown in S53_4(c-1) of FIG.
For the node "If" which is a leaf node of MIDS(N _R )1,
・The child node BLK, cond of the node "If" in A _d (M _G ) corresponding to the leaf node "If" in MIDS(N _R )1 is also a child node of the leaf node "If" in A(M _H ) and so on.
・The parent node While in A _d (M _G ) is the parent node of the root node “Do” in MIDS(N _R )1 in A(M _H ) by reconstructing the control flow of the connection relationship determination method of the root node. connect as much as possible.

図２９の具体例S53_4(c-2)では、コード変更パターンの変更前がif (cond) {m2(p2);}で、変更後がif(cond){m3(p2);}である。そこで、プロセッサは、A(M_G)からMIDS(N_G)「ES～p1」を削除しmappedノード「p1」を残したA_d(M_G)に、MIDS(N_R)2を追加して、A(M_H)を生成する。この場合、プロセッサは、図３０のS53_4(c-2)に示した次の処理を行う。
MIDS(N_R)2の leafノードN_l「p2」について、
A(M_G) でのMIDS(N_R)2の leafノードN_l「p2」に対応するノードN_l _1「p1」の親ノードは qualifier である。そのため、A(M_H) では、MIDS(N_R)2の leafノードN_l「p2」を、A(M_G) のノードN_l _1「p1」の修飾名ノード群 (qualifier + qualifier + p1) で置換するよう接続する。 In the specific example S53_4(c-2) of FIG. 29, the code change pattern is if (cond) {m2(p2);} before change and if(cond){m3(p2);} after change. Therefore, the processor deletes MIDS(N _G ) 'ES~p1' from A(M _G ) and adds MIDS(N _R )2 to A _d (M _G ) which leaves the mapped node 'p1'. , A(M _H ). In this case, the processor performs the next processing shown in S53_4(c-2) of FIG.
For leaf node N _l 'p2' of MIDS(N _R )2,
The parent node of node N _l — 1 'p1' corresponding to leaf node N _l 'p2' of MIDS(N _R )2 in A(M _G ) is qualifier. Therefore, in A(M _H ), let leaf node N _l ``p2'' of MIDS(N _R )2 be the qualified name node group (qualifier + qualifier + p1) of node N _l _1 ``p1'' of A(M _G ) Connect to replace with .

JAVA（登録商標）では、qualifier + qualifier + p1と、qualifier + qualifier + p2とは異なるメソッド呼び出しを意味するので、A(M_G)でのqualifier + qualifier + p1を維持する。 In JAVA, qualifier + qualifier + p1 means a different method call than qualifier + qualifier + p2, so we keep qualifier + qualifier + p1 in A(M _G ).

なお、この例では、プロセッサは、MIDS(N_R)2のルートノードNrについては、制御フロー再構築処理(b-3)により、A(M_H)の制御ノード「BLK」を親ノードとして接続する。但し、A_d(M_G)のleafノードN_l_1「p１」からN_ACの「BLK」に辿れないので、A (M_G)のleafノードN_l_1「p１」からN_ACの「BLK」に辿り検出する。図２５のS53_2の（ｂ－２）に記載した通りである。 In this example, the processor connects the control node "BLK" of A(M _H ) as a parent node to the root node Nr of MIDS(N _R )2 by the control flow reconstruction processing (b-3). do. However, since leaf node N _{l_1} “p1” of A _d (M _G ) cannot be traced to “BLK” of N _AC , leaf node N _{l_1} “p1” of A (M _G ) to “BLK” of N _AC to detect. It is as described in (b-2) of S53_2 in FIG.

以上説明したとおり、本実施の形態によれば、コード変更パターンによるコード変更で、fgPDGのコード変更パターンの変更前fgPDGサブグラフに対応する変更前AST（A(G)）と、変更後fgPDGサブグラフに対応する変更後AST（A(R)）とから、誘導サブツリーMIDS（N_G）とMIDS(N_R)を生成し、MIDS（N_G）とMIDS(N_R)を使用して変更前AST（A(M_G)）を変更後AST（A(M_H)）にコード変更する。MIDS（N_G）とMIDS(N_R)を使用することにより、コード変更でのノードの接続関係を検出可能になる。 As described above, according to the present embodiment, in code change by a code change pattern, the pre-change AST (A(G)) corresponding to the pre-change fgPDG subgraph of the code change pattern of fgPDG and the post-change fgPDG subgraph. Generate induced subtrees MIDS(N _G ) and MIDS(N _R ) from the corresponding modified AST (A(R)), and use MIDS(N _G ) and MIDS(N _R ) to generate the unmodified AST ( A(M _G )) is changed to AST(A(M _H )). By using MIDS(N _G ) and MIDS(N _R ), it becomes possible to detect the connection relation of nodes in code modification.

図３１は、本実施の形態におけるシステマティックエディットを行う開発支援装置１のユーザ画面例を示す図である。既に、開発履歴群１０からコード変更パターンのマイニングが完了し、コード変更パターン群がストレージに格納された状態である。このような状態で、ユーザがクライアント端末装置４０から開発支援装置１にアクセスし、開発支援プログラムを実行するときのクライアント端末の画面である。 FIG. 31 is a diagram showing an example of a user screen of the development support device 1 that performs systematic editing according to this embodiment. Mining of code change patterns from the development history group 10 has already been completed, and the code change pattern group has already been stored in the storage. This is the screen of the client terminal when the user accesses the development support device 1 from the client terminal device 40 and executes the development support program in such a state.

画面S70で、ユーザが、システマティックエディットを行うために、編集対象プログラムコードのファイルとして「Ana.java」を選択すると、変更対象プログラムコード内のあるメソッド「void anaFunc(Arg arg)」のコードが表示される。そして、ユーザが、編集対象プログラムコードとマッチするコード変更パターンの検索を要求するサーチボタン「search」をクリックする。 On the screen S70, when the user selects "Ana.java" as the file of the program code to be edited in order to perform systematic editing, the code of a method "void anaFunc(Arg arg)" in the program code to be changed is displayed. be done. The user then clicks on the search button "search" which requests a search for code change patterns that match the program code to be edited.

画面S71で、サーチボタンのクリックに応答して、プロセッサが、コード変更パターン群ないの各パターンについて検索し、マッチした３つのコード変更パターン「Pattern title 1」～「Pattern title 3」を検出し、表示する。そして、「Pattern title 1」にマッチした「Ana.java」を選択すると、プロセッサが、右側の画面に、コード変更前のコード（左側）とコード変更後のコード（右側）を表示して提案し、両コードの変更箇所をハイライト表示する。 In screen S71, in response to clicking the search button, the processor searches for each pattern in the code change pattern group and detects three matched code change patterns "Pattern title 1" to "Pattern title 3", indicate. Then, when you select "Ana.java" that matches "Pattern title 1", the processor displays the code before the code change (left side) and the code after the code change (right side) on the right screen and makes suggestions. , to highlight changes in both codes.

ユーザが、両コードの変更箇所のコード変更をチェックし、コード変更を承認する場合にアクセプトボタン「Accept」をクリックすると、プロセッサが提案したコード変更を自動で行う。 When the user checks the code changes in both code changes and clicks the accept button "Accept" to approve the code changes, the processor automatically makes the proposed code changes.

［JAVA（登録商標）のStatement, Expression等］
以下、JAVA（登録商標）のStatement、Expression、上記を除いたAST要素の例は、以下の通りである。制御ノードには下線を付した。 [JAVA (registered trademark) statements, expressions, etc.]
Examples of JAVA (registered trademark) statements, expressions, and AST elements excluding the above are as follows. Control nodes are underlined.

Statement
AssertStatement, Block, BreakStatement, ConstructorInvocation, ContinueStatement, DoStatement, EmptyStatement, EnhancedForStatement, ExpressionStatement, ForStatement, IfStatement, LabeledStatement, ReturnStatement, SuperConstructorInvocation, SwitchCase, SwitchStatement, SynchronizedStatement, ThrowStatement, TryStatement, TypeDeclarationStatement, VariableDeclarationStatement, WhileStatement
Expression
Annotation, ArrayAccess, ArrayCreation, ArrayInitializer, Assignment, BooleanLiteral, CastExpression, CharacterLiteral, ClassInstanceCreation, ConditionalExpression, FieldAccess, InfixExpression, InstanceofExpression, LambdaExpression, MethodInvocation, MethodReference, Name, NullLiteral, NumberLiteral, ParenthesizedExpression, PostfixExpression, PrefixExpression, StringLiteral, SuperFieldAccess, SuperMethodInvocation, ThisExpression, TypeLiteral, VariableDeclarationExpression
上記を除いたAST要素
AnonymousClassDeclaration, BodyDeclaration, CatchClause, Comment, CompilationUnit, Dimension, ImportDeclaration, MemberRef, MemberValuePair, MethodRef, MethodRefParameter, Modifier, ModuleDeclaration, ModuleDirective, ModuleModifier, PackageDeclaration, TagElement, TextElement, Type, TypeParameter, VariableDeclaration statement
AssertStatement, Block , BreakStatement, ConstructorInvocation, ContinueStatement, DoStatement , EmptyStatement, EnhancedForStatement , ExpressionStatement, ForStatement , IfStatement , LabeledStatement, ReturnStatement, SuperConstructorInvocation, SwitchCase, SwitchStatement , SynchronizedStatement , ThrowStatement, TryStatement , TypeDeclarationStatement, VariableDeclarationStatement, WhileStatement
expression
Annotation, ArrayAccess, ArrayCreation, ArrayInitializer, Assignment, BooleanLiteral, CastExpression, CharacterLiteral, ClassInstanceCreation, ConditionalExpression, FieldAccess, InfixExpression, InstanceofExpression, LambdaExpression, MethodInvocation, MethodReference, Name, NullLiteral, NumberLiteral, ParenthesizedExpression, PostfixExpression, PrefixExpression, StringLiteral, SuperFieldAccess, SuperMethodInvocation, ThisExpression, TypeLiteral, VariableDeclarationExpression
AST elements excluding the above
AnonymousClassDeclaration, BodyDeclaration, CatchClause , Comment, CompilationUnit, Dimension, ImportDeclaration, MemberRef, MemberValuePair, MethodRef, MethodRefParameter, Modifier, ModuleDeclaration, ModuleDirective, ModuleModifier, PackageDeclaration, TagElement, TextElement, Type, TypeParameter, VariableDeclaration

１：開発支援装置
１０：開発履歴群
１１：コード変更パターン群
１２：変更対象プログラム
１３：変更済プログラム
２１：開発支援プログラム
２１Ａ：コード変更パターンマイニングプログラム
２１Ｂ：パターン適用箇所検出プログラム
２１Ｃ：コード変更プログラム
Ｓ１：パターンマイニング処理
Ｓ２：パターン適用箇所検出処理
Ｓ３：パターンに基づくコード変更
A(M_L)：変更前プログラムコード
A(M_R)：変更後プログラムコード
L：変更前サブグラフ
R：変更後サブグラフ
A(M_G)：変更対象AST
A(M_R)：変更後AST
MIDS(N_G)：変更対象誘導サブツリー
MIDS(N_R)：変更後誘導サブツリー 1: Development support device 10: Development history group 11: Code change pattern group 12: Change target program 13: Changed program 21: Development support program 21A: Code change pattern mining program 21B: Pattern application location detection program 21C: Code change program S1: pattern mining processing S2: pattern application location detection processing S3: code change based on pattern
A(M _L ): Program code before change
A(M _R ): Changed program code
L: Subgraph before change
R: Subgraph after change
A(M _G ): AST to be changed
A(M _R ): AST after change
MIDS(N _G ): Induction subtree to be modified
MIDS(N _R ): modified derived subtree

Claims

Code change of program code to be changed based on a code change pattern having a pre-change subgraph and a post-change subgraph extracted from the pre-change program dependency graph and the post-change program dependency graph of the pre-change program code and the post-change program code a method for
In each of the AST to be changed which is an abstract syntax tree (hereinafter referred to as AST) of the program code to be changed that matches the subgraph before change and the AST after change which is the AST of the program code after change having the subgraph after change ,
Identifying a modified derived subtree and a modified derived subtree having a node of the pre-modification subgraph or the post-modification subgraph and having, as a root node or a leaf node, a map node having a correspondence between the modification target AST and the post-modification AST. death,
deleting the modified derived subtree from the modified AST;
adding the modified derived subtree to the deleted modified AST;
A code modification method, comprising connecting boundary nodes in the modified derived subtree with nodes in the deleted modified AST.

The connecting process includes:
2. The method according to claim 1, comprising a process of connecting a root node with a map in the derived subtree after modification to a parent node of a root node with a map in the derived subtree to be modified which is a node in the AST to be modified. How to change code.

The connecting process includes:
3. The method according to claim 2, comprising a process of connecting a root node with a map in the derived subtree after modification to a child node of a root node with a map in the derived subtree to be modified which is a node in the AST to be modified. How to change code.

The connecting process includes:
3. The method according to claim 1 or 2, comprising a process of connecting a leaf node with a map in the derived subtree after modification to a child node of a leaf node with a map in the derived subtree to be modified that is a node in the AST to be modified. How to change the code as described.

The connecting process includes:
Mapped leaf nodes in the modified derived subtree are qualified names having qualifiers and simple names of nodes in the modified AST associated with mapped leaf nodes in the modified derived subtree. 3. The code modification method according to claim 1, further comprising a process of replacing in a group of nodes.

The process of identifying the post-change derived subtree includes:
identifying a modified induced subtree having an unmapped statement node with no association as a root node;
The connecting process includes:
identifying a first map node that is a descendant of the unmapped statement node in the modified derived subtree;
identifying a control node that is an ancestor of a second map node corresponding to the first map node in the AST to be modified;
2. The method of modifying code according to claim 1, comprising the step of connecting said unmapped statement nodes to said control nodes.

Code change of program code to be changed based on a code change pattern having a pre-change subgraph and a post-change subgraph extracted from the pre-change program dependency graph and the post-change program dependency graph of the pre-change program code and the post-change program code a process for
In each of the AST to be changed which is an abstract syntax tree (hereinafter referred to as AST) of the program code to be changed that matches the subgraph before change and the AST after change which is the AST of the program code after change having the subgraph after change ,
Identifying a modified derived subtree and a modified derived subtree having a node of the pre-modification subgraph or the post-modification subgraph and having, as a root node or a leaf node, a map node having a correspondence between the modification target AST and the post-modification AST. death,
deleting the modified derived subtree from the modified AST;
adding the modified derived subtree to the deleted modified AST;
A code modification program that causes a computer to execute a process of connecting a boundary node in the modified derived subtree with a node in the deleted modified AST.