JP3870937B2

JP3870937B2 - Arithmetic processing device and arithmetic processing method

Info

Publication number: JP3870937B2
Application number: JP2003271526A
Authority: JP
Inventors: 浩美信方
Original assignee: Sony Corp
Current assignee: Sony Corp
Priority date: 2003-07-07
Filing date: 2003-07-07
Publication date: 2007-01-24
Anticipated expiration: 2023-07-07
Also published as: JP2005032034A

Description

本発明は、演算の高速化手法に関する。特に、演算パラメータや中間値をレジスタに格納し、演算プロセスにおいて、レジスタの格納値を取得して、取得値に基づいて演算プロセスの次ステップを実行する演算処理において、効率的なレジスタからのデータ取得を可能とすることにより演算の高速化を実現した演算処理装置、および演算処理方法に関する。 The present invention relates to a method for speeding up operations. In particular, data from an efficient register is stored in a calculation process in which calculation parameters and intermediate values are stored in a register, the stored value of the register is acquired in the calculation process, and the next step of the calculation process is executed based on the acquired value. The present invention relates to an arithmetic processing device and an arithmetic processing method that realize high-speed arithmetic by enabling acquisition.

例えば暗号処理等の演算処理においては、あるシーケンスに基づく演算処理を、パラメータや、前ステップで取得した中間値を用いて繰り返し実行する処理が多く行われる。このような演算処理を実行する場合、パラメータや中間値を格納するレジスタが用いられ、レジスタにパラメータや中間値を適宜、格納するとともに、必要に応じてレジスタ格納値を取得して演算プロセスが進行する。 For example, in an arithmetic process such as an encryption process, a process of repeatedly executing an arithmetic process based on a certain sequence using a parameter or the intermediate value acquired in the previous step is often performed. When executing such arithmetic processing, a register for storing parameters and intermediate values is used. The parameters and intermediate values are stored in the registers as appropriate, and the arithmetic process proceeds by acquiring the register stored values as necessary. To do.

例えば、公開鍵暗号方式として楕円曲線暗号（ＥＣＣ：Ellipitic Curve Cryptography）が知られているが、この暗号処理演算には、パラメータや中間値を格納するレジスタが用いられる。楕円曲線暗号は、１６０ｂｉｔの鍵でＲＳＡ１０２４ｂｉｔの鍵と同等の強度を持つと言われる。 For example, Ellipitic Curve Cryptography (ECC) is known as a public key cryptosystem, and a register for storing parameters and intermediate values is used for this cryptographic processing calculation. The elliptic curve cryptography is said to have a 160-bit key and a strength equivalent to that of the RSA 1024-bit key.

一般に、楕円曲線暗号（Elliptic Curve Cryptography）は、素体上の楕円曲線ｙ^２＝ｘ^３＋ａｘ＋ｂ（４ａ^３＋２７ｂ^２≠０）や、２の拡大体上の楕円曲線ｙ^２＋ｘｙ＝ｘ^３＋ａｘ^２＋ｂ（ｂ≠０）などを用いる。これらの曲線上の点に無限遠点（Ｏ）を加えた集合は、加法に関して有限群をなし、無限遠点（Ｏ）はその単位元となる。以下、この有限群上の点の加法を＋で表す。この有限群上の異なる２点Ｐ，Ｑの加算Ｐ＋Ｑを「点の加算」、点Ｐと点Ｐの加算Ｐ＋Ｐ＝２Ｐを「点の２倍算」と呼ぶ。また、点Ｐをｋ回加算した点Ｐ＋Ｐ＋…＋Ｐ＝ｋＰを求める演算を「点のスカラー倍算」と呼ぶ。 In general, elliptic curve cryptography is based on an elliptic curve y ² = x ³ + ax + b (4a ³ + 27b ² ≠ 0) on a prime field or an elliptic curve y ² + xy = x ³ + ax ² on an extension field of ^2. + B (b ≠ 0) or the like is used. A set obtained by adding an infinite point (O) to points on these curves forms a finite group with respect to addition, and the infinite point (O) is the unit element. Hereinafter, the addition of points on the finite group is represented by +. The addition P + Q of two different points P and Q on the finite group is called “point addition”, and the addition P + P = 2P of the points P and P is called “point doubling”. Further, an operation for obtaining a point P + P +... + P = kP obtained by adding the point P k times is referred to as “scalar multiplication of points”.

点のスカラー倍算は、点の加算、および点の２倍算を用いて構成できることが知られている。素体上の楕円曲線や２の拡大体上の楕円曲線上のアフィン座標系（ｘ，ｙ）や射影座標（Ｘ，Ｙ，Ｚ）における点の加算法、点の２倍算法、および点のスカラー倍算法は、ＩＥＥＥ P1363/D13 Standard Specifications for Public Key Cryptographyに記されている。 It is known that scalar multiplication of points can be constructed using point addition and point doubling. Point addition, point doubling, and point doubling in the affine coordinate system (x, y) and projective coordinates (X, Y, Z) on the elliptic curve on the prime field and the elliptic curve on the two extension field The scalar multiplication method is described in IEEE P1363 / D13 Standard Specifications for Public Key Cryptography.

また、素因数分解を行なうために導入された素体上のモンゴメリ型楕円曲線Ｂｙ^２＝ｘ^３＋Ａｘ^２＋ｘ（（Ａ^２−４）Ｂ≠０）を用いて点のスカラー倍算法を高速に行なう方法（P.Montgomery "Speeding the Pollard and Elliptic Curve Method of Factorization",Mathematics of Computation,Vol.48,No.177,pp.243-264(1987)）が提案されている。以下、この手法を素体上の楕円曲線におけるモンゴメリ法と呼ぶ。 Further, the scalar multiplication of points is performed at high speed using the Montgomery-type elliptic curve By ² = x ³ + Ax ² + x ((A ² −4) B ≠ 0) on the prime field introduced for performing the prime factorization. A method (P. Montgomery “Speeding the Pollard and Elliptic Curve Method of Factorization”, Mathematics of Computation, Vol. 48, No. 177, pp. 243-264 (1987)) has been proposed. Hereinafter, this method is referred to as the Montgomery method on an elliptic curve on a prime field.

この手法によれば、アフィン座標系において、異なる２点：Ｐ_０（ｘ_０，ｙ_０），Ｐ_１（ｘ_１，ｙ_１）の加算点を点Ｐ_２＝Ｐ_１＋Ｐ_０とすると、Ｐ_３（ｘ_３，ｙ_３）＝Ｐ_１（ｘ_１，ｙ_１）−Ｐ_０（ｘ_０，ｙ_０）が既知であれば、
ｘ_２＝（ｘ_０ｘ_１−１）^２／（ｘ_３（ｘ_０−ｘ_１）^２）
により、ｘ_２を求めることができる。 According to this method, if an addition point of two different points: P ₀ (x ₀ , y ₀ ), P ₁ (x ₁ , y ₁ ) is set to a point P ₂ = P ₁ + P _{0 in} the affine coordinate system, P _{If 3} (x ₃ , y ₃ ) = P ₁ (x ₁ , y ₁ ) −P ₀ (x ₀ , y ₀ ) is known,
x ₂ = (x ₀ x ₁ −1) ² / (x ₃ (x ₀ −x ₁ ) ² )
It makes it possible to calculate the _{x 2.}

また、点Ｐ_０（ｘ_０，ｙ_０）の２倍算点を点Ｐ_２（ｘ_２，ｙ_２）＝２Ｐ_０とすると、
ｘ_２＝（ｘ_０ ^２−１）^２／（４ｘ_０（ｘ_０ ^２＋Ａｘ_０＋１））
により、ｘ_２を求めることができる。このように、ｙ座標を用いないで、点の加算法、および点の２倍算法を構成することができる。 If the doubling point of the point P ₀ (x ₀ , y ₀ ) is the point P ₂ (x ₂ , y ₂ ) = 2P ₀ ,
x ₂ = (x ₀ ² −1) ² / (4x ₀ (x ₀ ² + Ax ₀ +1))
It makes it possible to calculate the _{x 2.} Thus, the point addition method and the point doubling method can be configured without using the y coordinate.

上述の楕円曲線暗号（ＥＣＣ）に基づく暗号化あるいは復号化に伴う演算処理を実行する場合、あるシーケンスに基づく演算処理を、パラメータや、前ステップで取得した中間値を用いて繰り返し実行する処理が多く行われる。レジスタは、複数の値を格納するため複数のデータ格納領域としての複数レジスタによって構成される。 When performing arithmetic processing accompanying encryption or decryption based on the above-described elliptic curve cryptography (ECC), processing for repeatedly executing arithmetic processing based on a sequence using parameters and intermediate values acquired in the previous step is performed. Much done. The register is composed of a plurality of registers as a plurality of data storage areas for storing a plurality of values.

このようなレジスタに対するデータ格納、レジスタからのデータ読み込みを繰り返し実行する演算処理においては、次の演算サイクルにおいてレジスタから読み出し予定のデータがレジスタの予定位置に格納されていない場合に、予定位置にデータコピー処理を行うことが必要となる。その結果、コピー処理に伴う処理時間が増大し、演算処理の遅延を発生させ、演算効率の低下を招くという問題があった。 In an arithmetic process that repeatedly executes data storage and data reading from the register, data is not stored at the planned position if the data scheduled to be read from the register is not stored at the planned position in the next calculation cycle. It is necessary to perform copy processing. As a result, there is a problem in that the processing time associated with the copy process increases, causing a delay in the calculation process and reducing the calculation efficiency.

本発明は、上記、問題点に鑑みてなされたものであり、レジスタに対するデータ格納、レジスタからのデータ読み込みを繰り返し実行する演算処理において、パラメータ、中間値の格納領域が予め定めた特定レジスタでない場合においても、アドレス制御を行うことで、レジスタからの効率的なデータ取得を可能とし、高速な演算処理を実現する演算処理装置、および演算処理方法を提供するものである。 The present invention has been made in view of the above problems, and in the arithmetic processing for repeatedly executing data storage to the register and data reading from the register, the parameter and intermediate value storage areas are not predetermined specific registers. However, the present invention provides an arithmetic processing device and an arithmetic processing method that enable efficient data acquisition from a register by performing address control and realize high-speed arithmetic processing.

本発明の第１の側面は、
データ格納領域としてのレジスタを複数有するメモリブロックを持つメモリ部と、前記レジスタの指定アドレスに基づいてレジスタから読み出されたデータを入力し、入力データに基づく演算処理を実行する演算部と、演算部に対するデータ入出力制御を実行する制御部とを有する演算処理装置において、
前記制御部からレジスタ指定アドレスを入力し、入力アドレスの変換処理を実行するアドレス制御部を有し、
前記アドレス制御部は、前記制御部から予め定められた特定レジスタのアドレスを入力し、かつ、前記演算部に出力予定のデータを格納しているデータ格納レジスタが前記特定レジスタと同一のメモリブロックの異なるレジスタである場合において、前記制御部からの入力アドレスを前記データ格納レジスタの指定アドレスに変換して、該変換アドレスを前記メモリブロックに対する読み出しアドレスとして出力する構成を有することを特徴とする演算処理装置にある。 The first aspect of the present invention is:
A memory unit having a memory block having a plurality of registers as a data storage area, an arithmetic unit that inputs data read from the register based on a designated address of the register, and executes arithmetic processing based on the input data; An arithmetic processing unit having a control unit that performs data input / output control on the unit,
An address control unit that inputs a register designation address from the control unit and executes input address conversion processing,
The address control unit inputs a predetermined register address from the control unit, and a data storage register storing data to be output to the arithmetic unit is stored in the same memory block as the specific register. In the case of different registers, the arithmetic processing has a configuration in which an input address from the control unit is converted into a designated address of the data storage register, and the converted address is output as a read address for the memory block. In the device.

さらに、本発明の演算処理装置の一実施態様において、前記アドレス制御部は、前記データ格納レジスタのアドレスを格納したラッチと、前記制御部からの入力アドレスが予め定められた特定レジスタのアドレスと一致するか否かを検出する一致検出部と、前記一致検出部の検出情報に基づいて、前記制御部からの入力アドレス、または前記ラッチに格納した前記データ格納レジスタのアドレスのいずれかを選択して、前記メモリブロックに対する読み出しアドレスとして出力するスイッチ手段と、を有することを特徴とする。 Furthermore, in one embodiment of the arithmetic processing unit of the present invention, the address control unit includes a latch that stores an address of the data storage register, and an input address from the control unit matches a predetermined register address. Based on the detection information of the coincidence detecting unit and the coincidence detecting unit, the input address from the control unit or the address of the data storage register stored in the latch is selected based on the detection information of the coincidence detecting unit. Switch means for outputting as a read address for the memory block.

さらに、本発明の演算処理装置の一実施態様において、前記アドレス制御部は、さらに、前記演算部に出力予定のデータを格納しているデータ格納レジスタが前記特定レジスタと同一のメモリブロックの異なるレジスタである場合にのみ、アドレス変換動作を有効とする情報を格納した情報格納部を含み、該情報格納部にアドレス変換動作を有効とする情報が格納されている場合にのみアドレス変換処理を行う構成であることを特徴とする。 Furthermore, in one embodiment of the arithmetic processing unit of the present invention, the address control unit further includes a data storage register in which data to be output is stored in the arithmetic unit and a different register in the same memory block as the specific register A configuration that includes an information storage unit that stores information that validates the address translation operation only when the address translation operation is stored, and that performs address translation processing only when information that validates the address translation operation is stored in the information storage unit It is characterized by being.

さらに、本発明の演算処理装置の一実施態様において、前記演算部は、暗号処理演算におけるモンゴメリ演算を実行する乗算器および加算器を含み、前記メモリ部は、複数のメモリブロック中の２つのメモリブロックの特定レジスタを次期演算サイクルにおいて前記演算部に入力予定のモンゴメリ演算に適用するパラメータまたは中間値の格納予定領域として設定された構成であり、前記アドレス制御部は、前記制御部からモンゴメリ演算に適用するパラメータまたは中間値の格納予定領域として設定された前記特定レジスタのアドレスを入力し、かつ、前記演算部に出力予定のデータを格納しているデータ格納レジスタが前記特定レジスタと同一のメモリブロックの異なるレジスタである場合において、前記制御部からの入力アドレスを前記データ格納レジスタの指定アドレスに変換して、該変換アドレスを前記メモリブロックに対する読み出しアドレスとして出力する構成を有することを特徴とする。 Furthermore, in one embodiment of the arithmetic processing apparatus of the present invention, the arithmetic unit includes a multiplier and an adder that execute Montgomery arithmetic in cryptographic processing arithmetic, and the memory unit includes two memories in a plurality of memory blocks. The specific register of the block is set as a storage area for parameters or intermediate values to be applied to the Montgomery calculation scheduled to be input to the calculation unit in the next calculation cycle, and the address control unit performs the Montgomery calculation from the control unit. A memory block in which the address of the specific register set as the storage area for the parameter to be applied or the intermediate value is input, and the data storage register storing the data to be output in the arithmetic unit is the same as the specific register The input address from the control unit It is converted to the specified address of the data storage registers, characterized by having a configuration for outputting the translated address as a read address for the memory block.

さらに、本発明の演算処理装置の一実施態様において、前記演算処理装置は、下記モンプロ演算を実行する演算処理装置であり、
ＭｏｎＰｒｏ（ａ^*，ｂ^*）
ｔ＝ａ^*×ｂ^*
for i = 0 to dl-1
ｍ＝ｔ₀×Ｐ₀'mod ｒ
ｔ＝（ｔ＋ｍ×Ｐ）／ｒ
next i
if ｔ≧Ｐ then return ｔ−Ｐ
else return ｔ
前記複数のメモリブロック中の２つのメモリブロックの特定レジスタには、前記モンプロ演算におけるパラメータａ^*またはｂ^*の格納予定レジスタとして設定された構成であることを特徴とする。 Furthermore, in one embodiment of the arithmetic processing device of the present invention, the arithmetic processing device is an arithmetic processing device that executes the following monpro operation:
MonPro (a ^* , b ^* )
t = a ^* × b ^*
for i = 0 to dl-1
m = t ₀ × P ₀ 'mod r
t = (t + m × P) / r
next i
if t ≧ P then return t−P
else return t
The specific registers of the two memory blocks in the plurality of memory blocks are configured to be stored as registers for storing parameters a ^* or b ^* in the monpro operation.

さらに、本発明の第２の側面は、
演算処理方法であり、
演算部に対するデータ入出力制御を実行する制御部においてレジスタの指定アドレスを生成するアドレス生成ステップと、
前記制御部から予め定められた特定レジスタのアドレスを入力し、かつ、演算部に出力予定のデータを格納しているデータ格納レジスタが前記特定レジスタと同一のメモリブロックの異なるレジスタである場合において、前記制御部からの入力アドレスを前記データ格納レジスタの指定アドレスに変換して、該変換アドレスをメモリブロックに対する読み出しアドレスとして出力するアドレス制御ステップと、
前記アドレス制御ステップにおいて制御されたアドレスに基づいてメモリブロックのレジスタからデータを読み出し前記演算部に出力するデータ読み出しステップと、
を有することを特徴とする演算処理方法にある。 Furthermore, the second aspect of the present invention provides
An arithmetic processing method,
An address generation step of generating a specified address of the register in the control unit that executes data input / output control for the arithmetic unit;
In the case where the address of the specific register determined in advance from the control unit and the data storage register storing the data to be output to the arithmetic unit are different registers in the same memory block as the specific register, An address control step of converting an input address from the control unit into a designated address of the data storage register and outputting the converted address as a read address for the memory block;
A data read step for reading data from a register of a memory block based on the address controlled in the address control step and outputting the data to the arithmetic unit;
There is an arithmetic processing method characterized by comprising:

さらに、本発明の演算処理方法の一実施態様において、前記アドレス制御ステップは、さらに、前記制御部からの入力アドレスが予め定められた特定レジスタのアドレスと一致するか否かを検出する一致検出ステップと、前記一致検出ステップにおける検出情報に基づいて、前記制御部からの入力アドレス、または予めラッチに格納した前記データ格納レジスタのアドレスのいずれかを選択して、前記メモリブロックに対する読み出しアドレスとして出力する出力切り替えステップとを有することを特徴とする。 Furthermore, in an embodiment of the arithmetic processing method of the present invention, the address control step further includes a coincidence detecting step for detecting whether or not an input address from the control unit coincides with a predetermined specific register address. Based on the detection information in the coincidence detection step, either an input address from the control unit or an address of the data storage register stored in advance in the latch is selected and output as a read address for the memory block And an output switching step.

さらに、本発明の演算処理方法の一実施態様において、前記演算部に出力予定のデータを格納しているデータ格納レジスタが前記特定レジスタと同一のメモリブロックの異なるレジスタである場合にのみ、前記アドレス制御ステップにおけるアドレス変換処理を行うことを特徴とする。 Furthermore, in one embodiment of the arithmetic processing method of the present invention, the address is stored only when a data storage register storing data to be output in the arithmetic unit is a different register in the same memory block as the specific register. An address conversion process is performed in the control step.

さらに、本発明の演算処理方法の一実施態様において、前記アドレス制御ステップは、前記制御部からモンゴメリ演算に適用するパラメータまたは中間値の格納予定領域として設定された特定レジスタのアドレスを入力し、かつ、前記演算部に出力予定のデータを格納しているデータ格納レジスタが前記特定レジスタと同一のメモリブロックの異なるレジスタである場合において、前記制御部からの入力アドレスを前記データ格納レジスタの指定アドレスに変換して、該変換アドレスを前記メモリブロックに対する読み出しアドレスとして出力することを特徴とする。 Furthermore, in one embodiment of the arithmetic processing method of the present invention, the address control step inputs a parameter to be applied to Montgomery arithmetic or an address of a specific register set as an intermediate value storage scheduled area from the control unit, and When the data storage register storing the data to be output in the arithmetic unit is a different register in the same memory block as the specific register, the input address from the control unit is set as the designated address of the data storage register. The converted address is output as a read address for the memory block.

さらに、本発明の演算処理方法の一実施態様において、下記モンプロ演算を実行する演算ステップを含み、
ＭｏｎＰｒｏ（ａ^*，ｂ^*）
ｔ＝ａ^*×ｂ^*
for i = 0 to dl-1
ｍ＝ｔ₀×Ｐ₀'mod ｒ
ｔ＝（ｔ＋ｍ×Ｐ）／ｒ
next i
if ｔ≧Ｐ then return ｔ−Ｐ
else return ｔ
前記アドレス制御ステップは、前記制御部から前記モンプロ演算におけるパラメータａ^*またはｂ^*の格納予定領域として設定された特定レジスタのアドレスを入力し、かつ、前記演算部に出力予定のデータを格納しているデータ格納レジスタが前記特定レジスタと同一のメモリブロックの異なるレジスタである場合において、前記制御部からの入力アドレスを前記データ格納レジスタの指定アドレスに変換して、該変換アドレスを前記メモリブロックに対する読み出しアドレスとして出力することを特徴とする。 Furthermore, in one embodiment of the calculation processing method of the present invention, the calculation processing method includes the calculation step of executing the following monpro calculation:
MonPro (a ^* , b ^* )
t = a ^* × b ^*
for i = 0 to dl-1
m = t ₀ × P ₀ 'mod r
t = (t + m × P) / r
next i
if t ≧ P then return t−P
else return t
In the address control step, an address of a specific register set as a storage area for the parameter a ^* or b ^* in the monpro calculation is input from the control unit, and data to be output is stored in the calculation unit. When the data storage register is a different register of the same memory block as the specific register, the input address from the control unit is converted into a specified address of the data storage register, and the converted address is read out from the memory block It is output as an address.

本発明の構成によれば、データ格納領域としてのレジスタを複数有するメモリブロックを持つメモリ部と、レジスタの指定アドレスに基づいてレジスタから読み出されたデータを入力し、入力データに基づく演算処理を実行する演算部と、演算部に対するデータ入出力制御を実行する制御部とを有する演算処理装置において、演算部に出力予定のデータを格納しているデータ格納レジスタが予め定めた特定レジスタと同一のメモリブロックの異なるレジスタである場合において、制御部からの入力アドレスをデータ格納レジスタの指定アドレスに変換して、該変換アドレスをメモリブロックに対する読み出しアドレスとして出力するアドレス制御処理を実行する構成としたので、予め定めた特定レジスタに対するデータコピー処理を削減してデータ読み出しおよび演算処理を実行することが可能となり、演算の高速化が実現される。 According to the configuration of the present invention, a memory unit having a memory block having a plurality of registers as data storage areas, and data read from the register based on a designated address of the register are input, and arithmetic processing based on the input data is performed. In an arithmetic processing unit having a calculation unit to be executed and a control unit for executing data input / output control on the calculation unit, a data storage register storing data to be output in the calculation unit is the same as a predetermined specific register In the case of different registers in the memory block, the address control process is executed to convert the input address from the control unit to the designated address of the data storage register and output the converted address as a read address for the memory block. , Reduce the data copy process for a specific register It is possible to perform the chromatography data reading and processing, faster operation is realized.

さらに、本発明の構成によれば、モンゴリ演算におけるモンプロ演算：ＭｏｎＰｒｏ（ａ^*，ｂ^*）の実行において、モンプロ演算におけるパラメータａ^*またはｂ^*の格納予定領域として設定された特定レジスタと異なるレジスタに、出力予定のパラメータａ^*またはｂ^*の対応データを格納している場合において、制御部からの入力アドレスを出力予定のパラメータａ^*またはｂ^*のデータ格納レジスタの指定アドレスに変換して、該変換アドレスを前記メモリブロックに対する読み出しアドレスとして出力する構成としたので、暗号処理演算において、複数回繰り返し実行されるモンプロ演算のデータコピー処理の削減が可能となり、暗号処理演算の高速化が実現される。 Furthermore, according to the configuration of the present invention, in the execution of the Monpro operation: MonPro (a ^* , b ^* ) in the Mongolian operation, a register different from the specific register set as the storage area for the parameter a ^* or b ^* in the Monpro operation When the data corresponding to the parameter a ^* or b ^{* to} be output is stored, the input address from the control unit is converted to the designated address of the data storage register of the parameter a ^* or b ^* to be output, Since the conversion address is output as a read address for the memory block, it is possible to reduce the data copy processing of the monpro operation that is repeatedly executed a plurality of times in the cryptographic processing operation, and the speed of the cryptographic processing operation is realized. The

以下、本発明の演算処理装置、および演算処理方法について詳細に説明する。 Hereinafter, the arithmetic processing device and the arithmetic processing method of the present invention will be described in detail.

本発明は、レジスタに対するデータ格納、レジスタからのデータ読み込みを繰り返し実行する演算処理において、パラメータ、中間値の格納領域が予め定めた特定レジスタでない場合においても、アドレス制御を行うことで、レジスタからの効率的なデータ取得を可能とし、高速な演算処理を実現する演算処理装置、および演算処理方法を提供するものである。 According to the present invention, in the arithmetic processing in which data storage to the register and data reading from the register are repeatedly executed, even when the parameter and intermediate value storage area is not a predetermined specific register, the address control is performed to perform the address control. The present invention provides an arithmetic processing device and an arithmetic processing method that enable efficient data acquisition and realize high-speed arithmetic processing.

以下では、レジスタを適用した演算処理の具体的な例として、公開鍵暗号方式における楕円曲線暗号（Elliptic Curve Cryptography）の演算処理を実行する場合の例について説明する。 Hereinafter, as a specific example of the arithmetic processing using the register, an example in which arithmetic processing of elliptic curve cryptography (Elliptic Curve Cryptography) in the public key cryptosystem is executed will be described.

暗号方式は、大きく分けるとメッセージの発信者と受信者が同じ鍵を用いる共通鍵暗号方式と、メッセージの発信者と受信者が異なる鍵を用いる公開鍵暗号方式とがある。両暗号方式には、以下の特徴がある。
共通鍵暗号：暗号化、復号化の速度が高速に行なわれる。
公開鍵暗号：演算が複雑で、鍵長が長いため、解読は非常に困難（離散対数問題）である。 The encryption methods are roughly classified into a common key encryption method in which the message sender and receiver use the same key, and a public key encryption method in which the message sender and receiver use different keys. Both encryption methods have the following characteristics.
Common key cryptography: encryption and decryption are performed at high speed.
Public key cryptography: Computation is complicated and key length is long, so decryption is very difficult (discrete logarithm problem).

現在では、両方式の利点を生かして、コンテンツの暗号化は共通鍵方式を用い、その鍵の受渡しに公開鍵暗号方式を用いるという使われ方が多い。公開鍵暗号方式では、金融系で古くから使われているＲＳＡ暗号と、ＲＳＡ暗号と同じセキュリティ強度をＲＳＡ暗号より短い鍵長で実現できる楕円曲線暗号の２つが代表的である。楕円曲線暗号では、鍵長が短いため、ＲＳＡ暗号に比べて演算時間が短いという特徴を持つ。近年、金融系とつながらないプラットフォームでの鍵の受け渡しや認証に用いる暗号方式として、楕円曲線暗号が注目されている。 Currently, taking advantage of both methods, content encryption is often performed using a common key method and a public key encryption method is used for delivering the key. There are two typical public key cryptosystems: RSA cryptography that has been used for a long time in financial systems, and elliptic curve cryptography that can realize the same security strength as RSA cryptography with a shorter key length than RSA cryptography. Elliptic curve cryptography has a feature that the computation time is shorter than that of RSA cryptography because the key length is short. In recent years, elliptic curve cryptography has attracted attention as an encryption method used for key exchange and authentication on platforms not connected to financial systems.

例えば、楕円曲線上での鍵交換は、素数を"Ｐ"としたとき、楕円曲線
Ｅ：ｙ²＝ｘ³＋ａｘ＋ｂ（ mod Ｐ）
とＥ上の点"Ｇ"が定義されている状態で、発信者Ａと受信者Ｂが適当な数値"ｒ"と"ｓ"を発生させ
Ａさん：ｒＧを計算して、その座標をＢさんに送る。
Ｂさん：ｓＧを計算して、その座標をＡさんに送る。
お互いの座標を受け取った後、
Ａさん：ｒ（ｓＧ）＝ｒｓＧを計算。
Ｂさん：ｓ（ｒＧ）＝ｓｒＧを計算。
を計算する事により、Ａさん・Ｂさんは共通の鍵"ｒｓＧ"を得る。悪意の第３者が、送信データ"ｒＧ"，"ｓＧ"を得ても、それから"ｒｓＧ"を得るのは事実上不可能である。 For example, in the key exchange on the elliptic curve, when the prime number is “P”, the elliptic curve E: y ² = x ³ + ax + b (mod P)
With the point “G” on E and E defined, caller A and receiver B generate appropriate numbers “r” and “s”. A: Calculate rG and set its coordinates to B Send to.
Mr. B: Calculate sG and send the coordinates to Mr. A.
After receiving each other's coordinates,
Mr. A: Calculate r (sG) = rsG.
Mr. B: Calculate s (rG) = srG.
By calculating, Mr. A and Mr. B obtain a common key “rsG”. Even if a malicious third party obtains the transmission data “rG” and “sG”, it is virtually impossible to obtain “rsG” therefrom.

楕円上の演算式としては、次の２つが定義されている。
２倍算：（ｘ₂，ｙ₂）＝２（ｘ₁，ｙ₁）
ｘ₂＝α²−２ｘ₁
ｙ₂＝−ｙ₁＋α(ｘ₁―ｘ₂)
ただし、α＝（３ｘ₁ ²＋ａ）／２ｙ₁
加算：（ｘ₃，ｙ₃）＝（ｘ₂，ｙ₂）＋（ｘ₁，ｙ₁）
ｘ₃＝α²−ｘ₁−ｘ₂
ｙ₂＝−ｙ₁＋α(ｘ₁―ｘ₃)
ただし、α＝（ｙ₂−ｙ₁）／（ｘ₂−ｘ₁） The following two are defined as arithmetic expressions on the ellipse.
Double calculation: (x ₂ , y ₂ ) = 2 (x ₁ , y ₁ )
x ₂ = α ² -2x ₁
y ₂ = −y ₁ + α (x ₁ −x ₂ )
Where α = (3 × ₁ ² + a) / 2y ₁
Addition: (x ₃ , y ₃ ) = (x ₂ , y ₂ ) + (x ₁ , y ₁ )
x ₃ = α ² −x ₁ −x ₂
y ₂ = −y ₁ + α (x ₁ −x ₃ )
Where α = (y ₂ −y ₁ ) / (x ₂ −x ₁ )

この２つの演算を用いれば、楕円上の点のスカラー倍演算は可能となる。例えば剰余数がＬ[bit]の場合
Ｗ＝ｓＧ（ｓ＝Σ_i=0 ^L-1ｓ_i・２ⁱ ）
の計算は、
Ｗ＝Ｇ
for i = L-1 to 0
Ｗ：＝２×Ｗ
if s_i＝1 then Ｗ：＝Ｗ＋Ｇ
next i
となる。 If these two operations are used, scalar multiplication of points on the ellipse can be performed. For example, when the remainder number is L [bit], W = sG (s = Σ _{i = 0} ^L−1 s _i · 2 ⁱ )
The calculation of
W = G
for i = L-1 to 0
W: = 2 × W
if s _i = 1 then W: = W + G
next i
It becomes.

例えば、２００Ｇの演算は、
２００Ｇ＝２{２{２{２{２{２{２Ｇ＋Ｇ}}}＋Ｇ}}}
で計算される。 For example, the calculation of 200G is
200G = 2 {2 {2 {2 {2 {2 {2G + G}}} + G}}}
Calculated by

しかし、実際の演算では、２倍算、加算の度に逆数演算が必要となり、これには Euclid 互除法またはベキ乗剰余演算が用いられるが、この演算には多くの演算サイクルが必要となる。このため、通常は、３次元のＪａｃｏｂｉａｎ（ヤコビアン）座標上に展開してスカラー倍演算を行ない、最後に２次元のＡｆｆｉｎ（アフィン）座標に戻すという手法が採られている。Ａｆｆｉｎ座標（ｘ，ｙ）からＪａｃｏｂｉａｎ座標（Ｘ，Ｙ，Ｚ）への展開は、下記関係式により行なわれる。 However, in actual calculations, reciprocal calculation is required for each doubling and addition, and Euclid algorithm or power-residue calculation is used for this, but this calculation requires many calculation cycles. For this reason, usually, a technique is adopted in which expansion is performed on three-dimensional Jacobian coordinates, scalar multiplication is performed, and finally, two-dimensional Affin coordinates are restored. Expansion from Affin coordinates (x, y) to Jacobiian coordinates (X, Y, Z) is performed by the following relational expression.

ｘ＝Ｘ／Ｚ² ・・（数式１）
ｙ＝Ｙ／Ｚ³ ・・（数式２）
ここで、Ｚ＝１とおくと、
（Ｘ，Ｙ，Ｚ）＝（ｘ，ｙ，１）
となり、Ｊａｃｏｂｉａｎ座標への展開ができる。 x = X / Z ² (Equation 1)
y = Y / Z ³ (Equation 2)
Here, if Z = 1,
(X, Y, Z) = (x, y, 1)
Thus, expansion to the Jacobian coordinates can be performed.

Ｊａｃｏｂｉａｎ座標上での２倍算、加算の式は、以下に示される式となる。 Expressions for doubling and adding on the Jacobian coordinates are as shown below.

２倍算：（Ｘ₂，Ｙ₂，Ｚ₂）＝２（Ｘ₁，Ｙ₁，Ｚ₁）
Ｘ₂＝Ｍ²−２Ｕ（modＰ）・・（数式３）
Ｙ₂＝−８Ｙ⁴＋Ｍ（Ｕ−Ｘ₂ ）（modＰ）・・（数式４）
Ｚ₂＝２Ｙ₁Ｚ₁ （modＰ）・・（数式５）
ただし、Ｍ＝３Ｘ₁ ²＋ａＺ₁ ⁴，Ｕ＝４Ｘ₁Ｙ₁ ² Double multiplication: (X ₂ , Y ₂ , Z ₂ ) = 2 (X ₁ , Y ₁ , Z ₁ )
X ₂ = M ² −2U (modP) (3)
Y ₂ = −8 Y ⁴ + M (U−X ₂ ) (modP) (Equation 4)
Z ₂ = 2Y ₁ Z ₁ (modP) (Equation 5)
However, M = 3X ₁ ² + aZ ₁ ⁴ , U = 4X ₁ Y ₁ ²

加算：（Ｘ₃，Ｙ₃，Ｚ₃）＝（Ｘ₂，Ｙ₂，Ｚ₂）＋（Ｘ₁，Ｙ₁，Ｚ₁）
Ｘ₃＝Ｒ²−ＴＷ² （modＰ）・・（数式６）
Ｙ₃＝{ Ｒ(ＴＷ²−２Ｘ₃)−ＳＷ³ }／２（modＰ）・・（数式７）
Ｚ₃＝Ｚ₁Ｚ₂Ｗ（modＰ）・・（数式８）
ただし、Ｒ＝Ｙ₂Ｚ₁ ³−Ｙ₁Ｚ₂ ³ ，Ｓ＝Ｙ₂Ｚ₁ ³＋Ｙ₁Ｚ₂ ³
Ｗ＝Ｘ₂Ｚ₁ ²−Ｘ₁Ｚ₂ ² ，Ｔ＝Ｘ₂Ｚ₁ ²＋Ｘ₁Ｚ₂ ²
と定義できる。 Addition: (X ₃ , Y ₃ , Z ₃ ) = (X ₂ , Y ₂ , Z ₂ ) + (X ₁ , Y ₁ , Z ₁ )
X ₃ = R ² −TW ² (modP) (Equation 6)
Y ₃ = {R (TW ² −2X ₃ ) −SW ³ } / 2 (modP) (Equation 7)
Z ₃ = Z ₁ Z ₂ W (modP) (Equation 8)
However, R = Y ₂ Z ₁ ³ −Y ₁ Z ₂ ³ , S = Y ₂ Z ₁ ³ + Y ₁ Z ₂ ³
W = X ₂ Z ₁ ² −X ₁ Z ₂ ² , T = X ₂ Z ₁ ² + X ₁ Z ₂ ²
Can be defined.

これを用いて、スカラー倍演算を行ない、最後に、(数式１)、(数式２)に基づいてＡｆｆｉｎ座標への逆変換を行なう。この時、Ｚの逆数を求める必要があるが、これは、スカラー倍演算で１回行なえばよく、Ａｆｆｉｎ座標上でスカラー倍演算を行なう場合に比べて、演算に要するサイクル数は大幅に少なくてすむ。 Using this, scalar multiplication is performed, and finally, inverse conversion to Affin coordinates is performed based on (Equation 1) and (Equation 2). At this time, it is necessary to obtain the reciprocal of Z, but this may be performed once by the scalar multiplication, and the number of cycles required for the calculation is significantly smaller than the case of performing the scalar multiplication on the Affin coordinates. I'm sorry.

上述の（数式３）〜（数式８）の演算における、素数"Ｐ"による剰余演算は、バレット（Barret）法やモンゴメリ（Montgomery）法を適用して、演算サイクル数を削減し、演算時間の削減をはかることが可能である。 In the operations of (Equation 3) to (Equation 8) described above, the remainder calculation using the prime “P” applies the Barret method or the Montgomery method to reduce the number of operation cycles and reduce the calculation time. Reductions can be made.

モンゴメリ（Montgoemry）法を用いた乗算剰余演算では、例えば、
ｃ＝ａ×ｂ mod Ｐの計算を行なう場合、Ｐより大きい値"Ｒ"（通常、２のベキ乗値）を定義し、
Ｒ・Ｒ^-1−Ｐ・Ｐ'＝１・・・（数式９）
を満たす値"Ｐ'"を求めておく。 In a modular multiplication operation using the Montgoemry method, for example,
When calculating c = a × b mod P, define a value “R” (usually a power of 2) greater than P;
R · R ⁻¹ −P · P ′ = 1 (Equation 9)
A value “P ′” that satisfies the above is obtained.

この時、乗算剰余演算は、
ａ^*＝ａ×Ｒ mod Ｐ・・・（数式１０）
ｂ^*＝ｂ×Ｒ mod Ｐ・・・（数式１１）
ｃ^*＝MonPro（ａ^*，ｂ^*）・・（数式１２）
ｃ＝MonPro（ｃ^*，１）・・（数式１３）
として行なわれる。 At this time, the modular multiplication operation is
a ^* = a × R mod P (Formula 10)
b ^* = b × R mod P (Formula 11)
c ^* = MonPro (a ^* , b ^* ) (formula 12)
c = MonPro (c ^* , 1) (Equation 13)
As done.

乗算剰余のみだと効率が悪いが、「楕円曲線上の点のスカラー倍演算」等に適用する場合、例えば、
Ｗ＝ｓＧ（ｓ＝Σ_i=0 ^L-1ｓ_i・２ⁱ ）
を計算する場合、
Ｘg^*＝ｘg・ｒ mod Ｐ
Ｙg^*＝ｙg・ｒ mod Ｐ
Ｚg^*＝１・ｒ mod Ｐ
Ｗ^*＝（Ｘw^*，Ｙw^*，Ｚw^*）＝Ｇ^*＝（Ｘg^*，Ｙg^*，Ｚg^*）
for i = L-1 to 0
Ｗ^*：＝２×Ｗ^*
if s_i＝1 then Ｗ^*：＝Ｗ^*＋Ｇ^*
next i
Ｘw＝MonPro（Ｘw^*，１）
Ｙw＝MonPro（Ｙw^*，１）
Ｚw＝MonPro（Ｚw^*，１）
ｘw＝Ｘw／Ｚw² mod Ｐ
ｙw＝Ｙw／Ｚw³ mod Ｐ
の様に、演算サイクル数の大部分を占めるループの前と後にパラメータの変換を行ない、ループ内はパラメータ変換した値でモンゴメリ（Montgomery）演算を実行することにより、演算サイクル数が格段に削減できる。 When it is applied to "scalar multiplication of points on an elliptic curve" etc.
W = sG (s = Σ _{i = 0} ^L−1 s _i · 2 ⁱ )
When calculating
Xg ^* = xg · r mod P
Yg ^* = yg · r mod P
Zg ^* = 1 ・ r mod P
W ^* = (Xw ^* , Yw ^* , Zw ^* ) = G ^* = (Xg ^* , Yg ^* , Zg ^* )
for i = L-1 to 0
W ^* : = 2 × W ^*
if s _i = 1 then W ^* : = W ^* + G ^*
next i
Xw = MonPro (Xw ^* , 1)
Yw = MonPro (Yw ^* , 1)
Zw = MonPro (Zw ^* , 1)
xw = Xw / Zw ² mod P
yw = Yw / Zw ³ mod P
In this way, parameter conversion is performed before and after the loop that accounts for the majority of the number of operation cycles, and the number of operation cycles can be significantly reduced by executing Montgomery operation with the parameter converted value in the loop. .

この演算で使われているモンプロ（MonPro）関数は、以下の様に定義される。
MonPro（ａ^*，ｂ^*）・・・（数式１４）
ｔ＝ａ^*×ｂ^*
ｍ＝ｔ×Ｐ' mod Ｒ
ｕ＝（ｔ＋ｍ×Ｐ）／Ｒ
if ｕ≧Ｐ then return ｕ−Ｐ
else return ｕ The MonPro function used in this calculation is defined as follows:
MonPro (a ^* , b ^* ) (Formula 14)
t = a ^* × b ^*
m = t × P ′ mod R
u = (t + m × P) / R
if u ≧ P then return u−P
else return u

または、乗算器のデータ幅をｒ[bit]，データ幅をdl[word：=ｒbit]とした時、
MonPro（ａ^*，ｂ^*）・・・（数式１５）
ｔ＝ａ^*×ｂ^*
for i = 0 to dl-1
ｍ＝ｔ₀×Ｐ₀'mod ｒ
ｔ＝（ｔ＋ｍ×Ｐ）／ｒ
next i
if ｔ≧Ｐ then return ｔ−Ｐ
else return ｔ
で定義される。 Or, when the data width of the multiplier is r [bit] and the data width is dl [word: = rbit],
MonPro (a ^* , b ^* ) (Formula 15)
t = a ^* × b ^*
for i = 0 to dl-1
m = t ₀ × P ₀ 'mod r
t = (t + m × P) / r
next i
if t ≧ P then return t−P
else return t
Defined by

上記（数式１５）で、ｔ₀，Ｐ'₀はｔ，Ｐ'の０ワード目の値を示す。この演算は、ソフトウェアで規定した演算プログラムで実現する方法と、ハードウェアで実現する方法がある。ＣＰＵパワーの無い携帯機器への搭載や、高いセキュリティが必要とされる機器への搭載は、ハードウェアで実現するのが望ましい。 In the above (Formula 15), t ₀ and P ′ ₀ indicate the values of the 0th word of t and P ′. This calculation includes a method realized by a calculation program defined by software and a method realized by hardware. It is desirable that mounting on a portable device without CPU power or mounting on a device requiring high security is realized by hardware.

上記の（数式１５）で示されるモンゴメリ（Montgomery）演算におけるモンプロ（MonPro）関数MonPro（ａ^*，ｂ^*）の演算を実行する本発明に係る演算処理装置のハードウェア構成例を図１に示す。 FIG. 1 shows an example of the hardware configuration of an arithmetic processing apparatus according to the present invention that executes the operation of the MontPro function (MonPro) (MonPro (a ^* , b ^* )) in the Montgomery operation expressed by the above (Formula 15). .

演算処理装置は、メモリ部１１０、演算部１５０を有し、演算部に入力するデータがメモリ部１１０のレジスタ群のいずれかに格納され、図示しない制御部からのアドレス信号に応じてレジスタからデータが取得され、メモリバススイッチ回路１３０、およびバスを介して演算部１５０に入力され、演算部１５０での演算がなされ、その結果が、バスおよびメモリバススイッチ回路１３０を介してメモリ部１１０内のレジスタに格納される。 The arithmetic processing unit includes a memory unit 110 and a calculation unit 150. Data input to the calculation unit is stored in one of the register groups of the memory unit 110, and data is transferred from the register in accordance with an address signal from a control unit (not shown). Is obtained and input to the arithmetic unit 150 via the memory bus switch circuit 130 and the bus, and the arithmetic unit 150 performs an operation, and the result is stored in the memory unit 110 via the bus and the memory bus switch circuit 130. Stored in a register.

本発明の演算処理装置は、さらにアドレス制御回路２００を有し、図示しない制御回路からのアドレスデータを変換して各メモリブロックに出力する。 The arithmetic processing unit of the present invention further includes an address control circuit 200, which converts address data from a control circuit (not shown) and outputs it to each memory block.

上述のモンゴメリ（Montgomery）演算では、複数回のMonPro（ａ^*，ｂ^*）演算を繰り返し実行することになり、各MonPro（ａ^*，ｂ^*）演算毎にレジスタからのデータ取得、レジスタに対するデータ格納処理が実行される。 In the Montgomery operation described above, a plurality of MonPro (a ^* , b ^* ) operations are repeatedly executed, and data acquisition from the register and data for the register are performed for each MonPro (a ^* , b ^* ) operation. Storage processing is executed.

演算部１５０は、乗算器１５２と加算器１５４，１５７、演算のパラメータや途中結果を格納するラッチ１５１，１５３，１５５，１５６，１５８および、これらを制御する図示しないコントロール回路で構成される。 The arithmetic unit 150 includes a multiplier 152 and adders 154 and 157, latches 151, 153, 155, 156, and 158 for storing operation parameters and intermediate results, and a control circuit (not shown) that controls these.

メモリ部１１０は、６個の単位データ格納領域としてのレジスタを有する４つのメモリブロック：メモリ０，１２０、メモリ１，１２１、メモリ２，１２２、メモリ３，１２３を有する。各メモリブロックは行デコーダ、列デコーダを有し、行デコーダが、入力アドレスに基づいて指定レジスタを選択し、指定レジスタから所定ワード単位のデータを列レジスタを介してメモリバススイッチ回路によって接続されたバスに出力、またはバスからのデータ入力を行う。例えばアドレスは、アドレスＡ０〜Ａ５によって構成され、レジスタ選択は、図示しない制御部から行デコーダに入力されるアドレスＡ３〜Ａ５を用いて実行され、多倍長データのワード選択は、図示しない制御部から列デコーダに入力されるアドレスＡ０〜Ａ２を用いて行われる。 The memory unit 110 includes four memory blocks having registers as six unit data storage areas: memories 0 and 120, memories 1 and 121, memories 2 and 122, and memories 3 and 123. Each memory block includes a row decoder and a column decoder. The row decoder selects a designated register based on an input address, and data in a predetermined word unit is connected from the designated register via the column register by a memory bus switch circuit. Output to the bus or input data from the bus. For example, the address is composed of addresses A0 to A5, the register selection is executed using addresses A3 to A5 inputted to the row decoder from the control unit (not shown), and the word selection of the multiple length data is performed by the control unit (not shown). Is performed using addresses A0 to A2 input to the column decoder.

多倍長の乗算演算では乗数と被乗数の乗算器への設定と、乗算結果の取出しを行なう必要があるため、少なくとも３つのメモリブロックが必要である。多倍長の加算演算でも同様に３つのメモリブロックが必要となる。例えば、４つのメモリブロックを用いて演算を行なう場合、演算のフローの中で保持しておく中間値を含めた最も多いデータの個数を元に必要なデータ容量を設定する。図１は、楕円曲線上の演算で最も多いデータの個数が２４個の場合の構成である。 In a multiple-length multiplication operation, it is necessary to set a multiplier and a multiplicand in a multiplier and to extract a multiplication result. Therefore, at least three memory blocks are required. Similarly, three memory blocks are required for the multiple length addition operation. For example, when performing calculations using four memory blocks, the necessary data capacity is set based on the largest number of data including intermediate values held in the calculation flow. FIG. 1 shows a configuration in the case where the number of data with the largest number of calculations on an elliptic curve is 24.

この構成で、例えば、メモリ１，１２１のレジスタ９（ｒｅｇ．９）の格納値と、メモリ３，１２３のレジスタ１５（ｒｅｇ．１５）の格納値とを演算部１５０に出力し、乗算処理を実行し、その結果をメモリ０，１２０のレジスタ４（ｒｅｇ．４）とレジスタ８（ｒｅｇ．８）に分割して格納する場合、すなわち、
（reg.9）×（reg.15）＝（reg.8, reg.4）
の多倍長乗算を実行する場合の処理例について説明する。 With this configuration, for example, the stored value of the register 9 (reg. 9) of the memory 1, 121 and the stored value of the register 15 (reg. 15) of the memory 3, 123 are output to the arithmetic unit 150, and multiplication processing is performed. When the result is divided and stored in the register 4 (reg. 4) and the register 8 (reg. 8) of the memories 0 and 120, that is,
(Reg.9) x (reg.15) = (reg.8, reg.4)
An example of processing in the case of executing the multiple length multiplication will be described.

まず、本発明の特徴であるアドレス制御回路２００を持たない場合の処理例について、説明し、その後、アドレス制御回路２００によるアドレス制御を実行した場合の処理例について説明する。 First, a processing example when the address control circuit 200 which is a feature of the present invention is not provided will be described, and then a processing example when address control by the address control circuit 200 is executed will be described.

（reg.9）×（reg.15）＝（reg.8, reg.4）
の多倍長乗算を実行する場合、図示しない制御部からの制御信号に基づいてメモリバススイッチ回路１３０を制御して、メモリ１とバス０、メモリ３とバス１、メモリ０とバス３を接続する。 (Reg.9) x (reg.15) = (reg.8, reg.4)
When executing a multiple length multiplication of the memory, the memory bus switch circuit 130 is controlled based on a control signal from a control unit (not shown) to connect the memory 1 and the bus 0, the memory 3 and the bus 1, and the memory 0 and the bus 3 To do.

次に、多倍長の乗算フローにしたがって、図示しない制御部において、メモリ１，１２１のレジスタ９（ｒｅｇ．９）の格納値と、メモリ３，１２３のレジスタ１５（ｒｅｇ．１５）の格納値とを取り出すためのアドレス１とアドレス３を発生させてメモリバススイッチ回路１３０および各バスを介して、演算部１５０の乗算器１５２前段のラッチ１５１に格納し、乗算処理を開始する。 Next, according to the multiple-length multiplication flow, in the control unit (not shown), the stored value of the register 9 (reg. 9) in the memory 1, 121 and the stored value of the register 15 (reg. 15) in the memory 3, 123 Are generated and stored in the latch 151 in the preceding stage of the multiplier 152 of the arithmetic unit 150 via the memory bus switch circuit 130 and each bus, and the multiplication process is started.

なお、演算部１５０において、乗算器の入力として、レジスタ９（ｒｅｇ．９）の格納値とレジスタ１５（ｒｅｇ．１５）の格納値が設定され、乗算処理がなされた結果は、バス３およびメモリバススイッチ回路１３０を介して、メモリ０，１２０に入力され、アドレス制御回路からのアドレス（アドレス０）に従って指定されたレジスタ（ｒｅｇ．８，ｒｅｇ．４）に乗算結果が格納される。 Note that in the arithmetic unit 150, the stored value of the register 9 (reg. 9) and the stored value of the register 15 (reg. 15) are set as inputs of the multiplier, and the result of the multiplication process is the result of the bus 3 and the memory. The multiplication result is stored in the registers (reg. 8, reg. 4) which are input to the memories 0 and 120 via the bus switch circuit 130 and specified according to the address (address 0) from the address control circuit.

Ｊａｃｏｂｉａｎ座標上での演算では、複数回のモンゴメリ（Montgomery）演算が必要となる。この演算を実現する手法として、多数回用いられるモンゴメリ（Montgomery）演算を関数化して必要に応じて呼び出す構成を採ることで回路規模は削減される。ただし、演算パラメータすなわち、モンゴメリ（Montgomery）演算において実行するモンプロ関数のパラメータ（MonPro(ａ^*, ｂ^*)の、ａ^*, ｂ^*）をセットするレジスタ位置は固定されることになる。 The calculation on the Jacobian coordinates requires a plurality of Montgomery calculations. As a technique for realizing this operation, the circuit scale is reduced by adopting a configuration in which a Montgomery operation used many times is converted into a function and called as necessary. However, operation parameters i.e., Montgomery (Montgomery) parameters Monpuro functions that perform the calculation ^{^{(MonPro (a *, b *}} ) of, a ^*, b ^*) register positions that set will be fixed.

例えばメモリ部１１０のレジスタ０（ｒｅｇ．０）〜レジスタ７（ｒｅｇ．７）をモンゴメリ（Montgomery）演算におけるモンプロ関数：ｃ^*＝MonPro（ａ^*,ｂ^*）のパラメータ及び演算途中結果としての中間値を格納する場所に設定する。 For example, register 0 (reg. 0) to register 7 (reg. 7) of the memory unit 110 are converted to the parameters of the Montpromer function in the Montgomery operation: c ^* = MonPro (a ^* , b ^* ) and intermediate results Set the location to store the value.

具体例として、パラメータ"ａ^*"と"ｂ^*"の格納場所をレジスタ５（ｒｅｇ．５）とレジスタ６（ｒｅｇ．６）に固定し、演算結果ｃ^*の格納場所を同じレジスタ５（ｒｅｇ．５）とレジスタ６（ｒｅｇ．６）になる様な設計である場合を想定する。 As a specific example, the storage locations of the parameters “a ^* ” and “b ^* ” are fixed to the register 5 (reg. 5) and the register 6 (reg. 6), and the storage location of the operation result c ^* is the same register 5 (reg. .5) and register 6 (reg. 6) are assumed.

パラメータ"ａ^*"と"ｂ^*"の格納場所をレジスタ５（ｒｅｇ．５）とレジスタ６（ｒｅｇ．６）に固定することで、パラメータ"ａ^*"と"ｂ^*"を適用したモンプロ関数：ｃ^*＝MonPro（ａ^*,ｂ^*）を演算部１５０において実行する場合、常に一定のアドレスを適用してデータ取り出しを実行することが可能となる。 A monpro function to which the parameters “a ^* ” and “b ^* ” are applied by fixing the storage locations of the parameters “a ^* ” and “b ^* ” in the register 5 (reg. 5) and the register 6 (reg. 6). When c ^* = MonPro (a ^* , b ^* ) is executed in the arithmetic unit 150, it is possible to always perform data extraction by applying a fixed address.

前の演算サイクルにおける計算結果は、メモリ０，１２０〜メモリ３，１２３の２４個のレジスタのいずれかのレジスタに格納されることになるので、その格納値をパラメータ"ａ^*"と"ｂ^*"の格納場所として設定されたレジスタ５（ｒｅｇ．５）とレジスタ６（ｒｅｇ．６）にコピーする処理が必要となる。 Since the calculation result in the previous operation cycle is stored in any one of the 24 registers of the memory 0, 120 to the memory 3, 123, the stored value is set to the parameters “a ^* ” and “b ^* ” ^. A process of copying to the register 5 (reg. 5) and the register 6 (reg. 6) set as the storage location of “is required.

例えば、図２に示す様に、演算に用いる２つのデータがメモリ０のレジスタ１２（ｒｅｇ．１２）とメモリ３のレジスタ１９（ｒｅｇ．１９）にある場合、これらの２つのデータを、パラメータ"ａ^*"と"ｂ^*"の格納場所として設定されたレジスタ５（ｒｅｇ．５）とレジスタ６（ｒｅｇ．６）にコピーする処理が必要となる。 For example, as shown in FIG. 2, when two pieces of data used for the calculation are in the register 12 (reg. 12) of the memory 0 and the register 19 (reg. 19) of the memory 3, these two pieces of data are set to the parameter “ A process of copying to the register 5 (reg. 5) and the register 6 (reg. 6) set as the storage locations of “a ^* ” and “b ^* ” is required.

図２に示す様にコピー処理の必要なデータが異なるメモリブロックに格納されている場合、メモリ０のレジスタ１２（ｒｅｇ．１２）からメモリ１のレジスタ５（ｒｅｇ．５）に対するデータコピー処理と、メモリ３のレジスタ１９（ｒｅｇ．１９）からメモリ２のレジスタ６（ｒｅｇ．６）に対するデータコピー処理とは、並行して実行することができるため、コピーはデータのワード数分のサイクル数程度で終了する。 As shown in FIG. 2, when data required for copy processing is stored in different memory blocks, data copy processing from the register 12 (reg. 12) of the memory 0 to the register 5 (reg. 5) of the memory 1; Since the data copy process from the register 19 (reg. 19) of the memory 3 to the register 6 (reg. 6) of the memory 2 can be executed in parallel, the copy is performed in the number of cycles corresponding to the number of words of the data. finish.

なお、各メモリブロックにおいて、データ読み取り処理、書き込み処理は１つのサイクルで１つの処理、すなわち１ワードデータの読み取りあるいは１ワードデータの書き込み処理が実行可能である。異なるメモリブロックにおけるデータ読み取りまたは書き込み処理は並列に実行できる。 In each memory block, data read processing and write processing can be performed in one cycle, that is, one word data reading or one word data writing processing can be executed. Data read or write processing in different memory blocks can be performed in parallel.

従って、例えば図３に示すように、次の演算サイクルに用いるデータのうち、少なくとも一方が、コピー先、すなわち、パラメータ"ａ^*"と"ｂ^*"の格納場所として設定されたレジスタ５（ｒｅｇ．５）とレジスタ６（ｒｅｇ．６）と同一のメモリブロックにある場合にはデータコピー処理に長い処理サイクルが必要となる。 Therefore, for example, as shown in FIG. 3, at least one of the data used in the next calculation cycle is a copy destination, that is, a register 5 (reg) set as a storage location of the parameters “a ^* ” and “b ^* ”. .5) and register 6 (reg. 6) are in the same memory block, a long processing cycle is required for data copy processing.

図３において、メモリ２にはコピー先であるレジスタ６（ｒｅｇ．６）と、コピー元データの格納されたレジスタ１４（ｒｅｇ．１４）が含まれる。 In FIG. 3, the memory 2 includes a register 6 (reg. 6) as a copy destination and a register 14 (reg. 14) in which copy source data is stored.

この場合、メモリ２のデータ読み取りとデータ書き込みは同一の処理サイクルでは並列に実行できないため、データ読み取り処理とデータ書き込み処理をシーケンシャルに実行しなければならない。例えば、レジスタ１４（ｒｅｇ．１４）からのデータ読み取り処理を伴うレジスタ１４（ｒｅｇ．１４）からレジスタ５（ｒｅｇ．５）へのデータコピー処理の終了後、レジスタ１９（ｒｅｇ．１９）からレジスタ６（ｒｅｇ．６）へのデータコピー処理を実行することが必要となり、図２に示すようなデータ格納態様の場合に比べて２倍のサイクル数を要することになる。 In this case, since data reading and data writing in the memory 2 cannot be executed in parallel in the same processing cycle, the data reading processing and data writing processing must be executed sequentially. For example, after the data copy process from the register 14 (reg. 14) to the register 5 (reg. 5) accompanied by the data reading process from the register 14 (reg. 14) is completed, the register 19 (reg. 19) to the register 6 It is necessary to execute the data copy process to (reg. 6), which requires twice as many cycles as in the case of the data storage mode shown in FIG.

また、図４に示すように、次の演算サイクルに用いるデータの両者が、コピー先、すなわち、パラメータ"ａ^*"と"ｂ^*"の格納場所として設定されたレジスタ５（ｒｅｇ．５）とレジスタ６（ｒｅｇ．６）と同一のメモリブロックにある場合にも、同様にデータコピー処理に長い処理サイクルが必要となる。 Further, as shown in FIG. 4, both of the data used in the next calculation cycle are the copy destination, that is, the register 5 (reg. 5) set as the storage location of the parameters “a ^* ” and “b ^* ”. Even in the same memory block as the register 6 (reg. 6), similarly, a long processing cycle is required for the data copy processing.

図４において、メモリ２にはコピー先であるレジスタ６（ｒｅｇ．６）と、コピー元データの格納されたレジスタ１４（ｒｅｇ．１４）が含まれ、メモリ１にはコピー先であるレジスタ５（ｒｅｇ．５）と、コピー元データの格納されたレジスタ１７（ｒｅｇ．１７）が含まれる。 4, the memory 2 includes a register 6 (reg. 6) that is a copy destination and a register 14 (reg. 14) that stores copy source data. The memory 1 includes a register 5 (reg. 5) that is a copy destination. reg.5) and a register 17 (reg.17) storing copy source data.

この場合、メモリ２のデータ読み取りとデータ書き込みは同一の処理サイクルでは並列に実行できず、また、メモリ１のデータ読み取りとデータ書き込みも同一の処理サイクルでは並列に実行できない。従って、これらの複数のデータ読み取り処理とデータ書き込み処理をシーケンシャルに実行しなければならない。 In this case, data reading and data writing in the memory 2 cannot be executed in parallel in the same processing cycle, and data reading and data writing in the memory 1 cannot be executed in parallel in the same processing cycle. Therefore, the plurality of data reading processes and data writing processes must be executed sequentially.

例えば、レジスタ１４（ｒｅｇ．１４）からのデータ読み取り処理を伴うレジスタ１４（ｒｅｇ．１４）からレジスタ５（ｒｅｇ．５）へのデータコピー処理の終了後、レジスタ１７（ｒｅｇ．１７）からレジスタ６（ｒｅｇ．６）へのデータコピー処理を実行することが必要となり、図２に示すようなデータ格納態様の場合に比べて２倍のサイクル数を要することになる。 For example, after the data copy process from the register 14 (reg. 14) to the register 5 (reg. 5) accompanied by the data read process from the register 14 (reg. 14) is completed, the register 17 (reg. 17) to the register 6 It is necessary to execute the data copy process to (reg. 6), which requires twice as many cycles as in the case of the data storage mode shown in FIG.

座標のスカラー倍演算では、座標の２倍算演算・加算演算は、剰余数"Ｐ"のビット幅の回数（例えば、剰余数"Ｐ"が１６０[bit]の場合、１６０回）分、繰返されるため、コピー処理回数の増大による演算サイクル数の増加の影響が多大となる。 In the coordinate scalar multiplication, the coordinate doubling and addition operations are repeated for the number of times of the bit width of the remainder number “P” (for example, 160 times when the remainder number “P” is 160 [bit]). Therefore, the influence of an increase in the number of operation cycles due to an increase in the number of copy processes becomes great.

本発明では、これらのデータ格納領域に基づくコピー処理による演算サイクルの増大、演算遅延を排除するために、アドレス制御回路２００により、アドレスの変換を行う。本発明の構成に従うことにより、演算サイクル数の削減、高速演算が可能となる。 In the present invention, address conversion is performed by the address control circuit 200 in order to eliminate an increase in operation cycle and operation delay due to copy processing based on these data storage areas. By following the configuration of the present invention, the number of operation cycles can be reduced and high-speed operation can be performed.

本発明においては、演算部１５０の制御を実行する制御部（図示せず）において生成し、メモリブロックへ出力するアドレスを変換するアドレス制御回路２００を設けた。アドレス制御回路２００の内部構成を図５、図６を参照して説明する。 In the present invention, an address control circuit 200 is provided that converts an address generated in a control unit (not shown) that executes control of the arithmetic unit 150 and output to the memory block. The internal configuration of the address control circuit 200 will be described with reference to FIGS.

図５に示すように、アドレス制御回路２００は、４つのメモリブロック中、パラメータ"ａ^*"と"ｂ^*"の格納場所として設定されたレジスタ５（ｒｅｇ．５）とレジスタ６（ｒｅｇ．６）を有する２つのメモリブロック、すなわちメモリ１，１２１と、メモリ２，１２２に出力するアドレスを必要に応じて変換して出力する２つのアドレス変換部２１０，２２０を有する。 As shown in FIG. 5, the address control circuit 200 includes a register 5 (reg. 5) and a register 6 (reg. 6) set as storage locations of the parameters “a ^* ” and “b ^* ” in the four memory blocks. ), That is, the memory 1, 121 and the two address conversion units 210, 220 for converting the addresses to be output to the memories 2, 122 as necessary.

それぞれのアドレス変換部２１０，２２０は、各々アドレス設定のラッチ回路２１１，２２１と一致検出回路２１２，２２２、及びアドレス切換回路２１３，２２３で構成される。１つのアドレス変換回路の具体的回路構成例を図６に示す。 Each of the address conversion units 210 and 220 includes address setting latch circuits 211 and 221, coincidence detection circuits 212 and 222, and address switching circuits 213 and 223, respectively. A specific circuit configuration example of one address conversion circuit is shown in FIG.

アドレスは、前述したように、アドレスＡ５〜Ａ０によって構成され、図示しない制御部からアドレスＡ５ｃ〜Ａ３ｃがアドレス制御回路の各アドレス変換部に入力され、必要に応じて変換され、変換アドレスＡ５ｍ〜Ａ３ｍが、行デコーダに入力され、レジスタ選択アドレスとして適用される。また、多倍長データのワード選択は、図示しない制御部から列デコーダに入力されるアドレスＡ０〜Ａ２を用いて行われる。 As described above, the address is composed of the addresses A5 to A0, and the addresses A5c to A3c are input from the control unit (not shown) to each address conversion unit of the address control circuit, converted as necessary, and converted addresses A5m to A3m. Is input to the row decoder and applied as a register selection address. In addition, word selection of multiple-length data is performed using addresses A0 to A2 input to a column decoder from a control unit (not shown).

楕円曲線暗号の演算においても、モンゴメリ（Montgomery）演算におけるモンプロ関数：ｃ^*＝MonPro（ａ^*,ｂ^*）は繰り返し実行される。パラメータ"ａ^*"と"ｂ^*"の格納場所は、メモリ１，１２１と、メモリ２，１２２のレジスタ５（ｒｅｇ．５）とレジスタ６（ｒｅｇ．６）に設定され、パラメータ"ａ^*"と"ｂ^*"を適用した演算サイクルの開始毎に、パラメータ"ａ^*"と"ｂ^*"の読み出しアドレス、すなわち、Ａ３ｃ〜Ａ５ｃ＝００１が、制御部からアドレス制御回路２００に入力される。 Also in the calculation of elliptic curve cryptography, the Montpro function in the Montgomery operation: c ^* = MonPro (a ^* , b ^* ) is repeatedly executed. The storage locations of the parameters “a ^* ” and “b ^* ” are set in the memory 1 121, the register 5 (reg. 5) and the register 6 (reg. 6) of the memory 2 122, and the parameter “a ^* ”. Each time an arithmetic cycle to which “b ^* ” and “b ^* ” are applied, read addresses of the parameters “a ^* ” and “b ^* ”, that is, A3c to A5c = 001 are input from the control unit to the address control circuit 200.

アドレス制御回路２００は、制御部から入力されるパラメータ"ａ^*"と"ｂ^*"の読み出しアドレス、すなわち、Ａ５ｃ〜Ａ３ｃ＝００１を必要に応じて変換し、メモリ１，１２１と、メモリ２，１２２の行デコーダに出力する。具体的には、パラメータ"ａ^*"または"ｂ^*"が、メモリ１，１２１と、メモリ２，１２２のパラメータ"ａ^*"と"ｂ^*"の格納レジスタ（アドレス（Ａ５，Ａ４，Ａ３）＝（００１））と異なるアドレスのレジスタ位置に格納されている場合に、制御部から入力されるパラメータ"ａ^*"と"ｂ^*"の読み出しアドレス、すなわち、Ａ５ｃ〜Ａ３ｃ＝００１を、実際にパラメータ"ａ^*"または"ｂ^*"が格納されているレジスタを指定するアドレスに変換してメモリ１，１２１と、メモリ２，１２２の行デコーダに出力する。 The address control circuit 200 converts the read addresses of the parameters “a ^* ” and “b ^* ” input from the control unit, that is, A5c to A3c = 001, as necessary. Output to 122 row decoder. Specifically, the parameter “a ^* ” or “b ^* ” is the storage register (address (A5, A4, A3) of the parameters “a ^* ” and “b ^* ” in the memories 1 and 121 and the memories 2 and 122. = (001)), the read addresses of the parameters “a ^* ” and “b ^* ” input from the control unit, that is, A5c to A3c = 001 are actually The register in which the parameter “a ^* ” or “b ^* ” is stored is converted into an address for designating and output to the memory 1 121 and the row decoder of the memory 2 122.

ラッチ２１１のＡ５＿Ｌ３１４〜Ａ３＿Ｌ３１２は、以前の演算サイクルにおいて演算部１５０において算出され、次の演算サイクルにおいて適用するパラメータ"ａ^*"または"ｂ^*"が、メモリ１，１２１または、メモリ２，１２２に格納されている場合に、そのレジスタアドレスを設定するラッチである。 A5_L314 to A3_L312 of the latch 211 are calculated by the calculation unit 150 in the previous calculation cycle, and the parameter “a ^* ” or “b ^* ” applied in the next calculation cycle is stored in the memory 1, 121 or the memory 2, 122. This is a latch that sets the register address when it is stored.

ＵＳＥ３１１は、この回路ブロックを機能させるか否かの設定を行なうラッチであり、機能させる場合には"１"、させない場合は"０"をセットする。 The USE 311 is a latch for setting whether or not to make this circuit block function. When the function is made to function, “1” is set, and when it is not set, “0” is set.

アドレス変換部の動作態様には、次の３つの態様がある。
（ａ）ラッチ２１１のＵＳＥ３１１に"０"が保持されている場合
（ｂ）ラッチ２１１のＵＳＥ３１１に"１"が保持され、制御部（コントローラ）からのアドレス信号（Ａ５ｃ，Ａ４ｃ，Ａ３ｃ）が、パラメータ"ａ^*"または"ｂ^*"の格納場所であるレジスタ５（ｒｅｇ．５）またはレジスタ６（ｒｅｇ．６）の指定アドレス（００１）と異なる場合
（ｃ）ラッチ２１１のＵＳＥ３１１に"１"が保持され、制御部（コントローラ）からのアドレス信号（Ａ５ｃ，Ａ４ｃ，Ａ３ｃ）が、パラメータ"ａ^*"または"ｂ^*"の格納場所であるレジスタ５（ｒｅｇ．５）またはレジスタ６（ｒｅｇ．６）の指定アドレス（００１）である場合 There are the following three modes of operation of the address translation unit.
(A) When “0” is held in the USE 311 of the latch 211 (b) “1” is held in the USE 311 of the latch 211 and the address signals (A5c, A4c, A3c) from the control unit (controller) are When the parameter “a ^* ” or “b ^* ” is different from the designated address (001) of the register 5 (reg. 5) or the register 6 (reg. 6) that is the storage location of the parameter “a ^* ” or “b ^* ”. And the address signal (A5c, A4c, A3c) from the control unit (controller) is stored in the register 5 (reg. 5) or the register 6 (reg. 5) where the parameter “a ^* ” or “b ^* ” is stored. When the specified address (001) is 6)

以下、これらの３つの場合におけるアドレス変換部の動作について説明する。
（ａ）ラッチ２１１のＵＳＥ３１１に"０"が保持されている場合
ラッチ２１１のＵＳＥ３１１に"０"が保持されている場合、反転回路３３４によって反転された信号"１"がＯＲ回路３３５に入力され、ＯＲ回路３３５の出力は１となり、スイッチ回路２１３のスイッチ素子３４１，３４２，３４３がＯＮとなり、スイッチ回路２１３は、制御部（コントローラ）から入力されるアドレス信号Ａ５ｃ〜Ａ３ｃをそのまま、パラメータ"ａ^*"と"ｂ^*"の格納場所であるレジスタ５（ｒｅｇ．５）とレジスタ６（ｒｅｇ．６）を有するメモリブロックであるメモリ１，１２１と、メモリ２，１２２に対するレジスタ指定アドレスＡ５ｍ〜Ａ３ｍとして、各メモリブロックの行デコーダに出力する。この場合は、制御部（コントローラ）で指定されたレジスタが指定される。 Hereinafter, the operation of the address translation unit in these three cases will be described.
(A) When “0” is held in USE 311 of latch 211 When “0” is held in USE 311 of latch 211, signal “1” inverted by inverting circuit 334 is input to OR circuit 335. , The output of the OR circuit 335 is 1, the switch elements 341, 342, and 343 of the switch circuit 213 are turned on, and the switch circuit 213 receives the address signals A5c to A3c input from the control unit (controller) as they are, without changing the parameter “a ^{* The} memory designations A5m to A3m for the memories 1 and 121 and the memories 2 and 122, which are memory blocks having the register 5 (reg. 5) and the register 6 (reg. 6) as storage locations of “b” and “b ^* ” Is output to the row decoder of each memory block. In this case, a register designated by the control unit (controller) is designated.

なお、スイッチ回路２１３は、ＯＲ回路３３５の出力が１の場合には、スイッチ素子３４３，３４２，３４１をＯＮとし、制御部（コントローラ）からのアドレス（Ａ５ｃ，Ａ４ｃ，Ａ３ｃ）をメモリに対するレジスタ指定アドレス（Ａ５ｍ，Ａ４ｍ，Ａ３ｍ）として出力し、ＯＲ回路３３５の出力が０の場合には、スイッチ素子３４６，３４５，３４４をＯＮとし、ラッチ２１１のＡ３＿Ｌ３１２〜Ａ５＿Ｌ３１４の格納アドレスをメモリに対するレジスタ指定アドレス（Ａ５ｍ，Ａ４ｍ，Ａ３ｍ）として出力する。 When the output of the OR circuit 335 is 1, the switch circuit 213 turns on the switch elements 343, 342, and 341, and designates the address (A5c, A4c, A3c) from the control unit (controller) as a register for the memory. When the output from the OR circuit 335 is 0, the switch elements 346, 345, and 344 are turned ON, and the storage addresses of the A3_L312 to A5_L314 of the latch 211 are the register specified addresses for the memory when output as addresses (A5m, A4m, A3m) Output as (A5m, A4m, A3m).

（ｂ）ラッチ２１１のＵＳＥ３１１に"１"が保持され、制御部（コントローラ）からのアドレス信号（Ａ５ｃ，Ａ４ｃ，Ａ３ｃ）が、パラメータ"ａ^*"または"ｂ^*"の格納場所であるレジスタ５（ｒｅｇ．５）またはレジスタ６（ｒｅｇ．６）の指定アドレス（００１）と異なる場合 (B) “5” is held in the USE 311 of the latch 211, and the address signal (A5c, A4c, A3c) from the control unit (controller) is stored in the register 5 where the parameter “a ^* ” or “b ^* ” is stored. (Reg.5) or the specified address (001) of register 6 (reg.6)

ラッチ２１１のＵＳＥ３１１に"１"が保持されている場合、反転回路３３４によって反転された信号"０"がＯＲ回路３３５に入力される。制御部（コントローラ）からのアドレス信号（Ａ５ｃ，Ａ４ｃ，Ａ３ｃ）が、パラメータ"ａ^*"または"ｂ^*"の格納場所であるレジスタ５（ｒｅｇ．５）またはレジスタ６（ｒｅｇ．６）の指定アドレス（００１）と異なる場合、すなわち、（Ａ５ｃ，Ａ４ｃ，Ａ３ｃ）≠（０，０，１）の場合、ＥＯＲ回路３３１，３３２，３３３の出力のうち、少なくとも１つは"１"となる。 When “1” is held in the USE 311 of the latch 211, the signal “0” inverted by the inverting circuit 334 is input to the OR circuit 335. The address signal (A5c, A4c, A3c) from the controller (controller) is designated by the register 5 (reg. 5) or register 6 (reg. 6) where the parameter “a ^* ” or “b ^* ” is stored. When the address is different from the address (001), that is, when (A5c, A4c, A3c) ≠ (0, 0, 1), at least one of the outputs of the EOR circuits 331, 332, 333 is “1”.

なお、ＥＯＲ回路３３１，３３２，３３３の一方の入力には、予めパラメータ"ａ^*"または"ｂ^*"の格納場所であるレジスタ５（ｒｅｇ．５）またはレジスタ６（ｒｅｇ．６）の指定アドレス（００１）に対応する値を入力とする設定がなされている。ＥＯＲ入力データ設定部３２１〜３２３の設定が、アドレス（００１）に対応する値を入力とする設定がなされている。 Note that one input of the EOR circuits 331, 332, and 333 has a designated address of the register 5 (reg. 5) or the register 6 (reg. 6) that is the storage location of the parameter “a ^* ” or “b ^* ” in advance. A setting corresponding to the value corresponding to (001) is made. The setting of the EOR input data setting units 321 to 323 is set to input a value corresponding to the address (001).

従って、制御部（コントローラ）からのアドレス信号（Ａ５ｃ，Ａ４ｃ，Ａ３ｃ）が、パラメータ"ａ^*"または"ｂ^*"の格納場所であるレジスタ５（ｒｅｇ．５）またはレジスタ６（ｒｅｇ．６）の指定アドレス（００１）と異なる場合、すなわち、（Ａ５ｃ，Ａ４ｃ，Ａ３ｃ）≠（０，０，１）の場合、ＥＯＲ回路３３１，３３２，３３３の出力のうち、少なくとも１つは"１"となる。 Therefore, the address signal (A5c, A4c, A3c) from the controller (controller) is stored in the register 5 (reg. 5) or the register 6 (reg. 6) where the parameter “a ^* ” or “b ^* ” is stored. In other words, when (A5c, A4c, A3c) ≠ (0, 0, 1), at least one of the outputs of the EOR circuits 331, 332, 333 is “1”. Become.

その結果、ＯＲ回路３３５の出力は、"１"、すなわち、一致検出部２１２の出力が"１"となり、上述の（ａ）の場合と同様、スイッチ回路２１３のスイッチ素子３４３，３４２，３４１がＯＮとなり、スイッチ回路２１３は、制御部（コントローラ）から入力されるアドレス信号Ａ５ｃ〜Ａ３ｃをそのまま、パラメータ"ａ^*"と"ｂ^*"の格納場所であるレジスタ５（ｒｅｇ．５）とレジスタ６（ｒｅｇ．６）を有するメモリブロックであるメモリ１，１２１と、メモリ２，１２２に対するレジスタ指定アドレスＡ５ｍ〜Ａ３ｍとして、各メモリブロックの行デコーダに出力する。この場合は、制御部（コントローラ）で指定されたレジスタが指定される。 As a result, the output of the OR circuit 335 is “1”, that is, the output of the coincidence detection unit 212 is “1”, and the switch elements 343, 342, and 341 of the switch circuit 213 are switched as in the case of (a) described above. The switch circuit 213 is turned on, and the switch circuit 213 receives the address signals A5c to A3c input from the control unit (controller) as they are, the register 5 (reg. 5) and the register 6 which are the storage locations of the parameters “a ^* ” and “b ^* ”. (Reg. 6) are output to the row decoders of the memory blocks as the register designation addresses A5m to A3m for the memories 1 and 121 and the memories 2 and 122, respectively. In this case, a register designated by the control unit (controller) is designated.

（ｃ）ラッチ２１１のＵＳＥ３１１に"１"が保持され、制御部（コントローラ）からのアドレス信号（Ａ５ｃ，Ａ４ｃ，Ａ３ｃ）が、パラメータ"ａ^*"または"ｂ^*"の格納場所であるレジスタ５（ｒｅｇ．５）またはレジスタ６（ｒｅｇ．６）の指定アドレス（００１）である場合 (C) “5” is held in the USE 311 of the latch 211, and the address signal (A5c, A4c, A3c) from the control unit (controller) is stored in the register 5 where the parameter “a ^* ” or “b ^* ” is stored. (Reg. 5) or specified address (001) of register 6 (reg. 6)

ラッチ２１１のＵＳＥ３１１に"１"が保持されている場合、反転回路３３４によって反転された信号"０"がＯＲ回路３３５に入力される。制御部（コントローラ）からのアドレス信号（Ａ５ｃ，Ａ４ｃ，Ａ３ｃ）が、パラメータ"ａ^*"または"ｂ^*"の格納場所であるレジスタ５（ｒｅｇ．５）またはレジスタ６（ｒｅｇ．６）の指定アドレス（００１）と一致する場合、すなわち、（Ａ５ｃ，Ａ４ｃ，Ａ３ｃ）＝（０，０，１）の場合、ＥＯＲ回路３３３，３３２，３３１の出力は全て"０"となる。 When “1” is held in the USE 311 of the latch 211, the signal “0” inverted by the inverting circuit 334 is input to the OR circuit 335. The address signal (A5c, A4c, A3c) from the controller (controller) is designated by the register 5 (reg. 5) or register 6 (reg. 6) where the parameter “a ^* ” or “b ^* ” is stored. When the address matches the address (001), that is, when (A5c, A4c, A3c) = (0, 0, 1), the outputs of the EOR circuits 333, 332, and 331 are all “0”.

その結果、ＯＲ回路３３５の出力は、"０"、すなわち、一致検出部２１２の出力が"０"となる。従って、スイッチ回路２１３のスイッチ素子３４６，３４５，３４４がＯＮとなり、スイッチ回路２１３は、ラッチ２１１のＡ５＿Ｌ３１４〜Ａ３＿Ｌ３１２の格納アドレスをメモリに対するレジスタ指定アドレス（Ａ５ｍ，Ａ４ｍ，Ａ３ｍ）として、各メモリブロックの行デコーダに出力する。この場合は、ラッチ２１１のＡ５＿Ｌ３１４〜Ａ３＿Ｌ３１２の格納アドレスからのデータ読み出しが実行される。 As a result, the output of the OR circuit 335 is “0”, that is, the output of the coincidence detection unit 212 is “0”. Accordingly, the switch elements 346, 345, and 344 of the switch circuit 213 are turned ON, and the switch circuit 213 uses the storage addresses of the A5_L314 to A3_L312 of the latch 211 as register designation addresses (A5m, A4m, and A3m) for each memory block. Output to row decoder. In this case, data reading from the storage addresses of A5_L314 to A3_L312 of the latch 211 is executed.

図７、図８を参照してアドレス制御回路２００における具体的なアドレス変換処理および、アドレス変換処理によって変換されたアドレスによるデータ読み出し処理について説明する。 A specific address conversion process in the address control circuit 200 and a data read process using an address converted by the address conversion process will be described with reference to FIGS.

図７は、先に図３を参照して説明したと同様のデータ格納状態、すなわち、次の演算サイクルに用いるデータのうち、一方のデータが、パラメータ"ａ^*"と"ｂ^*"の格納場所として設定されたレジスタ５（ｒｅｇ．５）とレジスタ６（ｒｅｇ．６）と同一のメモリブロックのレジスタに格納された状態である。メモリ２のレジスタ１４（ｒｅｇ．１４）に次の演算サイクルに用いるデータのうち、一方のデータが格納されている。他方のデータは、メモリ３のレジスタ１９（ｒｅｇ．１９）に格納されている。 FIG. 7 shows a data storage state similar to that described above with reference to FIG. 3, that is, one of the data used in the next calculation cycle is stored in parameters “a ^* ” and “b ^* ”. This is a state in which the register 5 (reg. 5) and the register 6 (reg. 6) set as locations are stored in the same memory block register. One of the data used in the next operation cycle is stored in the register 14 (reg. 14) of the memory 2. The other data is stored in the register 19 (reg. 19) of the memory 3.

図８は、先に図４を参照して説明したと同様のデータ格納状態、すなわち、次の演算サイクルに用いる２つのデータが、パラメータ"ａ^*"と"ｂ^*"の格納場所として設定されたレジスタ５（ｒｅｇ．５）とレジスタ６（ｒｅｇ．６）と同一のメモリブロックのレジスタに格納された状態である。メモリ２のレジスタ１４（ｒｅｇ．１４）に次の演算サイクルに用いるデータのうち、一方のデータが格納され、他方のデータがメモリ１のレジスタ１７（ｒｅｇ．１７）に格納されている。 In FIG. 8, the same data storage state as described above with reference to FIG. 4, that is, two data used in the next operation cycle are set as storage locations of the parameters “a ^* ” and “b ^* ”. In this state, the data is stored in the registers of the same memory block as the registers 5 (reg. 5) and 6 (reg. 6). One of the data used in the next operation cycle is stored in the register 14 (reg. 14) of the memory 2, and the other data is stored in the register 17 (reg. 17) of the memory 1.

従来方式によれば、この２つの状態では、２つのデータコピー処理をシーケンシャルに実行することが必要となるため、処理遅延を招き高速演算処理の妨げとなっていた。本発明のアドレス変換を伴う処理では、最大１回のデータコピー処理のみを実行することで、データのセットを実行し、次の演算サイクルに移行することが可能となる。 According to the conventional method, in these two states, it is necessary to execute two data copy processes sequentially, which causes a processing delay and hinders high-speed arithmetic processing. In the process involving address conversion according to the present invention, it is possible to execute a data set by executing only a maximum of one data copy process and shift to the next operation cycle.

なお、乗算処理あるいは加算処理等の演算処理、例えば上述したモンゴメリ（Montgomery）演算では、（数式１４）、（数式１５）から理解されるように、"ａ^*","ｂ^*"はそのセット位置、すなわち乗算データと被乗算データの立場が入れ替わっても結果に影響を与えない。 In arithmetic processing such as multiplication processing or addition processing, for example, the above-described Montgomery calculation, as understood from (Equation 14) and (Equation 15), “a ^* ” and “b ^* ” are the sets. Even if the positions, that is, the positions of the multiplied data and the multiplied data are switched, the result is not affected.

まず、図７を参照して、次の演算サイクルに用いるデータのうち、一方のデータが、パラメータ"ａ^*"と"ｂ^*"の格納場所として設定されたレジスタ５（ｒｅｇ．５）とレジスタ６（ｒｅｇ．６）と同一のメモリブロックのレジスタに格納された状態にある場合の処理シーケンスを説明する。 First, referring to FIG. 7, among the data used in the next operation cycle, one of the data is a register 5 (reg. 5) and a register in which parameters “a ^* ” and “b ^* ” are stored. 6 (reg. 6) will be described in the case where it is in the state stored in the register of the same memory block.

処理シーケンスは次のようになる。
ステップＳ１０１：メモリ３のレジスタ１９（ｒｅｇ．１９）のデータをメモリ１のレジスタ５（ｒｅｇ．５）にコピーする。
ステップＳ１０２：メモリ１に対応するアドレス変換部２２０（図５参照）のラッチ２２１のＵＳＥに不使用を示すデータ"０"をセットし、メモリ２に対応するアドレス変換部２１０のラッチ２１１のＵＳＥに使用を示すデータ"１"をセットし、（Ａ５＿Ｌ，Ａ４＿Ｌ，Ａ３＿Ｌ）に、レジスタ１４（ｒｅｇ．１４）に対応するアドレスデータ（０，１，１）を設定する。 The processing sequence is as follows.
Step S101: The data in the register 19 (reg. 19) in the memory 3 is copied to the register 5 (reg. 5) in the memory 1.
Step S102: Data “0” indicating non-use is set in the USE of the latch 221 of the address conversion unit 220 (see FIG. 5) corresponding to the memory 1, and the USE of the latch 211 of the address conversion unit 210 corresponding to the memory 2 is set. Data “1” indicating use is set, and address data (0, 1, 1) corresponding to the register 14 (reg. 14) is set in (A5_L, A4_L, A3_L).

ステップＳ１０３ａ：メモリ１に対応するアドレス変換部のアドレス一致検出部２２２の出力、すなわち図６に示すＯＲ回路３３５の出力は１となり、スイッチ回路２１３のスイッチ素子３４３，３４２，３４１がＯＮとなり、スイッチ回路２１３は、制御部（コントローラ）から入力されるアドレス信号Ａ５ｃ〜Ａ３ｃをそのまま、パラメータ"ａ^*"の格納場所であるレジスタ５（ｒｅｇ．５）の指定アドレスが、レジスタ指定アドレスＡ５ｍ〜Ａ３ｍとして、メモリ１の行デコーダに出力され、レジスタ５（ｒｅｇ．５）からデータ読み出しが行われ、メモリバススイッチ回路１３０およびバスを介して演算部１５０に出力される。 Step S103a: The output of the address coincidence detection unit 222 of the address conversion unit corresponding to the memory 1, that is, the output of the OR circuit 335 shown in FIG. 6 is 1, the switch elements 343, 342, and 341 of the switch circuit 213 are turned on. The circuit 213 uses the address signals A5c to A3c input from the control unit (controller) as they are, and the designated address of the register 5 (reg. 5), which is the storage location of the parameter “a ^* ”, becomes the register designated addresses A5m to A3m. The data is output to the row decoder of the memory 1, data is read from the register 5 (reg. 5), and is output to the arithmetic unit 150 via the memory bus switch circuit 130 and the bus.

ステップＳ１０３ｂ：メモリ２に対応するアドレス変換部のアドレス一致検出部２１２の出力、すなわち図６に示すＯＲ回路３３５の出力は０となり、スイッチ回路２１３のスイッチ素子３４６，３４５，３４４がＯＮとなり、スイッチ回路２１３は、ラッチ２１１のＡ５＿Ｌ３１４〜Ａ３＿Ｌ３１２の格納アドレスをメモリに対するレジスタ指定アドレス（Ａ５ｍ，Ａ４ｍ，Ａ３ｍ）として、メモリ２に出力する。この場合は、ラッチ２１１のＡ５＿Ｌ３１４〜Ａ３＿Ｌ３１２の格納アドレス、すなわち、レジスタ１４（ｒｅｇ．１４）に対応するアドレスデータ（０，１，１）に基づいてレジスタ１４（ｒｅｇ．１４）からのデータ読み出しが実行され、メモリバススイッチ回路１３０およびバスを介して演算部１５０に出力される。 Step S103b: The output of the address coincidence detection unit 212 of the address conversion unit corresponding to the memory 2, that is, the output of the OR circuit 335 shown in FIG. 6, becomes 0, the switch elements 346, 345, and 344 of the switch circuit 213 are turned on. The circuit 213 outputs the storage addresses of A5_L314 to A3_L312 of the latch 211 to the memory 2 as register designation addresses (A5m, A4m, A3m) for the memory. In this case, data reading from the register 14 (reg. 14) is performed based on the storage addresses of the A5_L 314 to A3_L 312 of the latch 211, that is, the address data (0, 1, 1) corresponding to the register 14 (reg. 14). This is executed and output to the arithmetic unit 150 via the memory bus switch circuit 130 and the bus.

結果として、メモリ１のレジスタ５（ｒｅｇ．５）のデータと、メモリ２のレジスタ１４（ｒｅｇ．１４）のデータが演算部１５０に入力されて、両データに基づく演算が実行される。 As a result, the data of the register 5 (reg. 5) of the memory 1 and the data of the register 14 (reg. 14) of the memory 2 are input to the arithmetic unit 150, and an operation based on both data is executed.

上述の説明から理解されるように、データコピー処理は、ステップＳ１０１の１回のコピー処理のみとなる。従って、先に図３を参照して説明したアドレス変換を伴わない場合の処理、すなわち２回のコピー処理をシーケンシャルに実行する必要がある場合に比較し、処理サイクルが減少し、高速演算が可能となる。 As understood from the above description, the data copy process is only one copy process in step S101. Therefore, the processing cycle is reduced and high-speed computation is possible compared to the processing without the address conversion described above with reference to FIG. 3, that is, the case where the two copy processing needs to be executed sequentially. It becomes.

次に、図８を参照して、次の演算サイクルに用いるデータの２つのデータが、パラメータ"ａ^*"と"ｂ^*"の格納場所として設定されたレジスタ５（ｒｅｇ．５）とレジスタ６（ｒｅｇ．６）と同一のメモリブロックのレジスタに格納された状態にある場合の処理シーケンスを説明する。 Next, referring to FIG. 8, two data of data used in the next operation cycle are registered in the registers 5 (reg. 5) and 6 where the parameters “a ^* ” and “b ^* ” are stored. The processing sequence in the case where it is stored in the register of the same memory block as (reg. 6) will be described.

処理シーケンスは次のようになる。
ステップＳ２０１：メモリ１に対応するアドレス変換部２２０のラッチ２２１のＵＳＥに使用を示すデータ"１"をセットし、（Ａ５＿Ｌ，Ａ４＿Ｌ，Ａ３＿Ｌ）に、レジスタ１７（ｒｅｇ．１７）に対応するアドレスデータ（１，０，０）を設定する。さらに、メモリ２に対応するアドレス変換部２１０のラッチ２１１のＵＳＥに使用を示すデータ"１"をセットし、（Ａ５＿Ｌ，Ａ４＿Ｌ，Ａ３＿Ｌ）に、レジスタ１４（ｒｅｇ．１４）に対応するアドレスデータ（０，１，１）を設定する。 The processing sequence is as follows.
Step S201: Data “1” indicating use is set to USE of the latch 221 of the address conversion unit 220 corresponding to the memory 1, and (A5_L, A4_L, A3_L) is set to address data corresponding to the register 17 (reg. 17). Set (1, 0, 0). Further, data “1” indicating use is set in USE of the latch 211 of the address conversion unit 210 corresponding to the memory 2, and the address data corresponding to the register 14 (reg. 14) is set in (A5_L, A4_L, A3_L). 0,1,1) is set.

ステップＳ２０２ａ：メモリ１に対応するアドレス変換部のアドレス一致検出部２２２の出力、すなわち図６に示すＯＲ回路３３５の出力は０となり、スイッチ回路２２３のスイッチ素子３４６，３４５，３４４がＯＮとなり、スイッチ回路２２３は、ラッチ２２１のＡ５＿Ｌ３１４〜Ａ３＿Ｌ３１２の格納アドレスをメモリに対するレジスタ指定アドレス（Ａ５ｍ，Ａ４ｍ，Ａ３ｍ）として、メモリ１に出力する。この場合は、ラッチ２２１のＡ５＿Ｌ３１４〜Ａ３＿Ｌ３１２の格納アドレス、すなわち、レジスタ１７（ｒｅｇ．１７）に対応するアドレスデータ（１，０，０）に基づいてレジスタ１７（ｒｅｇ．１７）からのデータ読み出しが実行され、メモリバススイッチ回路１３０およびバスを介して演算部１５０に出力される。 Step S202a: The output of the address coincidence detection unit 222 of the address conversion unit corresponding to the memory 1, that is, the output of the OR circuit 335 shown in FIG. 6 is 0, the switch elements 346, 345, and 344 of the switch circuit 223 are turned on. The circuit 223 outputs the storage addresses of A5_L314 to A3_L312 of the latch 221 to the memory 1 as register designation addresses (A5m, A4m, A3m) for the memory. In this case, data reading from the register 17 (reg. 17) is performed based on the storage addresses of the latches 221 A5_L314 to A3_L312, that is, the address data (1, 0, 0) corresponding to the register 17 (reg. 17). This is executed and output to the arithmetic unit 150 via the memory bus switch circuit 130 and the bus.

ステップＳ２０２ｂ：メモリ２に対応するアドレス変換部のアドレス一致検出部２１２の出力、すなわち図６に示すＯＲ回路３３５の出力は０となり、スイッチ回路２１３のスイッチ素子３４６，３４５，３４４がＯＮとなり、スイッチ回路２１３は、ラッチ２１１のＡ５＿Ｌ３１４〜Ａ３＿Ｌ３１２の格納アドレスをメモリに対するレジスタ指定アドレス（Ａ５ｍ，Ａ４ｍ，Ａ３ｍ）として、メモリ２に出力する。この場合は、ラッチ２１１のＡ５＿Ｌ３１４〜Ａ３＿Ｌ３１２の格納アドレス、すなわち、レジスタ１４（ｒｅｇ．１４）に対応するアドレスデータ（０，１，１）に基づいてレジスタ１４（ｒｅｇ．１４）からのデータ読み出しが実行され、メモリバススイッチ回路１３０およびバスを介して演算部１５０に出力される。 Step S202b: The output of the address match detection unit 212 of the address conversion unit corresponding to the memory 2, that is, the output of the OR circuit 335 shown in FIG. 6, becomes 0, the switch elements 346, 345, and 344 of the switch circuit 213 are turned on. The circuit 213 outputs the storage addresses of A5_L314 to A3_L312 of the latch 211 to the memory 2 as register designation addresses (A5m, A4m, A3m) for the memory. In this case, data reading from the register 14 (reg. 14) is performed based on the storage addresses of the A5_L 314 to A3_L 312 of the latch 211, that is, the address data (0, 1, 1) corresponding to the register 14 (reg. 14). This is executed and output to the arithmetic unit 150 via the memory bus switch circuit 130 and the bus.

結果として、メモリ１のレジスタ１７（ｒｅｇ．１７）のデータと、メモリ２のレジスタ１４（ｒｅｇ．１４）のデータが演算部１５０に入力されて、両データに基づく演算が実行される。 As a result, the data of the register 17 (reg. 17) of the memory 1 and the data of the register 14 (reg. 14) of the memory 2 are input to the arithmetic unit 150, and an operation based on both data is executed.

上述の説明から理解されるように、データコピー処理は不要となる。従って、先に図４を参照して説明したアドレス変換を伴わない場合の処理、すなわち２回のコピー処理をシーケンシャルに実行する必要がある場合に比較し、処理サイクルが大幅に減少し、高速演算が可能となる。 As can be understood from the above description, the data copy process is not necessary. Therefore, the processing cycle is greatly reduced compared to the processing without the address conversion described above with reference to FIG. 4, that is, the case where the two copy processing needs to be executed sequentially, and the high-speed operation is reduced. Is possible.

また、演算部１５０において演算された結果を中間値としてレジスタに格納し、再度、中間値をレジスタから取得し、演算部１５０に出力して２乗の計算を行なう場合の処理について説明する。例えば、上述のモンプロ関数による２乗の計算処理は、下記の処理として示される。
ｃ^*＝ MonPro（ａ^*，ｂ^* ）
ｃ2^*＝ MonPro（ｃ^*，ｃ^* ）
ここで、ａ^*，ｂ^*に入力するデータが、図８（図４と同様）である場合、すなわち、次の演算サイクルに用いる２つのデータが、パラメータ"ａ^*"と"ｂ^*"の格納場所として設定されたレジスタ５（ｒｅｇ．５）とレジスタ６（ｒｅｇ．６）と同一のメモリブロックのレジスタに格納された状態の場合を想定する。すなわち、メモリ２のレジスタ１４（ｒｅｇ．１４）に次の演算サイクルに用いるデータのうち、一方のデータが格納され、他方のデータがメモリ１のレジスタ１７（ｒｅｇ．１７）に格納されている。 Also, a description will be given of a process in which the result calculated in the calculation unit 150 is stored in the register as an intermediate value, the intermediate value is obtained again from the register, and output to the calculation unit 150 to perform the square calculation. For example, the square calculation process by the above-mentioned Montpro function is shown as the following process.
c ^* = MonPro (a ^* , b ^* )
c2 ^* = MonPro (c ^* , c ^* )
Here, when the data input to a ^* and b ^* is as shown in FIG. 8 (similar to FIG. 4), that is, the two data used in the next operation cycle are the parameters “a ^* ” and “b ^* ”. A case is assumed in which the data is stored in the register of the same memory block as the register 5 (reg. 5) and the register 6 (reg. 6) set as the storage location. That is, one of the data used in the next operation cycle is stored in the register 14 (reg. 14) of the memory 2, and the other data is stored in the register 17 (reg. 17) of the memory 1.

この場合は、
ステップＳ３０１：メモリ１に対応するアドレス変換部２２０のラッチ２２１のＵＳＥに使用を示すデータ"１"をセットし、（Ａ５＿Ｌ，Ａ４＿Ｌ，Ａ３＿Ｌ）に、レジスタ１７（ｒｅｇ．１７）に対応するアドレスデータ（１，０，０）を設定する。さらに、メモリ２に対応するアドレス変換部２１０のラッチ２１１のＵＳＥに使用を示すデータ"１"をセットし、（Ａ５＿Ｌ，Ａ４＿Ｌ，Ａ３＿Ｌ）に、レジスタ１４（ｒｅｇ．１４）に対応するアドレスデータ（０，１，１）を設定する。 in this case,
Step S301: Data “1” indicating use is set to USE of the latch 221 of the address conversion unit 220 corresponding to the memory 1, and (A5_L, A4_L, A3_L) is set to address data corresponding to the register 17 (reg. 17). Set (1, 0, 0). Further, data “1” indicating use is set in USE of the latch 211 of the address conversion unit 210 corresponding to the memory 2, and the address data corresponding to the register 14 (reg. 14) is set in (A5_L, A4_L, A3_L). 0,1,1) is set.

ステップＳ３０２ａ：メモリ１に対応するアドレス変換部のアドレス一致検出部２２２の出力、すなわち図６に示すＯＲ回路３３５の出力は０となり、スイッチ回路２２３のスイッチ素子３４６，３４５，３４４がＯＮとなり、スイッチ回路２２３は、ラッチ２２１のＡ５＿Ｌ３１４〜Ａ３＿Ｌ３１２の格納アドレスをメモリに対するレジスタ指定アドレス（Ａ５ｍ，Ａ４ｍ，Ａ３ｍ）として、メモリ１に出力する。この場合は、ラッチ２２１のＡ５＿Ｌ３１４〜Ａ３＿Ｌ３１２の格納アドレス、すなわち、レジスタ１７（ｒｅｇ．１７）に対応するアドレスデータ（１，０，０）に基づいてレジスタ１７（ｒｅｇ．１７）からのデータ読み出しが実行され、メモリバススイッチ回路１３０およびバスを介して演算部１５０に出力される。 Step S302a: The output of the address match detection unit 222 of the address conversion unit corresponding to the memory 1, that is, the output of the OR circuit 335 shown in FIG. 6 is 0, the switch elements 346, 345, and 344 of the switch circuit 223 are turned on. The circuit 223 outputs the storage addresses of A5_L314 to A3_L312 of the latch 221 to the memory 1 as register designation addresses (A5m, A4m, A3m) for the memory. In this case, data reading from the register 17 (reg. 17) is performed based on the storage addresses of the latches 221 A5_L314 to A3_L312, that is, the address data (1, 0, 0) corresponding to the register 17 (reg. 17). This is executed and output to the arithmetic unit 150 via the memory bus switch circuit 130 and the bus.

ステップＳ３０２ｂ：メモリ２に対応するアドレス変換部のアドレス一致検出部２１２の出力、すなわち図６に示すＯＲ回路３３５の出力は０となり、スイッチ回路２１３のスイッチ素子３４４，３４５，３４６がＯＮとなり、スイッチ回路２１３は、ラッチ２１１のＡ５＿Ｌ３１４〜Ａ３＿Ｌ３１２の格納アドレスをメモリに対するレジスタ指定アドレス（Ａ５ｍ，Ａ４ｍ，Ａ３ｍ）として、メモリ２に出力する。この場合は、ラッチ２１１のＡ５＿Ｌ３１４〜Ａ３＿Ｌ３１２の格納アドレス、すなわち、レジスタ１４（ｒｅｇ．１４）に対応するアドレスデータ（０，１，１）に基づいてレジスタ１４（ｒｅｇ．１４）からのデータ読み出しが実行され、メモリバススイッチ回路１３０およびバスを介して演算部１５０に出力される。 Step S302b: The output of the address coincidence detection unit 212 of the address conversion unit corresponding to the memory 2, that is, the output of the OR circuit 335 shown in FIG. 6, becomes 0, the switch elements 344, 345, and 346 of the switch circuit 213 are turned on. The circuit 213 outputs the storage addresses of A5_L314 to A3_L312 of the latch 211 to the memory 2 as register designation addresses (A5m, A4m, A3m) for the memory. In this case, data reading from the register 14 (reg. 14) is performed based on the storage addresses of the A5_L 314 to A3_L 312 of the latch 211, that is, the address data (0, 1, 1) corresponding to the register 14 (reg. 14). This is executed and output to the arithmetic unit 150 via the memory bus switch circuit 130 and the bus.

これらの処理により、２乗計算まで実行された結果がメモリ１のレジスタ１７（ｒｅｇ．１７）のデータと、メモリ２のレジスタ１４（ｒｅｇ．１４）に格納される。 By these processes, the result executed up to the square calculation is stored in the data of the register 17 (reg. 17) of the memory 1 and the register 14 (reg. 14) of the memory 2.

以上、特定の実施例を参照しながら、本発明について詳解してきた。しかしながら、本発明の要旨を逸脱しない範囲で当業者が該実施例の修正や代用を成し得ることは自明である。すなわち、例示という形態で本発明を開示してきたのであり、限定的に解釈されるべきではない。本発明の要旨を判断するためには、冒頭に記載した特許請求の範囲の欄を参酌すべきである。 The present invention has been described in detail above with reference to specific embodiments. However, it is obvious that those skilled in the art can make modifications and substitutions of the embodiments without departing from the gist of the present invention. In other words, the present invention has been disclosed in the form of exemplification, and should not be interpreted in a limited manner. In order to determine the gist of the present invention, the claims section described at the beginning should be considered.

以上、説明したように、本発明の構成によれば、データ格納領域としてのレジスタを複数有するメモリブロックを持つメモリ部と、レジスタの指定アドレスに基づいてレジスタから読み出されたデータを入力し、入力データに基づく演算処理を実行する演算部と、演算部に対するデータ入出力制御を実行する制御部とを有する演算処理装置において、演算部に出力予定のデータを格納しているデータ格納レジスタが予め定めた特定レジスタと同一のメモリブロックの異なるレジスタである場合において、制御部からの入力アドレスをデータ格納レジスタの指定アドレスに変換して、該変換アドレスをメモリブロックに対する読み出しアドレスとして出力するアドレス制御処理を実行する構成としたので、予め定めた特定レジスタに対するデータコピー処理を削減して、データ読み出しおよび演算処理を実行することが可能となり、演算の高速化が実現され、高速処理の要請される暗号処理デバイスにおいて適用可能である。 As described above, according to the configuration of the present invention, the memory unit having a memory block having a plurality of registers as the data storage area, and the data read from the register based on the designated address of the register are input, In an arithmetic processing device having an arithmetic unit that executes arithmetic processing based on input data and a control unit that executes data input / output control for the arithmetic unit, a data storage register that stores data to be output in the arithmetic unit is preliminarily provided. Address control processing for converting an input address from the control unit into a designated address of the data storage register and outputting the converted address as a read address for the memory block when the specified specific register is a different register in the same memory block Data for a specific register that has been set in advance. By reducing the copying process, it is possible to perform a data reading and processing, faster operation is realized, it is applicable in the cryptographic processing devices that are requested of the high-speed processing.

さらに、本発明の構成によれば、モンゴリ演算におけるモンプロ演算：ＭｏｎＰｒｏ（ａ^*，ｂ^*）の実行において、モンプロ演算におけるパラメータ"ａ^*"または"ｂ^*"の格納予定領域として設定された特定レジスタと異なるレジスタに、出力予定のパラメータ"ａ^*"または"ｂ^*"の対応データを格納している場合において、制御部からの入力アドレスを出力予定のパラメータ"ａ^*"または"ｂ^*"のデータ格納レジスタの指定アドレスに変換して、該変換アドレスを前記メモリブロックに対する読み出しアドレスとして出力する構成としたので、暗号処理演算において、複数回繰り返し実行されるモンプロ演算のデータコピー処理の削減が可能となり、暗号処理演算の高速化が実現され、高速処理の要請される暗号処理デバイスにおいて適用可能である。 Furthermore, according to the configuration of the present invention, in the execution of the Monpro operation: MonPro (a ^* , b ^* ) in the Mongolian operation, the identification set as the storage area for the parameter “a ^* ” or “b ^* ” in the Monpro operation to register a different register, parameters to be output "a ^*" or when storing the corresponding data are the "b ^*", the parameters of expected output an input address from the control unit "a ^*" or "b ^*" Since the conversion address is converted into the designated address of the data storage register and the converted address is output as a read address for the memory block, the data copy processing of the monpro operation that is repeatedly executed a plurality of times can be reduced in the cryptographic processing operation. It is possible to realize high-speed cryptographic processing operations and can be applied to cryptographic processing devices that require high-speed processing. It is.

本発明の演算処理装置の構成を示す図である。It is a figure which shows the structure of the arithmetic processing apparatus of this invention. 演算処理におけるメモリのレジスタ利用構成およびコピー処理について説明する図である。It is a figure explaining the register | resistor use structure of a memory in an arithmetic processing, and a copy process. 演算処理におけるメモリのレジスタ利用構成およびコピー処理について説明する図である。It is a figure explaining the register | resistor use structure of a memory in an arithmetic processing, and a copy process. 演算処理におけるメモリのレジスタ利用構成およびコピー処理について説明する図である。It is a figure explaining the register | resistor use structure of a memory in an arithmetic processing, and a copy process. 本発明の演算処理装置のアドレス制御回路構成を示す図である。It is a figure which shows the address control circuit structure of the arithmetic processing unit of this invention. 本発明の演算処理装置のアドレス制御回路の詳細回路構成例を示す図である。It is a figure which shows the detailed circuit structural example of the address control circuit of the arithmetic processing unit of this invention. 本発明の演算処理装置のアドレス制御回路におけるアドレス変換処理を伴うメモリのレジスタ利用演算処理について説明する図である。It is a figure explaining the register | resistor utilization arithmetic processing of the memory accompanying the address conversion process in the address control circuit of the arithmetic processing unit of this invention. 本発明の演算処理装置のアドレス制御回路におけるアドレス変換処理を伴うメモリのレジスタ利用演算処理について説明する図である。It is a figure explaining the register | resistor utilization arithmetic processing of the memory accompanying the address conversion process in the address control circuit of the arithmetic processing unit of this invention.

Explanation of symbols

１１０メモリ部
１２０〜１２３メモリブロック
１３０メモリバススイッチ回路
１５０演算部
１５１，１５３，１５５，１５６，１５８ラッチ
１５２乗算器
１５４，１５７加算器
２００アドレス制御回路
２１０，２２０アドレス変換部
２１１，２２１ラッチ
２１２，２２２一致検出部
２１３，２２３スイッチ
３１１〜３１４ラッチ
３２１〜３２３ＥＯＲ入力データ設定部
３３１〜３３３ＥＯＲ回路
３３５ＯＲ回路
３４１〜３４６スイッチ素子 DESCRIPTION OF SYMBOLS 110 Memory part 120-123 Memory block 130 Memory bus switch circuit 150 Operation part 151,153,155,156,158 Latch 152 Multiplier 154,157 Adder 200 Address control circuit 210,220 Address conversion part 211,221 Latch 212, 222 coincidence detection unit 213, 223 switch 311 to 314 latch 321 to 323 EOR input data setting unit 331 to 333 EOR circuit 335 OR circuit 341 to 346 switch element

Claims

A memory unit having a memory block having a plurality of registers as a data storage area, an arithmetic unit that inputs data read from the register based on a designated address of the register, and executes arithmetic processing based on the input data; An arithmetic processing unit having a control unit that performs data input / output control on the unit,
An address control unit that inputs a register designation address from the control unit and executes input address conversion processing,
The address control unit inputs a predetermined register address from the control unit, and a data storage register storing data to be output to the arithmetic unit is stored in the same memory block as the specific register. In the case of different registers, the arithmetic processing has a configuration in which an input address from the control unit is converted into a designated address of the data storage register, and the converted address is output as a read address for the memory block. apparatus.

The address control unit
A latch storing the address of the data storage register;
A coincidence detection unit that detects whether an input address from the control unit coincides with an address of a predetermined specific register;
Switch means for selecting either an input address from the control unit or an address of the data storage register stored in the latch based on detection information of the coincidence detection unit and outputting the selected address as a read address for the memory block When,
The arithmetic processing apparatus according to claim 1, comprising:

The address control unit further includes:
Including an information storage unit storing information for enabling an address conversion operation only when a data storage register storing data to be output in the arithmetic unit is a different register in the same memory block as the specific register 2. The arithmetic processing apparatus according to claim 1, wherein the address conversion process is performed only when information for enabling an address conversion operation is stored in the information storage unit.

The calculation unit includes a multiplier and an adder that perform Montgomery calculation in cryptographic processing calculation,
The memory unit has a configuration in which specific registers of two memory blocks in a plurality of memory blocks are set as storage areas for parameters or intermediate values to be applied to the Montgomery calculation scheduled to be input to the calculation unit in the next calculation cycle. ,
The address control unit inputs an address of the specific register set as a storage area for parameters or intermediate values to be applied to Montgomery calculation from the control unit, and stores data to be output to the calculation unit. When the data storage register is a different register of the same memory block as the specific register, the input address from the control unit is converted into a specified address of the data storage register, and the converted address is read out from the memory block The arithmetic processing unit according to claim 1, wherein the arithmetic processing unit is configured to output as an address.

The arithmetic processing device is an arithmetic processing device that executes the following monpro operation:
MonPro (a ^* , b ^* )
t = a ^* × b ^*
for i = 0 to dl-1
m = t ₀ × P ₀ 'mod r
t = (t + m × P) / r
next i
if t ≧ P then return t−P
else return t
5. The operation according to claim 4, wherein the specific registers of the two memory blocks in the plurality of memory blocks are configured to be stored as registers for storing parameters a ^* or b ^* in the Monpro operation. Processing equipment.

An arithmetic processing method,
An address generation step of generating a specified address of the register in the control unit that executes data input / output control for the arithmetic unit;
In the case where the address of the specific register determined in advance from the control unit and the data storage register storing the data to be output to the arithmetic unit are different registers in the same memory block as the specific register, An address control step of converting an input address from the control unit into a designated address of the data storage register and outputting the converted address as a read address for the memory block;
A data read step for reading data from a register of a memory block based on the address controlled in the address control step and outputting the data to the arithmetic unit;
An arithmetic processing method characterized by comprising:

The address control step further includes:
A coincidence detecting step for detecting whether or not an input address from the control unit coincides with an address of a predetermined specific register;
Based on the detection information in the coincidence detection step, output switching is performed by selecting either the input address from the control unit or the address of the data storage register stored in advance in the latch and outputting it as a read address for the memory block Steps,
The arithmetic processing method according to claim 6, further comprising:

In the arithmetic processing method,
The address conversion process in the address control step is performed only when a data storage register storing data scheduled to be output in the arithmetic unit is a different register in the same memory block as the specific register. Item 7. The processing method according to Item 6.

In the address control step, an address of a specific register set as a storage area for parameters or intermediate values to be applied to Montgomery calculation is input from the control unit, and data to be output is stored in the calculation unit When the data storage register is a different register of the same memory block as the specific register, the input address from the control unit is converted into a specified address of the data storage register, and the converted address is read to the memory block The calculation processing method according to claim 6, wherein:

The calculation processing method includes a calculation step for executing the following monpro calculation:
MonPro (a ^* , b ^* )
t = a ^* × b ^*
for i = 0 to dl-1
m = t ₀ × P ₀ 'mod r
t = (t + m × P) / r
next i
if t ≧ P then return t−P
else return t
In the address control step, an address of a specific register set as a storage area for the parameter a ^* or b ^* in the monpro calculation is input from the control unit, and the parameter a ^* or b scheduled to be output to the calculation unit ^{When the} data storage register storing the corresponding data of ^* is a different register in the same memory block as the specific register, the input address from the control unit is converted into the designated address of the data storage register, The arithmetic processing method according to claim 9, wherein the conversion address is output as a read address for the memory block.