JPH0283656A

JPH0283656A - Learning processor

Info

Publication number: JPH0283656A
Application number: JP63235441A
Authority: JP
Inventors: Atsunobu Hiraiwa; 平岩　篤信
Original assignee: Sony Corp
Current assignee: Sony Corp
Priority date: 1988-09-20
Filing date: 1988-09-20
Publication date: 1990-03-23
Anticipated expiration: 2012-04-30
Also published as: JP2606317B2

Abstract

(57)【要約】本公報は電子出願前の出願データであるた
め要約のデータは記録されません。(57) [Abstract] This bulletin contains application data before electronic filing, so abstract data is not recorded.

Description

【発明の詳細な説明】Ａ　産業上の利用分野本発明は、それぞれニューロンに対応する信号処理を行
う複数のユＳントにて構成された所謂ニューラル、ネッ
トワーク（Ｎｅｕｒａｌ　Ｎｅｔｗｏｒｋ　：　神経回
路網）を用いた信号処理部に対して、ハックプロパゲー
ション（Ｂａｃｋ　ｐｒｏｐａｇａｔｉｏｎ　：逆伝播
）学習剤に従った学習処理を施す学習処理装置に関する
。[Detailed Description of the Invention] A. Industrial Field of Application The present invention uses a so-called neural network (neural network) composed of a plurality of units that each perform signal processing corresponding to a neuron. The present invention relates to a learning processing device that performs learning processing on a signal processing unit based on a hack propagation learning agent.

Ｂ　発明の概要本発明は、ニューラルネットワークによる信号処理部に
対してバックプロパゲーンヨン学習則に従った学習処理
を行う学習処理装置において、中間層のユニットの数を
増加させながら結合の強さの係数の学習処理を行うこと
により、学習処理過程における局所的最小値状態を回避
でルるようにしたものである。B. Summary of the Invention The present invention provides a learning processing device that performs learning processing in accordance with the backpropagation learning rule for a signal processing unit using a neural network, in which the strength of connections is increased while increasing the number of units in the intermediate layer. By performing coefficient learning processing, a local minimum value state in the learning processing process can be avoided.

Ｃ従来の技術ニューラルネットワークの学習アルゴリズムであるバッ
クプロパゲーション学習則”　ＰａｒａｌｌｅｌＤｉｓ
ｔｒｉｂｕｔｅｄ　ＰｒｏｃｅｓｓｉｎｇＪＶｏｌ、Ｉ
　Ｔｈｅ　ＭＴＴ　Ｐｒｅｓｓ１９８６や日経エレクト
ロニクス１９８７年８月１０日号。C.Backpropagation learning rule which is a learning algorithm of conventional technology neural network” ParallelDis
TributedProcessingJVol,I
The MTT Press 1986 and Nikkei Electronics August 10, 1987 issue.

Ｎｏ、４２７．ｐｐＨ５−１２４等参照」は、第８図に
示すように、入力層（３１）と出力１ｉｌ　（３３）の
間に中間層（３２）を有する多層構造のニューラルネッ
トワークに適用され、高速画像処理やパターン認識等の
各種の信号処理への応用が試みられている。No, 427. ppH5-124, etc." is applied to a multilayer neural network that has an intermediate layer (32) between an input layer (31) and an output layer (33), as shown in Figure 8, and is used for high-speed image processing and Applications to various signal processing such as pattern recognition are being attempted.

すなわち、第８図に示すように、このニューラルネット
ワークを構成する各ユニット（Ｕ、）は、ユニット（Ｕ
、）からユニット（ｕＪ）への結合係数Ｗｊｉで結合さ
れるユニット（Ｕ、）の出力値。、の総和ｎｅＪを例え
ばｓｉｇｍｏｉｄ関数などの所定の関数ｆで変換された
値ｏ、を出力する。す、なゎち、パターンｐの値が入力
層の各ユニット（ＵＪ）に入力値としてそれぞれ供給さ
れたとき、中間層および出力層の各二二ンＦ（ｕ＝）の
出力値０＋＋ｉは、ＯｐＪ＝　ｆ　Ｊ（ｎｅｔｐｊ）＝ｆ、（ΣＷｊｉ・Ｏ９ｊ＋）　　・・・・・第１式な
る第１式で表される。That is, as shown in FIG. 8, each unit (U,) constituting this neural network is
, ) to the unit (uJ) with the coupling coefficient Wji. , is converted by a predetermined function f such as a sigmoid function, and a value o is output. So, when the value of pattern p is supplied as an input value to each unit (UJ) of the input layer, the output value 0++i of each unit F(u=) of the intermediate layer and the output layer is as follows. OpJ=f J(netpj) =f, (ΣWji·O9j+)... Expressed by the first equation.

そして、入力層（３Ｉ）から出力層（３３）へ向かって
、各ニヱーロンに対応するユニット（ＵＪ）の出力値を
順次計算していくことで、上記出力ｉ　（３３）のユニ
ット（ｕ４）の出力（直Ｏｐ＝が得られる。Then, by sequentially calculating the output value of the unit (UJ) corresponding to each unit from the input layer (3I) to the output layer (33), the output value of the unit (U4) of the above output i (33) is calculated. Output (direct Op= is obtained.

バンクブロバゲーシゴン学習アルゴリズムにおいては、
パターンｐを与えたときの、出力層（３３）の各ユニッ
ト（ｕ４）の実際の出力値０９ｊと望ましい出力値Ｌｐ
ｊすなわち教師信号との二乗誤差の総和Ｅ。In the bank blog game learning algorithm,
Actual output value 09j and desired output value Lp of each unit (u4) of the output layer (33) when pattern p is given
j, that is, the sum E of squared errors with respect to the teacher signal.

を桟小化するように、結合係数Ｗｊｉを変える学習処理
を出力層（３３）から入力層（３１）へ向かって順次に
行うことにより、教師信号の値ＬｐＪに最も近い出力値
Ｏｐ＝が上記出力層（３３）のユニット（ｕ　；）がら
出力されるようになる。By sequentially performing a learning process that changes the coupling coefficient Wji from the output layer (33) to the input layer (31) so as to reduce the cross section, the output value Op= closest to the value LpJ of the teacher signal is The unit (u;) of the output layer (33) is now output.

そして、二乗誤差の総和Ｅ、を小さくする結合係数Ｗｉ
（の変化量ΔＶＶｊｌ　を、 ΔＷｊｉ　ＣＣ−ａＥ、　／ｇｗＪｒ　　−−−−−第
３式と決めると、上記第３式は、 ΔＷ１−η・δ９、・ｏ２１　　・・・・・・−・・・
７Ｊ４４　ｆｃに変形することができる（この過程は上
述の文献を参照）。Then, the coupling coefficient Wi that reduces the sum of squared errors E
(If the amount of change ΔVVjl is determined as ΔWji CC−aE, /gwJr −−−−− third formula, then the above third formula becomes ΔW1−η・δ9,・o21 ・・・・・・−・
7J44 fc (see the above-mentioned document for this process).

ここで、ηは学習レート（定数）で、ユニットの数や層
の数さらには人出方の値等から経験的に決定される。ま
た、δ２．はユニット（ｕＪ）のもつ誤差値である。Here, η is a learning rate (constant), which is determined empirically from the number of units, the number of layers, the number of people, etc. Also, δ2. is the error value of the unit (uJ).

従って、上記変化量ΔＷ　ｊ　ｉを決定するためには、
上記誤差値δ２Ｊをネットワークの出力層がら入力層に
向かって逆向きに求めていけば良い。出力層のユニット
（ｕ、）の誤差値δ２．は、δｐｊ＝（ｔ、、’　ｐ＝
）　ｆ　’７（ｎｅｔ７）　０９１０．第５式なる第５
弐で与えられ、中間層のユニット（ｕ、）の誤差値δ２
、は、そのユニット（ｕＪ）が結合されている各二二ッ
）（ｕｍ）　　（この例では出方層の各ユニット）の結
合係数ＷｋＪおよび誤差値δ、を用いて、 δｐｊ　”’　ｆ　’　ｊ　（ｎｅｊｊ）Σδｐｋｗｋ
、山・・第６式なる再帰関数にて計算される（上記第５
式および第６弐を求める過程は上述の文献を参照）。Therefore, in order to determine the amount of change ΔW j i,
The above error value δ2J may be obtained in the reverse direction from the output layer of the network toward the input layer. Error value δ2 of unit (u,) in the output layer. is δpj=(t,,' p=
) f '7 (net7) 0910. 5th formula
2, the error value δ2 of the unit (u,) of the intermediate layer is given by
, using the coupling coefficient WkJ and error value δ of each unit (um) (in this example, each unit of the output layer) to which the unit (uJ) is coupled, δpj ''' f ' j (nejj)Σδpkwk
, mountain...Calculated using the recursive function of the sixth formula (the fifth above)
For the formula and the process of determining No. 6, please refer to the above-mentioned literature).

なお、上記ｆ　’　Ｊ（ｎｅｔＪ）は、出力関数ｆ　Ｊ
（ｎｅｔＪ）の微分値である。Note that the above f' J (netJ) is the output function f J
It is the differential value of (netJ).

そして、変化量Δｗ１、は、上記第５式および第６弐の
結果を用いて上述の第４式によって求められるが、前回
の学習結果を用いて、 ΔＷｊｉ、□ｌ＋＝η・δｐｊ’ｏｐ汁α・Δｗｊｉ（
ｎｌ・・・・・・第７式なる第７式にて求めることで、より安定した結果が得ら
れる。なお、αはエラーの振動を残らし、収束を速める
ための安定化定数である。The amount of change Δw1 is determined by the above-mentioned equation 4 using the results of equations 5 and 6, but using the previous learning results, ΔWji, □l+=η・δpj'op juice α・Δwji(
A more stable result can be obtained by finding nl using the seventh equation. Note that α is a stabilizing constant for leaving error oscillations and speeding up convergence.

そして、この学習を繰り返し行い、出力値□ｐ４と教師
信号の値ｔＩｌｊとの二乗誤差の総和Ｅｐが十分に小さ
くなった時点で学習を完了するようにしていた。Then, this learning is repeated, and the learning is completed when the sum Ep of the squared errors between the output value □p4 and the value tIlj of the teacher signal becomes sufficiently small.

Ｄ　発明が解決しようとする課題ところで、上述の如き多層型のニューラルネットワーク
に対するハックプロパゲーション学習則に従った学習処
理は、機能面で高い能力が期待できるのであるが、学習
処理過程において最適最小値（ｇｌｏｂａｌ　ｍｉｎｉ
ｍｕｍ）に達することなく、局所的最小値（ｌｏｃａｌ
　ｍｉｎｉｍｕｍ）状態に陥り、二乗誤差の総和Ｅｐが
十分に小さくならないことが多々ある。D Problems to be Solved by the Invention By the way, learning processing according to the hack propagation learning rule for multilayer neural networks as described above can be expected to have high functional ability, but in the learning processing process, the optimal minimum value (global mini
local minimum (local
In many cases, the sum Ep of the squared errors does not become sufficiently small.

従来、上記局所的最小値状態に陥った場合には、初期値
や学習レートηを変えて学習処理を繰り返し行うことに
より、最適最小値状態を見つけるようにしていたので、
従来の学習処理装置では、学習処理時間が極めて長く、
しかも、変動が大きいという問題点があった。Conventionally, when falling into the above local minimum value state, the optimal minimum value state was found by repeatedly performing the learning process by changing the initial value and learning rate η.
With conventional learning processing devices, the learning processing time is extremely long.
Moreover, there was a problem in that the fluctuation was large.

そこで、本発明は、上述の如き従来の実情に鑑み、ニュ
ーラルネットワークによる信号処理部に対してバックプ
ロパゲーション学習則に従った学習処理を施す学習処理
装置において、学習処理過程における局所的最小値状態
を効率良く回避して、安定に且つ高速に最適最小値状態
に収束できるようにすることを目的とし、中間層のユニ
ットを増加させながら学習処理を行うようにした新規な
構成の学習処理装置を提供するものである。Therefore, in view of the conventional situation as described above, the present invention provides a learning processing device that performs learning processing in accordance with the backpropagation learning rule on a signal processing unit using a neural network, in which a local minimum value state in the learning processing process is In order to efficiently avoid this and converge to the optimal minimum value state stably and quickly, we have developed a learning processing device with a new configuration that performs learning processing while increasing the number of units in the middle layer. This is what we provide.

Ｅ　課題を解決するための手段本発明は、上述の目的を達成するために、それぞれニュ
ーロンに対応する信号処理を行う複数のユニットにて構
成された入力層、中間層および出力層を備える信号処理
部と、上記入力層に入力される入力信号パターンに対す
る上記出力層の出力値と教師信号として与えられる所望
の出力値との誤差情報に基づいて上記各ユニットの間の
結合の強さの係数を上記出力層側から上記入力層側に向
かって順次に繰り返し計算し、上記結合の強さの係数の
学習処理を行う学習処理部とを備えて成る学習処理装置
において、上記結合の強さの係数の学習処理過程におい
て上記中間層のユニットの数を増加させる制御手段を上
記学習処理部に設け、上記学習処理部にて上記中間層の
ユニットの数を増加させながら上記結合の強さの係数の
学習処理を行うようにしたことを特徴としている。E. Means for Solving the Problems In order to achieve the above-mentioned objects, the present invention provides a signal processing system comprising an input layer, an intermediate layer, and an output layer, each of which is composed of a plurality of units that perform signal processing corresponding to neurons. and a coefficient of the strength of coupling between each unit based on error information between the output value of the output layer and the desired output value given as a teacher signal with respect to the input signal pattern input to the input layer. and a learning processing unit that repeatedly calculates the coefficients of the strength of the connection in order from the output layer side to the input layer side, and performs learning processing of the coefficient of the strength of the connection, wherein the coefficient of the strength of the connection is The learning processing section is provided with a control means for increasing the number of units in the intermediate layer in the learning processing process, and the learning processing section controls the coefficient of the connection strength while increasing the number of units in the intermediate layer. It is characterized by the fact that it performs learning processing.

Ｆ　作用本発明に係る学習処理装置では、学習処理部にて中間層
のユニットの数を増加させながら結合の強さの係数の学
習処理を行う社→ことにより、バックプロパゲーション
学習則に従った学習処理過程における局所的最小値状態
を回避して最適最小値状態に確実に収束する学習処理を
行う。F Function In the learning processing device according to the present invention, the learning processing unit performs learning processing of the coefficient of connection strength while increasing the number of units in the intermediate layer, thereby following the backpropagation learning rule. Learning processing is performed to avoid local minimum value states in the learning processing process and reliably converge to the optimal minimum value state.

Ｇ　実施例以下、本発明の実施例について、図面を参照しながら詳
細に説明する。G. Embodiments Hereinafter, embodiments of the present invention will be described in detail with reference to the drawings.

本発明に係る学習処理装置は、第１図にその原理的な構
成を示しであるように、それぞれニューロンに対応する
信号処理を行う複数のユニットにて１成された少なくと
も入力層（１１）、中間層（１２）および出力層（１３
）を備える３Ｎ構造のニューラルネットにて構成された
信号処理部（ｌＯ）と、上記信号処理部（１０）の上記
入力層（１１）に入力される入力信号パターンｐに対す
る上記出力層の出力値と教師信号Ｌｐｊとして与えられ
る所望の出力値０□との誤差情報δ２、に基づいて上記
各ユニットの間の結合の強さの係数Ｗｊｉを上記出力層
（１３）側から上記入力層（１１）側に向かって順次に
繰り返し計算し、バソクプロノバゲーション学習則に従
って上記結合係数Ｗｊｉを学習する学習処理を上記信号
処理部（１０）に施す学習処理部（２０）とを備えて成
る。As shown in FIG. 1, the learning processing device according to the present invention has at least an input layer (11) composed of a plurality of units that each perform signal processing corresponding to a neuron. Intermediate layer (12) and output layer (13
), and an output value of the output layer for the input signal pattern p input to the input layer (11) of the signal processing unit (10). The coefficient Wji of the strength of coupling between the units is calculated from the output layer (13) side to the input layer (11) based on the error information δ2 between the desired output value 0□ given as the teacher signal Lpj. and a learning processing section (20) that subjects the signal processing section (10) to a learning process of sequentially and repeatedly calculating the coupling coefficient Wji in accordance with the Basokupronovagation learning rule.

上記学習処理部（２０）は、上記信号処理部（１０）の
上記中間Ｊｉ（１２）のユニットの数を増加させながら
上記結合係数Ｗｊｉの学習処理を行うもので、上記結合
係数Ｗｊｉの学習処理過程において上記中間層（１２）
のユニットの数を増加させる制御機能を有し、例えば第
２図Ａに示すように、それぞれニューロンに対応する任
意の個数ｘ、ｙ、ｚのユニット（ｕ＋〜ｕ　＋＋＋）、
（ｕＨｉ−ｕＨｙ）＋（ｕｏ＋＝ｕｏｇ）にて構成され
た入力層（１１）、中間層（１２）および出力層（１３
）を有する信号処理部（１０）に対して、第２図Ｂに示
すように、上記中間層（１２）のユニットの数をｙ個か
ら（ｙ＋ｍ）個に順次に増加させながら、上記結合係数
Ｗｊｉの学習処理を行う。The learning processing unit (20) performs learning processing of the coupling coefficient Wji while increasing the number of units of the intermediate Ji (12) of the signal processing unit (10), and performs learning processing of the coupling coefficient Wji. In the process, the above intermediate layer (12)
For example, as shown in FIG. 2A, an arbitrary number of x, y, z units (u+ to u +++) corresponding to neurons, respectively,
Input layer (11), intermediate layer (12) and output layer (13) composed of (uHi-uHy)+(uo+=uog)
), as shown in FIG. 2B, while increasing the number of units in the intermediate layer (12) from y to (y+m), the coupling coefficient Performs Wji learning processing.

ここで、上記中間１１（１２）のユニットの数を増加さ
せる制御は、上記結合係数Ｗ　ｊ　ｉの学習処理過程に
おいて定期的に行っても良く、また、上記局所的最小値
状態の発生を検出する毎に行うようにしても良い。Here, the control to increase the number of units in the intermediate 11 (12) may be performed periodically in the process of learning the coupling coefficient W j i, and the occurrence of the local minimum value state may be detected. It may be done every time.

上記結合係数Ｗｊｉの学習処理過程において上記中間層
（１２）のユニットの数を増加させる制御機能を有する
上記学習処理部（２０）は、入力層（１１）、中間層（
１２）および出力層（１３）を備える３層構造のニュー
ラルネントにて構成された信号処理部（１０）に対して
、上記信号処理部（１０）の上記中間層（１２）のユニ
ットの数を増加させながら上記結合係数Ｗｊｉの学習処
理を行うことにより、上記結合係数Ｗｊｉの学習処理過
程において局所的最小値状態が発生した場合にも、上記
中間層（１２）のユニットの増加によって上記局所的最
小値状態から抜は出して、最適最小値状態に迅速に且つ
確実に収束する学習処理を行うことができる。In the process of learning the coupling coefficient Wji, the learning processing unit (20) has a control function of increasing the number of units in the intermediate layer (12).
12) and an output layer (13), the number of units in the intermediate layer (12) of the signal processing unit (10) is By performing the learning process of the coupling coefficient Wji while increasing the coupling coefficient Wji, even if a local minimum value state occurs in the process of learning the coupling coefficient Wji, the unit of the intermediate layer (12) increases. It is possible to perform a learning process that quickly and reliably converges to the optimal minimum value state by starting from the minimum value state.

このように上記結合係数Ｗｊｉの学習処理過程において
上記中間層のユニットの数を増加させる制御機能を有す
る上記学習処理部（２０）にて、例えば、第３図に示す
ように、それぞれニューロンに対応する任意の個数Ｘ、
　ｙ、Ｚのユニット（ｕ＋＋〜ｕ＋ｘ）＋（ｕｓｒ−ｕ
ｓｒ）ＡｕＯｔ−ｕｏｚ）　　にて構成された入力層（
Ｌｌ）と中間層（Ｌ、）と出力層（Ｌｏ）の３層構造の
ニューラルネントワークにて構成され、上記中間層（Ｌ
Ｍ）および出力層（Ｌｏ）の各ユニ７）（ｕＭｌ〜ｕ）
Ｉｙ）＋（ｕｏｌ”””ｕｏｉ）は、それぞれ遅延手段
を備え、その出力値ｏｉｔ。を上記遅延手段を介して自
己の入力とするループ（ＬＰ）および他のユニノＩ・の
入力とするフィードハック（ＦＢ）を含むリカレント回
路網を構成した信号処理部（１００）について、入力層
（Ｌｌ）のユニンＩ−数を８個（ｘ＝８）、出力層（Ｌ
ｏ）のユニット数を３個（ｚ＝３）、各層の遅延手段の
数を２とし、学習時の入力信号パターンｐとして／＝８
Ｘ７の時空間パターンを２１個用いて、第４図のフロー
チャートに示す処理アルゴリズムにて、中間層（１−Ｈ
）のユニット数を３個（ｙ＝３）から学習を開始し、学
習処理過程において上記中間層（ＬＭ）のユニットを追
加する実験を繰り返し行ったところ、上記中間層（ＬＨ
）のユニットを３〜５回追加することにより、全ての学
習処理実験において、局所的最小値状態に陥ることなく
、最適最小値状態に収束する実験結果が得られた。In this way, in the learning processing process of the coupling coefficient Wji, the learning processing unit (20) having a control function to increase the number of units in the intermediate layer, for example, as shown in FIG. any number X,
y, Z unit (u++~u+x)+(usr-u
sr)AuOt-uoz)
It is composed of a neural network with a three-layer structure: the middle layer (L,), the middle layer (L, ), and the output layer (Lo).
M) and output layer (Lo) each unit 7) (uMl~u)
Iy)+(uol"""uoi) is a loop (LP) which is provided with a delay means and whose output value oit is inputted to itself via the delay means, and a feed which is inputted to another unino I. Regarding the signal processing unit (100) that constitutes a recurrent circuit network including a hack (FB), the number of unins in the input layer (Ll) is 8 (x = 8), and the number of unins in the input layer (Ll) is 8 (x = 8),
o), the number of units is 3 (z=3), the number of delay means in each layer is 2, and the input signal pattern p during learning is /=8
Using 21 spatiotemporal patterns of X7, the middle layer (1-H
) We started learning from 3 units (y = 3), and repeated experiments in which we added units from the middle layer (LM) during the learning process.
) was added 3 to 5 times, in all learning processing experiments, experimental results were obtained that converged to the optimal minimum value state without falling into a local minimum value state.

第５図は、上記実験の結果の１例を示しており、同図中
に矢印を付して示すタイミングで上記中間層（ＬＪ　の
ユニットを追加して、上記中間層（Ｌｘ）のユニットを
３個から６個に増加させることにより、最適最小値状態
に収束する学習処理を行うことができた実験結果を示し
ている。なお、第５図において、縦軸は二乗誤差の総和
し門Ｓを示し、横軸は学習処理の回数を示している。FIG. 5 shows an example of the results of the above experiment, in which the middle layer (LJ) unit was added at the timing indicated by the arrow in the figure, and the middle layer (Lx) unit was added. The experimental results show that by increasing the number from 3 to 6, the learning process was able to converge to the optimal minimum value state.In Figure 5, the vertical axis represents the sum of the squared errors , and the horizontal axis indicates the number of learning processes.

ここで、上記第４図のフローチャートに示す処理アルゴ
リズムについて説明する。Here, the processing algorithm shown in the flowchart of FIG. 4 above will be explained.

この処理アルゴリズムでは、先ず、ステップ１において
、局所的最小値状態を検出するための処理回数を示す変
数ＫをＯに初期設定するとともに、学習処理の収束条件
を判断するための第１の変数Ｌｍｓを１０００００００
００に初期設定する。In this processing algorithm, first, in step 1, a variable K indicating the number of processes for detecting a local minimum value state is initially set to O, and a first variable Lms is set to O for determining the convergence condition of the learning process. 10000000
Initialize to 00.

次のステップ２にて全学習パータンすなわち！個の入力
信号パターンｐの学習回数を示す変数ｎを０に初期設定
してから、ステップ３に移ってＰ個の入力信号パターン
ｐの学習処理を行う。In the next step 2, you will learn all the learning patterns! After initially setting a variable n indicating the number of learning times for P input signal patterns p to 0, the process moves to step 3 and learning processing for P input signal patterns p is performed.

次のステップ４では、上記学習回数を示す変数ｎの判定
を行い、ｎ＝３でない場合にはステップ５に移ってｎ＝
ｎ＋１として上記ステップ３に戻って上記学習処理を繰
り返し行い、ｎ＝３になるとステップ゛６に１多る。In the next step 4, the variable n indicating the number of learning times is determined, and if n=3, the process moves to step 5 and n=
When n+1 is set, the process returns to step 3 and the learning process is repeated, and when n=3, step 6 is increased by one.

上記ステップ６では、学習処理の収束条件を判断するた
めの第２の変数Ｌｍｓ（−１）の値として上記第１の変
数Ｌ＋ｍｓの値を保持してから、各ユニットにおける教
師信号と出力信号との二乗誤差の総和を第８式にて算出
して、この値を上記第１の変数ＬＩ＋ｌｓの新たな値と
する。In step 6 above, the value of the first variable L+ms is held as the value of the second variable Lms(-1) for determining the convergence condition of the learning process, and then the teacher signal and output signal in each unit are The sum of the squared errors is calculated using equation 8, and this value is used as the new value of the first variable LI+ls.

Ｌｍｓ　＝ΣΣＤｏｔ〜Ｏ□）２　　・・・・・・第８
式次のステップ７では、上記学習処理の収束条件を判断
するための上記第１の変数Ｌｍｓと第２の変数Ｌｍｓ　
（−１）との比較を行い、上記第１の変数Ｌｍｓの値が
上記第２の変数Ｌｍｓ　（−１）の値よりも小さい場合
にはステップ８に移って局所的最小値状態を検出するた
めの処理回数を示す上記変数Ｋが０であるか否かの判定
を行う。Lms =ΣΣDot〜O□)2 ・・・・・・8th
In the next step 7, the first variable Lms and the second variable Lms are used to determine the convergence condition of the learning process.
(-1), and if the value of the first variable Lms is smaller than the value of the second variable Lms (-1), proceed to step 8 and detect a local minimum value state. It is determined whether the variable K indicating the number of times of processing is 0 or not.

上記ステップ８において、上記変数ＫがＯである場合に
は上記ステップ２に直接戻り、また、上記変数Ｋが０で
ない場合にはステップ９においてに＝に＋１としてから
上記ステップ２に戻ってｎ＝０にして、上述の２個の入
力信号パターンｐの学習処理を上記ステップ３にて行う
。In step 8, if the variable K is O, the process returns directly to step 2; if the variable K is not 0, in step 9, = is set to +1, and then returns to step 2, where n= 0, and the learning process for the two input signal patterns p described above is performed in step 3 above.

また、上記ステップ７において、上記第１の変数Ｌｍｓ
の値が上記第２の変数Ｌ＃１５（−１）の値よりも大き
い場合にはステップ１０に移って局所的最小値状態を検
出するための処理回数を示す上記変数にの値をに＝に＋
１としてからステップ１１にて上記変数にの値が２であ
るか否かの判定を行う。Further, in step 7, the first variable Lms
If the value of is larger than the value of the second variable L#15 (-1), the process moves to step 10 and the value of the variable indicating the number of processing times for detecting the local minimum state is set to = ni＋
After setting the value to 1, it is determined in step 11 whether the value of the variable is 2 or not.

上記ステップ１１において上記変数にの値が２でない場
合には上記ステップ２に直接戻り、また、上記変数Ｋが
２である場合には局所的最小値状態に陥っていると判断
してステップ１２において上記中間層（Ｌイ）のユニッ
トを追加する制御を行い、さらに、ステップ１３にてに
＝Ｏとしてから上記ステップ２に戻りでｎ＝０にして、
上述の１個の入力信号パターンｐの学習処理を」二記ス
テップ３にて行う。In step 11, if the value of the variable is not 2, the process returns directly to step 2, and if the variable K is 2, it is determined that the state has fallen into a local minimum value state, and in step 12, Control is performed to add the unit of the intermediate layer (L), and further, in step 13, set = O, and then return to step 2, set n = 0,
The above-described learning process for one input signal pattern p is performed in step 3 of ``2''.

なお、上記第３図に示した上記信号処理部（１００）に
おいて、上記人力＊（Ｌｌ）の各ユニット（Ｕ。In the signal processing section (100) shown in FIG. 3, each unit (U.

〜ｔＪ＋、）に入力される入力信号パターンｐに対して
、上記中間層（ＬＨ）の各ユニット（ｕ　）ＩＩ〜ｕ　
Ｎｙ）は、その人力の総和ｎｅｔｊが、ｎｅｔｊ−Σ　Σ　Ｗｊｘ＊ｍｈａ・　Ｏ＋ａ　（ｔ−
＊）＋θ、　　　　　　　・・・・・・第９式なる第９
式にて与えられ、この入力の総和ｎｅＪに対して、Ｏｈｊ　ｆｔｌ　　＝・・・・・・第１０式なる第１０弐のｓｉｇｍｏｉｄ関数にて示される出力値
０□ｆｔｌ　　を与える。~tJ+, ), each unit (u) of the intermediate layer (LH) II~u
Ny), the total human power netj is netj−Σ Σ Wjx*mha・O+a (t−
*)+θ, ・・・・・・Equation 9 is the 9th
For the total sum neJ of inputs, the output value 0□ftl is given by the sigmoid function of the 10th equation 10.

さらに、上記出力層（Ｌ、）の各ユニット（ｕｏＩ〜ｕ
　ｏｘ）は、その入力の総和ｎｅｔＪが、＋θＪ　　　
　　　・・・・・第１１式なる第１１弐にて与えられ、
この入力の総和ｎｅＪに対して、する。Furthermore, each unit (uoI to u
ox), the sum of its inputs netJ is +θJ
...Given in the 11th formula, 11th 2,
For this input total neJ, do the following.

ここで、上記θ、はしきい値、Ｎｌ、ＮＨ，Ｎ。Here, the above θ is a threshold value, Nl, NH, N.

は、上記各層（Ｌ　＋）、（Ｌイ）、（ＬＯ）の遅延手
段の数をそれぞれ示している。indicate the number of delay means in each of the layers (L+), (L-i), and (LO).

Ｈ比較例〔比較例１〕上記第３図に示した信号処理部（１００）について、上
記中間層（ＬＨ）のユニ７ト数を６個（ｙ＝６）に固定
して、学習処理実験を行ったところ、最適最小値状態に
収束させるのに学習処理を極めて多数回繰り返す必要が
あり多大な時間を要するばかりでなく、８回の学習処理
実験で３回は最適最小値状態に収束することなく局所的
最小値状態に陥るという実験結果が得られた。H Comparative Example [Comparative Example 1] Regarding the signal processing unit (100) shown in FIG. When we performed this, we found that it was necessary to repeat the learning process extremely many times in order to converge to the optimal minimum value state, which not only took a lot of time, but also converged to the optimal minimum value state three times out of eight learning processing experiments. Experimental results were obtained that the system could fall into a local minimum state without any problem.

ごこで、この比較例１における学習処理実験で、局所的
最小値状態に陥ってしまった場合の実験結果の１例を第
６図に示しである。FIG. 6 shows an example of the experimental results when the learning processing experiment in Comparative Example 1 falls into a local minimum value state.

なお、第６図において、縮軸は二乗誤差の総和ＬＭＳを
示し、横軸は学習処理の回数を示している。In FIG. 6, the reduced axis indicates the sum of squared errors LMS, and the horizontal axis indicates the number of learning processes.

（比較例２）上記第３図に示した信号処理部（１００）について、上
記中間層（Ｌイ）のユニ７ト数を３個（ｙ＝３）に固定
して、学習処理実験を３０回行ったところ、第７図に示
す実験結果の１例のように、全ての学習処理実験におい
て最適最小値状態に収束することなく局所的最小値状態
に陥るという実験結果が得られた。(Comparative Example 2) Regarding the signal processing unit (100) shown in FIG. As shown in one example of the experimental results shown in FIG. 7, in all the learning processing experiments, the learning process fell into a local minimum value state without converging to the optimal minimum value state.

なお、第７図においても、縦軸は二乗誤差の総和ＬＭＳ
を示し、横軸は学習処理の回数を示している。In addition, in Fig. 7 as well, the vertical axis is the sum of squared errors LMS
, and the horizontal axis indicates the number of learning processes.

■　発明の効果本発明に係る学習処理装置では、学習処理部にて中間層
のユニットの数を増加させながら結合の強さの係数の学
習処理を行う赫ことにより、バノクブロバゲーシジン学
習則に従った学習処理過程における局所的最小値状態を
回避して最適最小値状態に迅速且つ確実に収束する安定
した学習処理を行うことができる。■ Effects of the Invention In the learning processing device according to the present invention, the learning processing unit performs learning processing of the coefficient of connection strength while increasing the number of units in the intermediate layer. Accordingly, it is possible to perform stable learning processing that quickly and reliably converges to the optimal minimum value state by avoiding local minimum value states in the learning processing process.

[Brief explanation of drawings]

第１図は本発明に係る学習処理装置の構成を概念的に示
すブロック図、第２図Ａおよび第２図Ｂは上記学習処理
装置による学習処理過程における学習処理開始時および
学習処理途中の信号処理部の状態を示す模式図、第３図
は本発明に係る学習処理装置にて学習処理を施した信号
処理部のニューラルネットワークの構成を示す模式図、
第４図は上記学習処理装置を構成する学習処理部による
学習処理過程をの１例を示すフローチャート、第５図は
上記学習処理部による学習処理実験の結果の１例を示す
特性線図、第６図は上記第３図に示した信号処理部のニ
ューラルネットワークの中間層のユニット数を６個に固
定して学習処理実験を行った結果を示す比較例１の特性
線図、第ま図は上記第３図に示した信号処理部のニュー
ラルネットワークの中間層のユニット数を３個に固定し
て学習処理実験を行った結果を示す比較例２の特性線図
、第８図はバノクブロソバゲーション学習則の適用され
るニューラルネットワークの一般的な構成を示す模式図
である。（１０）　、　（１００）・・・・・信号処理部（２０
）・・・・・・・・・・・学習処理部（Ｌ＋）・・・・
・・・・・・・入力層（Ｌ、）・・・・・・・・・・・
中間層（Ｌ、）・・・・・・・・・・・出力層（ｕ＋＋
′ｕ＋ｘ）＋（ｕＨ＋”１Ｊｎｙ）＋（ｕｏ＋””Ｌｌ
ｏｊ　・ｂ會０．−９．ユニットFIG. 1 is a block diagram conceptually showing the configuration of a learning processing device according to the present invention, and FIGS. 2A and 2B are signals at the start of learning processing and during the learning processing in the learning processing process by the learning processing device. FIG. 3 is a schematic diagram showing the state of the processing section; FIG. 3 is a schematic diagram showing the configuration of the neural network of the signal processing section subjected to learning processing by the learning processing device according to the present invention;
FIG. 4 is a flowchart showing an example of the learning processing process by the learning processing unit constituting the learning processing device; FIG. 5 is a characteristic diagram showing an example of the results of a learning processing experiment by the learning processing unit; Figure 6 is a characteristic diagram of Comparative Example 1 showing the results of a learning processing experiment with the number of units in the intermediate layer of the neural network in the signal processing section shown in Figure 3 fixed at 6; A characteristic diagram of Comparative Example 2 showing the results of a learning processing experiment with the number of units in the intermediate layer of the neural network of the signal processing section fixed at three shown in Fig. 3 above, and Fig. 8 is a graph of Banoku Block FIG. 1 is a schematic diagram showing a general configuration of a neural network to which a severation learning rule is applied. (10), (100)...Signal processing section (20
)・・・・・・・・・・Learning processing unit (L+)・・・・
・・・・・・Input layer (L,)・・・・・・・・・・・・
Middle layer (L,)... Output layer (u++
'u+x)+(uH+"1Jny)+(uo+""Ll
oj ・b meeting 0. -9. unit

Claims

[Scope of Claims] A signal processing unit comprising an input layer, an intermediate layer, and an output layer each configured of a plurality of units that perform signal processing corresponding to a neuron; Based on the error information between the output value of the output layer and the desired output value given as a teacher signal, coefficients of the strength of coupling between the units are sequentially repeated from the output layer side to the input layer side. and a learning processing unit that performs learning processing of the coefficient of connection strength, the number of units of the intermediate layer being increased in the process of learning the coefficient of connection strength. A learning processing device characterized in that a control means is provided in the learning processing unit, and the learning processing unit performs learning processing of the coefficient of the strength of connection while increasing the number of units in the intermediate layer. .