JPH08147411A

JPH08147411A - Character recognition device

Info

Publication number: JPH08147411A
Application number: JP6286287A
Authority: JP
Inventors: Kinya Endo; 欽也遠藤; Hideo Tanimoto; 英雄谷本; Shizuo Nagata; 静男永田
Original assignee: Oki Electric Industry Co Ltd
Current assignee: Oki Electric Industry Co Ltd
Priority date: 1994-11-21
Filing date: 1994-11-21
Publication date: 1996-06-07

Abstract

PURPOSE: To provide the character recognition device which can accurately recognize a character even if the character is inputted slantingly. CONSTITUTION: The character recognition device, which extracts plural feature points according to the strokes of the character inputted by writing and compares a feature quantity showing the distribution of segments connecting the feature points on an xy coordinate system with a previously set reference feature quantity to recognize the character inputted by writing, is equipped with a tilt judgement part 7 which judges how much the character is written and inputted slantingly under specific conditions, partial pattern Q value matching (2) 9 which calculates a feature quantity be excluding components generated by the tilt from the distribution of the segments on the xy coordinate system according to the judgement of the tilt judgement part 7.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、筆記入力される文字の
ストロークから特徴量を算出し、これに基づいて的確な
認識を行う文字認識装置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a character recognition device for calculating a feature amount from a stroke of a character input by handwriting and performing accurate recognition based on the feature amount.

【０００２】[0002]

【従来の技術】タブレット等の入力手段により筆記入力
した文字を認識するには、一般的にパターンマッチング
方式が行われている。このパターンマッチング方式で
は、筆記入力された文字のストローク（ペンオンからペ
ンオフまでの筆記部分）の座標データ列より特徴点を抽
出し、その抽出した特徴点の情報を予め同一方法で特徴
点抽出して登録しておいた基準のパターン（以下、登録
パターンと言う。）とマッチングさせ、適合するものを
認識結果として得ている。2. Description of the Related Art A pattern matching method is generally used to recognize a character written and input by an input means such as a tablet. In this pattern matching method, a feature point is extracted from the coordinate data string of the stroke of a character (pen-on to pen-off) of a character input by handwriting, and the extracted feature point information is extracted in advance by the same method. Matching with a registered reference pattern (hereinafter referred to as a registered pattern), a matching pattern is obtained as a recognition result.

【０００３】このパターンマッチング方式では、筆記入
力された文字のストロークを登録パターンのどのストロ
ークとマッチングすべきかの処理に多大な時間を要す
る。また、全体の字のバランスが乱れている場合、例え
ば、「河」という文字における偏の部分が小さく、つく
りの部分が大きい場合には、正確なマッチング結果が得
られないという不都合が生じる。そこで、このような筆
記文字の変形に対応する観点から、特開昭６２−２２９
３８４号公報に示されるオンライン文字認識装置が開示
されている。In this pattern matching method, it takes a lot of time to process which stroke of a registered pattern the stroke of a character input by handwriting should be matched with. Further, when the balance of the entire characters is disturbed, for example, when the uneven portion of the character "kawa" is small and the portion of the character is large, there is a disadvantage that an accurate matching result cannot be obtained. Therefore, from the viewpoint of coping with such deformation of the written characters, Japanese Patent Laid-Open No. 229/1987.
An online character recognition device disclosed in Japanese Patent No. 384 is disclosed.

【０００４】このオンライン文字認識装置では、筆記入
力した文字のストローク数により大分類を行い、筆記上
一連のものとして筆記する部分を部分パターンとする。
そして、この部分パターンの重心間のベクトルにより中
分類を行い、部分パターンの特徴パラメータとしてのＱ
値なる値およびストロークコード分布なる値を用いてマ
ッチングを行う。これにより、文字変形に対応し、しか
も処理量の少ない文字認識を行うようにしている。In this on-line character recognition device, large classification is performed according to the number of strokes of a character input by handwriting, and a part to be written as a series of handwriting is defined as a partial pattern.
Then, the middle classification is performed by the vector between the centers of gravity of the partial patterns, and Q as the characteristic parameter of the partial patterns is
Matching is performed using a value that is a value and a value that is a stroke code distribution. As a result, character recognition is performed in response to character deformation and with a small amount of processing.

【０００５】[0005]

【発明が解決しようとする課題】しかしながら、このよ
うな文字認識装置には次のような問題がある。すなわ
ち、マッチングのための情報である特徴パラメータとし
てのＱ値は、各セグメントのｘｙ座標系における±ｘ方
向成分、±ｙ方向成分がｘｙ方向のどの位置に分布して
いるかを示すものである。例えば「土」という文字の各
ストロークにおける基準のＱ値は、各ストロークの＋ｘ
方向成分および−ｙ方向成分のみ構成されており、マッ
チングを行う場合には、この基準となるＱ値と筆記入力
された文字のＱ値とを比較してその差によりマッチング
したか否かを判定している。このため、筆記入力された
文字に傾きが生じていると、基準となるＱ値には存在し
ない方向成分が算出されてしまい、的確なマッチングが
困難となってしまう。However, such a character recognition device has the following problems. That is, the Q value as the characteristic parameter, which is information for matching, indicates at which position in the xy direction the ± x direction component and the ± y direction component in the xy coordinate system of each segment are distributed. For example, the standard Q value for each stroke of the character "Sat" is + x of each stroke.
Only the direction component and the -y direction component are configured, and when matching is performed, the reference Q value and the Q value of the character written and input are compared, and it is determined whether or not matching is performed based on the difference. are doing. For this reason, when a character input by handwriting is tilted, a direction component that does not exist in the reference Q value is calculated, and accurate matching becomes difficult.

【０００６】例えば、「土」という文字を右肩上がりで
筆記した場合には、基準のＱ値にはない＋ｙ方向成分が
生じてしまう。また、「土」という文字を右傾きで筆記
した場合には、基準のＱ値にはない−ｘ方向成分が生じ
てしまう。このように、文字が傾いて記入されると、正
確な文字認識が困難となり、認識率の低下を招くことに
なる。[0006] For example, when the character "Sat" is written upwards, a + y direction component which is not in the standard Q value is generated. Further, when the character "Sat" is written with a right tilt, a component in the -x direction that is not in the reference Q value occurs. As described above, when the characters are written with an inclination, accurate character recognition becomes difficult and the recognition rate is lowered.

【０００７】[0007]

【課題を解決するための手段】本発明は、このような課
題を解決するために成された文字認識装置である。すな
わち、本発明は、筆記入力された文字のストロークに応
じて複数の特徴点を抽出し、その特徴点間をつなぐセグ
メントにおけるｘｙ座標系での分布を示す特徴量と、予
め設定されている基準特徴量とを比較することにより、
筆記入力された文字の認識を行う文字認識装置であり、
筆記入力される文字の入力の際の傾き具合を所定の条件
に基づき判断する傾斜判断部と、この傾斜判断部での判
断に基づきセグメントにおけるｘｙ座標系での分布のう
ち傾きにより生じる成分を除外して特徴量を算出する特
徴量算出部とを備えた構成となっている。SUMMARY OF THE INVENTION The present invention is a character recognition device made to solve such problems. That is, according to the present invention, a plurality of feature points are extracted according to strokes of a character that is written and input, and a feature amount indicating a distribution in an xy coordinate system in a segment connecting the feature points and a preset reference value. By comparing with the feature quantity,
It is a character recognition device that recognizes written characters.
An inclination determining unit that determines the degree of inclination when inputting a character to be written and written based on a predetermined condition, and excludes a component caused by the inclination from the distribution in the xy coordinate system in the segment based on the determination by the inclination determining unit. And a feature amount calculation unit that calculates the feature amount.

【０００８】[0008]

【作用】本発明では、傾斜判断部により筆記入力される
文字の入力の際の傾き具合を判断し、特徴量算出部によ
りこの傾き具合に基づいた特徴量の算出を行っている。
すなわち、特徴量算出部では、筆記入力される文字のセ
グメントにおけるｘｙ座標系での分布のうち、傾きによ
り生じる成分を除去して特徴量の算出を行っているた
め、傾きが無く筆記入力された場合と同等な特徴量を得
ることができるようになる。これによって、不要な成分
を含むことなく算出した特徴量と予め設定されている基
準特徴量とを的確に比較することができるようになる。In the present invention, the inclination determining section determines the inclination degree when a character to be written is input, and the characteristic amount calculating section calculates the characteristic amount based on the inclination degree.
That is, since the feature amount calculation unit calculates the feature amount by removing the component caused by the inclination in the distribution in the xy coordinate system of the segment of the character to be written and input, the input is performed without inclination. It becomes possible to obtain a feature amount equivalent to the case. This makes it possible to accurately compare the calculated feature amount without including unnecessary components and the preset reference feature amount.

【０００９】[0009]

【実施例】以下に、本発明の文字認識装置における実施
例を図に基づいて説明する。図１は本発明の第１実施例
における文字認識装置を説明するブロック図、図２は本
発明第２実施例における文字認識装置を説明するブロッ
ク図、図３は本発明の第３実施例における文字認識装置
を説明するブロック図である。各実施例における文字認
識装置では、タブレット１、前処理部２、特徴抽出部
３、ストロークコード化部４、大分類部５、中分類部
６、傾斜判断部７、部分パターンＱ値マッチング部
８、部分パターンＱ値マッチング部９、部分パターン
ストロークコードマッチング部１０、出力端子１１を備
える点で共通するが、各傾斜判断部７の内部構成におい
て相違する。なお、これら共通する点については第１実
施例の説明で行い、第２、第３実施例では相違する点を
中心とした説明を行う。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS An embodiment of the character recognition device of the present invention will be described below with reference to the drawings. 1 is a block diagram illustrating a character recognition device according to a first embodiment of the present invention, FIG. 2 is a block diagram illustrating a character recognition device according to a second embodiment of the present invention, and FIG. 3 is a third embodiment of the present invention. It is a block diagram explaining a character recognition device. In the character recognition device according to each embodiment, the tablet 1, the preprocessing unit 2, the feature extraction unit 3, the stroke coding unit 4, the large classification unit 5, the middle classification unit 6, the inclination determination unit 7, and the partial pattern Q value matching unit 8 are provided. , The partial pattern Q-value matching unit 9, the partial pattern stroke code matching unit 10, and the output terminal 11 are common, but the internal configurations of the respective inclination determination units 7 are different. The common points will be described in the description of the first embodiment, and the different points will be mainly described in the second and third embodiments.

【００１０】先ず、各実施例の説明を行うに先立ち、文
字の傾きの状態を説明する。図４は文字の傾きの状態を
説明する図であり、「土」という文字を例とした各態様
を（ａ）〜（ｃ）に示している。図４（ａ）に示す態様
は「土」を標準字体で入力した場合の各ストロークを示
している。この場合は、ＡストロークからＣストローク
までの３つのストロークがｘｙ座標系上に示されてお
り、特徴量（以下、Ｑ値と言う。）はＡ、Ｂストローク
では＋ｘ方向成分のみ、Ｃストロークでは−ｙ方向成分
のみで構成される。これが基準Ｑ値となって予め設定さ
れている。First, before describing each embodiment, the state of inclination of a character will be described. FIG. 4 is a diagram for explaining the state of inclination of the characters, and (a) to (c) show respective modes in which the character “soil” is taken as an example. The mode shown in FIG. 4A shows each stroke when "Sat" is input in the standard font. In this case, three strokes from the A stroke to the C stroke are shown on the xy coordinate system, and the feature amount (hereinafter referred to as the Q value) is only the + x direction component in the A and B strokes, and the C stroke in the C stroke. -Y direction component only. This is the reference Q value and is preset.

【００１１】図４（ｂ）に示す態様は「土」を右肩上が
りで筆記入力した場合の各ストロークを示している。こ
の場合は、図４（ａ）に示す標準の場合と比べてＡ、Ｂ
ストロークに＋ｙ方向成分が加わった状態となってい
る。また、図４（ｃ）に示す態様は「土」を右傾きで筆
記入力した場合の各ストロークを示しており、この場合
は、図４（ａ）に示す標準の場合と比べてＣストローク
に−ｘ方向成分が加わった状態となっている。このよう
に、文字が傾く場合には標準の字体に対して特有の方向
成分が加わる状態となる。The mode shown in FIG. 4 (b) shows each stroke when "Sat" is input by writing in an upward slope. In this case, compared to the standard case shown in FIG.
It is in the state where the + y direction component is added to the stroke. In addition, the mode shown in FIG. 4C shows each stroke when “Sat” is written with a right inclination, and in this case, the stroke is C stroke compared to the standard case shown in FIG. 4A. It is in a state where the −x direction component is added. In this way, when a character is tilted, a specific direction component is added to the standard font.

【００１２】各実施例における文字認識装置では、この
ように傾斜して筆記入力された文字を所定の条件で判別
し、傾斜により生じる成分を除去した状態でＱ値の算出
を行い、基準Ｑ値とのマッチングを行うようにする点に
特徴がある。先ず、第１実施例における文字認識装置の
構成を順に説明する。図１に示すタブレット１は、ペン
等を接触移動させることで所望の文字を筆記入力するた
めの手段であり、ここで得た筆記データ列｛（ｘi ，ｙ
i ）、i=１，２，…ｎj ｝j を前処理部２へ送る構成と
なっている。図５は前処理・特徴点抽出を説明する図で
ある。タブレット１にて得られる筆記データ列は図５
（ａ）に示すような生データとして前処理部２へ送られ
る。前処理部２は、タブレット１から送られた筆記デー
タ列に対してノイズ除去処理、移動平均処理、あるいは
平滑化処理を施して特徴抽出部３へ送る構成となってい
る。平滑後は図５（ｂ）に示すようなデータの状態とな
る。In the character recognition device in each of the embodiments, the characters which have been written with inclination as described above are discriminated under a predetermined condition, and the Q value is calculated in a state where the components caused by the inclination are removed, and the reference Q value is obtained. It is characterized in that it is matched with. First, the configuration of the character recognition device in the first embodiment will be described in order. The tablet 1 shown in FIG. 1 is a means for writing and inputting a desired character by touching and moving a pen or the like, and the writing data string {(xi, y
i), i = 1, 2, ... Nj} j are sent to the preprocessing unit 2. FIG. 5 is a diagram for explaining preprocessing / feature point extraction. The writing data string obtained on the tablet 1 is shown in FIG.
It is sent to the preprocessing unit 2 as raw data as shown in (a). The pre-processing unit 2 is configured to perform noise removal processing, moving average processing, or smoothing processing on the writing data string sent from the tablet 1 and send it to the feature extraction unit 3. After smoothing, the data state is as shown in FIG.

【００１３】また、特徴抽出部３は、前処理部２から送
られたデータを用いて特徴点の抽出を行う。これによ
り、図５（ｃ）に示すような特徴点の抽出が行われる。
例えば、この特徴点の抽出を行う方法としては、平滑化
されたデータ列｛（ｘi ，ｙi ）、i=１，２，…ｎj ｝
j のデータ間のｘ、ｙ方向のサイン（正、負、０の符
号）を（１）式により算出し、そのサインの状態の変化
点を特徴点として抽出するようにする。The feature extraction unit 3 also extracts feature points using the data sent from the preprocessing unit 2. As a result, the feature points shown in FIG. 5C are extracted.
For example, as a method of extracting this feature point, a smoothed data string {(xi, yi), i = 1, 2, ... Nj}
The sign (positive, negative, sign of 0) in the x and y directions between the data of j is calculated by the equation (1), and the change point of the state of the sign is extracted as the feature point.

【００１４】[0014]

【数１】 [Equation 1]

【００１５】すなわち、データ間のｘ、ｙ方向のサイン
ＸＳi 、ＹＳi を求めて＋、０、−で表現し、各データ
間のサインを前データ間のサインと比較して同じであれ
ば特徴点として登録せず、異なった場合に状態が変化し
たとして特徴点としての登録を行う。また、このように
して求めた特徴点の他に、始点、終点を加えた点を特徴
点として示す。このようにして求められた特徴点情報
は、図１に示すストロークコード化部４および中分類部
６へ出力される。That is, the signs XSi and YSi in the x and y directions between the data are obtained and expressed by +, 0 and-, and if the signs between the respective data are compared with the signs between the previous data, they are characteristic points. Is registered as a feature point, assuming that the state has changed when different. Further, in addition to the feature points thus obtained, points to which a start point and an end point are added are shown as feature points. The feature point information obtained in this way is output to the stroke encoding unit 4 and the middle classification unit 6 shown in FIG.

【００１６】ストロークコード化部４では、特徴抽出部
３により得られた特徴点情報に基づき、各ストロークを
コード化する。一般的には、例えば各セグメントのＸ、
Ｙサイン、セグメントの角度およびセグメント間の回転
角度により分類し、コード化を行う。図６はストローク
コード化処理を説明する図であり、θ1 、θ2 、θ3は
セグメントの角度（＋ｘ方向となす角度）、θ1 ^-、θ
2 ^-は隣合うセグメント間の回転角度を示している。こ
のようにしてコード化されたストロークデータは、大分
類部５および部分パターンストロークコードマッチング
部１０へ出力される。The stroke coding unit 4 codes each stroke based on the characteristic point information obtained by the characteristic extraction unit 3. Generally, for example, X of each segment,
Classification is performed according to the Y-sign, the angle of the segment, and the rotation angle between the segments, and coding is performed. Figure 6 is a diagram for explaining a stroke coding process, θ1, θ2, (angle between the + x direction) .theta.3 segment angle, θ1 ^-, θ
2 ^- shows the rotation angle between adjacent segments. The stroke data encoded in this way is output to the large classification unit 5 and the partial pattern stroke code matching unit 10.

【００１７】大分類部５では、ストロークコード化部４
からの出力を受け、ストローク数によって対象文字に対
する大分類を行う。このため、予め画数（ストローク
数）毎にその画数となり得る文字を図７に示すような文
字辞書に用意しておく。例えば、筆記入力された文字パ
ターンのストローク数が１０画であった場合、文字辞書
に格納されているものの内の、１０画となり得る文字
（「唖」、「挨」、「逢」…）を候補文字として選択す
る。なお、本実施例中で示す「唖」は図７のＡで示され
る文字を表すものとする。In the large classification unit 5, the stroke coding unit 4
It receives the output from and classifies the target character according to the number of strokes. Therefore, for each stroke number (stroke number), a character that can be the stroke number is prepared in advance in a character dictionary as shown in FIG. For example, if the stroke number of the character pattern input by handwriting is 10 strokes, characters that can be 10 strokes among those stored in the character dictionary (“mush”, “dust”, “a” ...) Select as a candidate character. In addition, the "mute" shown in this embodiment represents the character shown by A in FIG.

【００１８】また、中分類部６では、大分類部５にて画
数により大分類して得た候補文字を、以下に説明する部
分パターン間ベクトルによってさらに中分類する処理を
行う。ここで、部分パターンとは、１つの文字のうち
筆記上一連のものとして筆記する部分をいうものとし、
部分パターン間ベクトルとは、１つの部分パターンの重
心と他の部分パターンの重心とを各々始点、終点とする
ベクトルをいうものとする。Further, the middle classifying section 6 performs a further middle classifying process on the candidate characters obtained by the large classifying section 5 according to the number of strokes, by the partial pattern vectors described below. Here, the partial pattern refers to a portion of one character that is written as a series of characters in writing,
The inter-partial-pattern vector is a vector whose start point and end point are the center of gravity of one partial pattern and the center of gravity of another partial pattern, respectively.

【００１９】部分パターン間ベクトルを算出するには、
先ず、部分パターン中の各セグメントのｘ、ｙ方向成分
を（ｄｘ1 ，ｄｙ1 ）とした場合の、各セグメントの長
さｄＬ1 （ｄＬ1 ＝√（ｄｘ1 ²＋ｄｙ1 ²））を求め
る。また、文字幅ＨＸ、ＨＹで除算することにより正規
化した各セグメントの中心座標を（ｘ1*, ｙ1*）とした
場合、部分パターンの重心位置（Ｘw ，Ｙw ）を（２）
式により求める。To calculate the inter-partition pattern vector,
First, the length dL1 (dL1 = √ (dx1 ² + dy1 ² )) of each segment when the x and y direction components of each segment in the partial pattern are (dx1, dy1) is obtained. If the center coordinates of each segment normalized by dividing by the character widths HX and HY are (x1 *, y1 *), the center of gravity position (Xw, Yw) of the partial pattern is (2).
Calculate by formula.

【００２０】[0020]

【数２】 [Equation 2]

【００２１】このようにして各部分パターンの重心を求
め、１つの部分パターンの重心と他の部分パターンの重
心とをそれぞれ始点、終点として部分パターン間ベクト
ルを求める。なお、ここでは部分パターン間ベクトルは
ｘ方向とｙ方向について各々考えるものとする。In this way, the center of gravity of each partial pattern is obtained, and the vector between partial patterns is obtained with the center of gravity of one partial pattern and the center of gravity of another partial pattern as the starting point and the ending point, respectively. In addition, here, it is assumed that the vectors between the partial patterns are considered in the x direction and the y direction, respectively.

【００２２】図８は部分パターン間ベクトルを説明する
図である。図８（ａ）は文字「唖」を例とした場合の部
分パターン「口」、「亜」の各重心および部分パターン
間ベクトルＢx （口、亜）、Ｂｙ（口、亜）を示してい
る。中分類部６では、この部分パターン間ベクトルによ
り、大分類部５で選択された候補文字を絞り込んで中分
類を行う。例えば、文字「逢」が筆記入力された場合を
考えると、先ず、「逢」は１０画であるため、図７で示
す文字辞書の１０画となる部分を参照し、候補文字を選
択する。ここで第１番目に配置されている「唖」の部分
パターン、部分パターンの筆記順によるカット位置、お
よび部分パターン間ベクトル値を読み出して、筆記入力
された「逢」との比較を行う。また、文字辞書には
「挨」、「逢」…の順に同様な情報が並んでおり、中分
類部６はこの文字順に従って比較を行い、候補とすべき
か否かをそれぞれ判定する。FIG. 8 is a diagram for explaining vectors between partial patterns. FIG. 8A shows the center of gravity of each of the partial patterns “mouth” and “sub” and the inter-partial-pattern vectors Bx (mouth, sub) and By (mouth, sub) in the case where the character “mute” is taken as an example. . The middle classification unit 6 narrows down the candidate characters selected by the large classification unit 5 based on the vector between the partial patterns to perform middle classification. For example, considering the case where the character "Ai" is written and input, first, since "Ai" has 10 strokes, the candidate character is selected by referring to the 10 strokes in the character dictionary shown in FIG. Here, the partial pattern of the "mute" arranged at the first position, the cut position in the writing order of the partial patterns, and the vector value between partial patterns are read out and compared with the "a" that has been manually input. Similar information is lined up in the character dictionary in the order of “dust”, “a”, ... The middle classifying unit 6 makes comparisons according to this character order to determine whether or not they should be candidates.

【００２３】例えば、筆記入力された文字「逢」に対し
て比較の対象が「唖」であった場合、文字辞書にかかれ
ている「唖」のカット位置に基づいた部分パターン間ベ
クトルＢx 、Ｂy と筆記入力された文字「逢」を「唖」
と同じカット位置で分けた場合の部分パターン間ベクト
ルＢx*、Ｂy*（図７（ｂ）参照）との差であるマッチン
グ距離ｄvec を（３）式により求める。For example, when the comparison target for the handwritten character "Ai" is "mute", the partial pattern vectors Bx, By based on the cut position of "mute" written in the character dictionary. The character "A" that is written in
The matching distance dvec, which is the difference between the inter-partial-pattern vectors Bx * and By * (see FIG. 7B) when divided at the same cut position as, is calculated by the equation (3).

【００２４】[0024]

【数３】 (Equation 3)

【００２５】一般的に、筆記入力した文字の部分パター
ンが複数ある場合には、その部分パターン数ＢＰＮで正
規化を行い、マッチング距離ｄvec を求める。この場合
のマッチング距離ｄvec は（４）式により算出する。In general, when there are a plurality of partial patterns of characters written and input, the matching distance dvec is calculated by normalizing the partial pattern number BPN. The matching distance dvec in this case is calculated by the equation (4).

【００２６】[0026]

【数４】 [Equation 4]

【００２７】そして、ここで算出されたマッチング距離
ｄvec と所定のしきい値ＶＥＣＲＥＪとを比較して、マ
ッチング距離ｄvec がしきい値ＶＥＣＲＥＪよりも大き
い場合には、筆記入力した文字が対象文字（この場合、
「唖」）でないとし、次の候補文字における部分パター
ン間ベクトルでのマッチングを行う。また、マッチング
距離ｄvec がしきい値ＶＥＣＲＥＪ以下となった場合に
は、筆記入力した文字が対象文字（「唖」）であるらし
いと判断し、後に説明する部分パターンＱ値の算出およ
びマッチングを行う。Then, the matching distance dvec calculated here is compared with a predetermined threshold value VECREJ. If the matching distance dvec is larger than the threshold value VECREJ, the handwritten character is the target character (this If
If it is not a "mute"), matching is performed with the partial pattern vector for the next candidate character. If the matching distance dvec is less than or equal to the threshold value VECREJ, it is determined that the character input by handwriting seems to be the target character (“mute”), and calculation and matching of the partial pattern Q value described later are performed. .

【００２８】中分類部６における候補文字の中分類を施
した後は、傾斜判断部７による処理を行う。第１実施例
における文字認識装置では、この傾斜判断部７が文字種
設定部７１と、文字種によるＱ値設定部７２と、ＱＦＬ
Ｇ判定部７３とから構成されている。文字種設定部７１
では、筆記入力される文字の種類に応じて例えば「漢
字」、「英字」、「数字」、「ひらがな」、「カタカ
ナ」、「記号」から成る６つの種類の信号を出力する。After the middle classification of the candidate characters in the middle classification unit 6, the inclination judgment unit 7 performs the processing. In the character recognition device according to the first embodiment, the inclination determination unit 7 includes a character type setting unit 71, a character type Q value setting unit 72, and a QFL.
It is composed of a G determination unit 73. Character type setting unit 71
Then, six types of signals consisting of, for example, "Kanji", "English characters", "Numerals", "Hiragana", "Katakana", and "Symbols" are output according to the type of characters input by handwriting.

【００２９】例えば、図９に示すようなタブレット１上
には筆記エリア１０と選択エリア１２とが設けられてお
り、この選択エリア１２内に「漢字」、「英字」、「数
字」、「ひらがな」、「カタカナ」、「記号」に対応し
た６つの選択部１２ａ〜１２ｆが設けられている。文字
種を選択するには、筆記入力する前にこの６つの選択部
１２ａ〜１２ｆのうち筆記入力する文字に対応するもの
を１つを選択する。例えば、漢字であれば「漢字」と表
示されている選択部１２ａをペンで押すことにより選択
する。この時、ペンで指定した位置の座標値（ｘ，ｙ）
をタブレット１から文字種設定部７１へ出力し、文字種
設定部７１ではこの座標値を受けて各選択部１２ａ〜１
２ｆの位置に応じた文字種設定フラグを文字種によるＱ
値設定部７２へ出力する。For example, a writing area 10 and a selection area 12 are provided on the tablet 1 as shown in FIG. 9, and in this selection area 12, "Kanji", "English characters", "Numerals", and "Hiragana" are provided. , "Katakana", and "symbols" are provided in the six selectors 12a to 12f. To select the character type, one of the six selection units 12a to 12f corresponding to the character to be written and input is selected before writing and input. For example, if it is a kanji character, it is selected by pushing the selection part 12a displayed as "kanji" with a pen. At this time, the coordinate value (x, y) of the position specified by the pen
Is output from the tablet 1 to the character type setting unit 71, and the character type setting unit 71 receives the coordinate values and selects each of the selection units 12a to 12a.
Set the character type setting flag according to the position of 2f by Q depending on the character type.
The value is output to the value setting unit 72.

【００３０】文字種によるＱ値設定部７２では、文字種
設定フラグを受けてその値に応じてＱ値における所定成
分削除の有無を判別し、Ｑ値設定フラグ（ＱＦＬＧ）を
設定する処理を行う。例えば、漢字に対応する文字種設
定フラグを受けた場合には、Ｑ値の−ｘ方向成分および
＋ｙ方向成分を削除し、それ以外の時（英字、数字、ひ
らがな、カタカナ、記号の場合）は削除しないとした場
合にはＱＦＬＧ＝１を、これ以外の場合にはＱＦＬＧ＝
０を出力する。The character type Q value setting unit 72 receives the character type setting flag, determines whether or not the predetermined component is deleted in the Q value according to the value, and sets the Q value setting flag (QFLG). For example, when a character type setting flag corresponding to a Chinese character is received, the -x direction component and the + y direction component of the Q value are deleted, and at other times (in the case of letters, numbers, hiragana, katakana, and symbols) they are deleted. If not, QFLG = 1; otherwise, QFLG =
Outputs 0.

【００３１】ＱＦＬＧ判定部７３は、文字種によるＱ値
設定部７２からＱＦＬＧを受けて、これに応じた出力先
にこのＱＦＬＧの値を出力する。すなわち、ＱＦＬＧ＝
０の場合はこの値を部分パターンＱ値マッチング部８
へ、ＱＦＬＧ＝１の場合はこの値を部分パターンＱ値マ
ッチング部９へ出力する。The QFLG determination unit 73 receives the QFLG from the character type Q value setting unit 72 and outputs the value of this QFLG to the output destination corresponding thereto. That is, QFLG =
When it is 0, this value is used as the partial pattern Q value matching unit 8
When QFLG = 1, this value is output to the partial pattern Q value matching unit 9.

【００３２】先ず、部分パターンＱ値マッチング部８
での処理を説明する。部分パターンＱ値マッチング部
８では、中分類部６における部分パターン間ベクトルに
よる中分類で残った候補文字について部分パターンＱ値
を算出し、図１０に示す部分パターン辞書中の部分パタ
ーンＱ値とのマッチングを行う。この部分パターン辞書
における部分パターンＱ値は登録パターンにより予め作
成され、格納されているものである。ここで、部分パタ
ーンＱ値とは、各セグメントの長さ、方向および位置を
表す特徴パラメータをいう。部分パターンＱ値は、例え
ば（５）式〜（１２）式により算出する。First, the partial pattern Q value matching unit 8
The processing in will be described. The partial pattern Q value matching unit 8 calculates partial pattern Q values for the candidate characters remaining in the intermediate classification by the inter-partition pattern vector in the intermediate classification unit 6, and compares them with the partial pattern Q values in the partial pattern dictionary shown in FIG. Match. The partial pattern Q value in this partial pattern dictionary is created and stored in advance by the registered pattern. Here, the partial pattern Q value refers to a characteristic parameter indicating the length, direction, and position of each segment. The partial pattern Q value is calculated by, for example, equations (5) to (12).

【００３３】[0033]

【数５】 (Equation 5)

【数６】 (Equation 6)

【数７】 (Equation 7)

【数８】 (Equation 8)

【数９】 [Equation 9]

【数１０】 [Equation 10]

【数１１】 [Equation 11]

【数１２】 (Equation 12)

【００３４】なお、これら式においてΣは全ストロー
ク、全セグメントに関する加算、ＨＸ、ＨＹは文字幅を
示している。また、（５）式〜（１２）式にて求めたＱ
1 〜Ｑ8 は、文字を中心とした四角形領域の左下を原点
に設定した値となっているため、原点近くにあるものは
乗算に供すると「０」となってしまう。そこで、原点を
四角形領域の右上に入れ替えて、（５）式〜（１２）式
と同様な式によりＱ9 〜Ｑ16を求め、Ｑ1 〜Ｑ16までの
合計１６個の値を算出する。In these equations, Σ is the total stroke, addition for all segments, and HX and HY are character widths. In addition, Q obtained by the equations (5) to (12)
Since 1 to Q8 are values in which the origin is set to the lower left of a rectangular area centered on a character, those near the origin become "0" when subjected to multiplication. Therefore, the origin is replaced with the upper right corner of the quadrangular area, Q9 to Q16 are obtained by the same equations as the equations (5) to (12), and a total of 16 values from Q1 to Q16 are calculated.

【００３５】例えば、筆記入力された文字が「逢」であ
る場合で、候補文字に「挨」が残っていた場合、先ず
「挨」の文字のカット位置は図７に示す文字辞書より読
み取る。この場合のカット位置は（３，２，５）であ
り、その部分パターンは図１１（ａ）に示す状態とな
る。この際、「逢」の文字に対しても「挨」のカット位
置を同じ（３，２，５）で分けると図１１（ｂ）に示す
状態となる。そして、この各々の部分パターンにおける
部分パターンＱ値でのマッチングを行う。For example, when the handwritten character is "O" and there is "dust" remaining in the candidate character, the cut position of the character "dust" is first read from the character dictionary shown in FIG. The cut position in this case is (3, 2, 5), and the partial pattern is in the state shown in FIG. At this time, if the “dust” cut position is divided into the same (3, 2, 5) for the character “Ai”, the state shown in FIG. 11B is obtained. Then, matching is performed with the partial pattern Q value in each of the partial patterns.

【００３６】マッチングでは、各部分パターン毎にＱ1
〜Ｑ16の差を算出し、これを合計したものをマッチング
距離ｄBPとし、このマッチング距離ｄBPを（１３）式を
用いて算出する。In matching, Q1 is set for each partial pattern.
The difference between Q16 and Q16 is calculated, and the sum of the differences is used as the matching distance dBP, and this matching distance dBP is calculated using the equation (13).

【００３７】[0037]

【数１３】 (Equation 13)

【００３８】ただし、（１３）式におけるＱi(j)は、第
j 部分パターンのＱ値である。次いで、求めたマッチン
グ距離ｄBPと、先の処理で求めた部分パターン間ベクト
ルでのマッチング距離ｄvec にαvec で重み付けしたも
のとを加算して距離ｄ1 を（１４）式により求める。However, Qi (j) in the equation (13) is
j is the Q value of the partial pattern. Next, the obtained matching distance dBP and the matching distance dvec in the inter-partial-pattern vector obtained in the previous process weighted by αvec are added to obtain the distance d1 by the equation (14).

【００３９】[0039]

【数１４】 [Equation 14]

【００４０】（１４）式におけるαvec としては、例え
ば筆記文字のストローク数が２〜３ストロークの場合は
αvec ＝１／２、４〜２５ストロークの場合にはαvec
＝１、１ストロークの場合にはαvec ＝０とする。この
ような操作を部分パターン間ベクトルによる中分類で残
った全ての候補文字に対して行い、各距離ｄ1 によるソ
ーティングを行う。As αvec in the equation (14), for example, when the number of strokes of the written character is 2-3, αvec = 1/2, and when it is 4-25 strokes, αvec
In the case of = 1 and 1 stroke, αvec = 0. Such an operation is performed for all the candidate characters remaining in the middle classification by the vector between the partial patterns, and sorting is performed at each distance d1.

【００４１】次に、部分パターンＱ値マッチング部９
での処理を説明する。部分パターンＱ値マッチング部
９では、先に説明した部分パターンＱ値マッチング部
８で行った（５）式〜（１２）式でのＱ値算出のうち、
文字が傾斜した際に生じる成分（例えば、−ｘ方向成分
および＋ｙ方向成分）を除いてＱ値の算出を行い、これ
によってマッチングを行う。つまり、−ｘ方向成分およ
び＋ｙ方向成分を除く場合には、Ｑ1 、Ｑ4 、Ｑ5、Ｑ8
と、原点を入れ替えた場合のこれらに対応するＱ9 、
Ｑ12、Ｑ13、Ｑ16の合計８個のＱ値を使用する。Next, the partial pattern Q value matching unit 9
The processing in will be described. In the partial pattern Q value matching unit 9, among the Q values calculated by the expressions (5) to (12) performed by the partial pattern Q value matching unit 8 described above,
The Q value is calculated excluding the components (for example, the −x direction component and the + y direction component) generated when the character is inclined, and the matching is performed by this. That is, when excluding the -x direction component and the + y direction component, Q1, Q4, Q5, Q8
And Q9 corresponding to these when the origin is replaced,
A total of eight Q values of Q12, Q13 and Q16 are used.

【００４２】よって、部分パターンＱ値マッチング部
で求めるマッチング距離ｄBPは、（１５）式を用いて算
出することになる。Therefore, the matching distance dBP obtained by the partial pattern Q-value matching section is calculated using the equation (15).

【００４３】[0043]

【数１５】 (Equation 15)

【００４４】次いで、求めたマッチング距離ｄBPと、先
の処理で求めた部分パターン間ベクトルでのマッチング
距離ｄvec にαvec で重み付けしたものとを加算して距
離ｄ1 を（１４）式により求める。この操作を部分パタ
ーン間ベクトルにより中分類された全ての候補文字に対
して行い、各距離ｄ1 によるソーティングを行う。この
ように、文字が傾斜した際に生じる方向の成分を除去し
たＱ値の算出により、傾きが無く筆記入力された場合と
同等なＱ値を得ることができるようになる。これによっ
て、傾きによる不要な成分を含むことなく算出したＱ値
と予め設定されている基準のＱ値との的確なマッチング
を行うことができるようになる。Then, the obtained matching distance dBP and the matching distance dvec in the partial pattern vector obtained in the previous process weighted by αvec are added to obtain the distance d1 by the equation (14). This operation is performed for all candidate characters that are moderately classified by the inter-partition pattern vector, and sorting is performed at each distance d1. As described above, by calculating the Q value by removing the component in the direction generated when the character is tilted, it is possible to obtain the Q value equivalent to that when the character is input without tilting. As a result, it is possible to perform accurate matching between the calculated Q value without including unnecessary components due to the slope and the preset reference Q value.

【００４５】次に部分パターンストロークコードマッチ
ング部１０での処理を説明する。部分パターンストロー
クコードマッチング部１０では、部分パターンＱ値マッ
チング部８および部分パターンＱ値マッチング部９
からの出力を受け、中分類により絞られた候補文字につ
き、部分パターンストロークコード分布を求める。そし
て、この分布と、登録パターンより予め作成された図１
０に示す部分パターン辞書に格納されている部分パター
ンストロークコード分布とのマッチングを行い、さらに
上位候補の順位付けを行う。Next, the processing in the partial pattern stroke code matching unit 10 will be described. The partial pattern stroke code matching unit 10 includes a partial pattern Q value matching unit 8 and a partial pattern Q value matching unit 9.
The partial pattern stroke code distribution is obtained for the candidate characters narrowed down by the middle classification. Then, FIG. 1 created in advance from this distribution and the registered pattern
Matching is performed with the partial pattern stroke code distribution stored in the partial pattern dictionary indicated by 0, and the higher-ranked candidates are ranked.

【００４６】この順位付けを行う対象の範囲は、例えば
先に説明した距離ｄ1 のソーティングで得られた第１候
補の距離ｄ1 との比率で決める。すなわち、ｄj ／ｄ1
≦ＺＲＡＴＥの候補文字までを対象範囲として順位付け
を行う。The range of objects to be ranked is determined, for example, by the ratio with the distance d1 of the first candidate obtained by sorting the distance d1 described above. That is, dj / d1
Ranking is performed with the candidate character of ≦ ZRATE as the target range.

【００４７】次に、部分パターンストロークコード分布
の算出法について説明する。一例として筆記入力された
文字が「逢」で、第１候補として選ばれた文字が「逢」
であった場合を説明する。先ず、図７に示す文字辞書よ
り、候補文字「逢」のカット位置（３，４，３）を読み
出す。これにより、文字「逢」の部分コードが図１２
（ａ）〜（ｃ）に示すような３つの部分コードに分けら
れることになる。Next, a method of calculating the partial pattern stroke code distribution will be described. As an example, the character written and input is "Ai", and the character selected as the first candidate is "Ai".
The case will be described. First, the cut position (3, 4, 3) of the candidate character "Ai" is read from the character dictionary shown in FIG. As a result, the partial code of the character "Ai" is shown in FIG.
It will be divided into three partial codes as shown in (a) to (c).

【００４８】次いで、この位置で筆記入力された文字の
パターンをカットして、これにより得た部分パターン
と、先に得ている部分パターンとの各々において、スト
ロークコード化部４により得たストロークコードの本数
の分布を算出する。例えば、図１２（ａ）に示す「逢」
の文字の第１部分パターンを見た場合、図１０における
部分パターンストロークコード分布「０１」に対応する
本数が１本、「０３」に対応する本数が１本、「０５」
に対応する本数が１本というようなストロークコード分
布が得られる。そして、このようにして得られた部分パ
ターンストロークコード分布と、予め登録パターンから
同様な手順により算出および平均化して作成された図１
０に示す部分パターンストロークコード分布とをマッチ
ングさせる。Next, the pattern of the character written and input at this position is cut, and the stroke code obtained by the stroke encoding unit 4 is obtained for each of the partial pattern obtained by this and the previously obtained partial pattern. Calculate the distribution of the number of. For example, "Ai" shown in FIG.
When the first partial pattern of the character is viewed, the number of lines corresponding to the partial pattern stroke code distribution “01” in FIG. 10 is 1, the number of lines corresponding to “03” is 1, and the number is “05”.
A stroke code distribution in which the number of lines corresponding to is one is obtained. Then, the partial pattern stroke code distribution obtained in this manner and the registration pattern are calculated and averaged in advance by the same procedure as in FIG.
The partial pattern stroke code distribution indicated by 0 is matched.

【００４９】図１２（ａ）に示す「逢」の文字の第１部
分パターンの場合には、図１０に示すパターン辞書に登
録されている第１部分パターンと対応する部分パターン
ストロークコード分布（「０１」が０．９本、「０２」
が０．１本、「０３」が０．４本、「０４」が０．６
本、「０５」が１本）とマッチングさせる。すなわち、
各部分パターンストロークコード分布の差を各々求め、
各差を加算して得るマッチング距離ｄs を求める。
「逢」の第１部分パターンの場合には、０．１＋０．１
＋０．６＋０．６＝１．４本がマッチング距離ｄs とし
て得られる。同様に、図１２（ｂ）、（ｃ）で示される
第２部分パターン、第３部分パターンに対してもマッチ
ング距離ｄs を算出し、これらを各部分パターンストロ
ークコード数ＢＳで正規化して、合計した距離Ｄs を
（１６）式により求める。In the case of the first partial pattern of the character "Ai" shown in FIG. 12A, the partial pattern stroke code distribution ("" corresponding to the first partial pattern registered in the pattern dictionary shown in FIG. "01" is 0.9, "02"
Is 0.1, "03" is 0.4, and "04" is 0.6
Book, "05" is 1 book). That is,
Find the difference in each partial pattern stroke code distribution,
A matching distance ds obtained by adding each difference is obtained.
0.1 + 0.1 for the first partial pattern of "Ai"
+ 0.6 + 0.6 = 1.4 lines are obtained as the matching distance ds. Similarly, the matching distance ds is calculated for the second partial pattern and the third partial pattern shown in FIGS. 12B and 12C, and these are normalized by the number of partial pattern stroke codes BS to obtain the total. The distance Ds is calculated by the equation (16).

【００５０】[0050]

【数１６】 [Equation 16]

【００５１】この距離Ｄs を全ての候補文字に対して算
出し、最も小さいものを認識結果として図１に示す出力
端子１１へ出力する。This distance Ds is calculated for all the candidate characters, and the smallest one is output to the output terminal 11 shown in FIG. 1 as the recognition result.

【００５２】このように、第１実施例における文字認識
装置では、文字種によって筆記入力する際の傾きによる
Ｑ値への影響が生じやすいものを判別し、生じやすいと
判別した場合には、傾きにより生じる不要な方向成分を
除外したＱ値の算出を行うことで、基準となるＱ値との
的確なマッチングを行うことができるようになる。すな
わち、文字が傾いて筆記入力された場合であっても、正
確に認識を行うことができるようになる。As described above, in the character recognition apparatus of the first embodiment, it is determined which of the character types is likely to affect the Q value due to the inclination when the handwriting is input. By calculating the Q value excluding the generated unnecessary directional component, it becomes possible to perform accurate matching with the reference Q value. That is, it is possible to accurately recognize even when a character is slanted and is input by handwriting.

【００５３】次に、本発明の第２実施例における文字認
識装置を説明する。図２は第２実施例における文字認識
装置を説明するブロック図である。第２実施例における
文字認識装置では、タブレット１、前処理部２、特徴抽
出部３、ストロークコード化部４、大分類部５、中分類
部６、部分パターンＱ値マッチング部８、部分パター
ンＱ値マッチング部９、部分パターンストロークコー
ドマッチング部１０、出力端子１１が先に説明した第１
実施例における文字認識装置と同じであるが、傾斜判断
部７’における内部構成が相違する。以下、装置する傾
斜判断部７’を中心として説明を行う。Next, a character recognition device according to the second embodiment of the present invention will be described. FIG. 2 is a block diagram illustrating a character recognition device in the second embodiment. In the character recognition device in the second embodiment, the tablet 1, the preprocessing unit 2, the feature extraction unit 3, the stroke coding unit 4, the large classification unit 5, the middle classification unit 6, the partial pattern Q value matching unit 8, the partial pattern Q. The value matching unit 9, the partial pattern stroke code matching unit 10, and the output terminal 11 are the first described above.
The character recognition device is the same as that of the embodiment, but the internal structure of the inclination determination unit 7'is different. Hereinafter, description will be made focusing on the tilt determination unit 7'that is installed.

【００５４】第２実施例における文字認識装置の傾斜判
断部７’は、画数設定部７４と、画数によるＱ値設定部
７５と、ＱＦＬＧ判定部７３とから構成されている。こ
の画数設定部７４では、前処理部２から出力される１文
字分のデータよりストローク数を算出し、そのストロー
ク数を変数Ｓtrsuに代入し、画数によるＱ値設定部７５
へ出力する。The inclination determination unit 7'of the character recognition apparatus in the second embodiment comprises a stroke number setting unit 74, a stroke number Q value setting unit 75, and a QFLG determination unit 73. In the stroke number setting unit 74, the stroke number is calculated from the data for one character output from the preprocessing unit 2, the stroke number is substituted into the variable Strsu, and the Q value depending on the stroke number setting unit 75 is calculated.
Output to.

【００５５】画数によるＱ値設定部７５では、この変数
Ｓtrsuの値に応じて、文字における傾きの具合を判断し
て、その傾きにより生じる成分を除去してＱ値の算出を
行うか否かを示すＱＦＬＧを設定する。例えば、Ｓtrsu
≧７（文字が７ストローク以上）の場合には、Ｑ値の−
ｘ方向成分および＋ｙ方向成分を削除するとしてＱＦＬ
Ｇ＝１を設定し、Ｓtrsu≦６（文字が６ストローク以
下）の場合は削除しないとしてＱＦＬＧ＝０を設定し、
ＱＦＬＧ判定部７３へ出力する。なお、ここで１≦Ｓtr
su≦２５とする。The number-of-strokes Q-value setting unit 75 determines the degree of inclination of the character in accordance with the value of the variable Strsu, and determines whether the Q-value is calculated by removing the component caused by the inclination. Set the indicated QFLG. For example, Strsu
When ≧ 7 (the character is 7 strokes or more), the Q value of −
QFL as deleting x-direction component and + y-direction component
If G = 1 is set and Strutu ≦ 6 (characters are 6 strokes or less), it is not deleted and QFLG = 0 is set.
Output to the QFLG determination unit 73. Here, 1 ≦ Str
Let su ≦ 25.

【００５６】ＱＦＬＧ判定部７３は、画数によるＱ値設
定部７５からＱＦＬＧを受けて、これに応じた出力先に
このＱＦＬＧの値を出力する。すなわち、ＱＦＬＧ＝０
の場合はこの値を部分パターンＱ値マッチング部８
へ、ＱＦＬＧ＝１の場合はこの値を部分パターンＱ値マ
ッチング部９へ出力する。これにより、先に説明した
部分パターンＱ値マッチング部での処理および部分パ
ターンＱ値マッチング部９での処理を行い、さらに部
分パターンストロークコードマッチング部１０での処理
を行って的確な認識結果を出力端子１１から出力する。The QFLG judging unit 73 receives the QFLG from the Q value setting unit 75 based on the number of strokes and outputs the value of this QFLG to the output destination corresponding thereto. That is, QFLG = 0
In the case of, the partial pattern Q value matching unit 8
When QFLG = 1, this value is output to the partial pattern Q value matching unit 9. As a result, the processing by the partial pattern Q value matching unit and the processing by the partial pattern Q value matching unit 9 described above are performed, and further the processing by the partial pattern stroke code matching unit 10 is performed to output an accurate recognition result. Output from the terminal 11.

【００５７】第２実施例における文字認識装置では、画
数が所定ストローク数より多い場合には、文字傾きによ
るＱ値の影響が大きくなるということに着目し、このよ
うな場合には傾きによって生じる不要な成分を除外して
Ｑ値を算出するようにしている。このストローク数とし
て、英字、数字、ひらがな、カナタカ、記号の最大画数
より大きな値に設定することで、文字傾きによりＱ値の
影響を受けやすい漢字が入力されたことを自動的に判別
し、不要な成分を除外したＱ値の算出を的確に行うこと
ができるようになる。これにより、文字が傾いて入力さ
れても正確に認識することが可能となる。In the character recognition apparatus according to the second embodiment, it is noted that when the number of strokes is larger than the predetermined stroke number, the influence of the Q value due to the inclination of the character becomes large. The Q value is calculated by excluding such components. By setting the number of strokes to a value larger than the maximum number of strokes for letters, numbers, hiragana, katakana, and symbols, it is automatically determined that a kanji character that is easily affected by the Q value due to the character inclination has been input, and is unnecessary. It becomes possible to accurately calculate the Q value excluding these components. As a result, it becomes possible to accurately recognize the characters even if the characters are inclined and input.

【００５８】次に、本発明の第３実施例における文字認
識装置を説明する。図３は第３実施例における文字認識
装置を説明するブロック図である。第３実施例における
文字認識装置では、タブレット１、前処理部２、特徴抽
出部３、ストロークコード化部４、大分類部５、中分類
部６、部分パターンＱ値マッチング部８、部分パター
ンＱ値マッチング部９、部分パターンストロークコー
ドマッチング部１０、出力端子１１が先に説明した第１
実施例および第２実施例における文字認識装置と同じで
あるが、傾斜判断部７’’における内部構成が相違す
る。以下、相違する傾斜判断部７’’を中心として説明
を行う。Next, a character recognition device according to the third embodiment of the present invention will be described. FIG. 3 is a block diagram illustrating a character recognition device in the third embodiment. In the character recognition device in the third embodiment, the tablet 1, the preprocessing unit 2, the feature extraction unit 3, the stroke coding unit 4, the large classification unit 5, the middle classification unit 6, the partial pattern Q value matching unit 8, the partial pattern Q. The value matching unit 9, the partial pattern stroke code matching unit 10, and the output terminal 11 are the first described above.
This is the same as the character recognition device in the embodiment and the second embodiment, but the internal configuration of the inclination determination unit 7 ″ is different. Hereinafter, the description will be centered on the different inclination determination unit 7 ″.

【００５９】第３実施例における文字認識装置の傾斜判
断部７’’は、ストロークコード設定部７６と、ストロ
ークコードによるＱ値設定部７７と、ＱＦＬＧ判定部７
３とから構成されている。このストロークコード設定部
７６では、ストロークコード化部４から出力される文字
数分のストロークコードを、予め分類分けした−ｘ方向
成分および＋ｙ方向成分を含むストロークと、含まない
ストロークとの情報を使用し、−ｘ方向成分および＋ｙ
方向成分を含むストロークの個数をＳY 、含まないスト
ロークの個数をＳN とし、これらをストロークコードに
よるＱ値設定部７７に出力する。The inclination judging section 7 '' of the character recognition apparatus in the third embodiment is composed of a stroke code setting section 76, a stroke code Q value setting section 77, and a QFLG judging section 7.
And 3. The stroke code setting unit 76 uses information about strokes including the −x direction component and the + y direction component, which are pre-classified stroke codes for the number of characters output from the stroke encoding unit 4, and strokes not including the stroke components. , -X direction component and + y
The number of strokes including the direction component is SY, the number of strokes not including it is SN, and these are output to the Q value setting unit 77 by the stroke code.

【００６０】ストロークコードによるＱ値設定部７７で
は、（１７）式に示す判定式によって、−ｘ方向成分お
よび＋ｙ方向成分が全体のストローク数に対してどの程
度含まれているかを示す割合によって所定のＱＦＬＧを
出力する。In the stroke value Q value setting unit 77, a predetermined value is determined according to a ratio indicating how much the -x direction component and the + y direction component are included in the total number of strokes according to the determination formula shown in the formula (17). QFLG of is output.

【００６１】[0061]

【数１７】 [Equation 17]

【００６２】すなわち、（１７）式におけるしきい値Ｉ
（ただし、０≦Ｉ≦１）を設定し、（ＳY ／（ＳY ＋Ｓ
N ））がしきい値Ｉ以下の場合には、−ｘ方向成分およ
び＋ｙ方向成分を除外したＱ値の算出を行うものとして
ＱＦＬＧ＝１を出力し、この場合以外にはＱＦＬＧ＝０
を出力する。That is, the threshold value I in equation (17)
(However, 0 ≦ I ≦ 1) is set, and (SY / (SY + S
N)) is less than or equal to the threshold value I, QFLG = 1 is output to calculate the Q value excluding the −x direction component and the + y direction component, and QFLG = 0 otherwise.
Is output.

【００６３】ＱＦＬＧ判定部７３は、ストロークコード
によるＱ値設定部７７からＱＦＬＧを受けて、これに応
じた出力先にこのＱＦＬＧの値を出力する。すなわち、
ＱＦＬＧ＝０の場合はこの値を部分パターンＱ値マッチ
ング部８へ、ＱＦＬＧ＝１の場合はこの値を部分パタ
ーンＱ値マッチング部９へ出力する。これにより、先
に説明した部分パターンＱ値マッチング部での処理お
よび部分パターンＱ値マッチング部９での処理を行
い、さらに部分パターンストロークコードマッチング部
１０での処理を行って的確な認識結果を出力端子１１か
ら出力する。The QFLG judging section 73 receives the QFLG from the Q value setting section 77 based on the stroke code, and outputs the value of this QFLG to the output destination corresponding to this. That is,
When QFLG = 0, this value is output to the partial pattern Q value matching unit 8, and when QFLG = 1, this value is output to the partial pattern Q value matching unit 9. As a result, the processing by the partial pattern Q value matching unit and the processing by the partial pattern Q value matching unit 9 described above are performed, and further the processing by the partial pattern stroke code matching unit 10 is performed to output an accurate recognition result. Output from the terminal 11.

【００６４】第３実施例における文字認識装置では、全
ストローク数に対して−ｘ方向成分および＋ｙ方向成分
がある値以上含まれている場合にはもともと文字に斜め
のセグメントが含まれているものと判断してそのままＱ
値算出を行い、全ストローク数に対して−ｘ方向成分お
よび＋ｙ方向成分がある値よりも少ない場合には本来斜
めのセグメントが少ない文字が傾いているものを判断し
−ｘ方向成分および＋ｙ方向成分を除外したＱ値の算出
を行う点に特徴がある。これによって、傾きによって生
じる不要な成分を除外したＱ値の算出を的確に行うこと
ができ、認識率の向上を図ることが可能となる。In the character recognition apparatus according to the third embodiment, if the -x direction component and the + y direction component are included in a certain number or more with respect to the total number of strokes, the character originally includes an oblique segment. It is judged as it is and Q
When the value is calculated and the -x direction component and the + y direction component are less than a certain value with respect to the total number of strokes, it is determined that the character originally having a small number of diagonal segments is inclined, and the -x direction component and the + y direction component The feature is that the Q value is calculated excluding the components. This makes it possible to accurately calculate the Q value excluding unnecessary components caused by the inclination and improve the recognition rate.

【００６５】なお、本実施例では、部分パターンＱ値マ
ッチング部９がＱ値算出において除外する成分として
−ｘ方向成分および＋ｙ方向成分を例として説明した
が、本発明はこれに限定されず、傾きによって多く生じ
る成分を適宜除外するようにすればよい。例えば、この
除外する成分を任意に設定できるようにしておき、入力
される文字の特徴（個人差等）に応じて変更するように
してもよい。In the present embodiment, the -x direction component and the + y direction component are described as examples of the components excluded by the partial pattern Q value matching unit 9 in the Q value calculation, but the present invention is not limited to this. It is only necessary to appropriately exclude components that are often generated by the inclination. For example, the components to be excluded may be set arbitrarily, and may be changed according to the characteristics (individual difference or the like) of the input character.

【００６６】[0066]

【発明の効果】以上説明したように本発明の文字認識装
置によれば次のような効果がある。すなわち、本発明の
文字認識装置では、文字の傾きの具合により文字が傾い
た際に生じる成分を除外した特徴量算出を行っているた
め、筆記入力される文字が傾いていても、基準特徴量と
の的確なマッチングを行うことが可能となる。これによ
って、文字傾きなどの文字変形に対しても高い認識率を
得ることが可能となる。As described above, the character recognition device of the present invention has the following effects. That is, in the character recognition device of the present invention, since the feature amount calculation excluding the component generated when the character is inclined due to the degree of inclination of the character is calculated, even if the character to be written and input is inclined, the reference feature amount It becomes possible to perform accurate matching with. As a result, it is possible to obtain a high recognition rate even for character deformation such as character inclination.

[Brief description of drawings]

【図１】本発明の第１実施例における文字認識装置を説
明するブロック図である。FIG. 1 is a block diagram illustrating a character recognition device according to a first embodiment of the present invention.

【図２】本発明の第２実施例における文字認識装置を説
明するブロック図である。FIG. 2 is a block diagram illustrating a character recognition device according to a second embodiment of the present invention.

【図３】本発明の第３実施例における文字認識装置を説
明するブロック図である。FIG. 3 is a block diagram illustrating a character recognition device according to a third embodiment of the present invention.

【図４】文字の傾きの状態を説明する図で、（ａ）は標
準、（ｂ）は右肩上がり、（ｃ）は右傾きを示すもので
ある。FIG. 4 is a diagram for explaining a state of inclination of a character, where (a) shows a standard, (b) shows an upward slope, and (c) shows a right inclination.

【図５】前処理・特徴点抽出を説明する図で、（ａ）は
生データ、（ｂ）は平滑後、（ｃ）は特徴抽出を示すも
のである。5A and 5B are diagrams illustrating preprocessing and feature point extraction, in which FIG. 5A shows raw data, FIG. 5B shows smoothed data, and FIG. 5C shows feature extraction.

【図６】ストロークコード化処理を説明する図である。FIG. 6 is a diagram illustrating stroke coding processing.

【図７】文字辞書の一構成例を示す図である。FIG. 7 is a diagram showing a configuration example of a character dictionary.

【図８】部分パターン間ベクトルを説明する図で、
（ａ）は部分パターン間ベクトル、（ｂ）は”逢”の例
を示すものである。FIG. 8 is a diagram for explaining vectors between partial patterns,
(A) shows a vector between partial patterns, and (b) shows an example of "a".

【図９】タブレットの一例を示す図である。FIG. 9 is a diagram showing an example of a tablet.

【図１０】部分パターン辞書の一構成例を示す図であ
る。FIG. 10 is a diagram showing a configuration example of a partial pattern dictionary.

【図１１】部分パターンＱ値の算出を説明する図であ
る。FIG. 11 is a diagram illustrating calculation of a partial pattern Q value.

【図１２】”逢”の部分パターンを（ａ）〜（ｃ）に示
す図である。FIG. 12 is a diagram showing partial patterns of “a” in (a) to (c).

[Explanation of symbols]

１タブレット２前処理部３特徴抽出部４ストロークコード化部５大分類部６中分類部７傾斜判断部８部分パターンＱ値マッチング部９部分パターンＱ値マッチング部１０部分パターンストロークコードマッチング部１１出力端子 1 tablet 2 pre-processing unit 3 feature extraction unit 4 stroke coding unit 5 large classification unit 6 medium classification unit 7 inclination determination unit 8 partial pattern Q value matching unit 9 partial pattern Q value matching unit 10 partial pattern stroke code matching unit 11 output Terminal

Claims

[Claims]

1. A feature quantity indicating a distribution in an xy coordinate system in a segment connecting a plurality of feature points in accordance with strokes of a character input by handwriting, and a preset reference feature. A character recognition device for recognizing a character input by handwriting by comparing with an amount, and an inclination determination unit that determines the degree of inclination at the time of input of a character input by handwriting based on a predetermined condition, A character amount calculating unit that calculates the characteristic amount by excluding the component generated by the inclination in the distribution in the xy coordinate system in the segment based on the determination by the inclination determining unit. apparatus.

2. The character recognition device according to claim 1, wherein the inclination determination unit determines the degree of inclination according to the type of the character input by handwriting.

3. The character recognition device according to claim 1, wherein the inclination determination unit determines the degree of inclination according to the number of strokes of the character input by handwriting.

4. The inclination determining unit determines the inclination depending on how much of the distribution in the xy coordinate system in the segment, which is caused by the inclination, is included in the total stroke number of the character to be written and input. The character recognition device according to claim 1, wherein a condition is judged.

5. The feature quantity calculation unit calculates the feature quantity by excluding the −x direction component and the + y direction component when the lower left corner of the character is the origin in the distribution in the xy coordinate system of the segment. The character recognition device according to claim 1, wherein the character recognition device is a character recognition device.