JPH1034089A

JPH1034089A - Video coding device

Info

Publication number: JPH1034089A
Application number: JP8200322A
Authority: JP
Inventors: Ryosuke Hiramatsu; 良介平松; Hitoshi Yoneda; 等米田; Yasuyuki Masai; 康之正井
Original assignee: Toshiba Corp
Current assignee: Toshiba Corp
Priority date: 1996-07-30
Filing date: 1996-07-30
Publication date: 1998-02-10

Abstract

PROBLEM TO BE SOLVED: To provide a video coding device which enables an operator to efficiently and rapidly input user codes with a less fatiguing operation and to embody rapid processing. SOLUTION: This video coding device takes in the entire part image on postal items stated with the user codes consisting of postal code numbers and address display numbers and addressee address information and the information on the results of the recognition for the same from a reading section 2, displays the same on a display section 32, inputs the user codes on the postal items by a keyboard 31 in accordance with the displayed images and transmits the user codes to a bar code printing section. The operator inputs town names and bulk posting receiver names by voices from a voice input section 34 to the postal items, such as those not stated with the user codes, those stated with the incomplete user codes and those having a difficulty in reading of the user codes. These voices are then recognized in a voice recognition section 30 and the candidates for voice recognition having a high probability are displayed in the display section 32. The operator inputs the user codes by selecting and determining the user codes out of the displayed candidates for voice recognition.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、たとえば、郵便物
上にあらかじめ設定される利用者コード指定記載領域内
に記載されている、少くとも郵便番号と住所表示番号と
からなる利用者コードを読取り、この読取った利用者コ
ードに対応したバーコード情報を郵便物上に印刷し、そ
のバーコード情報に基づき郵便物を区分処理する郵便物
処理装置において、利用者コードを読取れなかった郵便
物について、表示されたその郵便物上の全体画像を目視
しながら、オペレータが利用者コードを入力し、それを
対応するバーコード情報に変換して郵便物上に印刷する
ことにより、そのバーコード情報によって郵便物を区分
処理するためのビデオコーディング装置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention reads, for example, a user code consisting of at least a postal code and an address display number, which is described in a user code designation entry area set in advance on a postal matter. In a mail processing device that prints barcode information corresponding to the read user code on a mail and sorts mail based on the barcode information, the mail that cannot read the user code is read. The operator inputs the user code while viewing the displayed whole image on the mail, converts it into the corresponding bar code information, and prints it on the mail, so that the The present invention relates to a video coding device for sorting mail.

【０００２】[0002]

【従来の技術】最近、ビデオコーディング装置を併用す
る郵便物処理装置が開発されている。この種の郵便物処
理装置は、たとえば、郵便物上に記載された宛名住所を
示す利用者コード（たとえば、郵便番号と住所表示番号
とからなる）を光学的文字読取装置によって読取り、そ
れを対応するバーコード情報に変換して郵便物上に印刷
することにより、そのバーコード情報によって郵便物を
配達区域ごとに区分処理するものである。2. Description of the Related Art Recently, mail processing apparatuses using a video coding apparatus have been developed. This type of mail processing apparatus reads, for example, a user code (for example, consisting of a zip code and an address display number) indicating an address written on a mail by an optical character reading apparatus, and reads it. By converting the barcode information into barcode information and printing it on the mail, the mail is sorted according to the barcode information for each delivery area.

【０００３】利用者コードを読取ることのできなかった
郵便物については、その全体画像をビデオコーディング
装置の表示部に表示し、オペレータがその表示画面を目
視しながら利用者コードを入力し、それを対応するバー
コード情報に変換して郵便物上に印刷することにより、
そのバーコード情報によって郵便物を区分処理するもの
である。これにより、光学的文字読取装置による機械区
分の限界を補うことができる。[0003] For mail items for which the user code could not be read, the entire image is displayed on the display unit of the video coding apparatus, and the operator inputs the user code while visually observing the display screen. By converting it to the corresponding barcode information and printing it on mail,
The mail is sorted according to the barcode information. This makes it possible to compensate for the limitations of the machine classification by the optical character reader.

【０００４】ところが、従来のビデオコーディング装置
にあっては、利用者コードを読取れなくてリジェクトさ
れた全ての郵便物の利用者コードを、オペレータがその
イメージ情報（ビデオイメージ）を読取って、全てキー
入力でコード化していた。[0004] However, in the conventional video coding apparatus, the operator reads the user codes of all rejected mail items because the user codes cannot be read, and the image information (video image) is read by the operator. It was coded by key input.

【０００５】[0005]

【発明が解決しようとする課題】上述したように、従来
のビデオコーディング装置にあっては、全てのリジェク
ト郵便物の利用者コードを、オペレータがそのイメージ
情報を読取って、全てキー入力でコード化していた。こ
のため、入力コードの数に比例して打鍵数が増えるた
め、入力速度が上がらず、また、正確性が低下するとい
った問題が生じていた。As described above, in the conventional video coding apparatus, the user codes of all rejected mails are read by an operator by reading the image information, and are all coded by key input. I was For this reason, the number of keystrokes increases in proportion to the number of input codes, so that there has been a problem that the input speed does not increase and the accuracy decreases.

【０００６】また、実際の郵便物は、利用者コードが記
載されていないものや不完全なもの、さらに、判読が困
難なもの、間違った利用者コードが記載されているもの
がある。このような郵便物に対しては、入力速度の向上
が困難であり、また、連続入力の操作リズムが崩れるこ
とにより、システム全体の入力効率が低下していた。[0006] Further, actual postal items include those in which the user code is not described or incomplete, those which are difficult to read, and those in which the wrong user code is described. For such mails, it is difficult to improve the input speed, and the input rhythm of continuous input is disrupted, thereby reducing the input efficiency of the entire system.

【０００７】そこで、本発明は、オペレータが効率よく
短時間で疲労が少い操作で利用者コードを入力でき、正
確性や省力化が向上し、迅速な処理が実現できるビデオ
コーディング装置を提供することを目的とする。Therefore, the present invention provides a video coding apparatus which enables an operator to input a user code efficiently and in a short time with less fatigue, to improve accuracy and labor saving, and to realize quick processing. The purpose is to:

【０００８】[0008]

【課題を解決するための手段】本発明のビデオコーディ
ング装置は、少くとも郵便番号と住所表示番号とからな
る利用者コードおよび宛名住所情報が記載されている郵
便物上の全体画像、および、その郵便物の全体画像に対
する認識結果情報を取込んで表示し、その表示された画
像に基づき前記郵便物上の利用者コードを入力するもの
において、認識手段で認識することができなかった郵便
物の全体画像、および、その郵便物に対して前記認識手
段が認識した利用者コードの候補文字をそれぞれ入力す
る第１の入力手段と、この第１の入力手段で入力された
郵便物の全体画像、および、利用者コードの候補文字を
表示して該利用者コードを入力する利用者コード入力領
域をそれぞれ表示する第１の表示手段と、この第１の表
示手段で表示された郵便物の全体画像を目視しながら、
同時に表示される利用者コード入力領域に利用者コード
を入力したり、あるいは、音声認識対象種別情報を入力
する第２の入力手段と、前記第１の表示手段で表示され
た郵便物の全体画像に基づき音声認識対象名を音声で入
力する音声入力手段と、この音声入力手段で入力された
音声情報を認識する音声認識手段と、音声認識対象種別
ごとに認識対象語彙読み情報と、それに対応する認識対
象登録コードを記憶している第１の記憶手段と、前記第
２の入力手段で入力された音声認識対象種別情報、およ
び、前記音声認識手段の認識結果に基づき前記第１の記
憶手段内を検索することにより、少なくとも１つ以上の
認識対象登録コードの候補を抽出する検索手段と、認識
対象登録コードの候補に対応する認識対象表示情報を記
憶している第２の記憶手段と、前記検索手段から出力さ
れる認識対象登録コードの候補に基づき前記第２の記憶
手段内を検索することにより、認識対象登録コードの候
補に対応する少なくとも１つ以上の認識対象表示情報を
表示する第２の表示手段と、この第２の表示手段で表示
された認識対象表示情報から認識対象候補を選択決定す
る認識対象候補選択手段とを具備している。SUMMARY OF THE INVENTION A video coding apparatus according to the present invention comprises: an entire image on a postal matter in which a user code including at least a postal code and an address display number and address information are described; The system retrieves and displays the recognition result information for the entire image of the postal matter, and inputs the user code on the postal matter based on the displayed image. An entire image, first input means for inputting a candidate character of a user code recognized by the recognition means for the mail, and an entire image of the mail input by the first input means; And first display means for displaying candidate characters of the user code and displaying a user code input area for inputting the user code, respectively; While viewing the whole image of the mailpiece,
A second input means for inputting a user code in a user code input area which is displayed at the same time, or a voice recognition target type information, and an entire image of the mail displayed on the first display means Voice input means for inputting a voice recognition target name based on a voice, voice recognition means for recognizing voice information input by the voice input means, recognition target vocabulary reading information for each voice recognition target type, and corresponding A first storage unit that stores a recognition target registration code, a voice recognition target type information input by the second input unit, and the first storage unit based on a recognition result of the voice recognition unit. Searching means for extracting at least one candidate for the recognition target registration code, and a second storage means for storing recognition target display information corresponding to the candidate for the recognition target registration code. Storage means and at least one or more recognition target display information corresponding to a candidate for a recognition target registration code by searching the second storage means based on a candidate for a recognition target registration code output from the search means. And a recognition target candidate selecting means for selecting and determining a recognition target candidate from the recognition target display information displayed on the second display means.

【０００９】本発明によれば、たとえば、利用者コード
が記載されていない、利用者コードが不完全、利用者コ
ードの判読が困難、さらに、間違った利用者コードが記
載されているなどの郵便物に対して、町名、大口受取人
名などの音声認識対象名を音声で入力することにより、
音声認識され、確度の高い音声認識対象候補を表示す
る。オペレータは、表示された音声認識対象候補の中か
ら選択決定するだけで、正確に利用者コードを入力する
ことができる。したがって、オペレータが効率よく短時
間で疲労が少い操作で利用者コードを入力でき、正確性
や省力化が向上し、迅速な処理が実現できる。According to the present invention, for example, a mail in which a user code is not described, a user code is incomplete, a user code is difficult to read, and an incorrect user code is described, for example. By inputting the voice recognition target name such as the name of the town, the name of the major recipient, etc.
Speech recognition is performed, and a speech recognition target candidate with high accuracy is displayed. The operator can accurately input the user code only by selecting and determining from the displayed voice recognition target candidates. Therefore, the operator can input the user code efficiently in a short period of time with less fatigue, improving accuracy and labor saving, and realizing quick processing.

【００１０】また、音声認識の結果を基に検索するため
の認識対象語彙読み情報とそれに対応する認識対象登録
コードを記憶している第１の記憶手段は、同じ対象情報
に対して複数の認識対象語彙読み情報を持つことが可能
で、さらに、第２の表示手段は、同じ認識対象登録コー
ドの候補が複数存在しても、１つの認識対象表示情報と
して表示することにより、音声認識によるコード化の正
確性および迅速性が向上する。The first storage means for storing recognition target vocabulary reading information for performing a search based on the result of speech recognition and a recognition target registration code corresponding to the recognition target vocabulary reading information includes: It is possible to have the target vocabulary reading information, and the second display means displays the code as a single recognition target display information even if there are a plurality of candidates for the same recognition target registration code, so that the code by voice recognition is displayed. Accuracy and speed of conversion are improved.

【００１１】また、音声認識対象候補の表示に関して
も、認識確度が一番高い音声認識対象候補を、それ以外
の音声認識対象候補と区別して表示することにより、オ
ペレータは、迅速かつ正確に音声認識対象候補の選択が
可能になる。[0011] Also, regarding the display of the voice recognition target candidates, the operator can promptly and accurately recognize the voice recognition by displaying the voice recognition target candidate having the highest recognition accuracy separately from the other voice recognition target candidates. It becomes possible to select target candidates.

【００１２】また、認識確度が一番高い音声認識対象候
補に対応するコードは、最初から利用者コード入力領域
に表示しておき、音声認識対象候補の選択中は、暫定的
に選択されている音声認識対象候補に対応するコードを
逐次、利用者コード入力領域に表示することにより、効
率のよいシステムを構築できる。The code corresponding to the voice recognition target candidate having the highest recognition accuracy is displayed in the user code input area from the beginning, and is temporarily selected during the selection of the voice recognition target candidate. An efficient system can be constructed by sequentially displaying codes corresponding to voice recognition target candidates in a user code input area.

【００１３】さらに、当該音声認識対象の郵便物に対し
て、２回目以降の音声入力処理の有無情報を含んだ音声
認識対象種別情報を入力し、検索手段は、該音声入力処
理の有無情報が有りであった場合、以前の音声入力で音
声認識対象候補として抽出された認識対象語彙読み情報
を除外して第１の記憶手段内を検索することにより、認
識対象登録コードの候補とすることから、効率がよく、
正確性の高いシステムが実現できる。Further, the speech recognition target type information including the presence / absence information of the second and subsequent voice input processes is input to the postal matter to be voice-recognized. If yes, since the recognition target vocabulary reading information extracted as the speech recognition target candidate in the previous speech input is excluded and the first storage means is searched, the recognition target registration code candidate is obtained. , Efficient,
A highly accurate system can be realized.

【００１４】[0014]

【発明の実施の形態】以下、本発明の実施の形態につい
て図面を参照して説明する。図１、図２は、本実施の形
態に係る郵便物を示すものである。図１は、手書き郵便
物（はがき）Ｐの一例を示しており、従来の郵便番号や
宛名情報に代わる利用者コード（市区町村＋丁目番号ま
でを区分したもの）は、新郵便番号（市区町村までの区
分、赤色枠内に記載）と、住所表示番号（丁目番号、棟
室番号までの区分、宛名情報領内に記載）とから構成さ
れている。すなわち、郵便物Ｐの表面の切手貼付位置
（料額印面）に隣接して、たとえば、上位３桁と下位４
桁の合計７桁の数字からなる新郵便番号（市区町村まで
の区分、赤色枠内に記載）が記載される新郵便番号コー
ド指定記載領域６１があらかじめ設定されている。ま
た、６２は宛名情報領域であり、ここには住所表示番号
（丁目番号、棟室番号までの区分）を含む宛名情報が手
書きされている。Embodiments of the present invention will be described below with reference to the drawings. 1 and 2 show a postal matter according to the present embodiment. FIG. 1 shows an example of a handwritten postal matter (postcard) P, in which a user code (division of municipalities + chome numbers) in place of a conventional postal code or address information is a new postal code (city). It is composed of a section up to the ward, town and village, described in a red frame) and an address display number (a section up to a street number and a building room number, described in an address information area). That is, for example, the upper three digits and the lower four
A new zip code designation area 61 in which a new zip code consisting of a total of seven digits (division up to municipalities, described in a red frame) is set in advance. Reference numeral 62 denotes an address information area in which address information including an address display number (division up to a street number and a ridge room number) is handwritten.

【００１５】図２は、印刷活字郵便物（はがき）Ｐの一
例を示しており、その表面には、手書き郵便物と同様
に、新郵便番号コード指定記載領域６１があらかじめ設
定されている。しかし、この場合、新郵便番号は、住所
表示番号と同様に、新郵便番号コード指定記載領域６１
には記載されずに、図示のように宛名情報領域６２内に
宛名情報とともに印刷活字で記載されることが多い。FIG. 2 shows an example of a print type postal matter (postcard) P. On the surface of the postal matter P, a new postal code designation area 61 is set in advance similarly to a handwritten postal matter. However, in this case, the new zip code is stored in the new zip code designation area 61 similarly to the address display number.
Are often described in print type together with the address information in the address information area 62 as shown in the figure.

【００１６】図３は、本実施の形態に係るビデオコーデ
ィング装置が適用される郵便物処理装置の構成を概略的
に示すものである。すなわち、前記したように構成され
た郵便物Ｐは、多数枚一括して供給部１にセットされ
る。供給部１は、セットされた郵便物Ｐを一通ずつ読取
部２に供給する。FIG. 3 schematically shows a configuration of a mail processing apparatus to which the video coding apparatus according to the present embodiment is applied. That is, a large number of postal items P configured as described above are set in the supply unit 1 collectively. The supply unit 1 supplies the set mail items P to the reading unit 2 one by one.

【００１７】読取部２は、供給される郵便物Ｐ上の全体
画像を取込むことにより、その画像に基づき郵便物Ｐ上
に記載された利用者コードを読取る（認識する）。利用
者コードが読取れた場合、その利用者コードをバーコー
ド印刷部３へ出力する。このとき、読取部２は、読取り
の可否にかかわらず、各郵便物Ｐと対応したシリアル番
号（以後、ＩＤ番号と略称する）をバーコード印刷部３
へ出力する。The reading unit 2 reads (recognizes) the user code written on the mail P based on the image by taking in the whole image on the mail P supplied. When the user code is read, the user code is output to the barcode printing unit 3. At this time, regardless of whether or not reading is possible, the reading unit 2 prints a serial number (hereinafter abbreviated as an ID number) corresponding to each mail P on the barcode printing unit 3.
Output to

【００１８】読取部２において利用者コードが読取れた
郵便物Ｐについては、長い搬送路によって形成された、
郵便物Ｐの後段への搬送を遅延せしめる遅延パス４を介
してバーコード印刷部３へ搬送される。バーコード印刷
部３は、読取部２からの利用者コードを対応するバーコ
ード情報に変換して、ＩＤ番号によって対応付けられた
郵便物Ｐを確認しつつ、その郵便物Ｐ上に印刷する。The postal matter P whose user code has been read by the reading unit 2 is formed by a long transport path.
The postal matter P is conveyed to the barcode printing unit 3 via a delay path 4 which delays the conveyance of the postal matter P to the subsequent stage. The barcode printing unit 3 converts the user code from the reading unit 2 into corresponding barcode information, and prints on the mail P while confirming the mail P associated with the ID number.

【００１９】バーコード印刷部３でバーコード情報が印
刷された郵便物Ｐは、バーコード読取部５に送られる。
バーコード読取部５は、郵便物Ｐ上に印刷されたバーコ
ード情報を読取って解読し、その読取結果（利用者コー
ド）を郵便物Ｐとともに区分部６に送る。区分部６は、
バーコード読取部５からの読取結果に基づき郵便物Ｐを
配達区域ごとに区分するようになっている。The postal matter P on which the barcode information is printed by the barcode printing unit 3 is sent to the barcode reading unit 5.
The barcode reading unit 5 reads and decodes the barcode information printed on the mail P, and sends the reading result (user code) to the sorting unit 6 together with the mail P. Division 6
The postal matter P is classified for each delivery area based on the result of reading from the barcode reading unit 5.

【００２０】読取部２において利用者コードが読取れな
い郵便物Ｐについては、読取部２から対応する郵便物Ｐ
の全体画像、ＩＤ番号、属性情報（はがき／はがき以外
の分類、手書き郵便物／印刷活字郵便物の分類、文字行
方向判定結果など）、利用者コードの認識結果、およ
び、利用者コード記載領域の候補領域情報などがビデオ
コーディング装置７に送られるとともに、バーコード印
刷部３に対してリジェクト信号が送られる。For the mail P whose user code cannot be read by the reading section 2, the corresponding mail P
Image, ID number, attribute information (classification other than postcard / postcard, classification of handwritten mail / printed mail, character line direction judgment result, etc.), user code recognition result, and user code description area Is sent to the video coding device 7 and a reject signal is sent to the barcode printing unit 3.

【００２１】ビデオコーディング装置７では、詳細は後
述するが、読取部２からの郵便物Ｐの全体画像が表示部
に表示される。オペレータは、その表示画像を目視しな
がら、キーボードやマウスなどの入力操作によって利用
者コードを入力（あるいは、訂正入力）することによ
り、その利用者コードは正否をチェックされた後、バー
コード印刷部３に送られる。In the video coding device 7, the whole image of the mail P from the reading unit 2 is displayed on the display unit, as will be described in detail later. The operator inputs (or corrects) the user code by input operation with a keyboard, a mouse, or the like while visually checking the display image, and after the user code is checked for correctness, the bar code printing unit. Sent to 3.

【００２２】一方、利用者コードが読取れなかった郵便
物Ｐは、遅延パス４を介してバーコード印刷部３へ搬送
される。バーコード印刷部３は、このとき読取部２から
出力される利用者コードが読取れない郵便物Ｐに対する
リジェクト信号に基づき、ビデオコーディング装置７か
らの利用者コードをバーコード情報に変換し、ＩＤ番号
によって対応付けられた郵便物Ｐ上に印刷する。On the other hand, the postal matter P whose user code could not be read is conveyed to the bar code printing unit 3 via the delay path 4. The bar code printing unit 3 converts the user code from the video coding device 7 into bar code information based on a reject signal for the postal matter P from which the user code output from the reading unit 2 cannot be read at this time. It is printed on the mail P correlated by the number.

【００２３】これ以降は、前述した利用者コードが読取
れた郵便物Ｐの場合と同様であるので、説明は省略す
る。図４は、前記読取部２の構成を示すものである。す
なわち、光電変換部１１は、供給部１から供給される郵
便物Ｐ上に対して光源から光を照射し、その反射光をＣ
ＣＤ形ラインセンサなどによって電気信号に変換した
後、デジタル化することにより、郵便物Ｐの全体画像を
取込む。[0023] The subsequent steps are the same as those of the postal matter P from which the user code has been read, and a description thereof will be omitted. FIG. 4 shows the configuration of the reading unit 2. That is, the photoelectric conversion unit 11 irradiates light on the mail P supplied from the supply unit 1 from a light source, and converts the reflected light into C light.
After being converted into an electric signal by a CD type line sensor or the like, the whole image of the mail P is captured by digitizing.

【００２４】光電変換部１１で取込まれた郵便物Ｐの全
体画像は、画像メモリ１２に一時格納されるとともに、
領域検出部１３に送られる。領域検出部１３は、入力さ
れる郵便物Ｐの画像から、郵便物Ｐ上の新郵便番号コー
ド指定記載領域６１と宛名情報領域６２とを検出すると
ともに、後述する利用者コード認識部１８から再読取命
令信号が入力されると、再度、入力された郵便物Ｐの画
像から、新郵便番号コード指定記載領域６１以外の利用
者コード記載領域を検出する。The entire image of the postal matter P captured by the photoelectric conversion unit 11 is temporarily stored in the image memory 12, and
It is sent to the area detection unit 13. The area detection unit 13 detects a new postcode code designation area 61 and an address information area 62 on the postal matter P from the input image of the postal matter P, and re-reads the information from the user code recognition unit 18 described later. When the read command signal is input, the user code description area other than the new postal code code specification description area 61 is detected again from the input image of the postal matter P.

【００２５】領域検出部１３の出力は、文字行検出部１
４に送られる。文字行検出部１４は、検出された領域内
の画像から、たとえば、水平方向への射影パターンを求
めることにより文字行を検出する。The output of the area detection unit 13 is
4 The character line detection unit 14 detects a character line by obtaining, for example, a horizontal projection pattern from the image in the detected area.

【００２６】文字行検出部１４の出力は、文字検出部１
５に送られる。文字検出部１５は、検出された文字行か
ら、たとえば、垂直方向への射影パターンを求めること
により、文字を１文字ごとに検出する。The output of the character line detector 14 is
Sent to 5. The character detection unit 15 detects a character for each character by obtaining, for example, a projection pattern in the vertical direction from the detected character line.

【００２７】文字検出部１５の出力は、文字認識部１６
に送られる。文字認識部１６は、検出された文字につい
て、文字辞書１７を参照することにより認識処理を行な
う。文字認識部１６の認識結果は、認識手段としての利
用者コード認識部１８に送られる。利用者コード認識部
１８は、文字認識部１６の認識結果に基づき、利用者コ
ード辞書１９を用いて知識処理を行なうことにより、利
用者コードの認識処理を行なう。この認識処理におい
て、利用者コードを認識できた場合、その利用者コード
をバーコード印刷部３に対して出力する。The output of the character detecting unit 15 is
Sent to The character recognizing unit 16 performs a recognition process on the detected character by referring to the character dictionary 17. The recognition result of the character recognition unit 16 is sent to a user code recognition unit 18 as recognition means. The user code recognition unit 18 performs a user code recognition process by performing a knowledge process using the user code dictionary 19 based on the recognition result of the character recognition unit 16. If the user code can be recognized in this recognition processing, the user code is output to the barcode printing unit 3.

【００２８】上記認識処理において、利用者コードを認
識できなかった場合、再読取命令信号を領域検出部１３
に送ることにより、再度、利用者コードの認識処理を行
なう。この認識処理においても、利用者コードを認識で
きなかった場合、ここで最終的に認識不可とし、郵便物
Ｐと対応をとるためのＩＤ番号と、利用者コードが認識
されなかったことを示すリジェクト信号をビデオコーデ
ィング装置７に出力する。In the above recognition process, if the user code cannot be recognized, a re-read command signal is sent to the area detecting unit 13.
, The user code is recognized again. In this recognition process, if the user code cannot be recognized, it is finally determined to be unrecognizable, and an ID number for associating with the postal matter P and a reject indicating that the user code has not been recognized. The signal is output to the video coding device 7.

【００２９】また、利用者コード認識部１８は、利用者
コードを認識できなかった場合、このときの認識結果情
報として、利用者コードの候補文字情報、郵便物Ｐの属
性情報（はがき／はがき以外の分類、手書き郵便物／印
刷活字郵便物の分類、文字行方向判定結果など）、領域
検出部１３で検出された新郵便番号コード指定記載領域
６１の座標情報や、新郵便番号コード指定記載領域６１
以外の利用者コード記載領域の座標情報（つまり、利用
者コード記載候補領域情報）などをビデオコーディング
装置７に出力する。When the user code recognizing unit 18 cannot recognize the user code, the user code candidate character information and the postal matter P attribute information (other than postcard / postcard) are used as recognition result information at this time. Classification, classification of handwritten mail / printed print mail, character line direction determination result, etc.), coordinate information of the new zip code designation area 61 detected by the area detection unit 13, and new zip code designation area 61
It outputs to the video coding device 7 coordinate information of a user code description area other than the above (that is, user code description candidate area information) and the like.

【００３０】ここで、上記手書き郵便物／印刷活字郵便
物の分類は、たとえば、新郵便番号コード指定記載領域
６１内に新郵便番号が記載されている場合は手書き郵便
物と判定し、新郵便番号が記載されていない場合は印刷
活字郵便物と判定することによって分類される。Here, the classification of the handwritten mail / printed print mail is, for example, when the new zip code is described in the new zip code designation area 61, the mail is judged as a handwritten mail, and the new mail is determined. If the number is not described, it is classified by judging it as a print type mail.

【００３１】また、利用者コード認識部１８は、利用者
コード認識の可否にかかわらず、バーコード印刷部３に
対して郵便物Ｐに対応してＩＤ番号を出力する。さら
に、利用者コード認識部１８で利用者コードが認識でき
なかった場合、画像メモリ１２内の対応する郵便物Ｐの
全体画像がビデオコーディング装置７に対して出力され
る。The user code recognizing section 18 outputs an ID number corresponding to the mail P to the bar code printing section 3 regardless of whether the user code can be recognized. Further, if the user code cannot be recognized by the user code recognition unit 18, the entire image of the corresponding mail P in the image memory 12 is output to the video coding device 7.

【００３２】次に、ビデオコーディング装置７について
詳細に説明する。図５は、ビデオコーディング装置７の
構成を概略的に示すものである。すなわち、本装置全体
の制御を司る制御手段としてのＣＰＵ（セントラル・プ
ロセッシング・ユニット）２１には、ＣＰＵ２１の制御
プログラムなどの固定情報を記憶しているＲＯＭ（リー
ド・オンリ・メモリ）２２、ワーキングメモリなどとし
て用いられるＲＡＭ（ランダム・アクセス・メモリ）２
３、記憶手段としての第１の記憶部２４、記憶手段とし
ての第２の記憶部２５、入力手段としての入力インタフ
ェイス２６、キーボード制御部２７、表示制御部２８、
および、印字制御部２９、および、音声認識部３０がそ
れぞれ接続されている。Next, the video coding device 7 will be described in detail. FIG. 5 schematically shows the configuration of the video coding device 7. That is, a CPU (Central Processing Unit) 21 as control means for controlling the entire apparatus includes a ROM (Read Only Memory) 22 storing fixed information such as a control program of the CPU 21, a working memory. RAM (random access memory) 2 used as
3, a first storage unit 24 as a storage unit, a second storage unit 25 as a storage unit, an input interface 26 as an input unit, a keyboard control unit 27, a display control unit 28,
The print control unit 29 and the voice recognition unit 30 are connected to each other.

【００３３】入力インタフェイス２６には前記読取部２
が、キーボード制御部２７には入力手段としてのキーボ
ード３１が、表示制御部２８には表示手段としてのＣＲ
Ｔディスプレイなどの表示部３２が、印字制御部２９に
はプリンタ３３が、それぞれ接続されている。The input unit 26 includes the reading unit 2
However, the keyboard control unit 27 has a keyboard 31 as input means, and the display control unit 28 has a CR as display means.
A display unit 32 such as a T display and a printer 33 are connected to the print control unit 29, respectively.

【００３４】第１の記憶部２４は、画像データを一時記
憶したり、後述する図６に示す特定領域が確保されてお
り、たとえば、ＲＡＭやＥＥＰＲＯＭなどの半導体メモ
リが用いられる。The first storage section 24 temporarily stores image data and secures a specific area shown in FIG. 6, which will be described later. For example, a semiconductor memory such as a RAM or an EEPROM is used.

【００３５】第２の記憶部２５は、たとえば、大容量の
ハードディスク装置によって構成されており、キーボー
ド３１によって利用者コードを入力する際、その入力操
作を支援するための入力された郵便物Ｐの全体画像、読
取部２の認識結果、入力された利用者コード、入力され
た利用者コードをチェックするための住所データベース
などが記憶される。The second storage unit 25 is formed of, for example, a large-capacity hard disk device. When a user code is input by the keyboard 31, the second mail unit 25 stores the input mail P to support the input operation. An overall image, a recognition result of the reading unit 2, an input user code, an address database for checking the input user code, and the like are stored.

【００３６】入力インタフェイス２６は、たとえば、い
わゆるＳＣＳＩと称されるインタフェイスによって構成
されており、前記読取部２で利用者コードが判定できな
かった郵便物Ｐの全体画像、前記読取部２の認識結果と
して、属性情報（はがき／はがき以外の分類、手書き郵
便物／印刷活字郵便物の分類、文字行方向判定結果な
ど）、利用者コードの候補文字情報（新郵便番号、住所
表示番号）、新郵便番号コード指定記載領域６１の座標
情報、および、利用者コード記載候補領域情報などを入
力する。The input interface 26 is constituted by, for example, a so-called SCSI interface. The whole image of the postal matter P whose user code could not be determined by the reading unit 2, As recognition results, attribute information (classification other than postcard / postcard, classification of handwritten mail / printed mail, character line direction judgment result, etc.), candidate character information of user code (new zip code, address display number), The user inputs coordinate information of the new zip code designation area 61 and user code entry candidate area information.

【００３７】キーボード３１は、表示部３２によって表
示された郵便物Ｐの全体画像と利用者コード入力領域に
おいて、オペレータが読取部２の認識結果を参照しなが
ら訂正箇所だけを訂正入力する。もし、読取部２の認識
結果が間違っている場合は、全桁を訂正入力できるよう
になっている。In the keyboard 31, the operator corrects and inputs only the corrected portion in the whole image of the mail P and the user code input area displayed by the display section 32 while referring to the recognition result of the reading section 2. If the recognition result of the reading unit 2 is incorrect, all digits can be corrected and input.

【００３８】表示部３２は、入力インタフェイス２６に
よって入力された郵便物Ｐの全体画像、前記読取部２の
認識結果である利用者コード（新郵便番号＋住所表示番
号）の候補文字を表示して該利用者コードを入力する利
用者コード入力領域、新郵便番号コード指定記載領域６
１内の画像などをそれぞれ表示する。The display section 32 displays the entire image of the mail P inputted by the input interface 26 and the candidate characters of the user code (new post code + address display number) which are the recognition results of the reading section 2. User code input area for inputting the user code, new zip code code specification entry area 6
1 and the like are displayed.

【００３９】プリンタ３３は、本装置の処理内容やその
評価表などを印字出力する。音声入力部３４は、音声認
識対象名（町名、大口受取人名など）などを音声で入力
するものであり、たとえば、マイクロフォンなどが用い
られる。音声認識部３０は、音声入力部３４から入力さ
れた音声情報を認識するものである。The printer 33 prints out the processing contents of this apparatus and its evaluation table. The voice input unit 34 is for inputting voice recognition target names (town names, names of large recipients, etc.) by voice, and for example, a microphone is used. The voice recognition unit 30 recognizes voice information input from the voice input unit 34.

【００４０】図６は、第１の記憶部２４内に確保された
特定領域のメモリマップを示すものである。図６におい
て、入力モード情報領域４０は、本装置で入力するため
の入力モードの設定情報を書込むことが可能で、たとえ
ば、本装置のカスタマイズ機能になる入力者レベル設
定、画像レイアウト変更などがある。FIG. 6 shows a memory map of a specific area secured in the first storage unit 24. In FIG. 6, an input mode information area 40 is capable of writing input mode setting information for inputting with the present apparatus. For example, an input user level setting, an image layout change, and the like, which become customizing functions of the present apparatus, can be performed. is there.

【００４１】郵便物属性情報領域４１は、リジェクト画
像に対しての読取部２の認識結果である郵便物Ｐの属性
情報が書込まれる。この属性情報としては、たとえば、
はがき／はがき以外の分類、手書き郵便物／印刷活字郵
便物の分類、文字行方向判定結果などがある。全体画像
領域４２は、読取部２で取込まれたリジェクト画像が書
込まれる。In the mail attribute information area 41, the attribute information of the mail P, which is the recognition result of the reading unit 2 for the reject image, is written. As this attribute information, for example,
There are classifications other than postcards / postcards, classifications of handwritten mail / printed mail, and character line direction determination results. In the entire image area 42, a reject image captured by the reading unit 2 is written.

【００４２】利用者コード記載候補領域情報領域４３
は、リジェクト画像に対して、読取部２が新郵便番号コ
ード指定記載領域６１以外において、利用者コードが記
載されている領域だと最も高い認識率で認識した領域の
座標情報（つまり、利用者コード記載候補領域情報）が
書込まれる。User code description candidate area information area 43
Is the coordinate information of the area where the reading unit 2 recognizes the area where the user code is described with the highest recognition rate except for the new zip code code specification area 61 (that is, the user Code description candidate area information) is written.

【００４３】新郵便番号コード指定記載領域情報領域４
４は、郵便物Ｐの新郵便番号コード指定記載領域６１の
座標情報が書込まれる。新郵便番号コード結果領域４５
は、読取部２が郵便物Ｐの新郵便番号コード指定記載領
域６１内の画像から新郵便番号を認識した結果が書込ま
れる。住所表示番号コード結果領域４６は、読取部２が
郵便物Ｐの画像から住所表示番号を認識した結果が書込
まれる。New zip code designation area 4
Reference numeral 4 denotes the coordinate information of the new postal code designation area 61 of the postal matter P. New zip code result area 45
Is written as a result of the reading unit 2 recognizing the new zip code from the image in the new zip code designation area 61 of the postal matter P. In the address display number code result area 46, the result of the reading unit 2 recognizing the address display number from the image of the mail P is written.

【００４４】イメージワーク領域４７は、オペレータが
表示部３２の全体画像を目視して、郵便物Ｐによって回
転、縮小・拡大、スクロールなどの画像操作を行なう
際、画像処理を迅速に行なうために用いられる画像処理
用の領域である。The image work area 47 is used to promptly perform image processing when the operator looks at the entire image on the display unit 32 and performs image operations such as rotation, reduction / enlargement, and scrolling with the mail P. Area for image processing to be performed.

【００４５】利用者コード入力結果領域４８は、オペレ
ータが表示部３２の全体画像表示領域、もしくは、新郵
便番号コード指定記載領域の画像表示領域を目視して利
用者コードを認識し、その利用者コードをキーボード３
１で打鍵入力した結果が書込まれる。In the user code input result area 48, the operator visually recognizes the user code by viewing the entire image display area of the display unit 32 or the image display area of the new zip code code designation entry area, and recognizes the user code. Code 3 on keyboard
The result of the key input in step 1 is written.

【００４６】検索結果領域４９は、オペレータが打鍵入
力した利用者コードを第２の記憶部２５内の住所データ
ベースと照合し、その照合結果を書込む領域である。音
声認識対象語彙読み情報領域５０は、たとえば、図７に
示すような町名用ファイル５０ａと、図８に示すような
大口受取人名用ファイル５０ｂとから構成されている。
これらファイル５０ａ，５０ｂ内は、認識対象語彙読み
情報（たとえば、「あけかい」）と、それに対応する認
識対象登録コードの候補（たとえば、１２１０ＸＸ０
３）とから構成されている。さらに、これらは対象とす
る郵便局に関して、各郵便局単位の情報と、それらを一
括した全局情報とから構成されている。The search result area 49 is an area in which the user code input by the operator by key input is checked against the address database in the second storage unit 25, and the check result is written. The speech recognition target vocabulary reading information area 50 includes, for example, a town name file 50a as shown in FIG. 7 and a large recipient name file 50b as shown in FIG.
In these files 50a and 50b, recognition target vocabulary reading information (for example, "Akaikai") and corresponding recognition target registration code candidates (for example, 1210XX0) are stored.
3). Further, these are constituted by information of each post office concerning the target post office and information of all the offices collectively including the information.

【００４７】これら郵便局情報の登録方法としては、同
一の認識対象登録コードの候補に対して、複数の認識対
象語彙読み情報の登録が可能である。また、同一対象に
対して、略称名や簡略名のような別語彙読み情報を登録
可能である。たとえば、「浦田市役所」に対して、語彙
読み情報として、「うらたしやくしょ」、「しやくし
ょ」、「やくしょ」を複数登録可能である。As a method of registering the post office information, a plurality of recognition target vocabulary reading information can be registered for the same recognition target registration code candidate. Further, different vocabulary reading information such as an abbreviated name or abbreviated name can be registered for the same object. For example, a plurality of "Uratashiyakusho", "Shiyakusho", and "Yakusho" can be registered as vocabulary reading information for "Urata City Office".

【００４８】本例では、町名の場合、図７に示すよう
に、郵便局単位の情報（１２１０ＸＸ，１２４０ＸＸな
ど）と全局情報（１ＸＸＸＸＸなど）とから構成され、
さらに別種別の情報も含まれている。In this example, in the case of a street name, as shown in FIG. 7, it is composed of post office unit information (1210XX, 1240XX, etc.) and all station information (1XXXXXX, etc.).
Further, information of another type is also included.

【００４９】コード入力の際、郵便局が特定されている
場合は、町名用ファイル５０ａ内の郵便局単位の情報か
ら検索し、郵便局が特定されていない場合は、町名用フ
ァイル５０ａ内の全局情報から検索する。なお、大口受
取人名に関しても同じ構成である。At the time of inputting the code, if the post office is specified, it is searched from the information of the post office unit in the town name file 50a. If the post office is not specified, all the stations in the town name file 50a are searched. Search from information. It should be noted that the same configuration applies to the name of a large recipient.

【００５０】音声認識対象表示情報領域５１は、たとえ
ば、図９に示すように、音声認識対象種別情報（町名対
象、大口受取人名対象など）に対する郵便局ごとに、認
識対象登録コードの候補に対応する認識対象表示情報が
格納されたファイルによって構成されており、そのファ
イル名に意味を持たせている。以下、その意味付け例に
ついて説明する。As shown in FIG. 9, for example, as shown in FIG. 9, the speech recognition target display information area 51 corresponds to a recognition target registration code candidate for each post office corresponding to speech recognition target type information (for a town name, for a large recipient name, etc.). The file name is made up of a file in which the recognition target display information to be recognized is stored, and the file name has a meaning. Hereinafter, an example of the meaning will be described.

【００５１】すなわち、ファイル名は、たとえば、「上
位１桁」と「３桁／５桁のコード」（３桁の場合はＸＸ
を追加）からなり、上位１桁が「１」のときは町名用で
あることを示し、上位１桁が「２」のときは大口受取人
名用であることを示している。なお、図９の例は、ファ
イル名が町名用（１２１０ＸＸ）で、局番が「２１０」
の場合を示している。That is, the file name is, for example, “high-order 1 digit” and “3-digit / 5-digit code” (for three digits, XX
Is added), and when the upper one digit is "1", it indicates that it is for a street name, and when the upper one digit is "2", it indicates that it is for a large recipient name. In the example of FIG. 9, the file name is for the town name (1210XX) and the station number is "210".
Is shown.

【００５２】図１０は、音声認識部３０の構成例を示す
ものである。すなわち、音声入力部３４から入力された
音声情報は、音響分析部８１で特徴パラメータに変換さ
れる。音声認識に用いられる代表的な特徴パラメータと
しては、バンドパスフィルタやフーリエ変換によって求
めることができるパワースペクトルやＬＰＣ（線形予
測）分析によって求めたケプストラム係数などがよく用
いられる。音響分析部８１は、特徴パラメータ系列をモ
デル照合部８２に入力する。FIG. 10 shows an example of the configuration of the speech recognition section 30. That is, the sound information input from the sound input unit 34 is converted into the characteristic parameter by the sound analysis unit 81. As typical characteristic parameters used for speech recognition, a power spectrum obtained by a band-pass filter or Fourier transform, a cepstrum coefficient obtained by LPC (linear prediction) analysis, and the like are often used. The acoustic analysis unit 81 inputs the feature parameter sequence to the model matching unit 82.

【００５３】モデル照合部８２は、あらかじめ作成して
音声モデル記憶部８３に記憶しておいた認識対象音声の
各音声モデル（辞書情報）と特徴パラメータ系列との類
似度あるいは距離を求める演算を行なう。モデル照合部
８２での照合方法としては、音声モデルも特徴パラメー
タ系列で表現しておき、ＤＰ（動的計画）法で音声モデ
ルの特徴パラメータ系列と入力音声の特徴パラメータ系
列との距離を求める手法や、ＨＭＭ（隠れマルコフモデ
ル）を用いて音声モデルを表現しておき、入力音声の特
徴パラメータ系列が入力されたときの各音声モデルの確
率を計算する手法などが広く用いられている。The model collating unit 82 calculates the similarity or distance between each speech model (dictionary information) of the speech to be recognized and the feature parameter sequence, which are created in advance and stored in the speech model storage unit 83. . As a matching method in the model matching unit 82, a speech model is also represented by a feature parameter sequence, and a distance between the feature parameter sequence of the speech model and the feature parameter sequence of the input speech is determined by a DP (dynamic programming) method. Also, a method of expressing a speech model using an HMM (Hidden Markov Model) and calculating a probability of each speech model when a feature parameter sequence of an input speech is input is widely used.

【００５４】音声モデル記憶部８３に記憶されている辞
書情報としての音声モデルは、性別単位や個人単位で持
つことが可能で、オペレータの識別情報から対象となる
音声モデルを選択して用いるようになっている。この例
では、たとえば、男性用の音声モデルと女性用の音声モ
デルとが設けられていて、入力された音声情報で性別を
判別するか、音声入力の際に性別情報を入力することに
より、対象となる音声モデルを選択するようになってい
る。The voice model as the dictionary information stored in the voice model storage unit 83 can be stored in gender units or individual units, and a target voice model is selected from the operator identification information and used. Has become. In this example, for example, a voice model for men and a voice model for women are provided, and the gender is determined based on the input voice information, or the gender information is input at the time of voice input. Is selected.

【００５５】認識結果出力部８４は、モデル照合部８２
で求めた各音声モデルに対する類似度（あるいは、距
離）をある条件の基でソーティングして、類似度が最大
（あるいは、距離が最小）となる認識対象のカテゴリを
認識結果として出力する。The recognition result output unit 84 includes a model collation unit 82
The similarity (or distance) with respect to each voice model obtained in (1) is sorted under a certain condition, and the recognition target category having the maximum similarity (or minimum distance) is output as a recognition result.

【００５６】図１１は、表示部３２に表示される操作画
面例を示しており、郵便物Ｐの全体画像を表示する全体
画像表示領域７１、および、新郵便番号コード入力領域
７２と住所表示番号コード入力領域７３とからなり、利
用者コードを入力する利用者コード入力領域７４によっ
て構成される。FIG. 11 shows an example of an operation screen displayed on the display unit 32. An entire image display area 71 for displaying the entire image of the mail P, a new zip code input area 72 and an address display number are shown. It comprises a code input area 73 and is constituted by a user code input area 74 for inputting a user code.

【００５７】利用者コード入力領域７４は、新郵便番号
コード入力領域７２と住所表示番号コード入力領域７３
との２つに分割され、それぞれ異なる色表示で区分する
ことにより、オペレータが迅速に認識し易くしている。The user code input area 74 includes a new zip code input area 72 and an address display number code input area 73.
Are divided into two different colors, so that the operator can easily recognize them quickly.

【００５８】新郵便番号コード入力領域７２は、新郵便
番号に対しての読取部２の認識結果の不読桁（読取部２
が認識できなかった桁）パターンからオペレータに告知
する候補文字の表示を行なう。The new zip code input area 72 includes an unread digit (reading unit 2) of the recognition result of the reading unit 2 for the new zip code.
The candidate characters to be notified to the operator are displayed based on the pattern in which the characters cannot be recognized).

【００５９】住所表示番号コード入力領域７３は、住所
表示番号に対しての読取部２の認識結果である候補文字
が表示される。次に、上記のような構成において、リジ
ェクト郵便物が生じた場合の動作を図１２〜図１４に示
すフローチャートを参照して説明する。In the address display number code input area 73, candidate characters as a result of recognition of the address display number by the reading unit 2 are displayed. Next, an operation in the case where a reject mail has occurred in the above-described configuration will be described with reference to flowcharts shown in FIGS.

【００６０】一例として、新郵便番号コード指定記載領
域６１に新郵便番号の下位４桁が記載されていない印刷
活字郵便物の場合について説明する。たとえば、図１５
に示すように、本例で取扱うリジェクト郵便物Ｐは、記
載様式が横置横書である封書とする。対象となる郵便物
Ｐの利用者コードが、たとえば、「２１０−０００１
（新郵便番号）、１−２−３−Ｂ−２０１（住所表示番
号）」であるのに対し、記載情報において新郵便番号の
下位４桁「０００１」が記載されておらず、読取部２の
認識結果がなにも認識できなかった場合について説明す
る。As an example, a case of a printed type mail item in which the lower four digits of the new zip code are not described in the new zip code code designation description area 61 will be described. For example, FIG.
As shown in (1), the reject mail P handled in this example is a sealed letter whose description format is horizontal writing. The user code of the target mail P is, for example, "210-0001
(New postal code) and 1-2-3-B-201 (address display number), the lower four digits “0001” of the new postal code are not described in the description information, and the reading unit 2 A case in which no recognition result is recognized will be described.

【００６１】まず、ステップＳ１において、読取部２で
リジェクトされた郵便物Ｐ、すなわち、読取部２から入
力インタフェイス２６を介して送られる利用者コードが
認識できなかった郵便物Ｐの全体画像、読取部２の認識
結果である郵便物Ｐの各種属性情報、利用者コードの候
補文字情報（新郵便番号、住所表示番号）、利用者コー
ド記載候補領域情報、および、新郵便番号コード指定記
載領域６１の座標情報を、第１の記憶部２４の郵便物属
性情報領域４１、全体画像領域４２、利用者コード記載
候補領域情報領域４３、新郵便番号コード指定記載領域
情報領域４４、新郵便番号コード結果領域４５、住所表
示番号コード結果領域４６にそれぞれ書込む。First, in step S1, the entire image of the mail P rejected by the reading unit 2, that is, the mail P whose user code sent from the reading unit 2 via the input interface 26 could not be recognized, Various attribute information of the postal matter P as a recognition result of the reading unit 2, candidate character information of a user code (new zip code, address display number), candidate code description candidate area information, and new zip code designation area The coordinate information 61 is stored in the mail attribute information area 41, the entire image area 42, the user code description candidate area information area 43, the new zip code code designation area information area 44, and the new zip code in the first storage unit 24. The results are written in the result area 45 and the address display number code result area 46, respectively.

【００６２】次に、ステップＳ２において、第１の記憶
部２４内の全体画像領域４２から郵便物Ｐの全体画像を
読出すことにより、たとえば、図１５に示すように、読
出した郵便物Ｐの全体画像を表示部３２の全体画像表示
領域７１に表示する。この例の場合、読取部２の認識結
果がなにも認識できなかったので、利用者コード入力領
域７４（新郵便番号コード入力領域７２と住所表示番号
コード入力領域７３）にはなにも表示されない。Next, in step S2, by reading the entire image of the mail P from the entire image area 42 in the first storage unit 24, for example, as shown in FIG. The whole image is displayed in the whole image display area 71 of the display unit 32. In the case of this example, since no recognition result of the reading unit 2 was recognized, nothing is displayed in the user code input area 74 (the new postal code code input area 72 and the address display number code input area 73). Not done.

【００６３】次に、ステップＳ３において、オペレータ
は、表示部３２に表示された郵便物Ｐの全体画像を目視
しながら、キーボード３１により、利用者コードが全て
記載されている通常時は、表示部３２の利用者コード入
力領域７４において利用者コードを１桁ずつ順次入力す
るが、図１５の例の場合、新郵便番号が上位３桁の「２
１０」しか記載されていないので、まず「２１０」を入
力した後、音声認識対象種別情報（町名）を入力する町
名キーを押下する。なお、音声認識対象種別情報として
大口受取人名を入力する場合は、大口受取人名キーを押
下する。Next, in step S3, the operator uses the keyboard 31 to display the entire image of the postal matter P displayed on the display unit 32, and at the normal time, the display unit 32 displays all the user codes. The user code is sequentially input one digit at a time in the 32 user code input areas 74. In the example of FIG.
Since only "10" is described, "210" is first input, and then a town name key for inputting speech recognition target type information (town name) is pressed. When a large recipient name is input as the speech recognition target type information, a large recipient name key is pressed.

【００６４】次に、ステップＳ４において、オペレータ
が入力した結果が利用者コードであるか音声認識対象種
別情報であるかを判別する。すなわち、オペレータが利
用者コードを１桁入力するごとに、それが利用者コード
であるか、音声認識対象種別情報（町名、大口受取人
名）であるかを判別する。この判別の結果、利用者コー
ドであれば、ステップＳ５に進み、入力された利用者コ
ードを確定する。Next, in step S4, it is determined whether or not the result input by the operator is a user code or speech recognition target type information. That is, each time the operator inputs the user code by one digit, it is determined whether the user code is the user code or the speech recognition target type information (the name of the town, the name of the large recipient). If the result of this determination is that it is a user code, the flow proceeds to step S5, where the input user code is determined.

【００６５】上記判別の結果、音声認識対象種別情報で
あれば、ステップＳ６に進み、その入力された音声認識
対象種別情報を第１のバッファ（図示しない）に格納す
る。このとき、現行郵便番号が特定されていれば、つま
り、新郵便番号の上位３桁が入力されていれば、その郵
便番号も上記第１のバッファに格納する。If the result of the determination is that the type information is a voice recognition target, the process proceeds to step S6, and the input voice recognition target type information is stored in a first buffer (not shown). At this time, if the current zip code is specified, that is, if the upper three digits of the new zip code are input, the zip code is also stored in the first buffer.

【００６６】次に、ステップＳ７において、上記バッフ
ァ内の音声認識対象種別情報が町名であるか、大口受取
人名であるかを識別し、町名であればステップＳ８に進
み、大口受取人名であればステップＳ９に進む。Next, in step S7, it is determined whether the type information of the voice recognition target in the buffer is a town name or a large recipient name. If the name is a town name, the process proceeds to step S8. Proceed to step S9.

【００６７】また、オペレータは、音声認識対象種別情
報を入力した後、続けて、音声入力部３４から音声認識
対象名を音声により入力する（ステップＳ１４）。図１
５の例の場合、町名である「まいはま」を発声する。After inputting the voice recognition target type information, the operator subsequently inputs a voice recognition target name by voice from the voice input unit 34 (step S14). FIG.
In the case of example 5, the name of the town, "maihama" is uttered.

【００６８】こうして入力された音声情報は、音声認識
部３０に入力されて、認識処理される（ステップＳ１
５）。この場合、前述したように、オペレータが男性で
あれば、男性用の音声モデルを用いて音声認識され、女
性であれば、女性用の音声モデルを用いて音声認識され
る。The voice information thus input is input to the voice recognition unit 30 and subjected to recognition processing (step S1).
5). In this case, as described above, if the operator is male, speech recognition is performed using a male voice model, and if the operator is female, voice recognition is performed using a female voice model.

【００６９】さて、ステップＳ８では、前記第１のバッ
ファ内に現行郵便番号が存在するか否かにより、現行郵
便番号が特定されているか否かを判断し、特定されてい
れば、ステップＳ１０に進む。ステップＳ１０では、音
声認識部３０の認識結果を基に、第１の記憶部２４に設
けられた音声認識対象語彙読み情報領域５０の町名用フ
ァイル５０ａ（図７参照）において、「１＋現行郵便番
号」の情報領域から検索し、ステップＳ１６に進む。In step S8, whether or not the current postal code is specified is determined based on whether or not the current postal code exists in the first buffer. If the current postal code is specified, the process proceeds to step S10. move on. In step S10, based on the recognition result of the voice recognition unit 30, in the file 50a for town name (see FIG. 7) of the vocabulary reading information area 50 for voice recognition provided in the first storage unit 24, "1 + current postal code" , And proceeds to step S16.

【００７０】ステップＳ８で現行郵便番号が特定されて
いなければ、ステップＳ１１に進む。ステップＳ１１で
は、音声認識部３０の認識結果を基に、音声認識対象語
彙読み情報領域５０の町名用ファイル５０ａにおいて、
「１ＸＸＸＸＸ」の情報領域から検索し、ステップＳ１
６に進む。If the current postal code is not specified in step S8, the process proceeds to step S11. In step S11, based on the recognition result of the voice recognition unit 30, in the file 50a for the town name in the vocabulary reading information area 50 for voice recognition,
Searching from the information area of “1XXXXXX”, step S1
Proceed to 6.

【００７１】また、ステップＳ９では、ステップＳ８と
同様に、現行郵便番号が特定されているか否かを判断
し、特定されていれば、ステップＳ１２に進む。ステッ
プＳ１２では、音声認識部３０の認識結果を基に、音声
認識対象語彙読み情報領域５０の大口受取人名用ファイ
ル５０ｂ（図８参照）において、「２＋現行郵便番号」
の情報領域から検索し、ステップＳ１６に進む。In step S9, similarly to step S8, it is determined whether or not the current postal code is specified, and if specified, the process proceeds to step S12. In step S12, based on the recognition result of the voice recognition unit 30, "2 + current postal code" is stored in the large recipient name file 50b (see FIG. 8) of the vocabulary reading information area 50 for voice recognition.
, And proceeds to step S16.

【００７２】ステップＳ９で現行郵便番号が特定されて
いなければ、ステップＳ１３に進む。ステップＳ１３で
は、音声認識部３０の認識結果を基に、音声認識対象語
彙読み情報領域５０の大口受取人名用ファイル５０ｂに
おいて、「２ＸＸＸＸＸ」の情報領域から検索し、ステ
ップＳ１６に進む。If the current postal code is not specified in step S9, the process proceeds to step S13. In step S13, based on the recognition result of the voice recognition unit 30, a search is performed from the information area "2XXXXXX" in the large recipient name file 50b of the vocabulary reading information area 50 for voice recognition, and the process proceeds to step S16.

【００７３】ステップＳ１６では、ステップＳ１０，Ｓ
１１，Ｓ１２またはＳ１３の検索処理に基づき、音声認
識対象語彙読み情報領域５０から、第１〜第Ｎの認識対
象登録コードの候補を抽出して、第２のバッファ（図示
しない）に格納し、ステップＳ１７に進む。At step S16, steps S10, S
Based on the search processing in step S11, S12 or S13, the first to Nth recognition target registration code candidates are extracted from the speech recognition target vocabulary reading information area 50 and stored in a second buffer (not shown). Proceed to step S17.

【００７４】ステップＳ１７では、上記第２のバッファ
内の認識対象登録コードの候補に基づき、第１の記憶部
２４に設けられた音声認識対象表示情報領域５１（図９
参照）を検索することにより、対応する音声認識対象表
示情報を抽出し、図１６に示すように、表示部３２にお
ける利用者コード入力領域７４の上部に表示する。この
ときの表示方法としては、認識確度が一番高い第１候補
をそれ以外の候補と区別して表示する。区別して表示す
る方法としては、たとえば、第１候補を枠で囲んだり、
ブリンキングしたり、表示色を変えたり、表示サイズを
変えたり、表示位置を変えるなどが考えられる。In step S17, based on the candidate for the recognition target registration code in the second buffer, the voice recognition target display information area 51 (FIG. 9) provided in the first storage unit 24.
16), the corresponding display information for voice recognition is extracted and displayed above the user code input area 74 on the display unit 32, as shown in FIG. As a display method at this time, the first candidate having the highest recognition accuracy is displayed separately from the other candidates. As a method of displaying the first candidate separately, for example, the first candidate is surrounded by a frame,
It is possible to blink, change the display color, change the display size, change the display position, and the like.

【００７５】オペレータは、図１６の画面を目視して、
音声認識対象候補の選択操作、最初の検索では目的の候
補が得られなかったため再度検索を行なうための音声認
識対象種別情報の入力操作、あるいは、入力モードの切
換え操作などを行なう。The operator looks at the screen shown in FIG.
An operation of selecting a voice recognition target candidate, an input operation of voice recognition target type information for performing a search again since the target candidate was not obtained in the first search, or an input mode switching operation is performed.

【００７６】次に、ステップＳ１８において、候補にな
っている利用者コードをリアルタイムに利用者コード入
力領域７４に表示する。すなわち、第１音声認識対象候
補に対応するコードは、最初から利用者コード入力領域
７４に表示しておき、音声認識対象候補の選択中は、暫
定的に選択されている音声認識対象候補に対応するコー
ドを逐次、利用者コード入力領域７４に表示するもので
ある。Next, in step S18, the candidate user codes are displayed in the user code input area 74 in real time. That is, the code corresponding to the first voice recognition target candidate is displayed in the user code input area 74 from the beginning, and during the selection of the voice recognition target candidate, the code corresponding to the provisionally selected voice recognition target candidate is displayed. Are sequentially displayed in the user code input area 74.

【００７７】次に、ステップＳ１９において、図１６の
画面を表示した後のオペレータの入力結果を監視し、こ
こで音声認識対象候補の選択操作による選択コードが入
力された場合、ステップＳ２０に進む。ステップＳ２０
では、ステップＳ１９で選択された音声認識対象候補に
対応する利用者コードを自動入力し、ステップＳ５で入
力された利用者コードを確定する。Next, in step S19, the input result of the operator after displaying the screen of FIG. 16 is monitored, and if a selection code has been input by the operation of selecting a candidate for speech recognition, the process proceeds to step S20. Step S20
Then, the user code corresponding to the voice recognition target candidate selected in step S19 is automatically input, and the user code input in step S5 is determined.

【００７８】また、ステップＳ１９において、再び音声
認識対象種別情報が入力された場合、ステップＳ６に戻
り、前述した動作を繰り返す。また、ステップＳ１９に
おいて、入力モードの切換えコードが入力された場合、
ステップＳ２１に進む。すなわち、キーボード３１に
は、利用者コードの入力をキー入力モードで行なうか、
音声入力モードで行なうかを切換えるモード切換スイッ
チ（図示しない）が設けられており、音声入力モードを
中止してキー入力モードに切換える場合は、上記モード
切換スイッチをキー入力モードに切換える。これによ
り、入力モード切換えコードが入力される。If the speech recognition target type information is input again in step S19, the process returns to step S6, and the above-described operation is repeated. If the input mode switching code is input in step S19,
Proceed to step S21. That is, whether the input of the user code is performed in the key input mode on the keyboard 31 or not.
A mode changeover switch (not shown) for switching between the voice input mode and the input mode is provided. When the voice input mode is stopped and the mode is switched to the key input mode, the mode changeover switch is switched to the key input mode. Thus, the input mode switching code is input.

【００７９】ステップＳ２１では、前記第１のバッファ
の内容（音声認識対象種別情報、現行郵便番号など）を
参照し、ステップＳ２２に進む。ステップＳ２２では、
ステップＳ２１で参照した第１のバッファ内の情報に対
応したキー入力用のメニュー画面を表示部３２に表示
し、ステップＳ２３に進む。なお、上記キー入力用のメ
ニュー画面は、たとえば、図１５とほぼ同様な構成とな
っている。In step S21, the contents of the first buffer (speech recognition target type information, current postal code, etc.) are referred to, and the flow advances to step S22. In step S22,
A menu screen for key input corresponding to the information in the first buffer referred to in step S21 is displayed on the display unit 32, and the process proceeds to step S23. The key input menu screen has, for example, substantially the same configuration as that of FIG.

【００８０】オペレータは、キー入力用のメニュー画面
を目視して、音声認識対象候補の選択操作、あるいは、
利用者コードのキー入力操作などを行なう。ステップＳ
２３では、キー入力用のメニュー画面を表示した後のオ
ペレータの入力結果を監視し、ここで音声認識対象候補
の選択操作による選択コードが入力された場合、ステッ
プＳ２０に進み、上記選択された音声認識対象候補に対
応する利用者コードを自動入力し、ステップＳ５で入力
された利用者コードを確定する。The operator looks at the menu screen for key input and selects a speech recognition target candidate, or
Performs a key input operation of a user code. Step S
At 23, the operator's input result after the menu screen for key input is displayed is monitored, and if a selection code is input by the operation of selecting a voice recognition target candidate, the process proceeds to step S20 and the selected voice The user code corresponding to the recognition target candidate is automatically input, and the input user code is determined in step S5.

【００８１】また、ステップＳ２３において、利用者コ
ードが入力された場合、ステップＳ５に進み、入力され
た利用者コードを確定する。以下、具体例を用いて要部
を説明する。If a user code has been input in step S23, the flow advances to step S5 to determine the input user code. Hereinafter, the main part will be described using a specific example.

【００８２】オペレータは、図１５に示すように表示さ
れた郵便物Ｐの全体画像を目視しながら、本例では新郵
便番号が「２１０」しか記載されていないので、まず
「２１０」を入力し、その後、町名キーを押下する。続
けて、オペレータは、図１５の例の場合、「まいはま」
と発声し、音声入力する。The operator inputs "210" while watching the entire image of the postal matter P displayed as shown in FIG. 15, since only the new postal code "210" is described in this example. Then, press the town name key. Subsequently, in the case of the example of FIG.
And input a voice.

【００８３】こうして入力された音声情報は、音声認識
部３０において認識され、その認識結果から、音声認識
対象語彙読み情報領域５０の町名用ファイル５０ａ（図
７参照）が検索される。この例では、たとえば、「あけ
かい→うらやす→ほりえ」と認識し、認識対象登録コー
ドの候補は、以下に示す音声認識対象候補情報として抽
出される。The voice information thus input is recognized by the voice recognition unit 30, and the recognition result is searched for the file 50a for town name (see FIG. 7) in the vocabulary reading information area 50 for voice recognition. In this example, for example, the recognition is performed as “Akaikai → Urasasu → Horie”, and the candidates for the recognition target registration code are extracted as the following voice recognition target candidate information.

【００８４】認識対象登録コードの候補：１２１０ＸＸ
０３，１２１０ＸＸ０５，１２１０ＸＸ０２次に、上記抽出された認識対象登録コードの候補から、
音声認識対象表示情報領域５１（図９参照）を検索する
ことにより、対応する音声認識対象表示情報を抽出し
て、図１６に示すように表示部３２に表示する。音声認
識対象候補の表示は、利用者コード入力領域７４の上部
に表示し、第１音声認識対象候補の「明海」が枠表示さ
れている。これにより、他の音声認識対象候補と区別さ
れている。Recognition candidate registration code candidate: 1210XX
03,1210XX05,1210XX02 Next, from the extracted candidates for the recognition target registration code,
By searching the voice recognition target display information area 51 (see FIG. 9), the corresponding voice recognition target display information is extracted and displayed on the display unit 32 as shown in FIG. The display of the voice recognition target candidate is displayed above the user code input area 74, and the first voice recognition target candidate “Minghai” is displayed in a frame. This distinguishes it from other speech recognition target candidates.

【００８５】さらに、表示部３２の利用者コード入力領
域７４には、上記第１音声認識対象候補の「明海」に対
応する利用者コード「２１０−０００３」が表示され
る。なお、音声認識対象候補の表示方法として、音声認
識対象候補の横にインデックス情報や利用者コードなど
を表示してもよい。また、図１７に示すように、音声認
識対象候補の先頭にインデックス番号を表示すること
で、このインデックス番号を入力して音声認識対象候補
を選択決定してもよい。Further, in the user code input area 74 of the display unit 32, a user code "210-0003" corresponding to the first voice recognition target candidate "Akai" is displayed. Note that as a method of displaying the voice recognition target candidate, index information, a user code, or the like may be displayed next to the voice recognition target candidate. Alternatively, as shown in FIG. 17, an index number may be displayed at the beginning of the candidate for speech recognition, and the index number may be input to select and determine the candidate for speech recognition.

【００８６】オペレータは、図１６の画面を目視して、
この例の場合、目的の音声認識対象候補が存在しないの
で、再度音声入力を行なうため、再び町名キーを押下
し、続けて、「まいはま」と発声する。The operator looks at the screen shown in FIG.
In this example, since there is no target voice recognition target candidate, the town name key is pressed again to perform voice input again, and then "maihama" is uttered.

【００８７】こうして入力された音声情報は、音声認識
部３０において再度認識され、その認識結果から、音声
認識対象語彙読み情報領域５０の町名用ファイル５０ａ
が検索される。ここでは、前回の検索で音声認識対象候
補になった「あけかい、うらやす、ほりえ」を除外し
て、町名用ファイル５０ａを検索する。この例では、た
とえば、「まいはま→ねこざね→まえはま」と認識し、
認識対象登録コードの候補は、以下に示す音声認識対象
候補情報として抽出される。The voice information thus input is recognized again by the voice recognition unit 30, and from the recognition result, the file 50a for the town name in the vocabulary reading information area 50 for voice recognition is obtained.
Is searched. Here, the file for town name 50a is searched, excluding “Akaikai, Urasuyasu, Horie”, which has become a candidate for speech recognition in the previous search. In this example, for example, it recognizes "Maihama → Nekozane → Maehama"
Recognition candidate registration code candidates are extracted as speech recognition target candidate information described below.

【００８８】認識対象登録コードの候補：１２１０ＸＸ
０１，１２１０ＸＸ１０，１２１０ＸＸ０１次に、上記抽出された認識対象登録コードの候補から、
音声認識対象表示情報領域５１を検索することにより、
対応する音声認識対象表示情報を抽出して、図１８に示
すように表示部３２に表示する。この例では、同じ認識
対象登録コードの候補が複数あるので、１つの認識対象
登録コードとして表示される。Recognition registration code candidate: 1210XX
01,1210XX10,1210XX01 Next, from the extracted candidates for the recognition target registration code,
By searching the voice recognition target display information area 51,
The corresponding voice recognition target display information is extracted and displayed on the display unit 32 as shown in FIG. In this example, since there are a plurality of candidates for the same recognition target registration code, they are displayed as one recognition target registration code.

【００８９】図１８に示すように、第１音声認識対象候
補の「舞浜」が枠表示されており、さらに、利用者コー
ド入力領域７４には、上記第１音声認識対象候補の「舞
浜」に対応する利用者コード「２１０−０００１」が表
示される。As shown in FIG. 18, the first voice recognition target candidate “Maihama” is displayed in a frame, and the user code input area 74 is displayed with the first voice recognition target candidate “Maihama”. The corresponding user code "210-0001" is displayed.

【００９０】オペレータは、図１８の画面において、た
とえば、キーボード３１に設けられた候補選択キーによ
り、目的の音声認識対象候補として「舞浜」を選択決定
することにより、このとき利用者コード入力領域７４に
表示されている「舞浜」に対応する利用者コード「２１
０−０００１」が自動入力され、入力された利用者コー
ドとして確定される。The operator selects and determines “Maihama” as a target voice recognition target candidate using, for example, a candidate selection key provided on the keyboard 31 on the screen of FIG. User code "21" corresponding to "Maihama" displayed in
"0-0001" is automatically input and determined as the input user code.

【００９１】次に、オペレータは、続けて、次の住所表
示番号（１−２−３−Ｂ−２０１）を１桁ずつ入力する
が、その住所表示番号の１桁目を入力したタイミングで
音声認識対象候補の表示画面を消去し、図１９に示すよ
うな利用者コード入力画面に切換える。Next, the operator continuously inputs the next address display number (1-2-3-B-201) one digit at a time. The display screen of the recognition target candidates is deleted, and the screen is switched to a user code input screen as shown in FIG.

【００９２】このようにして入力された利用者コードの
入力結果は、第１の記憶部２４内の利用者コード入力結
果領域４８に書込まれる。次に、第２の記憶部２５に格
納されている住所データベースと、第１の記憶部２４内
の利用者コード入力結果領域４８に書込まれた利用者コ
ードの入力結果とを照合し、その照合結果を第１の記憶
部２４内の検索結果領域４９に書込む。[0092] The input result of the user code input as described above is written in the user code input result area 48 in the first storage unit 24. Next, the address database stored in the second storage unit 25 is compared with the input result of the user code written in the user code input result area 48 in the first storage unit 24. The collation result is written into the search result area 49 in the first storage unit 24.

【００９３】上記照合結果が正解であれば、第１の記憶
部２４内の利用者コード入力結果領域４８の利用者コー
ドがバーコード印刷部３へ転送され、バーコード情報に
変換されて、対応する郵便物Ｐ上に印刷される。If the collation result is correct, the user code in the user code input result area 48 in the first storage unit 24 is transferred to the barcode printing unit 3 and is converted into barcode information. Is printed on the postal matter P to be printed.

【００９４】上記照合結果が不正解であれば、不正解の
データがバーコード印刷部３へ転送され、やはりバーコ
ード情報に変換されて、対応する郵便物Ｐ上に印刷され
る。以上説明したように上記実施の形態によれば、利用
者コードが記載されていない、利用者コードが不完全、
利用者コードの判読が困難、さらに、間違った利用者コ
ードが記載されているなどの郵便物に対して、町名、大
口受取人名などの音声認識対象名を音声で入力すること
により、音声認識され、確度の高い音声認識対象候補を
表示する。オペレータは、表示された音声認識対象候補
の中から選択決定するだけで、正確に利用者コードを入
力することができる。If the collation result indicates an incorrect answer, the incorrect answer data is transferred to the barcode printing unit 3 and is also converted into barcode information and printed on the corresponding mail P. As described above, according to the embodiment, the user code is not described, the user code is incomplete,
It is difficult to read the user code, and it is also possible to input a voice recognition target name such as the name of a town or a large recipient for voice mail, such as an incorrect user code. , And display a voice recognition target candidate with high accuracy. The operator can accurately input the user code only by selecting and determining from the displayed voice recognition target candidates.

【００９５】また、音声認識するための辞書情報につい
ても、性別や個人別ごとに持つことにより、音声認識率
が向上する。また、音声認識の結果を基に検索するため
の音声認識対象語彙読み情報領域が、同じ対象情報に対
して、複数の認識対象語彙読み情報を持つことが可能
で、さらに、同じ認識対象登録コードの候補が複数抽出
されても、１つの認識対象表示情報として表示すること
から、音声認識によるコード化の正確性および迅速性が
向上する。Also, by providing dictionary information for voice recognition for each gender or individual, the voice recognition rate is improved. Also, the speech recognition target vocabulary reading information area for searching based on the result of speech recognition can have a plurality of recognition target vocabulary reading information for the same target information, and furthermore, the same recognition target registration code Even if a plurality of candidates are extracted, they are displayed as one piece of recognition target display information, so that the accuracy and speed of encoding by voice recognition are improved.

【００９６】また、音声認識対象候補の表示に関して
も、認識確度が一番高い第１音声認識対象候補とそれ以
外とを区別して表示することで、オペレータは、迅速か
つ正確に音声認識対象候補の選択が可能になる。Further, regarding the display of the voice recognition target candidate, the first voice recognition target candidate having the highest recognition accuracy is distinguished from the other voice recognition target candidates so that the operator can quickly and accurately identify the voice recognition target candidate. Selection becomes possible.

【００９７】また、第１音声認識対象候補に対応するコ
ードは、最初から利用者コード入力領域に表示してお
き、音声認識対象候補の選択中は、暫定的に選択されて
いる音声認識対象候補に対応するコードを、逐次、利用
者コード入力領域に表示することで、効率のよいシステ
ムになる。The code corresponding to the first candidate for speech recognition is displayed in the user code input area from the beginning, and while the candidate for speech recognition is selected, the temporarily selected candidate for speech recognition is selected. Are displayed in the user code input area one after another, thereby providing an efficient system.

【００９８】さらに、当該音声認識対象の郵便物に対し
て、２回目以降の音声入力処理の有無情報を含んだ音声
認識対象種別情報を入力できるようにし、該音声入力処
理の有無情報が有りであった場合、以前の音声入力で音
声認識対象候補として抽出された語棄読み情報を除外し
て、音声認識対象語彙読み情報領域から検索し、認識対
象登録コードの候補にすることから、効率がよく、正確
性の高いシステムが実現できる。Further, the voice recognition target type information including the presence / absence information of the second or later voice input process can be input to the voice mail to be subjected to the voice recognition. If there is, the word abandonment information extracted as a candidate for speech recognition in the previous speech input is excluded, and it is searched from the vocabulary reading information area for speech recognition and becomes a candidate for a recognition target registration code. A good and accurate system can be realized.

【００９９】[0099]

【発明の効果】以上詳述したように本発明によれば、オ
ペレータが効率よく短時間で疲労が少い操作で利用者コ
ードを入力でき、正確性や省力化が向上し、迅速な処理
が実現できるビデオコーディング装置を提供できる。As described above in detail, according to the present invention, an operator can input a user code efficiently in a short time and with less fatigue, thereby improving accuracy and labor saving, and speeding up processing. A video coding device that can be realized can be provided.

[Brief description of the drawings]

【図１】手書き郵便物の一例を示す平面図。FIG. 1 is a plan view showing an example of a handwritten mail.

【図２】印刷活字郵便物の一例を示す平面図。FIG. 2 is a plan view showing an example of a print type mail.

【図３】本発明の実施の形態に係るビデオコーディング
装置が適用される郵便物処理装置の構成を概略的に示す
ブロック図。FIG. 3 is a block diagram schematically showing a configuration of a mail processing apparatus to which the video coding apparatus according to the embodiment of the present invention is applied.

【図４】読取部の構成を示すブロック図。FIG. 4 is a block diagram illustrating a configuration of a reading unit.

【図５】本発明の実施の形態に係るビデオコーディング
装置の構成を概略的に示すブロック図。FIG. 5 is a block diagram schematically showing a configuration of a video coding device according to the embodiment of the present invention.

【図６】第１の記憶部内に確保された特定領域のメモリ
マップを示す図。FIG. 6 is a diagram showing a memory map of a specific area secured in a first storage unit.

【図７】第１の記憶部における音声認識対象語彙読み情
報領域の町名用ファイルの構成例を示す図。FIG. 7 is a view showing a configuration example of a town name file in a vocabulary reading information area for speech recognition in a first storage unit.

【図８】第１の記憶部における音声認識対象語彙読み情
報領域の大口受取人名用ファイルの構成例を示す図。FIG. 8 is a diagram showing a configuration example of a large recipient name file in a vocabulary reading information area for speech recognition in a first storage unit.

【図９】第１の記憶部における音声認識対象表示情報領
域の構成例を示す図。FIG. 9 is a diagram showing a configuration example of a speech recognition target display information area in a first storage unit.

【図１０】音声認識部の構成例を示すブロック図。FIG. 10 is a block diagram illustrating a configuration example of a speech recognition unit.

【図１１】表示部における表示画面例を示す図。FIG. 11 illustrates an example of a display screen on a display unit.

【図１２】ビデオコーディング装置の動作を説明するた
めのフローチャート。FIG. 12 is a flowchart for explaining the operation of the video coding device.

【図１３】ビデオコーディング装置の動作を説明するた
めのフローチャート。FIG. 13 is a flowchart for explaining the operation of the video coding device.

【図１４】ビデオコーディング装置の動作を説明するた
めのフローチャート。FIG. 14 is a flowchart for explaining the operation of the video coding device.

【図１５】表示部におけるリジェクト郵便物に対する具
体的な表示画面例を示す図。FIG. 15 is a diagram showing an example of a specific display screen for a rejected mail on the display unit.

【図１６】表示部におけるリジェクト郵便物に対する具
体的な表示画面例を示す図。FIG. 16 is a view showing a specific example of a display screen for a rejected mail on the display unit.

【図１７】表示部におけるリジェクト郵便物に対する具
体的な表示画面例を示す図。FIG. 17 is a diagram showing an example of a specific display screen for a rejected mail on the display unit.

【図１８】表示部におけるリジェクト郵便物に対する具
体的な表示画面例を示す図。FIG. 18 is a view showing an example of a specific display screen for a rejected mail on the display unit.

【図１９】表示部におけるリジェクト郵便物に対する具
体的な表示画面例を示す図。FIG. 19 is a view showing an example of a specific display screen for a rejected mail on the display unit.

[Explanation of symbols]

Ｐ……郵便物、１……供給部、２……読取部、３……バ
ーコード印刷部、４……遅延パス、５……バーコード読
取部、６……区分部、７……ビデオコーディング装置、
１１……光電変換部、１６……文字認識部、１８……利
用者コード認識部、２１……ＣＰＵ、２４……第１の記
憶部、２５……第２の記憶部、２６……入力インタフェ
イス、２７……キーボード制御部、２８……表示制御
部、３０……音声認識部、３１……キーボード、３２…
…表示部、３４……音声入力部、５０……音声認識対象
語彙読み情報領域、５０ａ……町名用ファイル、５０ｂ
……大口受取人名用ファイル、５１……音声認識対象表
示情報領域、６１……新郵便番号コード指定記載領域、
６２……宛名情報領域、７１……全体画像表示領域、７
２……新郵便番号コード入力領域、７３……住所表示番
号コード入力領域、７４……利用者コード入力領域。P: mail, 1 ... supply unit, 2 ... reading unit, 3 ... barcode printing unit, 4 ... delay path, 5 ... barcode reading unit, 6 ... sorting unit, 7 ... video Coding equipment,
11: photoelectric conversion unit, 16: character recognition unit, 18: user code recognition unit, 21: CPU, 24: first storage unit, 25: second storage unit, 26: input Interface 27 Keyboard control unit 28 Display control unit 30 Voice recognition unit 31 Keyboard 32
... display unit, 34 ... voice input unit, 50 ... vocabulary reading information area for voice recognition, 50a ... town name file, 50b
... Large recipient name file, 51... Voice recognition target display information area, 61...
62 ... address information area, 71 ... whole image display area, 7
2 ... New zip code input area, 73 ... Address display number code input area, 74 ... User code input area

Claims

[Claims]

1. An overall image on a postal matter in which a user code and address information including at least a postal code and an address display number, and recognition result information on the entire image of the postal matter are fetched. And inputting the user code on the mail based on the displayed image, wherein the entire image of the mail that could not be recognized by the recognition means, and the mail A first input unit for inputting candidate characters of a user code recognized by the recognition unit, an entire image of a mail item input by the first input unit, and a candidate character of the user code; A first display unit for displaying a user code input area for inputting the user code, and a display unit for simultaneously displaying the entire image of the postal matter displayed on the first display unit while viewing the entire image. Second input means for inputting a user code in a user code input area or inputting speech recognition target type information, and voice based on the entire image of the mail displayed on the first display means. Voice input means for inputting a recognition target name by voice; voice recognition means for recognizing voice information input by the voice input means; recognition target vocabulary reading information for each type of voice recognition target; A first storage unit that stores a code, a type of the voice recognition target input by the second input unit, and a search in the first storage unit based on a recognition result of the voice recognition unit. By at least one
Search means for extracting one or more candidates for the recognition target registration code; second storage means for storing recognition target display information corresponding to the candidates for the recognition target registration code; and a recognition target output from the search means By searching the second storage means based on the registration code candidate,
Second display means for displaying at least one or more recognition target display information corresponding to a candidate for a recognition target registration code; and selecting and determining a recognition target candidate from the recognition target display information displayed on the second display means. A video coding apparatus comprising: a recognition target candidate selecting unit.

2. The information stored in the first storage means is:
2. The video coding apparatus according to claim 1, wherein the video coding apparatus comprises information on each post office, and information on all the offices in which the post offices are collected.

3. The video coding apparatus according to claim 1, wherein said first storage means stores a plurality of recognition target vocabulary reading information for the same recognition target registration code.

4. The video coding apparatus according to claim 1, wherein said second display means displays the speech recognition target candidate having the highest recognition accuracy separately from other speech recognition target candidates.

5. The apparatus according to claim 1, wherein the second display unit displays additional information corresponding to the voice recognition target candidate from the recognition target display information near the voice recognition target candidate to be displayed. A video coding device as described.

6. The video coding apparatus according to claim 1, wherein said second display means displays the index information in the vicinity of the voice recognition target candidate to be displayed.

7. The video coding apparatus according to claim 1, wherein said second display means displays, as a single piece of recognition target display information, a plurality of candidates for the same recognition target registration code.

8. The second display means displays an input code corresponding to a voice recognition target candidate having the highest recognition accuracy in the user code input area before selecting a recognition target candidate, and performs recognition. 2. The video coding apparatus according to claim 1, wherein when a target candidate is selected, an input code corresponding to the user code input area is displayed.

9. The recognition target candidate selecting unit selects and determines a recognition target candidate by inputting index information corresponding to the recognition target candidate.
A video coding device as described.

10. The second input means includes an input mode switching means for performing a key input or a voice input and a holding means for holding the voice recognition target type information and post office information. When the input mode is switched by the input mode switching means, the voice recognition means transmits the same information as the voice recognition target type information and post office information held in the holding means of the input mode before the input mode switching. The video coding apparatus according to claim 1, wherein the video coding apparatus is targeted.

11. The search means searches from the information of each post office in the first storage means when the post office is specified, and when the post office is not specified,
3. The video coding apparatus according to claim 2, wherein a search is performed from the all-station information in the first storage unit.

12. The second input means inputs voice recognition target type information including the presence / absence of the second or subsequent voice input processing for the postal matter to be voice-recognized, and the search means comprises: If there is the presence / absence information of the voice input processing,
The recognition target registration code candidate is obtained by excluding the recognition target vocabulary reading information extracted as a voice recognition target candidate in a previous voice input and searching the first storage unit. 2. The video coding device according to 1.