JP2001350621A

JP2001350621A - Method for inputting or starting coordinate position on display screen, and device for inputting or starting coordinate position on display screen

Info

Publication number: JP2001350621A
Application number: JP2000168793A
Authority: JP
Inventors: Kiyoyuki Suzuki; 清幸鈴木
Original assignee: Advanced Media Inc
Current assignee: Advanced Media Inc
Priority date: 2000-06-06
Filing date: 2000-06-06
Publication date: 2001-12-21

Abstract

PROBLEM TO BE SOLVED: To provide a method for inputting coordinate position on display screen and a device for inputting coordinate position on display screen in which the disadvantage of a conventional pointing device is removed by paying attention to a figure, alphabet, katakana (the square form of the Japanese syllabary) (hiragana (the cursive form of the Japanese syllabary)) or command and performing only its voice recognition. SOLUTION: A figure, alphabet or katakana (hiragana) is attached to the sensitive area on a display screen displayed according to the content of a program. The figure, alphabet or katakana (hiragana) attached to the area shows that this area is the sensitive area. The figure or the like, for example, attached to the area to be clicked is called with a clear voice. After the voice is recognized, the program of the sensitive area with the figure is started. Accordingly, the area can be instantaneously clicked without requiring the time and labor of moving the pointing device to the sensitive area.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、情報処理装置また
は携帯電話等における表示画面の感応領域にマウスのカ
ーソルを移動させた後、クリックする代わりに、音声の
みによってクリックすることができる表示画面上の座標
位置を入力または起動する方法、および表示画面上の座
標位置を入力または起動する装置に関するものである。
特に、本発明は、表示画面が小さくて見に難い、あるい
はカーソルの位置合わせが困難な情報処理装置等に都合
のよい表示画面上の座標位置を入力または起動する方
法、および表示画面上の座標位置を入力または起動する
装置に関するものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to an information processing apparatus, a mobile phone, and the like, in which a mouse cursor is moved to a sensitive area of a display screen, and then the display can be clicked only by voice instead of clicking. And a device for inputting or starting a coordinate position on a display screen.
In particular, the present invention relates to a method of inputting or starting a coordinate position on a display screen which is convenient for an information processing apparatus or the like in which a display screen is small and is difficult to see or a cursor is difficult to position, and a coordinate on the display screen. It concerns a device for inputting or activating a position.

【０００２】[0002]

【従来の技術】座標位置を入力する装置としてのポイン
ティグ・デバイスは、たとえば、トラックボール、ライ
トペン、マウス等がある。前記トラックボールは、ラッ
プトップ型情報処理装置のポインティグ・デバイスとし
て使用されている。トラックボールは、ボールの回転に
よって回転する側面にパターンホイールを備えたＸ軸ロ
ーラと、Ｙ軸ローラとからなり、発光ダイオードとフォ
トトランジスタによって各ローラが回転するとパルスが
発生するようになっている。そして、トラックボール
は、前記各軸のローラの移動量で１パルス出力した場
合、１ドットと定義しておくことで、１ドット単位の位
置指定が可能である。2. Description of the Related Art A pointing device as a device for inputting a coordinate position includes, for example, a trackball, a light pen, a mouse, and the like. The trackball is used as a pointing device of a laptop information processing device. The trackball includes an X-axis roller having a pattern wheel on a side surface rotated by the rotation of the ball, and a Y-axis roller, and a pulse is generated when each roller is rotated by a light emitting diode and a phototransistor. When one pulse is output with the movement amount of the roller of each axis, the trackball is defined as one dot, and the position can be designated in units of one dot.

【０００３】ライトペンは、ディスプレイの画面から入
力したい位置を指すことにより、ライトペンの輝点を発
したビームをフォトトランジスタで検出し、その位置情
報をコンピュータに知らせることができる。また、マウ
スは、マウスに設けられているボールの回転が伝達され
るＸ軸およびＹ軸方向に設けられた検出車輪と同軸の検
出器によって、その移動量が検出される。A light pen can detect a beam emitted from a bright point of the light pen by a phototransistor by pointing to a position to be input from a screen of a display, and inform the computer of the position information. Further, the amount of movement of the mouse is detected by a detector coaxial with detection wheels provided in the X-axis and Y-axis directions to which rotation of a ball provided on the mouse is transmitted.

【０００４】図７は従来のポインティグ・デバイスで表
示画面の感応領域をクリックして感応領域の情報を表示
させるためのフローチャートである。図７において、プ
ログラムを起動（ステップ７１）させて、所望の情報を
表示画面に表示させる（ステップ７２）。表示画面にお
ける感応領域の情報を開きたい場合、オペレータは、ポ
インティグ・デバイスを前記感応領域がある位置まで移
動させる（ステップ７３）。FIG. 7 is a flowchart for displaying information on a sensitive area by clicking a sensitive area on a display screen with a conventional pointing device. In FIG. 7, the program is started (step 71), and desired information is displayed on the display screen (step 72). When the operator wants to open the information of the sensitive area on the display screen, the operator moves the pointing device to a position where the sensitive area is located (step 73).

【０００５】オペレータは、ポインティグ・デバイスが
感応領域に達したか否かを見（ステップ７４）ながら、
移動を続ける（ステップ７３）。オペレータは、ポイン
ティグ・デバイスが感応領域に達したと判断した場合
（ステップ７４）、クリックを行う（ステップ７５）
と、表示画面に感応領域の情報が表示される（ステップ
７６）。[0005] The operator checks whether the pointing device has reached the sensitive area (step 74).
The movement is continued (step 73). When the operator determines that the pointing device has reached the sensitive area (step 74), the operator clicks (step 75).
Is displayed on the display screen (step 76).

【０００６】図８は従来の音声認識技術を説明するため
のブロック構成図である。図８において、音声は、マイ
クロホン８１によって入力された後、Ａ／Ｄ変換器８２
によってデジタル信号（ＰＣＭ）に変換される。前記デ
ジタル信号は、たとえば、１秒間に１１０２５個の値を
サンプリングする。次に、前記デジタル信号は、周波数
分析器８３によって分析された結果、周波数スペクトル
を表す値が得られる。FIG. 8 is a block diagram for explaining a conventional speech recognition technique. In FIG. 8, after a voice is input by a microphone 81, an A / D converter 82
Is converted into a digital signal (PCM). The digital signal samples, for example, 11025 values per second. Next, the digital signal is analyzed by the frequency analyzer 83 to obtain a value representing a frequency spectrum.

【０００７】前記周波数スペクトルは、ベクトル量子化
手段８５によって、量子化され、１００種類程度のグル
ープに分類され、符号帳８４に記憶されたラベルと照合
することにより、前記周波数分析を行う毎に音声の特徴
が抽出されたラベル列ができる。このラベル列は、ＨＭ
Ｍ（ＨｉｄｄｅｎＭａｒｋｏｖＭｏｄｅｌ）と呼ば
れる確率的なモデルを使用して尤度計算手段８７により
尤度を求める。単語や言葉の認識を行う場合、前記単語
や言葉としての文法を認識するＨＭＭ８６を用意する必
要がある。そのため、前記ＨＭＭ８６および尤度計算手
段８７は、メモリ等演算手段が大きなものとなる。[0007] The frequency spectrum is quantized by a vector quantization means 85, classified into about 100 types of groups, and collated with a label stored in a codebook 84, so that each time the frequency analysis is performed, a voice A label string is created in which the features of are extracted. This label column is HM
The likelihood is calculated by the likelihood calculating means 87 using a probabilistic model called M (Hidden Markov Model). When recognizing a word or a word, it is necessary to prepare an HMM 86 for recognizing the grammar as the word or the word. Therefore, the HMM 86 and the likelihood calculating means 87 require a large calculating means such as a memory.

【０００８】前記単語ＨＭＭ８６は、それぞれラベル列
がどのくらいの確率で出現するのかを表したテーブルを
持っている。そして、単語は、前記テーブルを参照し
て、単語毎の出現率を求め、この出現率が最大になった
単語を選んで認識結果８８とする。The word HMM 86 has a table indicating the probability of occurrence of each label string. For the words, the appearance rate of each word is obtained by referring to the table, and the word having the highest appearance rate is selected as the recognition result 88.

【０００９】[0009]

【発明が解決しようとする課題】マウス、トラックボー
ル、グライド・ポインターのような従来のポインティグ
・デバイスは、所望の感応領域や文字入力領域に移動す
ることが困難な場合があった。特に、マウス等を動かす
場所が狭い場合、オペレータは、マウスを何回か持ち上
げながら所望位置に移動するため、時間がかかるだけで
なく、面倒である。また、オペレータは、カーソルの位
置が最初何処にあるのか判らない場合があり、マウスを
所望位置と違う方向に移動している場合がある。Conventional pointing devices such as a mouse, a trackball, and a glide pointer sometimes have difficulty moving to a desired sensitive area or character input area. In particular, when the place where the mouse or the like is moved is small, the operator moves the mouse to a desired position while lifting the mouse several times, which takes time and is troublesome. The operator may not know where the cursor is initially located, and may move the mouse in a direction different from the desired position.

【００１０】特に、ラップトップ型の情報処理装置、モ
バイル型情報処理装置、携帯電話等小型の機器は、カー
ソルの位置が判り難いだけでなく、カーソルの位置を所
望の位置に移動することが容易でない場合がしばしばあ
った。また、年寄や目の弱い者は、ポインティグ・デバ
イスを所望の位置に移動し、その場所をクリックするこ
とが容易でない場合がある。さらに、オペレータは、ポ
インティグ・デバイスを感応領域まで移動してみて、初
めてその場所が感応領域であることが判る。In particular, small devices such as a laptop type information processing device, a mobile type information processing device, and a mobile phone not only make it difficult to determine the position of the cursor, but also easily move the position of the cursor to a desired position. Often it was not. In addition, it may not be easy for an elderly person or a person with weak eyes to move the pointing device to a desired position and click that position. Further, when the operator moves the pointing device to the sensitive area, the operator can recognize that the location is the sensitive area for the first time.

【００１１】従来の音声認識は、単語ＨＭＭや符号帳に
膨大なデータを備えておく必要がある。音声認識は、た
とえば、一つの単語に対して、助詞、助動詞等の付き
方、文節、複文節、接頭語付き文節等の変換によって、
変わる。これらのデータを全て備えたソフトウェアは、
大きくなり、モバイル機器や携帯電話のように小型のも
のに応用することができなかった。In conventional speech recognition, it is necessary to prepare a huge amount of data in a word HMM or a codebook. Speech recognition, for example, for a single word, particle, auxiliary verb attachment, phrase, compound phrase, conversion of prefixed phrases, etc.,
change. Software with all these data,
It could not be applied to small devices such as mobile devices and mobile phones.

【００１２】本発明は、以上のような課題を解決するた
めに、数字、アルファベット、片仮名（平仮名−音声の
場合片仮名と平仮名は同じである）、コマンドに注目
し、これらのみの音声認識を行うことにより、従来のポ
インティグ・デバイスの欠点を除去した表示画面上の座
標位置を入力または起動する方法、および表示画面上の
座標位置を入力または起動する装置を提供することを目
的とする。In order to solve the above problems, the present invention focuses on numbers, alphabets, katakana (in the case of hiragana-speech, katakana and hiragana are the same), and commands, and performs voice recognition of only these. Accordingly, it is an object of the present invention to provide a method of inputting or starting a coordinate position on a display screen and a device for inputting or starting a coordinate position on a display screen, which eliminates the drawbacks of the conventional pointing device.

【００１３】本発明は、数字、アルファベット、片仮名
（平仮名）、コマンドに注目し、これらのみの音声認識
を行うことにより、小型の情報処理装置に使用できる表
示画面上の座標位置を入力または起動する方法、および
表示画面上の座標位置を入力または起動する装置を提供
することを目的とする。The present invention focuses on numbers, alphabets, katakana (hiragana), and commands, and performs voice recognition of only these to input or activate a coordinate position on a display screen that can be used in a small-sized information processing device. It is an object to provide a method and an apparatus for inputting or activating a coordinate position on a display screen.

【００１４】[0014]

【課題を解決するための手段】（第１発明）第１発明の
表示画面上の座標位置を入力または起動する方法は、プ
ログラムの内容にしたがって表示されている表示画面上
で、感応領域となっている領域に付けられている数字、
アルファベット、片仮名（平仮名）およびこれらの組合
せに対して音声で呼びあげると、前記音声の数字、アル
ファベット、片仮名（平仮名）およびこれらの組合せを
認識して、前記感応領域のプログラムが起動することを
特徴とする。According to a first aspect of the present invention, a method for inputting or starting a coordinate position on a display screen is a method of forming a sensitive area on a display screen displayed according to the contents of a program. Number attached to the area
When the alphabet, katakana (hiragana) and a combination thereof are called up by voice, the number of the voice, the alphabet, katakana (hiragana) and the combination thereof are recognized, and the program of the sensitive area is started. And

【００１５】（第２発明）第２発明の表示画面上の座標
位置を入力または起動する方法は、プログラムの内容に
したがって表示されている表示画面上で、文字入力領域
となっている領域に付けられている数字、アルファベッ
ト、片仮名（平仮名）およびこれらの組合せに対して音
声で呼びあげると、前記音声の数字、アルファベット、
片仮名（平仮名）およびこれらの組合せを認識して、前
記文字入力領域にカーソルが移動することを特徴とす
る。(Second invention) The method of inputting or starting a coordinate position on a display screen according to the second invention is to add a character input area on a display screen displayed according to the contents of a program. When voice is called for numbers, alphabets, katakana (hiragana) and combinations thereof, the numbers, alphabets,
The system is characterized in that a katakana (hiragana) and a combination thereof are recognized and a cursor is moved to the character input area.

【００１６】（第３発明）第３発明の表示画面上の座標
位置を入力または起動する方法において、音声は、名称
および／またはコマンドの読みあげであることを特徴と
する。(Third Invention) In the method for inputting or starting a coordinate position on a display screen according to the third invention, the voice is a reading of a name and / or a command.

【００１７】（第４発明）第４発明の表示画面上の座標
位置を入力または起動する装置は、プログラムの内容に
したがって表示されると共に、前記表示されている感応
領域に数字、アルファベット、片仮名（平仮名）および
これらの組合せが付けられている表示画面と、前記感応
領域に付けられている数字、アルファベット、片仮名
（平仮名）おこれらの組合せが音声で入力されるマイク
ロホン２１と、前記マイクロホンの信号をデジタル信号
に変換するＡ／Ｄ変換器２２と、前記Ａ／Ｄ変換器２２
の出力を音声認識する音声認識手段（２３ないし２８）
と、前記音声認識手段（２３ないし２８）の出力によっ
て、認識された前記音声の数字、アルファベット、片仮
名（平仮名）およびこれらの組合せに対応する前記感応
領域のプログラムを起動させることを特徴とする。(Fourth invention) A device for inputting or starting a coordinate position on a display screen according to the fourth invention is displayed according to the contents of a program, and includes numerals, alphabets, and katakana characters in the displayed sensitive area. A display screen to which the hiragana and the combination thereof are attached; a microphone 21 to which numbers, alphabets, katakana (hiragana) and the combination of these are attached in the sensitive region; An A / D converter 22 for converting to a digital signal;
Voice recognition means (23 to 28) for voice recognition of the output of
And the output of the voice recognition means (23 to 28) activates a program of the sensitive area corresponding to the recognized number, alphabet, katakana (hiragana), and a combination thereof.

【００１８】（第５発明）第５発明の表示画面上の座標
位置を入力または起動する装置は、プログラムの内容に
したがって表示されると共に、前記表示されている感応
領域に名称および／またはコマンドが記載されている表
示画面と、前記感応領域に記載されている名称および／
またはコマンドが音声となって入力されるマイクロホン
２１と、前記マイクロホン２１の信号をデジタル信号に
変換するＡ／Ｄ変換器２２と、前記Ａ／Ｄ変換器２２の
出力を音声認識する音声認識手段（２３ないし２８）
と、前記音声認識手段（２３ないし２８）の出力によっ
て、認識された前記名称および／またはコマンドに対応
したプログラムを起動させることを特徴とする。(Fifth invention) A device for inputting or starting a coordinate position on a display screen according to the fifth invention is displayed according to the contents of a program, and a name and / or a command is displayed in the displayed sensitive area. The display screen described, and the name and / or
Alternatively, a microphone 21 to which a command is input as voice, an A / D converter 22 for converting a signal of the microphone 21 into a digital signal, and voice recognition means for voice recognition of an output of the A / D converter 22 ( 23-28)
And a program corresponding to the recognized name and / or command is activated by an output of the voice recognition means (23 to 28).

【００１９】（第６発明）第６発明の表示画面上の座標
位置を入力または起動する装置は、プログラムの内容に
したがって表示されている表示画面上で、文字入力領域
に数字、アルファベット、片仮名（平仮名）およびこれ
らの組合せが記載されている表示画面と、前記文字入力
領域に記載されている数字、アルファベット、片仮名
（平仮名）およびこれらの組合せが音声となって入力さ
れるマイクロホン２１と、前記マイクロホン２１の信号
をデジタル信号に変換するＡ／Ｄ変換器２２と、前記Ａ
／Ｄ変換器２２の出力を音声認識する音声認識手段（２
３ないし２８）と、前記音声認識手段（２３ないし２
８）の出力によって、認識された前記数字、アルファベ
ット、片仮名（平仮名）およびこれらの組合せに対応し
た位置にカーソルが移動することを特徴とする。(Sixth invention) A device for inputting or starting a coordinate position on a display screen according to the sixth invention is characterized in that numerals, alphabets, and katakana characters are entered in a character input area on a display screen displayed according to the contents of a program. A display screen on which hiragana and a combination thereof are described, a microphone 21 in which numerals, alphabets, katakana (hiragana) and a combination thereof described in the character input area are inputted as voice, and the microphone An A / D converter 22 for converting the signal of the A / D converter 21 into a digital signal;
Voice recognition means (2) for voice-recognizing the output of the / D converter 22
3 to 28) and the voice recognition means (23 to 2)
According to the output of 8), the cursor is moved to a position corresponding to the recognized number, alphabet, katakana (hiragana) and a combination thereof.

【００２０】（第７発明）第７発明の表示画面上の座標
位置を入力または起動する装置において、音声認識手段
（２３ないし２８）は、ボードまたはチップに組み込ま
れていることを特徴とする。(Seventh invention) In the device for inputting or starting a coordinate position on a display screen according to the seventh invention, the voice recognition means (23 to 28) is incorporated in a board or chip.

【００２１】（第８発明）第８発明の表示画面上の座標
位置を入力または起動する装置において、感応領域に記
載されている数字、アルファベット、片仮名（平仮名）
およびこれらの組合せ、名称、およびコマンド等は、特
定のフォント、形状、色、網かけ等の修飾により、非感
応領域と区別されていることを特徴とする。(Eighth invention) In the device for inputting or starting a coordinate position on a display screen according to the eighth invention, a numeral, an alphabet, a katakana (hiragana) written in a sensitive area is provided.
And combinations, names, and commands thereof are distinguished from non-sensitive areas by specific font, shape, color, shading, and other modifications.

【００２２】（第９発明）第９発明の表示画面上の座標
位置を入力または起動する装置は、デスクトップ型情報
処理装置、ラップトップ型情報処理装置、モバイル型情
報処理装置、携帯電話のいずれかに備えられていること
を特徴とする。(Ninth Invention) The device for inputting or starting the coordinate position on the display screen according to the ninth invention is any one of a desktop information processing device, a laptop information processing device, a mobile information processing device, and a mobile phone. It is characterized by being provided in.

【００２３】[0023]

【発明の実施の形態】（第１発明）第１発明は、プログ
ラムの内容にしたがって表示されている表示画面上の感
応領域に、数字、アルファベット、片仮名（平仮名）お
よびこれらの組合せが付けられている。前記数字、アル
ファベット、片仮名（平仮名）およびこれらの組合せが
付けられている領域は、感応領域であることを示してい
る。そして、クリックしたい領域に付けられている、た
とえば、数字をハッキリした声で呼びあげる。前記声
は、音声認識された後、数字の付された感応領域のプロ
グラムが起動する。本発明は、ポインティグ・デバイス
を感応領域まで移動する時間と手間が不必要で、瞬時に
クリックされる。感応領域と感応領域でない数字等は、
色、形、フォント等によって予め決めておくことによ
り、区別が容易である。BEST MODE FOR CARRYING OUT THE INVENTION (First Invention) In a first invention, a sensitive area on a display screen displayed according to the contents of a program is provided with numbers, alphabets, katakana (hiragana) and a combination thereof. I have. The area to which the numeral, alphabet, katakana (hiragana) and a combination thereof are attached indicates that the area is a sensitive area. Then, for example, call out the number attached to the area you want to click with a clear voice. After the voice is recognized as a voice, a program in a sensitive area numbered is activated. The present invention does not require the time and effort to move the pointing device to the sensitive area, and is instantaneously clicked. The sensitive area and the numbers that are not in the sensitive area
Discrimination is easy by predetermining colors, shapes, fonts, and the like.

【００２４】音声認識は、数字、アルファベット、片仮
名（平仮名）およびこれらの組合せのみであるため、助
詞、変換、文節等多くのデータが不要であるだけでな
く、小型で認識率を向上させることができる。また、小
型の携帯用情報処理装置は、特定の個人のみが使用する
場合が多いため、学習効果により音声認識率が高くな
る。本発明は、従来のポインティグ・デバイスを使用す
る必要がなく、数字、アルファベット、片仮名（平仮
名）およびこれらの組合せ等を声を出して読むだけであ
るため、楽しく、早く、しかも自然である。Since speech recognition is limited to numbers, alphabets, katakana (hiragana), and combinations thereof, not only are many data such as particles, conversions, and phrases unnecessary, but also small, and the recognition rate can be improved. it can. In addition, since a small portable information processing device is often used only by a specific individual, the speech recognition rate increases due to a learning effect. The present invention is fun, fast and natural, since it does not require the use of a conventional pointing device and only reads aloud numbers, alphabets, katakana (hiragana) and combinations thereof.

【００２５】（第２発明）第２発明は、プログラムの内
容にしたがって表示されている表示画面上で、感応領域
が文字入力領域となっている点で第１発明と異なってい
る。本発明は、文字入力領域の近傍に記載された数字、
アルファベット、片仮名（平仮名）およびこれらの組合
せ等を声を出して呼びあげると、文字入力領域内にカー
ソルが入る。本発明は、ポインティグ・デバイスによる
カーソル位置を探したり、移動する時間と手間が不要で
あり、瞬時に文字を入力することができる。(Second invention) The second invention is different from the first invention in that the sensitive area is a character input area on the display screen displayed according to the contents of the program. The present invention provides a number described near a character input area,
When the alphabet, katakana (hiragana), a combination thereof, and the like are called out aloud, the cursor enters the character input area. According to the present invention, it is not necessary to search for a cursor position by a pointing device or to move and move the cursor, and characters can be input instantaneously.

【００２６】（第３発明）第３発明は、感応領域が表示
画面上の名称および／またはコマンドである点で、第１
発明および第２発明と異なっている。本発明は、表示画
面上に記載されている全ての感応文字（クリックすると
当該部分のプログラムが開く）を名称および／またはコ
マンドとして声を出して呼び（読み）あげることによっ
て、ポインティグ・デバイスのクリックと同じ動作を行
う。名称および／またはコマンドは、普通の言葉と違
い、数が制限されること、および特定の呼び方を予め決
めて置くことによって、音声認識率を向上させることが
できると共に、データを少なくすることができる。(Third invention) The third invention is the first invention in that the sensitive area is a name and / or a command on a display screen.
Different from the invention and the second invention. According to the present invention, a pointing device can be clicked by calling out (reading) all the sensitive characters described on the display screen (clicking to open the program of the corresponding portion) as names and / or commands. Performs the same operation as. Names and / or commands, unlike ordinary words, have a limited number and, by pre-determining a specific name, can improve speech recognition rate and reduce data. it can.

【００２７】（第４発明）第４発明は、表示画面上にプ
ログラムの内容にしたがって表示されると共に、前記表
示されている感応領域に数字、アルファベット、片仮名
（平仮名）およびこれらの組合せが付けられている。オ
ペレータは、前記表示画面上の感応領域に付けられてい
る数字、アルファベット、片仮名（平仮名）およびこれ
らの組合せを声を出してマイクロホンに向かって呼びあ
げる。前記マイクロホンから入力された音声のアナログ
信号は、Ａ／Ｄ変換器によってデジタル信号に変換され
る。(Fourth Invention) According to a fourth invention, a display screen is displayed according to the contents of a program, and a number, an alphabet, katakana (hiragana) and a combination thereof are added to the displayed sensitive area. ing. The operator calls out the numbers, alphabets, katakana (hiragana), and combinations thereof attached to the sensitive area on the display screen aloud to the microphone. An analog audio signal input from the microphone is converted into a digital signal by an A / D converter.

【００２８】音声認識手段は、前記Ａ／Ｄ変換器の出力
を、たとえば、周波数分析、ベクトル量子化、尤度計算
等を行い、予め単語の特徴が記憶されている単語ＨＭＭ
のデータを基にして出現確率の最大のものを認識結果と
して出力する。そして、前記音声認識手段の認識によっ
て、認識された前記音声の数字、アルファベット、片仮
名（平仮名）およびこれらの組合せに対応する前記感応
領域のプログラムを起動させる。The speech recognition means performs, for example, frequency analysis, vector quantization, likelihood calculation, etc. on the output of the A / D converter, and outputs a word HMM in which the features of the word are stored in advance.
The data having the largest appearance probability is output as a recognition result based on the data of Then, by the recognition of the voice recognition means, the program of the sensitive area corresponding to the recognized number, alphabet, katakana (hiragana) and the combination of the voice is activated.

【００２９】前記尤度計算手段と数字、アルファベット
等ＨＭＭは、省略することができる。この場合、音を認
識する符号帳に音の特徴をラベル列として記憶してお
き、周波数分析器により分析されたスペクトルをベクト
ル量子化手段でベクトル量子化し、前記記憶されている
ラベル列と比較して音を認識する。本発明は、単語を認
識するのではなく、数字、アルファベット、片仮名（平
仮名）およびこれらの組合せ等の音を認識すれば良く、
音声認識装置が簡単になる。The likelihood calculating means and HMMs such as numerals and alphabets can be omitted. In this case, the characteristics of the sound are stored in the codebook for recognizing the sound as a label sequence, the spectrum analyzed by the frequency analyzer is vector-quantized by the vector quantization means, and the spectrum is compared with the stored label sequence. To recognize the sound. In the present invention, instead of recognizing words, it is sufficient to recognize sounds such as numbers, alphabets, katakana (hiragana), and combinations thereof.
The speech recognition device is simplified.

【００３０】（第５発明）第５発明は、感応領域に名称
および／またはコマンドが記載されている点で、第４発
明と異なっている。本発明は、表示画面上に記載されて
いる名称および／またはコマンド、たとえば、「次」、
「ホーム」等に対して、声を出して呼びあげることによ
って、ポインティグ・デバイスのクリックと同じ動作を
行う。コマンドは、数が少なく、呼び方を省略すること
も可能であり、音声認識率を向上させることができると
共に、データを少なくすることができる。(Fifth invention) The fifth invention is different from the fourth invention in that a name and / or a command is described in the sensitive area. The present invention relates to names and / or commands described on a display screen, for example, "next",
By calling out aloud to "Home" or the like, the same operation as clicking the pointing device is performed. The number of commands is small, and it is possible to abbreviate the calling method, so that the voice recognition rate can be improved and the data can be reduced.

【００３１】（第６発明）第６発明は、プログラムの内
容にしたがって表示されている表示画面上で、感応領域
が文字入力領域となっている点で第４発明および第５発
明と異なっている。本発明は、文字入力領域の近傍に記
載された数字、アルファベット、片仮名（平仮名）およ
びこれらの組合せ等を声を出して呼びあげると、その文
字入力領域内にカーソルが入る。本発明は、瞬時に文字
を入力することができるため、ポインティグ・デバイス
によるカーソル位置を探したり、移動する時間と手間、
およびポインティグ・デバイスからキーボードに片手を
移動する必要がなく、情報処理装置の操作が早くなる。(Sixth invention) The sixth invention is different from the fourth invention and the fifth invention in that the sensitive area is a character input area on the display screen displayed according to the contents of the program. . According to the present invention, when a number, alphabet, katakana (hiragana), a combination thereof, or the like written near the character input area is called out aloud, a cursor is placed in the character input area. According to the present invention, since characters can be input instantaneously, it is necessary to search for the position of the cursor by the pointing device, to move and to move,
In addition, it is not necessary to move one hand from the pointing device to the keyboard, and the operation of the information processing device is quickened.

【００３２】（第７発明）第７発明は、限られた数字、
アルファベット、片仮名（平仮名）およびこれらの組合
せ、あるいは名称および／またはコマンドだけであるた
め、音声認識のためのソフトウエアが簡単で、音声認識
ボード、音声認識チップという部品を既存の情報処理装
置に組み込むだけで済む。(Seventh invention) The seventh invention is characterized by a limited number,
Since only alphabets, katakana (hiragana) and combinations thereof, or names and / or commands are used, software for voice recognition is simple, and components such as a voice recognition board and a voice recognition chip are incorporated into an existing information processing device. It only needs to.

【００３３】（第８発明）第８発明は、感応領域に記載
する数字、アルファベット、片仮名（平仮名）およびこ
れらの組合せ、あるいは名称および／またはコマンド
に、特定のフォントを使用したり、あるいは形状、色、
網かけ等の修飾により、他の文字と異ならしめ、感応領
域を表していることが一目で判るようにしている。(Eighth invention) An eighth invention is to use a specific font, a shape, or a number, an alphabet, a katakana (hiragana) and a combination thereof, or a name and / or a command described in the sensitive area. color,
Modifications, such as shading, make it different from other characters so that it can be seen at a glance that it represents the sensitive area.

【００３４】（第９発明）第９発明は、音声による座標
位置を入力または起動する装置をデスクトップ型情報処
理装置、ラップトップ型情報処理装置、モバイル型情報
処理装置、携帯電話のいずれかに適用することができ
る。本発明は、表示画面を見ながら声を出すだけで、カ
ーソルを移動させることができるため、キーボードから
指を離すことがなく、早い操作が可能である。(Ninth Invention) According to a ninth invention, a device for inputting or starting a coordinate position by voice is applied to any of a desktop information processing device, a laptop information processing device, a mobile information processing device, and a mobile phone. can do. According to the present invention, the cursor can be moved only by speaking while watching the display screen, so that quick operation is possible without releasing the finger from the keyboard.

【００３５】[0035]

【実施例】図１は本発明の音声による座標位置を入
力または起動する装置で表示画面の感応領域をクリック
して感応領域の情報を表示させるためのフローチャート
である。図１において、プログラムは、情報を表示画面
に表示するために起動される（ステップ１１）。プログ
ラムの実行により、前記情報は、表示画面に表示される
（ステップ１２）。表示画面における感応領域の情報を
開きたい場合、オペレータは、感応領域に記載されてい
る数字、アルファベット、片仮名（平仮名）およびこれ
らの組合せ、あるいは名称および／またはコマンド等を
声に出して呼びあげる（ステップ１３）。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS FIG. 1 is a flow chart for displaying information of a sensitive area by clicking a sensitive area on a display screen with the apparatus for inputting or starting a coordinate position by voice according to the present invention. In FIG. 1, the program is started to display information on a display screen (step 11). By executing the program, the information is displayed on the display screen (step 12). When the operator wants to open the information of the sensitive area on the display screen, the operator calls out the numbers, alphabets, katakana (hiragana) and the combination thereof, or the names and / or commands, etc. described in the sensitive area aloud ( Step 13).

【００３６】オペレータの声は、マイクロホンによって
アナログ信号として検出される（ステップ１４）。前記
アナログ信号は、Ａ／Ｄ変換器によってデジタル信号に
変換される（ステップ１５）。前記デジタル信号は、音
声認識ボードまたは音声認識チップに埋め込まれた演算
回路および数字、アルファベット、片仮名（平仮名）お
よびこれらの組合せ、あるいは名称および／または、コ
マンド等の特徴等を記憶したデータ等に基づいて演算さ
れる（ステップ１６）。The voice of the operator is detected by the microphone as an analog signal (step 14). The analog signal is converted into a digital signal by an A / D converter (step 15). The digital signal is based on an arithmetic circuit embedded in a voice recognition board or a voice recognition chip, data including numbers, alphabets, katakana (hiragana) and a combination thereof, or data storing characteristics such as names and / or commands. Is calculated (step 16).

【００３７】数字、アルファベット、片仮名（平仮名）
およびこれらの組合せ、あるいは名称および／または、
コマンド等の音声を認識した音声認識ボードまたは音声
認識チップの出力は、その感応領域をマウスでクリック
するのと同じになる（ステップ１７）。そして、クリッ
クされた感応領域のプログラムが開かれて、その情報が
表示画面に表示される（ステップ１８）。Numbers, alphabets, katakana (hiragana)
And their combinations, or names and / or
The output of the voice recognition board or the voice recognition chip which has recognized the voice such as the command becomes the same as clicking the sensitive area with the mouse (step 17). Then, the program of the clicked sensitive area is opened, and the information is displayed on the display screen (step 18).

【００３８】図２は本発明の音声認識技術を説明するた
めのブロック構成図である。図２において、音声は、マ
イクロホン２１によって入力された後、Ａ／Ｄ変換器２
２によってデジタル信号（ＰＣＭ）に変換される。前記
デジタル信号は、たとえば、１秒間に１１０２５個の値
をサンプリングする。次に、前記デジタル信号は、周波
数分析器２３によって分析された結果、周波数スペクト
ルを表す値が得られる。FIG. 2 is a block diagram for explaining the speech recognition technique of the present invention. In FIG. 2, after the voice is input by the microphone 21, the A / D converter 2
2 to a digital signal (PCM). The digital signal samples, for example, 11025 values per second. Next, the digital signal is analyzed by the frequency analyzer 23 to obtain a value representing a frequency spectrum.

【００３９】前記周波数スペクトルは、ベクトル量子化
手段２５によって、量子化され、１００種類程度のグル
ープに分類され、符号帳２４に記憶されたラベルと照合
することにより、前記周波数分析を行う毎に音声の特徴
が抽出されたラベル列ができる。また、数字、アルファ
ベット程度の認識は、音を認識するだけで済むため、前
記符号帳２４の記憶容量を少なくして、非常に簡単な音
声認識ボードまたは音声認識チップで構成できる。The frequency spectrum is quantized by the vector quantization means 25, classified into about 100 types of groups, and collated with a label stored in the codebook 24, so that each time the frequency analysis is carried out, A label string is created in which the features of are extracted. In addition, since recognition of numbers and alphabets can be performed only by recognizing sounds, the storage capacity of the code book 24 can be reduced and a very simple voice recognition board or voice recognition chip can be used.

【００４０】さらに、前記ラベル列は、数字、アルファ
ベット、片仮名（平仮名）およびこれらの組合せ、ある
いは名称および／またはコマンド等ＨＭＭ（Ｈｉｄｄｅ
ｎＭａｒｋｏｖＭｏｄｅｌ）２６と呼ばれる確率的な
モデルを使用して尤度計算手段２７により尤度を求め
る。Furthermore, the label string may be a number, alphabet, katakana (hiragana) and a combination thereof, or a name and / or a command such as HMM (Hide).
The likelihood is calculated by the likelihood calculating means 27 using a probabilistic model called “nMarkov Model” 26.

【００４１】本発明は、数字、アルファベット、片仮名
（平仮名）およびこれらの組合せ、あるいは名称および
／またはコマンド等限られた単語のみを認識できるよう
にするだけなので、数字、アルファベット、片仮名（平
仮名）およびこれらの組合せ、あるいは名称および／ま
たはコマンド等ＨＭＭ２６および前記周波数分析器２
３、ベクトル量子化手段２５、尤度計算手段２７を音声
認識ボード、あるいは音声認識チップ内に組み込むこと
が可能になった。The present invention only recognizes numbers, alphabets, katakana (hiragana) and combinations thereof, or limited words such as names and / or commands. The HMM 26 and the frequency analyzer 2 such as a combination of these or names and / or commands
3. The vector quantization means 25 and the likelihood calculation means 27 can be incorporated in a speech recognition board or a speech recognition chip.

【００４２】そして、前記尤度計算手段２７によって計
算された尤度は、最大になった数字、アルファベット、
片仮名（平仮名）およびこれらの組合せ、あるいは名称
および／またはコマンド等が認識手段２８から認識結果
として出力される。この出力は、プログラムの感応領域
でクリックされた場合と同等の情報を有し、同様の操作
を行うことができる。The likelihood calculated by the likelihood calculating means 27 is the maximum number, alphabet,
Katakana (hiragana) and combinations thereof, or names and / or commands, etc. are output from the recognition means 28 as recognition results. This output has the same information as when the click is made in the sensitive area of the program, and the same operation can be performed.

【００４３】図３は特許庁のホームページの一部を開い
た状態を示す図である。図３において、感応領域の一部
に数字が付けられている。たとえば、「最近の特許庁」
には「１」が、「制度紹介」には「２」が、「お知ら
せ」には「３」がそれぞれ付けられている。図３におい
て、「最近の特許庁」・・・「１」を開きたい場合、従
来は、「最近の特許庁」の部分に網がかけられて領域に
カーソルを移動させた後、マウスのボタンをクリックす
る。本実施例は、マウスを移動させることなく、声を出
して「１」と言うだけで、「最近の特許庁」が開く。FIG. 3 is a diagram showing a state where a part of the JPO homepage is opened. In FIG. 3, a number is given to a part of the sensitive area. For example, "Recent Patent Office"
, "2" for "system introduction", and "3" for "news". In FIG. 3, when it is desired to open “Recent Patent Office”... “1”, conventionally, a portion of “Recent Patent Office” is shaded, the cursor is moved to an area, and then a mouse button is pressed. Click. In the present embodiment, the "latest patent office" is opened simply by saying "1" without moving the mouse.

【００４４】図４は特許庁のホームページの一部で、
「最近の特許庁」・・・「１」が開かれた状態を示す図
である。図４において、「最近の特許庁」・・・「１」
が開かれている。「最近の特許庁」は、さらに、感応領
域があり、「長官からのメッセージ」・・・「Ａ」、
「プレス発表」・・・「Ｂ」、「特許行政の動き」・・
・「Ｃ」がある。FIG. 4 shows a part of the JPO homepage.
It is a figure which shows the state where "Recent Patent Office" ... "1" was opened. In FIG. 4, "Recent Patent Office" ... "1"
Is open. The “Recent Patent Office” has a further sensitive area, “Message from the Secretary” ...
"Press announcement" ... "B", "Patent administration movement" ...
・ There is "C".

【００４５】図５は特許庁のホームページの一部で、
「制度紹介」・・・「２」が開かれた状態を示す図であ
る。図５において、「制度紹介」・・・「２」が開かれ
ている。「制度紹介」・・・「２」は、さらに、感応領
域があり、「制度概要」・・・「Ａ」、「注目特許」・
・・「Ｂ」、「権利を取るためには」・・・「Ｃ」、
「権利を巡るトラブル」・・・「Ｄ」、「特許庁の紹
介」・・・「Ｅ」、「よくある質問」・・・「Ｆ」、
「問い合わせ先」・・・「Ｇ」がある。FIG. 5 shows a part of the JPO homepage.
It is a figure which shows the state where "system introduction" ... "2" was opened. In FIG. 5, “system introduction”... “2” is open. "Introduction of system" ... "2" has a further sensitive area, "Overview of system" ... "A", "Patent attention"
・・ "B", "To get the right" ... "C",
"Troubles over rights" ... "D", "Introduction to the Patent Office" ... "E", "FAQ" ... "F",
"Contact" ... "G".

【００４６】図６は特許庁のホームページの一部で、
「お知らせ」・・・「３」が開かれた状態を示す図であ
る。図６において、「お知らせ」・・・「３」が開かれ
ている。「お知らせ」・・・「３」は、さらに、感応領
域があり、「制度・運用改正」・・・「Ａ」、「出願手
続ニュース」・・・「Ｂ」、「審査情報」・・・
「Ｃ」、「審判情報」・・・「Ｄ」、「物品・役務調達
情報」・・・「Ｅ」、「職員採用案内」・・・「Ｆ」、
「説明会・セミナー・シンポジウム」・・・「Ｇ」、
「弁理士試験情報」・・・「Ｈ」、「その他」・・・
「Ｉ」がある。FIG. 6 shows a part of the JPO homepage.
It is a figure which shows the state which "notice" ... "3" was opened. In FIG. 6, “Notification”... “3” is open. "Notice" ... "3" has a further sensitive area, "System and operation revision" ... "A", "Application news" ... "B", "Examination information" ...
"C", "Judgment information" ... "D", "Article / service procurement information" ... "E", "Employee recruitment information" ... "F",
"Information Session / Seminar / Symposium" ... "G",
"Patent Attorney Test Information" ... "H", "Other" ...
There is an "I".

【００４７】図３における「ＩＮＤＥＸ」には、「１」
から「３」までの数字が付けられているが、「長官から
のメッセージ」・・・「Ａ」、「プレス発表」・・・
「Ｂ」、「特許行政の動き」・・・「Ｃ」に対してもア
ルファベットを付けておき、一度アルファベットを声を
出して呼びあげるだけで、図４を経ることなく、一度に
開けることも可能である。"INDEX" in FIG. 3 indicates "1".
Numbers from "to" 3 are given, but "Message from the Secretary" ... "A", "Press announcement" ...
"B", "Patent Administration Movement" ... Also add an alphabet to "C" and call it out aloud once, and open it all at once without going through Figure 4. It is possible.

【００４８】図３ないし図６において、感応領域を開く
際に、数字とアルファベットを付しておき、これらを声
を出して呼びあげることにより実行したが、前記以外
に、片仮名（平仮名）あるいは簡単な名称および／また
はコマンドにすることもできる。また、前記名称および
／またはコマンドは、表示画面に出ている「ファイ
ル」、「編集」、「表示」、「移動」、「ホーム」、
「次」、「左」、「右」、「上」、「下」等の簡単な言
葉を音声認識ボード等に登録しておくことができる。In FIG. 3 to FIG. 6, when the sensitive area is opened, numbers and alphabets are added and these are called out aloud, but in addition to the above, katakana (hiragana) or simple Names and / or commands. In addition, the names and / or commands are “file”, “edit”, “display”, “move”, “home”,
Simple words such as "next", "left", "right", "up", and "down" can be registered in a voice recognition board or the like.

【００４９】本実施例は、通常のデスクトップ型情報処
理装置について説明したが、ラップトップ型情報処理装
置、モバイル型情報処理装置、あるいはｉモード付き携
帯電話に適用することができる。特に、ｉモード付き携
帯電話の表示装置は、小型であるため、従来のポインテ
ィグ・デバイスを使用することが困難である。しかし、
本発明の表示画面上の座標位置を入力または起動する方
法または表示画面上の座標位置を入力または起動する装
置を使用すれば、音声のみで所望の場所をクリックする
ことができる。Although the present embodiment has been described with respect to a normal desktop information processing apparatus, it can be applied to a laptop information processing apparatus, a mobile information processing apparatus, or a mobile phone with an i-mode. In particular, since the display device of the i-mode mobile phone is small, it is difficult to use a conventional pointing device. But,
By using the method for inputting or activating the coordinate position on the display screen or the apparatus for inputting or activating the coordinate position on the display screen according to the present invention, a desired place can be clicked only by voice.

【００５０】また、ｉモード付き携帯電話は、他人が使
用することは稀であるため、音声認識ボード等が学習効
果により認識率を高め、使用勝手が良くなる。Since the i-mode mobile phone is rarely used by others, a voice recognition board or the like enhances the recognition rate by a learning effect, thereby improving usability.

【００５１】本発明の他の実施例は、表示画面に文字入
力領域がある場合、マウス等によってカーソルを移動さ
せるのではなく、文字入力領域付近に数字、アルファベ
ット、片仮名（平仮名）等を付けておき、これらを声に
出して呼みあげるだけで、カーソルが瞬時に所望の文字
入力領域に移動する。In another embodiment of the present invention, when a character input area is present on the display screen, a numeral, alphabet, katakana (hiragana), etc. are attached near the character input area instead of moving the cursor with a mouse or the like. Just by calling them up aloud, the cursor instantly moves to the desired character input area.

【００５２】以上、本実施例を詳述したが、本発明は、
前記実施例に限定されるものではない。そして、特許請
求の範囲に記載された本発明を逸脱することがなけれ
ば、種々の設計変更を行なうことが可能である。本発明
は、公知の音声認識技術を利用して、ポインティグ・デ
バイスと組み合わせたものである。As described above, the present embodiment has been described in detail.
It is not limited to the above embodiment. Various design changes can be made without departing from the present invention described in the appended claims. The present invention utilizes a well-known speech recognition technique and combines it with a pointing device.

【００５３】[0053]

【発明の効果】本発明によれば、音声認識が数字、アル
ファベット、片仮名（平仮名）およびこれらの組合せ、
あるいは名称および／またはコマンド等のみであるた
め、助詞、助動詞、変換、文節、接頭語等多くの組み合
わせからなるデータが不要であるだけでなく、小型のボ
ードあるいはチップに搭載でき、かつ認識率を向上させ
ることができる。According to the present invention, voice recognition can be performed for numbers, alphabets, katakana (hiragana), and combinations thereof,
Alternatively, since it is only a name and / or a command, not only is data composed of many combinations of particles, auxiliary verbs, conversions, phrases, prefixes, etc. unnecessary, it can be mounted on a small board or chip and the recognition rate can be reduced. Can be improved.

【００５４】本発明によれば、パーソナル機器に適用す
ると、音声認識ボード等が持主の声を学習し高い認識率
となる。本発明によれば、従来のポインティグ・デバイ
スを使用する必要がなく、数字、アルファベット、片仮
名（平仮名）およびこれらの組合せ、あるいは名称およ
び／またはコマンド等を声を出して読むだけであるた
め、楽しく、早く、しかも自然である。According to the present invention, when applied to a personal device, a voice recognition board or the like learns the owner's voice and has a high recognition rate. According to the present invention, it is not necessary to use a conventional pointing device, and it is only necessary to read aloud a number, an alphabet, katakana (hiragana) and a combination thereof, or a name and / or a command. Fast and natural.

【００５５】本発明によれば、従来のポインティグ・デ
バイスを移動する必要がなく、声を出すだけなので、瞬
時にクリックが可能である。すなわち、本発明は、表示
画面上における座標位置の入力または起動が簡単にでき
るようになる。According to the present invention, it is not necessary to move the conventional pointing device, but only to make a voice, so that an instant click is possible. That is, according to the present invention, the input or activation of the coordinate position on the display screen can be easily performed.

【００５６】本発明によれば、モバイル型情報処理装置
あるいは携帯電話は、マイクロホンを予め備えているた
め、数字、アルファベット、片仮名（平仮名）およびこ
れらの組合せ、あるいは名称および／またはコマンドを
認識する非常に簡単な音声認識ボード等を取り付けるだ
けで、安価な表示画面上の座標位置を入力または起動す
る装置を提供することができる。According to the present invention, since the mobile information processing apparatus or the mobile phone is provided with the microphone in advance, it is possible to recognize numbers, alphabets, katakana (hiragana) and combinations thereof, or names and / or commands. It is possible to provide an inexpensive device for inputting or starting a coordinate position on a display screen simply by attaching a simple voice recognition board or the like to the device.

【００５７】本発明によれば、感応領域に付けられてい
る数字、アルファベット、片仮名（平仮名）およびこれ
らの組合せ、あるいは名称および／またはコマンドのフ
ォント、形状、色等を予め決めておいたり、あるいは表
示画面の一部に列挙しておくと、他の文章と明らかに見
分けが付き、感応領域を認識し易く、かつ瞬時にクリッ
クが可能である。According to the present invention, numerals, alphabets, katakana (hiragana) and combinations thereof, or fonts, shapes, colors and the like of names and / or commands assigned to the sensitive area are determined in advance, or By listing them in a part of the display screen, they can be clearly distinguished from other sentences, the sensitive area can be easily recognized, and clicks can be made instantaneously.

【００５８】本発明によれば、音声認識が数字、アルフ
ァベット等限られた音に限定すると、音声認識に必要で
あった周波数分析器、ベクトル量子化手段、尤度計算手
段等が簡易なものとなるだけでなく、単語ＨＭＭを省略
することも可能である。According to the present invention, if speech recognition is limited to limited sounds such as numbers and alphabets, the frequency analyzer, vector quantization means, likelihood calculation means, etc. required for speech recognition can be simplified. Not only that, the word HMM can be omitted.

【００５９】本発明によれば、パーソナルコンピュー
タ、携帯電話等を特定の個人のみが利用する機器に適用
すると、近くにマイクロホンがあるという条件と、音声
認識ボード等の学習効果が発揮され、音声認識率が非常
に高くなる。According to the present invention, when a personal computer, a mobile phone, or the like is applied to a device used only by a specific individual, the condition that a microphone is nearby and the learning effect of a voice recognition board and the like are exhibited, and the voice recognition is performed. The rate will be very high.

[Brief description of the drawings]

【図１】本発明の音声による座標位置を入力または起動
する装置で表示画面の感応領域をクリックして感応領域
の情報を表示させるためのフローチャートである。FIG. 1 is a flowchart for displaying information on a sensitive area by clicking a sensitive area on a display screen with the apparatus for inputting or starting a coordinate position by voice according to the present invention.

【図２】本発明の音声認識技術を説明するためのブロッ
ク構成図である。FIG. 2 is a block diagram illustrating a speech recognition technique according to the present invention.

【図３】特許庁のホームページの一部を開いた状態を示
す図である。図３において、感応領域の一部に数字が付
けられている。FIG. 3 is a diagram showing a state in which a part of the JPO homepage is opened. In FIG. 3, a number is given to a part of the sensitive area.

【図４】特許庁のホームページの一部で、「最近の特許
庁」・・・「１」が開かれた状態を示す図である。FIG. 4 is a view showing a state in which “Recent Patent Office”.

【図５】特許庁のホームページの一部で、「制度紹介」
・・・「２」が開かれた状態を示す図である。[Figure 5] A part of the JPO's website, "Introduction to the system"
.. Is a view showing a state where “2” is opened.

【図６】特許庁のホームページの一部で、「お知らせ」
・・・「３」が開かれた状態を示す図である。[Fig. 6] A part of the JPO home page, "Notice"
.. Is a view showing a state where “3” is opened.

【図７】従来のポインティグ・デバイスで表示画面の感
応領域をクリックして感応領域の情報を表示させるため
のフローチャートである。FIG. 7 is a flowchart for displaying information on a sensitive area by clicking a sensitive area on a display screen using a conventional pointing device.

【図８】従来の音声認識技術を説明するためのブロック
構成図である。FIG. 8 is a block diagram illustrating a conventional speech recognition technique.

[Explanation of symbols]

２１・・・マイクロホン２２・・・Ａ／Ｄ変換器２３・・・周波数分析器２４・・・符号帳２５・・・ベクトル量子化手段２６・・・アルファベット、数字等ＨＭＭ２７・・・尤度計算手段２８・・・認識手段２９・・・認識結果によりプログラム起動手段 DESCRIPTION OF SYMBOLS 21 ... Microphone 22 ... A / D converter 23 ... Frequency analyzer 24 ... Codebook 25 ... Vector quantization means 26 ... HMM such as alphabets and numerals 27 ... Likelihood Calculation means 28 ... Recognition means 29 ... Program starting means based on recognition results

Claims

[Claims]

1. On a display screen displayed in accordance with the contents of a program, numbers, alphabets, katakana (hiragana), and a combination thereof attached to an area which is a sensitive area are voiced. And a method of inputting or starting a coordinate position on a display screen, wherein a program of the sensitive area is started by recognizing the numbers, alphabets, katakana (hiragana), and combinations thereof.

2. On a display screen displayed in accordance with the contents of a program, numbers, alphabets, katakana (hiragana), and a combination thereof are assigned by voice to a character input area. A method of inputting or starting a coordinate position on a display screen characterized by recognizing the numbers, alphabets, katakana (hiragana), and combinations thereof, of the voice and moving a cursor to the character input area.

3. The method according to claim 1, wherein the voice is a reading of a name and / or a reading of a command.

4. A display screen which is displayed in accordance with the contents of a program and in which a number, an alphabet, katakana (hiragana) and a combination thereof are attached to the displayed sensitive area, Numbers, alphabets,
A microphone in which katakana (hiragana) and a combination thereof are input as voice, and an A / A for converting a signal of the microphone into a digital signal
A D converter; voice recognition means for recognizing the output of the A / D converter; and an output of the voice recognition means corresponding to the number, alphabet, katakana (hiragana) and the combination of the recognized voice. A device for inputting or starting a coordinate position on a display screen, wherein the device activates a program in the sensitive area.

5. A display according to the contents of a program, and a name and / or
Or a display screen on which a command is described, and reading and / or reading a name described in the sensitive area.
Or a microphone in which a command is input as voice, and an A / A for converting a signal of the microphone into a digital signal.
A D converter; voice recognition means for voice recognition of an output of the A / D converter; and a program corresponding to the recognized name and / or command is activated by an output of the voice recognition means. A device for inputting or starting a coordinate position on the display screen to be displayed.

6. A display screen in which numbers, alphabets, katakana (hiragana) and a combination thereof are described in a character input area on a display screen displayed according to the contents of the program; And a microphone for inputting a number, an alphabet, katakana (hiragana) and a combination thereof as a voice, and an A / A for converting a signal of the microphone into a digital signal.
A D converter; voice recognition means for recognizing the output of the A / D converter; and a position corresponding to the number, alphabet, katakana (hiragana) and a combination thereof recognized by the output of the voice recognition means. A device for inputting or starting a coordinate position on a display screen, characterized in that a cursor is moved to a position.

7. The input or activation of a coordinate position on a display screen according to claim 4, wherein the voice recognition means is incorporated in a board or a chip. Equipment to do.

8. A non-sensitive number, alphabet, katakana (hiragana), a combination thereof, a name, a command, and the like described in the sensitive area may be modified by a specific font, shape, color, shading, or the like. The apparatus for inputting or starting a coordinate position on a display screen according to any one of claims 4 to 6, wherein the apparatus is distinguished from an area.

9. The information processing apparatus according to claim 4, wherein the information processing apparatus is provided in one of a desktop information processing apparatus, a laptop information processing apparatus, a mobile information processing apparatus, and a mobile phone. A device for inputting or starting a coordinate position on the display screen described in the section.