DE19636452A1

DE19636452A1 - Multiple user speech input system

Info

Publication number: DE19636452A1
Application number: DE1996136452
Authority: DE
Inventors: Dietmar Kirsten; Dietmar Koehler
Original assignee: Altenburger Industrienaehmaschinen GmbH
Current assignee: KOEHLER, DIETMAR, DIPL.-ING, 04600 ALTENBURG, DE K
Priority date: 1996-09-07
Filing date: 1996-09-07
Publication date: 1998-03-12

Abstract

The speech recognition system (1) accepts spoken inputs from several users, and the process begins with entry of a spoken identification word that is compared 1 with a stored value 2. This results in access to one of several user specific speech pattern vocabularies held in memory 3. The number of words held in the vocabulary can be selected by the user, as can the associated meanings. There can be an optical display (5) provided with the system.

Description

Beschrieben wird ein System zur Spracheingabe, welches ein sprecherabhängiges Sprach erkennungssystem nutzt um Sprachbefehle mit hoher Sicherheit zu erkennen. Eine starke Sprecherabhängigkeit durch exakten Vergleich des gesprochenen Wortes mit dem zuvor vom Sprecher erzeugten Muster, welches im Sprachmustervokabular hinterlegt ist, wirkt sich dabei positiv auf die Erkennungssicherheit aus. Umgebungslärm, Sprache dritter Person in der Nähe des Sprechers und vom Sprecher gesprochene Worte, welche nicht im Sprach mustervokabular hinterlegt sind, werden vom Spracherkennungssystem dabei weitgehend ignoriert. Diese Eigenschaften sind besonders wichtig, wenn das Spracherkennungssystem als Eingabesystem an der Schnittstelle Mensch-Maschine Einsatz findet.A system for speech input is described, which is a speaker-dependent language recognition system uses to recognize voice commands with high security. A strong Speaker dependence through exact comparison of the spoken word with the previous one Pattern generated by the speaker, which is stored in the speech pattern vocabulary, has an effect thereby positively on the recognition security. Environmental noise, third party language in close to the speaker and words spoken by the speaker that are not in the speech The vocabulary is largely used by the speech recognition system ignored. These properties are particularly important when using the speech recognition system is used as an input system at the human-machine interface.

Ein entscheidender Nachteil dabei ist, daß das System nur von einer Person genutzt werden kann.A key disadvantage is that the system can only be used by one person can.

Um diesen Nachteil zu umgehen, werden bekannterweise (z. B. DE 38 03 220 A1) für jedes zu erkennende Wort Sprachmuster verschiedener Sprecher hinterlegt. Ein entscheidender Nachteil hierbei ist, daß sich die Größe des Sprachmustervokabulars mit der Anzahl der potentiellen Nutzer multipliziert. Das hat zur Folge, daß sich die Erkennungssicherheit im gleichen Maße verringert.To avoid this disadvantage, it is known (e.g. DE 38 03 220 A1) for each Word to be recognized Language patterns of different speakers are stored. A crucial one The disadvantage here is that the size of the speech pattern vocabulary varies with the number of multiplied potential users. As a result, the detection security in the reduced to the same extent.

Zur Schaffung eines Mehrnutzersystems ohne den Nachteil der Verringerung der Er kennungssicherheit wird erfindungsgemäß für jeden potentiellen Nutzer ein eigenes Sprach mustervokabular gespeichert. Im Betriebsfall ist dann nur das Sprachmustervokabular des jeweiligen Nutzers aktiv.To create a multi-user system without the disadvantage of reducing Er According to the invention, security of identification becomes a separate language for each potential user Sample vocabulary saved. In the operating case, only the language pattern vocabulary of active user.

Die Zuordnung des nutzereigenen Sprachmustervokabulars zum jeweiligen Nutzer erfolgt erfindungsgemäß über ein, vom Nutzer bei Beginn zu sprechendes Identifikationswort. Die Identifikationsworte aller potentiellen Nutzer befinden sich in einem gemeinsamen Sprachmustervokabular, welches zu Beginn immer als erstes solange aktiv ist, bis ein Identifikationswort erkannt und damit das entsprechende nutzereigenen Sprachmuster vokabular aktiviert wird. Um den Nutzer vom erfolgten Vokabularwechsel zu informieren, wird erfindungsgemäß ein Begrüßungstext über eine Sprachausgabeeinheit ausgegeben. Der jeweilige Betriebszustand wird zusätzlich optisch signalisiert.The assignment of the user's own language pattern vocabulary to the respective user takes place According to the invention via an identification word to be spoken by the user at the beginning. The Identification words of all potential users are in a common one Speech pattern vocabulary, which is always the first to be active until a Identification word recognized and thus the corresponding user's own speech pattern vocabulary is activated. To inform the user of the change of vocabulary, According to the invention, a greeting text is output via a voice output unit. Of the the respective operating status is additionally signaled optically.

Die erfindungsgemäße Bildung der nutzereigenen Sprachmustervokabulare aus frei wählbaren Worten, welche ihrer Wortbedeutung entsprechend eigene Aktionen auslösen können, schafft auf einfache Weise ein leistungsfähiges Hilfsmittel zur Strukturierung und Verriegelung bei der sprachgesteuerten Bedienung von Maschinen und Einrichtungen. So ist es z. B. möglich, durch Strukturierung der nutzereigene Vokabulare unterschiedliche Bedien plattformen für Maschineneinrichter und Maschinenbediener zu schaffen.The creation of the user's own language pattern vocabularies according to the invention from free selectable words, which trigger their own actions according to their word meaning can easily create a powerful tool for structuring and Locking in the voice-operated operation of machines and equipment. So is it z. B. possible by structuring the user's own vocabularies different operating to create platforms for machine setters and machine operators.

BezugszeichenlisteReference list

1 sprecherabhängiger Spracherkenner
2 Sprachmustervokabular Identifikationsworte
3 Sprachmustervokabular einsatzspezifisch
4 Sprachausgabeeinheit
5 optische Anzeige 1 speaker-dependent speech recognizer
2 speech pattern vocabulary identification words
3 Language pattern vocabulary specific to use
4 voice output unit
5 optical display

Claims

1. Multi-user system for voice input, characterized in that in a speaker-dependent speech recognition system ( 1 ) there is a separate speech pattern vocabulary ( 2 ) available for all users together with the identification words of the users and for each user a further speech pattern vocabulary ( 3 ) with the mission-specific words is available.

2. Multi-user system for voice input, characterized in that the assignment of the application-specific language pattern vocabulary ( 3 ) to the user is carried out by voice recognition of the user-specific identification word.

3. Multi-user system for voice input, characterized in that the number of words of the mission-specific language pattern vocabulary ( 3 ) and their meaning are freely selectable for each user.

4. Multi-user system for voice input, characterized in that a voice output unit ( 4 ) and an optical display ( 5 ) or only one of the two is present for the recognition of the respectively active voice pattern vocabulary ( 3 ).