DE10017503A1

DE10017503A1 - Speech recognition method in wireless communication terminal, involves recognizing words held on voice server, and digitally transferring recognized results over Internet

Info

Publication number: DE10017503A1
Application number: DE2000117503
Authority: DE
Inventors: Manuel Duque-Anton
Original assignee: DUQUE ANTON MANUEL
Current assignee: comlet Verteilte Systeme GmbH
Priority date: 2000-04-07
Filing date: 2000-04-07
Publication date: 2001-10-25

Abstract

A voice client and voice server are respectively installed at a wireless communication terminal and a WAP server. Words held on the voice server, are recognized by the communication terminal. Recognized results are digitally transferred over the Internet, and corresponding data transfer information is provided to the terminal.

Description

Field of the Invention

Die vorliegende Erfindung bezieht sich auf ein drahtloses Kommunikationssy stem und insbesondere auf ein Verfahren, welches die sprachgesteuerte Kommu nikation zwischen dem mobilen, drahtlosen Kommunikationsendgerät und dem Internet vereinfachen kann.The present invention relates to a wireless communication system stem and in particular on a method that the voice-controlled commu nication between the mobile, wireless communication terminal and the Internet can simplify.

State of the art

Aktuelle Mobilfunknetze werden in der Regel nur zur Übertragung von Sprach daten verwendet. Wie in Abb. 1 gezeigt ist, umfaßt das drahtlose Kommu nikationssystem drahtlose Kommunikationsendgeräte 100, Basistationen 102, eine Basisstationssteuerung 104 und ein Vermittlungssystem 106.Current mobile networks are generally only used for the transmission of voice data. As shown in FIG. 1, the wireless communication system comprises wireless communication terminals 100 , base stations 102 , a base station controller 104 and a switching system 106 .

Jede der Basisstationen 102 verwaltet drahtlose Kommunikationsendgeräte in nerhalb eines vorbestimmten Gebietes.Each of the base stations 102 manages wireless communication terminals within a predetermined area.

Wenn sich ein drahtloses Kommunikationsendgerät 100 von einem Gebiet zu ei nem anderen bewegt, wählt die Basisstationssteuerung 104 eine passende Basis tation 102 für das drahtlose Kommunikationsendgerät aus.When a wireless communication terminal 100 moves from one area to another, the base station controller 104 selects an appropriate base station 102 for the wireless communication terminal.

Das Vermittlungssystem 106, das Verbindungen zwischen einem drahtlosen Kommunikationsendgerät 100 und einem anderen drahtgebundenen oder drahtlo sen Kommunikationsendgerät herstellt, bildet einen Teil eines öffentlichen Tele fonnetzes. Dieses Telefonnetz errichtet Verbindungen zwischen drahtlosen Kommunikationsendgeräten, zwischen drahtgebunden und drahtlosen Kommu nikationsendgeräten und zwischen Engeräten mit drahtgebundener Übertragung. Um ein drahtloses Kommunikationsendgerät mit Informationen wie beispiels weise HTML-Seiten zu beliefern, wird ein Zwischensystem für die Verbindung benötigt, welches die Informationen aus dem Internet in darstellbare Informatio nen am drahtlosen Kommunikationsendgerät umwandelt. The switching system 106 , which establishes connections between a wireless communication terminal 100 and another wired or wireless communication terminal, forms part of a public telephone network. This telephone network establishes connections between wireless communication terminals, between wired and wireless communication terminals and between terminals with wired transmission. In order to supply a wireless communication terminal with information such as HTML pages, an intermediate system is required for the connection, which converts the information from the Internet into representable information on the wireless communication terminal.

Dazu werden mögliche unterschiedliche Daten und Protokolle zwischen Internet und drahtlosen Kommunikationsendgerät angepaßt. Diese Aufgabe wird aktuell von einem sogenannten WAP-Server (Wireless Application Protocol) realisiert. In Abb. 2 ist diese Architektur gezeigt.Possible different data and protocols between Internet and wireless communication terminal are adapted for this. This task is currently being implemented by a so-called WAP server (Wireless Application Protocol). This architecture is shown in Fig. 2.

Die Darstellung der Eingabe und der Ausgabe am drahtlosen Kommunikations endgerät ist aufgrund der geringen Ausmaße des Kommunikationsendgeräts stark eingeschränkt.The representation of the input and output on wireless communication Terminal is strong due to the small size of the communication terminal limited.

Die Eingabe der Internet-Adressen beispielsweise am drahtlosen Kommunikati onsendgerät erfolgt über die alphanumerischen Zifferntasten am Endgerät. Zum Versenden von elektronischen Briefen müssen ebenfalls die alphanumerischen Eingabetasten verwendet werden. Dieses Verfahren ist sehr mühsam und wenig bedienerfreundlich und stellt eine große Hürde bei der Verbreitung des mobilen Internets dar.Entering the Internet addresses, for example on wireless communication Onsendgerät takes place via the alphanumeric numeric keys on the end device. To the Sending electronic letters must also be alphanumeric Enter keys are used. This process is very tedious and little user-friendly and represents a major hurdle in the spread of the mobile Internet.

Die Ausgabe der Internet-Information ist ebenfalls durch die Größe des Displays am drahtlosen Kommunikationsendgerät erheblich eingeschränkt. Die im WAP- Protokoll vorgeschlagenen Ansätze, die Information in Kartenform darzustellen und zu überlagen reicht nicht aus um komplexe Internet-Seiten geeignet darzu stellen. Die Anzeige eines mittelgroßen elektronischen Briefes erfordert schon eine enorme Geduld des Benutzers.The output of internet information is also due to the size of the display on the wireless communication terminal considerably restricted. The in the WAP Protocol proposed approaches to present the information in card form and overlaying is not enough to make complex internet pages suitable put. The display of a medium-sized electronic letter already requires a tremendous amount of patience from the user.

Summary of the invention

Um das obige Problem optimal zu lösen, besteht die Aufgabe der vorliegenden Erfindung darin, ein Verfahren anzugeben, welches die am drahtlosen Kommu nikationsendgerät vorhandenen Hardware-Komponenten optimal für die Verbin dung mit dem Internet ausnutzt.In order to optimally solve the above problem, there is the task of the present Invention to provide a method that the wireless commu existing hardware components optimal for the connection using the Internet.

Eine Aufgabe der Erfindung besteht darin, die Eingabe von Information, die zur Steuerung im Internet benötigt wird, am drahtlosen Kommunikationsendgerät mit Hilfe von Spracherkennung und Sprachverarbeitung zu unterstützen.An object of the invention is to enter information that is used for Control in the Internet is required on the wireless communication terminal Supporting speech recognition and speech processing.

Typischerweise wird in der Telefonie Spracherkennung am Kommunikations endgerät realisiert und wird vornehmlich zur Erkennung von Rufnummer-Infor mation und Steuerung von Telefonbücher und ähnlichen verwendet. In diesem Fall wird die Spracherkennung durch eine lokale Hard- und Software am Kom munikationsgerät realisiert. Dies verursacht zusätzliche Kosten für das Kommu nikationsendgerät und bietet nur eine eingeschränkte Flexibilität. Unter Umständen müssen Teile des Kommunikationsendgerätes ausgetauscht werden, falls sich die Spracherkennungskomponente ändert.Typically in telephony, speech recognition is used on communications terminal device and is primarily used to recognize phone number information mation and control of phone books and similar used. In this In this case, speech recognition is carried out by local hardware and software on the comm communication device realized. This causes additional costs for the commu nication terminal and offers only limited flexibility. Under Parts of the communication terminal may need to be replaced, if the speech recognition component changes.

Eine weitere Aufgabe der Erfindung besteht darin, ein Client-Server-Modell für die Spracherkennungs-Unterstützung am drahtlosen Kommunikationsendgerät zu realisieren. Dabei wird der vorwiegende Anteil der Spracherkennung im Netz (Spracherkennungs-Server) realisiert, das drahtlose Kommunikationsendgerät (Spracherkennungs-Client) wird nur zur Initialisierung, Aktivierung und Einstel lung der sprachgesteuerten Eingabe verwendet.Another object of the invention is to develop a client-server model for the speech recognition support on the wireless communication terminal to realize. The majority of speech recognition in the network (Speech recognition server), the wireless communication terminal (Speech recognition client) is only used for initialization, activation and setting voice-activated input.

In Abb. 3 ist eine solche Architektur gezeigt. Dabei werden insbesondere zwei Varianten beschrieben. Die erste Variante bietet die Spracherkennung inner halb des WAP-Servers 204 an. In diesem Fall kann das in GSM zur Zeit realisierte leitungsvermittelnde Funknetz verwendet werden, um mit dem Internet sprachge steuert zu kommunizieren. Gegenstand dieser Aufgabe ist es auch insbesondere eine Entwicklungsumgebung für das drahtlose Kommunikationsendgerät zur Verfügung zu stellen, um eine bestimmte Internet-Seite mit individueller Sprach erweiterung herzustellen. Die zweite Variante verwendet das aktuell im Aufbau befindliche Paket-Vermitteltes Funknetz von GSM, wobei nun die Basistations- Steuerung 200 zunächst mit einem Serving GPRS (General Packet Radio Service) Knoten SGSN 206 verbunden wird, auf dem unter anderen die Spracherkennung realisiert wird. Vom SGSN wird dann eine Verbindung zum Internet Server 210 über den Verbindungsknoten GGSN (Gateway GPRS Node) 208 hergestellt.Such an architecture is shown in Fig. 3. Two variants are described in particular. The first variant offers speech recognition within the WAP server 204 . In this case, the circuit-switching radio network currently implemented in GSM can be used to communicate with the Internet in a voice-controlled manner. The object of this task is in particular to provide a development environment for the wireless communication terminal in order to produce a specific Internet page with individual language extension. The second variant uses the packet-switched radio network from GSM that is currently being set up, the base station controller 200 now being initially connected to a serving GPRS (General Packet Radio Service) node SGSN 206 on which, among other things, the speech recognition is implemented. A connection to the Internet server 210 is then established by the SGSN via the connection node GGSN (Gateway GPRS Node) 208 .

Eine weitere Aufgabe der Erfindung besteht darin, den an jedem drahtlosen Kom munikationsendgerät vorhanden Lautsprecher zur Unterstützung der Ausgabe von Internet-Information zu verwenden. Dazu muß in Abb. 3 der WAP-Ser ver bzw. der SGSN um eine zusätzliche Komponente zur Sprach-Synthese erwei tert werden. Die entsprechenden Internet-Seiten müssen so aufbereitet werden, daß neben einer visuellen Ausgabe nun auch eine akustische Ausgabe der Infor mation automatisch angefordert werden kann.Another object of the invention is to use the loudspeaker present on each wireless communication terminal to support the output of Internet information. For this purpose, the WAP server or the SGSN must be expanded by an additional component for speech synthesis in Fig. 3. The corresponding Internet pages must be prepared so that, in addition to a visual output, an acoustic output of the information can now be requested automatically.

Die Realisierung im Netz hat den Vorteil, dass sie damit allen Teilnehmern unab hängig von ihrem Endgerät zur Verfügung gestellt werden kann, und vom Netz- oder Dienst-Anbieter basierend auf der Spracherkennung spezielle Dienstange bote realisiert werden kann. Auf diese Weise kann auch der Preis der benötigten drahtlosen Kommunikationsendgeräte minimiert werden. Die Realisierung im Netz hat auch den Vorteil, dass ohne den Austausch der drahtlosen Kommunika tionsendgeräte neue Spracherkenner im Netz genutzt werden können. The realization in the network has the advantage that it is independent of all participants depending on your device, and from the network or service providers special services based on speech recognition messenger can be realized. In this way, the price of what is needed wireless communication terminals can be minimized. The realization in Network also has the advantage that without the exchange of wireless communications end devices new speech recognizers can be used in the network.

Description of the Preferred Embodiments

Üblicherweise muß sich ein mobiler Teilnehmer in einem Mobilfunknetz mit Hil fe seines drahtlosen Kommunikationsendgerät im Mobilfunknetz mittels einer Einbuchung anmelden.Usually, a mobile subscriber in a mobile network with Hil fe of its wireless communication terminal in the cellular network using a Register your booking.

Im Falle der spracherweiterten Ein- und Ausgabe am drahtlosen Kommunikati onsendgerät muß nach der Einbuchung im System bei Bedarf die Sprach-Kom munikation mit dem Internet aktiviert werden. Dazu wird ein Verfahren vorgeschlagen, welches im drahtlosen Kommunikationsgerät implementiert wird. In Abb. 4 wird gezeigt, wie das drahtlose Kommunikationsendgerät um einen Sprach-Klienten erweitert wird, der mit dem Sprach-Manager im Fest netz-Server kommuniziert. Im folgenden Teil der Erfindung wird abkürzend Fest netz-Server geschrieben, wenn entweder ein WAP-Server oder ein SGSN Server oder ein zukünftiger UMTS-Server gemeint ist.In the case of language-enhanced input and output on the wireless communication terminal, the voice communication with the Internet must be activated after logging in to the system, if necessary. For this purpose, a method is proposed which is implemented in the wireless communication device. Fig. 4 shows how the wireless communication terminal is expanded to include a voice client that communicates with the voice manager in the fixed network server. In the following part of the invention, fixed network server is abbreviated if either a WAP server or an SGSN server or a future UMTS server is meant.

Die Aufgabe des Sprach-Klienten auf der drahtlosen Teilnehmer-Seite sind die Teilaufgaben, entweder eine individuell gespeicherte Sprachumgebung einer In ternet-Seite, die zusammen mit dem Teilnehmer-Profil in einem dem Mobilfunk teilnehmer zugeordneten Register gespeichert ist (Home Location Register), auf den aktuellen Festnetz-Server zu laden, oder ein Werkzeug zur Erstellung von Sprach-Erweiterten Internet-Seiten zu laden, um damit anschließend eine spezi elle spracherweiterte Internet-Seite zu produzieren.The task of the voice client on the wireless subscriber side is that Subtasks, either an individually saved language environment of an In ternet page, which together with the subscriber profile in a the mobile network subscriber-assigned register is stored (home location register) to load the current landline server, or a tool for creating Language-enhanced Internet pages to load a special to produce a language-enhanced website.

Zur Realisierung dieser Aufgabe wird auf das bekannte Client/Server-Modell zu rückgegriffen. Der Client, das drahtlose Kommunikationsendgerät, besitzt Befeh le, um die Dienste des Servers, die Festnetz-Komponente, zu verwenden. Die in der vorliegenden Erfindung zur Kommunikation mit dem Internet vorgesehene Spracherkennung erfolgt nicht am drahtlosen Kommunikationsendgerät, son dern an der Festnetz-Komponente. Dazu müssen am Festnetz-Server Komponenten zur Sprach-Erkennung untergebracht sein.The well-known client / server model is used to implement this task resorted to. The client, the wireless communication terminal, has commands le to use the services of the server, the landline component. In the of the present invention for communicating with the Internet Speech recognition does not take place on the wireless communication terminal, son on the landline component. This requires components on the fixed network server for speech recognition.

Die vorliegende Erfindung sieht die Möglichkeit vor, am drahtlosen Kommuni kationsendgerät die Sprach-Erweiterte Eingabe für Internet-Seite zu aktivieren bzw. zu deaktivieren. Dazu wird ein allgemeiner Soft- oder Hardware Button am drahtlosen Kommunikationsendgerät bereit gestellt. Die vorliegende Erfindung beschreibt ein Verfahren, welches dem Teilnehmer erlaubt durch Betätigung des Sprach-Buttons nun zusätzlich mit Hilfe von Sprach-Kommandos innerhalb der aufgerufenen Internet-Seite zu navigieren, statt nur wie bisher üblich mit Hilfe von "Klicks" und Listen-Aufrufen. Das gesprochene Sprach-Kommando wird dann nicht direkt am drahtlosen Kommunikationsendgerät umgesetzt, sondern zunächst in Form digitalisierter Sprache zum Festnetz-Server übertragen. Dort er folgt dann die Sprach-Erkennung und das anschließende Sprach-Verstehen. Das erkannte und verstandene Kommando kann dann im üblichen Internet-Standard übertragen werden, und lokal am drahtlosen Kommunikationsendgerät umgesetzt werden. Zur Beschleunigung des Verfahrens kann am Festnetz-Server ein Abbild der lokal aufgerufenen Internet-Seite gehalten werden. Das verstandene Sprach- Kommando kann dann zunächst innerhalb dieser Kopie in eine Navigationsan weisung umgesetzt werden, und die veränderte Internet-Seiten-Information an schließend zum drahtlosen Kommunikationsendgerät übertragen werden.The present invention provides the possibility of wireless communication kationsendgerät to activate the voice-extended input for the website or deactivate. For this purpose, a general soft or hardware button on wireless communication terminal provided. The present invention describes a method that allows the participant by pressing the Language buttons now also with the help of voice commands within the to navigate to the accessed website, instead of just using help as was previously the case of "clicks" and list calls. The spoken voice command is then not implemented directly on the wireless communication terminal, but first transmitted to the landline server in the form of digitized speech. There he then follows the speech recognition and the subsequent speech understanding. The recognized and understood command can then be in the usual Internet standard are transmitted, and implemented locally on the wireless communication terminal become. To accelerate the process, an image can be displayed on the fixed network server the locally accessed website. The understood speech Command can then first be navigated to within this copy instructions are implemented, and the changed website information are finally transmitted to the wireless communication terminal.

In Abb. 5 wird das Client/Server- Verhalten zur Spracherweiterten Eingabe im mobilen Internet in Form eines Flußdiagramms beschrieben, welches die Kommunikation und Synchronisation der beiden beteiligten drahtloses Kommu nikationsendgerät und Festnetz-Server beschreibt.In Fig. 5, the client / server behavior for language-enhanced input in the mobile Internet is described in the form of a flowchart which describes the communication and synchronization of the two wireless communication terminals involved and the fixed network server.

Die typischen Befehle, die für eine Internet-Seite sprachbasiert einzustellen sind, hängen natürlich von der konkreten Internet-Seite ab. In der Abb. 6 wird ein typisches Beispiel für eine Bank-Anwendung gezeigt. Die drei Schaltflächen: The typical commands that have to be set for a website based on language depend, of course, on the specific website. Fig. 6 shows a typical example of a bank application. The three buttons:

Konto-Übersicht, Überweisung und Lastschrift können in diesem Fall beispiels weise Sprach-Basiert aufgerufen werden, da sie zuvor mit Hilfe des Werkzeugs zur Sprachbasierten Eingabe so eingestellt worden sind. Nach Betätigung des Sprach-Aktivierungs-Buttons am drahtlosen Kommunikationsendgerät kann zum Beispiel Konten-Anzeige gesprochen werden. In diesem Fall wird das digitali sierte Sprachsignal per Funkschnittstelle zum Festnetz-Server übertragen, und anschließend erkannt. Als Folge davon, erscheinen zum Beispiel zwei Konten- Namen am Display des drahtlosen Kommunikationsendgeräts, die ebenfalls sprachgesteuert aktivierbar sind. Durch Aktivierung von Konto-Name 2 werden dann die letzten Konto-Bewegungen und andere Konto-relevanten Daten des an gegeben Kontos im Display angezeigt.Account overview, transfer and direct debit can be called up in this case, for example, language-based, since they were previously set up using the language-based input tool. After pressing the voice activation button on the wireless communication terminal, account display can be spoken, for example. In this case, the digitized voice signal is transmitted to the fixed network server via radio interface and then recognized. As a result, two account names appear on the display of the wireless communication terminal, for example, which can also be activated by voice control. By activating account name 2 , the last account movements and other account-relevant data of the specified account are shown on the display.

Eine weitere Aufgabe der Erfindung besteht in der Sicherung von Zugriffen mit Hilfe der Sprach-Erkennung. Die Aufgabe der Erfindung besteht darin, eine Si cherheitsrelevante Internet-Seite mit Hilfe von einem Sprach-Kommando zu si chern. Durch die Einstellung auf Sprecher-Abhängige Erkennung am Spracherkenner im Festenetz-Server, kann dann nur die Person die Internet-Seite betreten, die dieselbe Seite zuvor mit demselben Sprach-Kommando eingestellt hatte. Typische Fallandwendungen dieser Form der Authentifikation sind Bank- Seiten oder Seiten von Vertriebs-Beauftragten, die per drahtlosen Kommunikati onsendgerät auf ihre Informationen komfortabel und sicher zugreifen wollen. Genau wie die Eingabe per Sprache erweiterbar ist, kann auch die Ausgabe per Sprache erweitert werden. Für diesen Fall wird in der vorliegenden Erfindung im Festnetz-Server zusätzlich eine Komponente zur Sprach-Synthese installiert, wie in Abb. 7 gezeigt. Desweiteren wird ein Sprach-Ausgabe-Button am draht losen Kommunikationsendgerät vorgesehen (Hard- oder Software-mäßig).Another object of the invention is to secure access using voice recognition. The object of the invention is to secure a security-relevant Internet site with the aid of a voice command. By setting the speaker-dependent recognition on the speech recognizer in the fixed network server, only the person who has previously set the same page with the same voice command can then access the website. Typical use cases of this form of authentication are bank pages or pages of sales representatives who want to access their information comfortably and securely via wireless communication terminal. Just as the input can be expanded by voice, the output by voice can also be expanded. For this case, a component for voice synthesis is additionally installed in the fixed network server in the present invention, as shown in FIG. 7. Furthermore, a voice output button is provided on the wireless communication terminal (hardware or software).

Durch entsprechende Aktivierung erfolgt zusätzlich oder alternativ eine Sprach- Ausgabe. Im letzten Beispiel würden die Konto-Bewegungen des entsprechenden Kontos angesagt werden. Dazu wird die Internet-Seite, die auf dem Festnetz-Ser ver gespiegelt ist, gelesen und die entsprechenden Texte (Folge von Wörter) durch einen Sprach-Synthesizer geschickt und anschließend über die Funk schnittstelle als digitale Sprachsignale an das drahtlose Kommunikationsendgerät übertragen.Appropriate activation additionally or alternatively gives a voice Output. In the last example, the account movements would be the corresponding Account will be announced. For this purpose, the Internet site on the landline ser is mirrored, read and the corresponding texts (sequence of words) sent through a speech synthesizer and then over the radio interface as digital voice signals to the wireless communication terminal transfer.

Die besten Ergebnisse bei der Ausgabe können durch eine Kombination von Text, Grafik und Sprache realisiert werden. Dazu kann während der Bearbeitung der speziellen Internet-Seiten eine grobe Skizze als Grafik deklariert werden. Die De tails-Angaben sind dann per Sprachausgabe erreichbar.A combination of text, Graphics and language can be realized. This can be done while editing the a rough sketch can be declared as a graphic on special Internet pages. The De Tails information can then be reached via voice output.

Eine weitere Aufgabe der Erfindung besteht darin, Sprach-Befehle unabhängig von einer speziellen Internet-Seite zu erlernen. Dazu muß ebenfalls das Werk zeug zur Erstellung von Internet-Seiten geladen werden. Typische Beispiele sol cher Befehle sind Adressen von Internet-Seiten (Unified Ressource Locator URL), deren Spracheingabe gelernt wird und die anschließend zusammen mit den Teilnehmer spezifischen Daten an der Heimat-Adresse des Teilnehmers (Home Location Register) im Mobilfunknetz abgelegt werden. Um eine sichere und komfortable Arbeitsweise sicherzustellen wird vorgeschlagen zur Eingabe von URLs Buchstabenerkenner statt Worterkenner einzusetzen.Another object of the invention is to make voice commands independent to learn from a special website. The factory must also do this stuff for creating web pages. Typical examples are sol The commands are addresses of Internet pages (Unified Resource Locator URL), the input of which is learned and which is then carried out together with the Participant-specific data at the participant's home address (Home Location register) in the mobile network. To be safe and secure To ensure comfortable working is suggested for entering Use URLs letter recognizer instead of word recognizer.

Eine weitere Aufgabe besteht darin, Listen elektronischer Adressen (email) von Internet-Benutzern sprachgesteuert zu lernen und zusammen mit den Teilnehmer spezifischen Daten an der Heimat-Adresse des Teilnehmers (Home Location Re gister) im Mobilfunknetz zur späteren Verwendung abzulegen. Für den Fall, dass der Teilnehmer eine Email-Nachricht senden will, baut er zunächst die Adresse der Email-Nachricht auf, in dem er den Namen der Person in das drahtlose Kom munikationsendgerät hineinspricht. Der Spracherkenner im Netz erhält die digitalen Sprachsignale, die über die Funkschnittstelle übertragen werden und erkennt daraufhin die richtige email-Adresse, die im Mail-Programm eingebaut wird. Diese Information wird dem drahtlosen Kommunikationsgerät angezeigt. Eine weitere Aufgabe der Erfindung besteht darin, den Inhalt der Email per Spracheingabe automatisch zu erkennen und aufzuschreiben. Dazu wird per Soft- oder Hardware-Taste am drahtlosen Kommunikationsendgerät die Spracheinga be aktiviert. Die gesprochenen Wörter und/oder Sätze, die danach über das Mi krofon des drahtlosen Kommunikationsendgeräts empfangen werden, werden als digitale Sprachsignale zum Sprach-Server im Netz übertragen. Dort werden diese erkannt und in den Inhaltsbereich der Email als normale (ASCII) Zeichen einge tragen. Gleichzeitig werden die digitalen ASCII Zeichen dem drahtlosen Kom munikationsendgerät übertragen, so dass der Teilnehmer die Möglichkeit der Korrektur hat. Durch einen besonderen Befehl wird die auf diese Weise erstellte Email versendet.Another task is to keep lists of electronic addresses (email) from Internet users learn by voice and together with the participants specific data at the home address of the participant (Home Location Re gister) in the mobile network for later use. In case that If the participant wants to send an email message, he first builds the address the email message in which he entered the person's name into the wireless comm communication terminal. The speech recognizer in the network receives the digital one Speech signals that are transmitted via the radio interface and will then recognize the correct email address that is built into the mail program becomes. This information is displayed to the wireless communication device. Another object of the invention is to send the content of the email via Automatically recognize and write down voice input. To do this, or hardware button on the wireless communication terminal be activated. The spoken words and / or sentences that are then said about the Mi of the wireless communication terminal are received as transmit digital voice signals to the voice server in the network. There are these recognized and entered in the content area of the email as normal (ASCII) characters wear. At the same time, the digital ASCII characters are sent to the wireless comm communication terminal device, so that the participant the possibility of Has correction. With a special command, the one created in this way Email sent.

Die vorliegende Erfindung beschreibt auf die gleiche Weise die Ausgabe von empfangenen Emails. Durch Aktivierung eines Soft- oder Hardware-Buttons werden die empfangenen Emails im Sprach-Server im Netz in Sprache umgewan delt und dann als digitale Sprachsignale dem drahtlosen Kommunikationsgerät übertragen und dort als analoges Sprachsignal wiedergegeben. The present invention describes the output of received emails. By activating a soft or hardware button the received emails are converted into speech in the voice server on the network delt and then as digital voice signals to the wireless communication device transmitted and reproduced there as an analog voice signal.

Description of the pictures

Abb. 1 ist ein schematisches Diagramm, das ein allgemeines drahtloses Kommunikationssystem darstellt. Fig. 1 is a schematic diagram illustrating a general wireless communication system.

Abb. 2 ist ein schematisches Diagramm, das ein drahtloses Kommunikati onssystem darstellt, welches mit Hilfe eines WAP-Servers mit dem Internet ver bunden ist. Fig. 2 is a schematic diagram illustrating a wireless communication system that is connected to the Internet using a WAP server.

Abb. 3 und 4 sind Strukturblockdiagramme, welche die drahtlose Kom munikationseinrichtungen im Zusammenhang mit der Behandlung von Internet- Seiten um Spracherkennung anreichert, wobei die Realisierung der Spracherken nung im Festnetz gezeigt ist gemäß der Erfindung als Client/Server-Modell. Abb. 5 ist ein Flußdiagramm, welches gemäß der vorliegenden Erfindung die Kooperation der Sprach-Klienten und Sprach-Server im drahtlosen Kommu nikationsendgerät und dem Festnetz-Server zeigt, zur Aktivierung, Navigation und Bearbeitung von Internet-Seiten mit zusätzlicher Sprach-Erweiterung. Fig. 3 and 4 are structural block diagrams showing the wireless communication facilities in connection with the treatment of internet pages to accumulate voice recognition wherein the realization of the voice recognition is shown in the fixed network according to the invention as a client / server model. Fig. 5 is a flowchart which shows the cooperation of the voice clients and voice servers in the wireless communication terminal and the landline server, according to the present invention, for activating, navigating and editing Internet pages with additional voice extension.

In Abb. 6 wird ein typisches Fallbeispiel der Erfindung gezeigt, welches die sprach-erweiterte Navigation in Bank-Internet-Seiten betrifft.In Fig. 6 a typical case example of the invention is shown, which concerns the language-extended navigation in bank Internet pages.

Abb. 7 zeigt das um Sprach-Synthese erweiterte Strukturblockdiagramm von Abb. 4. Fig. 7 shows the structure block diagram of Fig. 4 expanded by speech synthesis.

Claims

1. Method in a wireless communication terminal that is connected to a fixed network server to display information from the Internet on the wireless communication terminal. The method uses the client / server model to implement flexible speech recognition on the wireless communication terminal.
The method includes the voice client to be installed on the wireless communication terminal and the voice server to be installed in the fixed network server. The method comprises the steps for activating the speech recognition on the wireless communication terminal in order to recognize spoken letters, words or sentences in the speech server of the network, which are transmitted digitally via the radio interface, and the subsequent navigation in the Internet as a result of Voice recognition. The information recognized from the digitally transmitted voice signal is converted into navigation in the Internet on the fixed network server and the result is communicated to the wireless communication end device.

2.Procedure as under 1, but now the output by automatic voice Synthesis is supplemented. Exactly as under 1, is now additionally on the fixed network ser ver uses a speech synthesis tool that can output speech on wireless communication terminal.

3. Procedure as under 1, with a list of so-called In Learned ternet addresses (Unified Resource Locators) using voice commands and together with the participant specific data at home Address of the subscriber (home location register) stored in the mobile network become. After activating this functionality on the wireless communication tion device, the subscriber can use a voice command to send the desired Internet address Enter the spoken internet address over the radio interface to the fixed network server is transferred, recognized and understood there and the corresponding recognized website is loaded from the Internet on the landline Server copied and over the radio interface for wireless communication transferred to the terminal and shown there on the display.

4. The procedure as under 1, but now the procedure for editing of conc Internet pages are described. As a result, the special internet Pages expanded by voice input functionality and together with the part specific data at the participant's home address (Home Lo cation register) in the mobile network. After that, the participant is at Pressing the voice button on his wireless communication terminal this saved website is always given, if he has the corresponding In ternet address.

5. The procedure as under 2, but now the procedure for editing conc Internet pages are described. As a result, the special internet Pages expanded and added with voice input and voice output functionality together with the participant specific data at the home address of the part stored in the mobile network. After that the participant when pressing the voice button on his wireless comm communication terminal always specifies this saved website, if he indicates the corresponding Internet address.

6. Procedure as under 1, with an additional list of so-called Email addresses can be learned via voice commands, and along with that Participant-specific data at the participant's home address (Home Location register) in the mobile network. After activating this Functionality on the wireless communication terminal, the subscriber can Enter the desired email address by voice command.

7. Procedure as under 4 and 5, with the additional option of speaking controlled speaker-dependent authentication is installed. The procedure ren includes the steps of installing and using them Access control. The edited Inter net page secured with a voice command, the voice server in the Network must be set to speaker-dependent. For use is after Calling up the website of the participants prompted the voice command to speak, which is transmitted as a digital voice signal via the radio interface is recognized and correctly or incorrectly recognized as a speaker on the fixed network server. Only in the positive case is the further processing of the website via the Radio interface approved.

8. Procedure as under 1 and 2, but now electronic messages (emails) produced on the wireless communication terminal by spoken language and automatically converted into characters (ASCII) sequences in the language server the and are sent as such, and vice versa received electronic Messages automatically converted from (ASCII) characters to digital language are in the voice server on the network and the wireless communication device Output via loudspeaker can be transmitted via the radio interface.

9. Procedure as under 1, 2, 3, 4, 5, 6 and 7 extended to wired commu nication terminals, which are just like the wireless communication terminals have a display.

10. Procedure as under 1, 2, 3, 4, 5, 6 and 7 extended to wired commu nikationsendgeräte that have no display, but now no combination of Voice input and output and website for communication with the part is used, but only pure voice input and only synthetic Narrator is used.

11. Procedure as under 1, 2, 3, 4, 5, 6 and 7 extended to any package or lei switching networks that offer access to the Internet, the just like the wireless communication terminals have a display.

12. Procedure as under 4 and 5, but now the Internet pages are not part but from the network operator or service provider for language processing processing can be expanded. The Internet pages thus expanded not together with participant data but globally available for all Inter net users filed on the Internet.

13. Procedure as in 12, the participant using the connection request an Internet page is asked whether he wants language extension. In positi If necessary, the language-extended pages are fetched, otherwise the normal pages, whereby both sides variants in the global for all Internet users Internet servers stores are.