[go: up one dir, main page]

WO2000024198A1 - An interactive television system and method for converting non-textual information to textual information by a remote server - Google Patents

An interactive television system and method for converting non-textual information to textual information by a remote server Download PDF

Info

Publication number
WO2000024198A1
WO2000024198A1 PCT/US1999/024710 US9924710W WO0024198A1 WO 2000024198 A1 WO2000024198 A1 WO 2000024198A1 US 9924710 W US9924710 W US 9924710W WO 0024198 A1 WO0024198 A1 WO 0024198A1
Authority
WO
WIPO (PCT)
Prior art keywords
textual information
television system
top box
server
information
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
PCT/US1999/024710
Other languages
French (fr)
Inventor
Vincent Dureau
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
OpenTV Inc
Original Assignee
OpenTV Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Family has litigation
First worldwide family litigation filed litigation Critical https://patents.darts-ip.com/?family=22645087&utm_source=google_patent&utm_medium=platform_link&utm_campaign=public_patent_search&patent=WO2000024198(A1) "Global patent litigation dataset” by Darts-ip is licensed under a Creative Commons Attribution 4.0 International License.
Priority to DE69910813T priority Critical patent/DE69910813T2/en
Priority to EP99962656A priority patent/EP1127462B1/en
Priority to JP2000577835A priority patent/JP4555480B2/en
Priority to CA002347006A priority patent/CA2347006A1/en
Priority to BR9914740-8A priority patent/BR9914740A/en
Application filed by OpenTV Inc filed Critical OpenTV Inc
Priority to AU19057/00A priority patent/AU755646B2/en
Priority to HK02101415.3A priority patent/HK1042001B/en
Priority to AT99962656T priority patent/ATE248483T1/en
Publication of WO2000024198A1 publication Critical patent/WO2000024198A1/en
Anticipated expiration legal-status Critical
Ceased legal-status Critical Current

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/41Structure of client; Structure of client peripherals
    • H04N21/422Input-only peripherals, i.e. input devices connected to specially adapted client devices, e.g. global positioning system [GPS]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/234Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs
    • H04N21/2343Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs involving reformatting operations of video signals for distribution or compliance with end-user requests or end-user device requirements
    • H04N21/234336Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs involving reformatting operations of video signals for distribution or compliance with end-user requests or end-user device requirements by media transcoding, e.g. video is transformed into a slideshow of still pictures or audio is converted into text
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/41Structure of client; Structure of client peripherals
    • H04N21/422Input-only peripherals, i.e. input devices connected to specially adapted client devices, e.g. global positioning system [GPS]
    • H04N21/42203Input-only peripherals, i.e. input devices connected to specially adapted client devices, e.g. global positioning system [GPS] sound input device, e.g. microphone
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/47End-user applications
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/16Analogue secrecy systems; Analogue subscription systems
    • H04N7/173Analogue secrecy systems; Analogue subscription systems with two-way working, e.g. subscriber sending a programme selection signal
    • H04N7/17309Transmission or handling of upstream communications
    • H04N7/17318Direct or substantially direct transmission and handling of requests

Definitions

  • TITLE AN INTERACTIVE TELEVISION SYSTEM AND METHOD FOR CONVERTING NONTEXTUAL INFORMATION TO TEXTUAL INFORMATION BY A REMOTE SERVER
  • the invention relates generally to interactive television systems and more particularly to means and methods for using a server to convert user-provided information into a data format which can be used by an interactive television application.
  • Interactive television systems can be used to provide a variety of services to users. These systems are capable of displaying text and graphic images to facilitate viewer interaction as well as the audio and video streams associated with ordinary television programs. Interactive television systems enable viewer interaction and thereby allow the systems to be used for marketing and educational purposes in addition to extending the entertainment capabilities of an ordinary television. Viewers can order advertised products or services, request information regarding particular programs, or send electronic messages (e-mail).
  • a broadcast service provider In a typical interactive television system, a broadcast service provider generates an interactive television signal for transmission to a viewer's television.
  • the interactive television signal includes an audio- video portion consisting of a television program, as well as an interactive portion consisting of application code or control information.
  • the broadcast service provider combines the audio-video and interactive portions into a single signal for transmission to a receiver connected to the user's television.
  • the signal is typically compressed prior to transmission and transmitted through broadcast channels such as cable television (CATV) lines or direct satellite transmission systems.
  • CATV cable television
  • the interactive functionality of the interactive television system is controlled by a set-top box connected to the television.
  • the set-top box receives the signal transmitted by the broadcast service provider, separates the interactive portion from the audio-video portion and decompresses the respective portions of the signal.
  • the interactive portion of the signal comprises an interactive application
  • the application may be executed while the audio-video information (the normal television signal) is conveyed to the television.
  • the set-top box may combine the audio-video information with interactive graphics or audio generated by the interactive application prior to conveying the information to the television.
  • the interactive graphics and audio may present additional information to the viewer or may prompt the viewer for input and may be designed to function in response to signals in the broadcast or actions taken by the viewer.
  • An interactive television system may also ran applications that present the user only with the audio and graphics relating to the interactive application. In other words, this information may be displayed in place of the normal television signal, rather than adding to the signal.
  • the invention comprises a system and method for enabling a user to provide non-textual information which is converted by the system to a textual form in which it can be used by the interactive application
  • the non-textual information is entered by the user at the set-top box of a receiving station and this information is transmitted to a server which may be located at a broadcast station
  • the server converts the information into textual data so that it can be used by the system
  • the server transmits the textual data back to the receiving station, where it can be used by an application executing in the set-top box
  • the textual data can be used at the server or transmitted to a part of the system other than the set- top box
  • One embodiment comprises an interactive television system comprising a broadcast station and a receiving station
  • the broadcast station transmits an interactive television application to the receiving station, which then executes the application
  • the application requires textual data from the user
  • the receiving station includes a set-top box which executes the application and a graphics tablet which is coupled to the set-top box for entering information
  • the user provides the information by writing on the graphics tablet, which generates an image file
  • the image file is transmitted from the set-top box to the broadcast station
  • the broadcast station includes a server which uses handwriting recognition software to convert the image file into character or textual data
  • the textual data is transmitted back to the application at the set-top box, which uses the data as if it had been typed in directly by the user
  • the handwriting recognition software resides on the server computer instead of the set-top box, more resources are available for execution of the software
  • the server typically has more available memory and more processing power than the set-top box and consequently provides much faster recognition of handwriting images
  • the greater available resources enable the software to provide for recognition of additional, complex languages (e g , Chinese) and extended character sets (e g , Unicode) Since the recognition software is maintained on the server, the software can be quickly and easily updated with the latest handwriting recognition technology, and it does not have to be distributed to the individual subscribers' set-top boxes
  • the textual data need not be sent back to the set-top box, but may instead be used at the server or another location remote from the set-top box
  • a user may place an order for a product by entering oral or handwritten information and the order may be confirmed later by e-mail or other means
  • the server could consist of a human operator who receives the audio or image data and manually transcribes it into textual information
  • the audio or image data may alternately be converted into textual information by automated means operating under human supervision
  • the interactive television system includes a microphone rather than a graphics tablet for entry of information to the system The microphone is used to provide voice data, which is recorded and transmitted to a server equipped with a voice recognition application
  • the voice recognition application converts the voice data into textual data, which is then transmitted back to the application executing on the set-top box
  • Fig 1 is a block diagram illustrating one embodiment of a system for distribution of interactive television applications and television programs from their sources to a series of viewers
  • Fig 2 is a block diagram of a set-top box m one embodiment of the invention
  • Fig 3 is a block diagram illustrating the components of a graphics tablet in one embodiment of the invention
  • Fig 4 is a flow diagram illustrating the flow of handwritten textual information in one embodiment of the invention
  • Fig 5 is a flow diagram illustrating the manner in which the server's handwriting recognition application processes the image data to produce textual data in one embodiment of the invention
  • an interactive television system transmits an audio-video-mteractive signal from a broadcast station to a receiving station via a broadcast channel
  • broadcast channel may comprise a direct satellite transmission channel or any known means for broadcasting a signal, including non-satellite, cable, telco, MMDS (microwave) and terrestrial transmissions (A "direct" satellite transmission as used herein means a transmission received by the interactive television receiver directly from the satellite )
  • the receiving station in this embodiment is additionally configured to receive signals via a modem connection to the broadcast station
  • the audio-video-interactive signal broadcast to the receiving station may contain both television programming and interactive information such as control signals or interactive applications When the broadcast signal is received, it is separated into its components and processed (e g , decompressed) to reconstruct the respective television programming and interactive signals
  • An interactive application which
  • the set-top box of the receiving station is configured with a device for non-textual data entry
  • this device is a graphics tablet on which the user can inscribe (write) the information required by the application
  • the graphics tablet digitizes the user's handwriting and stores the digitized data as an image file
  • the set-top box transmits the file containing the digitized data to the broadcast station for conversion into a textual form (The image data need not all be contained in a single file, and may be transmitted to the server in segments to spread out the processing load )
  • the broadcast station includes a server which is configured to transform image data into textual data using handwriting recognition algorithms
  • the broadcast station then transmits the textual data back to the receiving station
  • the application executing on the set- top box identifies the received textual data as corresponding to the image data which it earlier transmitted to the broadcast station
  • the set-top box accordingly utilizes the textual data as if it had been entered as text by the user of the set-top box and continues execution of the interactive television application Referring to
  • the broadcast signal is relayed by communications satellite 28 and received by receiving station 20 Although the figure illustrates a satellite transmission, it is contemplated that any broadcast medium (e g , CATV or direct satellite transmission) may be used
  • Receiving station 20 is contemplated to be one of a number of such stations which are subscribers of the broadcast service provider operating broadcast station 10
  • the broadcast signal is collected by receiving antenna 21 and fed to set-top box 22
  • Set-top box 22 processes the packetized signal to reconstruct the television programs and interactive applications embodied in the signal
  • the reconstructed applications are executed m the set-top box, while the reconstructed television programs are passed to television 23, where they are displayed (It is noted that television 23 can be any suitable monitor or display device )
  • the interactive applications may generate graphics or audio which are combined with the television program prior to being displayed
  • the interactive applications may also be displayed m place of the television programming
  • a modem channel which may also be referred to as an http channel, or hypertext transfer protocol channel
  • These types of channels serve two functions in the system they allow the set-top box to provide data to the broadcast station, and they provide an alternate path for from sources 11-13 to be delivered to receiving station 20
  • the server is not constrained to be located at the broadcast station It may be more convenient to locate server 13 at a site which is separate from broadcast station 10 in order to reduce the workload of the broadcast station, to more efficiently convey data between the receiving station and the server, or for other reasons
  • Tuner 31 selects the channel on which the broadcast audio-video- mteractive signal is transmitted and passes the signal to processing unit 32 (Tuner 31 may be replaced by other means, all collectively referred to herein as input ports, for receiving signals from various signal sources )
  • Processing unit 32 demultiplexes the packets from the broadcast signal if necessary and reconstructs the television programs and/or interactive applications embodied in the signal
  • the programs and applications are then decompressed by decompression unit 33
  • the audio and video information associated with the television programs embodied m the signal is then conveyed to display unit 34, which may perform further processing and conversion of the information into a suitable television format, such as NTSC or HDTV audio/video
  • Graphics tablet 39 provides a means for the user to supply handwritten information for conversion to text and subsequent use by the applications (In other embodiments, graphics tablet 39 may be replaced by a microphone for supplying voice data or some other type of input device for supplying non-textual information to the system )
  • Microprocessor 35 may comprise various types of microprocessors, micro-controllers, digital signal processors (DSPs), or other types of software instruction processing devices, as are appropriate to the particular design
  • RAM 37 may include memory units which are static (e g , SRAM), dynamic (e g , DRAM), volatile or non-volatile (e g , Flash memory), as required to support the functions of the set- top box
  • microprocessor 35 executes operating system code which is stored in ROM 36 (In some embodiments, ROM 36 may comprise Flash memory or EEPROMs )
  • the operating system code executes continuously while the set-top box is powered m the same manner the operating system code of a typical personal computer (PC) and enables the set-top box to act on control information and execute interactive and other applications
  • the set- top box also includes modem 38 Modem 38 provides both a return path by which viewer data can be transmitted to the broadcast station and an alternate path by which the broadcast station can transmit data to the set-top box
  • set-top box refers to any receiver or processing unit for receiving and processing a transmitted signal and conveying the processed signal to a television or other monitor
  • the set-top box may be in a housing which physically sits on top of a television, it may be in some other location external to the television (e g , on the side or back of the television or remotely located from the television), or it may be incorporated into the television itself Alternatively, the functionality of the set-top box may be entirely removed from the television and placed in a more remote location, such as outside a house in which the set-top box is located
  • Set-top box 22 serves to demodulate (if necessary) the signal received from broadcast station 10 and to separate the components of the signal, such as different television programs and interactive applications
  • Other embodiments of the set-top box may have different components or interconnections than those shown in Fig 2
  • television 23 may be a television or a video monitor employing any suitable television format (e g , NTSC or HDTV), or it may be
  • the combination of display 51 and digitizer 52 should be selected to provide sufficient resolution to allow an accurate digital representation of the user's handwriting It is contemplated that the resolution of the digitizer and display should be roughly equivalent to that of a fairly high-quality printer The combination should also provide display quality (e g , brightness and contrast) which makes the handwritten information easy for the user to see
  • Microprocessor 53 should be fast enough that the path of the digitizer pen is displayed without any substantial delay as it is drawn by the user Although microprocessor 53 is shown separately from microprocessor 35, it is contemplated that a single device may be sufficient to perform both functions
  • Digitizer 52 may also be constructed using one of a number of technologies Although early technologies often used opaque structures, technologies used m modern digitizers are more likely to incorporate transparent panels which are designed to be used in conjunction with overlaid displays Digitizer 52 may use electrostatic, resistive film or capacitive/electrostatic film technologies Various embodiments of the invention may incorporate different ones of the technologies which are best suited to the particular embodiments Digitizer 52 sense
  • digitizer 52 may sense not only X-Y position, but also the direction of the pen's path, the pen's angle, speed and acceleration, and other information which forms a vectorized representation of the user's handwriting This vectorized information can be transmitted to the server as it is produced so that recognition can be performed as the user is writing.
  • the use of vectorized information to recognize handwritten information is sometimes referred to as on-line recognition
  • the receiving station is operatively connected to the broadcast station by a broadcast channel
  • This broadcast channel can utilize various transmission media and is contemplated to include media such as coaxial cable and free space (e g , as used for direct satellite transmissions )
  • the broadcast channel forms a transmission path between the broadcast station and the receiving station
  • the broadcast station and receiving station are also connected by a return path
  • the return path typically consists of a pair of modems, one in the receiving station and one in the broadcast station, each connected to a standard telephone line
  • Other means for establishing a return path e g , using a portion of the bandwidth of the broadcast channel
  • the system illustrated m Fig 1 utilizes the broadcast channel to establish a return path for communicating data from the receiving station to the server
  • an application is transmitted from the broadcast station to the receiving station, where it is reconstructed and executed
  • the application may be resident in the set-top box or may be provided to the set-top box by means other than the broadcast channel (e g , flash card )
  • Fig 4 is a flow diagram illustrating the flow of handwritten textual information m the system While the application is executing 61, it prompts the user for some sort of textual information 62
  • the user enters the information on the graphics tablet 63, writing the information in his or her normal handwriting
  • the graphics tablet digitizes the image of the user's writing and conveys the digitized data to the set-top box, which transmits the information to the server 64 via the return path to the broadcast station
  • the image data is received by the broadcast station and passed on to the server
  • the server processes the image data 65, recognizing characters and/or words in the image and producing the equivalent textual data
  • This textual data is then transmitted back to the receiving station 66, either via the broadcast channel or via the return channel
  • Fig 5 is a flow diagram illustrating the manner in which the server's handwriting recognition application processes the image data to produce textual data
  • the handwriting image is transmitted to the server in segments as they are generated by the graphics tablet This may serve both to reduce the peak loading of the server resulting from recognition of the image and to allow the server to provide simultaneous feedback (l e , recognized characters) to the user ("Simultaneous" as used here means that some recognized text is transmitted to the set-top box as the user continues to write on the graphics tablet so that the user can see the results of the handwriting recognition )
  • the image instead combines text with a known form, the form is first identified and removed Then, if the handwriting was confined to several boxes or fields on the form, these fields are isolated and handled individually (Free-form handwritten entries may be distinguished from each other in the same
  • the handwriting in a field is recognized by first breaking the image into segments showing characters or pieces of characters 71.
  • the individual image segments and combinations of these image segments are selected as character candidates 72.
  • These character candidates are assigned character classes and associated values representative of the confidence with which the recognition application places the segment in the associated character class 73.
  • the character candidates are organized into groups for which a dictionary lookup algorithm can be performed 74.
  • a dictionary look-up determines the word entries which best match the groups of character candidates 75 and may assign a level of confidence to each word. If desired, an acceptability check can be performed to determine whether the confidence level of the recognized words is sufficiently high 76. If the confidence level is too low, the word can be rejected and the image will be considered unrecognizable.
  • the user can also be given the option of checking the textual output of the recognition application and accepting or rejecting all or part of the output 77. After the user has verified the accuracy of the recognized text, the data can be provided to the application executing in the set-top box. It should be noted that the description is intended to be illustrative rather than restrictive and that the process of recognizing vectorized handwriting data or voice data will differ from the foregoing description.
  • the recognition of the user's handwriting may be assisted by the association of contextual information with the handwriting. If the user's handwriting comprises entries on a form, identifying the type of information requested for each entry may make it easier to interpret the entry itself. For example, an entry in a box requesting a social security number should contain nine digits. A character in this entry which might be interpreted as a "1", an "I” or and "1" must be the numeral "1". The identification of keywords such as "to” or "cc:” may likewise distinguish the handwriting which follows as a name or address.
  • an interactive television service provider may wish to provide e-mail service to subscribers.
  • the user can select the e-mail application furnished by the service provider and proceed to write the message which he or she wishes to send on the graphics tablet.
  • the user writes the address of the intended recipient and the message to be sent to the recipient.
  • the graphical data is transmitted to the server, which may segment the image data and then convert the data to text, or it may convert the entire image to text and then parse the text to determine the recipient 's address.
  • only the address of the message is converted to text while the body of the message is transmitted to the recipient as an image.
  • the body of the message need not be converted to text — the recipient can read the message embodied in the image whether or not it is converted to text data.
  • the user may be given the option of selecting an addressee from an address book or a list of previous addressees.
  • a menu can be presented to the user via the graphics tablet to allow selection of one of these addressees. New addressees which are handwritten can be added to the user's address book.
  • the user can also select other options (e.g., sending an image versus sending text only) in the same manner as selecting an addressee.
  • the information which the user enters via the graphics tablet may therefore be a mix of handwriting and selection of particular predefined inputs
  • a system which uses a microphone for non- textual input may also be configured to allow selection of predefined inputs by using audio prompts and corresponding menus
  • the system may be configured to send faxes instead of e-mail
  • the foregoing description of the e-mail-configured system is in large part applicable to the fax-configured system
  • the user can enter a destination fax number and/or addressee via the graphics tablet, and this information can then be transmitted to the server for conversion into textual information
  • the textual information is then transmitted back to the set-top box and used by the fax application to dial the destination fax
  • the transmitted fax may be an image of the user's handwriting, or it may be an image of textual information corresponding to the user's handwriting (It should be noted that in this instance, the system may convert the user's handwriting to textual information so that the information is more legible than handwriting or so that more information will fit on a page, but the transmitted fax will by its nature consist of image data rather than textual data )
  • Electronic commerce services are those through which the user can buy items or otherwise conduct business These services include online catalogs and home shopping services
  • online catalog the user can browse through the catalog to determine which products he or she would like to order
  • the user can provide item information via menu entries, but must enter non-standardized information such as a shipping address via the graphics tablet
  • the user can enter his information by voice
  • the user can use a microphone or a telephone handset to provide voice data to the system
  • the microphone may a special-purpose microphone for use with the interactive television system or it may be a telephone handset
  • a special-purpose microphone may be connected to the set-top box, or it may be built into a remote control for the system
  • a telephone handset may be connected to the set-top box, or it may be connected directly to the return path (I e , telephone line )
  • the voice data is transmitted to the server, which uses voice recognition software to convert the voice data into textual data
  • the textual data is returned to the set-top box, where it can be displayed to the user

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)
  • Television Systems (AREA)
  • Selective Calling Equipment (AREA)
  • Character Discrimination (AREA)
  • Information Transfer Between Computers (AREA)
  • Facsimiles In General (AREA)
  • User Interface Of Digital Computer (AREA)

Abstract

A system and method for providing user input to an application executing on an interactive television system wherein a user provides non-textual information to the interactive television system and this information is converted by a server to textual information which is provided to the application. In one embodiment, a digitizer pad coupled to a set-top box is used to digitize the user's handwriting. The digitized information is conveyed to a remote server which converts the digitized handwriting data into textual information. The textual information is conveyed to the set-top box, where it is input to an application executing on the set-top box. In another embodiment, a microphone is coupled to a set-top box. The microphone allows the user to input voice information which is digitized and conveyed to the server for conversion into textual information. The textual information is conveyed back to the set-top box and is input to an application executing on the set-top box.

Description

TITLE: AN INTERACTIVE TELEVISION SYSTEM AND METHOD FOR CONVERTING NONTEXTUAL INFORMATION TO TEXTUAL INFORMATION BY A REMOTE SERVER
BACKGROUND OF THE INVENTION
1. Field of the Invention
The invention relates generally to interactive television systems and more particularly to means and methods for using a server to convert user-provided information into a data format which can be used by an interactive television application.
2. Description of the Relevant Art
Interactive television systems can be used to provide a variety of services to users. These systems are capable of displaying text and graphic images to facilitate viewer interaction as well as the audio and video streams associated with ordinary television programs. Interactive television systems enable viewer interaction and thereby allow the systems to be used for marketing and educational purposes in addition to extending the entertainment capabilities of an ordinary television. Viewers can order advertised products or services, request information regarding particular programs, or send electronic messages (e-mail).
In a typical interactive television system, a broadcast service provider generates an interactive television signal for transmission to a viewer's television. The interactive television signal includes an audio- video portion consisting of a television program, as well as an interactive portion consisting of application code or control information. The broadcast service provider combines the audio-video and interactive portions into a single signal for transmission to a receiver connected to the user's television. The signal is typically compressed prior to transmission and transmitted through broadcast channels such as cable television (CATV) lines or direct satellite transmission systems.
The interactive functionality of the interactive television system is controlled by a set-top box connected to the television. The set-top box receives the signal transmitted by the broadcast service provider, separates the interactive portion from the audio-video portion and decompresses the respective portions of the signal. If the interactive portion of the signal comprises an interactive application, the application may be executed while the audio-video information (the normal television signal) is conveyed to the television. The set-top box may combine the audio-video information with interactive graphics or audio generated by the interactive application prior to conveying the information to the television. The interactive graphics and audio may present additional information to the viewer or may prompt the viewer for input and may be designed to function in response to signals in the broadcast or actions taken by the viewer. An interactive television system may also ran applications that present the user only with the audio and graphics relating to the interactive application. In other words, this information may be displayed in place of the normal television signal, rather than adding to the signal.
There are various examples of interactive applications which require viewer input. One example is a televised buying service in which a series of products are displayed via a television program and the viewer utilizes the interactive functions of the television system to purchase the displayed products. The viewer must information descriptive of the product This information may be provided by selecting the appropriate choice from a menu The application may, however, also require the user's name or shipping information which cannot simply be selected from a menu Another example of an application which requires textual information is a messaging application for which the viewer must provide the message, as well as information identifying the addressee of the message Some means is therefore necessary to enter this textual information
One option for entering text would be to use a keyboard This might not be the best solution, however, because some users may not feel comfortable using a keyboard One of the attractions of interactive television systems is the ease with which they may be used Interactive television applications are designed to provide simplified user interfaces and many require no more input than can be provided using a basic remote control The requirement of entering textual information via a keyboard may make the interactive television system less attractive to users because of the additional hardware which is required for the system, the perceived complexity of the system or the difficulty which may be experienced by some users in typing the necessary information Further, in some languages (e g , Chinese,) the complexity of the written language makes text entry via a keyboard difficult even for experienced users Other options for providing textual information to an application may involve means for converting non-textual information into a textual form For example, using voice recognition technology, a user's voice can be sampled and compared to previously sampled speech patterns to determine the words spoken by the user The words can then be output by the speech recognition system as text Handwriting recognition systems could also be used to generate textual information for the application These systems perform the same process on images of the user's handwriting (entered via a graphics tablet or similar input device) to determine the text written by the user While these technologies have evolved to the point that they can reliably generate accurate textual information from the user's voice or handwriting, they require a great deal of computer resources The applications may be very large and they need large amounts of processing time to perform the required pattern matching These technologies therefore cannot be implemented in current set-top boxes, which have very limited storage and processing capacity
SUMMARY OF THE INVENTION
One or more of the problems outlined above may be solved by the various embodiments of the invention The invention comprises a system and method for enabling a user to provide non-textual information which is converted by the system to a textual form in which it can be used by the interactive application The non-textual information is entered by the user at the set-top box of a receiving station and this information is transmitted to a server which may be located at a broadcast station The server converts the information into textual data so that it can be used by the system In one embodiment, the server transmits the textual data back to the receiving station, where it can be used by an application executing in the set-top box In other embodiments, the textual data can be used at the server or transmitted to a part of the system other than the set- top box
One embodiment comprises an interactive television system comprising a broadcast station and a receiving station The broadcast station transmits an interactive television application to the receiving station, which then executes the application The application requires textual data from the user The receiving station includes a set-top box which executes the application and a graphics tablet which is coupled to the set-top box for entering information The user provides the information by writing on the graphics tablet, which generates an image file The image file is transmitted from the set-top box to the broadcast station The broadcast station includes a server which uses handwriting recognition software to convert the image file into character or textual data The textual data is transmitted back to the application at the set-top box, which uses the data as if it had been typed in directly by the user
Because the handwriting recognition software resides on the server computer instead of the set-top box, more resources are available for execution of the software The server typically has more available memory and more processing power than the set-top box and consequently provides much faster recognition of handwriting images The greater available resources enable the software to provide for recognition of additional, complex languages (e g , Chinese) and extended character sets (e g , Unicode) Since the recognition software is maintained on the server, the software can be quickly and easily updated with the latest handwriting recognition technology, and it does not have to be distributed to the individual subscribers' set-top boxes
In one embodiment, the textual data need not be sent back to the set-top box, but may instead be used at the server or another location remote from the set-top box For example, a user may place an order for a product by entering oral or handwritten information and the order may be confirmed later by e-mail or other means In another embodiment, the server could consist of a human operator who receives the audio or image data and manually transcribes it into textual information The audio or image data may alternately be converted into textual information by automated means operating under human supervision In an alternate embodiment, the interactive television system includes a microphone rather than a graphics tablet for entry of information to the system The microphone is used to provide voice data, which is recorded and transmitted to a server equipped with a voice recognition application The voice recognition application converts the voice data into textual data, which is then transmitted back to the application executing on the set-top box
BRIEF DESCRIPTION OF THE DRAWINGS
Other objects and advantages of the invention will become apparent upon reading the following detailed description and upon reference to the accompanying drawings in which
Fig 1 is a block diagram illustrating one embodiment of a system for distribution of interactive television applications and television programs from their sources to a series of viewers Fig 2 is a block diagram of a set-top box m one embodiment of the invention Fig 3 is a block diagram illustrating the components of a graphics tablet in one embodiment of the invention
Fig 4 is a flow diagram illustrating the flow of handwritten textual information in one embodiment of the invention
Fig 5 is a flow diagram illustrating the manner in which the server's handwriting recognition application processes the image data to produce textual data in one embodiment of the invention
While the invention is susceptible to various modifications and alternative forms, specific embodiments thereof are shown by way of example in the drawings and will herein be described in detail It should be understood, however, that the drawmg and detailed description thereto are not intended to limit the invention to the particular form disclosed, but on the contrary, the intention is to cover all modifications, equivalents and alternatives falling within the spirit and scope of the present invention as defined by the appended claims
DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS One embodiment of the inventive method is described in detail below In this embodiment, an interactive television system transmits an audio-video-mteractive signal from a broadcast station to a receiving station via a broadcast channel ("Broadcast" is used herein to refer to transmission of a single signal to all subscribing receivers ) The broadcast channel may comprise a direct satellite transmission channel or any known means for broadcasting a signal, including non-satellite, cable, telco, MMDS (microwave) and terrestrial transmissions (A "direct" satellite transmission as used herein means a transmission received by the interactive television receiver directly from the satellite ) The receiving station in this embodiment is additionally configured to receive signals via a modem connection to the broadcast station The audio-video-interactive signal broadcast to the receiving station may contain both television programming and interactive information such as control signals or interactive applications When the broadcast signal is received, it is separated into its components and processed (e g , decompressed) to reconstruct the respective television programming and interactive signals An interactive application which was broadcast to the receiving station or which was resident in the receiving station is executed on the microprocessor of the set-top box The application is one which requires textual data from the user The user, however, is allowed to input the data in a non-textual form "Non-textual" as used herein means a form handled by the system as something other than a series of alphanumeric characters, such as the ASCII character set A graphical representation of handwriting, although textual in nature, is handled as an image or as vectorized handwriting information and not as the characters or words which are written "Textual," on the other hand, means one or more characters or words If the system is set up for an English-speaking user, the characters may be from the ASCII character set If the system is set up for a non-English-speaking user, the characters will be from the character set of the user's language (e g , Kanji characters for a Japanese user)
The set-top box of the receiving station is configured with a device for non-textual data entry In one embodiment, this device is a graphics tablet on which the user can inscribe (write) the information required by the application The graphics tablet digitizes the user's handwriting and stores the digitized data as an image file The set-top box transmits the file containing the digitized data to the broadcast station for conversion into a textual form (The image data need not all be contained in a single file, and may be transmitted to the server in segments to spread out the processing load ) The broadcast station includes a server which is configured to transform image data into textual data using handwriting recognition algorithms The broadcast station then transmits the textual data back to the receiving station The application executing on the set- top box identifies the received textual data as corresponding to the image data which it earlier transmitted to the broadcast station The set-top box accordingly utilizes the textual data as if it had been entered as text by the user of the set-top box and continues execution of the interactive television application Referring to Fig 1, a block diagram illustrating a system for distribution of interactive television applications and television programs from their sources to a series of viewers is shown Broadcast station 10 has a television program source 11 and an interactive application source 12 The television program source may include remote broadcast network feeds, videotape recorders, computers, data storage devices, and the like Interactive application sources 12 may provide interactive applications, control information or audio or video information which is to be included in the interactive television signal Additionally, broadcast station 10 includes a server 13, which is used to process non-textual data received at the broadcast station and generate textual data The information generated by the television program source 11, interactive application source 12 and server 13 is typically processed by compression/packetization units 14-16 before it is broadcast Usually, the information is also compressed in order to conserve bandwidth (It should be noted that while this embodiment comprises a digital broadcast system, other embodiments may comprise analog broadcast systems in conjunction with means for transmitting digital application data The analog broadcast systems may include existing television networks, and the means for transmitting the application data may include transmissions m the vertical blanking interval or parallel transmissions via telephone lines or out-of-band cable signals ) Any of a number of compression algorithms, such as one of the Motion Picture Expert Group (MPEG) compression standards, may be used if appropriate for a particular program or application The information is packetized to enable error checking, interleaving of data and other transmission-related functions Additional data may accordingly be appended to the application and programming data For example, error check sums may be appended for error detection/correction and time stamps may be included for the purpose of synchronizing associated audio and video signals The packetized information from compression/packetization units 14-16 is fed into multiplexing unit 17, which intersperses the packets prior to transmission The interspersed packets are then broadcast to the receiving stations 20 (Although only one receiving station is shown in the figure, it is contemplated that the audio-video-interactive signal is broadcast to a group of subscribing receiving stations ) In the figure, the audio-video-mteractive signal is depicted as being transmitted via satellite broadcast through antenna 19
The broadcast signal is relayed by communications satellite 28 and received by receiving station 20 Although the figure illustrates a satellite transmission, it is contemplated that any broadcast medium (e g , CATV or direct satellite transmission) may be used Receiving station 20 is contemplated to be one of a number of such stations which are subscribers of the broadcast service provider operating broadcast station 10 The broadcast signal is collected by receiving antenna 21 and fed to set-top box 22 Set-top box 22 processes the packetized signal to reconstruct the television programs and interactive applications embodied in the signal The reconstructed applications are executed m the set-top box, while the reconstructed television programs are passed to television 23, where they are displayed (It is noted that television 23 can be any suitable monitor or display device ) The interactive applications may generate graphics or audio which are combined with the television program prior to being displayed The interactive applications may also be displayed m place of the television programming
In addition to the broadcast channel between the broadcast station and receiving station, there may be other channels, such as a modem channel (which may also be referred to as an http channel, or hypertext transfer protocol channel ) These types of channels serve two functions in the system they allow the set-top box to provide data to the broadcast station, and they provide an alternate path for from sources 11-13 to be delivered to receiving station 20 It should also be noted that, if receiving station 20 and server 13 are connected via a transmission medium other than the broadcast station's broadcast channel and return path, the server is not constrained to be located at the broadcast station It may be more convenient to locate server 13 at a site which is separate from broadcast station 10 in order to reduce the workload of the broadcast station, to more efficiently convey data between the receiving station and the server, or for other reasons
Referring to Fig 2, a block diagram of a set-top box 22 in one embodiment is shown The broadcast signal is received and fed into tuner 31 Tuner 31 selects the channel on which the broadcast audio-video- mteractive signal is transmitted and passes the signal to processing unit 32 (Tuner 31 may be replaced by other means, all collectively referred to herein as input ports, for receiving signals from various signal sources ) Processing unit 32 demultiplexes the packets from the broadcast signal if necessary and reconstructs the television programs and/or interactive applications embodied in the signal The programs and applications are then decompressed by decompression unit 33 The audio and video information associated with the television programs embodied m the signal is then conveyed to display unit 34, which may perform further processing and conversion of the information into a suitable television format, such as NTSC or HDTV audio/video
Applications reconstructed from the broadcast signal are routed to random access memory (RAM) 37 and are executed by microprocessor 35 Graphics tablet 39 provides a means for the user to supply handwritten information for conversion to text and subsequent use by the applications (In other embodiments, graphics tablet 39 may be replaced by a microphone for supplying voice data or some other type of input device for supplying non-textual information to the system )
Microprocessor 35 may comprise various types of microprocessors, micro-controllers, digital signal processors (DSPs), or other types of software instruction processing devices, as are appropriate to the particular design RAM 37 may include memory units which are static (e g , SRAM), dynamic (e g , DRAM), volatile or non-volatile (e g , Flash memory), as required to support the functions of the set- top box When power is applied to the set-top box, microprocessor 35 executes operating system code which is stored in ROM 36 (In some embodiments, ROM 36 may comprise Flash memory or EEPROMs ) The operating system code executes continuously while the set-top box is powered m the same manner the operating system code of a typical personal computer (PC) and enables the set-top box to act on control information and execute interactive and other applications The set- top box also includes modem 38 Modem 38 provides both a return path by which viewer data can be transmitted to the broadcast station and an alternate path by which the broadcast station can transmit data to the set-top box
Although the term "set-top box" is used herein, it is understood that this term refers to any receiver or processing unit for receiving and processing a transmitted signal and conveying the processed signal to a television or other monitor The set-top box may be in a housing which physically sits on top of a television, it may be in some other location external to the television (e g , on the side or back of the television or remotely located from the television), or it may be incorporated into the television itself Alternatively, the functionality of the set-top box may be entirely removed from the television and placed in a more remote location, such as outside a house in which the set-top box is located Set-top box 22 serves to demodulate (if necessary) the signal received from broadcast station 10 and to separate the components of the signal, such as different television programs and interactive applications Other embodiments of the set-top box may have different components or interconnections than those shown in Fig 2 Similarly, television 23 may be a television or a video monitor employing any suitable television format (e g , NTSC or HDTV), or it may be replaced by other devices, such as a video recorder, depending on the particular embodiment Referring to Fig 3, a block diagram illustrating the components of a graphics tablet 50 is shown The mam components of graphics tablet 50 are display 51, digitizer 52 and processor 53 Graphics tablet 50 can have a wide range of sizes, from several inches across (as used in pen computing systems) to several feet across (as used in some CAD systems) It is contemplated that the most convenient size for the graphics tablet will be the size of a small or medium sized notepad The surface of graphics tablet 50 should be flat and smooth to facilitate use as a writing surface Although graphics tablet 50 is contemplated to be connected to set-top box 22 by a data cable, infrared transmission or any other suitable transmission means may also be used to transmit data between graphics tablet 50 and set-top box 22 To the extent that graphics tablet 50 is not constrained by these data transmission means, the user should be able to move the tablet to a convenient and comfortable writing position Display 51 and digitizer 52 are overlaid so that the path of a digitizer pen (not shown) can be displayed as the user writes on the tablet, thereby simulating writing on a notepad Display 51 is contemplated to be a flat panel display using LCD (liquid crystal diode) or similar technologies These technologies use substances this those between two plates of glass to control the amount of light which passes through the display Depending upon the chosen technology, the display may simply control the amount of light from any source behind the display, or it may generate varying amounts of light from each pixel in the display The display can be configured to provide feedback to the user by displaying his or her handwriting, or it may display icons or other images representative of the user's input The display may also be configured to display forms, or other prompts, m response to which the user can provide handwritten information In an alternate embodiment, graphics tablet 50 may be simplified by eliminating display 51 In such an embodiment, the user could instead receive visual feedback, including prompts, forms, handwriting and other images through television 23
The combination of display 51 and digitizer 52 should be selected to provide sufficient resolution to allow an accurate digital representation of the user's handwriting It is contemplated that the resolution of the digitizer and display should be roughly equivalent to that of a fairly high-quality printer The combination should also provide display quality (e g , brightness and contrast) which makes the handwritten information easy for the user to see Microprocessor 53 should be fast enough that the path of the digitizer pen is displayed without any substantial delay as it is drawn by the user Although microprocessor 53 is shown separately from microprocessor 35, it is contemplated that a single device may be sufficient to perform both functions Digitizer 52 may also be constructed using one of a number of technologies Although early technologies often used opaque structures, technologies used m modern digitizers are more likely to incorporate transparent panels which are designed to be used in conjunction with overlaid displays Digitizer 52 may use electrostatic, resistive film or capacitive/electrostatic film technologies Various embodiments of the invention may incorporate different ones of the technologies which are best suited to the particular embodiments Digitizer 52 senses the position of the digitizer pen and transmits the corresponding X-Y coordinates to the set- top box In one embodiment, this X-Y data is recorded as a series of darkened pixels m a bitmap image of the text written by the user The image is then transmitted to the server for processing and recognition of the written data The recognition of the bitmap image is sometimes referred to as off-line recognition
In an alternate embodiment, digitizer 52 may sense not only X-Y position, but also the direction of the pen's path, the pen's angle, speed and acceleration, and other information which forms a vectorized representation of the user's handwriting This vectorized information can be transmitted to the server as it is produced so that recognition can be performed as the user is writing The use of vectorized information to recognize handwritten information is sometimes referred to as on-line recognition
The receiving station is operatively connected to the broadcast station by a broadcast channel This broadcast channel can utilize various transmission media and is contemplated to include media such as coaxial cable and free space (e g , as used for direct satellite transmissions ) The broadcast channel forms a transmission path between the broadcast station and the receiving station The broadcast station and receiving station are also connected by a return path The return path typically consists of a pair of modems, one in the receiving station and one in the broadcast station, each connected to a standard telephone line Other means for establishing a return path (e g , using a portion of the bandwidth of the broadcast channel) are also contemplated The system illustrated m Fig 1 utilizes the broadcast channel to establish a return path for communicating data from the receiving station to the server
In one embodiment, an application is transmitted from the broadcast station to the receiving station, where it is reconstructed and executed In alternate embodiments, the application may be resident in the set-top box or may be provided to the set-top box by means other than the broadcast channel (e g , flash card ) Fig 4 is a flow diagram illustrating the flow of handwritten textual information m the system While the application is executing 61, it prompts the user for some sort of textual information 62 The user enters the information on the graphics tablet 63, writing the information in his or her normal handwriting The graphics tablet digitizes the image of the user's writing and conveys the digitized data to the set-top box, which transmits the information to the server 64 via the return path to the broadcast station The image data is received by the broadcast station and passed on to the server The server processes the image data 65, recognizing characters and/or words in the image and producing the equivalent textual data This textual data is then transmitted back to the receiving station 66, either via the broadcast channel or via the return channel The textual data is then utilized by the application executing m the set-top box 67 as if the text had been directly entered by the user
Fig 5 is a flow diagram illustrating the manner in which the server's handwriting recognition application processes the image data to produce textual data An off-line, bitmap recognition system is used in this embodiment It is assumed in this instance that the user's handwriting is not contained in a form, but is instead free-form input to the graphics tablet Further, in one embodiment, the handwriting image is transmitted to the server in segments as they are generated by the graphics tablet This may serve both to reduce the peak loading of the server resulting from recognition of the image and to allow the server to provide simultaneous feedback (l e , recognized characters) to the user ("Simultaneous" as used here means that some recognized text is transmitted to the set-top box as the user continues to write on the graphics tablet so that the user can see the results of the handwriting recognition ) If the image instead combines text with a known form, the form is first identified and removed Then, if the handwriting was confined to several boxes or fields on the form, these fields are isolated and handled individually (Free-form handwritten entries may be distinguished from each other in the same manner as form entries by providing different dialog boxes in which the user can write the entries.)
The handwriting in a field is recognized by first breaking the image into segments showing characters or pieces of characters 71. The individual image segments and combinations of these image segments are selected as character candidates 72. These character candidates are assigned character classes and associated values representative of the confidence with which the recognition application places the segment in the associated character class 73. The character candidates are organized into groups for which a dictionary lookup algorithm can be performed 74. A dictionary look-up determines the word entries which best match the groups of character candidates 75 and may assign a level of confidence to each word. If desired, an acceptability check can be performed to determine whether the confidence level of the recognized words is sufficiently high 76. If the confidence level is too low, the word can be rejected and the image will be considered unrecognizable. The user can also be given the option of checking the textual output of the recognition application and accepting or rejecting all or part of the output 77. After the user has verified the accuracy of the recognized text, the data can be provided to the application executing in the set-top box. It should be noted that the description is intended to be illustrative rather than restrictive and that the process of recognizing vectorized handwriting data or voice data will differ from the foregoing description.
The recognition of the user's handwriting may be assisted by the association of contextual information with the handwriting. If the user's handwriting comprises entries on a form, identifying the type of information requested for each entry may make it easier to interpret the entry itself. For example, an entry in a box requesting a social security number should contain nine digits. A character in this entry which might be interpreted as a "1", an "I" or and "1" must be the numeral "1". The identification of keywords such as "to" or "cc:" may likewise distinguish the handwriting which follows as a name or address.
The system described above can be used with a number of different applications. For example, an interactive television service provider may wish to provide e-mail service to subscribers. The user can select the e-mail application furnished by the service provider and proceed to write the message which he or she wishes to send on the graphics tablet. In the message, the user writes the address of the intended recipient and the message to be sent to the recipient. The graphical data is transmitted to the server, which may segment the image data and then convert the data to text, or it may convert the entire image to text and then parse the text to determine the recipient 's address. In an alternate embodiment, only the address of the message is converted to text while the body of the message is transmitted to the recipient as an image. Although the address of the e- mail must be computer-readable so it can be properly routed to the recipient, the body of the message need not be converted to text — the recipient can read the message embodied in the image whether or not it is converted to text data. In fact, it may in some cases be preferable to deliver some messages as image data so that the sender can communicate drawings or other non-textual data. Delivery of the image of a handwritten note may also be preferable because simple text messages may be considered impersonal.
The user may be given the option of selecting an addressee from an address book or a list of previous addressees. A menu can be presented to the user via the graphics tablet to allow selection of one of these addressees. New addressees which are handwritten can be added to the user's address book. The user can also select other options (e.g., sending an image versus sending text only) in the same manner as selecting an addressee The information which the user enters via the graphics tablet may therefore be a mix of handwriting and selection of particular predefined inputs A system which uses a microphone for non- textual input may also be configured to allow selection of predefined inputs by using audio prompts and corresponding menus
In another embodiment, the system may be configured to send faxes instead of e-mail The foregoing description of the e-mail-configured system is in large part applicable to the fax-configured system The user can enter a destination fax number and/or addressee via the graphics tablet, and this information can then be transmitted to the server for conversion into textual information The textual information is then transmitted back to the set-top box and used by the fax application to dial the destination fax As for the e-mail-configured system, the transmitted fax may be an image of the user's handwriting, or it may be an image of textual information corresponding to the user's handwriting (It should be noted that in this instance, the system may convert the user's handwriting to textual information so that the information is more legible than handwriting or so that more information will fit on a page, but the transmitted fax will by its nature consist of image data rather than textual data )
Another example of an application with which the system can be employed is electronic commerce service Electronic commerce services are those through which the user can buy items or otherwise conduct business These services include online catalogs and home shopping services When using an online catalog, the user can browse through the catalog to determine which products he or she would like to order The user can provide item information via menu entries, but must enter non-standardized information such as a shipping address via the graphics tablet In another embodiment, the user can enter his information by voice The user can use a microphone or a telephone handset to provide voice data to the system The microphone may a special-purpose microphone for use with the interactive television system or it may be a telephone handset A special-purpose microphone may be connected to the set-top box, or it may be built into a remote control for the system A telephone handset may be connected to the set-top box, or it may be connected directly to the return path (I e , telephone line ) The voice data is transmitted to the server, which uses voice recognition software to convert the voice data into textual data The textual data is returned to the set-top box, where it can be displayed to the user The user can correct the text or confirm that the text has been accurately generated from the voice data
While the present invention has been described with reference to particular embodiments, it will be understood that the embodiments described above are illustrative and that the scope of the invention is not limited to these embodiments Many variations, modifications, additions and improvements to the described embodiments are possible These variations, modifications, additions and improvements are intended to be within the scope of the invention as detailed within the following claims

Claims

WHAT IS CLAIMED IS:
1. An interactive television system comprising: a receiving station having an input device configured to receive non-textual information from a user; and a server coupled to said receiving station by a transmission medium, wherein said server is configured to receive said non-textual information from said receiving station and to convert said non-textual information into textual information, wherein said server is further configured to provide said textual information to one or more interactive applications executing withm said interactive television system
2. The interactive television system of claim 1 wherein said receiving station includes a microprocessor configured to execute one of said one or more interactive applications, wherein said server is configured to transmit said textual information to said receiving station, and wherein said receiving station is configured to receive said textual information from said server and provide said textual information to said application.
3. The mteractive television system of claim 2 further comprising a broadcast station coupled to said receiving station by a broadcast channel.
4 The interactive television system of claim 3 wherein said server is coupled to said broadcast station and wherein said transmission medium comprises a return path between said broadcast station and said receiving station
5. The interactive television system of claim 4 wherein said receiving station includes a first modem, wherein said broadcast station includes a second modem and wherein said return path comprises a telephone line coupled to said first and second modems
6 The interactive television system of claim 4 wherein said return path comprises a portion of the bandwidth of said broadcast channel between said broadcast station and said receiving station
7 The mteractive television system of claim 2 wherein said input device comprises a graphics tablet.
8. The interactive television system of claim 7 wherein said graphics tablet comprises a digitizer configured to generate data corresponding to handwriting inscribed thereon
9 The interactive television system of claim 8 wherein said graphics tablet further comprises a display overlaid with said digitizer and configured to display an image of said handwriting as said handwriting is mscπbed on said digitizer.
10 The mteractive television system of claim 2 wherein said receiving station further comprises a display and wherein said receiving station is configured to present said textual information on said display for verification of said textual information by said user
11 The interactive television system of claim 2 wherein said input device comprises a microphone
12 A set-top box for use in an interactive television system having a server configured to convert said non-textual information to corresponding textual information, the set-top box comprising a microprocessor configured to execute an interactive application which requires textual information from a user, an input device for receiving non-textual information from said user, transmitting means coupled to said input device for transmitting said non-textual information to a server, and receiving means coupled to said microprocessor for receiving said corresponding textual information from said server, wherein said receiving means is configured to convey said corresponding textual information to said microprocessor for use by said interactive application
13 The set-top box of claim 12 wherein said input device comprises a graphics tablet configured to receive handwriting inscribed thereon by said user and wherein said non-textual information comprises digitized information corresponding to said handwriting
14 The set-top box of claim 12 wherein said input device comprises a microphone configured to receive voice information
15 The set-top box of claim 12 wherein said transmitting means comprises a modem
16 The set- top box of claim 12 wherein said receiving means comprises a broadcast receiver
17 The set-top box of claim 12 wherein said microprocessor is configured to prompt said user for said non- textual information, to receive said non- textual information and to provide said non-textual information to said transmitting means
18 A method implemented m an interactive television system for providing textual input to an interactive application executing in the system, the method comprising providing non- textual information to said interactive television system, conveying said non-textual information to a server, converting said non-textual information to textual information in said server, providing said textual information to said application
19. The method of claim 18 wherein said application is executing on a set-top box and wherein the method further comprises conveying said textual information to said set-top box.
20. The method of claim 19 wherein providing said non-textual information comprises writing characters on a graphics tablet coupled to said interactive television system and wherein said graphics tablet is configured to digitize said characters and produce said non-textual information therefrom.
21. The method of claim 20 wherein converting said non-textual information to said textual information comprises executing a handwriting recognition application on said server and providing said non-textual information to said application to produce said textual information.
22. The method of claim 19 wherein providing said non-textual information comprises speaking into a microphone coupled to said interactive television system and wherein said microphone is configured to produce said non-textual information from spoken words.
PCT/US1999/024710 1998-10-21 1999-10-19 An interactive television system and method for converting non-textual information to textual information by a remote server Ceased WO2000024198A1 (en)

Priority Applications (8)

Application Number Priority Date Filing Date Title
AT99962656T ATE248483T1 (en) 1998-10-21 1999-10-19 INTERACTIVE TELEVISION SYSTEM AND METHOD FOR CONVERTING NON-TEXT INFORMATION INTO TEXT INFORMATION BY A REMOTE SERVER
EP99962656A EP1127462B1 (en) 1998-10-21 1999-10-19 An interactive television system and method for converting non-textual information to textual information by a remote server
JP2000577835A JP4555480B2 (en) 1998-10-21 1999-10-19 Interactive television system and method for converting non-text information to text information by a remote server
CA002347006A CA2347006A1 (en) 1998-10-21 1999-10-19 An interactive television system and method for converting non-textual information to textual information by a remote server
BR9914740-8A BR9914740A (en) 1998-10-21 1999-10-19 Interactive television system and method for converting non-text information into text information by a remote server
DE69910813T DE69910813T2 (en) 1998-10-21 1999-10-19 INTERACTIVE TV SYSTEM AND METHOD FOR CONVERTING NON-TEXT INFORMATION INTO TEXT INFORMATION BY A REMOTE SERVER
AU19057/00A AU755646B2 (en) 1998-10-21 1999-10-19 An interactive television system and method for converting non-textual information to textual information by a remote server
HK02101415.3A HK1042001B (en) 1998-10-21 1999-10-19 An interactive television system and method for converting non-textual information to textual information by a remote server

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US09/176,611 1998-10-21
US09/176,611 US6345389B1 (en) 1998-10-21 1998-10-21 Interactive television system and method for converting non-textual information to textual information by a remote server

Publications (1)

Publication Number Publication Date
WO2000024198A1 true WO2000024198A1 (en) 2000-04-27

Family

ID=22645087

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US1999/024710 Ceased WO2000024198A1 (en) 1998-10-21 1999-10-19 An interactive television system and method for converting non-textual information to textual information by a remote server

Country Status (12)

Country Link
US (1) US6345389B1 (en)
EP (1) EP1127462B1 (en)
JP (1) JP4555480B2 (en)
CN (1) CN1130913C (en)
AT (1) ATE248483T1 (en)
AU (1) AU755646B2 (en)
BR (1) BR9914740A (en)
CA (1) CA2347006A1 (en)
DE (1) DE69910813T2 (en)
ES (1) ES2207315T3 (en)
HK (1) HK1042001B (en)
WO (1) WO2000024198A1 (en)

Cited By (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2001095625A1 (en) * 2000-06-08 2001-12-13 Agile Tv Corporation System and method of voice recognition near a wireline node of a network supporting cable television and/or video delivery
WO2002073965A1 (en) * 2001-03-14 2002-09-19 Telefonaktiebolaget Lm Ericsson (Publ) Interactive multimedia system
GB2377336A (en) * 2001-07-04 2003-01-08 Digital Video Networks Ltd Transmitting messages between set top box users via telephone and broadcast networks
DE10340580A1 (en) * 2003-09-01 2005-03-24 Klimek, Winfried M. Set-top box, video recorder or television with speech recognition capability, whereby spoken commands or input are recorded locally and transmitted over a network to a central speech recognition processing unit
US7181759B2 (en) 2000-05-26 2007-02-20 Bigband Networks, Inc. System and method for providing interactivity for end-users over digital broadcast channels
EP1425706A4 (en) * 2001-08-08 2009-06-24 Lenovo Singapore Pte Ltd System and method for collaborative handwriting input
EP2752763A3 (en) * 2013-01-07 2015-06-17 Samsung Electronics Co., Ltd Display apparatus and method of controlling display apparatus
US10856031B2 (en) 2003-04-15 2020-12-01 MedialP, Inc. Method and apparatus for generating interactive programming in a communication network
USRE48579E1 (en) 2002-04-15 2021-06-01 Media Ip, Inc. Method and apparatus for internet-based interactive programming

Families Citing this family (36)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7177429B2 (en) 2000-12-07 2007-02-13 Blue Spike, Inc. System and methods for permitting open access to data objects and for securing data within the data objects
US7257132B1 (en) 1998-02-26 2007-08-14 Hitachi, Ltd. Receiver set, information apparatus and receiving system
US7664264B2 (en) 1999-03-24 2010-02-16 Blue Spike, Inc. Utilizing data reduction in steganographic and cryptographic systems
WO2001018628A2 (en) 1999-08-04 2001-03-15 Blue Spike, Inc. A secure personal content server
GB2360903B (en) * 2000-03-30 2004-09-08 Mitel Corp Telephone with handwriting recognition
US7613790B2 (en) * 2000-07-13 2009-11-03 Biap Systems, Inc. Apparatus for and method of executing customized interactive computing services in a broadband network environment
WO2002023389A1 (en) * 2000-09-15 2002-03-21 Robert Fish Systems and methods for translating an item of information using a distal computer
US20020107885A1 (en) * 2001-02-01 2002-08-08 Advanced Digital Systems, Inc. System, computer program product, and method for capturing and processing form data
US20030023981A1 (en) * 2001-07-25 2003-01-30 Thomas Lemmons Method and apparatus for transmission of interactive and enhanced television data
US7324947B2 (en) 2001-10-03 2008-01-29 Promptu Systems Corporation Global speech user interface
US20030142129A1 (en) * 2002-01-31 2003-07-31 Kleven Michael L. Content processing and distribution systems and processes
US7287275B2 (en) 2002-04-17 2007-10-23 Moskowitz Scott A Methods, systems and devices for packet watermarking and efficient provisioning of bandwidth
US7174042B1 (en) * 2002-06-28 2007-02-06 Microsoft Corporation System and method for automatically recognizing electronic handwriting in an electronic document and converting to text
US7185278B1 (en) 2002-06-28 2007-02-27 Microsoft Corporation Separating and moving document objects using the movement of a wiper bar
US7188309B2 (en) 2002-06-28 2007-03-06 Microsoft Corporation Resolving document object collisions
US7079713B2 (en) * 2002-06-28 2006-07-18 Microsoft Corporation Method and system for displaying and linking ink objects with recognized text and objects
US7751623B1 (en) 2002-06-28 2010-07-06 Microsoft Corporation Writing guide for a free-form document editor
US7259752B1 (en) 2002-06-28 2007-08-21 Microsoft Corporation Method and system for editing electronic ink
US20040244031A1 (en) * 2003-02-26 2004-12-02 Peter Martinez System and method for a network of interactive televisions
CA2539442C (en) * 2003-09-17 2013-08-20 Nielsen Media Research, Inc. Methods and apparatus to operate an audience metering device with voice commands
US20080239167A1 (en) * 2004-01-26 2008-10-02 Koninklijke Philips Electronic, N.V. Remote Control of Interactive Television by Telephone
US7358965B2 (en) * 2004-02-18 2008-04-15 Microsoft Corporation Tapping to create writing
US7721226B2 (en) 2004-02-18 2010-05-18 Microsoft Corporation Glom widget
US7659890B2 (en) * 2004-03-19 2010-02-09 Microsoft Corporation Automatic height adjustment for electronic highlighter pens and mousing devices
US7958115B2 (en) * 2004-07-29 2011-06-07 Yahoo! Inc. Search systems and methods using in-line contextual queries
US7684433B2 (en) * 2005-03-31 2010-03-23 Sony Corporation Method and apparatus for bi-directional communication between analog and digital devices
US7526737B2 (en) * 2005-11-14 2009-04-28 Microsoft Corporation Free form wiper
US9135809B2 (en) * 2008-06-20 2015-09-15 At&T Intellectual Property I, Lp Voice enabled remote control for a set-top box
US20090320076A1 (en) * 2008-06-20 2009-12-24 At&T Intellectual Property I, L.P. System and Method for Processing an Interactive Advertisement
US9124769B2 (en) 2008-10-31 2015-09-01 The Nielsen Company (Us), Llc Methods and apparatus to verify presentation of media content
US8275623B2 (en) 2009-03-06 2012-09-25 At&T Intellectual Property I, L.P. Method and apparatus for analyzing discussion regarding media programs
US20120297418A1 (en) * 2011-05-20 2012-11-22 Sony Corporation External application processor for multimedia devices
KR101309794B1 (en) * 2012-06-27 2013-09-23 삼성전자주식회사 Display apparatus, method for controlling the display apparatus and interactive system
US10194284B2 (en) * 2012-09-12 2019-01-29 Digit International Inc. Embedded communication in message based transports
CN104469544A (en) * 2014-11-07 2015-03-25 重庆晋才富熙科技有限公司 Video marking method based on voice technology
US10348690B2 (en) * 2016-07-08 2019-07-09 Xattic, Inc. Secure message inoculation

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0633661A1 (en) * 1993-07-05 1995-01-11 Texas Instruments Incorporated Remote control systems
EP0689155A2 (en) * 1994-06-21 1995-12-27 Canon Kabushiki Kaisha Character recognizing apparatus and method of controlling the same
EP0838945A2 (en) * 1996-10-25 1998-04-29 Matsushita Electric Industrial Co., Ltd. Video user's environment

Family Cites Families (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH0594564A (en) * 1991-10-01 1993-04-16 Toshiba Corp Data processing system
US5861881A (en) * 1991-11-25 1999-01-19 Actv, Inc. Interactive computer system for providing an interactive presentation with personalized video, audio and graphics responses for multiple viewers
US5528743A (en) * 1993-05-27 1996-06-18 Apple Computer, Inc. Method and apparatus for inserting text on a pen-based computer system
US5546538A (en) * 1993-12-14 1996-08-13 Intel Corporation System for processing handwriting written by user of portable computer by server or processing by the computer when the computer no longer communicate with server
KR100383713B1 (en) * 1994-08-12 2003-07-07 에이씨티브이, 인크. Interactive program reception and generation system, and how to provide a single complex interactive signal
JPH0964840A (en) * 1995-08-24 1997-03-07 Sony Corp Information processing apparatus and method, broadcasting apparatus, and data collection method
JPH0993555A (en) * 1995-09-28 1997-04-04 Toshiba Corp TV receiver with telephone ordering function
GB9523869D0 (en) * 1995-11-22 1996-01-24 Philips Electronics Nv Interactive television
US5831664A (en) * 1995-12-15 1998-11-03 Mediaone Group, Inc. Method and system for synchronizing data between at least one mobile interface device and an interactive terminal
CN1096796C (en) * 1996-02-23 2002-12-18 E盖德公司 Device and method for two-way interactive television
US5875448A (en) * 1996-10-08 1999-02-23 Boys; Donald R. Data stream editing system including a hand-held voice-editing apparatus having a position-finding enunciator
JPH10124291A (en) * 1996-10-17 1998-05-15 Casio Comput Co Ltd Mobile terminal speech recognition communication system
JPH10174082A (en) * 1996-12-13 1998-06-26 Toshiba Corp Interactive TV broadcasting system and file access method applied to the system

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0633661A1 (en) * 1993-07-05 1995-01-11 Texas Instruments Incorporated Remote control systems
EP0689155A2 (en) * 1994-06-21 1995-12-27 Canon Kabushiki Kaisha Character recognizing apparatus and method of controlling the same
EP0838945A2 (en) * 1996-10-25 1998-04-29 Matsushita Electric Industrial Co., Ltd. Video user's environment

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
"SPEECH RECOGNITION METHODS FOR CONTROLLING CABLE TELEVISION", IBM TECHNICAL DISCLOSURE BULLETIN,US,IBM CORP. NEW YORK, vol. 38, no. 8, 1 August 1995 (1995-08-01), pages 285 - 287, XP000534519, ISSN: 0018-8689 *

Cited By (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7181759B2 (en) 2000-05-26 2007-02-20 Bigband Networks, Inc. System and method for providing interactivity for end-users over digital broadcast channels
WO2001095625A1 (en) * 2000-06-08 2001-12-13 Agile Tv Corporation System and method of voice recognition near a wireline node of a network supporting cable television and/or video delivery
US7047196B2 (en) 2000-06-08 2006-05-16 Agiletv Corporation System and method of voice recognition near a wireline node of a network supporting cable television and/or video delivery
USRE44326E1 (en) 2000-06-08 2013-06-25 Promptu Systems Corporation System and method of voice recognition near a wireline node of a network supporting cable television and/or video delivery
WO2002073965A1 (en) * 2001-03-14 2002-09-19 Telefonaktiebolaget Lm Ericsson (Publ) Interactive multimedia system
GB2377336A (en) * 2001-07-04 2003-01-08 Digital Video Networks Ltd Transmitting messages between set top box users via telephone and broadcast networks
GB2377336B (en) * 2001-07-04 2005-07-20 Digital Video Networks Ltd A system and method for transmission of data
EP1425706A4 (en) * 2001-08-08 2009-06-24 Lenovo Singapore Pte Ltd System and method for collaborative handwriting input
USRE48579E1 (en) 2002-04-15 2021-06-01 Media Ip, Inc. Method and apparatus for internet-based interactive programming
US11477506B2 (en) 2003-04-15 2022-10-18 MediaIP, LLC Method and apparatus for generating interactive programming in a communication network
US11575955B2 (en) 2003-04-15 2023-02-07 MediaIP, LLC Providing interactive video on demand
US10856031B2 (en) 2003-04-15 2020-12-01 MedialP, Inc. Method and apparatus for generating interactive programming in a communication network
US11076190B2 (en) 2003-04-15 2021-07-27 MedialP, Inc. Method and apparatus for generating interactive programming in a communication network
DE10340580A1 (en) * 2003-09-01 2005-03-24 Klimek, Winfried M. Set-top box, video recorder or television with speech recognition capability, whereby spoken commands or input are recorded locally and transmitted over a network to a central speech recognition processing unit
EP2752763A3 (en) * 2013-01-07 2015-06-17 Samsung Electronics Co., Ltd Display apparatus and method of controlling display apparatus

Also Published As

Publication number Publication date
AU1905700A (en) 2000-05-08
HK1042001B (en) 2004-07-23
EP1127462B1 (en) 2003-08-27
AU755646B2 (en) 2002-12-19
DE69910813T2 (en) 2004-06-17
BR9914740A (en) 2001-07-03
CA2347006A1 (en) 2000-04-27
EP1127462A1 (en) 2001-08-29
US6345389B1 (en) 2002-02-05
JP2002528808A (en) 2002-09-03
CN1326641A (en) 2001-12-12
HK1042001A1 (en) 2002-07-26
ES2207315T3 (en) 2004-05-16
ATE248483T1 (en) 2003-09-15
DE69910813D1 (en) 2003-10-02
CN1130913C (en) 2003-12-10
JP4555480B2 (en) 2010-09-29

Similar Documents

Publication Publication Date Title
US6345389B1 (en) Interactive television system and method for converting non-textual information to textual information by a remote server
US5561708A (en) Method and apparatus for interactive television through use of menu windows
US5900905A (en) System and method for linking video, services and applications in an interactive television system
US7243127B2 (en) Network-based document delivery system with receipt and display verification
US6928652B1 (en) Method and apparatus for displaying HTML and video simultaneously
US20030043190A1 (en) Website chat room having images displayed simultaneously with interactive chatting
US20110001758A1 (en) Apparatus and method for manipulating an object inserted to video content
US20020175944A1 (en) System and method for providing a context-sensitive instructional user interface icon in an interactive television system
EP2268052A1 (en) Cradle for mobile telephone, videophone system, karaoke system, car navigation system, and emergency information notification system
US20040008229A1 (en) Reconfigurable user interface
CN113157157B (en) Live broadcast-based interactive image management method, device, equipment and storage medium
MXPA02004025A (en) Virtual on demand electronic book.
US9565466B2 (en) Video processing method and video processing system
CN118250480A (en) Live broadcast interaction method, device, equipment and storage medium
CN116057936A (en) Barrage art management system, barrage art management method, barrage art management program, and computer-readable recording medium
CN1432254A (en) Providing advertising information with video content
CN111510372A (en) Multimedia publicity page promotion method and system with electronic business card
US20060253880A1 (en) System for adapting television entertainment network setup in remote locations
US20090234723A1 (en) Publicly generated advertisement system and method
MXPA01010299A (en) Electronic book alternative delivery methods.
KR100638348B1 (en) Set-top box system for handwritten e-mail
Gilligan et al. Videotext technology: An overview with special reference to transaction processing as an interactive service or everything you never wanted to know about text on TV and were terrified that someone would tell you
EP1189441A2 (en) Context-sensitive information delivery service
US20140215511A1 (en) Television interactive system
US11574134B2 (en) Systems and methods of processing a document in an imaging device

Legal Events

Date Code Title Description
WWE Wipo information: entry into national phase

Ref document number: 99813413.9

Country of ref document: CN

WWE Wipo information: entry into national phase

Ref document number: 19057/00

Country of ref document: AU

AK Designated states

Kind code of ref document: A1

Designated state(s): AE AL AM AT AU AZ BA BB BG BR BY CA CH CN CR CU CZ DE DK DM EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG UZ VN YU ZA ZW

AL Designated countries for regional patents

Kind code of ref document: A1

Designated state(s): GH GM KE LS MW SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
DFPE Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101)
ENP Entry into the national phase

Ref document number: 2347006

Country of ref document: CA

Ref document number: 2347006

Country of ref document: CA

Kind code of ref document: A

WWE Wipo information: entry into national phase

Ref document number: 1999962656

Country of ref document: EP

ENP Entry into the national phase

Ref document number: 2000 577835

Country of ref document: JP

Kind code of ref document: A

REG Reference to national code

Ref country code: DE

Ref legal event code: 8642

WWP Wipo information: published in national office

Ref document number: 1999962656

Country of ref document: EP

WWG Wipo information: grant in national office

Ref document number: 19057/00

Country of ref document: AU

WWG Wipo information: grant in national office

Ref document number: 1999962656

Country of ref document: EP