KR20140042637A

KR20140042637A - Image processing apparatus and control method thereof, image processing system

Info

Publication number: KR20140042637A
Application number: KR1020130057262A
Authority: KR
Inventors: 이주영; 박상신
Original assignee: 삼성전자주식회사
Priority date: 2012-09-28
Filing date: 2013-05-21
Publication date: 2014-04-07
Anticipated expiration: 2032-10-18
Also published as: RU2013103490A; KR101877430B1; JP2014149548A; BR102013002349A2; RU2571520C2; KR20140039946A; MX2015003890A; JP2022008691A; MX341560B

Abstract

본 발명의 실시예에 따른 영상처리장치는, 외부로부터 수신되는 방송신호를 영상으로 표시되게 처리하는 영상처리부와; 서버에 통신 가능하게 접속되는 통신부와; 사용자의 발화가 입력되는 음성입력부와; 발화에 대응하는 음성 명령에 따라서 기 설정된 대응 동작이 수행되게 처리하는 음성처리부와; 음성입력부를 통해 발화가 입력되면 음성처리부 및 서버 중 어느 하나에 의해 발화에 대응하는 음성 명령이 처리되게 제어하는 제어부를 포함하며, 제어부는, 음성 명령이 방송 채널의 콜사인(call sign)에 관련된 키워드를 포함하는 경우에 음성처리부 및 서버 중 어느 하나에 의해 키워드에 대응하는 추천 콜사인이 기 설정된 선택조건에 따라서 선택되게 제어하고, 추천 콜사인의 방송 채널에 대하여 음성 명령에 따른 대응 동작을 수행하는 것을 특징으로 한다.An image processing apparatus according to an embodiment of the present invention includes an image processing unit for processing a broadcast signal received from the outside to be displayed as an image; A communication unit communicatively connected to the server; A voice input unit to which a user's utterance is input; A voice processing unit for processing a predetermined corresponding action according to a voice command corresponding to the utterance; And a controller for controlling a voice command corresponding to a utterance by any one of the voice processor and the server when a utterance is input through the voice input unit, wherein the controller includes a keyword related to a call sign of a broadcast channel. In the case of including the control unit to control the recommended call sign corresponding to the keyword selected by any one of the voice processing unit and the server according to the preset selection conditions, and performs a corresponding operation according to the voice command for the broadcast channel of the recommended call sign It is done.

Description

TECHNICAL FIELD [0001] The present invention relates to an image processing apparatus, a control method thereof, and an image processing system,

본 발명은 외부로부터 수신되는 방송신호 등의 영상신호를 영상으로 표시되게 처리하는 영상처리장치 및 그 제어방법, 영상처리 시스템에 관한 것으로서, 상세하게는 사용자의 음성 명령을 인식함으로써 해당 음성 명령에 대응하는 기능 또는 동작을 실행할 수 있는 구조의 영상처리장치 및 그 제어방법, 영상처리 시스템에 관한 것이다.The present invention relates to an image processing apparatus for processing a video signal such as a broadcast signal received from the outside to be displayed as an image, a control method thereof, and an image processing system. More particularly, And a control method thereof, and an image processing system.

영상처리장치는 외부로부터 수신되는 영상신호/영상데이터를 다양한 영상처리 프로세스에 따라서 처리한다. 영상처리장치는 처리된 영상신호를 자체 구비한 디스플레이 패널 상에 영상으로 표시하거나, 또는 패널을 구비한 타 디스플레이장치에서 영상으로 표시되도록 이 처리된 영상신호를 해당 디스플레이장치에 출력할 수 있다. 즉, 영상처리장치는 영상신호를 처리 가능한 장치라면 영상을 표시 가능한 패널을 포함하는 경우 및 패널을 포함하지 않는 경우 모두 포함할 수 있는 바, 전자의 경우의 예시로는 TV가 있으며, 후자의 경우의 예시로는 셋탑박스(set-top box)가 있다.The image processing apparatus processes image signal / image data received from the outside according to various image processing processes. The image processing apparatus can display the processed video signal on the display panel on its own display panel or output the processed video signal to the corresponding display device so as to be displayed as an image on the other display device having the panel. That is, the image processing apparatus can include both a case including a panel capable of displaying an image and a case not including a panel, as long as the apparatus can process a video signal. An example of the former case is a TV, An example of a set-top box is a set-top box.

영상처리장치는 기술의 발전에 따라서 다양한 기능의 추가 및 확장이 계속적으로 반영되고 있는 바, 이러한 추세에 따라서 영상처리장치에 있어서 사용자의 의도를 반영한 커맨드를 영상처리장치에 입력하는 구성도 다양한 구조 또는 방법이 제안되고 있다. 예를 들면, 종래에는 사용자 리모트 컨트롤러(remote controller) 상의 키/버튼을 누르면 리모트 컨트롤러가 사용자가 원하는 동작이 실행되도록 하는 제어신호를 영상처리장치에 전송하는 구성이었으나, 근래에는 영상처리장치가 사용자에 의한 모션 또는 발화 등을 감지하고, 감지된 내용을 분석하여 대응 동작을 실행시키는 등, 사용자의 의도를 반영하여 영상처리장치를 제어하는 다양한 구성이 제안되고 있다.As image processing apparatuses continue to reflect the addition and expansion of various functions in accordance with the development of the technology, a configuration in which a command reflecting the intention of the user in the image processing apparatus is input to the image processing apparatus in accordance with this trend may be variously structured A method has been proposed. For example, conventionally, when the user presses a key / button on a remote controller, the remote controller transmits a control signal to the image processing apparatus to allow the user to perform an operation desired by the user. In recent years, however, There have been proposed various configurations in which the image processing apparatus is controlled to reflect the user's intention, such as detecting motion or ignition by the user, analyzing the detected content, and executing a corresponding operation.

본 발명의 실시예에 따른 영상처리장치는, 외부로부터 수신되는 방송신호를 영상으로 표시되게 처리하는 영상처리부와; 서버에 통신 가능하게 접속되는 통신부와; 사용자의 발화가 입력되는 음성입력부와; 상기 발화에 대응하는 음성 명령에 따라서 기 설정된 대응 동작이 수행되게 처리하는 음성처리부와; 상기 음성입력부를 통해 상기 발화가 입력되면 상기 음성처리부 및 상기 서버 중 어느 하나에 의해 상기 발화에 대응하는 상기 음성 명령이 처리되게 제어하는 제어부를 포함하며, 상기 제어부는, 상기 음성 명령이 방송 채널의 콜사인(call sign)에 관련된 키워드를 포함하는 경우에 상기 음성처리부 및 상기 서버 중 어느 하나에 의해 상기 키워드에 대응하는 추천 콜사인이 기 설정된 선택조건에 따라서 선택되게 제어하고, 상기 추천 콜사인의 방송 채널에 대하여 상기 음성 명령에 따른 대응 동작을 수행하는 것을 특징으로 한다.
An image processing apparatus according to an embodiment of the present invention includes an image processing unit for processing a broadcast signal received from the outside to be displayed as an image; A communication unit communicatively connected to the server; A voice input unit to which a user's utterance is input; A voice processing unit for performing a predetermined corresponding operation according to a voice command corresponding to the utterance; And a controller configured to control the voice command corresponding to the utterance by any one of the voice processor and the server when the utterance is input through the voice input unit. In the case of including a keyword related to a call sign, one of the voice processor and the server controls the recommended call sign corresponding to the keyword to be selected according to a preset selection condition, and to the broadcast channel of the recommended call sign. It is characterized in that for performing the corresponding operation according to the voice command.

여기서, 상기 키워드에 대응하는 적어도 하나의 콜사인 후보의 데이터베이스가 상기 영상처리장치 및 상기 서버에 저장되며, 상기 추천 콜사인은 상기 데이터베이스로부터 검색된 복수의 상기 콜사인 후보 중에서 상기 선택조건에 따라서 선택될 수 있다.Here, a database of at least one call sign candidate corresponding to the keyword is stored in the image processing apparatus and the server, and the recommendation call sign may be selected according to the selection condition from the plurality of call sign candidates searched from the database.

여기서, 상기 선택조건은, 상기 영상처리장치의 사용 이력 정보에 기초하여 상기 복수의 콜사인 후보 중에서 선택 빈도가 기 설정 순위 이상인 콜사인 후보가 상기 추천 콜사인으로 선택될 수 있다.The selection condition may include selecting a call sign candidate having a selection frequency equal to or greater than a predetermined rank from among the plurality of call sign candidates based on usage history information of the image processing apparatus as the recommended call sign.

또는, 상기 선택조건은, 상기 복수의 콜사인 후보 중에서 상기 서버와 통신하는 복수의 타 영상처리장치에서의 선택 빈도가 기 설정 순위 이상인 콜사인 후보가 상기 추천 콜사인으로 선택될 수 있다.Alternatively, the selection condition may include selecting a call sign candidate having a predetermined frequency or more from a plurality of other image processing apparatuses communicating with the server from among the plurality of call sign candidates as the recommended call sign.

또한, 상기 추천 콜사인은 상기 복수의 콜사인 후보 중에서 하나 이상을 선택 가능하며, 상기 제어부는, 복수의 상기 콜사인 후보가 선택되면, 상기 선택된 복수의 콜사인 후보 중에서 어느 하나를 선택 가능하도록 제공하는 유아이 영상을 표시할 수 있다.The recommendation callsign may select one or more of the plurality of callsign candidates, and the controller may provide an image for the infant to select one of the plurality of selected callsign candidates when a plurality of callsign candidates are selected. I can display it.

여기서, 상기 제어부는, 상기 유아이 영상이 표시된 이후 기 설정된 시간 동안에 어느 하나의 상기 콜사인 후보를 선택하는 입력이 수행되지 않은 경우에, 상기 기 설정된 선택조건에 기초하여 어느 하나의 상기 추천 콜사인을 선택할 수 있다.The controller may select one of the recommended call signs based on the preset selection condition when the input for selecting one of the call sign candidates is not performed for a preset time after the image is displayed. have.

또한, 상기 통신부는 상기 발화를 텍스트의 음성 명령으로 변환하는 STT(speech-to-text)서버와 통신하며, 상기 제어부는, 상기 음성입력부에 상기 발화가 입력되면 상기 발화의 음성신호를 상기 STT서버로 전송하며, 상기 STT서버로부터 상기 발화에 대응하는 상기 음성 명령을 수신할 수 있다.The communication unit may communicate with a speech-to-text (STT) server that converts the speech into a voice command of text, and the controller may transmit the speech signal of the speech to the STT server when the speech is input to the voice input unit. The voice command corresponding to the utterance may be received from the STT server.

여기서, 상기 제어부는, 상기 음성 명령이 단문일 경우에 상기 음성 명령을 상기 음성처리부에 의해 처리되고, 상기 음성 명령이 대화문일 경우에 상기 음성 명령을 상기 서버에 의해 처리되게 제어할 수 있다.Here, the controller may control the voice command to be processed by the voice processing unit when the voice command is a short text, and to process the voice command by the server when the voice command is a conversation text.

또한, 상기 영상처리부에 의해 처리되는 방송신호를 영상으로 표시하는 디스플레이부를 더 포함할 수 있다.The display apparatus may further include a display unit configured to display a broadcast signal processed by the image processor as an image.

또한, 본 발명의 실시예에 따른 서버와 통신하는 영상처리장치의 제어방법은, 사용자의 발화가 입력되는 단계와; 상기 영상처리장치 및 상기 서버 중 어느 하나에 의해 상기 발화에 대응하는 음성 명령이 처리되게 제어하고, 상기 음성 명령에 따라서 기 설정된 대응 동작을 수행하는 단계를 포함하며, 상기 음성 명령에 따라서 기 설정된 대응 동작을 수행하는 단계는, 상기 음성 명령이 방송 채널의 콜사인에 관련된 키워드를 포함하는 경우에, 상기 영상처리장치 및 상기 서버 중 어느 하나에 의해 상기 키워드에 대응하는 추천 콜사인이 기 설정된 선택조건에 따라서 선택되게 제어하는 단계와; 상기 추천 콜사인의 방송 채널에 대하여 상기 음성 명령에 따른 대응 동작을 수행하는 단계를 포함하는 것을 특징으로 한다.In addition, the control method of the image processing apparatus communicating with the server according to an embodiment of the present invention, the step of inputting the user's speech; Controlling the voice command corresponding to the speech to be processed by any one of the image processing apparatus and the server, and performing a preset corresponding operation according to the voice command, wherein the preset response according to the voice command In the performing of the operation, when the voice command includes a keyword related to a call sign of a broadcast channel, the recommendation call sign corresponding to the keyword is set by one of the image processing apparatus and the server according to a preset condition. Controlling to be selected; And performing a corresponding operation according to the voice command on the broadcast channel of the recommended call sign.

또한, 상기 추천 콜사인은 상기 복수의 콜사인 후보 중에서 하나 이상을 선택 가능하며, 상기 음성 명령에 따라서 기 설정된 대응 동작을 수행하는 단계는, 복수의 상기 콜사인 후보가 선택된 경우에 상기 선택된 복수의 콜사인 후보 중에서 어느 하나를 선택 가능하도록 제공하는 유아이 영상을 표시하는 단계를 포함할 수 있다.The recommendation callsign may select one or more of the plurality of callsign candidates, and the performing of a preset corresponding operation according to the voice command may include selecting among the selected callsign candidates when a plurality of callsign candidates are selected. The infant may be configured to display an image.

여기서, 상기 유아이 영상을 표시하는 단계는, 상기 유아이 영상이 표시된 이후 기 설정된 시간 동안에 어느 하나의 상기 콜사인 후보를 선택하는 입력이 수행되지 않은 경우에, 상기 기 설정된 선택조건에 기초하여 어느 하나의 상기 추천 콜사인을 선택하는 단계를 포함할 수 있다.The displaying of the image by the infant may include performing any one of the ones based on the predetermined selection condition when an input for selecting one of the call sign candidates is not performed for a preset time after the image is displayed. The method may include selecting a recommended call sign.

또한, 상기 영상처리장치는 상기 발화를 텍스트의 음성 명령으로 변환하는 STT서버와 통신하며, 상기 사용자의 발화가 입력되는 단계는, 상기 발화의 음성신호를 상기 STT서버로 전송하는 단계와; 상기 STT서버로부터 상기 발화에 대응하는 상기 음성 명령을 수신하는 단계를 포함할 수 있다.In addition, the image processing apparatus communicates with the STT server for converting the speech into a voice command of the text, the step of inputting the user's speech, the step of transmitting the speech signal of the speech to the STT server; And receiving the voice command corresponding to the utterance from the STT server.

여기서, 상기 음성 명령에 따라서 기 설정된 대응 동작을 수행하는 단계는, 상기 음성 명령이 단문일 경우에 상기 음성 명령을 상기 영상처리장치에 의해 처리되고, 상기 음성 명령이 대화문일 경우에 상기 음성 명령을 상기 서버에 의해 처리되게 제어하는 단계를 포함할 수 있다.The performing of the preset corresponding operation according to the voice command may include processing the voice command by the image processing apparatus when the voice command is a short message, and executing the voice command when the voice command is a conversation text. And controlling to be processed by the server.

또한, 본 발명의 실시예에 따른 영상처리 시스템은, 외부로부터 수신되는 방송신호를 영상으로 표시되게 처리하는 영상처리장치와; 상기 영상처리장치와 통신하는 서버를 포함하며, 상기 영상처리장치는, 사용자의 발화가 입력되는 음성입력부와; 상기 발화에 대응하는 음성 명령에 따라서 기 설정된 대응 동작이 수행되게 처리하는 음성처리부와; 상기 음성입력부를 통해 상기 발화가 입력되면 상기 음성처리부 및 상기 서버 중 어느 하나에 의해 상기 발화에 대응하는 상기 음성 명령이 처리되게 제어하는 제어부를 포함하며, 상기 제어부는, 상기 음성 명령이 방송 채널의 콜사인에 관련된 키워드를 포함하는 경우에 상기 음성처리부 및 상기 서버 중 어느 하나에 의해 상기 키워드에 대응하는 추천 콜사인이 기 설정된 선택조건에 따라서 선택되게 제어하고, 상기 추천 콜사인의 방송 채널에 대하여 상기 음성 명령에 따른 대응 동작을 수행하는 것을 특징으로 한다.In addition, the image processing system according to an embodiment of the present invention, the image processing apparatus for processing a broadcast signal received from the outside to be displayed as an image; And a server for communicating with the image processing apparatus, wherein the image processing apparatus comprises: a voice input unit to which a user's utterance is input; A voice processing unit for performing a predetermined corresponding operation according to a voice command corresponding to the utterance; And a controller configured to control the voice command corresponding to the utterance by any one of the voice processor and the server when the utterance is input through the voice input unit. In the case of including a keyword related to a call sign, either the voice processor or the server controls the recommended call sign corresponding to the keyword to be selected according to a preset selection condition, and the voice command is directed to a broadcast channel of the recommended call sign. It characterized in that to perform a corresponding operation according to.

여기서, 상기 발화를 텍스트의 음성 명령으로 변환하는 STT서버를 더 포함하며, 상기 제어부는, 상기 음성입력부에 상기 발화가 입력되면 상기 발화의 음성신호를 상기 STT서버로 전송하며, 상기 STT서버로부터 상기 발화에 대응하는 상기 음성 명령을 수신할 수 있다.The STT server may further include converting the utterance into a voice command of text, wherein the controller transmits the voice signal of the utterance to the STT server when the utterance is input to the voice input unit. The voice command corresponding to the utterance may be received.

도 1은 본 발명의 제1실시예에 따른 디스플레이장치의 구성 블록도,
도 2는 키워드 및 콜사인 후보에 관한 데이터베이스의 구조를 개략적으로 나타내는 예시도,
도 3은 도 1의 디스플레이장치 및 서버의 인터랙션 구조를 나타내는 구성 블록도,
도 4는 도 3의 디스플레이장치 및 서버의 인터랙션 과정을 나타내는 예시도,
도 5 및 도 6은 도 1의 디스플레이장치에서 복수의 추천 콜사인 중 어느 하나를 선택 가능하게 제공하는 유아이 영상의 예시도,
도 7은 본 발명의 제2실시예에 따른 디스플레이장치 및 서버의 인터랙션 과정을 나타내는 예시도,
도 8은 본 발명의 제3실시예에 따른 디스플레이장치 및 서버의 인터랙션 구조를 나타내는 구성 블록도,
도 9는 도 8의 디스플레이장치 및 서버의 인터랙션 과정을 나타내는 예시도,
도 10은 본 발명의 제4실시예에 따른 디스플레이장치의 음성처리부의 신호 전달 구조를 나타내는 구성 블록도이다.1 is a block diagram of a display device according to a first embodiment of the present invention;
2 is an exemplary diagram schematically showing the structure of a database relating to keyword and callsign candidates;
3 is a block diagram illustrating an interaction structure between the display apparatus and the server of FIG. 1;
4 is an exemplary diagram illustrating an interaction process between a display apparatus and a server of FIG. 3;
5 and 6 are views illustrating an example of an infant image to selectably provide any one of a plurality of recommended call signs in the display device of FIG. 1;
7 is an exemplary view illustrating an interaction process between a display apparatus and a server according to a second embodiment of the present invention;
8 is a block diagram illustrating an interaction structure between a display apparatus and a server according to a third exemplary embodiment of the present invention;
9 is an exemplary diagram illustrating an interaction process between a display apparatus and a server of FIG. 8;
10 is a block diagram illustrating a signal transmission structure of a voice processing unit of a display device according to a fourth embodiment of the present invention.

이하에서는 첨부도면을 참조하여 본 발명에 대해 상세히 설명한다. 이하 실시예에서는 본 발명의 사상과 직접적인 관련이 있는 구성들에 관해서만 설명하며, 그 외의 구성에 관해서는 설명을 생략한다. 그러나, 본 발명의 사상이 적용된 장치 또는 시스템을 구현함에 있어서, 이와 같이 설명이 생략된 구성이 불필요함을 의미하는 것이 아님을 밝힌다.Hereinafter, the present invention will be described in detail with reference to the accompanying drawings. In the following embodiments, only configurations directly related to the concept of the present invention will be described, and description of other configurations will be omitted. However, it is to be understood that, in the implementation of the apparatus or system to which the spirit of the present invention is applied, it is not meant that the configuration omitted from the description is unnecessary.

도 1은 본 발명의 제1실시예에 따른 영상처리장치(100)의 구성 블록도이다.1 is a block diagram of a configuration of an image processing apparatus 100 according to a first embodiment of the present invention.

이하 실시예는 영상처리장치(100)가 자체적으로 영상을 표시할 수 있는 구조의 디스플레이장치인 경우에 관해 설명하나, 본 발명의 사상은 영상처리장치(100)가 자체적으로 영상을 표시하지 않고 타 디스플레이장치에 영상신호/제어신호를 출력 가능한 구조의 장치인 경우에도 적용이 가능한 바, 이하 설명하는 실시예에 한정되지 않는다. 본 실시예는 영상처리장치(100)가 TV인 경우에 관해 설명하지만, 이러한 이유에 따라서 그 구현 방식이 다양하게 변경되어 적용될 수 있다.Although the embodiment will be described with reference to the case where the image processing apparatus 100 is a display apparatus having a structure capable of displaying images on its own, the idea of the present invention is that the image processing apparatus 100 does not display images But the present invention is not limited to the embodiments described below as long as it is a device capable of outputting a video signal / control signal to a display device. The present embodiment describes a case where the image processing apparatus 100 is a TV, but the implementation method may be variously modified and applied according to the reasons.

도 1에 도시된 바와 같이, 본 실시예에 따른 영상처리장치(100) 또는 디스플레이장치(100)는 영상공급원(미도시)으로부터 영상신호를 수신한다. 디스플레이장치(100)가 수신 가능한 영상신호는 그 종류 또는 특성이 한정되지 않으며, 예를 들면 디스플레이장치(100)는 방송국의 송출장비(미도시)로부터 송출되는 방송신호를 수신하고, 해당 방송신호를 튜닝하여 방송영상을 표시할 수 있다.As shown in FIG. 1, the image processing apparatus 100 or the display apparatus 100 according to the present embodiment receives a video signal from a video source (not shown). For example, the display device 100 receives a broadcasting signal transmitted from a transmission device (not shown) of a broadcasting station, and transmits the broadcasting signal to the display device 100 The broadcast image can be displayed by tuning.

디스플레이장치(100)는 영상공급원(미도시)으로부터 영상신호를 수신하는 영상수신부(110)와, 영상수신부(110)에 수신되는 영상신호를 기 설정된 영상처리 프로세스에 따라서 처리하는 영상처리부(120)와, 영상처리부(120)에서 처리되는 영상신호에 기초하여 영상을 표시하는 디스플레이부(130)와, 서버(10)와 같은 외부장치와 통신하는 통신부(140)와, 사용자에 의해 조작되는 사용자입력부(150)와, 외부로부터의 음성 또는 소리가 입력되는 음성입력부(160)와, 음성입력부(160)에 입력되는 음성/소리를 해석 및 처리하는 음성처리부(170)와, 데이터/정보가 저장되는 저장부(180)와, 디스플레이장치(100)의 제반 동작을 제어하는 제어부(190)를 포함한다.The display apparatus 100 includes an image receiving unit 110 for receiving a video signal from a video source (not shown), an image processing unit 120 for processing a video signal received by the image receiving unit 110 according to a predetermined image processing process, A display unit 130 for displaying an image based on the image signal processed by the image processing unit 120, a communication unit 140 for communicating with an external device such as the server 10, A sound processing unit 170 for analyzing and processing the sound / sound input to the sound input unit 160, and a sound processing unit 170 for storing the data / A storage unit 180, and a control unit 190 for controlling various operations of the display device 100. [

영상수신부(110)는 영상신호/영상데이터를 유선 또는 무선으로 수신하여 영상처리부(120)에 전달한다. 영상수신부(110)는 수신하는 영상신호의 규격 및 디스플레이장치(100)의 구현 형태에 대응하여 다양한 방식으로 마련될 수 있다. 예를 들면, 영상수신부(110)는 RF(radio frequency)신호를 수신하거나, 컴포지트(composite) 비디오, 컴포넌트(component) 비디오, 슈퍼 비디오(super video), SCART, HDMI(high definition multimedia interface), 디스플레이포트(DisplayPort), UDI(unified display interface), 또는 와이어리스(wireless) HD 규격 등에 의한 영상신호를 수신할 수 있다. 영상수신부(110)는 영상신호가 방송신호인 경우, 이 방송신호를 채널 별로 튜닝하는 튜너(tuner)를 포함한다.The image receiving unit 110 receives the image signal / image data by wire or wireless and transmits the image signal / image data to the image processing unit 120. The image receiving unit 110 may be provided in various ways corresponding to the standard of the image signal to be received and the implementation form of the display device 100. [ For example, the image receiving unit 110 may receive a radio frequency (RF) signal, or may be a composite video, a component video, a super video, a SCART, a high definition multimedia interface (HDMI) A display port, a unified display interface (UDI), or a wireless HD standard. The image receiving unit 110 includes a tuner for tuning the broadcast signal for each channel when the image signal is a broadcast signal.

영상처리부(120)는 영상수신부(110)에 수신되는 영상신호에 대해 다양한 영상처리 프로세스를 수행한다. 영상처리부(120)는 이러한 프로세스를 수행한 영상신호를 디스플레이부(130)에 출력함으로써, 디스플레이부(130)에 해당 영상신호에 기초하는 영상이 표시되게 한다. 예를 들면, 영상처리부(120)는 영상수신부(110)에서 특정 채널로 방송신호가 튜닝되면, 방송신호로부터 해당 채널에 대응하는 영상, 음성 및 부가데이터를 추출하고 기 설정된 해상도로 조정하여 디스플레이부(130)에 표시한다.The image processing unit 120 performs a variety of image processing processes on the image signal received by the image receiving unit 110. The image processor 120 outputs a video signal that has undergone such a process to the display unit 130 so that an image based on the video signal is displayed on the display unit 130. For example, when the broadcast signal is tuned to a specific channel in the image receiving unit 110, the image processing unit 120 extracts video, audio, and additional data corresponding to the channel from the broadcast signal, adjusts the video, (130).

영상처리부(120)가 수행하는 영상처리 프로세스의 종류는 한정되지 않으며, 예를 들면 영상데이터의 영상 포맷에 대응하는 디코딩(decoding), 인터레이스(interlace) 방식의 영상데이터를 프로그레시브(progressive) 방식으로 변환하는 디인터레이싱(de-interlacing), 영상데이터를 기 설정된 해상도로 조정하는 스케일링(scaling), 영상 화질 개선을 위한 노이즈 감소(noise reduction), 디테일 강화(detail enhancement), 프레임 리프레시 레이트(frame refresh rate) 변환 등을 포함할 수 있다.The type of the image processing process performed by the image processing unit 120 is not limited. For example, the decoding and interlace image data corresponding to the image format of the image data are converted into a progressive method. De-interlacing, scaling to adjust image data to a preset resolution, noise reduction to improve image quality, detail enhancement, and frame refresh rate conversion And the like.

영상처리부(120)는 이러한 여러 기능을 통합시킨 SOC(system-on-chip), 또는 이러한 각 프로세스를 독자적으로 수행할 수 있는 개별적인 구성들이 인쇄회로기판 상에 장착됨으로써 영상처리보드(미도시)로 구현되어 디스플레이장치(100)에 내장된다.The image processor 120 may be a system-on-a-chip (SOC) that integrates various functions, or an individual configuration capable of independently performing each of the processes, And is embedded in the display device 100.

디스플레이부(130)는 영상처리부(120)로부터 출력되는 영상신호에 기초하여 영상을 표시한다. 디스플레이부(130)의 구현 방식은 한정되지 않는 바, 액정(liquid crystal), 플라즈마(plasma), 발광 다이오드(light-emitting diode), 유기발광 다이오드(organic light-emitting diode), 면전도 전자총(surface-conduction electron-emitter), 탄소 나노 튜브(carbon nano-tube), 나노 크리스탈(nano-crystal) 등의 다양한 디스플레이 방식으로 구현될 수 있다.The display unit 130 displays an image based on the image signal output from the image processing unit 120. [ The display unit 130 may be implemented in various forms including, but not limited to, a liquid crystal, a plasma, a light-emitting diode, an organic light-emitting diode, electron conduction electron-emitter, carbon nano-tube, nano-crystal, and the like.

디스플레이부(130)는 그 구현 방식에 따라서 부가적인 구성을 추가적으로 포함할 수 있다. 예를 들면, 디스플레이부(130)가 액정 방식인 경우, 디스플레이부(130)는 액정 디스플레이 패널(미도시)과, 이에 광을 공급하는 백라이트유닛(미도시)과, 패널(미도시)을 구동시키는 패널구동기판(미도시)을 포함한다.The display unit 130 may further include an additional configuration depending on the implementation method. For example, when the display unit 130 is a liquid crystal type, the display unit 130 includes a liquid crystal display panel (not shown), a backlight unit (not shown) for supplying light thereto, and a panel (Not shown).

통신부(140)는 디스플레이장치(100)가 서버(10)와 양방향 통신을 수행하도록 데이터의 송수신을 수행한다. 통신부(140)는 서버(10)의 통신 프로토콜(protocol)에 따라서, 유선/무선을 통한 광역/근거리 네트워크나 또는 로컬 접속 방식으로 서버(10)에 접속한다.The communication unit 140 performs transmission and reception of data so that the display device 100 performs bidirectional communication with the server 10. The communication unit 140 connects to the server 10 via a wired / wireless wide area / local area network or a local connection method according to a communication protocol of the server 10.

사용자입력부(150)는 사용자의 조작 및 입력에 따라서 기 설정된 다양한 제어 커맨드 또는 정보를 제어부(190)에 전달한다. 사용자입력부(150)는 디스플레이장치(100) 외측에 설치된 메뉴 키(menu-key) 또는 입력 패널(panel)이나, 디스플레이장치(100)와 분리 이격된 리모트 컨트롤러(remote controller) 등으로 구현된다. 또는, 사용자입력부(150)는 디스플레이부(130)와 일체형으로 구현될 수 있는 바, 디스플레이부(130)가 터치스크린(touch-screen)인 경우에 사용자는 디스플레이부(130)에 표시된 입력메뉴(미도시)를 터치함으로써 기 설정된 커맨드를 제어부(190)에 전달할 수 있다.The user input unit 150 transmits various preset control commands or information to the controller 190 according to a user's operation and input. The user input unit 150 is realized by a menu-key or an input panel installed outside the display device 100 or a remote controller separated from the display device 100. Alternatively, the user input unit 150 may be integrated with the display unit 130, and when the display unit 130 is a touch-screen, the user may select the input menu 130 displayed on the display unit 130 (Not shown) to the controller 190. The controller 190 may be configured to receive the command.

음성입력부(160)는 마이크로 구현되며, 디스플레이장치(100)의 외부 환경에서 발생하는 다양한 소리를 감지한다. 음성입력부(160)가 감지하는 소리는 사용자에 의한 발화와, 사용자 이외에 다양한 요인에 의해 발생하는 소리를 포함한다.The voice input unit 160 is micro-implemented and detects various sounds generated in the external environment of the display device 100. [ The sound sensed by the voice input unit 160 includes voice uttered by the user and sound generated by various factors other than the user.

음성처리부(170)는 디스플레이장치(100)에서 수행되는 다양한 기 설정된 프로세스 중에서, 음성입력부(160)에 입력되는 음성/소리에 대한 프로세스를 수행한다. 여기서, 음성처리부(170)가 처리하는 "음성"은 음성입력부(160)에 입력되는 음성을 의미한다. 영상처리부(120)가 영상신호를 처리할 때에 해당 영상신호는 음성데이터를 포함할 수 있는 바, 영상신호에 포함된 음성데이터는 영상처리부(120)에 의해 처리된다.The voice processing unit 170 performs processes for voice / sound input to the voice input unit 160 among various preset processes performed in the display device 100. [ Here, the "voice" processed by the voice processing unit 170 means a voice input to the voice input unit 160. [ When the video processing unit 120 processes the video signal, the video signal may include audio data, and the audio data included in the video signal is processed by the video processing unit 120.

음성처리부(170)는 음성입력부(160)에 음성/소리가 입력되면, 입력된 음성/소리가 사용자에 의한 발화인지 아니면 기타 요인에 의하여 발생한 소리인지 여부를 판단한다. 이러한 판단 방법은 다양한 구조가 적용될 수 있으므로 특정할 수 없으며, 예를 들면 입력된 음성/소리가 사람의 목소리에 대응하는 파장/주파수 대역에 해당하는지 판단하거나, 또는 사전에 지정된 사용자의 음성의 프로파일에 해당하는지 판단하는 등의 방법이 가능하다.When the voice / sound is input to the voice input unit 160, the voice processing unit 170 determines whether the voice / sound is a voice generated by a user or other factors. This determination method can not be specified because various structures can be applied. For example, it can be determined whether the inputted voice / sound corresponds to the wavelength / frequency band corresponding to the voice of the person, or the voice / It is possible to determine whether it is applicable or not.

음성처리부(170)는 사용자의 발화가 입력된 것으로 판단하면, 해당 발화에 대응하는 음성 명령에 따라서 기 설정된 대응 동작이 수행되게 처리한다. 여기서, 음성 명령은 사용자의 발화의 내용을 의미한다. 이에 관한 자세한 내용은 후술한다.If the voice processing unit 170 determines that the user's utterance has been input, the voice processing unit 170 processes the predetermined corresponding operation according to the voice command corresponding to the utterance. Here, the voice command means the contents of the utterance of the user. Details of this will be described later.

저장부(180)는 제어부(190)의 제어에 따라서 한정되지 않은 데이터가 저장된다. 저장부(180)는 플래시메모리(flash-memory), 하드디스크 드라이브(hard-disc drive)와 같은 비휘발성 메모리로 구현된다. 저장부(180)는 제어부(190), 영상처리부(120) 또는 음성처리부(170) 등에 의해 액세스되며, 데이터의 독취/기록/수정/삭제/갱신 등이 수행된다.The storage unit 180 stores unlimited data under the control of the controller 190. The storage unit 180 is implemented as a non-volatile memory such as a flash memory, a hard-disc drive, or the like. The storage unit 180 is accessed by the control unit 190, the image processing unit 120 or the voice processing unit 170 and reads / writes / corrects / deletes / updates data.

제어부(190)는 음성입력부(160)를 통해 사용자의 발화가 입력되면, 입력된 발화를 처리하도록 음성처리부(170)를 제어한다. 영상처리부(120)가 영상수신부(110)에 수신되는 방송신호를 처리함으로써 디스플레이부(130)에 방송영상이 표시될 때, 제어부(190)는 음성입력부(160)를 통해 채널 전환을 명령하는 사용자의 발화가 수신되면, 해당 발화의 내용에 따라서 채널을 변경시킨다.The control unit 190 controls the voice processing unit 170 to process the input utterance when the utterance of the user is input through the voice input unit 160. [ When the broadcast image is displayed on the display 130 by processing the broadcast signal received by the image receiver 110 by the image processor 120, the controller 190 commands a user to switch channels through the voice input unit 160. When an utterance is received, the channel is changed according to the content of the utterance.

"채널 전환"에 관련된 음성 명령의 방식은, 사용자가 원하는 방송채널의 채널번호를 말하거나, 또는 원하는 방송채널의 콜사인(call sign)을 발화하는 방법이 가능하다. 채널번호 및 콜사인은 어느 한 채널을 타 채널과 구분하는 미리 약속된 표현방식이다. 채널번호는 6, 7, 11 등과 같은 정수로 표현한다.The method of voice command related to "channel switching" may be a method of speaking a channel number of a broadcast channel desired by a user or uttering a call sign of a desired broadcast channel. Channel numbers and callsigns are pre-defined expressions that distinguish one channel from another. The channel number is expressed by an integer such as 6, 7, 11, or the like.

콜사인은 특정 채널을 제공하는 제공자(provider)의 식별명이며, 일반적으로 해당 채널을 방송하는 방송국의 식별명이다. 여기서, 하나의 채널의 콜사인은 복수 개가 있을 수 있으며, 또한 하나의 방송국이 복수의 채널을 제공하는 경우에 각 채널은 상호 구분을 위해 서로 상이한 콜사인을 가진다.The call sign is an identification name of a provider providing a specific channel, and is generally an identification name of a broadcasting station broadcasting the channel. Here, there may be a plurality of callsigns of one channel, and when one broadcasting station provides a plurality of channels, each channel has a different callsign for mutual distinction.

전자의 예를 들면 다음과 같다. 소정의 제1채널의 콜사인이 "KBS"라고 할 때에, "한국방송"이라는 콜사인 또한 "KBS"와 동일하게 제1채널을 지칭하는 것일 수 있다. 또는, 어느 지역에서는 소정의 제2채널의 콜사인이 "MBC"인 것에 비해, 타 지역에서는 제2채널의 콜사인이 이와 상이한 "TNN"일 수도 있다. 즉, 특정 채널의 콜사인은 하나가 아닌 복수 개가 있을 수 있다.An example of the former is as follows. When the call sign of a predetermined first channel is "KBS", the call sign of "Korean broadcast" may also refer to the first channel in the same way as "KBS". Alternatively, the call sign of the second channel may be different from the call sign of the second channel in another region, whereas the call sign of the second channel may be different from that of the second channel. That is, there may be a plurality of call signs of a specific channel instead of one.

후자의 예를 들면 다음과 같다. "KBS"라는 콜사인을 가지는 방송국은 소정의 제3채널 및 제4채널에 각기 방송신호를 제공할 수 있다. 이 경우, 제3채널의 콜사인은 "KBS-1"이고, 제4채널의 콜사인은 "KBS-2"로 각기 상이하다. "KBS"는 해당 방송국의 대표 콜사인으로 볼 수 있으며, "KBS-1" 및 "KBS-2"는 "KBS"와 관련된 하위 콜사인이다. 즉, "KBS"라는 콜사인은 제3채널 및 제4채널과 모두 관련된다.An example of the latter is as follows. Broadcasting stations having a call sign of "KBS" may provide broadcast signals to predetermined third and fourth channels, respectively. In this case, the call sign of the third channel is "KBS-1", and the call sign of the fourth channel is different from "KBS-2". "KBS" may be regarded as a representative call sign of the broadcasting station, and "KBS-1" and "KBS-2" are sub-call signs related to "KBS". That is, a call sign of "KBS" is associated with both the third channel and the fourth channel.

따라서, 만일 음성입력부(160)를 통해 입력된 사용자의 발화가 "KBS 틀어줘"라는 음성 명령이라면, "KBS-1"의 제3채널 및 "KBS-2"의 제4채널 중에서 어떠한 채널을 의미하는 것인지 불명료할 수 있다.Therefore, if the user's speech input through the voice input unit 160 is a voice command of "KBS turn on", it means any channel among the third channel of "KBS-1" and the fourth channel of "KBS-2". It may be unclear.

이에, 본 실시예에 따르면, 제어부(190)는 사용자의 발화에 대응하는 음성 명령이 방송채널의 콜사인에 관련된 키워드(key-word)를 포함하는지 여부를 판단한다.Accordingly, according to the present exemplary embodiment, the controller 190 determines whether a voice command corresponding to a user's speech includes a keyword (key-word) related to a call sign of a broadcast channel.

제어부(190)는 음성 명령이 콜사인 관련 키워드를 포함하는 것으로 판단하면, 복수의 콜사인을 포함하는 데이터베이스에서 해당 키워드에 대응하는 콜사인을 검색하도록 음성처리부(170)를 제어한다. 여기서, 데이터베이스는 저장부(180)에 저장되며, 이와 같이 검색된 콜사인을 콜사인 후보라고 지칭한다. 이 때, 제어부(190)는 해당 키워드에 대응하는 복수의 콜사인 후보가 검색된 경우, 기 설정된 선택조건에 기초하여 복수의 콜사인 후보 중에서 추천 콜사인을 선택한다.If it is determined that the voice command includes a call sign related keyword, the controller 190 controls the voice processing unit 170 to search for a call sign corresponding to the keyword in a database including a plurality of call signs. Here, the database is stored in the storage unit 180 and the retrieved call sign is referred to as a call sign candidate. At this time, when a plurality of call sign candidates corresponding to the corresponding keyword are found, the controller 190 selects a recommended call sign from among the plurality of call sign candidates based on a preset selection condition.

또는, 제어부(190)는 상기한 데이터베이스가 저장된 서버(10)에 키워드 및 음성 명령을 전송할 수도 있다. 이 경우, 서버(10)는 앞서 설명한 구성과 유사한 원리로 추천 콜사인을 선택하며 음성 명령에 따른 대응 동작을 분석하고, 이러한 선택 및 분석 결과에 따른 제어신호를 디스플레이장치(100)에 전송한다.Alternatively, the controller 190 may transmit a keyword and a voice command to the server 10 in which the database is stored. In this case, the server 10 selects the recommended call sign on the principle similar to the above-described configuration, analyzes the corresponding operation according to the voice command, and transmits a control signal according to the selection and analysis result to the display apparatus 100.

제어부(190)는 이와 같이 선택된 추천 콜사인의 방송 채널에 대하여 음성 명령에 따른 대응 동작을 수행한다.The controller 190 performs a corresponding operation according to a voice command on the broadcast channel of the selected recommended call sign.

이하, 데이터베이스(200)의 구성과, 데이터베이스(200)를 검색하여 콜사인 후보(230)를 검색하는 방법에 관해 도 2를 참조하여 설명한다.Hereinafter, a configuration of the database 200 and a method of searching the database 200 to search the call sign candidate 230 will be described with reference to FIG. 2.

도 2는 데이터베이스(200)의 구조를 개략적으로 나타내는 예시도이다.2 is an exemplary view schematically showing the structure of the database 200.

도 2에 도시된 바와 같이, 제어부(190)는 사용자의 발화를 텍스트로 변환한 음성 명령인 "KBS 틀어줘"에 콜사인 관련 키워드가 포함되는지 판단한다. 저장부(180)가 키워드 및 콜사인이 상호 대응하게 맵핑(mapping)된 관계 데이터베이스(200)를 저장하고 있으며, 제어부(190)는 소정 키워드를 가지고 데이터베이스(200)를 검색함으로써 해당 키워드가 콜사인 관련 키워드인지 여부를 판단할 수 있다.As illustrated in FIG. 2, the controller 190 determines whether a call sign-related keyword is included in the " KBS " The storage unit 180 stores the relational database 200 in which keywords and callsigns are mapped corresponding to each other, and the controller 190 searches the database 200 with a predetermined keyword so that the corresponding keyword is a callsign related keyword. It can be determined whether or not.

데이터베이스(200)는 복수의 키워드(220) 및 복수의 콜사인(230)을 상호 맵핑시킴으로써, 어느 하나의 키워드(220)를 가지고 하나 이상의 콜사인 후보(230)를 검색하기 위해 사용된다. 본 도면에서는 데이터베이스(200) 중에서 "KBS" 및 "FTV"의 두 대표 콜사인(210)에 관련된 항목의 관계만을 나타낸 것이다.The database 200 is used to search for one or more callsign candidates 230 with either keyword 220 by mapping a plurality of keywords 220 and a plurality of callsigns 230 to each other. In this figure, only the relation of items related to two representative call signs 210 of "KBS" and "FTV" in the database 200 is shown.

대표 콜사인(210)은 키워드(220) 및 콜사인 후보(230)를 상호 관련시키기 위한 링크 역할을 수행한다. 구체적으로, 소정 키워드(220)가 입력되었을 때에, 우선 해당 키워드(220)가 어느 대표 콜사인(210)과 관련되는지가 데이터베이스(200) 상에서 1차적으로 검색된다. 대표 콜사인(210)이 검색되면, 검색된 대표 콜사인(210)의 하부 콜사인(230) 또는 관련된 콜사인 후보(230)가 2차적으로 검색된다.The representative callsign 210 serves as a link for correlating the keyword 220 and the callsign candidate 230. Specifically, when a predetermined keyword 220 is input, first, the representative call sign 210 related to the keyword 220 is first searched on the database 200. When the representative callsign 210 is searched, the lower callsign 230 or related callsign candidate 230 of the found representative callsign 210 is searched secondarily.

키워드(220)는 대표 콜사인(210)과 관련된 동의어, 유사어 등을 포함하는 다양한 용어가 대표 콜사인(210)에 대해 그룹화/카테고리화된다. 또한, 콜사인 후보(230)는 대표 콜사인(210)과 연관된 하나 이상의 채널의 콜사인을 포함하며, 이러한 콜사인은 대표 콜사인(210)에 대해 그룹화/카테고리화된다.The keyword 220 is grouped / categorized for the representative callsign 210 with various terms, including synonyms, synonyms, and the like associated with the representative callsign 210. In addition, callsign candidate 230 includes callsigns of one or more channels associated with representative callsign 210, which are grouped / categorized relative to representative callsign 210.

예를 들면, "낚시채널"이라는 키워드(220)가 입력되는 경우, "낚시채널"과 연관된 대표 콜사인(210)은 "FTV"이며, 대표 콜사인(210) "FTV"에 관련된 콜사인 후보(230)는 "FTV" 하나이다. 즉, 데이터베이스(200)에 의해 검색된 바로는, "낚시채널"이라는 키워드(220)에 대응하는 채널의 콜사인은 "FTV" 하나이다.For example, when the keyword 220 "fishing channel" is input, the representative call sign 210 associated with the "fishing channel" is "FTV" and the call sign candidate 230 related to the representative call sign 210 "FTV". Is one "FTV". That is, as long as the database 200 searches, the call sign of the channel corresponding to the keyword 220 of "fishing channel" is one "FTV".

한편, "KBS"라는 키워드(220)가 입력되는 경우, "KBS"와 연관된 대표 콜사인(210)은 "KBS"이며, 대표 콜사인 "KBS"에 관련된 콜사인 후보(230)는 "KBS-1", "KBS-2", "KBS-sports", "KBS-movie"의 네 가지가 있다.Meanwhile, when the keyword 220 of "KBS" is input, the representative callsign 210 associated with "KBS" is "KBS", and the callsign candidate 230 related to the representative callsign "KBS" is "KBS-1", There are four types, "KBS-2", "KBS-sports" and "KBS-movie".

이러한 방법으로 데이터베이스(200)를 검색함으로써 키워드(220)와 관련된 적어도 하나 이상의 콜사인 후보(230)를 얻을 수 있다. 다만, 상기한 방법은 데이터베이스(200)를 구현하는 하나의 예시에 불과할 뿐인 바, 데이터베이스(200)의 구현 방식은 다양하게 적용될 수 있으며 상기한 예시로 한정되지 않는다.By searching the database 200 in this manner, at least one callsign candidate 230 associated with the keyword 220 may be obtained. However, the above-described method is only one example of implementing the database 200, and the implementation manner of the database 200 may be variously applied and is not limited to the above-described example.

도 3은 디스플레이장치(100) 및 서버(20, 30)의 인터랙션 구조를 나타내는 구성 블록도이다.3 is a block diagram illustrating an interaction structure between the display apparatus 100 and the servers 20 and 30.

도 3에 도시된 바와 같이, 디스플레이장치(100)는 통신부(140)와, 음성입력부(160)와, 음성처리부(170)와, 제어부(190)를 포함한다. 이러한 구성은 앞선 도 1에서 설명한 바와 같다. 여기서, 통신부(140)는 사용자의 발화를 음성 명령으로 변환하는 STT(speech-to-text)서버(20)와, 음성 명령을 분석함으로써 음성 명령에 대응하는 대응 동작을 판단하는 대화형 서버(30)에 접속된다.As shown in FIG. 3, the display apparatus 100 includes a communication unit 140, a voice input unit 160, a voice processing unit 170, and a controller 190. This configuration is as described in FIG. Here, the communication unit 140 includes a speech-to-text (STT) server 20 for converting a user's utterance into a voice command, an interactive server 30 for determining a corresponding operation corresponding to the voice command by analyzing the voice command .

STT서버(20)는 음성신호가 수신되면 해당 음성신호의 파형을 분석함으로써 음성신호의 내용을 텍스트로 생성한다. STT서버(20)는 디스플레이장치(100)로부터 사용자의 발화의 음성신호를 수신하면, 이를 음성 명령으로 변환한다.When the voice signal is received, the STT server 20 generates the text of the voice signal by analyzing the waveform of the voice signal. The STT server 20 receives a voice signal of a user's utterance from the display device 100 and converts it into a voice command.

대화형 서버(30)는 음성 명령에 대응하는 다양한 디스플레이장치(100)의 동작이 맵핑된 데이터베이스를 포함한다. 대화형 서버(30)는 디스플레이장치(100)로부터 수신한 음성 명령을 분석하고, 분석 결과에 따라서 해당 음성 명령에 대응하는 동작을 수행하기 위한 제어신호를 디스플레이장치(100)에 전송한다.The interactive server 30 includes a database to which operations of various display apparatuses 100 corresponding to voice commands are mapped. The interactive server 30 analyzes the voice command received from the display device 100 and transmits a control signal for performing an operation corresponding to the voice command to the display device 100 according to the analysis result.

제어부(190)는 음성입력부(160)에 사용자의 발화가 입력되면, 해당 발화의 음성신호를 STT서버(20)에 전송하고, STT서버(20)로부터 해당 발화에 대응하는 음성 명령을 수신한다.The control unit 190 transmits the voice signal of the utterance to the STT server 20 and receives the voice command corresponding to the utterance from the STT server 20. [

제어부(190)는 STT서버(20)로부터 수신된 음성 명령이 단문 및 대화문 중에서 어느 쪽에 해당하는지를 판단한다. 제어부(190)는 음성 명령이 단문이면 음성처리부(170)에 의해 처리되도록 하고, 음성 명령이 대화문이면 대화형 서버(30)에 의해 처리되도록 한다.The control unit 190 determines whether the voice command received from the STT server 20 corresponds to the short message or the dialogue. The control unit 190 causes the voice processing unit 170 to process the voice command if the voice command is a short message and allows the voice command to be processed by the interactive server 30 if the voice command is a conversation.

이러한 과정은, 대화문이 자연어이기 때문에, 대화문인 음성 명령 내에서 사용자가 원하는 대응 동작을 기계적으로 추출하는 것이 상대적으로 용이하지 않기 때문이다. 예를 들면, 사용자의 음성 명령이 "KBS 틀어"라는 단문인 경우, 음성처리부(170)는 "KBS"라는 콜사인 키워드와 "틀어"라는 동작 키워드를 가지고 해당 동작을 바로 수행할 수 있다.This is because, since the dialogue is a natural language, it is relatively easy to mechanically extract the corresponding action desired by the user in the dialogue command. For example, when the user's voice command is a short sentence of "KBS", the voice processing unit 170 may directly perform the operation with a call sign keyword of "KBS" and an operation keyword of "wrong".

그런데, 이와 실질적으로 동일한 내용의 음성 명령인 "지금 보고 있는 채널을 한국방송으로 변경해 주세요"와 같은 대화문인 경우, "한국방송"에 대응하는 "KBS"의 콜사인 키워드를 도출하고, "변경해 주세요"에 대응하는 "틀어"라는 동작 키워드를 도출하는 과정이 필요하다. 시스템의 부하 또는 데이터베이스의 정보량 등과 같은 다양한 요인으로 인해, 음성처리부(170)가 이러한 대화문을 처리하는 것은 용이하지 않을 수 있다.However, in the case of a dialogue such as "Please change the channel you are watching to Korean broadcast", which is the same voice command, the call sign keyword of "KBS" corresponding to "Korean broadcast" is derived and "Please change". There is a need for a process of deriving an action keyword corresponding to "twist". Due to various factors such as the load of the system or the amount of information in the database, it may not be easy for the voice processing unit 170 to process such a dialogue.

도 4는 본 실시예에 따른 디스플레이장치(100) 및 서버(20, 30)의 인터랙션 과정을 나타내는 예시도이다.4 is an exemplary diagram illustrating an interaction process between the display apparatus 100 and the servers 20 and 30 according to the present exemplary embodiment.

도 4에 도시된 바와 같이, 디스플레이장치(100)는 사용자로부터 발화가 입력되면(600), 해당 발화의 음성신호를 STT서버(20)에 전달한다(610).As shown in FIG. 4, when a utterance is input from the user (600), the display apparatus 100 transmits a voice signal of the corresponding utterance to the STT server 20 (610).

STT서버(20)는 음성신호를 음성 명령으로 변환하고(620), 변환된 음성 명령을 디스플레이장치(100)에 전달한다(630).The STT server 20 converts the voice signal into a voice command (620), and transmits the converted voice command to the display apparatus 100 (630).

디스플레이장치(100)는 STT서버(30)로부터 수신한 음성 명령을 분석하여, 음성 명령으로부터 콜사인 관련 키워드를 추출한다(640). 여기서, 디스플레이장치(100)는 음성 명령의 단문/대화문 여부를 판단한다.The display apparatus 100 analyzes the voice command received from the STT server 30 and extracts a call sign related keyword from the voice command (640). Here, the display apparatus 100 determines whether a voice command is a short / conversation text.

만일, 음성 명령이 대화문으로 판단되면, 디스플레이장치(100)는 음성 명령 및 콜사인 관련 키워드를 대화형 서버(30)에 전송한다(650).If the voice command is determined to be a conversation, the display apparatus 100 transmits the voice command and the call sign related keyword to the interactive server 30 (650).

대화형 서버(30)는 디스플레이장치(100)로부터 수신된 음성 명령 및 콜사인 관련 키워드에 의한 콜사인 분석 프로세스를 수행한다(660). 콜사인 분석 프로세스에서는 콜사인 관련 키워드에 대응하는 콜사인 후보를 검색하는 단계, 검색된 콜사인 후보들 중에서 추천 콜사인을 선택하는 단계, 텍스트 내에서 추천 콜사인에 대응하는 디스플레이장치(100)의 동작을 판별하는 단계 등이 수행되며, 이에 관한 자세한 내용은 후술한다.The interactive server 30 performs a callsign analysis process based on a voice command and callsign related keywords received from the display apparatus 100 (660). In the call sign analysis process, searching for a call sign candidate corresponding to a call sign related keyword, selecting a recommended call sign from among searched call sign candidates, and determining an operation of the display apparatus 100 corresponding to the recommended call sign in the text are performed. The details thereof will be described later.

대화형 서버(30)는 추천 콜사인의 선택과 음성 명령의 대응 동작의 판별이 완료되면, 이러한 선택 및 판별 결과에 따른 제어신호를 디스플레이장치(100)에 전송한다. 이에, 디스플레이장치(100)는 제어신호에 따라서 추천 콜사인에 대한 대응 동작을 수행할 수 있다.When the selection of the recommended call sign and the determination of the corresponding operation of the voice command are completed, the interactive server 30 transmits a control signal according to the selection and the determination result to the display apparatus 100. Accordingly, the display apparatus 100 may perform a corresponding operation for the recommended call sign according to the control signal.

예를 들면, 추천 콜사인이 "KBS-1"이고, 대응 동작이 채널 전환이라고 분석되면, 대화형 서버(30)는 이러한 내용을 지시하는 제어신호를 디스플레이장치(100)에 전송함으로써 디스플레이장치(100)가 "KBS-1" 채널로 전환하도록 한다.For example, if the recommended call sign is "KBS-1" and the corresponding operation is analyzed to be channel switching, the interactive server 30 transmits a control signal indicating the contents to the display apparatus 100 to display the display apparatus 100. ) Switch to the "KBS-1" channel.

한편, 앞선 640 단계에서 음성 명령이 단문으로 판단되면, 디스플레이장치(100)는 수신된 음성 명령 및 콜사인 관련 키워드에 의한 콜사인 분석 프로세스를 수행한다. 이러한 프로세스는 앞선 대화형 서버(30)에서 수행되는 프로세스와 실질적으로 동일한 원리에 따라서 이루어진다.On the other hand, if it is determined in step 640 that the voice command is short, the display apparatus 100 performs a call sign analysis process based on the received voice command and call sign related keywords. This process is performed according to substantially the same principle as the process performed in the foregoing interactive server 30.

이하, 콜사인 관련 키워드에 의한 콜사인 분석 프로세스에 관해 설명한다.Hereinafter, the callsign analysis process using callsign related keywords will be described.

대화형 서버(30)는 콜사인 관련 키워드에 대응하는 콜사인 후보를 검색하고, 검색된 콜사인 후보가 복수 개인지 판단한다.The interactive server 30 searches for a callsign candidate corresponding to a callsign related keyword, and determines whether there are a plurality of searched callsign candidates.

콜사인에 관련된 키워드 포함 여부의 판단 및 해당 키워드에 대응하는 콜사인 후보의 검색은, 앞선 도 2에서 설명한 바와 같은 방법을 통해 수행될 수 있다. 즉, 대화형 서버(30)는 음성 명령으로부터 추출된 단어를 데이터베이스(200, 도 2 참조) 상에 검색하여 매칭되는 단어(220, 도 2 참조)가 있는지 여부를 판단하고, 매칭되는 단어가 있다면 해당 대표 콜사인(210, 도 2 참조)의 콜사인 후보(230, 도 2 참조)를 얻을 수 있다.The determination of whether to include the keyword related to the call sign and the search of the call sign candidate corresponding to the keyword may be performed by the method described with reference to FIG. 2. That is, the interactive server 30 searches the word extracted from the voice command on the database 200 (see FIG. 2) to determine whether there is a matched word 220 (see FIG. 2), and if there is a matched word, The callsign candidate 230 (see FIG. 2) of the representative callsign 210 (see FIG. 2) may be obtained.

한편, 음성 명령에서 키워드를 추출하는 과정에서, 음성 명령의 오기에 대한 수정 또는 필터링이 수행될 수도 있다. 예를 들면, "안국방송 틀어줘"라는 음성 명령이 있다고 할 때, "안국방송"이란 단어가 데이터베이스 상에 없다고 하더라도, 데이터베이스 상의 "한국방송"이란 단어가 "안국방송"이란 단어와 유사하다고 판단되면 "한국방송"이란 단어가 선택될 수 있다. 단어의 유사도를 판단하는 방법은 다양하게 결정될 수 있는 바, 본 발명의 사상을 한정하지 않는다.On the other hand, in the process of extracting a keyword from the voice command, correction or filtering may be performed for misunderstanding of the voice command. For example, if there is a voice command of "broadcasting Anguk Broadcasting", the word "Korean Broadcasting" in the database is judged to be similar to the word "Anguk Broadcasting" even if the word "Anguk Broadcasting" is not in the database. The word "Korean broadcast" can be selected. The method of determining the similarity of words may be variously determined and does not limit the spirit of the present invention.

한편, 검색된 콜사인 후보가 하나라면, 대화형 서버(30)는 해당 콜사인 후보를 추천 콜사인으로 선택한다.On the other hand, if there is only one retrieved callsign candidate, the interactive server 30 selects the callsign candidate as the recommended callsign.

반면, 검색된 콜사인 후보가 복수 개라면, 대화형 서버(30)는 기 설정된 선택조건에 따라서 추천 콜사인을 선택한다.On the other hand, if there are a plurality of retrieved call sign candidates, the interactive server 30 selects the recommended call sign according to a preset selection condition.

콜사인 후보들 중에서 추천 콜사인을 선택하는 선택조건은 다양한 조건이 미리 설정될 수 있다. 예를 들면, 대화형 서버(30)는 디스플레이장치(100)의 사용 이력 정보에 기초하여, 콜사인 후보 중에서 선택 빈도가 기 설정 순위 이상으로 높은 복수 개의 콜사인을 추천 콜사인으로 선택하거나, 또는 선택 빈도가 가장 높은 하나의 콜사인을 추천 콜사인으로 선택할 수 있다.As the selection condition for selecting the recommended callsign from among the callsign candidates, various conditions may be set in advance. For example, the interactive server 30 selects a plurality of callsigns having a high frequency of selection higher than a predetermined rank from the callsign candidates as recommended callsigns based on the usage history information of the display apparatus 100, or selects a plurality of callsigns. The highest one callsign can be selected as the recommended callsign.

콜사인 후보 내에 "KBS-1", "KBS-2", "KBS-sports", "KBS-movie"의 네 콜사인이 있다고 할 때, 제어부(190)는 사용 이력 정보에 기초하여 이들 채널이 소정 기간 동안에 디스플레이장치(100)에서 선택된 빈도를 판단한다. 예를 들어, "KBS-sports", "KBS-movie", "KBS-2", "KBS-1"의 순서대로 선택 빈도가 높다고 할 때, 제어부(190)는 선택 빈도의 순서대로 복수 개의 콜사인을 선택하거나, 또는 하나의 콜사인을 선택할 수 있다.When there are four callsigns of "KBS-1", "KBS-2", "KBS-sports", and "KBS-movie" in the callsign candidate, the control unit 190 determines that these channels remain for a predetermined period based on the usage history information. The frequency selected by the display apparatus 100 is determined. For example, when the frequency of selection is high in the order of "KBS-sports", "KBS-movie", "KBS-2", and "KBS-1", the controller 190 controls the plurality of call signs in the order of selection frequency. You can select or choose one callsign.

여기서, 제어부(190)는 복수의 콜사인 후보 중에서 어느 하나를 사용자가 선택 가능하도록 제공하는 유아이 영상(UI, user interface)을 표시할 수 있다.Herein, the controller 190 may display a user interface (UI) for providing a user to select one of a plurality of callsign candidates.

도 5 및 도 6은 복수의 추천 콜사인 중 어느 하나를 선택 가능하게 제공하는 유아이 영상(310, 320)의 예시도이다.5 and 6 are exemplary diagrams of an infant image 310 and 320 to selectably provide any one of a plurality of recommended call signs.

도 5에 도시된 바와 같이, 제어부(190)는 사용 이력 정보에 기초하여 디스플레이장치(100)에서 선택 빈도가 가장 높은 "KBS-sports" 및 "KBS-movie"을 추천 콜사인으로 선택하고, 선택한 추천 콜사인 중에서 사용자가 원하는 채널을 선택하도록 유아이 영상(310)을 제공한다. 사용자는 유아이 영상(310)을 통하여 "KBS-sports" 및 "KBS-movie" 중에서 어느 하나의 콜사인 및 방송채널을 선택할 수 있다.As shown in FIG. 5, the controller 190 selects "KBS-sports" and "KBS-movie" having the highest selection frequency as the recommendation call sign on the display apparatus 100 based on the usage history information, and selects the selected recommendation. The infant provides an image 310 to select a desired channel from among the call signs. The user may select one of the call sign and the broadcasting channel from among "KBS-sports" and "KBS-movie" through the image 310.

또는, 도 6에 도시된 바와 같이, 제어부(190)는 "KBS-sports", "KBS-movie", "KBS-2", "KBS-1"의 모든 콜사인 후보 중에서 어느 하나를 선택 가능하게 제공하는 유아이 영상(320)을 제공할 수도 있다. 유아이 영상(320)에 검색된 모든 콜사인 후보를 표시하되, 제어부(190)는 선택 빈도에 따라서 각 콜사인들의 표시 순서를 결정할 수 있다. 예를 들면, 유아이 영상(320)은 가장 선택 빈도가 높은 순서대로 각 콜사인 후보들이 정렬되어 표시될 수 있다.Alternatively, as shown in FIG. 6, the controller 190 selectably provides any one of all call sign candidates of "KBS-sports", "KBS-movie", "KBS-2", and "KBS-1". An infant may provide an image 320. The infant may display all the call sign candidates found in the image 320, but the controller 190 may determine the display order of each call sign according to the selection frequency. For example, the infant image 320 may be displayed by arranging each callsign candidate in the order of the highest selection frequency.

만일, 이와 같은 유아이 영상(310, 320)이 표시된 시점에서 기 설정된 시간 동안에 사용자에 의한 선택이 수행되지 않으면, 제어부(190)는 콜사인 후보들 중에서 최우선순위의 채널, 예를 들면 가장 높은 선택 빈도의 "KBS-sports"를 선택하여 대응 동작을 수행한다.If the infant is not selected by the user for a preset time at the time when the images 310 and 320 are displayed, the controller 190 may determine the channel of the highest priority among the callsign candidates, for example, the highest selection frequency. Select "KBS-sports" to perform the corresponding action.

콜사인 후보들 중에서 추천 콜사인을 선택하는 선택조건은 상기한 예시와 상이한 실시예가 적용될 수 있다. 서버(10, 도 1 참조)에는 디스플레이장치(100) 이외의 다양한 타 디스플레이장치가 접속된다. 여기서, 해당 서버(10)는 STT서버(20) 또는 대화형 서버(30)와 동일한 서버이거나, 상이한 서버일 수 있다. 또한, STT서버(20) 및 대화형 서버(30)는 상이한 서버인 것으로 표현하였으나, 동일한 서버로 구현될 수도 있다.As a selection condition for selecting the recommended callsign from among the callsign candidates, an embodiment different from the above example may be applied. Various other display apparatuses other than the display apparatus 100 are connected to the server 10 (see FIG. 1). Here, the server 10 may be the same server as the STT server 20 or the interactive server 30, or may be a different server. Also, although the STT server 20 and the interactive server 30 are expressed as being different servers, they may be implemented with the same server.

이들 타 디스플레이장치는 각기 사용 이력 정보를 서버(10)에 전송한다. 서버(10)는 각각의 타 디스플레이장치로부터 수집한 사용 이력 정보에 기초하여 콜사인 후보 내의 "KBS-1", "KBS-2", "KBS-sports", "KBS-movie" 각각의 선택 빈도를 판단한다.These other display apparatuses respectively transmit usage history information to the server 10. The server 10 selects the frequency of selection of each of the "KBS-1", "KBS-2", "KBS-sports", and "KBS-movie" in the callsign candidate based on the usage history information collected from each other display apparatus. To judge.

제어부(190)는 "KBS-1", "KBS-2", "KBS-sports", "KBS-movie" 의 콜사인 후보를 서버(10)에 전송하고, 콜사인 후보 내에서 추천 후보를 선택해 줄 것으로 요청할 수 있다.The controller 190 transmits the callsign candidates of "KBS-1", "KBS-2", "KBS-sports", and "KBS-movie" to the server 10, and selects a recommendation candidate within the callsign candidate. You can request

이에, 서버(10)는 콜사인 후보 내에서, 타 디스플레이장치로부터의 사용 이력 정보에 기초한 선택 빈도 순위를 결정한다. 선택 빈도 순위가 "KBS-movie", "KBS-2", "KBS-sports", "KBS-1"이고, 디스플레이장치(100)로부터 요청받은 추천 후보의 수가 하나라고 할 때, 서버(10)는 선택 빈도 순위가 가장 높은 "KBS-movie"를 추천 후보로 결정하여 디스플레이장치(100)에게 알린다. 이에, 제어부(190)는 "KBS-movie"에 관련된 정보를 영상으로 표시할 수 있다.Accordingly, the server 10 determines the ranking of the selection frequency based on the usage history information from another display device in the call sign candidate. When the selection frequency rank is "KBS-movie", "KBS-2", "KBS-sports", "KBS-1", and the number of recommendation candidates requested from the display apparatus 100 is one, the server 10 Determines that the KBS-movie having the highest selection frequency rank is the recommendation candidate and informs the display apparatus 100. Thus, the controller 190 may display information related to "KBS-movie" as an image.

여기서, 제어부(190)는 하나의 추천 콜사인만을 선택하면, 자동으로 추천 콜사인의 방송채널 영상을 표시할 수 있다. 그런데, 지역 별로 콜사인에 대응하는 채널번호가 동일하지 않을 수 있다.In this case, when only one recommendation call sign is selected, the controller 190 may automatically display a broadcast channel image of the recommendation call sign. However, channel numbers corresponding to call signs may not be the same for each region.

따라서, 제어부(190)는 디스플레이장치(100)가 위치하는 지역정보를 취득하여, 해당 징역에 맞는 콜사인의 채널번호를 판단한다. 디스플레이장치(100)의 지역을 판단하는 방법은 다양하게 적용될 수 있는 바, 예를 들면 방송신호의 헤더 또는 메타데이터에 포함된 지역/국가 ID를 추출하거나, 통신부(140)의 맥 어드레스를 기초로 하여 서버(10)가 판단하거나, 또는 디스플레이장치(100)에 미리 사용자가 지역정보를 입력해 둘 수도 있다.Therefore, the controller 190 obtains local information where the display apparatus 100 is located and determines the channel number of the call sign corresponding to the imprisonment. The method of determining the region of the display apparatus 100 may be variously applied. For example, the region / country ID included in the header or metadata of the broadcast signal may be extracted, or based on the MAC address of the communicator 140. The server 10 may determine or the user may input local information in advance in the display apparatus 100.

이상 설명한 바와 같이, 디스플레이장치(100)는 사용자의 발화에 대응하는 음성 명령이 콜사인 관련 키워드를 포함하면 해당 키워드에 대응하는 추천 콜사인이 선택되도록 하고, 선택된 추천 콜사인의 방송 채널에 대하여 음성 명령에 따른 대응 동작을 수행할 수 있다.As described above, if the voice command corresponding to the user's utterance includes a call sign related keyword, the display apparatus 100 selects a recommendation call sign corresponding to the keyword and according to the voice command for the broadcast channel of the selected recommendation call sign. The corresponding operation may be performed.

도 7은 본 발명의 제2실시예에 따른 디스플레이장치(100) 및 서버(20, 30)의 인터랙션 과정을 나타내는 예시도이다.7 is an exemplary diagram illustrating an interaction process between the display apparatus 100 and the servers 20 and 30 according to the second exemplary embodiment of the present invention.

도 7에 도시된 바와 같이, 디스플레이장치(100)는 사용자로부터 발화가 입력되면(700), 해당 발화의 음성신호를 STT서버(20)에 전송한다(710).As illustrated in FIG. 7, when an utterance is input from the user (700), the display apparatus 100 transmits a voice signal of the corresponding utterance to the STT server 20 (710).

STT서버(20)는 수신된 음성신호를 음성 명령으로 변환한다(720). 이 단계까지는 앞선 도 4의 경우와 동일하다.The STT server 20 converts the received voice signal into a voice command (720). This step is the same as in the case of FIG. 4.

STT서버(20)는 음성 명령을 대화형 서버(30)에 전달한다(730).The STT server 20 transmits the voice command to the interactive server 30 (730).

대화형 서버(30)는 콜사인 후보의 검색, 추천 콜사인 선택과 같은 일련의 콜사인 분석 프로세스를 진행한다(740). 이에 관한 자세한 내용은 앞선 실시예의 경우를 응용할 수 있는 바, 자세한 설명을 생략한다. 다만, 본 실시예에서의 콜사인 분석 프로세스는 도 4의 경우와 달리, 대화형 서버(30)가 음성 명령에서 콜사인 키워드의 추출하여 진행된다.The interactive server 30 performs a series of callsign analysis processes, such as searching for callsign candidates and selecting recommended callsigns (740). Details of this can be applied to the case of the previous embodiment, so a detailed description thereof will be omitted. However, unlike the case of FIG. 4, the call sign analysis process according to the present embodiment proceeds by extracting the call sign keyword from the voice command.

대화형 서버(30)는 추천 콜사인 및 대응 동작을 지시하는 제어신호를 디스플레이장치(100)에 전송함으로써, 디스플레이장치(100)가 해당 제어신호에 따라서 동작하도록 한다(750).The interactive server 30 transmits a control signal indicating a recommended call sign and a corresponding operation to the display apparatus 100 so that the display apparatus 100 operates according to the control signal (750).

한편, 앞선 실시예에서는 디스플레이장치(100)에 입력된 사용자의 발화를 STT서버(20)에 의해 음성 명령으로 변환하고, 음성 명령이 단문이면 디스플레이장치(100)에서 처리되며 음성 명령이 대화문이면 대화형 서버(30)에 의해 처리되는 구성에 관하여 설명하였다.On the other hand, in the above embodiment, the user's utterance input to the display apparatus 100 is converted into a voice command by the STT server 20, and if the voice command is a short message, the display device 100 is processed and if the voice command is a dialogue, the conversation The configuration processed by the type server 30 has been described.

그러나, 본 발명의 사상이 이에 한정되지 않으며, 발화를 음성 명령으로 변환하는 구성과, 음성 명령의 단문/대화문 여부에 따라서 해당 음성 명령을 처리하는 주체에 관한 구성은 앞선 실시예와 상이한 구조로 구현될 수도 있다.However, the idea of the present invention is not limited to this, and a structure for converting a speech to a voice command and a subject for processing the voice command according to whether a voice command is a short message or a dialogue is different from the previous embodiment .

도 8은 제3실시예에 따른 디스플레이장치(100a) 및 서버(40)의 인터랙션 구조를 나타내는 구성 블록도이며, 도 9는 도 8의 디스플레이장치(100a) 및 서버(40)의 인터랙션 과정을 나타내는 예시도이다.FIG. 8 is a block diagram illustrating an interaction structure between the display apparatus 100a and the server 40 according to the third embodiment, and FIG. 9 illustrates an interaction process between the display apparatus 100a and the server 40 of FIG. 8. It is an illustration.

도 8에 도시된 바와 같이, 디스플레이장치(100a)는 통신부(140a)와, 음성입력부(160a)와, 음성처리부(170a)와, 제어부(190a)를 포함한다.8, the display device 100a includes a communication unit 140a, a voice input unit 160a, a voice processing unit 170a, and a control unit 190a.

여기서, 음성처리부(170a)는 음성입력부(160a)로부터 전달되는 발화를 음성 명령으로 변환하는 STT변환부(171a)와, 음성 명령이 단문일 경우에 이를 처리하는 단문 명령 처리부(172a)를 포함한다. Here, the voice processing unit 170a includes an STT converting unit 171a for converting the voice transmitted from the voice input unit 160a into a voice command, and a short message processing unit 172a for processing the voice command if the voice command is short .

음성입력부(160a)는 사용자로부터 발화가 입력되면, 입력된 발화의 음성신호를 STT변환부(171a)에 전달한다. STT변환부(171a)는 음성입력부(160a)로부터 전달된 음성신호를 분석하여, 해당 음성의 내용을 포함하는 음성 명령으로 변환한다. STT변환부(171a)는 변환한 음성 명령 제어부(190a)에 전달한다. 즉, STT변환부(171a)는 앞선 제1실시예의 STT서버(20)의 기능을 수행한다.The speech input unit 160a transmits the speech signal of the input speech to the STT conversion unit 171a when the speech is inputted from the user. The STT converting unit 171a analyzes the voice signal transmitted from the voice input unit 160a and converts the voice signal into a voice command including the voice. The STT converter 171a transmits the converted voice command controller 190a. That is, the STT conversion unit 171a performs the function of the STT server 20 of the first embodiment.

제어부(190a)는 음성 명령의 단문/대화문 여부를 판단한다. 제어부(190a)는 음성 명령이 단문이면 음성 명령을 단문 명령 처리부(172a)에 전달한다. 단문 명령 처리부(172a)는 제어부(190a)의 제어에 따라서 음성 명령을 분석하고, 분석 결과에 따라서 대응 동작을 실행한다. 음성 명령의 분석 및 실행에 관한 내용은 앞선 실시예를 응용할 수 있는 바, 자세한 설명을 생략한다.The controller 190a determines whether the voice command is a short / conversation text. The controller 190a transmits the voice command to the short command processor 172a when the voice command is a short text. The short sentence command processing unit 172a analyzes the voice command under the control of the control unit 190a, and executes the corresponding operation according to the analysis result. Since the above embodiment can be applied to the analysis and execution of the voice command, detailed description will be omitted.

반면, 제어부(190a)는 음성 명령이 대화문이면, 해당 음성 명령을 단문 명령 처리부(172a)에 전달하지 않고, 통신부(140a)를 통해 대화형 서버(40)에 전송한다. 대화형 서버(20)는 앞선 제1실시예의 대화형 서버(30)와 동일한 기능을 수행한다.On the other hand, if the voice command is a conversation text, the controller 190a does not transmit the voice command to the short command processor 172a but transmits the voice command to the interactive server 40 through the communication unit 140a. The interactive server 20 performs the same function as the interactive server 30 of the first embodiment described above.

이에, 디스플레이장치(100a)는 대화형 서버(20)로부터 수신한 제어신호에 대응하는 동작을 수행한다.Accordingly, the display apparatus 100a performs an operation corresponding to the control signal received from the interactive server 20.

도 9에 도시된 바와 같이, 디스플레이장치(100a)는 대화형 서버(40)에 통신 가능하게 접속한다. 디스플레이장치(100a)는 사용자로부터 발화가 입력되면(810), 해당 발화에 대응하는 음성 명령의 단문/대화문 여부를 판단한다(820). 디스플레이장치(100a)는 음성 명령이 대화문인 것으로 판단하면, 해당 음성 명령을 대화형 서버(40)에 전송한다(830).As shown in FIG. 9, the display apparatus 100a is communicatively connected to the interactive server 40. When the utterance is input from the user (810), the display apparatus 100a determines whether the voice command corresponding to the utterance is short / conversation 820. If the display apparatus 100a determines that the voice command is a conversation text, the display apparatus 100a transmits the voice command to the interactive server 40 in operation 830.

대화형 서버(40)는 디스플레이장치(100)로부터 음성 명령을 수신하면(910), 음성 명령으로부터 콜사인 관련 키워드를 추출한다(920).When the interactive server 40 receives a voice command from the display apparatus 100 (910), the interactive server 40 extracts a call sign related keyword from the voice command (920).

또는, 콜사인 관련 키워드가 대화형 서버(40)에 의해 추출되는 것이 아닌, 디스플레이장치(100a)에 의해 추출될 수도 있다. 이 경우, 디스플레이장치(100a)는 대화형 서버(40)에 대해 콜사인 관련 키워드 및 음성 명령을 함께 전송한다.Alternatively, the call sign related keyword may not be extracted by the interactive server 40 but may be extracted by the display apparatus 100a. In this case, the display apparatus 100a transmits call sign related keywords and voice commands to the interactive server 40 together.

대화형 서버(40)는 키워드에 대응하는 콜사인 후보를 검색한다(930). 대화형 서버(40)는 복수의 콜사인 후보가 검색되면, 검색된 복수의 콜사인 후보 중에서 앞서 설명한 바와 같은 선택조건에 따라서 추천 콜사인을 선택한다(940). 또한, 대화형 서버(40)는 음성 명령을 분석하여, 음성 명령에 따른 대응 동작을 판별한다.The interactive server 40 searches for a callsign candidate corresponding to the keyword (930). When a plurality of callsign candidates are found, the interactive server 40 selects a recommended callsign from the plurality of searched callsign candidates according to the selection condition as described above (940). In addition, the interactive server 40 analyzes the voice command to determine a corresponding operation according to the voice command.

대화형 서버(40)는 추천 콜사인 및 대응 동작을 지시하는 제어신호를 디스플레이장치(100a)에 전송한다(950).The interactive server 40 transmits a control signal indicating a recommended call sign and a corresponding operation to the display apparatus 100a (950).

디스플레이장치(100a)는 대화형 서버(40)로부터 수신한 제어신호에 따라서, 추천 콜사인의 방송 채널에 대한 대응 동작을 실행한다(840).The display apparatus 100a executes a corresponding operation for the broadcast channel of the recommended call sign according to the control signal received from the interactive server 40 (840).

한편, 디스플레이장치(100a)는 앞선 820 단계에서 음성 명령이 단문인 것으로 판단되면, 디스플레이장치(100a) 자체적으로 추천 콜사인 및 대응 동작을 분석한다.On the other hand, if it is determined in step 820 that the voice command is short, the display apparatus 100a analyzes the recommended call sign and the corresponding operation by the display apparatus 100a itself.

도 10은 본 발명의 제4실시예에 따른 디스플레이장치(100b)의 음성처리부(171b)의 신호 전달 구조를 나타내는 구성 블록도이다.10 is a block diagram illustrating a signal transmission structure of the voice processor 171b of the display apparatus 100b according to the fourth embodiment of the present invention.

도 10에 도시된 바와 같이, 음성처리부(170b)는 음성입력부(160b)로부터 전달되는 사용자의 발화를 음성 명령으로 변환하는 STT변환부(171b)와, STT변환부(171b)에 의해 변환된 음성 명령이 단문일 경우에 이를 처리하는 단문 명령 처리부(172b)와, STT변환부(171b)에 의해 변환된 음성 명령이 대화문/자연어일 경우에 이를 처리하는 대화형 명령 처리부(173b)를 포함한다. 음성처리부(170b)의 구조는 본 예시에 의해 한정되지 않으며, 본 예시는 본 발명의 실시예와 직접적인 연관이 있는 사항만을 간략히 표현한 것이다.As shown in FIG. 10, the voice processing unit 170b includes an STT converter 171b for converting a user's utterance transmitted from the voice input unit 160b into a voice command, and a voice converted by the STT converter 171b. A short command processing unit 172b for processing the command when the command is a short text, and an interactive command processing unit 173b for processing the voice command converted by the STT conversion unit 171b for the conversation / natural language. The structure of the voice processing unit 170b is not limited to this example, and this example is merely a brief representation of matters directly related to the embodiment of the present invention.

음성입력부(160b)는 사용자로부터 발화가 입력되면, 입력된 발화의 음성신호를 STT변환부(171b)에 전달한다. STT변환부(171b)는 음성입력부(160b)로부터 전달된 발화를, 해당 발화의 내용을 포함하는 음성 명령으로 변환한다. STT변환부(171b)는 변환한 음성 명령을 제어부(190b)에 전달한다.The speech input unit 160b, when a speech is input from the user, transmits the speech signal of the input speech to the STT conversion unit 171b. The STT conversion section 171b converts the speech delivered from the speech input section 160b into a speech command containing the content of the corresponding speech. The STT converter 171b transfers the converted voice command to the controller 190b.

제어부(190b)는 음성 명령이 단문인지 아니면 대화문인지 여부를 판단한다. 이 때, 단문 또는 대화문의 판단 여부는 다양한 알고리즘에 따라서 처리될 수 있다.The controller 190b determines whether the voice command is short or conversation. In this case, whether the short sentence or the dialogue sentence is determined may be processed according to various algorithms.

제어부(190b)는 음성 명령이 단문인 경우에는 해당 음성 명령을 단문 명령 처리부(172b)로 전달되게 한다. 반면, 제어부(190b)는 음성 명령이 대화문인 경우에는 해당 음성 명령을 대화형 명령 처리부(173b)로 전달한다.If the voice command is a short message, the controller 190b transmits the voice command to the short command processor 172b. On the other hand, when the voice command is a dialogue, the controller 190b transmits the voice command to the interactive command processor 173b.

단문 명령 처리부(172b)는 앞선 도 8의 단문 명령 처리부(172a)와 실질적으로 동일한 기능을 수행한다. 또한, 대화형 명령 처리부(173b)는 앞선 실시예들의 대화형 서버(30, 40)가 수행하는 기능을 수행한다.The short command processor 172b performs substantially the same function as the short command processor 172a shown in FIG. In addition, the interactive command processing unit 173b performs the functions performed by the interactive servers 30 and 40 of the preceding embodiments.

즉, 본 실시예에 따른 디스플레이장치(100b)는 앞선 실시예들과 달리, 외부 서버(20 내지 40)와의 데이터/신호 전송을 수행하지 않고, 디스플레이장치(100b) 자체적으로 사용자의 발화에 따른 음성 명령의 변환과, 해당 음성 명령에 대응하는 동작을 분석할 수 있다.That is, unlike the previous embodiments, the display apparatus 100b according to the present exemplary embodiment does not perform data / signal transmission with the external servers 20 to 40, and the display apparatus 100b itself generates a voice according to the user's speech. The translation of the command and the operation corresponding to the voice command can be analyzed.

한편, 디스플레이장치가 콜사인 분석 프로세스를 실행하는 별도의 서버(미도시)에 접속되어 있는 경우, 단문 명령 처리부(172a) 또는 대화형 명령 처리부(173b)는 키워드 및 음성 명령을 해당 서버(미도시)에 전송함으로써, 해당 서버(미도시)에서 콜사인 분석 프로세스가 수행되도록 할 수도 있다.On the other hand, when the display device is connected to a separate server (not shown) that executes a callsign analysis process, the short command processing unit 172a or the interactive command processing unit 173b sends keywords and voice commands to the corresponding server (not shown). In this case, the call sign analysis process may be performed at the server (not shown).

즉, 앞선 실시예들에서는 대화형 서버(30, 40) 또는 음성처리부(170, 170a, 170b)에서 콜사인 분석 프로세스가 수행되는 것으로 표현하였으나, 콜사인 분석 프로세스를 수행하는 별도의 서버(미도시)에 의해 해당 프로세스가 수행될 수도 있다.That is, in the above embodiments, although the call sign analysis process is expressed as being performed in the interactive server 30 or 40 or the voice processing units 170, 170a and 170b, a separate server (not shown) that performs the call sign analysis process is performed. The process may be performed by this.

상기한 실시예는 예시적인 것에 불과한 것으로, 당해 기술 분야의 통상의 지식을 가진 자라면 다양한 변형 및 균등한 타 실시예가 가능하다. 따라서, 본 발명의 진정한 기술적 보호범위는 하기의 특허청구범위에 기재된 발명의 기술적 사상에 의해 정해져야 할 것이다.The above-described embodiments are merely illustrative, and various modifications and equivalents may be made by those skilled in the art. Accordingly, the true scope of protection of the present invention should be determined by the technical idea of the invention described in the following claims.

10 : 서버
100 : 영상처리장치/디스플레이장치
110 : 영상수신부
120 : 영상처리부
130 : 디스플레이부
140 : 통신부
150 : 사용자입력부
160 : 음성입력부
170 : 음성처리부
180 : 저장부
190 : 제어부10: Server
100: image processing device / display device
110:
120:
130:
140:
150: User input
160:
170:
180:
190:

Claims

An image processing apparatus comprising:
An image processor which processes a broadcast signal received from the outside to be displayed as an image;
A communication unit communicatively connected to the server;
A voice input unit to which a user's utterance is input;
A voice processing unit for performing a predetermined corresponding operation according to a voice command corresponding to the utterance;
And a control unit for controlling the voice processing unit and the server to process the voice command corresponding to the utterance when the utterance is input through the voice input unit,
The controller may be configured to display at least one call sign corresponding to the keyword stored in either the voice processor or the server when the voice command includes a keyword related to a call sign of a broadcast channel. An image processing apparatus characterized by the above.

The method of claim 1,
At least one callsign candidate database corresponding to the keyword is stored in one of the image processing apparatus and the server, and at least one callsign corresponding to the keyword is selected through a search from the database. Processing unit.

The method of claim 1,
The preset selection condition may include selecting a call sign having the highest frequency of selection from the at least one call sign based on usage history information of the image processing apparatus.

The method of claim 1,
The preset selection condition may include selecting a call sign whose frequency of selection is greater than or equal to a predetermined order in a plurality of other image processing apparatuses communicating with the server.

3. The method of claim 2,
The controller may be configured to display the at least one call sign and to display an image of an infant providing the user to select one of the at least one call sign.

The method of claim 5, wherein
And the control unit displays the at least one call sign in the order of the predetermined selection condition.

The method of claim 6, wherein
And the controller is configured to display a cursor on a callsign having the highest rank among the at least one callsign displayed in the order of the predetermined selection condition.

The method of claim 5, wherein
The controller may be further configured to determine which one of the at least one call sign is selected from the user during a preset time after the image is displayed, based on a selection condition provided to be different from the preset selection condition. The image processing apparatus, characterized in that for selecting one of the call sign.

The method of claim 1,
The communication unit communicates with a speech-to-text (STT) server that converts the utterance into a voice command of text,
Wherein the control unit transmits the voice signal of the utterance to the STT server when the utterance is input to the voice input unit and receives the voice command corresponding to the utterance from the STT server.

10. The method of claim 9,
Wherein the control unit controls the voice command to be processed by the voice processing unit when the voice command is a short message and the voice command to be processed by the server when the voice command is a conversation. Device.

The method of claim 1,
And a display unit for displaying a broadcast signal processed by the image processor as an image.

In the control method of the image processing apparatus to communicate with the server,
Inputting a user's utterance;
Controlling the voice command corresponding to the speech to be processed by any one of the image processing apparatus and the server, and performing a preset corresponding operation according to the voice command;
The performing of the preset corresponding operation according to the voice command may include:
If the voice command includes a keyword related to a call sign of a broadcast channel, displaying at least one call sign corresponding to the keyword stored by either the image processing apparatus or the server. Control method of an image processing apparatus.

The method of claim 12,
At least one callsign candidate database corresponding to the keyword is stored in one of the image processing apparatus and the server, and at least one callsign corresponding to the keyword is selected through a search from the database. Control method of processing device.

The method of claim 12,
The preset selection condition is a control method of the image processing apparatus, characterized in that the call frequency having the highest selection frequency among the at least one call sign is selected based on the usage history information of the image processing apparatus.

The method of claim 12,
The preset selection condition is a control method of an image processing apparatus, characterized in that for selecting a call sign of the frequency of selection in the plurality of other image processing apparatuses communicating with the server or more than a predetermined order.

The method of claim 12,
And displaying the image by the infant providing the user to select any one of the at least one call sign when the at least one call sign is displayed.

The method of claim 16, wherein
And the infant displays the at least one call sign in the order of the predetermined selection condition in the image.

The method of claim 17, wherein
And displaying a cursor on a callsign having the highest rank among the at least one callsign displayed in the order of the predetermined selection condition in the image.

The method of claim 16, wherein
The displaying of the image by the infant may include selecting to be different from the preset selection condition when an input for selecting one of the at least one call sign is not performed by the user during a preset time after the image is displayed. And selecting one of the call signs based on a condition.

The method of claim 12,
The image processing apparatus communicates with the STT server for converting the speech into a voice command of the text,
Wherein the inputting of the user's utterance comprises:
Transmitting the voice signal of the utterance to the STT server;
And receiving the voice command corresponding to the utterance from the STT server.

21. The method of claim 20,
Controlling the voice command to be processed by the image processing apparatus when the voice command is a short text, and processing the voice command by the server when the voice command is a conversation text. Control method of an image processing apparatus.

In an image processing system,
An image processing apparatus for processing a broadcast signal received from the outside to be displayed as an image;
And a server for communicating with the image processing apparatus,
The image processing apparatus comprising:
A voice input unit to which a user's utterance is input;
A voice processing unit for performing a predetermined corresponding operation according to a voice command corresponding to the utterance;
And a control unit for controlling the voice processing unit and the server to process the voice command corresponding to the utterance when the utterance is input through the voice input unit,
The controller may be further configured to display at least one call sign corresponding to the keyword stored in one of the voice processor and the server when the voice command includes a keyword related to a call sign of a broadcast channel. Processing system.

The method of claim 22,
Further comprising a STT server for converting the speech into a voice command of the text,
The controller may be configured to transmit a voice signal of the speech to the STT server when the speech is input to the speech input unit, and receive the speech command corresponding to the speech from the STT server.