TWI770762B

TWI770762B - Audio and visual system and control method thereof

Info

Publication number: TWI770762B
Application number: TW110100990A
Authority: TW
Inventors: 潘慶元; 蔡敷恩
Original assignee: 圓展科技股份有限公司
Priority date: 2021-01-11
Filing date: 2021-01-11
Publication date: 2022-07-11
Also published as: TW202228431A

Abstract

An audio and visual system includes a speaker device, a camera device. The speaker device includes an ultrasound emitter. The camera device includes an ultrasound receiver and an audio source tracking circuit. The ultrasound emitter is configured to output an ultrasound signal. The ultrasound receiver is configured to receive the ultrasound signal, and, according to the ultrasound signal, output a notification signal. The audio source tracking circuit is configured, according to the notification signal, to output a tracking signal, in which the camera device is configured, according to the tracking signal, not to track the speaker device or to turn to track a sound source different from the speaker device. A controlling method of the audio and visual system is also disclosed herein.

Description

Audio-visual system and its control method

本揭示是關於一種影音系統，且特別是關於一種與追蹤喇叭方式相關的影音系統。 The present disclosure relates to an audio-visual system, and more particularly, to an audio-visual system related to the way of tracking speakers.

在習知的固定式攝影機無法涵蓋所有監視範圍的情況下，因為旋轉變焦式攝影機具備較大的攝影範圍，故旋轉變焦式攝影機已逐漸取代固定式攝影機或配合固定式攝影機以獲得更優異的追蹤效果。於視訊會議中，通常會藉由判斷本地端的聲音來源讓鏡頭追蹤而自動對準講話者，使遠端會議參加者可清楚看見對方。然而實際使用狀況下，因為本地端的喇叭會播放遠端傳來的聲音，導致本地端判斷喇叭是講話者，使鏡頭對準喇叭的異常狀況產生。因此，對於如何使鏡頭有效正常運作已是當前相關領域的研發課題之一。 Under the circumstance that the conventional fixed camera cannot cover all the monitoring range, because the rotating zoom camera has a larger shooting range, the rotating zoom camera has gradually replaced the fixed camera or cooperated with the fixed camera to obtain better tracking Effect. In a video conference, the camera is usually tracked and automatically aimed at the speaker by judging the sound source of the local end, so that the remote conference participants can clearly see each other. However, in actual use, because the speaker at the local end will play the sound from the far end, the local end determines that the speaker is the speaker, and an abnormal situation occurs that the camera is aimed at the speaker. Therefore, how to make the lens operate effectively and normally has become one of the research and development issues in the current related fields.

本揭示內容的一實施例是關於一種影音系統，影音系統包含一揚聲器裝置及一攝影裝置。揚聲器裝置包含一超聲波發射器。攝影裝置包含一超聲波接收器以及一音頻源追蹤電路。超聲波發射器用以輸出一超聲波信號；超聲波接收器用以接收超聲波信號，並根據超聲波信號輸出一通知信號；以及音頻源追蹤電路用以根據通知信號輸出一追蹤信號，其中攝影裝置係用以根據追蹤信號不對揚聲器裝置進行追蹤或轉向追蹤與揚聲器裝置相異之一音源。 An embodiment of the present disclosure relates to an audio-visual system, which includes a speaker device and a camera device. The speaker device includes a Ultrasonic transmitter. The photographing device includes an ultrasonic receiver and an audio source tracking circuit. The ultrasonic transmitter is used for outputting an ultrasonic signal; the ultrasonic receiver is used for receiving the ultrasonic signal and outputting a notification signal according to the ultrasonic signal; and the audio source tracking circuit is used for outputting a tracking signal according to the notification signal, wherein the photographing device is used for according to the tracking signal Do not track the speaker unit or turn to track a source other than the speaker unit.

本揭示內容的一實施例是關於一種影音系統的控制方法，影音系統的控制方法包含：透過設置在一揚聲器裝置中的一超聲波發射器輸出一超聲波信號；藉由設置在一攝影裝置中的一超聲波接收器接收超聲波信號，並根據超聲波信號輸出一通知信號；根據通知信號透過設置在攝影裝置中的一音頻源追蹤電路輸出一追蹤信號；以及根據追蹤信號控制攝影裝置不對揚聲器裝置進行追蹤或轉向與揚聲器裝置相異之一音源。 An embodiment of the present disclosure relates to a control method of an audio-visual system. The control method of the audio-visual system includes: outputting an ultrasonic signal through an ultrasonic transmitter disposed in a speaker device; The ultrasonic receiver receives the ultrasonic signal, and outputs a notification signal according to the ultrasonic signal; outputs a tracking signal through an audio source tracking circuit arranged in the photographing device according to the notification signal; and controls the photographing device not to track or turn the speaker device according to the tracking signal A sound source other than a speaker unit.

100:影音系統 100: AV system

110:網路 110: Internet

112:電腦 112: Computer

114:網路接收器 114: Network Receiver

116:攝影裝置 116: Photographic installations

118:音頻源追蹤電路 118: Audio source tracking circuit

120:超聲波接收器 120: Ultrasonic receiver

122:揚聲器裝置 122: Speaker unit

124:超聲波發射器 124: Ultrasonic Transmitter

200:影音系統 200: AV system

226:重取樣器 226: Resampler

228:音頻處理電路 228: Audio processing circuit

230:計算電路 230: Computational Circuits

232:音訊接收器 232: Audio Receiver

400:影音系統 400: AV system

434:定位器 434: Locator

436:遮罩器 436: Masker

S302、S304、S306、S308、S310、S312、S314、S316、S318、S502、S504、S506、S508、S510、S512:步驟 S302, S304, S306, S308, S310, S312, S314, S316, S318, S502, S504, S506, S508, S510, S512: Steps

US:超聲波信號 US: Ultrasonic signal

NS:通知信號 NS: notification signal

TS:追蹤信號 TS: Tracking Signal

MS:麥克風信號 MS: Microphone signal

RS:參考信號 RS: reference signal

AS:音頻信號 AS: audio signal

FAS:遠端音頻信號 FAS: Far-end audio signal

APS:音頻處理信號 APS: Audio Processing Signal

RNS:區域通知信號 RNS: Regional Notification Signal

當結合隨附圖式閱讀時，自以下詳細描述將最佳地理解本揭示的態樣。應注意，根據工業中的標準實務，各個特徵並非按比例繪製。事實上，出於論述清晰的目的，可任意增加或減小各個特徵的尺寸。 Aspects of the present disclosure are best understood from the following detailed description when read in conjunction with the accompanying drawings. It should be noted that in accordance with standard practice in the industry, the various features are not drawn to scale. In fact, the dimensions of the various features may be arbitrarily increased or decreased for clarity of discussion.

第1圖係根據本揭示的一些實施例之影音系統方塊圖。 FIG. 1 is a block diagram of an audio-visual system according to some embodiments of the present disclosure.

第2圖係根據本揭示的一些實施例之影音系統方塊圖。 FIG. 2 is a block diagram of an audio-visual system according to some embodiments of the present disclosure.

第3圖係根據本揭示的一些實施例圖示用於影音系統控制方法的流程圖。 FIG. 3 is a flowchart illustrating a control method for an audio-visual system according to some embodiments of the present disclosure.

第4圖係根據本揭示的一些實施例之影音系統方塊圖。 FIG. 4 is a block diagram of an audio-visual system according to some embodiments of the present disclosure.

第5圖係根據本揭示的一些實施例圖示用於影音系統控制方法的流程圖。 FIG. 5 is a flowchart illustrating a control method for an audio-visual system according to some embodiments of the present disclosure.

本說明書中所用術語大體在使用每一術語的技術領域及特定上下文中具有其普通含義。本說明書中對實例的使用，包括本文論述的任何項的實例皆僅為說明性的，且絕不限制本揭示案或任何示例性術語的範疇及含義。同樣，本揭示案並非僅限於本說明書中給定的多個實施例。 Terms used in this specification generally have their ordinary meanings in the technical field and specific context in which each term is used. The use of examples in this specification, including examples of any items discussed herein, are illustrative only and in no way limit the scope and meaning of the disclosure or any exemplary term. Likewise, the present disclosure is not limited to the embodiments given in this specification.

本文中所使用之『包含』、『包括』、『具有』、『含有』、『涉及』及相似詞彙將理解為可變更的，亦即不限於不過意指包括。 As used herein, "comprises," "includes," "has," "includes," "involves," and similar words are to be construed as variable, ie, not limited to, but mean to include.

綜觀此說明書中『一個實施例』、『一實施例』或『一些實施例』的參考意指敘述與實施例有關的特定特徵、結構、實施或特性包括在本揭示的至少一實施例中。因此，綜觀此說明書於各個地方『於一個實施例中』、『於一實施例中』或『於一些實施例中』之片語的使用未必全部參見相同實施例。並且，在一或多個實施例中特定特徵、結構、實施或特性可以任何合適方式組合。 References throughout this specification to "one embodiment," "an embodiment," or "some embodiments" mean that a particular feature, structure, implementation, or characteristic associated with the embodiment is included in at least one embodiment of the present disclosure. Thus, usages of the phrases "in one embodiment," "in one embodiment," or "in some embodiments" in various places throughout this specification may not necessarily all refer to the same embodiment. Furthermore, the particular features, structures, implementations or characteristics may be combined in any suitable manner in one or more embodiments.

第1圖係根據本揭示內容一些實施例之影音系統100的方塊示意圖。如第1圖所示，在一些實施例中，影音系統100包括網路110、電腦112、攝影裝置116及揚聲器裝置122。電腦112用以接收來自網路110傳送的遠端音頻信號FAS，並透過通用序列匯流排通道(未繪示)傳送音頻信號AS至揚聲器裝置122。揚聲器裝置122用以依據音頻信號AS播放聲音。電腦112用以透過通用序列匯流排通道(未繪示)擷取攝影裝置116接收的影像。在一些實施例中，上述攝影裝置116可藉由例如，但不限於旋轉變焦式攝影機(Pan/Tilt/Zoom camera；PTZ camera)實現。 FIG. 1 is a block diagram of an audio-visual system 100 according to some embodiments of the present disclosure. As shown in FIG. 1 , in some embodiments, the audio-visual system 100 includes a network 110 , a computer 112 , a camera device 116 and a speaker device 122 . The computer 112 is used to receive the transmission from the network 110 The far-end audio signal FAS is transmitted to the speaker device 122 through the general serial bus channel (not shown). The speaker device 122 is used for playing sound according to the audio signal AS. The computer 112 is used for capturing images received by the camera 116 through a general serial bus channel (not shown). In some embodiments, the above-mentioned photographing device 116 may be implemented by, for example, but not limited to, a pan/Tilt/Zoom camera (PTZ camera).

如第1圖所示，在一些實施例中，電腦112包括網路接收器114。網路接收器114用以接收來自網路110傳送的遠端音頻信號FAS，並透過通用序列匯流排通道(未繪示)傳送音頻信號AS至揚聲器裝置122。 As shown in FIG. 1 , in some embodiments, computer 112 includes a network receiver 114 . The network receiver 114 is used for receiving the far-end audio signal FAS transmitted from the network 110, and transmitting the audio signal AS to the speaker device 122 through the general serial bus channel (not shown).

在一般的作法中，攝影裝置116用以判斷聲音來源並能自動對準講話的人，以使得遠端會議參加的人清楚知道目前講話的人。但當通常本地的揚聲器裝置122用以播放電腦112傳過來的音頻信號AS時，會造成本地端攝影裝置116判斷揚聲器裝置122是講話的人，造成攝影裝置116轉向揚聲器裝置122的非預期動作。 In a common practice, the photographing device 116 is used to determine the source of the sound and can automatically aim at the person speaking, so that the person participating in the remote conference clearly knows the person who is currently speaking. However, when the local speaker device 122 is used to play the audio signal AS sent from the computer 112 , the local camera device 116 may determine that the speaker device 122 is the speaker, causing the camera device 116 to turn to the speaker device 122 unexpectedly.

相較於上述的作法，本發明的實施例中，揚聲器裝置122用以透過超聲波作為媒介通知攝影裝置116，以達到避免攝影裝置116追蹤揚聲器裝置122的目的。在一些實施例中，揚聲器裝置122用以判斷本身是否播放聲音，並透過超聲波通知攝影裝置116不進行追蹤。在一些實施例中，揚聲器裝置122用以持續透過超聲波通知攝影裝置116，使攝影裝置116判斷揚聲器裝置122的位置，並僅被允許追蹤揚聲器裝置122以外的區域，具體方式將參照如下所示的實施例進行說明。 Compared with the above method, in the embodiment of the present invention, the speaker device 122 is used to notify the camera device 116 through ultrasonic waves as a medium, so as to prevent the camera device 116 from tracking the speaker device 122 . In some embodiments, the speaker device 122 is used to determine whether it plays a sound, and notify the camera device 116 not to track through ultrasonic waves. In some embodiments, the speaker device 122 is used to continuously notify the camera device 116 through ultrasonic waves, so that the camera device 116 determines the position of the speaker device 122 and only Areas other than the speaker device 122 are allowed to be tracked in a manner that will be described with reference to the embodiments shown below.

在一些實施例中，如第1圖所示，攝影裝置116包含音頻源追蹤電路118以及超聲波接收器120，而揚聲器裝置122包含超聲波發射器124。超聲波發射器124用以輸出超聲波信號US至超聲波接收器120，超聲波接收器120用以根據超聲波信號US產生通知信號NS。音頻源追蹤電路118用以根據通知信號NS輸出追蹤信號TS。在一些實施例中，攝影裝置116用以根據追蹤信號TS轉向追蹤與揚聲器裝置122相異之一音源。在其他實施例中，攝影裝置116用以根據追蹤信號TS不對揚聲器裝置122進行追蹤。 In some embodiments, as shown in FIG. 1 , the camera device 116 includes an audio source tracking circuit 118 and an ultrasonic receiver 120 , and the speaker device 122 includes an ultrasonic transmitter 124 . The ultrasonic transmitter 124 is used for outputting the ultrasonic signal US to the ultrasonic receiver 120 , and the ultrasonic receiver 120 is used for generating the notification signal NS according to the ultrasonic signal US. The audio source tracking circuit 118 is used for outputting the tracking signal TS according to the notification signal NS. In some embodiments, the camera device 116 is adapted to track a sound source different from the speaker device 122 according to the tracking signal TS. In other embodiments, the camera device 116 is configured to not track the speaker device 122 according to the tracking signal TS.

第2圖係根據本揭示內容一些實施例之影音系統200的方塊示意圖。在一些實施例中，第2圖所示的揚聲器裝置122可藉由例如，但不限於一或多件喇叭實現，一或多件喇叭可用以播放音頻信號AS。 FIG. 2 is a block diagram of an audio-visual system 200 according to some embodiments of the present disclosure. In some embodiments, the speaker device 122 shown in FIG. 2 can be implemented by, for example, but not limited to, one or more pieces of speakers, and the one or more pieces of speakers can be used to play the audio signal AS.

關於第1圖的實施例，為了便於理解，在第2圖中的相似元件用相同的元件符號指定。相較於第1圖所示的實施例，第2圖影音系統200中的揚聲器裝置122更包括重取樣器226、音頻處理電路228、計算電路230及音訊接收器232。 Regarding the embodiment of Fig. 1, for ease of understanding, like elements in Fig. 2 are designated with the same reference numerals. Compared with the embodiment shown in FIG. 1 , the speaker device 122 in the audio-visual system 200 in FIG. 2 further includes a resampler 226 , an audio processing circuit 228 , a computing circuit 230 and an audio receiver 232 .

在一些實施例中，如第2圖所示，音訊接收器232用以接收揚聲器裝置122本身播放音頻信號AS所發出的聲音，以產生麥克風信號MS。在一些實施例中，音訊接收器232可用以接收一或多件喇叭所發出的聲音。 In some embodiments, as shown in FIG. 2 , the audio receiver 232 is used to receive the sound produced by the speaker device 122 playing the audio signal AS to generate the microphone signal MS. In some embodiments, audio reception The device 232 may be used to receive sound from one or more speakers.

在一些實施例中，重取樣器226用以針對揚聲器裝置122所接收之音頻信號AS進行重取樣，以產生參考信號RS。 In some embodiments, the resampler 226 is used to resample the audio signal AS received by the speaker device 122 to generate the reference signal RS.

在一些實施例中，音頻處理電路228用以接收來自音訊接收器232產生的麥克風信號MS及來自重取樣器226產生的參考信號RS其中至少一者，並對其中至少一者進行迴音處理，並輸出音頻處理信號APS至計算電路230。在其他實施例中，音頻處理電路228係接收參考信號RS及麥克風信號MS兩者，以針對麥克風信號MS及參考信號RS進行迴音處理，並輸出音頻處理信號APS至計算電路230。在一些實施例中，音頻處理電路228的實例包括，但不限制於回音消除器(Acoustic Echo Cancellation；AEC)、回音抑制器(Acoustic Echo Suppression；AES)等等。 In some embodiments, the audio processing circuit 228 is configured to receive at least one of the microphone signal MS generated by the audio receiver 232 and the reference signal RS generated by the resampler 226, and perform echo processing on at least one of them, and The audio processing signal APS is output to the calculation circuit 230 . In other embodiments, the audio processing circuit 228 receives both the reference signal RS and the microphone signal MS, performs echo processing on the microphone signal MS and the reference signal RS, and outputs the audio processing signal APS to the computing circuit 230 . In some embodiments, examples of audio processing circuitry 228 include, but are not limited to, Acoustic Echo Cancellation (AEC), Acoustic Echo Suppression (AES), and the like.

在一些實施例中，計算電路230用以接收音頻處理信號APS，以判斷揚聲器裝置122是否發出聲音。在一些實施例中，當計算電路230判斷揚聲器裝置122發出聲音時，揚聲器裝置122中之超聲波發射器124用以輸出超聲波信號US至攝影裝置116的超聲波接收器120，使得攝影裝置116不對揚聲器裝置122進行追蹤。在一些實施例中，當計算電路230判斷揚聲器裝置122未發出聲音時，揚聲器裝置122中之超聲波發射器124將不輸出超聲波信號US，且攝影裝置116仍進行追蹤。 In some embodiments, the computing circuit 230 is configured to receive the audio processing signal APS to determine whether the speaker device 122 emits sound. In some embodiments, when the computing circuit 230 determines that the speaker device 122 emits sound, the ultrasonic transmitter 124 in the speaker device 122 is used to output the ultrasonic signal US to the ultrasonic receiver 120 of the camera device 116, so that the camera device 116 does not respond to the speaker device 122 to track. In some embodiments, when the computing circuit 230 determines that the speaker device 122 does not emit sound, the ultrasonic transmitter 124 in the speaker device 122 will not output the ultrasonic signal US, and the camera device 116 is still tracking.

第3圖係根據本揭示內容一些實施例繪示一種影音系統控制方法之流程圖。應瞭解到，在本實施方式中所提及的步驟，除特別敘明其順序者外，均可依實際需要調整其前後順序，甚至可同時或部分同時執行。在一些實施例中，第3圖之影音系統控制方法可應用於第2圖所示之影音系統200，但不以此為限。為了清楚說明起見，下述第3圖之影音系統控制方法係搭配第2圖所示之影音系統200來做說明。 FIG. 3 is a flowchart illustrating a control method of an audio-visual system according to some embodiments of the present disclosure. It should be understood that, unless the sequence of the steps mentioned in this embodiment is specifically stated, the sequence of the steps may be adjusted according to actual needs, and may even be performed simultaneously or partially simultaneously. In some embodiments, the AV system control method shown in FIG. 3 can be applied to the AV system 200 shown in FIG. 2 , but is not limited thereto. For the sake of clarity, the following control method of the audio-visual system in FIG. 3 is described in conjunction with the audio-visual system 200 shown in FIG. 2 .

首先，在步驟S302，重取樣器226對音頻信號AS進行重取樣，以產生參考信號RS。 First, in step S302, the resampler 226 resamples the audio signal AS to generate the reference signal RS.

接著，在步驟S304，音訊接收器232接收揚聲器裝置122的聲音，以產生麥克風信號MS。 Next, in step S304, the audio receiver 232 receives the sound of the speaker device 122 to generate the microphone signal MS.

其次，在步驟S306，音頻處理電路228對麥克風信號MS及參考信號RS其中至少一者進行迴音處理，以輸出音頻處理信號APS至計算電路230。如上所述，在一些實施例中，音頻處理電路228係對麥克風信號MS及參考信號RS兩者進行迴音處理，以輸出音頻處理信號APS至計算電路230，並藉此降低計算電路230運作時環境聲音產生的干擾。 Next, in step S306 , the audio processing circuit 228 performs echo processing on at least one of the microphone signal MS and the reference signal RS to output the audio processing signal APS to the computing circuit 230 . As described above, in some embodiments, the audio processing circuit 228 performs echo processing on both the microphone signal MS and the reference signal RS to output the audio processing signal APS to the computing circuit 230 , thereby reducing the operating environment of the computing circuit 230 interference from sound.

在這樣的情形下，於步驟S308，計算電路230判斷揚聲器裝置122是否發出聲音。若揚聲器裝置122發出聲音，則流程往步驟S310，若揚聲器裝置122未發出聲音，則流程往步驟S318。 In such a case, in step S308, the calculation circuit 230 determines whether the speaker device 122 emits sound. If the speaker device 122 emits sound, the flow goes to step S310 , and if the speaker device 122 does not emit sound, the flow goes to step S318 .

當揚聲器裝置122發出聲音時，在步驟S310，揚聲器裝置122中的超聲波發射器124輸出超聲波信號US傳送至攝影裝置116。 When the speaker device 122 emits sound, in step S310, The ultrasonic transmitter 124 in the speaker device 122 outputs the ultrasonic signal US to the imaging device 116 .

再者，在步驟S312，超聲波接收器120接收超聲波信號US，並輸出通知信號NS傳送至音頻源追蹤電路118。 Furthermore, in step S312 , the ultrasonic receiver 120 receives the ultrasonic signal US, and outputs the notification signal NS to the audio source tracking circuit 118 .

其次，在步驟S314，音頻源追蹤電路118根據通知信號NS輸出追蹤信號TS傳送至攝影裝置116。 Next, in step S314 , the audio source tracking circuit 118 outputs the tracking signal TS according to the notification signal NS and transmits it to the photographing device 116 .

然後，在步驟S316，攝影裝置116根據追蹤信號TS不對揚聲器裝置122進行追蹤。 Then, in step S316, the camera device 116 does not track the speaker device 122 according to the tracking signal TS.

另一方面，當揚聲器裝置122未發出聲音時，在步驟S318，超聲波發射器124不輸出超聲波信號US。因此攝影裝置116持續追蹤目標物。 On the other hand, when the speaker device 122 does not emit sound, the ultrasonic transmitter 124 does not output the ultrasonic signal US in step S318. Therefore, the photographing device 116 keeps tracking the target.

第4圖係根據本揭示內容一些實施例之影音系統400的方塊示意圖。如第4圖所示，影音系統400同樣包含第2圖所示的超聲波發射器124。在一些實施例中，相較於第2圖所示實施例，影音系統400中的超聲波發射器124係用以持續輸出超聲波信號US至超聲波接收器120。在其他實施例中，若第4圖所示的揚聲器裝置122關閉及未播放音頻信號AS，超聲波發射器124不輸出超聲波信號US，攝影裝置116可持續追蹤目標物。 FIG. 4 is a block diagram of an audio-visual system 400 according to some embodiments of the present disclosure. As shown in FIG. 4 , the audio-visual system 400 also includes the ultrasonic transmitter 124 shown in FIG. 2 . In some embodiments, compared with the embodiment shown in FIG. 2 , the ultrasonic transmitter 124 in the audio-visual system 400 is used to continuously output the ultrasonic signal US to the ultrasonic receiver 120 . In other embodiments, if the speaker device 122 shown in FIG. 4 is turned off and the audio signal AS is not played, the ultrasonic transmitter 124 does not output the ultrasonic signal US, and the photographing device 116 can continue to track the target object.

關於第1圖的實施例，為了便於理解，在第4圖中的相似元件用相同的元件符號指定。相較於第1圖所示的實施例，第4圖影音系統400中的攝影裝置116更包括定位器434及遮罩器436。定位器434用以接收超聲波接收器120所輸出之通知信號NS。遮罩器436用以決定不對揚聲器裝置122進行追蹤的區域。 Regarding the embodiment of Fig. 1, for ease of understanding, like elements in Fig. 4 are designated by the same reference numerals. Compared with the embodiment shown in FIG. 1 , the photographing device 116 in the audio-visual system 400 in FIG. 4 further includes a positioner 434 and a mask 436 . The locator 434 is used to receive the ultrasonic connection The notification signal NS output by the receiver 120 . The masker 436 is used to determine the area in which the speaker device 122 is not to be tracked.

在一些實施例中，定位器434用以根據通知信號NS判斷揚聲器裝置122與攝影裝置116的相對位置，此相對位置例如，但不限於一水平角度、一垂直角度及一距離。 In some embodiments, the locator 434 is used to determine the relative position of the speaker device 122 and the camera device 116 according to the notification signal NS, such as, but not limited to, a horizontal angle, a vertical angle and a distance.

在一些實施例中，遮罩器436用以根據相對位置輸出關於揚聲器裝置122之遮罩區的區域通知信號RNS，其中遮罩區係攝影裝置116不對揚聲器裝置122進行追蹤之區域，且遮罩區的範圍係例如，但不限於水平正負10度與垂直正負10度，其他遮罩區的範圍落入此揭示案的範圍內。 In some embodiments, the masker 436 is configured to output a region notification signal RNS about the masked region of the speaker device 122 according to the relative position, wherein the masked region is an area where the camera device 116 does not track the speaker device 122, and the masked region The range of the region is, for example, but not limited to, plus or minus 10 degrees horizontally and plus or minus 10 degrees vertically, and the range of other mask regions falls within the scope of this disclosure.

在一些實施例中，音頻源追蹤電路118用以接收區域通知信號RNS，以輸出追蹤信號TS至攝影裝置116。在其他實施例中，音頻源追蹤電路118用以根據通知信號NS判斷攝影裝置116與揚聲器裝置122的相對位置及遮罩區的範圍。 In some embodiments, the audio source tracking circuit 118 is configured to receive the region notification signal RNS to output the tracking signal TS to the camera device 116 . In other embodiments, the audio source tracking circuit 118 is used to determine the relative position of the camera device 116 and the speaker device 122 and the range of the mask area according to the notification signal NS.

第5圖係根據本揭示內容一些實施例圖示用於影音系統控制方法的流程圖。應瞭解到，在本實施方式中所提及的步驟，除特別敘明其順序者外，均可依實際需要調整其前後順序，甚至可同時或部分同時執行。第5圖之影音系統控制方法可應用於第4圖所示之影音系統400，但不以此為限。為了清楚說明起見，下述第5圖之影音系統控制方法係搭配第4圖所示之影音系統400來做說明。 FIG. 5 is a flowchart illustrating a control method for an audio-visual system according to some embodiments of the present disclosure. It should be understood that, unless the sequence of the steps mentioned in this embodiment is specifically stated, the sequence of the steps may be adjusted according to actual needs, and may even be performed simultaneously or partially simultaneously. The AV system control method shown in FIG. 5 can be applied to the AV system 400 shown in FIG. 4 , but is not limited thereto. For the sake of clarity, the following control method of the audio-visual system shown in FIG. 5 is described in conjunction with the audio-visual system 400 shown in FIG. 4 .

首先，在步驟S502，超聲波發射器124輸出超聲波信號US，並傳送超聲波信號US至超聲波接收器120。 First, in step S502 , the ultrasonic transmitter 124 outputs the ultrasonic signal US, and transmits the ultrasonic signal US to the ultrasonic receiver 120 .

其次，在步驟S504，超聲波接收器120接收超聲波信號US，並輸出通知信號NS傳送至定位器434。 Next, in step S504 , the ultrasonic receiver 120 receives the ultrasonic signal US, and outputs a notification signal NS to the localizer 434 .

接著，在步驟S506，定位器434接收通知信號NS，根據通知信號NS判斷揚聲器裝置122與攝影裝置116的相對位置。 Next, in step S506, the locator 434 receives the notification signal NS, and determines the relative position of the speaker device 122 and the photographing device 116 according to the notification signal NS.

接著，在步驟S508，遮罩器436根據相對位置輸出關於揚聲器裝置122之遮罩區的區域通知信號RNS至音頻源追蹤電路118。 Next, in step S508 , the masker 436 outputs a region notification signal RNS about the masked area of the speaker device 122 to the audio source tracking circuit 118 according to the relative position.

然後，在步驟S510，音頻源追蹤電路118用以接收區域通知信號RNS，以輸出追蹤信號TS至攝影裝置116。 Then, in step S510 , the audio source tracking circuit 118 is used for receiving the region notification signal RNS to output the tracking signal TS to the photographing device 116 .

然後，在步驟S512，攝影裝置116根據追蹤信號TS轉向與揚聲器裝置122相異之音源。 Then, in step S512, the camera device 116 turns to a sound source different from the speaker device 122 according to the tracking signal TS.

上文概述若干實施例的特徵，使得熟習此項技術者可更好地理解本揭示的態樣。熟習此項技術者應瞭解，可輕易使用本揭示作為設計或修改其他製程及結構的基礎，以便執行本文所介紹的實施例的相同目的及/或實現相同優點。熟習此項技術者亦應認識到，此類等效構造並未脫離本揭示的精神及範疇，且可在不脫離本揭示的精神及範疇的情況下產生本文的各種變化、取代及更改。 The foregoing outlines features of several embodiments so that those skilled in the art may better understand aspects of the present disclosure. Those skilled in the art should appreciate that the present disclosure may be readily utilized as a basis for designing or modifying other processes and structures for carrying out the same purposes and/or achieving the same advantages of the embodiments described herein. Those skilled in the art should also realize that such equivalent constructions do not depart from the spirit and scope of the present disclosure, and that various changes, substitutions and alterations herein can be made without departing from the spirit and scope of the present disclosure.

100:影音系統 100: AV system

110:網路 110: Internet

112:電腦 112: Computer

114:網路接收器 114: Network Receiver

116:攝影裝置 116: Photographic installations

118:音頻源追蹤電路 118: Audio source tracking circuit

120:超聲波接收器 120: Ultrasonic receiver

122:揚聲器裝置 122: Speaker unit

124:超聲波發射器 124: Ultrasonic Transmitter

US:超聲波信號 US: Ultrasonic signal

NS:通知信號 NS: notification signal

TS:追蹤信號 TS: Tracking Signal

AS:音頻信號 AS: audio signal

FAS:遠端音頻信號 FAS: Far-end audio signal

Claims

An audio-visual system, comprising: a speaker device, comprising: an ultrasonic transmitter, wherein the ultrasonic transmitter is used to output an ultrasonic signal, wherein when the speaker device emits a sound according to an audio signal, the ultrasonic transmitter outputs the ultrasonic signal; A photographing device, comprising: an ultrasonic receiver for receiving the ultrasonic signal and outputting a notification signal according to the ultrasonic signal; and an audio source tracking circuit for outputting a tracking signal according to the notification signal, wherein the photographing device It is used for not tracking the speaker device that emits the sound or turning to track a sound source different from the speaker device according to the tracking signal.

The audio-visual system as claimed in claim 1, wherein the speaker device further comprises: an audio receiver for receiving the sound emitted by the speaker device to generate a microphone signal; a calculation circuit for according to a reference signal and The microphone signal determines whether the speaker device emits sound; and when the computing circuit determines that the speaker device emits the sound, the ultrasonic transmitter in the speaker device outputs the ultrasonic signal to the photographing device.

The audio-visual system of claim 2, wherein when the computing circuit determines that the speaker device does not emit the sound, the ultrasonic transmitter in the speaker device will not output the ultrasonic signal, and the camera device is still tracking.

The audio-visual system of claim 2, wherein the speaker device further comprises: a resampler for re-sampling an audio signal received by the speaker device to generate the reference signal.

The audio-visual system of claim 4, wherein the speaker device further comprises: an audio processing circuit for performing echo processing on at least one of the microphone signal and the reference signal to output an audio processing signal to the computing circuit .

The audio-visual system according to claim 1, wherein the photographing device further comprises: a locator for receiving the notification signal output by the ultrasonic receiver, and determining a relationship between the speaker device and the photographing device according to the notification signal a relative position; a masker for outputting an area notification signal about a masked area of the speaker device according to the relative position, wherein the masked area is an area where the camera device does not track the speaker device; and The audio source tracking circuit is used for receiving the area notification signal to output the tracking signal.

A control method of an audio-visual system, comprising: outputting an ultrasonic signal by an ultrasonic transmitter arranged in a speaker device, wherein when the speaker device emits a sound according to an audio signal, the ultrasonic transmitter outputs the ultrasonic signal; receiving the ultrasonic signal by an ultrasonic receiver provided in a photographing device, and outputting a notification signal according to the ultrasonic signal; outputting a tracking signal through an audio source tracking circuit provided in the photographing device according to the notification signal; and According to the tracking signal, the camera device is controlled not to track the speaker device that emits the sound or turn to a sound source different from the speaker device.

The method of claim 7, further comprising: receiving the sound emitted by the speaker device through an audio receiver disposed in the speaker device to generate a microphone signal; using an audio receiver disposed in the speaker device to generate a microphone signal; The computing circuit determines whether the speaker device emits the sound according to a reference signal and the microphone signal; and when the computing circuit determines that the speaker device emits the sound, the ultrasonic transmitter in the speaker device outputs the ultrasonic signal to the photographing device .

The method of claim 8, further comprising when the computing circuit determines that the speaker device does not emit the sound, the ultrasonic transmitter in the speaker device will not output the ultrasonic signal, and the camera device is still tracking.

The method of claim 8, further comprising: re-sampling an audio signal received by the speaker device by a re-sampler disposed in the speaker device to generate the reference signal.

The method of claim 10, further comprising: performing echo processing on at least one of the microphone signal and the reference signal by an audio processing circuit disposed in the speaker device to output an audio processing signal to the computing circuit .

The method according to claim 7, further comprising: receiving the notification signal output by the ultrasonic receiver through a locator disposed in the photographing device, and determining a relationship between the speaker device and the photographing device according to the notification signal Relative position; by means of a mask provided on the photographing device, output an area notification signal about a masking area of the speaker device according to the relative position, wherein the masking area means that the photographing device does not perform any operation on the speaker device tracking area; and receiving the area notification signal through the audio source tracking circuit to output the tracking signal.