[go: up one dir, main page]

TWI820341B - Method for image tracking and display - Google Patents

Method for image tracking and display Download PDF

Info

Publication number
TWI820341B
TWI820341B TW109123845A TW109123845A TWI820341B TW I820341 B TWI820341 B TW I820341B TW 109123845 A TW109123845 A TW 109123845A TW 109123845 A TW109123845 A TW 109123845A TW I820341 B TWI820341 B TW I820341B
Authority
TW
Taiwan
Prior art keywords
image
display
trigger
bounding box
camera lens
Prior art date
Application number
TW109123845A
Other languages
Chinese (zh)
Other versions
TW202205850A (en
Inventor
張森喬
吳明德
蔡茗光
Original Assignee
圓展科技股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 圓展科技股份有限公司 filed Critical 圓展科技股份有限公司
Priority to TW109123845A priority Critical patent/TWI820341B/en
Publication of TW202205850A publication Critical patent/TW202205850A/en
Application granted granted Critical
Publication of TWI820341B publication Critical patent/TWI820341B/en

Links

Images

Landscapes

  • Studio Devices (AREA)
  • Burglar Alarm Systems (AREA)
  • Radar Systems Or Details Thereof (AREA)

Abstract

A method for image tracking and display includes the following procedure. A variation detection process, which determines whether there is a target increase or decrease in an image corresponding to a variable trigger area. A display screen change process, which adjusts the camera to capture the image corresponding to a trigger bounding frame and outputs the image to a monitor when the result of the variation detecting procedure is “YES”. A stable detection process, which determines whether a change parameter of the image corresponding to the trigger bounding frame is less than or equal to a predetermined change value. A target detection process, which performs a target detecting on the image corresponding to the triggered bounding frame and generates an image display frame. An image adjusting process, which outputs the image corresponding to the range of the image display frame to the monitor. Accordingly, the method can reduce the variation of the display screen.

Description

影像追蹤及顯示方法 Image tracking and display methods

本發明係關於一種影像處理方法,特別關於一種影像追蹤及顯示方法。 The present invention relates to an image processing method, and in particular to an image tracking and display method.

由於網路技術的成熟,會議的舉行方式也由傳統面對面的型態轉變為遠距離的視訊型態。視訊會議除了足夠以及穩定的網路頻寬能夠提供穩定的影音訊號之外,讓鏡頭能夠將與會者完整的呈現在顯示畫面中,也是一般的視訊會議所需要的效果。如此一來,雙方的與會人員才能得知有哪些人員參與會議。 Due to the maturity of network technology, the way meetings are held has also changed from the traditional face-to-face format to the long-distance video format. In addition to having sufficient and stable network bandwidth to provide stable audio and video signals, video conferencing allows the camera to fully present participants in the display screen, which is also an effect required for general video conferencing. In this way, participants on both sides can know who is participating in the meeting.

另外,由於攝像技術的進步,具有電動平移傾斜變焦(PTZ)功能的攝影機也普遍的使用於視訊會議中,以藉由調整攝影機平移角、傾斜角以及焦距來擷取視訊會議中的各種影像。 In addition, due to the advancement of camera technology, cameras with electric pan-tilt-zoom (PTZ) functions are also commonly used in video conferencing to capture various images in video conferencing by adjusting the camera pan angle, tilt angle and focal length.

在習知的一種影像追蹤技術中,系統是利用人臉偵測技術來判斷會議空間裡的人員分布,而將所有與會人員的影像畫面傳送至顯示器上,並顯示之。其具體作法係在偵測到攝像鏡頭的取景範圍內有人臉的增加、移動或減少時,則即時調整顯示器所顯示的畫面,以令與會者獲得與會人員的數量及動態。 In a conventional image tracking technology, the system uses face detection technology to determine the distribution of people in the meeting space, and then transmits the images of all participants to the monitor and displays them. The specific method is to detect the increase, movement or decrease of faces within the viewing range of the camera lens, and then immediately adjust the image displayed on the monitor to allow participants to obtain the number and dynamics of participants.

然而,上述的習知做法由於是針對人臉偵測,因此與會人員的動作,例如擺動、轉頭、側臉或移動位置...等,皆可能觸發偵測追蹤機制。如此將會造成顯示的畫面一直在變動。 However, since the above-mentioned conventional method is for face detection, the actions of the participants, such as swinging, turning their heads, turning their faces sideways, or moving their positions, etc., may trigger the detection and tracking mechanism. This will cause the displayed screen to keep changing.

因此,如何解決上述因不斷追蹤人臉而造成顯示畫面頻繁變動,實屬當前重要課題之一。 Therefore, how to solve the above-mentioned frequent changes in the display screen caused by continuous tracking of human faces is indeed one of the important issues at present.

有鑑於上述課題,本發明之一目的是提供一種影像追蹤及顯示方法,能夠使得顯示器所顯示的即時影像具有較為穩定的效果。 In view of the above problems, one purpose of the present invention is to provide an image tracking and display method that can make the real-time image displayed on the display have a more stable effect.

為達上述目的,本發明提供一種影像追蹤及顯示方法,其係與一攝像鏡頭及一顯示器配合應用。攝像鏡頭係連續擷取一影像,且根據影像對應設定有一觸發邊界框以及一可變觸發區域。影像追蹤及顯示方法包括一變動偵測程序、一顯示畫面變更程序、一穩定偵測程序、一目標偵測程序以及一影像調整程序。變動偵測程序係判斷可變觸發區域對應之影像是否存在一目標的數量增減。顯示畫面變更程序係當變動偵測程序之結果為「是」,則調整攝像鏡頭擷取對應於觸發邊界框範圍內之影像,並將影像輸出至顯示器。穩定偵測程序係判斷觸發邊界框對應之影像之一變動參數是否小於等於一預設變動值。目標偵測程序係對觸發邊界框對應之影像進行一目標偵測,並產生一影像顯示框。影像調整程序係將影像顯示框範圍對應之影像輸出至顯示器。 To achieve the above object, the present invention provides an image tracking and display method, which is used in conjunction with a camera lens and a display. The camera lens continuously captures an image, and sets a trigger bounding box and a variable trigger area corresponding to the image. The image tracking and display method includes a change detection procedure, a display screen change procedure, a stability detection procedure, a target detection procedure and an image adjustment procedure. The change detection program determines whether there is an increase or decrease in the number of a target in the image corresponding to the variable trigger area. The display screen changing process is when the result of the change detection process is "yes", the camera lens is adjusted to capture the image corresponding to the trigger bounding box range, and the image is output to the display. The stability detection program determines whether the change parameter of the image corresponding to the trigger bounding box is less than or equal to a preset change value. The target detection program performs target detection on the image corresponding to the trigger bounding box and generates an image display frame. The image adjustment program outputs the image corresponding to the image display frame range to the monitor.

於一實施例中,其中影像調整程序執行完畢後更重新執行變動偵測程序。 In one embodiment, the change detection process is re-executed after the image adjustment process is completed.

於一實施例中,其中影像調整程序更包含根據影像顯示框調整攝像鏡頭之一變焦倍率、一平移角度、一傾斜角度及其組合。 In one embodiment, the image adjustment process further includes adjusting a zoom ratio, a translation angle, a tilt angle and combinations thereof of the camera lens according to the image display frame.

於一實施例中,其中於該變動偵測程序之前更包括一初始目標偵測程序以及一初始影像調整程序。初始目標偵測程序係對觸發邊界框對應之影像進行一初始目標偵測,並產生一初始影像顯示框。初始影像調整程序係將初始影像顯示框範圍對應之影像輸出至顯示器。 In one embodiment, an initial target detection process and an initial image adjustment process are further included before the change detection process. The initial target detection program performs an initial target detection on the image corresponding to the trigger bounding box and generates an initial image display frame. The initial image adjustment program outputs the image corresponding to the initial image display frame range to the monitor.

於一實施例中,其中觸發邊界框係對應於攝像鏡頭之一最廣焦距,或係小於攝像鏡頭之最廣焦距的一自行定義範圍。 In one embodiment, the trigger bounding box corresponds to one of the widest focal lengths of the camera lens, or is a self-defined range smaller than the widest focal length of the camera lens.

於一實施例中,其中目標偵測程序係透過一人形偵測技術而執行。 In one embodiment, the target detection process is executed through a humanoid detection technology.

承上所述,本發明之一種影像追蹤及顯示方法係在發現影像中目標的數量出現變化時,隨即將顯示器的影像轉變為對應於觸發邊界框所對應之影像,並在影像內之目標動作趨於穩定之後逐漸調整至適當的影像顯示框所對應的影像。據此,顯示 器所顯示之影像只有在偵測到可變觸發區域中的目標數量變化時才會變更輸出影像的大小,其餘的穩定狀態下,顯示器顯示的影像不會一直處於調整或變動的狀態。因此,當影像追蹤及顯示方法應用於視訊會議時,可以讓與會者在會議過程中所觀察到的影像是較穩定的。 As mentioned above, an image tracking and display method of the present invention is to detect changes in the number of targets in the image, and then convert the image on the display into an image corresponding to the trigger bounding box, and move the target within the image. After stabilizing, gradually adjust to the image corresponding to the appropriate image display frame. Accordingly, it shows The image displayed by the monitor will only change the size of the output image when it detects changes in the number of targets in the variable trigger area. In other stable states, the image displayed by the monitor will not always be adjusted or changed. Therefore, when the image tracking and display method is applied to a video conference, the images observed by participants during the conference can be more stable.

11:攝像鏡頭 11:Camera lens

12:顯示器 12:Display

20:會議室 20:Conference room

F01:影像顯示框 F01: Image display frame

F02:觸發邊界框 F02: Trigger bounding box

A01:可變觸發區域 A01: Variable trigger area

P01~P07:程序 P01~P07: Program

〔圖1〕係顯示與本發明之影像追蹤及顯示方法配合應用之一環境示意圖。 [Fig. 1] is a schematic diagram showing an environment used in conjunction with the image tracking and display method of the present invention.

〔圖2〕係顯示本發明之影像追蹤及顯示方法對於影像框選的一示意圖。 [Fig. 2] is a schematic diagram showing the image tracking and display method of the present invention for image frame selection.

〔圖3〕係顯示本發明較佳實施例之影像追蹤及顯示方法之一流程示意圖。 [Fig. 3] is a schematic flowchart showing an image tracking and display method according to a preferred embodiment of the present invention.

〔圖4A〕至〔圖4C〕係顯示對應於本發明之影像追蹤及顯示方法之各程序中,對於影像框選的示意圖。 [Fig. 4A] to [Fig. 4C] are schematic diagrams showing image frame selection in each program corresponding to the image tracking and display method of the present invention.

為了使所屬技術領域中具有通常知識者能瞭解本發明的內容,並可據以實現本發明的內容,茲配合適當實施例及圖式說明如下,其中相同的元件將以相同的元件符號加以說明。 In order to enable those with ordinary knowledge in the technical field to understand the contents of the present invention and implement the contents of the present invention, the following description is provided with appropriate embodiments and drawings, in which the same components will be described with the same component symbols. .

請參照圖1所示,本發明之一種影像追蹤及顯示方法係與一攝像鏡頭11及一顯示器12配合應用。本實施例係以應用於一視訊會議的影像追蹤及顯示方法為例說明,因此攝像鏡頭11與顯示器12亦屬於視訊會議系統之一部分。攝像鏡頭11係設置於一會議室20的前方,並朝向會議室20內部而連續擷取一影像,顯示器12可顯示攝像鏡頭11所擷取之影像,同時也可顯示由遠端所輸出的影像。於本實施例中,攝像鏡頭11係為具有電動平移傾斜變焦(PTZ)功能的攝像鏡頭,其可調整鏡頭的變焦倍率、平移角度以及傾斜角度。 Referring to FIG. 1 , an image tracking and display method of the present invention is used in conjunction with a camera lens 11 and a display 12 . This embodiment takes an image tracking and display method applied in a video conference as an example. Therefore, the camera lens 11 and the display 12 are also part of the video conference system. The camera lens 11 is installed in front of a conference room 20, and faces the inside of the conference room 20 to continuously capture an image. The display 12 can display the image captured by the camera lens 11, and can also display the image output from the remote end. . In this embodiment, the camera lens 11 is a camera lens with an electric pan-tilt-zoom (PTZ) function, which can adjust the zoom magnification, translation angle and tilt angle of the lens.

請同時參照圖1及圖2所示,視訊會議系統根據影像對應設定有一影像顯示框F01、一觸發邊界框F02以及一可變觸發 區域A01。其中,影像顯示框F01中的影像係用以顯示於顯示器12,其可由攝像鏡頭11以光學變焦倍率搭配馬達調整平移或傾斜角度,以擷取對應的影像,亦可藉由數位PTZ技術而選擇之。觸發邊界框F02係可為攝像鏡頭11最廣角的視野邊界,亦可為使用者透過視訊會議系統自行定義的一個區域範圍,進一步說明之,當攝像鏡頭11的焦距過廣而使得會議室中包括天花板等,不會有人出現的區域都在鏡頭視野範圍中時,即可由使用者自行定義觸發邊界框F02的涵蓋範圍。本實施例之觸發邊界框F02係以攝像鏡頭11最廣角的視野邊界為例。 Please refer to Figure 1 and Figure 2 at the same time. The video conference system sets an image display frame F01, a trigger bounding frame F02 and a variable trigger according to the image correspondence. Area A01. Among them, the image in the image display frame F01 is used to be displayed on the display 12. The camera lens 11 can use the optical zoom magnification and the motor to adjust the translation or tilt angle to capture the corresponding image. It can also be selected through digital PTZ technology. Of. The trigger bounding box F02 can be the widest field of view boundary of the camera lens 11, or can be a region defined by the user through the video conferencing system. To further illustrate, when the focal length of the camera lens 11 is too wide and the conference room includes When areas such as the ceiling and where no one will appear are within the camera's field of view, the user can define the coverage of the trigger bounding box F02. The trigger bounding box F02 in this embodiment takes the widest angle field of view boundary of the camera lens 11 as an example.

以下,請參照圖3並且配合上述所示,以說明本發明之一種影像追蹤及顯示方法。影像追蹤及顯示方法包括一初始目標偵測程序P01、一初始影像調整程序P02、一變動偵測程序P03、一顯示畫面變更程序P04、一穩定偵測程序P05、一目標偵測程序P06以及一影像調整程序P07。 Below, please refer to FIG. 3 in conjunction with the above description to describe an image tracking and display method of the present invention. The image tracking and display method includes an initial target detection procedure P01, an initial image adjustment procedure P02, a change detection procedure P03, a display screen change procedure P04, a stable detection procedure P05, an object detection procedure P06 and a Image adjustment program P07.

在啟動視訊會議系統之後,首先會執行初始目標偵測程序P01。請同時參照圖3與圖4A,初始目標偵測程序P01係對觸發邊界框F02對應之影像進行一初始目標偵測,並產生一初始影像顯示框F01。其中,初始目標偵測係偵測觸發邊界框F02對應之影像中是否存在有人,並且辨識其分布位置。在辨識之後則對應於人與分布位置而產生初始影像顯示框F01。 After starting the video conferencing system, the initial target detection procedure P01 will be executed first. Please refer to FIG. 3 and FIG. 4A at the same time. The initial target detection program P01 performs an initial target detection on the image corresponding to the trigger bounding box F02 and generates an initial image display frame F01. Among them, the initial target detection is to detect whether there are people in the image corresponding to the trigger bounding box F02, and to identify their distribution location. After recognition, an initial image display frame F01 is generated corresponding to the person and the distribution position.

於此,所謂的目標係可為系統預設或是由使用者自行設定。本實施例係以視訊會議為範例,故目標係為系統預設之「人形」。要特別說明的是,在其他應用中,目標亦可為交通工具或動物等,而不限定於人。於本實施例中,人形偵測係以人工智慧(AI)硬體,其例如選用R-FCN(Region-based Fully Convolutional Network)、SSD(single shot multibox detector)或YOLOv2等模組而據以實現。 Here, the so-called target can be preset by the system or set by the user. This embodiment uses video conference as an example, so the target is the "humanoid" defaulted by the system. It should be noted that in other applications, the target can also be a vehicle or an animal, and is not limited to humans. In this embodiment, humanoid detection is implemented by artificial intelligence (AI) hardware, which is implemented by, for example, using modules such as R-FCN (Region-based Fully Convolutional Network), SSD (single shot multibox detector) or YOLOv2. .

另外,值得一提的是,在產生初始影像顯示框F01之後,倘若初始影像顯示框F01之比例與顯示器之顯示比例不相符時,更包括對變形的影像進行補償處理。 In addition, it is worth mentioning that after the initial image display frame F01 is generated, if the proportion of the initial image display frame F01 does not match the display proportion of the monitor, it also includes compensation processing for the deformed image.

初始影像調整程序P02係將初始影像顯示框F01範圍對應之影像輸出至顯示器。於此,初始影像顯示框F01範圍對應之影像可藉由物理性的調整攝像鏡頭之機構而擷取之,亦可藉由數位演算的方式擷取之。本實施例中,初始影像顯示框F01範圍對應之影像係藉由調整攝像鏡頭之變焦倍率、平移角度以及傾斜角度而獲得該影像,並將擷取到的影像持續輸出至顯示器。在本實施例中,還包括確認攝像鏡頭之變焦倍率、平移角度以及傾斜角度是否調整至定位,當處於持續調整的狀態時則持續執行確認動作,而當確定調整完畢後則進行下一個程序。 The initial image adjustment program P02 outputs the image corresponding to the range of the initial image display frame F01 to the monitor. Here, the image corresponding to the range of the initial image display frame F01 can be captured by physically adjusting the mechanism of the camera lens, or can be captured by digital calculation. In this embodiment, the image corresponding to the range of the initial image display frame F01 is obtained by adjusting the zoom magnification, translation angle and tilt angle of the camera lens, and the captured image is continuously output to the display. In this embodiment, it also includes confirming whether the zoom magnification, translation angle and tilt angle of the camera lens are adjusted to the position. When the adjustment is in a continuous state, the confirmation action is continued, and when it is determined that the adjustment is completed, the next procedure is performed.

於此要在說明的是,上述的初始影像顯示框F01即相當於後續將會提及的影像顯示框,初始的用詞僅代表係為系統開啟時所執行的首次做動而言。 It should be noted here that the above-mentioned initial image display frame F01 is equivalent to the image display frame that will be mentioned later. The word initial only means that it is the first action performed when the system is turned on.

請同時參照圖3與圖4B,變動偵測程序P03係判斷可變觸發區域A01對應之影像是否存在一目標的數量增減變化。於此,可變觸發區域A01係介於影像顯示框F01與觸發邊界框F02之間的區域範圍,而變動偵測程序P03則係判斷是否有人員進出可變觸發區域A01。換言之,其係基於人形於可變觸發區域A01的數量變化而產生判斷結果。由於判斷依據是基於「人形」,因此變化的面積必須大於一臨界值,如此一來,可以避免因為轉頭或擺動等較小的動作而造成誤判的情況產生。於本實施例中,當可變觸發區域A01未存在目標數量的變化時,則持續執行變動偵測程序P03。 Please refer to FIG. 3 and FIG. 4B at the same time. The change detection program P03 determines whether there is an increase or decrease in the number of a target in the image corresponding to the variable trigger area A01. Here, the variable trigger area A01 is the area range between the image display frame F01 and the trigger boundary frame F02, and the change detection program P03 determines whether there is a person entering or leaving the variable trigger area A01. In other words, the judgment result is generated based on the change in the number of human figures in the variable trigger area A01. Since the judgment is based on the "human form", the area of change must be greater than a critical value. In this way, misjudgments caused by smaller movements such as turning the head or swinging can be avoided. In this embodiment, when there is no change in the target quantity in the variable trigger area A01, the change detection procedure P03 is continuously executed.

顯示畫面變更程序P04係當變動偵測程序P03之結果為「是」,則調整攝像鏡頭擷取對應於觸發邊界框F02範圍內之影像,並將影像輸出至顯示器。於此,所謂的變動偵測程序P03之結果為「是」即是表示有人員進出會議室,而造成人數的增減,而此時攝像鏡頭將調整為擷取對應於觸發邊界框F02範圍內之影像。於本實施例中,這個調整過程可以是瞬間切換。藉由將顯示畫面調整為廣角的狀態可以讓與會者得知會議室中發生的變化,對於進入會議室的人員可以藉由顯示器的畫面得知攝像鏡頭的視野涵蓋區域,而可選擇適當的位置就座。 The display screen change program P04 is when the result of the change detection program P03 is "yes", the camera lens is adjusted to capture the image corresponding to the range of the trigger bounding box F02, and the image is output to the display. Here, the result of the so-called change detection program P03 is "yes", which means that there are people entering and exiting the conference room, resulting in an increase or decrease in the number of people. At this time, the camera lens will be adjusted to capture the range corresponding to the trigger bounding box F02 image. In this embodiment, this adjustment process can be instantaneous switching. By adjusting the display screen to a wide-angle state, participants can be informed of the changes taking place in the conference room. People entering the conference room can know the field of view of the camera lens through the display screen, and can choose an appropriate position. Take a seat.

穩定偵測程序P05係判斷觸發邊界框F02對應之影像之一變動參數是否小於等於一預設變動值。於此,係可選擇利用幀間差分演算法(Frame difference)、侵蝕演算法(erison)或膨脹演算法(dilation)而得到人員的變動是否趨於穩定。於本實施例中,係以幀間差分演算法為例,其係利用連續影像的前後幀(frame)畫面進行像素差異分析而得到變動參數。當變動參數大於預設變動值係可代表人員尚在移動中,而當變動參數小於預設變動值則可代表人員已經離開會議室(消失於畫面)或是已經穩定就座。並且在確定人員已經穩定的狀況下,接著進行以下程序。 The stability detection program P05 determines whether a change parameter of the image corresponding to the trigger bounding box F02 is less than or equal to a preset change value. Here, the system can choose to use the inter-frame difference algorithm (Frame difference), erosion algorithm (erison) or dilation algorithm (dilation) to obtain whether the changes in personnel tend to be stable. In this embodiment, the inter-frame difference algorithm is taken as an example, which uses the preceding and following frames of the continuous image to perform pixel difference analysis to obtain the variation parameters. When the change parameter is greater than the preset change value, it means that the person is still moving, and when the change parameter is less than the preset change value, it means that the person has left the conference room (disappeared from the screen) or has been seated stably. And after confirming that the person is in stable condition, proceed with the following procedures.

接著,請同時參照圖3與圖4C,目標偵測程序P06係對觸發邊界框F02對應之影像進行一目標偵測,並產生一影像顯示框F01。於此,目標偵測程序P06係與前述的初始目標偵測程序P01相似,其差異可能在於因為影像中的人數不同,而將會改變影像顯示框F01的大小,進而會改變顯示於顯示器上的影像。要特別說明的是,由於影像顯示框F01的大小範圍會根據目標數量的不同而改變,換言之,可變觸發區域A01的範圍亦將隨之變化。 Next, please refer to FIG. 3 and FIG. 4C simultaneously. The target detection program P06 performs a target detection on the image corresponding to the trigger bounding box F02 and generates an image display frame F01. Here, the target detection program P06 is similar to the aforementioned initial target detection program P01. The difference may be that due to the different number of people in the image, the size of the image display frame F01 will be changed, which in turn will change the image displayed on the monitor. image. It should be noted that since the size range of the image display frame F01 will change according to the number of targets, in other words, the range of the variable trigger area A01 will also change accordingly.

影像調整程序P07係將影像顯示框F01範圍對應之影像輸出至顯示器。於此,影像調整程序P07係與前述的初始影像調整程序P02相似,於此則不再加以贅述。另外,在影像調整程序P07執行完畢後,將會再次執行變動偵測程序P03以成為迴圈。 The image adjustment program P07 outputs the image corresponding to the range of the image display frame F01 to the monitor. Here, the image adjustment procedure P07 is similar to the aforementioned initial image adjustment procedure P02, and will not be described again here. In addition, after the image adjustment program P07 is executed, the change detection program P03 will be executed again to form a loop.

綜上所述,本發明之一種影像追蹤及顯示方法係在發現影像中目標的數量出現變化時,隨即將顯示器的影像轉變為對應於觸發邊界框所對應之影像,並在影像內之目標動作趨於穩定之後逐漸調整至適當的影像顯示框所對應的影像。據此,顯示器所顯示之影像只有在偵測到可變觸發區域中的目標數量變化時才會變更輸出影像的大小,其餘的穩定狀態下,顯示器顯示的影像不會一直處於調整或變動的狀態。因此,當影像追蹤及顯示方法應用於視訊會議時,可以讓與會者在會議過程中所觀察到的影像是較穩定的。 To sum up, an image tracking and display method of the present invention is to convert the image on the display into an image corresponding to the trigger bounding box when it is found that the number of targets in the image changes, and the target action in the image After stabilizing, gradually adjust to the image corresponding to the appropriate image display frame. According to this, the image displayed on the monitor will only change the size of the output image when a change in the number of targets in the variable trigger area is detected. In other stable states, the image displayed on the monitor will not always be adjusted or changed. . Therefore, when the image tracking and display method is applied to a video conference, the images observed by participants during the conference can be more stable.

以上所述僅為舉例性,而非為限制性者。任何未脫 離本發明之精神與範疇,而對其進行之等效修改或變更,均應包含於後附之申請專利範圍中。 The above is only illustrative and not restrictive. Nothing left to take off Equivalent modifications or changes that depart from the spirit and scope of the present invention shall be included in the appended patent scope.

P01~P07:程序 P01~P07: Program

Claims (8)

一種影像追蹤及顯示方法,係與一攝像鏡頭及一顯示器配合應用,該攝像鏡頭係連續擷取一影像,該影像對應設定有一觸發邊界框以及一可變觸發區域,包含: An image tracking and display method is used in conjunction with a camera lens and a display. The camera lens continuously captures an image. The image is correspondingly set with a trigger bounding box and a variable trigger area, including: 一變動偵測程序,其判斷該可變觸發區域對應之該影像是否存在一目標的數量增減; A change detection program that determines whether there is an increase or decrease in the number of a target in the image corresponding to the variable trigger area; 一顯示畫面變更程序,當該變動偵測程序之結果為「是」,則調整該攝像鏡頭擷取對應於該觸發邊界框範圍內之該影像,並將該影像輸出至該顯示器; A display screen change procedure, when the result of the change detection procedure is "yes", adjust the camera lens to capture the image corresponding to the trigger bounding box range, and output the image to the display; 一穩定偵測程序,判斷該觸發邊界框對應之該影像之一變動參數是否小於等於一預設變動值; A stable detection procedure to determine whether the variation parameter of the image corresponding to the triggering bounding box is less than or equal to a preset variation value; 一目標偵測程序,對該觸發邊界框對應之該影像進行一目標偵測,並產生一影像顯示框;以及 A target detection program performs target detection on the image corresponding to the trigger bounding box and generates an image display frame; and 一影像調整程序,係將該影像顯示框範圍對應之該影像輸出至該顯示器。 An image adjustment program outputs the image corresponding to the image display frame range to the display. 如請求項1所述之影像追蹤及顯示方法,其中該影像調整程序完成後更重新執行該變動偵測程序。 The image tracking and display method as described in claim 1, wherein the change detection process is re-executed after the image adjustment process is completed. 如請求項1所述之影像追蹤及顯示方法,其中該影像調整程序更包含根據該影像顯示框調整該攝像鏡頭之一變焦倍率、一平移角度、一傾斜角度及其組合。 The image tracking and display method as described in claim 1, wherein the image adjustment procedure further includes adjusting the zoom magnification, a translation angle, a tilt angle and combinations thereof of the camera lens according to the image display frame. 如請求項1所述之影像追蹤及顯示方法,其中於該變動偵測程序之前更包含: The image tracking and display method as described in claim 1, which further includes before the change detection process: 一初始目標偵測程序,對該觸發邊界框對應之該影像進行一初始目標偵測,並產生一初始影像顯示框;以及 An initial target detection process, performs an initial target detection on the image corresponding to the trigger bounding box, and generates an initial image display frame; and 一初始影像調整程序,係將該初始影像顯示框範圍對應之該影像輸出至該顯示器。 An initial image adjustment procedure is to output the image corresponding to the initial image display frame range to the display. 如請求項1所述之影像追蹤及顯示方法,其中該觸發邊界框係對應於該攝像鏡頭之一最廣焦距。 The image tracking and display method of claim 1, wherein the trigger bounding box corresponds to one of the widest focal lengths of the camera lens. 如請求項1所述之影像追蹤及顯示方法,其中該觸發邊界框之大小係小於該攝像鏡頭之一最廣焦距。 The image tracking and display method of claim 1, wherein the size of the trigger bounding box is smaller than one of the widest focal lengths of the camera lens. 如請求項1所述之影像追蹤及顯示方法,其中該目標偵測程序係藉由一人形偵測技術而進行。 The image tracking and display method as described in claim 1, wherein the target detection procedure is performed by a humanoid detection technology. 如請求項1所述之影像追蹤及顯示方法,其中該可變觸發區域係介於該觸發邊界框與該影像顯示框之間的區域。 The image tracking and display method as described in claim 1, wherein the variable trigger area is an area between the trigger bounding box and the image display frame.
TW109123845A 2020-07-15 2020-07-15 Method for image tracking and display TWI820341B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
TW109123845A TWI820341B (en) 2020-07-15 2020-07-15 Method for image tracking and display

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
TW109123845A TWI820341B (en) 2020-07-15 2020-07-15 Method for image tracking and display

Publications (2)

Publication Number Publication Date
TW202205850A TW202205850A (en) 2022-02-01
TWI820341B true TWI820341B (en) 2023-11-01

Family

ID=81323730

Family Applications (1)

Application Number Title Priority Date Filing Date
TW109123845A TWI820341B (en) 2020-07-15 2020-07-15 Method for image tracking and display

Country Status (1)

Country Link
TW (1) TWI820341B (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TWI888137B (en) * 2024-05-15 2025-06-21 圓展科技股份有限公司 Obstacle avoidance camera system and method

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20100245532A1 (en) * 2009-03-26 2010-09-30 Kurtz Andrew F Automated videography based communications
TW201246942A (en) * 2011-04-11 2012-11-16 Intel Corp Object of interest based image processing

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20100245532A1 (en) * 2009-03-26 2010-09-30 Kurtz Andrew F Automated videography based communications
TW201246942A (en) * 2011-04-11 2012-11-16 Intel Corp Object of interest based image processing

Also Published As

Publication number Publication date
TW202205850A (en) 2022-02-01

Similar Documents

Publication Publication Date Title
US11350029B1 (en) Apparatus and method of detecting and displaying video conferencing groups
JP3792901B2 (en) Camera control system and control method thereof
US7843499B2 (en) Image capturing system employing different angle cameras on a common rotation axis and method for same
JP4140591B2 (en) Imaging system and imaging method
US11172158B2 (en) System and method for augmented video production workflow
KR20050051575A (en) Photographing apparatus and method, supervising system, program and recording medium
US11076127B1 (en) System and method for automatically framing conversations in a meeting or a video conference
CN103780837B (en) A kind of motion detection and the method and its device of positioning shooting
EP4075794A1 (en) Region of interest based adjustment of camera parameters in a teleconferencing environment
JP2007158860A (en) Imaging system, imaging device, image switching device, and data holding device
CN101699862A (en) High-resolution region-of-interest image acquisition method of PTZ camera
JP2010533416A (en) Automatic camera control method and system
WO2011082185A1 (en) Confined motion detection for pan-tilt cameras employing motion detection and autonomous motion tracking
CN105812736A (en) Self-adjustment Pan/Tilt/Zoom camera remote intelligent control system and control method
TWI530180B (en) Linking-up photographing system and control method for cameras thereof
JP2011091546A (en) Intruding object detection system
CN105915802A (en) A shooting range setting adjustment method of a rotary camera
CN101404725A (en) Camera, camera set, its control method, apparatus and system
WO2009066988A2 (en) Device and method for a surveillance system
TWI820341B (en) Method for image tracking and display
CN204697218U (en) A kind of examination hall supervisory control system
CN109120847A (en) A kind of control method and device of image acquisition equipment
TWI537885B (en) Monitoring method and monitoring system
JP2000341574A (en) Camera device and camera control system
US8860780B1 (en) Automatic pivoting in a wide-angle video camera