[go: up one dir, main page]

TWI623889B - 3d hand gesture image recognition method and system thereof - Google Patents

3d hand gesture image recognition method and system thereof Download PDF

Info

Publication number
TWI623889B
TWI623889B TW104108504A TW104108504A TWI623889B TW I623889 B TWI623889 B TW I623889B TW 104108504 A TW104108504 A TW 104108504A TW 104108504 A TW104108504 A TW 104108504A TW I623889 B TWI623889 B TW I623889B
Authority
TW
Taiwan
Prior art keywords
gesture
image
light field
gesture image
dimensional
Prior art date
Application number
TW104108504A
Other languages
Chinese (zh)
Other versions
TW201635196A (en
Inventor
王敬文
劉兆恆
Original Assignee
國立高雄應用科技大學
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 國立高雄應用科技大學 filed Critical 國立高雄應用科技大學
Priority to TW104108504A priority Critical patent/TWI623889B/en
Publication of TW201635196A publication Critical patent/TW201635196A/en
Application granted granted Critical
Publication of TWI623889B publication Critical patent/TWI623889B/en

Links

Landscapes

  • Image Analysis (AREA)

Abstract

一種三維手勢影像辨識系統包含一光場攝影單元、一演算單元及一輸出單元。利用該光場攝影單元攝取一手勢動作,以獲得一3D手勢影像。該演算單元連接至該光場攝影單元,再將該3D手勢影像投影至一預定辨識空間,以獲得一特徵向量,再將該特徵向量與數個樣本特徵向量以進行比對分類,以分類該3D手勢影像,以便辨識該3D手勢影像之種類。該輸出單元連接至該演算單元,以便輸出該3D手勢影像之種類。 A three-dimensional gesture image recognition system includes a light field photographing unit, a calculation unit and an output unit. A light gesture is taken by the light field photographing unit to obtain a 3D gesture image. The calculation unit is connected to the light field photography unit, and then the 3D gesture image is projected into a predetermined recognition space to obtain a feature vector, and then the feature vector and the plurality of sample feature vectors are compared and classified to classify the 3D gesture image to identify the type of the 3D gesture image. The output unit is connected to the calculation unit to output the type of the 3D gesture image.

Description

三維手勢影像辨識方法及其系統 Three-dimensional gesture image recognition method and system thereof

本發明係關於一種三維手勢影像辨識方法及其系統;特別是關於一種採用各種光場攝影〔light field capturing〕之三維手勢影像辨識方法及其系統。 The invention relates to a three-dimensional gesture image recognition method and a system thereof; in particular, to a three-dimensional gesture image recognition method and a system thereof using various light field capturing.

習用手勢影像辨識之相關應用裝置,例如:中華民國專利公告第M382675號之〝以手勢辨識為基礎之監控攝影機操控裝置〞新型專利,其揭示有關於一種可以下達監控攝影機鏡頭上下左右轉向、拉近及拉遠等動作指令的操控裝置。利用手勢取像攝影機所拍攝使用者的手勢影像,辨識出使用者手勢的上、下、左、右、前、後移動情形,並對監控攝影機發出鏡頭往上、往下、往左、往右、拉近、拉遠的操控訊息,而不需藉由操作滑鼠或操縱桿來下達上述操控訊息。 A related application device for recognizing gesture image recognition, for example, the Republic of China Patent Publication No. M382675, a new type of surveillance camera control device based on gesture recognition, which discloses that a camera lens can be turned up and down, left and right, and zoomed in. And remote control and other action command control devices. Using the gesture image of the user captured by the gesture camera to recognize the movement of the user's gestures up, down, left, right, front, and back, and to send the camera up, down, left, and right to the surveillance camera. To zoom in and out, without having to manipulate the mouse or joystick to release the above control message.

前述第M382675號先利用手勢取像攝影機拍攝使用者手部影像,再利用手勢位移偵測模組偵測與辨識使用者的手部位置與趨勢,進而計算出欲控制監控攝影機的方向,再傳遞訊號控制監控攝影機之鏡頭執行往上、往下、往左、往右、拉近及拉遠的動作。該手勢位移偵測模組先計算出手勢取像攝影機中之手部出現的位置與面積,做為手勢辨識的原點,此部份亦可以在手勢取像攝影機的畫面上預先設定一手勢辨識的原點,做為後續手勢辨識的位移偵測參考點,並分別定義出一個手勢最小面積的門檻值、一個手勢最小移動距離的門檻值及一個手勢最小面積 變化的門檻值。 The aforementioned M382675 first uses a gesture camera to capture the user's hand image, and then uses the gesture displacement detection module to detect and recognize the user's hand position and trend, thereby calculating the direction of the camera to be controlled, and then transmitting The signal control monitors the lens of the camera to perform actions of going up, down, left, right, zooming in and out. The gesture displacement detecting module first calculates the position and area of the hand in the gesture camera, and serves as the origin of the gesture recognition. This part can also preset a gesture recognition on the screen of the gesture camera. The origin is used as the displacement detection reference point for subsequent gesture recognition, and defines the threshold value of the minimum area of a gesture, the threshold value of the minimum moving distance of a gesture, and the minimum area of a gesture. The threshold for change.

前述第M382675號在使用者的手勢面積大小大於預先設定之手勢最小面積門檻值後,裝置便會開始啟動手勢辨識功能,利用該手勢位移偵測模組偵測目前手勢位置與手勢原點的位移量。當手勢移動的位移量大於預先設定的門檻值時,則視為有手勢移動發生,反之,當手勢移動的位移量小於門檻值時,則視為沒有手勢移動發生;又,該手勢位移偵測模組亦會偵測目前手勢位置與手勢原點的位移方向,若手勢往上、下、左或右移動,則觸發一控制訊號以驅動監控攝影機鏡頭往上、下、左或右移動。另外,該手勢位移偵測模組以影像辨識技術判斷手勢在活動範圍內的前後移動;手勢往前移時其面積會比面積門檻值大,則觸發一控制訊號以驅動監控攝影機鏡頭拉近影像;手勢往後移時其面積會比面積門檻值小,則觸發一控制訊號以驅動監控攝影機鏡頭拉遠影像。 In the foregoing No. M382675, after the size of the gesture area of the user is greater than a preset minimum threshold value of the gesture, the device starts to activate the gesture recognition function, and uses the gesture displacement detection module to detect the displacement of the current gesture position and the origin of the gesture. the amount. When the displacement of the gesture movement is greater than a preset threshold value, the gesture movement is considered to occur. Conversely, when the displacement amount of the gesture movement is less than the threshold value, no gesture movement occurs; and the gesture displacement detection is performed. The module also detects the current gesture position and the direction of displacement of the gesture origin. If the gesture moves up, down, left or right, a control signal is triggered to drive the camera lens to move up, down, left or right. In addition, the gesture displacement detection module uses image recognition technology to determine the movement of the gesture in the range of motion; when the gesture moves forward, the area thereof is larger than the area threshold, and a control signal is triggered to drive the surveillance camera lens to zoom in. When the gesture moves backwards, its area will be smaller than the area threshold, triggering a control signal to drive the camera lens to zoom out.

另一習用手勢影像辨識之相關應用裝置,例如:中華民國專利公告第I298461號之〝手勢辨識系統及其方法〞發明專利,其揭示一種手勢辨識系統及其方法應用於一具有影像擷取器之筆記型電腦。使用者可直接對準此影像擷取器比出一預設的手勢,而筆記型電腦即會執行此手勢動作相對應之應用程式或是功能選項,以增加使用者執行應用程式或是功能選項時之方便性。 Another application device for recognizing gesture image recognition, for example, the gesture recognition system of the Republic of China Patent No. I298461 and its method and invention patent, which discloses a gesture recognition system and a method thereof applied to an image capture device Notebook computer. The user can directly align the image capture device with a preset gesture, and the notebook computer will execute the corresponding application or function option of the gesture action to increase the user execution application or function options. The convenience of time.

另一習用手勢影像辨識之相關應用裝置,例如:中華民國專利公開第201030630號之〝手勢辨識系統及其方法〞發明專利,其揭示一種手勢辨識系統包括:一攝影裝置用於取得可能含有自然手勢的影像;一處理器用以從影像中找出膚色部份的膚色輪廓〔edge〕,再將膚色輪廓分類為多個不同角度的輪廓碎片;一運算引擎具有數個平行運算單元及數個不同角度類別的手勢模板庫,該數 個平行運算單元分別在不同角度類別的手勢模板庫中找出和輪廓碎片最近似的手勢模板;一最佳模板選取手段,自由該數個平行運算單元找出的數個近似的手勢模板中再選出一個最佳的手勢模板;及一顯示終端用以顯示最佳的手勢模板的影像;藉此達到無需使用任何標記〔marker less〕且能夠即時辨識手勢的目的。 Another application device for recognizing gesture image recognition, for example, the gesture recognition system of the Republic of China Patent Publication No. 201030630 and the method thereof, and a method for identifying a gesture recognition system comprising: a photographing device for obtaining a possible natural gesture An image is used to find the skin contour of the skin color portion from the image, and then classify the skin color contour into a plurality of contour fragments of different angles; an arithmetic engine has a plurality of parallel computing units and a plurality of different angles; Category of gesture template library, the number The parallel computing units respectively find the gesture template that is closest to the contour fragments in the gesture template library of different angle categories; an optimal template selection method, which is free from the several approximate gesture templates found by the plurality of parallel operation units. Selecting an optimal gesture template; and displaying an image of the best gesture template by the display terminal; thereby achieving the purpose of not recognizing the marker without using any marker.

另一習用手勢影像辨識相關方法及系統,例如:中華民國專利第I431538號之〝基於影像之動作手勢辨識方法及系統〞發明專利,其揭示一種基於影像之動作手勢辨識方法及系統。該方法包含下列步驟:接收複數張影像畫面;根據複數張影像畫面執行一手勢偵測,以得到一第一手勢,如果該第一手勢符合一預設開始手勢,則根據複數張影像畫面中手部位置執行一移動追蹤,以取得一移動手勢;於執行移動追蹤之過程中,根據複數張影像畫面執行手勢偵測,以得到一第二手勢,若該第二手勢符合一預設結束手勢,停止移動追蹤。 Another conventional gesture image recognition related method and system, for example, the image-based motion gesture recognition method and system invention patent of the Republic of China Patent No. I431538, which discloses an image-based motion gesture recognition method and system. The method includes the steps of: receiving a plurality of image frames; performing a gesture detection according to the plurality of image images to obtain a first gesture, and if the first gesture conforms to a preset start gesture, according to the plurality of image frames The part position performs a movement tracking to obtain a movement gesture; during the execution of the movement tracking, performing gesture detection according to the plurality of image images to obtain a second gesture, if the second gesture meets a preset end Gesture, stop moving tracking.

另一習用手勢影像辨識相關方法及系統,例如:發明人提出中華民國專利第I444907號之〝採用奇異值分解處理複雜背景之手勢影像辨識方法及其系統〞發明專利,其揭示一種採用奇異值分解處理複雜背景之手勢影像辨識方法。該方法包含:利用一奇異值分解法分解一原始手勢影像,以獲得一增益手勢影像;自該增益手勢紋影像去除深色背景,以獲得至少一類皮膚圖素區塊;及利用一膚色偵測方法於該類皮膚圖素區塊進行膚色偵測,以去除該類皮膚圖素區塊之剩餘背景。該手勢影像辨識系統包含一輸入單元、一演算單元及一輸出單元。該輸入單元用以輸入該原始手勢影像,該演算單元用以自該增益手勢紋影像去除深色背景及剩餘背景,而該輸出單元用以輸出一膚色手勢影像。 Another conventional method and system for recognizing gesture image recognition, for example, the inventor proposes a gesture image recognition method using singular value decomposition to deal with complex backgrounds and a system for inventing the patent of the Republic of China Patent No. I444907, which discloses a singular value decomposition Handling image recognition methods for complex backgrounds. The method comprises: decomposing an original gesture image by using a singular value decomposition method to obtain a gain gesture image; removing a dark background from the gain gesture image to obtain at least one type of skin pixel block; and utilizing a skin color detection method The method performs skin color detection on the skin pixel block to remove the remaining background of the skin pixel block. The gesture image recognition system comprises an input unit, a calculation unit and an output unit. The input unit is configured to input the original gesture image, the calculation unit is configured to remove the dark background and the remaining background from the gain gesture image, and the output unit is configured to output a skin motion gesture image.

另一習用手勢影像辨識相關方法及系統,例如:發明人提出中華民國專利第I444908號之〝採用影像方向對正處理之手勢影像辨識方法及其系統〞發明專利,其揭示一種採用影像方向對正處理之手勢影像辨識方法。該方法包含:輸入一膚色手勢影像;於該膚色手勢影像計算一全域性質心;於該膚色手勢影像選擇一感興趣區塊;利用該感興趣區塊選擇一子區域;於該子區域計算一區域性質心;利用該全域性質心及區域性質心計算一對正角度。該手勢影像辨識系統包含一輸入單元、一演算單元及一輸出單元。該輸入單元用以輸入該膚色手勢影像,該演算單元於該膚色手勢影像選擇該感興趣區塊及子區域,以計算該全域性質心及區域性質心,再計算該對正角度,而該輸出單元用以輸出該對正角度。 Another conventional method and system for recognizing gesture image recognition, for example, the inventor proposes a method for recognizing a gesture image using image direction alignment and a system for inventing the invention after the patent of the Republic of China Patent No. I444908, which discloses a method of correcting image orientation Handling gesture image recognition method. The method includes: inputting a skin color gesture image; calculating a global heart in the skin color gesture image; selecting a region of interest in the skin color gesture image; selecting a sub region using the region of interest; and calculating a subregion in the subregion Regional nature; use this global nature and regional nature to calculate a pair of positive angles. The gesture image recognition system comprises an input unit, a calculation unit and an output unit. The input unit is configured to input the skin color gesture image, and the calculation unit selects the region of interest and the sub-region in the skin color gesture image to calculate the global nature heart and the region property core, and then calculate the alignment angle, and the output is The unit is used to output the pair of positive angles.

另一習用手勢影像辨識相關方法及系統,例如:發明人提出中華民國專利第I444909號之〝採用奇異值分解進行光線補償處理之手勢影像辨識方法及其系統〞發明專利,其揭示一種採用奇異值分解光線補償處理之手勢影像辨識方法。該方法包含:輸入一手勢影像;利用一奇異值分解法應用於該手勢影像;以一光線補償法計算至少一光線補償係數;及利用該光線補償係數進行光線補償處理該手勢影像,以獲得一光線補償手勢影像。該手勢影像辨識系統包含一輸入單元、一演算單元及一輸出單元。該輸入單元用以輸入該原始手勢影像,該演算單元以該光線補償法計算該光線補償係數,並利用該光線補償係數進行光線補償處理該手勢影像,以獲得該光線補償手勢影像,而該輸出單元用以輸出該光線補償手勢影像。 Another conventional method and system for recognizing gesture image recognition, for example, the inventor proposes a method for recognizing a gesture image using singular value decomposition for gamma compensation, and a system for inventing the invention, which discloses a singular value The gesture image recognition method for decomposing the light compensation processing. The method comprises: inputting a gesture image; applying a singular value decomposition method to the gesture image; calculating at least one ray compensation coefficient by a ray compensation method; and performing ray compensation processing on the gesture image by using the ray compensation coefficient to obtain a gesture image Light compensated gesture image. The gesture image recognition system comprises an input unit, a calculation unit and an output unit. The input unit is configured to input the original gesture image, and the calculating unit calculates the light compensation coefficient by using the light compensation method, and performs light compensation processing on the gesture image by using the light compensation coefficient to obtain the light compensation gesture image, and the output is The unit is configured to output the light compensation gesture image.

另外,習用手勢影像辨識之相關應用技術已揭示於部分美國專利,例如:美國專利第7,702,130號之〝User interface apparatus using hand gesture recognition and method thereof〞、第7,680,295號之〝Hand-gesture based interface apparatus〞、第6,215,890號之〝Hand gesture recognizing device〞、第6,002,808號之〝Hand gesture control system〞及第5,594,469號之〝Hand gesture machine control system〞等。前述中華民國專利及美國專利僅為本發明技術背景之參考及說明目前技術發展狀態而已,其並非用以限制本發明之範圍。 In addition, related art techniques for conventional gesture image recognition have been disclosed in some U.S. patents, for example, U.S. Patent No. 7,702,130, User interface apparatus using hand gesture recognition and method After the Hand-gesture based interface apparatus, No. 6, 215, 890, Hand gesture recognizing device 〞, Hand gesture control system 第 No. 6, 002, 808 and Hand gesture machine control system 第 No. 5, 594, 469 The foregoing Japanese patents and U.S. patents are only for the purpose of the present invention, and are not intended to limit the scope of the present invention.

雖然前述專利已揭示相關手勢影像辨識技術,但其並未提供三維手勢影像辨識之相關技術。事實上,就手勢影像辨識技術而言,其必然需要簡化複雜的系統架構或省略複雜的前置處理程序,否則其影響手勢影像辨識的可靠度。因此,習用手勢影像辨識技術必然存在進一步提供三維手勢影像辨識的需求。 Although the aforementioned patents have disclosed related gesture image recognition techniques, they do not provide related techniques for three-dimensional gesture image recognition. In fact, in terms of gesture image recognition technology, it is necessary to simplify the complicated system architecture or omit complicated pre-processing procedures, otherwise it will affect the reliability of gesture image recognition. Therefore, the conventional gesture image recognition technology necessarily has the need to further provide three-dimensional gesture image recognition.

有鑑於此,本發明為了滿足上述需求,其提供一種三維手勢影像辨識方法及其系統,其利用一光場攝影單元攝取一手勢動作,以獲得一3D手勢影像,而將該3D手勢影像投影至一預定辨識空間,以獲得一特徵向量,且將該特徵向量與數個樣本特徵向量進行比對分類,即可完成手勢影像辨識,以提升習用手勢影像辨識技術之可靠度。 In view of the above, the present invention provides a three-dimensional gesture image recognition method and system thereof, which utilizes a light field photographing unit to take a gesture motion to obtain a 3D gesture image, and project the 3D gesture image to A predetermined recognition space is obtained to obtain a feature vector, and the feature vector is compared with a plurality of sample feature vectors to perform gesture image recognition, so as to improve the reliability of the conventional gesture image recognition technology.

本發明較佳實施例之主要目的係提供一種三維手勢影像辨識方法及其系統,其利用一光場攝影單元攝取一手勢動作,以獲得一3D手勢影像,而將該3D手勢影像投影至一預定辨識空間,以獲得一特徵向量,且將該特徵向量與數個樣本特徵向量進行比對分類,以達成提升手勢影像辨識可靠度之目的。 The main purpose of the preferred embodiment of the present invention is to provide a three-dimensional gesture image recognition method and system thereof, which utilizes a light field photographing unit to take a gesture motion to obtain a 3D gesture image, and project the 3D gesture image to a predetermined The space is identified to obtain a feature vector, and the feature vector is compared with a plurality of sample feature vectors to achieve the purpose of improving the reliability of gesture image recognition.

為了達成上述目的,本發明較佳實施例之三維手勢影像辨識方法包含:利用一光場攝影單元攝取一手勢動作,以獲得 一3D手勢影像;將該3D手勢影像投影至一預定辨識空間,以獲得一特徵向量;及將該特徵向量與數個樣本特徵向量以進行比對分類,以分類該3D手勢影像,以便辨識該3D手勢影像之種類。 In order to achieve the above object, a three-dimensional gesture image recognition method according to a preferred embodiment of the present invention includes: capturing a gesture by using a light field photographing unit to obtain a 3D gesture image; projecting the 3D gesture image to a predetermined recognition space to obtain a feature vector; and classifying the feature vector and the plurality of sample feature vectors to classify the 3D gesture image to identify the image The type of 3D gesture image.

本發明較佳實施例之該3D手勢影像包含一平面影像資訊及一深度資訊。 In the preferred embodiment of the present invention, the 3D gesture image includes a plane image information and a depth information.

本發明較佳實施例之該3D手勢影像為一3D手勢輪廓影像或一3D手勢實心影像。 In the preferred embodiment of the present invention, the 3D gesture image is a 3D gesture contour image or a 3D gesture solid image.

本發明較佳實施例利用投影色彩空間方式將一3D手勢實心RGB影像進行投影色彩空間轉換,以獲得一R通道影像資訊、一G通道影像資訊及一B通道影像資訊。 In a preferred embodiment of the present invention, a 3D gesture solid RGB image is converted into a projected color space by using a projected color space method to obtain an R channel image information, a G channel image information, and a B channel image information.

本發明較佳實施例在將該3D手勢影像投影至該預定辨識空間時,採用主成分分析進行投影。 In the preferred embodiment of the present invention, when the 3D gesture image is projected to the predetermined recognition space, the principal component analysis is used for projection.

本發明較佳實施例在將該特徵向量與數個樣本特徵向量以進行比對分類時,採用最近鄰居法進行比對分類。 In the preferred embodiment of the present invention, when the feature vector and the plurality of sample feature vectors are compared for classification, the nearest neighbor method is used for comparison and classification.

為了達成上述目的,本發明較佳實施例之三維手勢影像辨識系統包含:一光場攝影單元,其攝取一手勢動作,以獲得一3D手勢影像;一演算單元,其連接至該光場攝影單元,將該3D手勢影像投影至一預定辨識空間,以獲得一特徵向量,再將該特徵向量與數個樣本特徵向量以進行比對分類,以分類該3D手勢影像,以便辨識該3D手勢影像之種類;及一輸出單元,其連接至該演算單元,以便輸出該3D手勢影像之種類。 In order to achieve the above object, a three-dimensional gesture image recognition system according to a preferred embodiment of the present invention includes: a light field photographing unit that takes a gesture motion to obtain a 3D gesture image; and an arithmetic unit connected to the light field photographing unit Projecting the 3D gesture image to a predetermined recognition space to obtain a feature vector, and then classifying the feature vector and the plurality of sample feature vectors to classify the 3D gesture image to identify the 3D gesture image. And an output unit connected to the calculation unit to output the type of the 3D gesture image.

本發明較佳實施例之該3D手勢影像為一3D手勢輪廓影像或一3D手勢實心影像。 In the preferred embodiment of the present invention, the 3D gesture image is a 3D gesture contour image or a 3D gesture solid image.

本發明較佳實施例利用投影色彩空間方式將一3D手勢實心RGB影像進行投影色彩空間轉換,以獲得一R通道影像資訊、一G通道影像資訊及一B通道影像資訊。 In a preferred embodiment of the present invention, a 3D gesture solid RGB image is converted into a projected color space by using a projected color space method to obtain an R channel image information, a G channel image information, and a B channel image information.

本發明較佳實施例在將該3D手勢影像投影至該預定辨識空間時,採用主成分分析進行投影,而在將該特徵向量與數個樣本特徵向量以進行比對分類時,採用最近鄰居法進行比對分類。 In the preferred embodiment of the present invention, when the 3D gesture image is projected into the predetermined recognition space, the principal component analysis is used for projection, and when the feature vector and the plurality of sample feature vectors are compared for classification, the nearest neighbor method is adopted. Perform a comparison classification.

S1‧‧‧步驟 S1‧‧‧ steps

S2‧‧‧步驟 S2‧‧‧ steps

S3‧‧‧步驟 S3‧‧‧ steps

10‧‧‧光場攝影單元 10‧‧‧Light field photography unit

20‧‧‧演算單元 20‧‧‧ calculus unit

30‧‧‧輸出單元 30‧‧‧Output unit

第1圖:本發明較佳實施例之三維手勢影像辨識方法之流程示意圖。 FIG. 1 is a flow chart showing a method for recognizing a three-dimensional gesture image according to a preferred embodiment of the present invention.

第2圖:本發明較佳實施例之三維手勢影像辨識系統之方塊示意圖。 2 is a block diagram of a three-dimensional gesture image recognition system in accordance with a preferred embodiment of the present invention.

第3A及3B圖:本發明較佳實施例之三維手勢影像辨識方法及其系統攝取九個3D手勢輪廓影像及九個3D手勢實心影像之示意圖。 3A and 3B are diagrams showing a three-dimensional gesture image recognition method and a system thereof for capturing nine 3D gesture contour images and nine 3D gesture solid images.

第4A及4B圖:本發明較佳實施例之三維手勢影像辨識方法及其系統採用各種平面旋轉變異條件分別進行測試3D手勢輪廓影像及3D手勢實心影像後,其辨識率與特徵向量關係之曲線示意圖。 4A and 4B are diagrams showing a three-dimensional gesture image recognition method and a system thereof according to a preferred embodiment of the present invention, and using a variety of plane rotation variation conditions to respectively test a 3D gesture contour image and a 3D gesture solid image, the relationship between the recognition rate and the feature vector schematic diagram.

第5A及5B圖:本發明較佳實施例之三維手勢影像辨識方法及其系統採用各種深度旋轉變異條件分別進行測試3D手勢輪廓影像及3D手勢實心影像後,其辨識率與特徵向量關係之曲線示意圖。 5A and 5B are diagrams showing a three-dimensional gesture image recognition method and a system thereof according to a preferred embodiment of the present invention, which use various depth rotation variation conditions to respectively test the relationship between the recognition rate and the feature vector after testing the 3D gesture contour image and the 3D gesture solid image. schematic diagram.

第6A圖:本發明另一較佳實施例之三維手勢影像辨識方法及其系統攝取九個3D手勢實心RGB影像之示意 圖。 FIG. 6A is a schematic diagram of a three-dimensional gesture image recognition method according to another preferred embodiment of the present invention and a system for capturing the solid RGB images of nine 3D gestures. Figure.

第6B圖:本發明另一較佳實施例之三維手勢影像辨識方法及其系統對3D手勢實心RGB影像進行投影色彩空間轉換後,產生九個PCS投影影像之示意圖。 FIG. 6B is a schematic diagram of a three-dimensional gesture image recognition method and a system thereof for performing projection color space conversion on a solid RGB image of a 3D gesture, and generating nine PCS projection images.

第7(a)至7(d)圖:本發明另一較佳實施例之三維手勢影像辨識方法及其系統由原始3D手勢實心RGB影像處理後,產生一R通道影像、一G通道影像及一B通道影像,再產生一PCS投影影像之示意圖。 7(a) to 7(d): a three-dimensional gesture image recognition method and system thereof according to another preferred embodiment of the present invention, which is processed by an original 3D gesture solid RGB image to generate an R channel image, a G channel image, and A B channel image, and then a schematic diagram of a PCS projection image.

第8圖:本發明另一較佳實施例之三維手勢影像辨識方法及其系統進行投影色彩空間轉換後,將灰階手勢影像及PCS投影影像之辨識率與特徵向量關係之曲線比較示意圖。 FIG. 8 is a schematic diagram showing a curve of a relationship between a recognition rate of a grayscale gesture image and a PCS projection image and a feature vector after performing a projection color space conversion according to another preferred embodiment of the present invention.

第9(a)至9(f)圖:本發明較佳實施例之三維手勢影像辨識方法及其系統採用添加各種高斯雜訊變異條件進行測試後,將原始3D手勢輪廓影像與一系列測試3D手勢輪廓影像之比較示意圖。 9(a) to 9(f): a three-dimensional gesture image recognition method and system thereof according to a preferred embodiment of the present invention, after testing with various Gaussian noise variations conditions, the original 3D gesture contour image and a series of test 3D A schematic diagram of the comparison of gesture contour images.

第10圖:本發明較佳實施例之三維手勢影像辨識方法及其系統採用添加各種高斯雜訊變異條件進行測試後,一系列3D手勢輪廓影像之辨識率與特徵向量關係之示意圖。 FIG. 10 is a schematic diagram showing the relationship between the recognition rate and the feature vector of a series of 3D gesture contour images after the test is performed by adding various Gaussian noise variations conditions according to the preferred embodiment of the present invention.

第11(a)至11(f)圖:本發明較佳實施例之三維手勢影像辨識方法及其系統採用添加各種高斯雜訊變異條件進行測試後,將原始3D手勢實心影像與一系列測試3D手勢實心影像之比較示意圖。 11(a) to 11(f): a three-dimensional gesture image recognition method and system thereof according to a preferred embodiment of the present invention, after adding various Gaussian noise variation conditions for testing, the original 3D gesture solid image and a series of test 3D A schematic diagram of the comparison of gestures and solid images.

第12(a)至12(f)圖:本發明較佳實施例之三維手勢影像辨識方法及其系統採用添加各種高斯雜訊變異條件進行測試,並進行投影色彩空間轉換後,將原始PCS投影影像與一系列測試PCS投影影像之比較示意圖。 12(a) to 12(f): a three-dimensional gesture image recognition method and system thereof according to a preferred embodiment of the present invention are tested by adding various Gaussian noise variations conditions, and performing projection color space conversion to project the original PCS A comparison of images and a series of test PCS projection images.

第13圖:本發明較佳實施例之三維手勢影像辨識方法 及其系統採用添加各種高斯雜訊變異條件進行測試後,一系列3D手勢輪廓影像之辨識率與特徵向量關係之示意圖。 Figure 13: Three-dimensional gesture image recognition method in accordance with a preferred embodiment of the present invention The system and its system adopt a variety of Gaussian noise variation conditions to test, the relationship between the recognition rate of a series of 3D gesture contour images and the feature vector.

為了充分瞭解本發明,於下文將舉例較佳實施例並配合所附圖式作詳細說明,且其並非用以限定本發明。 In order to fully understand the present invention, the preferred embodiments of the present invention are described in detail below, and are not intended to limit the invention.

本發明較佳實施例之三維手勢影像辨識方法及其系統適用於各種手勢辨識裝置及其相關應用設備,例如:各類型電腦系統、家電產品控制系統〔如物聯網〕、自動化控制系統、醫療照護系統或保全系統,但其並非用以限定本發明之範圍。 The three-dimensional gesture image recognition method and system thereof according to the preferred embodiment of the present invention are applicable to various gesture recognition devices and related application devices, such as various types of computer systems, home appliance control systems (such as the Internet of Things), automated control systems, and medical care. The system or the security system is not intended to limit the scope of the invention.

第1圖揭示本發明較佳實施例之三維手勢影像辨識方法之流程示意圖。第2圖揭示本發明較佳實施例之三維手勢影像辨識系統之方塊示意圖,其對應於第1圖。請參照第1及2圖所示,舉例而言,本發明較佳實施例之三維手勢影像辨識系統包含一光場攝影〔light field capturing〕單元10、一演算單元20及一輸出單元30,而該演算單元20適當連接至該光場攝影單元10,且該輸出單元30適當連接至該演算單元20。 FIG. 1 is a flow chart showing a method for recognizing a three-dimensional gesture image according to a preferred embodiment of the present invention. FIG. 2 is a block diagram showing a three-dimensional gesture image recognition system according to a preferred embodiment of the present invention, which corresponds to FIG. Referring to FIGS. 1 and 2, for example, a three-dimensional gesture image recognition system according to a preferred embodiment of the present invention includes a light field capturing unit 10, a calculation unit 20, and an output unit 30. The calculation unit 20 is suitably connected to the light field photographing unit 10, and the output unit 30 is suitably connected to the calculation unit 20.

請再參照第1及2圖所示,本發明較佳實施例之三維手勢影像辨識方法包含步驟S1:首先,利用該光場攝影單元10或具類似功能的光場影像輸入單元攝取一手勢動作,以獲得一3D手勢影像。舉例而言,該3D手勢影像包含一平面影像資訊及一深度資訊。該3D手勢影像為一3D手勢輪廓〔contour〕影像或一3D手勢實心〔solid〕影像。 Referring to FIG. 1 and FIG. 2 again, the method for recognizing a three-dimensional gesture image according to a preferred embodiment of the present invention includes the step S1: first, the light field imaging unit 10 or a light field image input unit having a similar function is used to take a gesture. To get a 3D gesture image. For example, the 3D gesture image includes a plane image information and a depth information. The 3D gesture image is a 3D gesture contour image or a 3D gesture solid image.

第3A圖揭示本發明較佳實施例之三維手勢影像辨識方法及其系統攝取九個3D手勢輪廓影像之示意圖,其包含零至八的3D手勢影像。相對的,第3B圖揭示 本發明較佳實施例之三維手勢影像辨識方法及其系統攝取九個3D手勢實心影像之示意圖。 FIG. 3A is a schematic diagram showing a three-dimensional gesture image recognition method and a system for capturing nine 3D gesture contour images according to a preferred embodiment of the present invention, which includes zero to eight 3D gesture images. In contrast, Figure 3B reveals A three-dimensional gesture image recognition method and a system thereof for capturing nine solid images of 3D gestures in a preferred embodiment of the present invention.

請再參照第1及2圖所示,本發明較佳實施例之三維手勢影像辨識方法包含步驟S2:接著,將該3D手勢影像投影至一預定辨識空間,以獲得一特徵向量。舉例而言,本發明較佳實施例採用主成分分析〔PCA,principal component analysis〕方法將該3D手勢影像之特徵投影至比較容易辨別的空間。 Referring to FIG. 1 and FIG. 2 again, the three-dimensional gesture image recognition method according to the preferred embodiment of the present invention includes the step S2: then, the 3D gesture image is projected to a predetermined recognition space to obtain a feature vector. For example, the preferred embodiment of the present invention uses a principal component analysis (PCA) method to project features of the 3D gesture image into a relatively easily discernible space.

該主成分分析可運用於高維資料的降維,且保留資料的變異程度,由於一維PCA需要將輸入訓練影像的維度變直。就(m×n)大小的影像為例,在進行共變異矩陣進算時,會產生(m×n)×(m×n)大小的矩陣,對計算特徵向量的步驟上需要極大的計算時間。因此i將原本的共變異矩陣改為下式: The principal component analysis can be applied to the dimensionality reduction of high-dimensional data, and the degree of variation of the data is preserved, because the one-dimensional PCA needs to straighten the dimensions of the input training image. Taking the image of ( m × n ) size as an example, when the covariation matrix is calculated, a matrix of ( m × n ) × ( m × n ) is generated, which requires a great calculation time for the step of calculating the feature vector. . Therefore i changed the original covariation matrix to the following formula:

其中C i 為改變過的共變異矩陣,L為訓練樣本數量,X tr 為訓練影像,X為全部訓練影像的平均值。如此,將共變異矩陣維度大小降為L×L,且大幅減少計算投影基底所需的時間。本發明較佳實施例採用奇異值分解〔SVD〕對共變異矩陣進行運算如下式: Where C i is the changed covariation matrix, L is the number of training samples, X tr is the training image, and X is the average of all training images. In this way, the covariation matrix dimension is reduced to L × L and the time required to calculate the projection substrate is greatly reduced. The preferred embodiment of the present invention uses the singular value decomposition [SVD] to operate the covariation matrix as follows:

上式所求得的特徵值矩陣Σ i 與原本的共變異矩陣C經奇異值分解所得特徵值矩陣Σ相同,而特徵向量矩陣 則須以(X tr -)*U i 方形成特徵向量矩陣U。接著,利用特徵向量矩陣將原本的訓練資料投影至PCA空間,以獲得經過PCA處理過後的訓練樣本特徵向量F tr The formula obtained eigenvalue matrix Σ i resulting eigenvalue matrix [Sigma same original covariance matrix C through singular value decomposition, and the eigenvector matrix shall be to (X tr - ) * U i square forms the eigenvector matrix U . Then, the original training data is projected into the PCA space by using the feature vector matrix to obtain the PCA processed training sample feature vector F tr .

請再參照第1及2圖所示,本發明較佳實施例之三維手勢影像辨識方法包含步驟S3:接著,將該特徵向量與數個樣本特徵向量以進行比對分類,以分類該3D手勢影像,以便辨識該3D手勢影像之種類。舉例而言,在將該特徵向量與數個樣本特徵向量以進行比對分類時,採用最近鄰居法〔KNNk-nearest neighbors〕進行比對分類。利用在特徵空間中最接近測試影像的k個訓練樣本進行分類,將測試影像投影至PCA空間,並與訓練樣本比較,以便進行相似度的計算如下式: Referring to FIG. 1 and FIG. 2 again, the three-dimensional gesture image recognition method according to the preferred embodiment of the present invention includes the step S3: then, the feature vector and the plurality of sample feature vectors are compared and classified to classify the 3D gesture. Image to identify the type of 3D gesture image. For example, when the feature vector is compared with a plurality of sample feature vectors for comparison, the nearest neighbor method ( KNN , k- nearest neighbors) is used for comparison classification. The k training samples closest to the test image in the feature space are classified, the test image is projected into the PCA space, and compared with the training samples, so that the similarity is calculated as follows:

其中S k 為計算出的相似度矩陣,k為所設定最鄰近的訓練樣本數量,N為特徵根數的最大值,F te F tr 為測試及訓練樣本的特徵向量,依照k值選擇相似度最近的k個訓練樣本,判斷測試資料與選定訓練樣本中的哪一類最相近,以進行分類的預測。 Where S k is the calculated similarity matrix, k is the number of the nearest training samples, N is the maximum value of the feature roots, F te and F tr are the feature vectors of the test and training samples, and the similarity is selected according to the k value. The nearest k training samples determine which of the selected training samples is closest to the classification of the selected training samples for classification prediction.

第4A圖揭示本發明較佳實施例之三維手勢影像辨識方法及其系統採用各種平面旋轉〔in-plane rotation〕變異條件分別進行測試3D手勢輪廓影像後,其辨識率與特徵向量關係之曲線示意圖。請參照第4A圖所示,本發明較佳實施例採用旋轉角度分別為±3、±5、±8及±10度。雖然該3D手勢輪廓影像在k=1及k=3的最高辨識率皆為100%,但其依特徵向量〔特徵根數,eigen vector〕上升的辨識率趨勢卻以k=1較快收斂,比較兩者的 辨識率下降程度也約略相等。在此種情形下,判定k=1為最適合平面旋轉輪廓手勢的k值,且在取到第17根時可達到最高100%的辨識率。 FIG. 4A is a schematic diagram showing the relationship between the recognition rate and the feature vector after the 3D gesture contour image is tested by using various in-plane rotation variations conditions according to a preferred embodiment of the present invention. . Referring to FIG. 4A, the preferred embodiment of the present invention uses rotation angles of ±3, ±5, ±8, and ±10 degrees, respectively. Although the highest recognition rate of the 3D gesture contour image is 100% for k =1 and k =3, the trend of the recognition rate according to the eigenvector [eigen vector] rises faster with k =1. The degree of decline in the recognition rate of the two is also approximately equal. In this case, it is determined that k =1 is the k value that is most suitable for the plane rotation contour gesture, and the recognition rate of up to 100% can be achieved when the 17th root is taken.

相對的,第4B圖揭示本發明較佳實施例之三維手勢影像辨識方法及其系統採用各種平面旋轉變異條件分別進行測試3D手勢實心影像後,其辨識率與特徵向量關係之曲線示意圖。請參照第4B圖所示,本發明較佳實施例採用旋轉角度分別為±3、±5、±8及±10度。該3D手勢實心影像在k=2時的曲線震盪幅度較大,且在辨識率達到最高後也呈現衰減的趨勢。因此,於此僅對k=1與k=3進行比較,且在k=3時,較快收斂且能保持其穩定度,於是判定k=3為最適合平面旋轉輪廓手勢的k值,且在取到第7根時可達到最高100%的辨識率。 In contrast, FIG. 4B discloses a three-dimensional gesture image recognition method according to a preferred embodiment of the present invention, and a system diagram of the relationship between the recognition rate and the feature vector after testing the solid image of the 3D gesture using various plane rotation variation conditions. Referring to FIG. 4B, the preferred embodiment of the present invention uses rotation angles of ±3, ±5, ±8, and ±10 degrees, respectively. The solid image of the 3D gesture has a large amplitude of oscillation when k =2, and also shows a tendency to decay after the recognition rate reaches the highest. Therefore, only k =1 and k = 3 are compared here, and when k = 3, the convergence is faster and the stability is maintained, so that k = 3 is determined to be the k value most suitable for the plane rotation contour gesture, and Up to 100% recognition rate can be achieved when the 7th root is taken.

第5A圖揭示本發明較佳實施例之三維手勢影像辨識方法及其系統採用各種深度旋轉〔out-of-plane rotation〕變異條件分別進行測試3D手勢輪廓影像後,其辨識率與特徵向量關係之曲線示意圖。請參照第5A圖所示,本發明較佳實施例採用旋轉角度分別為±15及±30度。在加入深度旋轉變異條件後,該3D手勢輪廓影像的辨識率下降為在k=1時達到最高為94.07%,且在特徵取至第3根時即可達到最高辨識率,而後則呈現逐步緩慢下降的趨勢。 FIG. 5A discloses a three-dimensional gesture image recognition method and a system thereof according to a preferred embodiment of the present invention. After using various out-of-plane rotation conditions to test a 3D gesture contour image, the relationship between the recognition rate and the feature vector is Schematic diagram of the curve. Referring to FIG. 5A, the preferred embodiment of the present invention uses rotation angles of ±15 and ±30 degrees, respectively. After adding the deep rotation variation condition, the recognition rate of the 3D gesture contour image is reduced to reach a maximum of 94.07% when k =1, and the highest recognition rate can be achieved when the feature is taken to the third root, and then the display is gradually slow. Downward trend.

相對的,第5B圖揭示本發明較佳實施例之三維手勢影像辨識方法及其系統採用各種深度旋轉變異條件分別進行測試3D手勢實心影像後,其辨識率與特徵向量關係之曲線示意圖。請參照第5B圖所示,本發明較佳實施例採用旋轉角度分別為±15及±30度。該3D手勢實心影像在深度旋轉變異上辨識率有明顯的下降,且其最高辨識率在k=1時僅剩下78.89%。由於將該3D手勢實心影像僅以 深度的灰階圖進行特徵萃取,因此其無法完整保留所有的深度資訊。當深度旋轉時,類別內的間距因而加大,並造成PCA投影後其特徵分布與其他類別影像混雜。 In contrast, FIG. 5B discloses a three-dimensional gesture image recognition method according to a preferred embodiment of the present invention, and a system diagram of the relationship between the recognition rate and the feature vector after testing the solid image of the 3D gesture using various depth rotation variation conditions. Referring to FIG. 5B, the preferred embodiment of the present invention uses rotation angles of ±15 and ±30 degrees, respectively. The 3D gesture solid image has a significant decrease in the recognition rate of the deep rotation variation, and its maximum recognition rate is only 78.89% when k =1. Since the solid image of the 3D gesture is extracted only by the grayscale image of the depth, it cannot completely retain all the depth information. When rotated in depth, the spacing within the category is thus increased, and the characteristic distribution of the PCA projection is mixed with other types of images.

為了解決上述問題,本發明另一較佳實施例採用在擷取該3D手勢實心影像的原始RGB影像,以投影色彩空間〔PCS〕方法將RGB三通道的影像資訊投影至同一空間,藉此保有更多影像資訊。 In order to solve the above problem, another preferred embodiment of the present invention uses the original RGB image of the solid image of the 3D gesture to project the image information of the RGB three channels into the same space by the projection color space [PCS] method, thereby retaining More image information.

就該3D手勢輪廓影像而言,深度的變化對辨識率的影響並不像實心手勢那麼大,因為在僅依輪廓資訊進行辨識時,深度的改變不會對輪廓線條造成太大的變化。反之,就該3D手勢實心影像而言,深度旋轉的變異造成了同一類別內不同深度間特徵分布的間距加大,導致辨識率的下降。 In the case of the 3D gesture contour image, the influence of the depth change on the recognition rate is not as large as the solid gesture, because the depth change does not cause too much change to the contour line when the recognition is performed only by the contour information. On the contrary, in the case of the solid image of the 3D gesture, the variation of the depth rotation causes the spacing of the feature distributions between different depths in the same category to increase, resulting in a decrease in the recognition rate.

第6A圖揭示本發明另一較佳實施例之三維手勢影像辨識方法及其系統攝取九個3D手勢實心RGB影像之示意圖。第6B圖揭示本發明另一較佳實施例之三維手勢影像辨識方法及其系統對3D手勢實心RGB影像進行投影色彩空間轉換後,產生九個PCS投影影像之示意圖,其對應於第6A圖。 FIG. 6A is a schematic diagram showing a three-dimensional gesture image recognition method and a system for capturing nine solid 3D gesture RGB images according to another preferred embodiment of the present invention. FIG. 6B is a schematic diagram showing a method for recognizing a three-dimensional gesture image and a system thereof for performing projection color space conversion on a solid RGB image of a 3D gesture, and generating nine PCS projection images, which corresponds to FIG. 6A.

第7(a)至7(d)圖揭示本發明另一較佳實施例之三維手勢影像辨識方法及其系統由原始3D手勢實心RGB影像處理後,產生一R通道影像、一G通道影像及一B通道影像,再產生一PCS投影影像之示意圖。請參照第7(a)至7(d)圖所示,本發明另一較佳實施例之三維手勢影像辨識方法將選擇擷取至少一3D手勢實心RGB影像,如第7(a)圖所示。接著,將該3D手勢實心RGB影像以投影色彩空間方法取得RGB三通道的影像資訊,如第7(b)、7(c)、7(d)所示。接著,將該RGB三通道的影像資訊投影至同一空間,以獲得一PCS手勢實心投影影像,如第7(d)圖所示。 7(a) to 7(d) show a three-dimensional gesture image recognition method and system thereof according to another preferred embodiment of the present invention, which is processed by the original 3D gesture solid RGB image to generate an R channel image, a G channel image, and A B channel image, and then a schematic diagram of a PCS projection image. Referring to FIG. 7(a) to FIG. 7(d), the three-dimensional gesture image recognition method according to another preferred embodiment of the present invention selects to capture at least one 3D gesture solid RGB image, as shown in FIG. 7(a). Show. Then, the 3D gesture solid RGB image is obtained by the projection color space method to obtain RGB three-channel image information, as shown in FIGS. 7(b), 7(c), and 7(d). Then, the RGB three-channel image information is projected into the same space to obtain a PCS gesture solid projection image, as shown in FIG. 7(d).

第8圖揭示本發明另一較佳實施例之三維手勢影像辨識方法及其系統進行投影色彩空間轉換後,將灰階手勢影像及PCS投影影像之辨識率與特徵向量關係之曲線比較示意圖。請參照第8圖所示,該PCS手勢實心投影影像在第14根的辨識率達到96.67%,其後下降趨勢呈現穩定,且整體而言辨識率都高於灰階實心手勢影像。在將該灰階實心手勢影像與PCS手勢實心投影影像進行比較時,其辨識率顯著提升。由於該PCS手勢實心投影影像保留了RGB三個通道的深度資訊,因此不致隨深度變化而遺失太多資訊。對於深度旋轉的變異而言,可產生較高容忍度。 FIG. 8 is a schematic diagram showing a curve of a relationship between a recognition rate of a grayscale gesture image and a PCS projection image and a feature vector after a projection color space conversion method according to another preferred embodiment of the present invention. Referring to FIG. 8 , the recognition rate of the solid projected image of the PCS gesture at the 14th root reaches 96.67%, and then the downward trend is stable, and the overall recognition rate is higher than the gray-scale solid gesture image. When the gray-scale solid gesture image is compared with the solid projection image of the PCS gesture, the recognition rate is significantly improved. Since the solid projected image of the PCS gesture retains the depth information of the three channels of RGB, no information is lost due to the depth change. For deep rotation variations, higher tolerances can be produced.

第9(a)至9(f)圖揭示本發明較佳實施例之三維手勢影像辨識方法及其系統採用添加各種高斯雜訊〔Gaussian noise〕變異條件進行測試後,將原始3D手勢輪廓影像與一系列測試3D手勢輪廓影像之比較示意圖。請參照第9(a)至9(f)圖所示,將該原始3D手勢輪廓影像〔如第9(a)圖所示〕添加的高斯雜訊平均值均為0,以1%、5%、10%、20%、30%的變異數進行辨識,如第9(b)至9(f)圖所示。 9(a) to 9(f) show a three-dimensional gesture image recognition method and system thereof according to a preferred embodiment of the present invention, and after adding various Gaussian noise variation conditions, the original 3D gesture contour image is A series of comparison diagrams for testing 3D gesture contour images. Referring to Figures 9(a) through 9(f), the average of the Gaussian noise added to the original 3D gesture contour image (as shown in Figure 9(a)) is 0, to 1%, 5 The %, 10%, 20%, and 30% variances were identified as shown in Figures 9(b) through 9(f).

第10圖揭示本發明較佳實施例之三維手勢影像辨識方法及其系統採用添加各種高斯雜訊變異條件進行測試後,一系列3D手勢輪廓影像之辨識率與特徵向量關係之示意圖,其對應於第9(b)至9(f)圖。請參照第10圖所示,在將該原始3D手勢輪廓影像〔如第9(a)圖所示〕分別加入1%、5%、10%、20%、30%的高斯雜訊〔如第9(b)至9(f)圖所示〕時,添加5%雜訊為止仍保有90%以上的最高辨識率,且趨勢也較為平穩;而當添加雜訊高於10%時,雖然最高辨識率的下降程度不是非常明顯,且在添加30%高斯雜訊時仍有82.22%的最高辨識率,但在所選取的特徵根數增加時,其辨識率的下降速度過快,可顯示其辨識率 較不穩定。 FIG. 10 is a schematic diagram showing the relationship between the recognition rate and the feature vector of a series of 3D gesture contour images after the three-dimensional gesture image recognition method is performed by adding various Gaussian noise variations conditions according to the preferred embodiment of the present invention. Figures 9(b) to 9(f). Please refer to Figure 10 to add the original 3D gesture contour image (as shown in Figure 9(a)) to 1%, 5%, 10%, 20%, 30% Gaussian noise. From 9(b) to 9(f), the highest recognition rate of more than 90% is maintained until 5% of the noise is added, and the trend is relatively stable; when the added noise is higher than 10%, although the highest The degree of decline in the recognition rate is not very obvious, and there is still a maximum recognition rate of 82.22% when adding 30% Gaussian noise. However, when the number of selected features increases, the recognition rate decreases too fast, which can be displayed. Identification rate Less stable.

第11(a)至11(f)圖揭示本發明較佳實施例之三維手勢影像辨識方法及其系統採用添加各種高斯雜訊變異條件進行測試後,將原始3D手勢實心影像與一系列測試3D手勢實心影像之比較示意圖。請參照第11(a)至11(f)圖所示,將該原始3D手勢實心影像〔如第11(a)圖所示〕添加的高斯雜訊平均值均為0,以1%、5%、10%、20%、30%的變異數進行辨識,如第11(b)至11(f)圖所示。 11(a) to 11(f) illustrate a three-dimensional gesture image recognition method and system thereof according to a preferred embodiment of the present invention, and after adding various Gaussian noise variations conditions, the original 3D gesture solid image and a series of test 3D are tested. A schematic diagram of the comparison of gestures and solid images. Referring to Figures 11(a) through 11(f), the average of the Gaussian noise added to the original 3D gesture solid image (as shown in Figure 11(a)) is 0, to 1%, 5 The %, 10%, 20%, and 30% variances were identified as shown in Figures 11(b) through 11(f).

第12(a)至12(f)圖揭示本發明較佳實施例之三維手勢影像辨識方法及其系統採用添加各種高斯雜訊變異條件進行測試,並進行投影色彩空間轉換後,將原始PCS投影影像與一系列測試PCS投影影像之比較示意圖,其對應於第11(a)至11(f)圖。 12(a) to 12(f) illustrate a three-dimensional gesture image recognition method and system thereof according to a preferred embodiment of the present invention, which adopts adding various Gaussian noise variation conditions for testing, and performs projection color space conversion to project the original PCS. A comparison of images and a series of test PCS projection images, corresponding to Figures 11(a) through 11(f).

第13圖揭示本發明較佳實施例之三維手勢影像辨識方法及其系統採用添加各種高斯雜訊變異條件進行測試後,一系列3D手勢輪廓影像之辨識率與特徵向量關係之示意圖。請參照第10及13圖所示,將該原始3D手勢實心影像〔如第11(a)圖所示〕添加1%至30%的高斯雜訊,再以投影色彩空間進行處理,雖然最高辨識率同樣呈現下降的趨勢,但縱使添加30%的高斯雜訊,其辨識率仍保持震盪在85%以上。由於該3D手勢輪廓影像〔如第9(b)至9(f)圖所示〕僅保留原始手勢的輪廓資訊,因此在對抗雜訊方面並不如該3D手勢實心影像〔如第11(b)至11(f)圖所示〕保有完整手勢的深度資訊優勢。 FIG. 13 is a schematic diagram showing the relationship between the recognition rate and the feature vector of a series of 3D gesture contour images after the test is performed by adding various Gaussian noise variations conditions according to the preferred embodiment of the present invention. Please refer to the 10th and 13th pictures to add 1% to 30% Gaussian noise to the original 3D gesture solid image (as shown in Figure 11(a)), and then process it in the projected color space, although the highest recognition The rate also showed a downward trend, but even with the addition of 30% Gaussian noise, the recognition rate remained volatile at more than 85%. Since the 3D gesture contour image (as shown in Figures 9(b) to 9(f)) retains only the outline information of the original gesture, it is not as solid as the 3D gesture in combating noise (eg, 11(b) As shown in Figure 11(f), the depth information advantage of full gestures is maintained.

前述較佳實施例僅舉例說明本發明及其技術特徵,該實施例之技術仍可適當進行各種實質等效修飾及/或替換方式予以實施;因此,本發明之權利範圍須視後附申請專利範圍所界定之範圍為準。本案著作權限制使用於中華民國專利申請用途。 The foregoing preferred embodiments are merely illustrative of the invention and the technical features thereof, and the techniques of the embodiments can be carried out with various substantial equivalent modifications and/or alternatives; therefore, the scope of the invention is subject to the appended claims. The scope defined by the scope shall prevail. The copyright limitation of this case is used for the purpose of patent application in the Republic of China.

Claims (10)

一種三維手勢影像辨識方法,其包含:利用一光場攝影單元攝取一手勢動作,以獲得一3D手勢影像,且該3D手勢影像為一3D光場手勢影像,其包含3D光場資訊,且該3D光場手勢影像為一3D光場手勢輪廓RGB影像或一3D光場手勢實心RGB影像;將該3D光場手勢影像投影至一預定辨識空間,以獲得一3D光場特徵向量;及將該3D光場特徵向量與數個樣本特徵向量以進行比對分類,以分類該3D光場手勢影像,以便辨識該3D光場手勢影像之種類,其中利用投影色彩空間方式將該3D光場手勢輪廓RGB影像或3D光場手勢實心RGB影像進行投影色彩空間轉換,以獲得一R通道影像資訊、一G通道影像資訊及一B通道影像資訊,再將該3D光場手勢影像投影至該預定辨識空間時,採用主成分分析進行投影。 A three-dimensional gesture image recognition method includes: capturing a gesture by using a light field photographing unit to obtain a 3D gesture image, wherein the 3D gesture image is a 3D light field gesture image, and the 3D light field information includes the 3D light field information, and the The 3D light field gesture image is a 3D light field gesture contour RGB image or a 3D light field gesture solid RGB image; the 3D light field gesture image is projected to a predetermined recognition space to obtain a 3D light field feature vector; The 3D light field feature vector and the plurality of sample feature vectors are compared for classification to classify the 3D light field gesture image to identify the type of the 3D light field gesture image, wherein the 3D light field gesture profile is utilized by a projected color space manner RGB image or 3D light field gesture solid RGB image for projection color space conversion to obtain an R channel image information, a G channel image information and a B channel image information, and then project the 3D light field gesture image to the predetermined recognition space Principal component analysis is used for projection. 依申請專利範圍第1項所述之三維手勢影像辨識方法,其中該3D光場手勢影像包含一平面影像資訊及一深度資訊。 The three-dimensional gesture image recognition method according to claim 1, wherein the 3D light field gesture image comprises a plane image information and a depth information. 依申請專利範圍第1項所述之三維手勢影像辨識方法,其中該3D光場特徵向量為一特徵根數。 The three-dimensional gesture image recognition method according to claim 1, wherein the 3D light field feature vector is a feature number. 依申請專利範圍第1項所述之三維手勢影像辨識方法,其中將一RGB三通道的影像資訊投影至同一空間,以獲得一PCS手勢投影影像。 According to the three-dimensional gesture image recognition method described in claim 1, wherein an RGB three-channel image information is projected into the same space to obtain a PCS gesture projection image. 依申請專利範圍第1項所述之三維手勢影像辨識方法,其中在將該3D光場手勢影像投影至該預定辨識空間時,採用主成分分析進行投影,而在將該3D光場特徵向量與數個樣本特徵向量以進行比對分類時,採用最近鄰居法進行比對分類。 The three-dimensional gesture image recognition method according to claim 1, wherein when the 3D light field gesture image is projected to the predetermined recognition space, the principal component analysis is used for projection, and the 3D light field feature vector is When several sample feature vectors are used for comparison classification, the nearest neighbor method is used for comparison classification. 依申請專利範圍第1項所述之三維手勢影像辨識方法,其中在將該3D光場特徵向量與數個樣本特徵向量以 進行比對分類時,採用最近鄰居法進行比對分類。 The three-dimensional gesture image recognition method according to claim 1, wherein the 3D light field feature vector and the plurality of sample feature vectors are When performing the alignment classification, the nearest neighbor method is used for the comparison classification. 一種三維手勢影像辨識系統,其包含:一光場攝影單元,其攝取一手勢動作,以獲得一3D手勢影像,且該3D手勢影像為一3D光場手勢影像,其包含3D光場資訊,且該3D光場手勢影像為一3D光場手勢輪廓RGB影像或一3D光場手勢實心RGB影像;一演算單元,其連接至該光場攝影單元,將該3D光場手勢影像投影至一預定辨識空間,以獲得一3D光場特徵向量,再將該3D光場特徵向量與數個樣本特徵向量以進行比對分類,以分類該3D光場手勢影像,以便辨識該3D光場手勢影像之種類;及一輸出單元,其連接至該演算單元,以便輸出該3D光場手勢影像之種類,其中利用投影色彩空間方式將該3D光場手勢輪廓RGB影像或3D光場手勢實心RGB影像進行投影色彩空間轉換,以獲得一R通道影像資訊、一G通道影像資訊及一B通道影像資訊,再將該3D光場手勢影像投影至該預定辨識空間時,採用主成分分析進行投影。 A three-dimensional gesture image recognition system includes: a light field photographing unit that takes a gesture to obtain a 3D gesture image, and the 3D gesture image is a 3D light field gesture image, which includes 3D light field information, and The 3D light field gesture image is a 3D light field gesture contour RGB image or a 3D light field gesture solid RGB image; a calculation unit connected to the light field photography unit, and projecting the 3D light field gesture image to a predetermined identification Space, to obtain a 3D light field feature vector, and then classify the 3D light field feature vector and the plurality of sample feature vectors to classify the 3D light field gesture image to identify the type of the 3D light field gesture image And an output unit connected to the calculation unit for outputting the type of the 3D light field gesture image, wherein the 3D light field gesture contour RGB image or the 3D light field gesture solid RGB image is projected by the projection color space method Space conversion to obtain an R channel image information, a G channel image information, and a B channel image information, and then project the 3D light field gesture image into the predetermined recognition space. Principal component analysis is projected. 依申請專利範圍第7項所述之三維手勢影像辨識系統,其中該3D光場特徵向量為一特徵根數。 The three-dimensional gesture image recognition system according to claim 7, wherein the 3D light field feature vector is a feature number. 依申請專利範圍第7項所述之三維手勢影像辨識系統,其中將一RGB三通道的影像資訊投影至同一空間,以獲得一PCS手勢投影影像。 According to the three-dimensional gesture image recognition system described in claim 7, wherein an RGB three-channel image information is projected into the same space to obtain a PCS gesture projection image. 依申請專利範圍第7項所述之三維手勢影像辨識系統,其中在將該3D光場手勢影像投影至該預定辨識空間時,採用主成分分析進行投影,而在將該3D光場特徵向量與數個樣本特徵向量以進行比對分類時,採用最近鄰居法進行比對分類。 The three-dimensional gesture image recognition system according to claim 7 , wherein when the 3D light field gesture image is projected to the predetermined recognition space, the principal component analysis is used for projection, and the 3D light field feature vector is When several sample feature vectors are used for comparison classification, the nearest neighbor method is used for comparison classification.
TW104108504A 2015-03-17 2015-03-17 3d hand gesture image recognition method and system thereof TWI623889B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
TW104108504A TWI623889B (en) 2015-03-17 2015-03-17 3d hand gesture image recognition method and system thereof

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
TW104108504A TWI623889B (en) 2015-03-17 2015-03-17 3d hand gesture image recognition method and system thereof

Publications (2)

Publication Number Publication Date
TW201635196A TW201635196A (en) 2016-10-01
TWI623889B true TWI623889B (en) 2018-05-11

Family

ID=57847359

Family Applications (1)

Application Number Title Priority Date Filing Date
TW104108504A TWI623889B (en) 2015-03-17 2015-03-17 3d hand gesture image recognition method and system thereof

Country Status (1)

Country Link
TW (1) TWI623889B (en)

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20110158508A1 (en) * 2005-10-11 2011-06-30 Primesense Ltd. Depth-varying light fields for three dimensional sensing
TWM455912U (en) * 2013-01-02 2013-06-21 San-Wei Fan Gesture recognition control system
TWI467516B (en) * 2011-04-26 2015-01-01 Univ Nat Cheng Kung Method for color feature extraction

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20110158508A1 (en) * 2005-10-11 2011-06-30 Primesense Ltd. Depth-varying light fields for three dimensional sensing
TWI467516B (en) * 2011-04-26 2015-01-01 Univ Nat Cheng Kung Method for color feature extraction
TWM455912U (en) * 2013-01-02 2013-06-21 San-Wei Fan Gesture recognition control system

Non-Patent Citations (4)

* Cited by examiner, † Cited by third party
Title
https://www.raytrix.de/technologie/,其中VIDEOS第3、5 video;第3 video:Raytrix 3d Video - R5 Camera - light field technology -- gesture recognition,21. June 2013;第5 video:Raytrix 3D light field technology -- Hand 25FPS R5 video,8. February 2013 *
https://www.raytrix.de/technologie/,其中VIDEOS第3、5 video;第3 video:Raytrix 3d Video - R5 Camera - light field technology -- gesture recognition,21. June 2013;第5 video:Raytrix 3D light field technology -- Hand 25FPS R5 video,8. February 2013。
R5 High Speed Video 3D Light Field Cameras,Raytrix GmbH,2012/12/24,https://web.archive.org/web/20121224154224/http://www.raytrix.de:80/tl_files/downloads/R5.pdf *
R5 High Speed Video 3D Light Field Cameras,Raytrix GmbH,2012/12/24,https://web.archive.org/web/20121224154224/http://www.raytrix.de:80/tl_files/downloads/R5.pdf。

Also Published As

Publication number Publication date
TW201635196A (en) 2016-10-01

Similar Documents

Publication Publication Date Title
CN102981604B (en) Image processing equipment and image processing method
US8442269B2 (en) Method and apparatus for tracking target object
CN102375970B (en) A kind of identity identifying method based on face and authenticate device
US20160300100A1 (en) Image capturing apparatus and method
JP6007682B2 (en) Image processing apparatus, image processing method, and program
US20180088671A1 (en) 3D Hand Gesture Image Recognition Method and System Thereof
JP6071002B2 (en) Reliability acquisition device, reliability acquisition method, and reliability acquisition program
CN106709895A (en) Image generating method and apparatus
CN103530599A (en) Method and system for distinguishing real face and picture face
CN105740775A (en) Three-dimensional face living body recognition method and device
WO2019015344A1 (en) Image saliency object detection method based on center-dark channel priori information
JP5936561B2 (en) Object classification based on appearance and context in images
GB2547760A (en) Method of image processing
CN110751097A (en) A Semi-Supervised 3D Point Cloud Gesture Keypoint Detection Method
JP5656768B2 (en) Image feature extraction device and program thereof
CN108229281B (en) Neural network generation method, face detection device and electronic equipment
Ali et al. An effective face detection and recognition model based on improved YOLO v3 and VGG 16 networks
TW202411949A (en) Cascaded detection of facial attributes
CN118898869B (en) Violence behavior detection method, electronic equipment and storage medium
TWI623889B (en) 3d hand gesture image recognition method and system thereof
CN118230249A (en) Intrusion object detection and identification method, device, equipment and storage medium
CN106662927A (en) Motion recognition method and motion recognition device
JP5865687B2 (en) Image feature extraction device and program thereof
TWI618027B (en) 3d hand gesture image recognition method and system thereof with ga
TWI444909B (en) Hand gesture image recognition method and system using singular value decompostion for light compensation

Legal Events

Date Code Title Description
MM4A Annulment or lapse of patent due to non-payment of fees