[go: up one dir, main page]

TW201040847A - Method of identifying objects in image - Google Patents

Method of identifying objects in image Download PDF

Info

Publication number
TW201040847A
TW201040847A TW98116102A TW98116102A TW201040847A TW 201040847 A TW201040847 A TW 201040847A TW 98116102 A TW98116102 A TW 98116102A TW 98116102 A TW98116102 A TW 98116102A TW 201040847 A TW201040847 A TW 201040847A
Authority
TW
Taiwan
Prior art keywords
image
specific point
axis
recognizing
identifiable
Prior art date
Application number
TW98116102A
Other languages
Chinese (zh)
Other versions
TWI417796B (en
Inventor
Gregory C Smith
Xiang-Feng Chen
Original Assignee
Univ Nat Taiwan
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Univ Nat Taiwan filed Critical Univ Nat Taiwan
Priority to TW98116102A priority Critical patent/TWI417796B/en
Publication of TW201040847A publication Critical patent/TW201040847A/en
Application granted granted Critical
Publication of TWI417796B publication Critical patent/TWI417796B/en

Links

Landscapes

  • Image Analysis (AREA)

Abstract

A method of identifying objects in images includes steps of: photographing an object; capturing characteristics of the object to generate histogram of the X-axis or the Y-axis of the captured object; deriving a minimum specific point and a maximum characteristic point from the histogram of the X-axis or the Y-axis; calculating the shape, size, and position information of the object by polynomial regression analysis based on the information of minimum specific point and the maximum characteristic point; if the object is identified to be linear trend, the object is determined to be an triangle; if the object is identified to be straight line trend, the object is determined to be a square; and if the object is identified to comply with a quadratic equation, the object is determined to be a circle.

Description

201040847 六、發明說明: 【發明所屬之技術領域】 本發明係關於一種在影像中辨識物體的方法,特別 係關於一種可準確及迅速辨識出擷取影像之形狀、大小 或位置之在影像中辨識物體的方法。 【先前技術】 習知物體辨識系統,係包含一擷取輸入影像用之照 相機(或類似裝置),並包含切割輸入影像、從切割之影 〇 像中擷取物體、表示輸人之影像物體,及將待辨識擷: 物體分類。 經操取輸入影像一力係由數位格式<色彩或灰階像 素資料組成,並被排列為又與γ方向之二維陣列其中, 該影像可包含數千或數萬個獨立像素。 在最低階層的影像表示部份,物體可被模式化為全 部的影像,並將該影像與—在資料庫中的原始輸入的影 像’-個像素-個像素地做比對。不過多數物體辨識系 統採用不同的切割影像及擷取、表示及分類影像物體的 ϋ 方法’以提升速度與/或準確度(Kmmm, U.S. Patent 7,092,566)。 特別指出的是,色彩長條統計圖已被用於各種不同 的處理步驟巾’其主要制以提升彩色影像中物體辨識 的速度。 在美國7,〇2〇,329號專利中,prempraneerach等人描 ^ 了一種利用色彩長條統計圖切割一彩色影像成複數個 區域的技術,其係藉由將影像轉換為三維色彩空間,並 產生該色彩空間之每一維度的色彩長條統計圖,使用該 201040847 長計圖在該三維色彩空間中產生複數個連接盒,以 計算每-連接盒之一正規化變異值’而形成連ς盒叢 組,對應該等經切割與對應相肖色彩特性之影像區域之 叢組化像素影像,以從該影像中取出經切割區域,並分 類該影像域中之叢組化像素,以辨識該影像中的物體。 對於其中之分類方法,可使用類神經網路(具可適性樣板 匹配技術頻敏競爭學習 '對手處罰競爭學習或統計分 類分級法。 Ο Ο 過去rrrneerach等人說明其所提出之叢組化技術較 =去疊代叢組化技術更具效力,並還指出其所提出之切 d方法更藉每-像素僅處理一次的方式 間’以建立其後叢組化所需之長條統計圖的強度H 及飽和度。 π又巴度 不過,該方法在達成物體辨識處理 串複雜的計算。首先,輸入學傻达須、由:仍需要連 加以h Μ β 像㈣―邊緣預留滤鏡 加以處理,使彩色影像得到平滑化, 長條統計圖中的不連續性,接著該影像= 计圖s長條料圖,料該長條統 2屮 頻雜訊’且每—長條統計圖中的最低處 一長條統計圖的方式為之)。 …斯,慮鏡旋繞母 正連接盒叢組被計算每-連接盒中之像素值之 異的方式形成,且連接盒被根據正規化變異值連 ==構。連接盒之具區域最小正規化變異值= <、’母ί狀結構之根節點,其餘者則被則—最陡梯 201040847 度下降消算法連結為根節點的分支節點。 為使該方法正常卫作,影像中的同f區域必須被叢 組化成該三維長條統計圖中的良好定義間距,不過真實 影像色彩變異可能會難與雜訊或自RGB轉換成ms色彩 空間之非線性轉換所形成之色彩變異相區隔。 Ο201040847 VI. Description of the Invention: [Technical Field] The present invention relates to a method for recognizing an object in an image, in particular to an image that can accurately and quickly recognize the shape, size or position of the captured image. The method of the object. [Prior Art] A conventional object recognition system includes a camera (or the like) for capturing an input image, and includes cutting an input image, extracting an object from the image of the cut image, and representing the image object of the input image. And will be identified 撷: Object classification. The input image is composed of a digital format <color or gray scale pixel data and is arranged in a two-dimensional array of gamma directions, wherein the image may contain thousands or tens of thousands of independent pixels. At the lowest level of the image representation, the object can be modeled as an entire image, and the image is compared to the original input image in the database by - pixels to pixels. However, most object recognition systems use different cut images and methods of capturing, representing, and classifying image objects to improve speed and/or accuracy (Kmmm, U.S. Patent 7,092,566). In particular, the color strip chart has been used for a variety of different processing steps, which are primarily used to increase the speed of object recognition in color images. In U.S. Patent No. 7, 〇 2, 329, Prempraneerach et al. describe a technique for cutting a color image into a plurality of regions using a color strip chart by converting the image into a three-dimensional color space. Generating a color strip chart for each dimension of the color space, using the 201040847 long gauge map to generate a plurality of junction boxes in the three-dimensional color space to calculate a normalized variation value of each of the connected boxes to form a flail box The cluster group, corresponding to the clustered pixel image of the image region that is cut and corresponding to the color characteristic of the phase, to take the cut region from the image, and classify the grouped pixels in the image domain to identify the image The object in it. For the classification method, you can use the neural network (with adaptive template matching technology, frequency-sensitive competition learning, 'opportunity penalty competition learning or statistical classification and classification method. Ο Ο In the past, rrrneerach et al. explained that the proposed clustering technique is better. = De-stacking clustering technique is more effective, and it is also pointed out that the proposed method of cutting d is more intensively per pixel-only method to establish the strength of the long-statistic chart required for post-cluster grouping. H and saturation. π and Bard However, this method is a complex calculation for the object recognition processing. First, the input of the silly must, by: still need to be connected with h Μ β image (4) - edge reservation filter to deal with To smooth the color image, the discontinuity in the long chart, and then the image = the long slab of the chart s, which is expected to be in the long-range data. The lowest point is a long chart. ..., the mirror is wrapped around the mother. The connected box group is formed by calculating the difference in pixel values in each of the junction boxes, and the junction box is constructed according to the normalized variation value. The minimum normalized variation value of the area of the connection box = <, the root node of the 'female structure', and the others are the most steep ladder. The 201040847 degree reduction algorithm is linked to the branch node of the root node. In order for this method to work properly, the same f-region in the image must be clustered into a well-defined spacing in the three-dimensional strip chart, but the true image color variation may be difficult to convert to noise or RGB to ms color space. The color variation phase formed by the nonlinear transformation is separated. Ο

在美國第 7,092,566、6,952,496 & 6,611 622 號專利 中,Krumm描述使用色彩長條統計圖,以表示及分類一 輸入影像的技術。其物_識方法首先建立並儲存待辨 識物體之模型長條統計圖,並接著㈣—輪人影像,以 取出可能對應該等待辨識物體之區域,自該等經切割區 域中求出長純,接著比較經求出長條統計圖與該 經儲存之模型長條統計圖。如果兩長條統計圖的相似程 度超過預定臨界值,代表輸人影像與模型物體相匹配。 相匹配之輸人長條統計圖亦可被加至該固定物體之模型 長條統計圖的資料庫中。 上述方法係藉下列方式建立模型餘入影像長條統 計圖:判定一模型或輸入影像區域中,像素所呈現之真實 RGB色彩,將真實像素色彩之全部範圍㈣成—系列分 立色彩範圍或量化色彩類目’將擷取出之模型或輸入影 像區域之每-像素指定至經量化之色彩類目,及建立被 指定予每'經量化色彩類目之像素數的計數值。在一較 佳實施例中,刪像素值被量化為27個色彩類目,藉 由計算在每一經量化色彩類目中的像素計數值做輸入景; 像及模型長條統計圖的比較。模型長條統計圖必須與輸 入待辨識物體相似之一前序(prefat〇ry)影像t求出,'求出〗 模型長條統計圖之影像區域亦被用於後續輸入影像中, 201040847 以取出待辨識物體。 自相施’模型影像被藉下列方式切割: 不明顯改變之像素值判定—靜態f景影像,藉 影像中抽出該背景影像形成—前景影像 2 改變像素值群的方式切割該前景影像g物體 Ο Ο 然而,該方法主要係用於追蹤時序影像中的物 此外’該方法仍需要大量之建立模型長條統計圖愈比 輸入影像長條統計圖與模型長條統計圖之處理時間。該 方法所用之色彩長條統計圖產生技術並不預留原幾何 狀’係為固定色彩之影像像素數目的總結,故真實物體 形狀、大小及位置不能被判定。最後,不同物體之類似 長條統計圖可能會得到不正確的物體辨識結果。 …在第6,532,301及6,477,272號美國專利中,κ咖 說明了使用共生(co-occurrence)長條統計圖代表及確認 一搜尋影像中一經模型化之物體之位置的技術。 ⑽ 上述方法先建立物體的模型影像,並接著計算每一 模型影像之一共生長條統計圖。其中模型影像之建立, 係藉該物體周環之彼此以等角相隔,但不同間距之視點 擷取該待被確認物體之影像組,共生長條統計圖之計算 則係藉由確認該模型影像中每一種可能,且唯一,之未 排列像素對及產生像素對落於相同色彩範圍之數量。 接著,預定大小之搜尋窗,係自該搜尋影像之覆疊 部份產生,且每一搜尋窗之共生長條統計圖,係利用該 技術及模型影像共生長條統計圖建立之像素色彩及間離 範圍產生。 201040847 最後每模型影像共生長條統計圖與每一搜尋窗 共生長條統計圖被比較,以評估出其相似度。每一模型 影像共生長條統計圖匹配之搜尋窗共生長條統計圖,係 被設定為可能包含該待辨識物體,其中匹配係由相似值 大=一臨界值的方式進行。接著,待辨識物體之位置被 判定為位於可能包含該待辨識物體之所有搜尋窗中之具 最大相似度計量值之單一搜尋區内,並可以重覆往上、 往下、往左或往右移動一像素位置,接著計算該搜尋窗 〇 共生長條統計®,及比較該搜尋窗共生長條統計圖與每 一模型長條統計圖,藉以求出可能較高之相似度計量 值。該系統及方法必須令其搜尋窗大小、色彩範圍及距 離範圍在影像搜尋開始即被選定。In U.S. Patent Nos. 7,092,566, 6,952,496 & 6,611,622, Krumm describes the use of color strip charts to represent and classify an input image technique. The object_method first establishes and stores a model strip chart of the object to be identified, and then (4)-wheel image to take out the area that may be waiting for the object to be identified, and obtain the long purity from the cut area. Then, the long graph and the stored model strip graph are obtained. If the similarity of the two long bars exceeds a predetermined threshold, it means that the input image matches the model object. The matching input strip chart can also be added to the database of the model's long strip chart of the fixed object. The above method establishes a model residual image strip chart by determining the true RGB color represented by the pixel in a model or input image region, and the full range of the true pixel color (four) into a series of discrete color ranges or quantized colors. The category 'assigns the extracted model or each pixel of the input image area to the quantized color category and establishes a count value assigned to the number of pixels per 'quantized color category'. In a preferred embodiment, the decimated pixel values are quantized into 27 color categories by comparing the pixel count values in each quantized color category to the input scene; the comparison of the image and the model strip chart. The model strip chart must be obtained from the prefat〇ry image t similar to the input object to be identified. The image area of the model strip chart is also used in the subsequent input image, 201040847 to be taken out. The object to be identified. The self-phased 'model image is cut by the following method: pixel value determination without significant change - static f scene image, the background image is extracted by the image - foreground image 2 changes the pixel value group to cut the foreground image g object Ο Ο However, this method is mainly used to track the objects in the time series image. In addition, the method still needs a large amount of modeling time. The longer the statistical chart is, the more processing time is than the input image long chart and the model long chart. The color strip chart generation technique used in the method does not reserve the original geometry as a summary of the number of image pixels of a fixed color, so the shape, size and position of the real object cannot be determined. Finally, similar strip graphs of different objects may result in incorrect object recognition results. In U.S. Patent Nos. 6,532,301 and 6,477,272, Kappa describes the use of co-occurrence strip charts to represent and confirm the location of a modeled object in a search image. (10) The above method first establishes a model image of the object, and then calculates a co-growth bar graph for each model image. The image of the model is created by the isometric angle of the circumference of the object, but the image points of the object to be confirmed are captured by the viewpoints of different distances, and the calculation of the total growth bar chart is confirmed by confirming the image of the model. Each of the possible, and unique, unaligned pairs of pixels and the number of pairs of pixels that fall within the same color range. Then, a search window of a predetermined size is generated from the overlay portion of the search image, and a common growth bar graph of each search window is a pixel color and a space established by using the technology and the model image co-growth bar graph. Produced from the range. 201040847 Finally, each model image co-growth bar chart is compared with each search window co-growth bar chart to evaluate its similarity. Each model image co-growth bar graph matching search window co-growth bar graph is set to possibly include the object to be identified, wherein the matching system is performed by a similar value of a large value = a critical value. Then, the position of the object to be identified is determined to be located in a single search area having the largest similarity measure among all the search windows that may include the object to be recognized, and may be repeated upward, downward, left or right. Moving a pixel position, then calculating the search window total growth bar statistic®, and comparing the search window co-growth bar graph with each model strip graph to obtain a possibly higher similarity measure value. The system and method must have its search window size, color range, and range selected at the beginning of the image search.

Knim說明該方法之數個優點,其中特別提到共生長 條統計圖係代表辨識影像中物體的有效方法。藉持續追 蹤具匹配色彩,並在其間具有一固定距離之像素,能夠 將可變之幾何資訊量加至一規則唯色彩長條統計圖中。 接著,藉考慮色彩與幾何資訊,物體辨識方法在背景凌 〇 亂及適量閉塞及物體屈曲的條件下仍可工作。 不過,長條統計圖匹配所需之模型影像資料庫的建 立相當耗時,且藉由計算模型影像及搜尋影像中,針對 每一種可能唯一非排列像素距離進行計量,在計算共生 長條統計圖上亦相當消耗計算成本。 此外,物體的表示,並不包含模型與搜尋影像中同 色彩像素間距以外的其它詳細幾何資訊,真實物體形 狀、大小及位置之資訊並不存在,故最後的物體位置判 定不是準確的,且後續之判定位置連續限定需要大量的 201040847 計算量。 再者,搜尋窗大小等方法參數會影響物體辨識準確 度,且影像必須被調整大小,以令搜尋與模型影像間的 總大小差得以受到處理。 物體辨識系統之一有效用,且定義相當良好的應用 為在移動之車輛上即時辨識交通號誌,且此交通號誌系 統一般必須能夠快速抽出物體及正確對物體加以分類。 ❹ Ο 在美國第6,801,638號專利中,Jassen等人說明一種 辨識交通號tfe之枝與裝置,其能靠記憶體協助顯示該 等號誌給觀看者。在該方法與裝置中,影像係由一影像 感測器擁取,並透過-資訊處理單元中之分類器進行分 析及分類。接著,—交通號該之合成影像被產生,該影 像被存於-記憶單元中,並透過—顯示單元顯示。y 輸入影像先被藉色彩與/或空間位置資訊搜尋, 平均機率之區域,則判定可能包含的交通號諸物體。物 體在,判定區域中被辨識的方式係為,以階層及順序方 式藉交通號諸之各分立已知特性,針對儲存中特性 分類該等影像區域’如為確認外形(圓或正方形)及 符號用之校正程序等。分類器比較輸入物體特性資料; 記憶單元中储存之典型特性資料組,物體在比較距離低 於組臨界值時被辨識出。 ”〜、w 携饭训沐数次,以處 因天氣變化及光條件之影像品f差異。此外,該分 t與,憶單元中儲存形狀之輪人物體形《_ “ 聯性相關。一般而言,盘關_ 上Knim illustrates several advantages of this approach, with particular reference to co-growth bar graphs representing an efficient way to identify objects in an image. By continuously tracking the matching colors and having a fixed distance between them, the variable geometric information can be added to a regular color strip chart. Then, by considering color and geometric information, the object recognition method can still work under the condition that the background is confusing and the amount of occlusion and the object is buckling. However, the establishment of a model image database required for long-status chart matching is quite time consuming, and by calculating the model image and the search image, the distance of each possible unique non-arranged pixel is measured, and the co-growth bar graph is calculated. It also consumes considerable computational costs. In addition, the representation of the object does not include detailed geometric information other than the distance between the model and the searched image, and the information of the shape, size and position of the real object does not exist, so the final object position determination is not accurate, and subsequent The determination of the position continuously requires a large amount of calculation for 201040847. Furthermore, method parameters such as the size of the search window affect the object recognition accuracy, and the image must be resized so that the total size difference between the search and the model image is processed. One of the object recognition systems is effective and well-defined applications for the immediate identification of traffic signs on moving vehicles, and the traffic signal system must generally be able to quickly extract objects and properly classify objects. ❹ Ο In U.S. Patent No. 6,801,638, Jassen et al. describe a branch and device for identifying a traffic number tfe that can assist in displaying the slogan to a viewer by means of a memory. In the method and apparatus, the image is captured by an image sensor and analyzed and classified by a classifier in the information processing unit. Next, the composite image of the traffic number is generated, and the image is stored in the memory unit and displayed through the display unit. y The input image is first searched by color and/or spatial position information, and the area of average probability is used to determine the traffic number objects that may be included. The way in which the object is identified in the decision area is to classify the image areas by the hierarchical characteristics of the traffic signals in a hierarchical and sequential manner, such as to confirm the shape (circle or square) and symbols. Use the calibration procedure, etc. The classifier compares the input object characteristic data; the typical characteristic data set stored in the memory unit, and the object is recognized when the comparison distance is lower than the group threshold. "~, w Carrying rice training for several times, in order to make a difference in the image of the weather due to weather changes and light conditions. In addition, the point t is related to the shape of the person in the unit that stores the shape of the wheel "_". In general, the mark is off _

關聯性相關之分類器可能不夠 準確或速度慢。關聯性盘,丨綠 J /、刺練、觀視環境及/或儲存中形 8 201040847 狀資料之品質相關。 改善儲存中形狀資料需要大量的訓練或大型的儲存 資料庫。其:欠,大型儲存資料庫需要更多的處理時間進 行關聯處理。舉例而言,圓形及正方形物體在不同觀視 角度時會以不同的㈣及長方形出現,如此,可能會降 低物體辨識的準確度。 上述方法亦需要對輸入影像可能包含的色彩值及/ 或空間位置相關之交通號誌之區域加以搜尋。 〇 在美國第6,813,545號專利中,stromme說明了一種 提醒司機至少一特定交通號諸存在的系统,其係由一影 像單元 負料庫、一自動辨識單元、一雙號諸間選擇 機制及一聲音與/或可視指示器組成,其中該影像單元被 連接於車輛前之靠近道路處,該資料庫包含至少一預先 儲存之交通號誌形狀,該自動辨識單元則被用以偵測及 確認連續影像之交通號誌,且該偵測及確認,係藉由搜 尋該資料庫中之形狀的方式達成,該雙訊號間選擇機制 包含於相同影像中,用以判定車輛與號誌間的距離,該 0 聲音及/或可視指示器通知一經確認之交通號誌已出現 於車輛前的路上。 輸入影像被定期根據車速擷取,且每一輸入影像在 一形狀辨識處理器中被分析,以偵測交通號誌形狀與含 於形狀資訊資料中的交通號誌符號形狀。 該系統中的形狀搜尋與辨識單元係使用傳統影像處 理方法’如Canny邊緣偵測,接著再於被處理影像上進 行簡易的逐一像素匹配程序。號誌形狀之數個方向觀看 結果被存於形狀匹配資料庫中。對於號誌中的三角形、 9 201040847 圓形或長方形符號亦被利用圖案或用於經偵測形狀上 辨識演算法確認。色彩_亦被執行以核對 狀確為-交通號言志。 ’ 然而,邊緣偵測與逐一像素匹配方法一般都不準確, 且速度相當慢。此外’同—號諸與號魏符號形狀及 隨車位置之不同的差異相當大,這會使得以邊緣谓測及 逐-像素匹配方式執行之形狀匹配與物體辨識 受制。 又 〇 在美國專利第5,926,564號中,MaSayuki Ki_係 說明一種以長條統計圖模式進行影像辨識,然而,其主 要係針對X軸及Y軸進行掃描,並以”〇”、,,i,,方式進行 影像比對’如此的比對模式…旦影像擁取裝置^拍: 物產生角度差時,即無法正確辨識出物體之形狀及大小 尺寸。 由此可見,上述習用方式仍有諸多缺失,實非一良 善之設計,而亟待加以改良。 本案發明人鑑於上述習用方式所衍生的各項缺點, 〇 乃亟思加以改良創新,並經多年苦心孤詣潛心研究後, 終於成功研發完成本件在影像中辨識物體的方法。 【發明内容】 本發明之目的即在於提供一種藉由x軸或γ軸長條 統計圖上取得之最小特定點及最大特定點,配合多項式 回歸分析,即可快速並簡易判定物體形狀、大小及位置 之在影像中辨識物體的方法。 本發明之次一目的係在於提供一種在影像中辨識物 體的方法,係可擷取不同的可辨識影像特徵,如RGB色 歩驟] 輸入影像; 歩驟2 特徵;例如 以取得一數位Correlation related classifiers may not be accurate or slow. Relevant discs, green J /, spurs, viewing environment and / or storage medium 8 201040847 The quality of the data. Improving the shape data in storage requires a lot of training or a large storage database. It: owed, large storage databases require more processing time for correlation processing. For example, circular and square objects appear in different (four) and rectangular shapes at different viewing angles, which may reduce the accuracy of object recognition. The above method also requires searching for the area of the traffic signal associated with the color value and/or spatial position that the input image may contain. In US Patent No. 6,813,545, stromme describes a system that alerts a driver to the presence of at least one particular traffic number, which consists of an image unit negative library, an automatic identification unit, a double selection mechanism, and a sound. And/or a visual indicator, wherein the image unit is connected to the road in front of the vehicle, the database includes at least one pre-stored traffic symbol shape, and the automatic identification unit is used to detect and confirm the continuous image. The traffic sign, and the detection and confirmation is achieved by searching for the shape in the database. The dual signal selection mechanism is included in the same image to determine the distance between the vehicle and the sign. 0 Sound and / or visual indicator notifications A confirmed traffic sign has appeared on the road in front of the vehicle. The input image is periodically captured based on the vehicle speed, and each input image is analyzed in a shape recognition processor to detect the shape of the traffic signal and the shape of the traffic symbol contained in the shape information. The shape search and recognition unit in the system uses a conventional image processing method such as Canny edge detection, followed by a simple pixel-by-pixel matching procedure on the processed image. The number of directions in the shape of the logo is stored in the shape matching database. For the triangle in the symbol, the 9 201040847 circular or rectangular symbol is also confirmed by the pattern or for the identification algorithm on the detected shape. The color _ was also executed to verify that the traffic number was true. However, edge detection and pixel-by-pixel matching methods are generally inaccurate and relatively slow. In addition, the difference between the shape of the same symbol and the position of the vehicle is quite large, which makes the shape matching and object recognition performed by edge prediction and pixel-by-pixel matching. Also, in U.S. Patent No. 5,926,564, MaSayuki Ki_ describes an image recognition in a long chart mode, however, it mainly scans for the X-axis and the Y-axis, and is "〇",,, i, , the method of image comparison 'such a comparison mode ... Once the image capture device ^ shot: When the object produces an angular difference, the shape and size of the object cannot be correctly identified. It can be seen that there are still many shortcomings in the above-mentioned methods of use, which is not a good design, but needs to be improved. In view of the shortcomings derived from the above-mentioned conventional methods, the inventor of the present invention has improved and innovated, and after years of painstaking research, he finally succeeded in researching and developing the method for identifying objects in the image. SUMMARY OF THE INVENTION The object of the present invention is to provide a minimum specific point and a maximum specific point obtained by the x-axis or γ-axis bar graph, and the polynomial regression analysis can quickly and easily determine the shape and size of the object and The method of identifying an object in an image. A second object of the present invention is to provide a method for recognizing an object in an image, which can capture different recognizable image features, such as RGB color, input image; step 2; for example, to obtain a digit

G Ο 201040847 戈灰“像特徵或視頻影像特徵,並迅速及準確 用於空ϋ料·^狀 的辨識’使其可應 用於父通破5^、或其他領域上。 步驟Ϊ成上述發明目的之在影像中辨識物體的方法,其 ••先在影像資料中進行掃描 梅取該數位輸人影像_其中—種可辨識的 色衫特徵(如RGB或IHS色彩)< 徵或光譜特徵等; ”,、白特 步驟3:同時建立可辨識影像特徵之一維 軸長條統計圖,該Χ抽或γ 5 出影像之幾何資訊,· 轴長條統计圖中保有該擁取 ν、驟4求出χ軸或γ轴長條統 點及最大特定點,以找圆之滅小特疋 + m 找U等長條料®巾的物體; 點及ί大^ Μ軸或丫轴長條統計圖上之最小特定 形狀’·若㈣物體為線性走向即可判斷為三^物= 直線走向則判定為方行;若 ,右為 形; 付0 一-人方程式則判斷為圓 6 ·藉由x軸或γ轴長條統計圖上之兩最小特 疋點間的距離判定物體大小; 取J特 點判=位藉置由x轴或γ轴長條統計圖上之最小特定 覆步二8步影像,並重 娜 以辨識該影像中之其他特徵。 201040847 【實施方式】 法之⑺不’為本發明在影像中辨識物體的方 法之流程步驟圖,主要包括: 像的tr,:係藉錄位影像卿裝置對物體進行數位影 資料亦可藉Γ取付―數位影像資料;另外,該數位影像 枓亦了藉由一類比影像擷取裝置取 ;=r:r,同樣可取-數位影像== Ο Ο 或類比影:擷取數位影像擁取裝置 影像之裝置 機或攝影機或其他可擁取 性,操取某單H 照RGB之色彩特 為-灰或者,當影像 或者,可L7姑说擁 象辨識灰階的特徵; 時,同樣传鮮之數位影像資料為可見光譜頻率範圍 、 健麻該單—光m像可_特徵; 步驟3··將影像可辨識特徵擷取出後,即 2條:t取影像之X軸方向(水平)或γ軸方向(垂直) =圖:。3’·而取得Χ抽或γ抽長條統計圖的方式,) 十算該影像可辨識特徵之每一行或每一列上的像素, 此,即可取得Χ軸或γ軸長條統計圖; 、 軸取得X軸或Μ長條統計圖後,再求出Χ 軸或Υ軸長條統計圖之最小特定點及最大特定點, 上特疋點及取大特定點係為可辨識特徵影像中的零^ =及最大值點像素8G4;另外,最小及最 之 尋方式’ «由線性搜尋模式於Χ軸或= = 12 201040847 中被求出’並同時並註記與記錄該最小特定點及 定點的位置;另外’亦可以不同搜尋演算法或方法+ =轴或Υ軸長條統計圖資料中的最小特定點及最大特定 步驟l再透過多項切歸分析方式根據求 ,小特定點及最大較點;t義出物體形狀8G5;例如. 若最小特定點至最大特定點間係呈線性走向,即可二 Ο Ο 該待辨識物體應為三角形;^最小特定點至最大特= 間係呈直線走向’即可判定該待辨識物體應為四方开/或 矩形;若最小特定點至最大特定點間係符合二次方^ 式,即可判定該待辨識物體應為圓形或橢圓形; *步驟6:再依照影像之χ轴或γ軸長條統計圖 貝料與最小特定點及最大特定點的位置,即可取得影 2轴或Υ轴方向的像素值,透過該像素值即可準確 算出拍攝物體的大小尺寸806 ; ° 步驟7 :再根據f彡像之χ轴及γ軸長條統計 與最小特點的位置判定出物體的正確位置8〇7; 科 =驟8·再揭取上述數位影像資料中不同辨識特徵 像資料(如其他不同色彩或光譜頻率的特徵),並重 2驟2至步驟7之處理流程’如此,將同—影像中不 5 ’辨識特徵作多次個別的辨識處理,以達到更準確及 更^速辨識物體形狀、大小及位置之目的。 明=閱圖二至圖五所示,係本發明之第一實施示意 圖’该影像辨識步驟如不: 步驟1.如圖二所示,該白色背景上具有五種態樣 的數位影像資料,分別為一紅色三角形1、一紅色圓形 13 201040847 2、一紅色四方形3、—綠色三角形4及—藍色四方形5· 該影像可透過數位電腦繪製而成,或透過數 像擷取裝置拍攝而成; 飞頚比〜 Ο Ο 步驟2:如圖三所示’當取得數位影像資料後 可依照影像可賴之特徵,決定先操取具有紅色特徵之 影像’因此’圖二中具有紅色特徵之紅色三角形i、紅 色圓形2及紅色四方形3皆會被擷取出,並開始進行影 像辨識,而非紅色特徵之影像,如綠色三角形4及藍色 四方形5會在影像擷取過程中被移除; 步驟3:計算X軸及Y軸長條統計圖;如圖四所示, 同時計算圖三中所擷取影像内的每一行上的紅色(非黑) 像素,以取得X軸(水平)長條統計圖,該X軸長條統 計圖上會顯示出與紅色三角形1、紅色圓形2及紅色四 方形3相對應之三角長條統計圖1〇1、曲線長條統計圖 2〇1及方形長條統計圖3〇1,而χ轴長條統計圖之水平 方向所載數值係代表行數,而垂直方向則代表像素點 值;該X軸長條統計圖所顯示的資料保有足夠資訊,以 判定在原始輸入影像中之物體形狀與水平物體大小及位 置;如圖五所示’係計算圖三中經擷取影像内每一列上 的紅色(非黑)像素,以取得之γ轴(垂直)長條統計圖;該 γ軸長條統計圖上會顯示出與紅色三角形丨、紅色圓形2 及紅色四方形3相對應之三角長條統計圖102、曲線長 條統計圖202及方形長條統計圖302,而Υ軸長條統計 圖之水平方向所載數值係代表列數,而垂直方向則代表 像素點值;該Υ軸長條統計圖資料保有足夠資訊,以判 定在原始輸入影像中之物體形狀與垂直物體大小及位 14 201040847 —步驟4 .再依照χ軸或γ軸長條統計圖求出最小特 定點及最大特定點’如圖四所示,係以線性搜尋方式求 出二最小特定點及一最大特定點,同時將求出之二^小 特定點及最大特定點的位置(行數)註記與記錄;以乂軸 長條統計圖為例,該二最小特定點之像素值皆為〇點, 故經由像素〇點朝對應行數的位置逐行尋找,即可在第 50行中蒐尋到三角長條統計圖1〇1的第一個最小特定點 © A,並將第50行加以註記及記錄,緊接著會搜 長^统計圖之第二個最小特定點c為第17〇#,及= 特定點B為像素85 _點,並將所搜尋的特定點B、c註記 及記錄;接著搜尋出曲線長條統計囷2〇1及方形長條統 計圖3〇1之最小特定點d'f'g]及最大特定點e h、 I;其中,該最小特定點D為273行,362行,g為 498行,J為563行,而最大特定點E像素為%點,η 及I像素為86點;最後搜尋至又軸長條統計圖的終端; 同理’圖五所示之Υ轴長條統計圖所顯示之三角長條統 ❹ 計圖1G2、曲線長條統計圖202及方形長條統計圖3〇2 的最小特定點及最大特定點同樣以相同方式求出,以此 不在贅述,可得到三角長條統計圖1〇2的最小特定點κ 及Μ分別為50行及136行,而最大特定點L為像素119 點;該曲線長條統計® 202之最小特定點p分別為 160行及245行,最高特定點〇為像素88點;該方形長 條統計圖302之最小特定點(^及τ分別為28〇行及365 行,最咼特定點R及S為像素65點,依序搜尋到γ軸 長條統計圖的終端; 15 201040847 步驟5 ·取得最小特定點及最大特定點後,請參照 圖四所示,係透過X軸長條統計圖中加以回歸(regression) 分析,以判斷出可能之物體形狀;係經由被註記及記錄 為最小特定點及最大特定點之位置,而定義出三角長條 統计圖101之線條1〇11及1012與曲線長條統計圖2〇1 之線條2011與方形長條統計圖3〇1之線條3〇11、3〇12、 3013的邊界形狀; 首先’係針對三角長條統計圖1〇 j之線條1〇 i i的多 〇 項式回歸分析的結果:G Ο 201040847 Ge ash "like features or video image features, and quickly and accurately used for the identification of empty materials · ^ shape" can be applied to the father through the 5 ^, or other areas. Steps into the above purposes The method of recognizing an object in an image, which first scans the image data to capture the digital input image _ among them, an identifiable color shirt feature (such as RGB or IHS color) < sign or spectral feature, etc. ;,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,, Step 4 Find the χ axis or γ axis strip point and the maximum specific point, in order to find the circle of the small special 疋 + m to find the U and other long strips of the material of the towel; point and ί large ^ Μ axis or 丫 axis length The smallest specific shape on the chart '· If (4) the object is linear, it can be judged as three objects = the straight line is judged as a square line; if, the right is a shape; the pay 0 - the human equation is judged as a circle 6 · Determining the size of the object by the distance between the two minimum special points on the x-axis or γ-axis strip chart; Take the J-point judgment = bit to borrow the smallest specific step-by-step 8-step image from the x-axis or γ-axis bar graph, and re-apply to identify other features in the image. 201040847 [Embodiment] The method (7) is not a process step diagram of the method for recognizing an object in an image according to the invention, and mainly includes: tr of the image, which can also be used to perform digital image data on the object. Paying for digital image data; in addition, the digital image is also taken by a class of image capture device; =r:r, the same can be taken - digital image == Ο 或 or analog image: capture digital image capture device image The device or camera or other admissibility, the operation of a single H RGB color is special - gray or, when the image or L7 can be said to recognize the grayscale features; The image data is the visible spectral frequency range, and the single-light m image can be _ feature; Step 3·· After the image identifiable feature is extracted, that is, 2: t take the X-axis direction (horizontal) or γ-axis of the image Direction (vertical) = graph:. 3′······························································································· After the axis obtains the X-axis or Μ long bar chart, the minimum specific point and the maximum specific point of the Χ-axis or Υ-axis bar graph are obtained, and the upper special point and the larger specific point are identified as identifiable features. The zero ^ = and maximum point pixel 8G4; in addition, the minimum and most search mode '« is found by the linear search mode on the Χ axis or = = 12 201040847' and simultaneously notes and records the minimum specific point and fixed point Position; in addition, can also be different search algorithm or method + = minimum or specific step in the chart data of the axis or the axis, and the maximum specific step l and then through multiple cut analysis methods, small specific points and maximum comparison Point; t derives the shape of the object 8G5; for example. If the minimum specific point to the maximum specific point is linear, then the object to be identified should be a triangle; ^ the minimum specific point to the maximum special = the line is a straight line Go to 'can determine that the object to be identified should be Square opening/or rectangle; if the minimum specific point to the maximum specific point is in accordance with the quadratic formula, it can be determined that the object to be identified should be circular or elliptical; *Step 6: According to the axis or γ of the image The position of the axis and the minimum specific point and the maximum specific point of the axis can be obtained as the pixel value in the direction of the 2 axis or the axis. The pixel size can be accurately calculated by the pixel value 806; ° Step 7 : According to the 彡 及 及 及 及 及 及 及 及 及 及 及 判定 判定 判定 判定 判定 判定 判定 判定 判定 判定 判定 判定 判定 判定 判定 判定 判定 判定 判定 判定 判定 判定 判定 判定 判定 判定 判定 判定 判定 判定 判定 判定 判定 判定 判定 判定 判定Other characteristics of different colors or spectral frequencies), and the processing flow from step 2 to step 7 is performed. Thus, multiple identification functions are performed in the same image without multiple 5' identification features to achieve more accurate and faster identification. The purpose of the shape, size and location of the object. The following is a schematic diagram of the first embodiment of the present invention. The image recognition step is as follows: Step 1. As shown in FIG. 2, the white background has five kinds of digital image data. They are a red triangle 1, a red circle 13 201040847 2, a red square 3, a green triangle 4 and a blue square 5 · The image can be drawn by a digital computer, or through a digital image capture device Shooting; Flying 頚 ratio ~ Ο Ο Step 2: As shown in Figure 3, 'When digital image data is acquired, it can be decided to take the image with red features first according to the characteristics of the image. Therefore, there is red in Figure 2. The red triangle i, red circle 2 and red square 3 of the feature will be taken out and image recognition will be started instead of the red feature image, such as green triangle 4 and blue square 5 will be in the image capturing process. Removed from step; Step 3: Calculate the X-axis and Y-axis strip charts; as shown in Figure 4, simultaneously calculate the red (non-black) pixels on each line in the captured image in Figure 3 to obtain X Axis (horizontal) strip statistics , the X-axis strip chart will display the triangle strip chart corresponding to the red triangle 1, the red circle 2 and the red square 3, the curve of the long strip chart 2〇1 and the square strip Chart 3〇1, and the numerical values in the horizontal direction of the x-axis bar graph represent the number of rows, while the vertical direction represents the pixel value; the data displayed in the X-axis bar graph retains enough information to determine The shape of the object in the original input image and the size and position of the horizontal object; as shown in Figure 5, the red (non-black) pixels in each column of the image are captured in Figure 3 to obtain the gamma axis (vertical) Long bar chart; the γ-axis bar graph will display the triangular strip chart 102, the curve strip chart 202 and the square strip corresponding to the red triangle 丨, the red circle 2 and the red square 3 Chart 302, and the numerical values in the horizontal direction of the x-axis bar graph represent the number of columns, while the vertical direction represents the pixel value; the axis bar graph data holds sufficient information to determine the original input image. Object shape and vertical Object size and position 14 201040847 - Step 4. Then find the minimum specific point and the maximum specific point according to the x-axis or γ-axis bar graph. As shown in Figure 4, the two minimum specific points and one are obtained by linear search. The maximum specific point, at the same time will find the two small specific points and the position of the largest specific point (number of rows) annotation and record; take the long axis chart as an example, the pixel values of the two minimum specific points are all defects Therefore, the pixel search point is searched line by line toward the corresponding line number, and the first minimum specific point © A of the triangle strip chart 1〇1 can be searched in the 50th line, and the 50th line is noted. And the record, followed by the search for the second smallest specific point c of the chart is the 17th 〇 #, and = the specific point B is the pixel 85 _ point, and the specific points B, c of the search are noted and recorded; Then search for the curve bar 囷2〇1 and the square bar chart 3〇1 minimum specific point d'f'g] and the maximum specific point eh, I; wherein the minimum specific point D is 273 lines, 362 Line, g is 498 lines, J is 563 lines, and the maximum specific point E pixel is % point, η and I pixel is 86 points; After searching for the terminal of the axis long bar chart; the same as the triangle bar chart shown in Figure 5 shows the triangle strip chart 1G2, the curve bar chart 202 and the square bar chart The minimum specific point and the maximum specific point of 3〇2 are also obtained in the same way. Therefore, the minimum specific points κ and Μ of the triangular strip chart 1〇2 can be obtained, respectively, and 50 lines and 136 lines, respectively. The specific point L is the pixel 119 point; the minimum specific point p of the curve strip statistics о 202 is 160 lines and 245 lines, respectively, the highest specific point 〇 is the pixel 88 points; the minimum specific point of the square strip chart 302 (^ And τ are 28 lines and 365 lines respectively, and finally the specific points R and S are 65 points of pixels, and the terminals of the γ-axis long chart are sequentially searched; 15 201040847 Step 5 · After obtaining the minimum specific point and the maximum specific point , as shown in Figure 4, the regression analysis is performed through the X-axis bar graph to determine the shape of the possible object; it is recorded and recorded as the minimum specific point and the position of the largest specific point. Define the line 1 of the triangle strip chart 101 〇11 and 1012 and curve strip chart 2〇1 line 2011 and square strip chart 3〇1 line 3〇11,3〇12, 3013 boundary shape; first 'system for triangle strip chart 1 The results of the multi-parameter regression analysis of the line 1〇ii of 〇j:

Degree 1: -7〇.28 + 1.398x, a=〇.〇55 p < 〇.〇〇〇!, p<0.0001, R-2 = l.oo ;Degree 1: -7〇.28 + 1.398x, a=〇.〇55 p < 〇.〇〇〇!, p<0.0001, R-2 = l.oo ;

Degree 2: -70.71 + 1.41x - 7.〇424e-5x^2, a=0.05, p<0.0001,PM.OOOl,p = 0.6085, ΙΓ2 = 1.00 ; …果顯示二角長條統計圖报可能與斜率〗·398 互為線性,多項次回歸分析結果顯示,第二級項(Degree 2)在a=0.〇5時在統計上為不顯著。 而針對線條1〇12的多項式回歸分析結果如下: 〇 OQgTeQ U 237 2 - a = 〇〇5j p<〇.〇〇01, p<0.0001,ΪΤ2 = l.ooDegree 2: -70.71 + 1.41x - 7.〇424e-5x^2, a=0.05, p<0.0001, PM.OOOl, p = 0.6085, ΙΓ2 = 1.00; ... the result shows that the two-dimensional strip chart may be Slope 〗 398 is linear with each other. The results of multiple regression analysis show that the second level (Degree 2) is statistically insignificant at a=0.〇5. The result of the polynomial regression analysis for the line 1〇12 is as follows: 〇 OQgTeQ U 237 2 - a = 〇〇5j p<〇.〇〇01, p<0.0001,ΪΤ2 = l.oo

Degree 2: 238.5 - 1.418X + 7.〇424e-5x^2, a=0.05, p<0.0001, p<〇.〇〇〇i, P=0.6085, R^2 = i 〇〇 結果顯示三角長條統計圖101可能與斜率1.398互 為線多項式回歸分析結果顯示,第二級項 2)在a=〇.〇5時在統計上為不顯著。 因此,線條1011與1021之聯合結果顯示,在圖二 中物體為紅色三角形1; 16 201040847 接著,進行曲線長條統計圖201之線條2011的多項 式回歸分析結果如下:Degree 2: 238.5 - 1.418X + 7.〇424e-5x^2, a=0.05, p<0.0001, p<〇.〇〇〇i, P=0.6085, R^2 = i 〇〇Results show triangular strips The statistical graph 101 may be lined with a slope of 1.398. The results of the polynomial regression analysis show that the second level term 2) is statistically insignificant at a=〇.〇5. Therefore, the combined result of lines 1011 and 1021 shows that the object in Figure 2 is a red triangle 1; 16 201040847 Next, the results of the polynomial regression analysis of the line 2011 of the curve strip chart 201 are as follows:

Degree 1: 48.71 + 0.05037x, a=0.05, p = 〇.0784, p = 0.5589 RA2 = 〇.〇〇 ;Degree 1: 48.71 + 0.05037x, a=0.05, p = 〇.0784, p = 0.5589 RA2 = 〇.〇〇 ;

Degree 2: -3331 + 21.48x - 0.03375x^2, a = 0.05, p<0.0001 p<0.0001, P<0.0001,ΙΓ2 = 0.95 ; 結果顯示曲線長條統計圖201之結果很可能為圓形 或卵形’多項式回歸分析結果顯示,第二級項(Degree2) 〇 在a=0.05時在統計上為顯著。 因此,結果顯示圖二中物體為紅色圓形2。 接著’由於方形長條統計圖301之線條3011、3012 及3013為垂直與水平之線條,故無法進多項式回歸分 析;但是,該線條3011係代表零點最小特定值至最大特 定值之直線,而線條3012包含所有相等之最大特定值, 且線條3013代表最大特定值至零值最小特定值之直 線,故該線條3011、3012、3013之組成物體為一紅色四 方形3。 〇 步驟6:得知物體形狀後,再進行物體大小之判定, 係如圖四所示,該三角長條統計圖1〇1之最小特定點A、 C為5〇行及n〇行,最大特定點B為85個像素,因此, 紅色三角形1之實際大小尺寸可由照相機及影像特性與 照相機在實際環境中的位置判定出。 〃 ,該曲線長條統計圖201之最小特定點D、F為273 ^及362行,最大特定點E為86個像素,因此,若欲計 算圖一之紅色圓形2之大小,可由照相機及影像特性 照相機在實際環境中的位置判定出。 、 17 201040847 該方形長條統計圖301之最小特定點G、J為498 行及563行,最大特定點H、工為86個像素,因此,若 欲計算圖二之紅色四方形3之大小,可由照相機及影像 特性與照相機在實際環境中的位置判定出。 步驟7.最後,计算物體的位置,請同時參閱圖四 及圖五所示,係透過X軸長條統計圖及¥軸長條統計圖 所得之最小特定點a、c、d、f、g、j、k、m、n、p、 Ο Ο Q、t進行物體位置的判斷,其中,該三角長條統計圖 101之X軸最小特定點A、(^為50至17〇;而Y軸最小 特定點K、Μ則為52 1 136’透過X軸及γ軸之最小特 定點A、C、K、Μ資料,即可得知三角長條統計圖的位 置, 該曲線長條統計圖2()1之X軸最小特定點D、U 273至362;而Y轴最小特定點N、P則為16〇至245, 透過X轴及Y轴之最小特定點D、F、N、p資料,即可 得知曲線長條統計圖的位置; 該方形長條統計® 3〇kx軸最小特定點㈠為 Γ至563;H軸最小特定點Q、T則為至365, 透過X轴及Y軸之最,j、鞋&扑ρ τ 错“… J、Q、T資料,即可 传知方形長條統計圖的位置; 及〜二必要時’真實物體的大小通常可由照相機 及衫像特性與照相機在實際環境中的位置判定出。 步驟8:可換擷取綠色辨識 影像資料,並重複上述歩驟2 色辨識特徵的 辨識物體之形狀、大小及位置。7的步驟流程,即可取得 因此本發月之影像辨識係在無圖案比對與圖案比 18 201040847 雜二!!庫的條件下完成,較f用物體辨識方法改善了物 利用I速度與準確度。另外’物體形狀資訊被求出,係 开W 2式回歸分析,如此亦根據RA2適合分析提供對 ^進:準確度之計量。此外’辨識物體形狀時不會有 準確與耗時直接輸入影像搜尋之情形。 明同時參閱圖六至圖九所示,係本發明之第二實施 不μ圖,其主要係應用在交通號誌影像辨識上,主要辨 識步驟為·· mDegree 2: -3331 + 21.48x - 0.03375x^2, a = 0.05, p<0.0001 p<0.0001, P<0.0001, ΙΓ2 = 0.95; the result shows that the result of the curve strip chart 201 is likely to be round or egg The shape-polynomial regression analysis showed that the second-order term (Degree2) 在 was statistically significant at a=0.05. Therefore, the result shows that the object in Figure 2 is a red circle 2. Then, since the lines 3011, 3012, and 3013 of the square strip chart 301 are vertical and horizontal lines, the polynomial regression analysis cannot be performed; however, the line 3011 represents a line from the minimum specific value of the zero point to the maximum specific value, and the line 3012 contains all equal maximum specific values, and line 3013 represents a straight line from the maximum specific value to the minimum specific value of zero value, so the constituent objects of the lines 3011, 3012, and 3013 are a red square 3. 〇Step 6: After knowing the shape of the object, determine the size of the object, as shown in Figure 4. The minimum specific points A and C of the triangular strip chart 1〇1 are 5〇 and n〇, the maximum The specific point B is 85 pixels, so the actual size of the red triangle 1 can be determined by the camera and image characteristics and the position of the camera in the actual environment. 〃 , the minimum specific point D, F of the curve bar graph 201 is 273 ^ and 362 lines, and the maximum specific point E is 86 pixels. Therefore, if the size of the red circle 2 of Figure 1 is to be calculated, the camera and Image characteristics The position of the camera in the actual environment is determined. 17 201040847 The minimum specific points G and J of the square strip chart 301 are 498 lines and 563 lines, and the maximum specific point H and the work are 86 pixels. Therefore, if the size of the red square 3 in Fig. 2 is to be calculated, It can be determined by the camera and image characteristics and the position of the camera in the actual environment. Step 7. Finally, calculate the position of the object. Please refer to Figure 4 and Figure 5 at the same time. The minimum specific points a, c, d, f, g obtained through the X-axis strip chart and the ¥-axis strip chart. , j, k, m, n, p, Ο Ο Q, t determine the position of the object, wherein the X-axis minimum specific point A of the triangular strip chart 101, (^ is 50 to 17 〇; and the Y-axis The minimum specific point K and Μ are 52 1 136'. The minimum specific points A, C, K, and X of the X-axis and the γ-axis can be used to know the position of the triangle strip chart. The curve strip chart 2 (1) X-axis minimum specific point D, U 273 to 362; and Y-axis minimum specific point N, P is 16〇 to 245, through the X-axis and Y-axis minimum specific point D, F, N, p data , you can know the position of the curve strip chart; the square strip statistics ® 3〇kx axis minimum specific point (1) is Γ to 563; H axis minimum specific point Q, T is 365, through X axis and Y The most axis, j, shoes & ρ ρ wrong "... J, Q, T data, you can know the position of the square strip chart; and ~ two if necessary, the size of the real object can usually be seen by the camera and shirt Characteristics and photos The position of the camera in the actual environment is determined. Step 8: The green identification image data can be exchanged, and the shape, size and position of the identified object of the color recognition feature can be repeated. The image recognition of this month is completed under the condition of no pattern comparison and pattern ratio, and the object identification method improves the speed and accuracy of object utilization. In addition, the object shape information is requested. Out, the W 2 regression analysis is performed, so the RA2 is suitable for analysis to provide the measurement of accuracy: In addition, there is no accurate and time-consuming direct input image search when identifying the shape of the object. 6 to FIG. 9 is a second embodiment of the present invention, which is mainly applied to traffic image recognition, and the main identification step is ··m

歩驟1:如圖六所示,係透過影像擷取裝置拍攝拍 道路邊之交通號誌,以取得該交通號誌影像6,誃 通號誌影像6具有紅色外環61、白色内環62及黑色字 體63等影像可辨識特徵’同時,該交通號諸影像6令亦 包含其它紅色特徵物體,如:樹上的紅花71與複數朵在 灌木中的紅花7卜該紅花71有些部位被綠葉所遮蔽; …歩驟2:如圖七所示,係抓取圖六中具有紅色特徵 得影像資料,故可取得交通號誌影像6之紅色外環Μ及 樹上紅花71與灌木叢中紅花71等影像特徵; 步驟3 :如圖八及圖九所示,係經由計算而取得X 軸及Υ軸長條統計圖,該χ軸及γ軸長條統計圖上包含 紅色外環長條統計圖501、樹上紅花統計圖5〇2及灌木 叢上複數朵紅花統計圖503,其中,長條統計圖5〇1之 獨特圖形,是原始環狀物體形狀的特徵;且經由圖九之 γ軸長條統計圖可知紅色外環61與樹上紅花71在γ = (垂直)方向上重疊,故將該紅色外環長條統計圖及 樹上紅花長條統計圖5〇2資料彼此相加,以在待辨識 體對應之紅色外環長條統計圖501資料上加入雜訊 另 19 201040847 外,長條統計圖501之獨特圖形,是原始環狀物體形狀 的特徵;上述X軸及γ軸長條統計圖之計算模式與圖四 及圖五相同,於此不在贅述; 歩驟4 .如圖八及圖九所示,係利用零行至行 掃描X軸及Y軸長條統計圖資料,以求出χ軸紅色外環 長條統計圖501之最小特定點a’、D,及最大特定點Β,、 C’及Υ軸紅色外環長條統計圖5〇1之最小特定點Ε,、Η, 及最大特定點F,、G,;該最小特定點八,為1U2行,D, 〇 為2040行、Ε’為1099行、Η,為2078行;該最大特定點 Β’及C’之像素值皆為710點,而F,及G,之像素值為66〇 及650 ;另外’該最小特定點a,、D,、Ε,、Η,及最大特 定點B’、C’、F,、G,之找尋方式皆與圖四及圖五相同, 於此不在贅述; 步驟5 :取得最小特定點及最大特定點後利用圖 八所示之三組線條501卜5012、5013搭配多項式回歸分 析(regression)判斷物體之形狀; 圖八中線條5011之多項式回歸分析結果如下: 〇 Degree 1: -4439 + 4.145x, a=〇.〇5, p < 〇 〇〇〇lj p<0.0001, RA2 = 0.93 5Step 1: As shown in Fig. 6, the traffic sign is taken by the image capturing device to obtain the traffic sign image 6 of the road. The image of the traffic signal 6 has a red outer ring 61 and a white inner ring 62. And black font 63 and other image-recognizable features' At the same time, the traffic number images 6 also contain other red features, such as: safflower 71 on the tree and a plurality of safflowers in the shrub 7 safflower 71 some parts are green leaves Obscured; ... Step 2: As shown in Figure 7, the image data with red features in Figure 6 is captured, so the red outer ring of the traffic sign image 6 and the red flower 71 on the tree and the safflower in the bush can be obtained. 71 and other image features; Step 3: As shown in Figure 8 and Figure 9, the X-axis and the Υ-axis strip graph are obtained by calculation. The χ-axis and γ-axis strip graphs contain red outer loop strip statistics. Figure 501, safflower statistics on the tree 5〇2 and a plurality of safflower statistics 503 on the bush, wherein the unique figure of the long chart 5〇1 is a feature of the shape of the original annular object; The axial strip chart shows that the red outer ring 61 and the safflower 71 on the tree are at γ = In the (vertical) direction, the red outer ring strip graph and the red flower strip graph 5〇2 data are added to each other to be in the red outer loop strip graph 501 corresponding to the object to be identified. Adding the noise to another 19 201040847, the unique figure of the long chart 501 is the feature of the shape of the original ring object; the calculation mode of the above X-axis and γ-axis bar graph is the same as that of Figure 4 and Figure 5. As shown in Figure 8 and Figure 9, the X-axis and Y-axis long chart data are scanned from zero line to line to find the minimum specific point of the 红色 axis red outer ring strip chart 501. a', D, and maximum specific point Β, C' and Υ axis red outer ring strip chart 〇1 of the minimum specific point Ε, Η, and the maximum specific point F,, G,; the minimum specific point Eight, 1U2 lines, D, 〇 is 2040 lines, Ε ' is 1099 lines, Η, is 2078 lines; the maximum specific point Β 'and C' pixel values are 710 points, and F, and G, the pixels The values are 66〇 and 650; in addition, the minimum specific points a, D, Ε, Η, and the maximum specific points B', C', F, G, The search methods are the same as those in Figure 4 and Figure 5, and are not described here. Step 5: After obtaining the minimum specific point and the maximum specific point, use the three sets of lines shown in Figure 501. 5012, 5013 with polynomial regression analysis (regression) Judging the shape of the object; the polynomial regression analysis of line 5011 in Figure 8 is as follows: 〇Degree 1: -4439 + 4.145x, a=〇.〇5, p < 〇〇〇〇lj p<0.0001, RA2 = 0.93 5

Degree 2: -42765 + 68.91x - 0.02733x^2, a=0.05 p<0.0001,p<0.0001,p=0.0001,ΙΓ2 = 0.99 結果顯示該線條5011很可能為圓形或_形且多項 式回歸分析結果顯示’第二級項在a = 0.05時在統計上為 顯者; 圖八中線條5012之多項式回歸分析結果如下:Degree 2: -42765 + 68.91x - 0.02733x^2, a=0.05 p<0.0001,p<0.0001,p=0.0001,ΙΓ2 = 0.99 The result shows that the line 5011 is likely to be circular or _-shaped and polynomial regression analysis results It is shown that the 'second level term is statistically significant at a = 0.05; the polynomial regression analysis of line 5012 in Figure 8 is as follows:

Degree 1: 352.3 + 0.008745x,a = 〇.〇5,p<〇 〇〇〇1 20 201040847 ρ = 0_6111,RA2 = 0.00Degree 1: 352.3 + 0.008745x, a = 〇.〇5,p<〇 〇〇〇1 20 201040847 ρ = 0_6111,RA2 = 0.00

Degree 2:6456 - 7.86U + 0.〇〇25〇3χΛ2, a=〇 05, ρ<0.0001, ρ<〇.〇〇〇1,ρ<〇.〇〇〇1,RA2 = 〇 89 ’ 結果顯示線條5012可能為圓形或卵形,且多項欠回 歸分析結果顯示’第二級項在a=0.05時在統計上為顯7著; 圖八中線條5013多項式回歸分析結果如下:Degree 2:6456 - 7.86U + 0.〇〇25〇3χΛ2, a=〇05, ρ<0.0001, ρ<〇.〇〇〇1,ρ<〇.〇〇〇1,RA2 = 〇89 ' Line 5012 may be round or oval, and multiple under regression analysis results show that 'the second level term is statistically significant at a=0.05; the line 5013 polynomial regression analysis in Figure 8 is as follows:

Degree 1: 9155 - 4.43x, a = 0.05, p<〇.〇〇01> p<〇 〇〇〇1> RA2 = 0.92 ’ ΟDegree 1: 9155 - 4.43x, a = 0.05, p<〇.〇〇01>p<〇〇〇〇1> RA2 = 0.92 ’ Ο

Degree 2: -102241 +109x - 0.02887x^2, a=〇.〇5, p<0.0001, p<0.0001, p=0.000l, R-2 = 0.99 ’ 結果顯示線條5013可能為圓形或卵形,且多項式回 歸分析,第二級項在a=〇.〇5時在統計上為顯著;> 因此,線條5〇1卜5〇12、5〇13之聯合結果顯示,圖 八所顯示之長條統計圖501很可能為環狀體之圓形或卵 形0 歩驟6:得知物體形狀後,再進行物體大小之判定, 係參照圖八所示,該X軸方向之紅色外環長條統計圖5〇1 之最小特定點A’、D,為1112行及2040行,最大特定點 B’、C’為710個像素;而圖九所示之丫軸方向最小特定 點E’、H’為1〇99行及2078行,的最大特定點F,為66〇 個像素、G為650個像素,經由上述條件及可記算出物 體正確的大小尺寸;另外,物體之正確大小尺寸可由照 相機及影像特性與照相機在實際環境中的位置判定出, 於此不在贅述。 /驟7 .敢後,计鼻物體的位置,請同時參閱圖八 及圖九所示,係透過X軸長條統計圖及Y軸長條統計圖 21 201040847 所侍之最小特定點a,、d,、e,、h, ^ u , 進仃物體位置的判斷, 吨w斷方式與上述圖四及圖五 不在贅述; ㈣路之方式相同’於此 並重複 形狀、 並可應 經由多 步趨8:再重新擷取其他影像可辨識特徵 步驟2 7即可重新進行影像之辨識。 因此,透過本發明方法所取得物體之色奢 大小及/或位置等資訊,除可制於交通號魏ί 用於其他用途或領域上。 Ο 〇 另外’圖八所顯示之樹上紅花統計圖5〇2”里” ^ ^分析顯示’該物體不具有良好定義的線性或曲 線’因為第-與第二級分析之ΙΤ2值报小:Degree 2: -102241 +109x - 0.02887x^2, a=〇.〇5, p<0.0001, p<0.0001, p=0.000l, R-2 = 0.99 ' The result shows that the line 5013 may be round or oval And polynomial regression analysis, the second-level term is statistically significant at a=〇.〇5; > Therefore, the combined result of the line 5〇1 Bu 5〇12, 5〇13 shows that Figure 8 shows The long bar graph 501 is likely to be a circular or oval shape of the annular body. Step 6: After knowing the shape of the object, determine the size of the object, as shown in Figure 8, the red outer ring in the X-axis direction. The minimum specific points A' and D of the long bar graph 5〇1 are 1112 lines and 2040 lines, and the maximum specific points B' and C' are 710 pixels; and the minimum specific point E' of the x-axis direction shown in FIG. H' is 1〇99 lines and 2078 lines, the maximum specific point F is 66〇 pixels, G is 650 pixels, and the correct size of the object can be calculated through the above conditions; in addition, the correct size of the object It can be determined by the camera and image characteristics and the position of the camera in the actual environment, and will not be described here. /Step 7. After the dare, the position of the nose object, please also refer to Figure 8 and Figure 9, through the X-axis strip chart and the Y-axis strip chart 21 201040847 The minimum specific point a, d,, e, h, ^ u , the judgment of the position of the entering object, the ton w breaking mode is not repeated with the above figure 4 and Figure 5; (4) The way of the road is the same 'this and repeating the shape, and may be multi-step Step 8: Re-take the other image identifiable features Step 2 7 to re-identify the image. Therefore, information such as the size and/or position of the object obtained by the method of the present invention can be used for other uses or fields in the traffic number Wei. Ο 〇 In addition, the safflower chart on the tree shown in Figure 8 is in the 〇2"" ^ ^ analysis shows that the object does not have a well-defined linear or curved line because the ΙΤ2 value of the first- and second-level analysis is small:

Degree 1: _1653 + Q 5543x,a=()()5 〇 p<0.0001, R^2 = 0.48Degree 1: _1653 + Q 5543x, a=()()5 〇 p<0.0001, R^2 = 0.48

Degree 2: -155477 + l〇2x - 0.01672x^2, a=0.05, P<0.0001, p<〇.〇〇〇is p=〇>0〇olj R^2 = 0.66 因此,該物體明顯非為圓形或線性交通號誌物體。 上述利用多項式回歸分析影像形狀時,並不會受到 雜訊存在影響’如圖九所示,若針對γ轴長條統計圖所 顯示之線條5〇14進行多項式回歸分析結果如下:Degree 2: -155477 + l〇2x - 0.01672x^2, a=0.05, P<0.0001, p<〇.〇〇〇is p=〇>0〇olj R^2 = 0.66 Therefore, the object is obviously not A circular or linear traffic number object. When using the polynomial regression to analyze the image shape, it is not affected by noise. As shown in Figure 9, the polynomial regression analysis results for the line 5〇14 displayed on the γ-axis strip chart are as follows:

Degree 1: 393 _ 〇.〇2945x, a=0.05, p<0.0001, p=0.0370, R^2 = 0.01Degree 1: 393 _ 〇.〇2945x, a=0.05, p<0.0001, p=0.0370, R^2 = 0.01

Degree 2: 5350 - 6.353x + 〇.〇〇1987xa2,a=0.〇5, p<0.0001,p<〇.〇〇〇l,p<〇 〇〇〇1,RA2 = 〇 9 結果顯示長條統計圖501之線條5014可能為圓形或 卵形(非為線性)’且多項式回歸分析結果顯示第二級項 在a=0.05時為統計上顯著,此與圖八X轴長條統計圖之 22 201040847 2 5012相似,故Υ軸長條統計圖之形狀辨識不會受 到樹上紅花之雜訊影響。 & 本發㈣糾之在f彡像巾辨識物㈣料 習用技術相互比較時,更具備下列優點: 、/、 小特大轴或γ軸長條統計圖上取得之最 特疋點及m點’配合多項式回歸分析 判定物體形狀、大小及位置之在影像中辨識物Degree 2: 5350 - 6.353x + 〇.〇〇1987xa2,a=0.〇5, p<0.0001,p<〇.〇〇〇l,p<〇〇〇〇1,RA2 = 〇9 result shows strip The line 5014 of the statistical graph 501 may be circular or oval (not linear)' and the result of the polynomial regression analysis shows that the second level term is statistically significant at a=0.05, which is the same as the graph of the X-axis strip chart of FIG. 22 201040847 2 5012 is similar, so the shape identification of the long axis chart is not affected by the noise of the red flower on the tree. & This (4) Correction has the following advantages when comparing the techniques of the f彡icon identification (4) materials: /, / The most special points and m points obtained on the small extra large axis or γ axis long bar chart 'With polynomial regression analysis to determine the shape, size and position of the object in the image

Ο 2.本發明係可擷取不同的可辨識影像特徵Ο 2. The invention can capture different identifiable image features

…〜令付做,如 RGB 色彩特徵錢階影像特徵或視頻影像特徵,並 確對該操取影像進行形狀、大小及位置的辨識,使其可 應用於交通號諸或其他領域上。 、 上列詳細㈣係針對本發明之—可行實施例之具體 說明,惟該實施例並非用以限制本發明之專利範圍,、凡 未脫離本發明技藝精神所為之等效實施或變更, 含於本案之專利範圍中。 似匕 综上料,本㈣但在技術思想上確屬㈣,並能 較習用物品增進上述多項功效’應已充分符合新穎性及 進步性之法定發明專利要件,爰依法提出申請,懇請貴 局核准本件發明專利申請案,以勵發明,至感抨僮月貝 【圖式簡單說明】 " 圖一為本發明在影像中辨識物體的方法之步驟圖; 圖二為本發明在影像中辨識物體的方法之第一實施 之五種物體色彩圖; 圖三為抓取圖二中可辨識影像特徵示意圖; 圖四為計具圖三中每一行彩色像素所得之X軸彩長 23 201040847 條統計圖; γ軸彩長 圖五為計算圖三中每一列彩色像素所得之 條統計圖; 圖六為本發明在影像中辨識物體的方法之第二實施 彩色交通號諸圖; 圖七為抓取圖六中可辨識影像特徵示意圖; 圖八為計算圖六中每-行彩色像素所得之χ轴彩長 條統計圖;以及... to make a payment, such as RGB color feature money image features or video image features, and to identify the shape, size and position of the captured image, so that it can be applied to traffic numbers or other fields. The detailed description of the above is a detailed description of the possible embodiments of the present invention, which is not intended to limit the scope of the invention, and the equivalents or modifications thereof The patent scope of this case. It seems that the above-mentioned materials (4) are technically (4), and can improve the above-mentioned multiple functions compared with the conventional articles. 'The statutory invention patents that should fully comply with the novelty and the progressiveness, and apply in accordance with the law, please ask your office. Approving the invention patent application for this invention, in order to invent the invention, to the sensation of the child, the moon [a simple description] " Figure 1 is a step diagram of the method for identifying an object in the image; Figure 2 is the identification of the image in the present invention The color of the five objects in the first implementation of the method of the object; FIG. 3 is a schematic diagram of the identifiable image features in the second drawing; FIG. 4 is the X-axis color length obtained from the color pixels in each row of the figure III. Figure γ-axis color length Figure 5 is a bar chart obtained by calculating each column of color pixels in Figure 3. Figure 6 is a second embodiment of the color traffic number of the method for identifying objects in the image; Figure 7 is a screenshot Figure 6 is a schematic diagram of the identifiable image features; Figure 8 is a statistical diagram of the χ-axis color strip obtained by calculating each color pixel in Figure 6;

Ο 圖九為計算圖六中每一列彩色像素所得之γ軸彩長 條統計圖。 【主要元件符號說明】 1紅色三角形 2紅色圓形 3紅色四方形 4綠色三角形 5藍色四方形 6交通標諸 61紅色外環 62白色内環 63黑色字體 71紅花 101三角長條統計圖 1011線條 1012線條 102三角長條統計圖 201曲線長條統計圖 24 201040847 2011線條 202曲線長條統計圖 3 01方形長條統計圖 3011線條 3012線條 3013線條 3 02方形長條統計圖 501外環長條統計圖 0 5011線條 5012線條 5013線條 5014線條 502樹木紅花長條統計圖 503灌木紅花長條統計圖 A、 C、D、F、G、J最小特定點 B、 E、H、I最大特定點 K、Μ、N、P、Q、T最小特定點 Q L、0、R、S最大特定點 A,、D,、E,、H,最小特定點 B,、C,、F,、G,最大特定點 25Ο Figure 9 is a γ-axis color bar graph obtained by calculating the color pixels of each column in Figure 6. [Main component symbol description] 1 red triangle 2 red circle 3 red square 4 green triangle 5 blue square 6 traffic standard 61 red outer ring 62 white inner ring 63 black font 71 safflower 101 triangle strip chart 1011 line 1012 line 102 triangle strip chart 201 curve strip chart 24 201040847 2011 line 202 curve strip chart 3 01 square strip chart 3011 line 3012 line 3013 line 3 02 square strip chart 501 outer ring strip statistics Figure 0 5011 line 5012 line 5013 line 5014 line 502 tree safflower strip chart 503 shrub safflower strip chart A, C, D, F, G, J minimum specific point B, E, H, I maximum specific point K, Μ, N, P, Q, T minimum specific point QL, 0, R, S maximum specific point A, D, E, H, minimum specific point B, C, F, G, maximum specific point 25

Claims (1)

201040847 七、申請專利範圍: 1. 一種在影像中辨識物體的方法 步驟η係先取得-數位影像資料要d ^驟2.再將數位影像可辨識之特徵操取出; ::::同時計算出步驟2所擷取影像之χ轴 長條統計圖; 赤驟4’取得X轴或γ輛長條統計圖後,再求出X轴 Ο Ο :γ軸長條統計圖之最小特定點及最大特定點; 妙驟5, n透過多項式回歸分析方式,根據可辨識物 體之特徵判斷出物體形狀。 2.如申請專利範㈣1類述之在料巾辨識物體的方 =,更包括一步驟6,該步驟6係再擷取步驟ι中的 同一影像但不同可辨識特徵之影像資料,並重複步驟 2至步驟5之處理流程。 3·如申請專利範圍第1項所述之在影像中辨識物體的方 法,其中該步驟1所取得之數位影像資料,係藉由數 位影像擷取裝置對物體進行拍攝取得。 4·如申請專利範圍第3項所述之在影像中辨識物體的方 法,其中該數位影像擷取裝置係為照相機或攝影機。 5. 如申請專利範圍第1項所述之在影像中辨識物體的方 法,其中該步驟1所取得之數位影像資料係藉由類比 影像擷取裝置對物體拍攝取得,再將該類比信號轉換 成數位信號,以取得一數位影像資料。 6. 如申請專利範圍第5項所述之在影像中辨識物體的方 法’其中該類比影像擷取裝置可為照相機或攝影機或 其他可擷取影像之裝置。 26 201040847 7.ηί利範圍第1項所述之在影像中辨識物體的方 翁-肖步驟2所操取數位影像可辨識特徵為單一 顏色之RGB或IHS色彩可辨識特徵。 法申if利範圍第1項所述之在影像中辨識物體的方 階可辨2所操取數位影像可辨識特徵係為灰 9.m利範圍第1項所述之在影像中辨識物體的方 Ο 頻光㈣ί步驟2所㈣數位影像可辨識特徵係為視 頻先谱頻率可辨識特徵。 10二ϋ利範圍帛1項所述之在影像中辨識物體的 方1 2步驟3之χ轴或γ抽長條统計圖的計算 的i去影像可辨識特徵之每-行或每一列上 的像素。 11.如申請專利範圍第 ,Α Φ ^ 項所述之在影像中辨識物體的 或Υ轴i條=驟4之最小特定點之取得,係在χ軸 長條統相上,以線性搜尋模式中被求出之零 Ο 12·Γ=範圍第1項所述之在影像中辨識物體的方 步驟4之最大特定點之取得,係在X軸ΐ 值點像素、/圖上’以線性搜尋模式中被求出之最大 13:申項所述之在影像中辨識物體的方 位置::純註記與2得之最小特定點及最大特定點 14·如申請專利範圍第1項#+ 法,其中該步驟5之= = 辨識物體的方 項式回歸y刀析方式,係根據係 27 201040847 將X軸或Y軸長條統計圖上之線條進行分析,當 之最小特定點至最大特定點間係呈線性走向,即可判 定該待辨識物體應為三角形。 •如申請專利範圍第13項所述之在影像中辨識物體的 方法,其中該線條之最小特定點至最大特定點間係呈 直線走向,即可判定該待辨識物體應為四方形或矩形。 跡申請專利第13項所述之在影像中辨識物體的 方法’其中該線條之最小特定點至最大特定點間係符 〇 纟二次方程式,即可判定該待辨識物體應為圓形或橢 圓形。 17. —種在景> 像中辨識物體的方法,主要包括: 步驟1 :係先取得一數位影像資料; 步驟2.再將數位影像可辨識之特徵擷取出; 步驟3··同時計算出步驟2所擷取影像之χ軸或γ軸 長條統計圓; 步驟4:取得X軸或γ轴長條統計圖後,再求出乂轴 或Υ軸長條統計圖之最小特定點及最大特定點; ❹ 纟驟5.再透過多項式回歸分析方式,根據可辨識物 體之特徵判斷出物體形狀; 步驟6:再依照最小特定點及最大特定點的位置判定 出物體的大小尺寸。 18. 如申吻專利範圍第17項所述之在影像中辨識物體的 方法,更包括一步驟7,該步驟7係再擷取步驟i中 的同一影像但不同可辨識特徵之影像資料,並重複步 驟2至步驟6之處理流程。 19. 如申請專利範圍第17項所述之在影像中辨識物體的 28 201040847 ζ法,其中該步驟1所取得之數位影像資料,係藉由 數位影像擷取裝置對物體進行拍攝取得。 〇.=申凊專㈣圍第19項所述之在影像中辨識物體的 法#中該數位影像擷取裝置係、為照相機或攝影 機0 21. =申晴專利範圍第17項所述之在影像中辨識物體的 Ο Ο 法’其中該步驟1所取得之數位影像資料係藉由類 〜像操取裝置對物體拍攝取得,再將該類比信號轉 換成數位信號,以取得一數位影像資料。 22. ^申睛專利範圍第21項所述之在影像中辨識物體的 :,其中該類比影像擷取裝置可為照相機或攝影機 或其他可掏取影像之裝置。 认如申請專利範圍第17項所述之在影像中辨識物體的 方法,其中該步驟2所操取數位影像可辨識特徵為單 一顏色之RGB或IHS色彩可辨識特徵。 申叫專利範®第17項所述之在景彡像巾辨識物體的 =法’其中該步驟2所擷取數位影像可辨識特徵係為 灰階可辨識特徵。 25.如申請專利範圍第17項所述之在影像中辨識物體的 方法,其中該步驟2所操取數位影像可辨識特徵係為 視頻光譜頻率可辨識特徵。 如申明專利範圍第17項所述之在影像中辨識物體的 方法’其中該㈣3之X轴或γ軸長條統計圖的計 算方式,係計算該影像可辨識特徵之每一行 上的像素。 / A如申請專利範圍第17項所述之在影像中辨識物體的 29 201040847 _ Λ 4之最小特定點之取得,係在χ 之零點像素長條統計圓上,以線性搜尋模式中被求出 28·=請Γ範圍第17項所述之在影像中辨識物體的 站赤中4步驟4之最大特定點之取得,係在X 二純條統計圖上,以線性搜尋模式中被求出 之豉大值點像素。201040847 VII. Patent application scope: 1. A method for recognizing an object in an image. The step η is first obtained - the digital image data is required to be d ^ 2. The digital image can be recognized by the function; :::: simultaneously calculated Step 2: Take the image of the long axis of the image; after obtaining the X-axis or γ-length bar graph in red 4', find the X-axis Ο Ο: the minimum specific point and maximum of the γ-axis long bar chart The specific point; Wonder 5, n through the polynomial regression analysis method, according to the characteristics of the identifiable object to determine the shape of the object. 2. If the application of the patent specification (4) is described in the category of the towel identification object, it further includes a step 6, which is to capture the image data of the same image but different identifiable features in step ι, and repeat the steps. Process flow from 2 to 5. 3. The method for recognizing an object in an image as described in the first aspect of the patent application, wherein the digital image data obtained in the step 1 is obtained by photographing the object by a digital image capturing device. 4. A method of recognizing an object in an image as described in claim 3, wherein the digital image capturing device is a camera or a camera. 5. The method for recognizing an object in an image according to the first aspect of the patent application, wherein the digital image data obtained in the step 1 is obtained by capturing an object by an analog image capturing device, and converting the analog signal into Digital signal to obtain a digital image. 6. The method of recognizing an object in an image as described in claim 5, wherein the analog image capturing device can be a camera or a camera or other device capable of capturing images. 26 201040847 7. The range of objects identified in the image as described in item 1 of the η 利 范围 肖 肖 步骤 Step 2 The digital image captured in step 2 can be identified as a single color RGB or IHS color identifiable feature. The method of recognizing an object in the image as described in the first item of the Fashen range is identifiable. 2 The digital image is identifiable. The identifiable feature is ash 9.m. The range is identified in the image. Ο Ο 频 四 (4) 步骤 Step 2 (4) Digital image identifiable features are identifiable features of the video pre-spectrum frequency. 10 ϋ 帛 帛 在 在 在 在 在 在 辨识 辨识 辨识 辨识 辨识 辨识 辨识 辨识 辨识 辨识 辨识 辨识 辨识 辨识 辨识 辨识 辨识 辨识 辨识 辨识 辨识 辨识 辨识 辨识 辨识 辨识 辨识 辨识 辨识 辨识 辨识 辨识 辨识Pixel. 11. If the scope of the patent application is specified, the minimum specific point of the object identified in the image or the axis i of the axis = the fourth step is obtained in the linear search mode. The obtained zero Ο 12·Γ=range The first specific point of the step 4 of recognizing the object in the image as described in the first item is obtained by linear search on the X-axis 点 pixel, / on the graph The maximum value obtained in the model 13: the square position of the object identified in the image as stated in the application: the minimum specific point and the maximum specific point of the pure note and 2 are as follows: Where the step 5 = = the equation of the object is determined by the regression method, and the line on the X-axis or Y-axis strip chart is analyzed according to the system 27 201040847, from the minimum specific point to the maximum specific point The system is linear, and it can be determined that the object to be identified should be a triangle. • A method for recognizing an object in an image as described in claim 13 wherein the line from the smallest specific point to the maximum specific point is linear, and the object to be identified should be square or rectangular. It can be determined that the object to be recognized should be a circle or an ellipse in the method of recognizing an object in the image described in the thirteenth application patent, wherein the minimum specific point of the line to the maximum specific point is a quadratic equation. shape. 17. The method for recognizing an object in the scene includes: Step 1: acquiring a digital image first; Step 2. extracting the identifiable feature of the digital image; Step 3·· simultaneously calculating Step 2: Take the x-axis or γ-axis long circle of the image; Step 4: Obtain the X-axis or γ-axis bar graph, and then find the minimum specific point and maximum of the x-axis or x-axis bar graph. Specific point; ❹ Step 5. Then through the polynomial regression analysis method, determine the shape of the object according to the characteristics of the identifiable object; Step 6: Determine the size and size of the object according to the minimum specific point and the position of the largest specific point. 18. The method for recognizing an object in an image as described in claim 17 of the patent application, further comprising a step 7 of extracting image data of the same image but different identifiable features in step i, and Repeat the process flow from step 2 to step 6. 19. The method of claim 2010, wherein the digital image obtained by the step 1 is obtained by photographing the object by a digital image capturing device. 〇.=申凊专(四), in the method of identifying objects in the image as described in Item 19 of the 19th, the digital image capturing device is a camera or a camera. 21. = Shen Qing Patent Range No. 17 The Ο Ο method for recognizing an object in the image is obtained by capturing the object image by the class-like operation device, and converting the analog signal into a digital signal to obtain a digital image data. 22. The object of recognizing an object in an image as described in claim 21 of the claim is wherein the analog image capturing device can be a camera or a camera or other device capable of capturing images. A method for recognizing an object in an image as described in claim 17 wherein the digital image captured in step 2 is identifiable as a single color RGB or IHS color identifiable feature. The method of claiming the patent image is as follows: in the image method, the image recognition object of the image is the grayscale identifiable feature. 25. The method of identifying an object in an image as described in claim 17, wherein the digital image recognizable feature of the step 2 is a video spectral frequency identifiable feature. The method of recognizing an object in an image as described in claim 17 of the patent scope, wherein the calculation of the X-axis or γ-axis bar graph of the (4) 3 is to calculate a pixel on each line of the image recognizable feature. /A The acquisition of the minimum specific point of the 2010 201084847 _ Λ 4 of the object identified in the image as described in claim 17 is obtained in the linear search mode on the zero-pixel stripe statistical circle of χ 28·=Please take the maximum specific point of the 4th step 4 of the station red in the image as described in item 17 of the scope, which is obtained in the linear search mode on the X-two pure bar chart.豉 Large value points pixels. Ο 29·=\專利範圍第17項所述之在影像中辨識物體的 法,其巾該㈣4所取得之最小特定點及最大特定 點位置必須被註記與記錄。 3〇.=請專利範圍第Μ所述之在影像中辨識物體的 其中該步驟5之多項式回歸分析方式,係根據 2將X軸或γ軸長條統計圖上之線條進行分析,當 線條之最小特定點至最大特定點間係呈線性走向,即 可判定該待辨識物體應為三角形。 31.^申請專利範圍第29項所述之在影像中辨識物體的 法’其中該線條之最小特定點至最大特定點間係呈 直線走向,即可判定該待辨識物體應為四方形或矩 形〇 申叫專利範圍第29項所述之在影像中辨識物體的 2法’其中該線條之最小特定點至最大特定點間係符 =次方程式’即可判定該待辨識物體應為圓形或橢 圓形。 33·~種在影像中辨識物體的方法,主要包括: V驟1 .係先取得一數位影像資料; 步驟2 :再將數位影像可辨識之特徵擷取出; 30 201040847 步驟3 :同時計算出步驟2所擷取影像之χ軸或γ 軸長條統計圖; 步驟4 :取得X轴或Υ轴長條統計圖後,再求出χ 軸或Υ轴長條統計圖之最小特定點及最大特定點; 步驟5:再透過多項式回歸分析方式,根據可辨識物 體之特徵判斷出物體形狀; 步驟6:再依照最小特定點及最大特定點的位置判定 出物體的大小尺寸; Ο 34 35. Ο 36. 37. 步驟7 .再根據最最小特定點及最大特定點的位置判 定出物體的正確位置。 ‘如申請專㈣圍第33項所述之在料巾辨識物體的 方法,更包括一步驟8,該步驟8係再擁取步驟i中 的同一影像但不同可辨識特徵之影像資料,並重 驟2至步驟7之處理流程。 ^申請專利範圍第33項所述之在影像中辨識物體的 數2/、中該步驟1所取得之數位影像資料,係藉由 數位f像擷取裝置對物體進行拍攝取得。 利範圍第35項所述之在影像中辨識物體的 機。’、中該數位影像操取裝置係為照相機或攝影 方法利㈣第33項所述之在影像中辨識物體的 比會像H步驟1所取得之數位影像f料係藉由類 換成置對物體拍攝取得’再將該類比信號轉 如申L 以取得-數位影像資料。 方範圍第37項所述之在影像中辨識物體的 、類比影像擷取裝置可為照相機或攝影機 31 38. 201040847 或其他可擷取影像之裝置。 39. 如申請專利範圍第33項所述之在影像中辨識物體的 方法,其中該步驟2所擷取數位影像可辨識特徵為單 一顏色之RGB或IHS色彩可辨識特徵。 40. 如申請專利範圍第33項所述之在影像中辨識物體的 方法,其中該步驟2所擷取數位影像可辨識特徵係為 灰階可辨識特徵。 41. 如申請專利範圍第33項所述之在影像中辨識物體的 方法,其中該步驟2所擷取數位影像可辨識特徵係為 視頻光譜頻率可辨識特徵。 42. 如申請專利範圍第33項所述之在影像中辨識物體的 :法,其中該步驟3之X軸或γ軸長條統計圖的計 异方式,係計算該影像可辨識特徵之每一行或每一列 上的像素。The method of recognizing an object in an image as described in Item 17 of the patent scope, the smallest specific point and the maximum specific point position obtained by the towel (4) 4 must be noted and recorded. 3〇.=Please refer to the paradigm in the patent scope to identify the object in the image. The polynomial regression analysis method of step 5 is based on 2 to analyze the lines on the X-axis or γ-axis strip chart, when the line From the minimum specific point to the maximum specific point, the linear trend is obtained, and it can be determined that the object to be recognized should be a triangle. 31. The method for recognizing an object in an image as described in claim 29, wherein the minimum specific point to the maximum specific point of the line is a straight line, and the object to be identified should be square or rectangular. It is determined that the object to be identified should be circular or by the method of identifying the object in the image as described in item 29 of the patent scope, wherein the minimum specific point to the maximum specific point of the line = the sub-equation Oval. 33·~ The method of identifying objects in the image mainly includes: V1 is to obtain a digital image first; Step 2: to extract the identifiable features of the digital image; 30 201040847 Step 3: Calculate the steps at the same time 2 The axis or γ-axis bar graph of the captured image; Step 4: After obtaining the X-axis or Υ-axis bar graph, the minimum specific point and maximum specificity of the χ-axis or Υ-axis bar graph are obtained. Point 5: Determine the shape of the object according to the characteristics of the identifiable object through the polynomial regression analysis method. Step 6: Determine the size of the object according to the minimum specific point and the position of the largest specific point; Ο 34 35. Ο 36 37. Step 7. Determine the correct position of the object based on the position of the smallest specific point and the maximum specific point. The method for identifying an object in the towel as described in Item 33 of the application (4) further includes a step 8 of re-acquiring the image data of the same image but different identifiable features in step i, and repeating Process flow from 2 to 7. ^ The number of image recognition objects in the image as described in item 33 of the patent application scope, and the digital image data obtained in the step 1 are obtained by photographing the object by the digital f image capturing device. A machine for identifying an object in an image as described in item 35 of the benefit range. ', the digital image manipulation device is the camera or the photography method (4), the ratio of identifying the object in the image as described in item 33, the digital image f obtained in step H is replaced by the class The object is captured and the analog signal is converted to obtain the digital image data. The analog image capturing device for recognizing an object in an image as described in item 37 of the scope can be a camera or a camera 31 38. 201040847 or other device capable of capturing images. 39. The method of identifying an object in an image as described in claim 33, wherein the digital image captured in step 2 is identifiable as a single color RGB or IHS color identifiable feature. 40. The method for recognizing an object in an image as described in claim 33, wherein the digital image recognizable feature captured in the step 2 is a gray-scale identifiable feature. 41. The method for identifying an object in an image as described in claim 33, wherein the digital image identifiable feature captured in the step 2 is a video spectral frequency identifiable feature. 42. The method for recognizing an object in an image as described in claim 33, wherein the step of calculating the X-axis or γ-axis strip chart of the step 3 is to calculate each line of the image recognizable feature. Or pixels on each column. 之最大值點像素。 45.如申請專利範圍第% 方法’其中該步驟4所 點位置必須被註記與記錄 46.如申請專利範圍第33 33項所述之在影像中辨識物體的 所取得之最小特定點及最大特定 33項所述之在影像中辨識物體的 32 201040847 方法,其中該步驟5之多項式回歸分柄大a ^ v 4 啊万式,係根據 係將X袖或Y轴長條統計圖上之線條進行八析 ^ 線條之最小特定點至最大特定點間係呈線性走向,^ 可判定該待辨識物體應為三角形。 47.如申請專利範圍第45項所述之在影像中辨識物體的 方法,其中該線條之最小特定點至最大特定點間係呈 直線走向,即可判定該待辨識物體應為四方形或矩 形。The maximum point pixel. 45. If the patent application scope % method 'where the position of the step 4 must be noted and recorded 46. The minimum specific point and maximum specificity of identifying the object in the image as described in claim 33, paragraph 33 The 32 201040847 method for recognizing an object in an image, wherein the polynomial regression of the step 5 is a large a ^ v 4 ah, according to the line on the X-sleeve or Y-axis strip chart. Eight analysis ^ The minimum specific point of the line to the maximum specific point is linear, ^ can be determined that the object to be identified should be a triangle. 47. The method for recognizing an object in an image according to claim 45, wherein a line from a minimum specific point to a maximum specific point of the line is determined to be a square or a rectangle. . 48·如申請專利範圍第45項所述之在影像中辨識物體的 方法,其中該線條之最小特定點至最大特定點間係符 合—-Λ> -fc- 一\万程式,即可判定該待辨識物體應為圓形或橢 圓形。48. The method for recognizing an object in an image as described in claim 45, wherein the minimum specific point to the maximum specific point of the line conforms to -Λ>-fc- The object to be identified should be circular or elliptical. 3333
TW98116102A 2009-05-15 2009-05-15 Method of recognizing objects in images TWI417796B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
TW98116102A TWI417796B (en) 2009-05-15 2009-05-15 Method of recognizing objects in images

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
TW98116102A TWI417796B (en) 2009-05-15 2009-05-15 Method of recognizing objects in images

Publications (2)

Publication Number Publication Date
TW201040847A true TW201040847A (en) 2010-11-16
TWI417796B TWI417796B (en) 2013-12-01

Family

ID=44996112

Family Applications (1)

Application Number Title Priority Date Filing Date
TW98116102A TWI417796B (en) 2009-05-15 2009-05-15 Method of recognizing objects in images

Country Status (1)

Country Link
TW (1) TWI417796B (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TWI489395B (en) * 2011-11-28 2015-06-21 Ind Tech Res Inst Apparatus and method for foreground detection
TWI718442B (en) * 2018-11-21 2021-02-11 晶睿通訊股份有限公司 Convolutional neutral networks identification efficiency increasing method and related convolutional neutral networks identification efficiency increasing device

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TWI489395B (en) * 2011-11-28 2015-06-21 Ind Tech Res Inst Apparatus and method for foreground detection
TWI718442B (en) * 2018-11-21 2021-02-11 晶睿通訊股份有限公司 Convolutional neutral networks identification efficiency increasing method and related convolutional neutral networks identification efficiency increasing device

Also Published As

Publication number Publication date
TWI417796B (en) 2013-12-01

Similar Documents

Publication Publication Date Title
US11922615B2 (en) Information processing device, information processing method, and storage medium
JP7113657B2 (en) Information processing device, information processing method, and program
US8340420B2 (en) Method for recognizing objects in images
CN105144239B (en) Image processing apparatus, image processing method
Sirmacek et al. Urban-area and building detection using SIFT keypoints and graph theory
Pun et al. Multi-scale noise estimation for image splicing forgery detection
Li et al. An overlapping-free leaf segmentation method for plant point clouds
CN113313149A (en) Dish identification method based on attention mechanism and metric learning
CN106169080A (en) A kind of combustion gas index automatic identifying method based on image
CN116740758A (en) A bird image recognition method and system to prevent misjudgment
CN103186790A (en) Object detecting system and object detecting method
Abujayyab et al. Integrating object-based and pixel-based segmentation for building footprint extraction from satellite images
WO2020022329A1 (en) Object detection/recognition device, method, and program
CN118968362A (en) Small target detection method, system and storage medium based on drone images
CN113505784A (en) Automatic nail annotation analysis method and device, electronic equipment and storage medium
CN107704864B (en) Salient object detection method based on image object semantic detection
JP7057324B2 (en) Recognition device, program and construction device
Liu et al. Automatic grape bunch detection in vineyards for precise yield estimation
CN114373144B (en) Automatic identification method for circular identification points in high-speed video
CN120707568A (en) Intelligent detection method and system for cover film defects based on multi-feature fusion
CN114863464A (en) Second-order identification method for PID drawing picture information
TW201040847A (en) Method of identifying objects in image
KR100606404B1 (en) Color Code Image Detection Method and Apparatus
CN118918503B (en) Comprehensive safety supervision system and method for postal service
CN121280797A (en) Tobacco standard detection method based on improved YOLOv network

Legal Events

Date Code Title Description
MM4A Annulment or lapse of patent due to non-payment of fees