TWI444923B

TWI444923B - Automatic pseudo-object-based image region of interest detection method and system

Info

Publication number: TWI444923B
Application number: TW99100144A
Authority: TW
Original assignee: Chunghwa Telecom Co Ltd
Priority date: 2010-01-06
Filing date: 2010-01-06
Publication date: 2014-07-11
Also published as: TW201124945A

Description

Method and system for detecting representative area of automatic analog object image

本發明係關於一種自動擬似物件式圖像代表性區域偵測方法與系統，特別係一種利用特徵點產生擬似物件的方法，進行自動偵測圖像之代表性區域的方法與系統。The invention relates to a method and a system for detecting a representative region of an automatic analog object image, in particular to a method and system for automatically detecting a representative region of an image by using a feature point to generate a pseudo object.

因近年來數位相機及網際網路的普及，使得數位圖像數量成指數增加，在應用層面上，對於圖像內容的分析愈來愈顯示其重要性；當前對於圖像物件偵測的研究中，大多藉由圖像的低階資訊，如利用色彩變化特性，偵測圖像中的邊緣或邊界，進行圖像分割產生區塊，再對區塊抽取視覺特徵並用以代表該區塊。然而，由於圖像本身是屬於2D平面，對於各區塊而言，如何判斷哪些區塊屬於圖像中的同一物件便是一個很困難的問題，且用此種方式常常會產生很多破碎、不具意義的小區塊，有些研究試著在抽取區塊特徵時取出該區塊的紋理特徵，再將紋理相似且位置相近的區塊組成物件，但此種方式仍有許多缺點，像是同一物件在不同部位上的紋理可能會非常不相似，或是同一種物件會因為圖像拍攝時，拍攝角度與燈光明亮度不同而導致差異非常大，當應用於物件偵測時，效果往往不盡理想。Due to the popularity of digital cameras and the Internet in recent years, the number of digital images has increased exponentially. At the application level, the analysis of image content has become more and more important; the current research on image object detection Most of the image is segmented to generate a block by using low-level information of the image, such as using color change characteristics to detect edges or boundaries in the image, and then extracting visual features for the block and representing the block. However, since the image itself belongs to the 2D plane, it is a very difficult problem for each block to determine which blocks belong to the same object in the image, and in this way, often many broken and not In the meaning of the block, some studies try to extract the texture features of the block when extracting the features of the block, and then form the blocks with similar textures and similar positions, but there are still many disadvantages in this way, such as the same object. The textures on different parts may be very dissimilar, or the same kind of object will be very different due to the difference in shooting angle and brightness when shooting images. When applied to object detection, the effect is often not ideal.

另有一些研究讓使用者先框選出可能的物件區域，再對該區域像素點進行侵蝕、擴散或是利用邊緣資訊來產生圖像區塊，進而在該區域中組成物件並抽取特徵以作進一步使用，然而此種方式需使用者介入，在使用上比較沒有那麼方便。Other studies allow the user to first select the possible object areas, then etch or diffuse the pixels in the area or use the edge information to generate image blocks, and then form objects in the area and extract features for further Use, however, this method requires user intervention and is not so convenient in use.

由此可見，上述習用方式仍有諸多不足，實非一良善之設計，而亟待加以改良。It can be seen that there are still many shortcomings in the above-mentioned methods of use. It is not a good design and needs to be improved.

本案發明人鑑於上述習用方式所衍生的各項缺點，乃亟思加以改良創新，並經多年苦心孤詣潛心研究後，終於成功研發完成本件一種自動擬似物件式圖像代表性區域偵測方法與系統。In view of the shortcomings derived from the above-mentioned conventional methods, the inventor of the present invention has improved and innovated, and after years of painstaking research, he finally succeeded in research and development of an automatic pseudo-object-like image representative area detection method and system.

本發明之目的即在於提供一種對於圖像可快速且自動產生擬似物件與偵測代表性或感興趣區域的方法及系統，可使用在網際網路上大量圖像資料的影像檢索與搜尋、影像壓縮、影像縮圖、影像標註等應用。The object of the present invention is to provide a method and a system for quickly and automatically generating a pseudo object and detecting a representative or region of interest for an image, and can use image retrieval and searching and image compression of a large amount of image data on the Internet. , image thumbnails, image annotation and other applications.

達成上述發明目的之一種自動擬似物件式圖像代表性區域偵測方法與系統，係利用分析圖片中訊號特性而得的特徵點，將其根據平面座標關係分群而自動估計出擬似物件的數量、位置與範圍，並藉由進一步探勘擬似物件的資訊而發展出自動偵測全圖中最具代表性區域(或稱為感興趣區域)的系統，該系統可依據擬似物件之位置、大小以及涵蓋特徵點數量等資訊，自動偵測出圖像中最具代表性的區域；若是結合存在既有資料庫中其他圖像擬似物件之視覺文字頻率(Term Frequency)與反向文件頻率(Inverse Document Frequency)等統計特性，更可以偵測出有別於存在既有資料庫中的代表性物件或區域。An automatic pseudo-object-like image representative region detecting method and system for achieving the above object aims to automatically estimate the number of pseudo-objects by grouping the feature points obtained by analyzing the signal characteristics in the image according to the plane coordinate relationship. Position and scope, and by further exploring the information of the simulated object, develop a system that automatically detects the most representative area (or area of interest) in the full picture, which can be based on the location, size, and coverage of the simulated object. Information such as the number of feature points automatically detects the most representative area of the image; if combined with the presence of other image mimic objects in the existing database, the term text frequency (Term Frequency) and reverse file frequency (Inverse Document Frequency) And other statistical characteristics, it can detect different representative objects or regions that exist in the existing database.

本發明主要係以區域性視覺特徵點作為擬似物件生成的重要依據，首先以黑塞仿射(Hessian-Affine)或最穩定極值區域(Maximally Stable Extremal Regions，MSER)等特徵點偵測方式來取得特徵點座標，並以上述特徵點座標進行分群，產生擬似物件的資訊，來代表一張圖片中多個物件；其中，分群方法則可使用如G平均(G-means)分群演算法，其係以K平均(K-means)為基礎，加入以高斯(Gaussian)分佈檢測自動判定適當群數的機制，或是利用高斯混合模型(Gaussian mixture model)搭配貝氏訊息準則(Bayesian information criterion)亦可達成自動化分群並決定出群數。The invention mainly uses regional visual feature points as an important basis for the generation of pseudo-objects, firstly using feature point detection methods such as Hessian-Affine or Maximally Stable Extremal Regions (MSER). The feature point coordinates are obtained, and the feature point coordinates are used to group, and the information of the pseudo object is generated to represent multiple objects in a picture; wherein the grouping method can use a G-means grouping algorithm, Based on K-means, a mechanism for automatically determining the appropriate number of groups by Gaussian distribution detection, or a Gaussian mixture model with Bayesian information criterion is also used. Automated grouping can be achieved and the number of groups can be determined.

上述產生擬似物件之方法，其步驟流程圖，請參考圖一所示，包含：步驟一：利用Hessian-Affine特徵點偵測方法，從圖像中選取出具特色(例如：紋理、邊界或色彩等)的區域性視覺特徵點；步驟二：利用加速強健特徵點(Speeded Up Robust Features，SURF)特徵點描述方法，根據步驟一所找出的區域性視覺特徵點，計算並記錄所有區域性視覺特徵點相對應的特徵參數以及其平面座標資訊；步驟三：根據步驟二所找出所有區域性視覺特徵點的平面座標資訊，利用高斯混合模型搭配貝氏訊息準則進行自動化分群，自動分成N群，其中N>=1，每一群內特徵點所涵蓋的區域即構成一個擬似物件；其中擬似物件的位置為每一群內特徵點的重心，而大小則用此群內特徵點的變異數來表示。The method for generating a pseudo-object, the flow chart of the steps, please refer to FIG. 1 , including: Step 1: Using the Hessian-Affine feature point detection method to select features from the image (for example: texture, border or color, etc.) Regional visual feature points; Step 2: Using the Speeded Up Robust Features (SURF) feature point description method, calculate and record all regional visual features based on the regional visual feature points found in Step 1. The corresponding feature parameters of the point and its plane coordinate information; Step 3: According to the plane coordinate information of all the regional visual feature points found in step 2, the Gaussian mixture model is matched with the Bayesian message criterion for automatic grouping, and is automatically divided into N groups. Where N>=1, the area covered by the feature points in each group constitutes a pseudo-object; wherein the position of the pseudo-object is the center of gravity of the feature points in each group, and the size is represented by the variation of the feature points in the group.

其中，當僅產生兩個擬似物件時，擬似物件位置較靠近整張圖像中心點座標者，可視為圖像之前景，另一則視為圖像之背景，進而達到自動區分圖像前景、背景之功能。Wherein, when only two pseudo-objects are generated, the position of the pseudo-object is closer to the coordinate point of the entire image, which can be regarded as the foreground of the image, and the other is regarded as the background of the image, thereby automatically distinguishing the foreground and background of the image. The function.

請參閱圖二所示，為本發明一種自動擬似物件式圖像代表性區域偵測方法之第一實施例示意圖：係以貝氏訊息準則估計出適當數量的多組高斯分佈建立混合模型，以此模型將特徵點作分群之圖例，首先嘗試以多組多個高斯分佈所建立的高斯混合模型來模擬特徵點分佈情形，例如分別以2個、3個、4個以及5個高斯分佈來模擬特徵點分佈，其後則以貝氏訊息準則評估每組模型的複雜度，並選擇複雜度最低之高斯混合模型作為最佳化分群數；選定高斯混合模型之後，圖片中所有特徵點可分別歸類至其最可能屬於之高斯分佈之中，最後將每一高斯分佈視為一群，便得到分群結果。Please refer to FIG. 2 , which is a schematic diagram of a first embodiment of a method for detecting a representative region of an automatic analog object image according to the present invention: estimating a suitable number of Gaussian distributions by using a Bayesian message criterion to establish a hybrid model, This model uses the feature points as a grouping legend. Firstly, the Gaussian mixture model established by multiple sets of Gaussian distributions is used to simulate the distribution of feature points, for example, by two, three, four and five Gaussian distributions. The feature point distribution is followed by the Bayesian message criterion to evaluate the complexity of each group of models, and the Gaussian mixture model with the lowest complexity is selected as the optimal grouping number; after selecting the Gaussian mixture model, all the feature points in the picture can be returned separately. From the class to the Gaussian distribution that it most likely belongs to, and finally treating each Gaussian distribution as a group, the clustering result is obtained.

在本實施例中，特徵點被分為4個群，分屬4個擬似物件201、202、203以及204，其範圍以橢圓表示之，將每個擬似物件範圍內的區域性視覺特徵點取出，此範例圖片在資料庫中便以4個擬似物件特徵代表之。In this embodiment, the feature points are divided into four groups, belonging to four pseudo objects 201, 202, 203, and 204, the range of which is represented by an ellipse, and the regional visual feature points within each pseudo object are taken out. This sample image is represented in the database by four pseudo-object features.

請參閱圖三所示，為本發明一種自動擬似物件式圖像代表性區域偵測方法之第二實施例示意圖：其中，擬似物件個數可預先設定分群個數，並將圖像分割為可重疊的N個固定形狀區域之圖例，其固定分割方式將圖片切割為若干格狀區域，而每一區域即視為一擬似物件。Please refer to FIG. 3 , which is a schematic diagram of a second embodiment of a method for detecting a representative region of an automatic analog object image according to the present invention. The number of pseudo-objects can be preset into groups and the image is segmented into The legend of the overlapping N fixed shape regions, the fixed segmentation method cuts the picture into a plurality of lattice regions, and each region is regarded as a pseudo object.

在本實施例中，係將圖片從水平和垂直方向各均分為2份，得到左上、右上、左下及右下4個均分區域，並於圖片正中央設定另一相等大小區域；因此，本實施例固定將圖像分成5個格狀擬似物件301、302、303、304以及305，故此範例圖片在資料庫中便以5個擬似物件特徵代表之。In this embodiment, the picture is divided into two parts from the horizontal and vertical directions, and four equalized areas of upper left, upper right, lower left, and lower right are obtained, and another equal-sized area is set in the center of the picture; therefore, In this embodiment, the image is divided into five grid-like pseudo-objects 301, 302, 303, 304, and 305. Therefore, the sample image is represented by five pseudo-object features in the database.

在實際系統中，擬似物件位置可以是每一群內特徵點的重心，或是涵蓋區域的中心點，擬似物件範圍可以圓形、橢圓形或任意多邊形等形狀表示，範圍大小判定也可以最大或最小座標值、或是座標平均值及標準差等不同方式來決定；此外，每張圖的全圖亦可視為單一擬似物件(即N=1)；另外，為了避免自動分群演算法將圖像分為過多而不具意義的小面積擬似物件，通常我們會設定一上限值，例如N<=5，來避免此狀況之發生。In an actual system, the position of the pseudo object can be the center of gravity of the feature points in each group, or the center point of the coverage area. The range of the pseudo object can be represented by a shape such as a circle, an ellipse or an arbitrary polygon, and the range size can also be determined to be the largest or the smallest. Coordinate values, or coordinate mean and standard deviation are determined in different ways; in addition, the full map of each graph can also be regarded as a single pseudo-object (ie, N=1); in addition, to avoid automatic clustering algorithm, the image is divided. For too many unimportant small-area pseudo-objects, we usually set an upper limit, such as N <= 5, to avoid this situation.

請參閱圖四所示，為本發明一種自動擬似物件式圖像代表性區域偵測系統之架構圖，包含：一擬似物件產生模組11，係利用如圖一所述自動擬似物件式圖像代表性區域偵測方法之步驟流程，將輸入之圖像產生擬似物件及其相關特徵資訊，並記錄在資料庫13中。Please refer to FIG. 4, which is an architectural diagram of an automatic analog object image representative area detecting system according to the present invention, comprising: a pseudo object generating module 11 for automatically simulating an object image as shown in FIG. The step flow of the representative area detecting method generates the pseudo-object and related feature information of the input image and records it in the database 13.

一代表性區域偵測模組12，其係利用儲存於資料庫13中擬似物件的位置、大小、涵蓋特徵點數量或反轉檔案關聯索引等資訊，自動偵測出圖像中最具代表性的區域。A representative area detecting module 12 automatically detects the most representative image in the image by using information such as the position, size, number of feature points or reverse file association index stored in the database 13 Area.

其中，該代表性區域偵測模組12可利用資料庫中擬似物件的中心位置與整張圖像中心的距離反比，計算出各擬似物件之位置比例參數值；其中，該代表性區域偵測模組12可利用資料庫中擬似物件的大小，計算出各擬似物件之大小比例參數值；其中，該代表性區域偵測模組12可利用資料庫中擬似物件所含的區域性視覺特徵點數量，計算出各擬似物件之數量比例參數值；其中，該代表性區域偵測模組12可利用上述位置比例參數值、大小比例參數值以及數量比例參數值，依加權線性組合方式計算出各擬似物件之重要性參數；並將同一張圖像內的所有擬似物件重要性參數值由大至小排序，累計前k個排序後的值之總和至大於一預設臨界值時，則此k個值對應到的擬似物件所涵蓋的範圍即為該圖像之代表性區域。The representative area detecting module 12 can calculate the position ratio parameter values of the pseudo-objects by using the distance between the center position of the pseudo-object in the database and the distance of the entire image center; wherein the representative area detection The module 12 can calculate the size ratio parameter values of the pseudo-objects by using the size of the pseudo-objects in the database; wherein the representative area detecting module 12 can utilize the regional visual feature points contained in the pseudo-objects in the database. The quantity, the quantity proportional parameter value of each pseudo object is calculated; wherein the representative area detecting module 12 can calculate each of the position proportional parameter values, the size proportional parameter values, and the quantity proportional parameter values according to the weighted linear combination manner. The importance parameter of the pseudo object; and sorting all the imitation object importance parameter values in the same image from large to small, and accumulating the sum of the first k sorted values to be greater than a predetermined threshold, then k The range covered by the pseudo-object corresponding to the value is the representative area of the image.

一資料庫13，其係儲存各模組產生或使用之資料，例如：所有圖像產生區域性視覺特徵點之特徵參數、平面座標資訊、相對應之視覺文字、擬似物件編號，以及所有圖像其代表性區域所對應到的擬似物件編號。a database 13 for storing data generated or used by each module, for example, characteristic parameters of regional visual feature points generated by all images, plane coordinate information, corresponding visual characters, pseudo object numbers, and all images The pseudo-object number corresponding to its representative area.

一視覺文字產生模組14，其係將資料庫13記錄的所有擬似物件區域性視覺特徵點之特徵參數，產生視覺文字碼簿，並將其對應至視覺文字碼簿最相似之視覺文字。A visual character generating module 14 is configured to generate a visual text codebook for the characteristic parameters of all the pseudo-object regional visual feature points recorded by the database 13, and corresponding to the visual text of the visual text code book.

一反轉檔案關聯索引產生模組15，其係將資料庫13中所記錄的所有擬似物件與視覺文字建立反轉檔案(Inverted File)關聯索引，並計算每個視覺文字之反向文件頻率(Inverse Document Frequency)。A reverse file association index generation module 15 is configured to associate an inverted file (Inverted File) associated index with all the visual objects recorded in the database 13 and calculate the reverse file frequency of each visual text ( Inverse Document Frequency).

其中，擬似物件的位置如果愈靠近整張圖像的中心點，通常表示這個擬似物件愈為重要，因為一般人在照相時會把焦點擺在圖像的正中央，因此可依據第m個擬似物件的中心位置與整張圖像中心的距離反比計算出第m個擬似物件之位置比例參數值L_m ，m=1,2,...,N，L₁ +L₂ +...+L_N =1；此外，擬似物件的範圍大小如果愈大，通常也表示這個擬似物件愈為重要，愈容易吸引到人的注意力，因此可依據第m個擬似物件的大小也計算出第m個擬似物件之大小比例參數值S_m ，m=1,2,...,N，S₁ +S₂ +...+S_N =1；當然，也不是擬似物件愈大就一定愈重要，例如翱翔於天際中的飛機，背景的藍天白雲極可能會被歸類成一個大面積的擬似物件，因此我們也把特徵點數量列入考量，如果一個擬似物件所包含的區域性視覺特徵點數量愈多，通常表示這個擬似物件愈為重要，因為該擬似物件具有較多重要的視覺特徵，如邊界、紋路等等，因此我們可依據第m個擬似物件所含的區域性視覺特徵點數量計算出第m個擬似物件之數量比例參數值P_m ，m=1,2,...,N，P₁ +P₂ +...+P_N =1；將上述三個重要性比例參數值，依加權線性組合方式計算出第m個擬似物件之重要性參數O_m ，其中O_m =a*L_m +b*S_m +c*P_m ，m=1,2,...,N，a,b,c>=0且a+b+c=1；最後，將同一張圖像內的所有擬似物件重要性參數O_m 值由大至小排序，累計前k個排序後的O_m 值之總和至大於一預設臨界值時，則此k個O_m 值對應到的擬似物件所涵蓋的範圍即為該圖像之代表性區域，此偵測機制於資料庫13中存在或不存在既有圖像的情況下皆可適用。Among them, if the position of the pseudo object is closer to the center point of the whole image, it usually indicates that the imitation object is more important, because the average person will focus on the center of the image when taking photos, so it can be based on the mth mimetic object. The position of the m-th pseudo-object is proportional to the distance between the center position and the center of the entire image. L _m , m = 1, 2, ..., N, L ₁ + L ₂ +... + L _N =1; In addition, the larger the range of the pseudo-object, the more often it means that the pseudo-object is more important, and the easier it is to attract people's attention, so the m-th can be calculated according to the size of the m-th mimic object. The size ratio parameter values of the pseudo-objects S _m , m = 1, 2, ..., N, S ₁ + S ₂ + ... + S _N =1; of course, it is not the more important the object is, the more important it is, For example, an airplane flying in the sky, the background of the blue sky and white clouds is likely to be classified into a large area of the pseudo-object, so we also consider the number of feature points, if a pseudo-object contains the number of regional visual feature points The more it is, the more important it is to represent this mimic object, because the mimic With more important visual features, such as borders, lines, etc., we can calculate the amount based on the m-th quasi feature points regional visual objects contained in the m-th pseudo-object number ratio of values of the parameter P _{m, m} = 1,2,...,N,P ₁ +P ₂ +...+P _N =1; calculate the importance of the mth pseudo-object based on the weighted linear combination of the above three importance proportional parameter values The parameter O _m , where O _m =a*L _m +b*S _m +c*P _m ,m=1,2,...,N,a,b,c>=0 and a+b+c= 1; Finally, all of quasi importance parameter object O within an image with the value of _m by sorting large to small, the cumulative sum of the values _m k O before and after the sort to be greater than a predetermined threshold, then the k O _m a range of values corresponding to pseudo object contemplated is the representative of the image of the region, this detection scheme where both the presence or absence of an image can be useful in database 13.

請參閱圖二及圖五所示，為本發明一種自動擬似物件式圖像代表性區域偵測方法之第一及第三實施例示意圖：在本實施例中代表性區域偵測模組即將圖二中之擬似物件201、202、203組成該範例圖之最具代表性區域501，其中a=0.5,b=0.25,c=0.25，預設臨界值為0.75，擬似物件之範圍以橢圓表示，而最具代表性區域則以長方形表示。Please refer to FIG. 2 and FIG. 5 , which are schematic diagrams of the first and third embodiments of the method for detecting the representative area of the automatic analog object image according to the present invention: in this embodiment, the representative area detecting module is about to be shown. The pseudo-objects 201, 202, and 203 of the second form constitute the most representative region 501 of the example graph, wherein a=0.5, b=0.25, c=0.25, the preset threshold value is 0.75, and the range of the pseudo-object is represented by an ellipse. The most representative areas are represented by rectangles.

另外，可藉由進一步引入文字檢索中TF-IDF的概念，當資料庫存在既有圖像時，本發明之代表性區域偵測模組可有另一種偵測機制(請同時參閱圖四、圖六及圖七所示)：其中，該視覺文字產生模組14，係將資料庫13中記錄的所有擬似物件區域性視覺特徵點143利用其特徵參數，以K-means演算法進行分群(共Q群)，並產生視覺特徵參數分群142，再依所有群重心產生視覺特徵參數碼簿，即所謂視覺文字碼簿141，並將資料庫13中記錄的所有擬似物件區域性視覺特徵點143的視覺特徵參數，根據視覺文字碼簿141，將其分別對應至最相似之視覺文字141q；再利用該反轉檔案關聯索引產生模組15，接著將資料庫13中所記錄的每個擬似物件110(PO_i,j ，表示第i張圖的第j個擬似物件，共R個)與每個視覺文字141q建立反轉檔案(Inverted File)關聯索引151，並計算每個視覺文字(VW)的反向文件頻率(Inverse Document Frequency)IDF_VW ，IDF_VW =(total number of pseudo-objects/number of pseudo-objects where the VW appears)，對於每個視覺文字而言，IDF_VW 值與該視覺文字的重要性成正比。In addition, by further introducing the concept of TF-IDF in text search, when the data inventory is in the existing image, the representative area detecting module of the present invention can have another detection mechanism (please refer to FIG. 4 at the same time). FIG. 6 and FIG. 7): wherein the visual character generating module 14 uses the K-means algorithm to group all the pseudo-object regional visual feature points 143 recorded in the database 13 by using the characteristic parameters thereof ( A total of Q groups), and a visual feature parameter grouping 142 is generated, and a visual feature parameter codebook, that is, a so-called visual character codebook 141, is generated according to the center of gravity of all the groups, and all the pseudo-objective regional visual feature points recorded in the database 13 are 143. The visual feature parameters are respectively corresponding to the most similar visual characters 141q according to the visual text codebook 141; and the inverted file association index generating module 15 is used, and then each of the pseudo-objects recorded in the database 13 is recorded. 110 (PO _i,j , representing the jth mimetic object of the i-th picture, a total of R) establishes an inverted file association index 151 with each visual character 141q, and calculates each visual text (VW) Reverse file frequency Inverse Document Frequency IDF _VW , IDF _VW = (total number of pseudo-objects / number of pseudo-objects where the VW appears), for each visual text, the IDF _VW value and the importance of the visual text become Just proportional.

相對於不存在於資料庫13中的圖像，代表性區域偵測模組12先利用擬似物件產生模組11產生擬似物件相關資訊，再將該圖所有擬似物件區域性視覺特徵點的視覺特徵參數根據視覺文字產生模組14之視覺文字碼簿141對應至最相似之視覺文字141q，並計算每個擬似物件所含視覺文字頻率(Term Frequency，TF_VW )，其中：TF_VW =(number of occurrences of the VW in certain pseudo-object/sum of number of occurrences of all VWs in certain pseudo-object)，TF_VW 值與該視覺文字141q相對於擬似物件之重要性成正比；其中，TF-IDF的意義在於，某個視覺文字141q如果在某個擬似物件中出現次數愈多，表示這個視覺文字141q愈重要；但是，如果這個視覺文字141q出現在不同擬似物件愈頻繁，也表示這個視覺文字141q愈不具代表性；因此，代表性區域偵測模組12對於新圖像中所有出現在第m個擬似物件的視覺文字141q(共T_m 個)，可利用上述之視覺文字141q的反向文件頻率與擬似物件的視覺文字頻率，將兩者以相乘並加總的方式計算出第m個擬似物件110之重要性參數O_m ，，m=1,2,...,N；最後，將該圖像內的所有擬似物件重要性參數O_m 值由大至小排序，累計前k個排序後的O_m 值之總和至大於一預設臨界值時，則此k個Om值對應到的擬似物件所涵蓋的範圍即為該圖像之最具代表性區域。The representative area detecting module 12 first generates the pseudo-object related information by using the pseudo object generating module 11 with respect to the image not existing in the database 13, and then visually features the regional visual feature points of all the pseudo-objects of the figure. The parameter corresponds to the most similar visual character 141q according to the visual text codebook 141 of the visual text generating module 14, and calculates the visual character frequency (Term Frequency, TF _VW ) contained in each pseudo-object, wherein: TF _VW = (number of Occurrences of the VW in certain pseudo-object/sum of number of occurrences of all VWs in certain pseudo-object), the TF _VW value is proportional to the importance of the visual text 141q relative to the pseudo-object; wherein the meaning of TF-IDF Therefore, if a visual character 141q appears more frequently in a certain object, the more important the visual character 141q is; however, if the visual character 141q appears more frequently in different pseudo-objects, it means that the visual character 141q is less Representative; therefore, the representative area detecting module 12 for all the visual characters 141q appearing in the mth mimic object in the new image (total T _m )), the reverse file frequency of the visual character 141q and the visual character frequency of the pseudo object are used, and the importance parameter O _m of the mth pseudo object 110 is calculated by multiplying and summing the two. , M = 1,2, ..., N ; Finally, the pseudo-all objects O importance parameter values from large to small _m ordering within the image, the cumulative value sum _m O k before and after the sort greater than When a threshold value is preset, the range covered by the pseudo object corresponding to the k Om values is the most representative region of the image.

請參閱圖二及圖八所示，為本發明一種自動擬似物件式圖像代表性區域偵測方法與系統之第一以及第四實施例示意圖：在本實施例中代表性區域偵測模組即於資料庫中已存在既有圖像的情況下，將圖二中之擬似物件204判定為該範例圖之最具代表性區域801，其中Q=10,000，預設臨界值為0.5，擬似物件之範圍以橢圓表示，而最具代表性區域則以長方形表示；此機制適用於資料庫中已存在既有圖像的情況下，讓圖像代表性區域偵測系統可自動偵測出有別於既有資料庫中的代表性物件或區域，尤其適合用在資料庫量大時，將使資料庫自然具有學習與調適性；例如一開始資料庫中並沒有台北101大樓的圖像，隨著此圖像漸漸開始增加時，偵測出以台北101大樓為代表性區域的圖像也會愈來愈多，但隨著台北101大樓圖像出現的次數愈來愈頻繁時，系統也會漸漸將台北101大樓視為背景物件，而自然開始嘗試尋找出圖中有別於台北101大樓之外的代表性區域，因此也讓資料庫更富多樣化並更具代表性。Please refer to FIG. 2 and FIG. 8 , which are schematic diagrams of a first and fourth embodiment of a method and system for detecting a representative area of an automatic analog object image according to the present invention: a representative area detecting module in this embodiment That is, in the case where the existing image already exists in the database, the pseudo object 204 in FIG. 2 is determined as the most representative region 801 of the example image, where Q=10,000, the preset threshold is 0.5, and the pseudo object is The range is represented by an ellipse, and the most representative area is represented by a rectangle. This mechanism is suitable for the image representative area detection system to automatically detect the presence of existing images in the database. Representative objects or areas in the existing database, especially suitable for large database, will make the database naturally have learning and adaptability; for example, there is no image of Taipei 101 building in the initial database. As the image gradually began to increase, more and more images were detected in the Taipei 101 building. However, as the number of images in the Taipei 101 building became more frequent, the system would Gradually will be Taipei 10 The 1 building is considered a background object, and naturally it tries to find a representative area that is different from the Taipei 101 building, thus making the database more diverse and representative.

本發明所提供之一種自動擬似物件式圖像代表性區域偵測方法與系統，與其他習用技術相互比較時，更具備下列優點：The method and system for detecting the representative area of the automatic analog object image provided by the invention have the following advantages when compared with other conventional technologies:

1.本發明可跳脫傳統習以影像處理方法嘗試切割出精確的物件區域，以及相鄰物件邊緣的特徵點到底歸屬於何者的難題，以特徵點座標與分群演算法即可快速找出圖像中的擬似物件範圍與數量。1. The present invention can jump off the traditional image processing method to try to cut out the precise object area, and the problem of which the feature points of the adjacent object edge belong to, and quickly find the figure by feature point coordinates and grouping algorithm. The range and number of pseudo-objects in the image.

2.本發明可有效代表任一圖像中多個物件，增加特徵資料的代表性。2. The invention can effectively represent multiple objects in any image and increase the representativeness of the feature data.

3.本發明可自動產生一圖像中所有可能包含物件的位置與特徵資訊，有利於使用在各種以物件為基礎的影像應用之中。3. The present invention automatically generates position and feature information for all possible objects in an image, facilitating use in a variety of object-based imaging applications.

4.本發明透過擬似物件的資訊，可進一步自動偵測出圖像代表性(感興趣)區域，不需藉由人工介入指定或畫出區域，尤其適合處理網際網路上大量的圖像資料。4. The invention can further automatically detect the representative (interesting) region of the image through the information of the pseudo object, and does not need to manually designate or draw the region, and is particularly suitable for processing a large amount of image data on the Internet.

5.本發明將文字檢索中TF-IDF的概念，轉化到圖像視覺文字特徵為基礎之擬似物件重要性判斷的應用層面。5. The invention transforms the concept of TF-IDF in text retrieval into the application level of the importance judgment of the object based on the visual character of the image.

6.本發明對於一張新輸入的圖像，可自動探勘出既有資料庫中不存在的新物件圖像，讓資料庫更富多樣化並更具代表性。6. The present invention automatically extracts images of new objects that do not exist in the existing database for a newly input image, so that the database is more diverse and representative.

上列詳細說明係針對本發明之一可行實施例之具體說明，惟該實施例並非用以限制本發明之專利範圍，凡未脫離本發明技藝精神所為之等效實施或變更，均應包含於本案之專利範圍中。The detailed description of the preferred embodiments of the present invention is intended to be limited to the scope of the invention, and is not intended to limit the scope of the invention. The patent scope of this case.

綜上所述，本案不但在技術思想上確屬創新，並能較習用物品增進上述多項功效，應已充分符合新穎性及進步性之法定發明專利要件，爰依法提出申請，懇請　貴局核准本件發明專利申請案，以勵發明，至感德便。To sum up, this case is not only innovative in terms of technical thinking, but also able to enhance the above-mentioned multiple functions compared with conventional articles. It should fully comply with the statutory invention patent requirements of novelty and progressiveness, and apply in accordance with the law. I urge you to approve this article. Invention patent application, in order to invent invention, to the sense of virtue.

11．．．擬似物件產生模組11. . . Analog object generation module

110．．．擬似物件110. . . Quasi-like object

12．．．代表性區域偵測模組12. . . Representative area detection module

13．．．資料庫13. . . database

14．．．視覺文字產生模組14. . . Visual text generation module

141．．．視覺文字碼簿141. . . Visual textbook

141q．．．視覺文字141q. . . Visual text

15．．．反轉檔案關聯索引產生模組15. . . Reverse file association index generation module

151．．．反轉檔案關聯索引151. . . Reverse file association index

143．．．區域性視覺特徵點143. . . Regional visual feature point

142．．．視覺特徵參數分群142. . . Visual feature parameter grouping

201^~ 204．．．擬似物件201 ^~ 204. . . Quasi-like object

301^~ 305．．．擬似物件301 ^~ 305. . . Quasi-like object

501．．．最具代表性區域501. . . Most representative area

801．．．最具代表性區域801. . . Most representative area

圖一為本發明一種自動擬似物件式圖像代表性區域偵測方法之產生擬似物件之流程圖；1 is a flow chart of generating a pseudo-object of an automatic pseudo-object-like image representative region detecting method according to the present invention;

圖二為本發明一種自動擬似物件式圖像代表性區域偵測方法之第一實施例示意圖；2 is a schematic diagram of a first embodiment of a method for detecting a representative area of an automatic analog object image according to the present invention;

圖三為本發明一種自動擬似物件式圖像代表性區域偵測方法之第二實施例示意圖；FIG. 3 is a schematic diagram of a second embodiment of a method for detecting a representative area of an automatic analog object image according to the present invention; FIG.

圖四為本發明一種自動擬似物件式圖像代表性區域偵測系統之架構圖；4 is an architectural diagram of a representative area detection system for an automatic analog object image according to the present invention;

圖五為本發明一種自動擬似物件式圖像代表性區域偵測方法之第三實施例示意圖；FIG. 5 is a schematic diagram of a third embodiment of a method for detecting a representative area of an automatic analog object image according to the present invention; FIG.

圖六為本發明一種自動擬似物件式圖像代表性區域偵測方法之視覺文字產生模組產生視覺文字示意圖；6 is a schematic diagram of visual text generated by a visual text generating module of an automatic analog object image representative region detecting method according to the present invention;

圖七為本發明一種自動擬似物件式圖像代表性區域偵測方法之反轉檔案關聯索引產生模組之反轉檔案關聯索引示意圖；FIG. 7 is a schematic diagram of a reverse file association index of a reverse file association index generation module of an automatic analog object image representative area detection method according to the present invention; FIG.

圖八為本發明一種自動擬似物件式圖像代表性區域偵測方法與系統之第四實施例示意圖。FIG. 8 is a schematic diagram of a fourth embodiment of a method and system for detecting a representative area of an automatic analog object image according to the present invention.

11．．．擬似物件產生模組11. . . Analog object generation module

13．．．資料庫13. . . database

14．．．視覺文字產生模組14. . . Visual text generation module

Claims

An automatic pseudo-object-like image representative region detecting method is a method for generating a pseudo-like object by using a regional visual feature point, and the method for generating a pseudo-object, comprising: Step 1: using a regional feature point detecting method from the image Select regional characteristic feature points; Step 2: Calculate and record the corresponding feature parameters of all regional visual feature points according to the regional visual feature points found in Step 1 using the regional feature point description method and its Plane coordinate information; Step 3: According to step 2, find the plane coordinate information of all regional visual feature points, use clustering algorithm to automate grouping, and automatically divide into N groups, where N>=1, covered by feature points in each group The area constitutes a pseudo-object.

An automatic analog object image representative region detecting method according to claim 1, wherein the number of the pseudo objects can be specified in advance, and the image is divided into N A superimposable area of a fixed shape.

An automatic pseudo-object image representative region detecting method according to claim 1, wherein the number of the pseudo-objects can be automatically determined by a grouping algorithm, and the image is divided into N pieces. An area similar to an elliptical arbitrary shape and which may not be continuous.

An automatic pseudo-object image representative area detecting method according to claim 1, wherein when only two pseudo-objects are generated, wherein the pseudo-objects are located closer to the center point coordinates of the entire image. The image is the foreground of the image, and the other is regarded as the background of the image, thereby achieving the function of automatically distinguishing the foreground and the background of the image.

An automatic pseudo-object-like image representative area detecting system comprises: a pseudo-object generating module, which uses the generating method of the input object to generate the pseudo-object and related feature information, and records the data in the database. A representative area detection module that automatically detects the most representative image in the image by using information such as the position, size, number of feature points, or reverse file association index stored in the database. a database; a database that stores data generated or used by each module; a visual text generation module that records characteristic parameters of regional visual feature points of all pseudo-objects recorded in the database to generate a visual text book And corresponding to the visual text code most similar to the visual text; a reverse file association index generation module, which associates all the pseudo-objects recorded in the database with the visual text to create an inverted file (Inverted File) Index and calculate the Inverse Document Frequency of each visual text.

An automatic analog object image representative area detecting system according to claim 5, wherein the database records characteristic parameters, plane coordinate information, and corresponding vision of regional image feature points of all images. The text, the imitation object number, and the pseudo object number corresponding to the representative area of all images.

For example, an automatic analog object image representative area detecting system according to item 5 of the patent application scope, wherein the representative area detecting module can utilize the position of each mimic object in the database to be inversely proportional to the distance of the entire image center. Calculate the position proportional parameter values of each mimic object.

For example, an automatic pseudo-object image representative area detecting system according to item 5 of the patent application scope, wherein the representative area detecting module can calculate the size proportional parameter of each quasi-object by using the size of each mimic object in the database. value.

An example of an automatic analog object image as in item 5 of the patent application scope The area detecting system, wherein the representative area detecting module can calculate the quantity proportional parameter values of each of the pseudo-objects by using the number of regional visual feature points included in each of the quasi-objects in the database.

An automatic analog object image representative area detecting system according to claim 5, wherein the representative area detecting module generates the image that is not present in the database by using the pseudo object. The module generates the information related to the object, and the visual character generating module generates visual characters corresponding to the visual characteristic parameters of all the regional visual feature points of the pseudo object, and calculates the visual frequency of each pseudo object (Term Frequency). .

An automatic analog object image representative area detecting system according to claim 5, wherein the representative area detecting image generates a representative area of the image by one or more pseudo-objects Composition, this area can be a continuous or discontinuous block and recorded in the database.

An automatic pseudo-image-like image representative region detecting method is applicable to the case where an existing image exists in a database, and the steps thereof include: a. generating an image of the pseudo-object and related features of the input image, and recording In the database; b. Calculate the position proportional parameter values of the pseudo-objects according to the positions of the pseudo-objects and the distance from the center of the whole image; c. Calculate each according to the size of each mimic object in the figure The size and proportion parameter values of the pseudo-objects; d. According to the number of regional visual feature points contained in the pseudo-objects of the figure, calculate the quantity proportional parameter values of the pseudo-objects; e. The three proportional parameter values of the step ac are The weighted linear combination method calculates the importance parameter values of each mimic object; f. sorts all the imitation object importance parameter values in the image from large to small, and accumulates the sum of the first k sorted importance parameter values to More than one pre When the threshold is set, the range covered by the pseudo object corresponding to the k importance parameter values is the representative region of the image.

An automatic pseudo-object-like image representative region detecting method is applicable to the case where there is no existing image in the database, and the steps include: a. generating an image of the pseudo-object and its related features, and recording the input image In the database; b. Calculate the position proportional parameter values of the pseudo-objects according to the position of the pseudo-objects and the distance from the center of the whole image; c. Calculate according to the size of each mimic object in the figure The size and proportion parameter values of each of the pseudo-objects; d. Calculate the number-of-proportional parameter values of each mimic object according to the number of regional visual feature points contained in the pseudo-objects of the figure; e. The three proportional parameter values of step ac, Calculate the importance parameter values of each mimic object according to the weighted linear combination method; f. Sort all the imitation object importance parameter values in the image from large to small, and accumulate the sum of the first k sorted importance parameter values. When the value is greater than a predetermined threshold, the range covered by the pseudo object corresponding to the k importance parameter values is a representative region of the image.

An automatic pseudo-image-like image representative region detecting method is applicable to the case where an existing image exists in a database, and the steps thereof include: a. generating an image of the pseudo-object and related features of the input image, and recording In the database; b. grouping all the visual features of the pseudo-objects recorded in the database with their visual feature parameters to generate a visual text code book; c. Recording all the visual features of the pseudo-objects in the database The visual characteristic parameters are corresponding to the most similar visual text according to the visual text code book; d. an inverted file association index is established for all the pseudo-objects recorded in the database and the visual characters, and each visual text is calculated. Reverse File frequency; e. For the new image that does not exist in the database, the pseudo-object related information is generated, and the visual feature parameters of the regional visual feature points of all the pseudo-objects of the figure are corresponding to the most similar visual text according to the visual text code book. And calculating the frequency of the visual characters contained in each of the pseudo-objects; f. multiplying and adding the reverse file frequency of the visual text of step d to the visual text frequency of the pseudo-object of step e, and calculating the importance parameters of each pseudo-object Value; g. sort all the imitation object importance parameter values in the image from large to small, and accumulate the sum of the first k sorted values to be greater than a predetermined threshold, then the k values correspond to The range covered by the pseudo-object is the most representative area of the image.