US20200050885A1 - Image processing system - Google Patents
Image processing system Download PDFInfo
- Publication number
- US20200050885A1 US20200050885A1 US16/660,559 US201916660559A US2020050885A1 US 20200050885 A1 US20200050885 A1 US 20200050885A1 US 201916660559 A US201916660559 A US 201916660559A US 2020050885 A1 US2020050885 A1 US 2020050885A1
- Authority
- US
- United States
- Prior art keywords
- window
- classifier
- stage
- detector
- image
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Abandoned
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/70—Arrangements for image or video recognition or understanding using pattern recognition or machine learning
- G06V10/74—Image or video pattern matching; Proximity measures in feature spaces
- G06V10/75—Organisation of the matching processes, e.g. simultaneous or sequential comparisons of image or video features; Coarse-fine approaches, e.g. multi-scale approaches; using context analysis; Selection of dictionaries
- G06V10/751—Comparing pixel values or logical combinations thereof, or feature values having positional relevance, e.g. template matching
-
- G06K9/6202—
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/21—Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
- G06F18/214—Generating training patterns; Bootstrap methods, e.g. bagging or boosting
- G06F18/2148—Generating training patterns; Bootstrap methods, e.g. bagging or boosting characterised by the process organisation or structure, e.g. boosting cascade
-
- G06K9/00228—
-
- G06K9/00986—
-
- G06K9/6257—
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/70—Arrangements for image or video recognition or understanding using pattern recognition or machine learning
- G06V10/77—Processing image or video features in feature spaces; using data integration or data reduction, e.g. principal component analysis [PCA] or independent component analysis [ICA] or self-organising maps [SOM]; Blind source separation
- G06V10/774—Generating sets of training patterns; Bootstrap methods, e.g. bagging or boosting
- G06V10/7747—Organisation of the process, e.g. bagging or boosting
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/94—Hardware or software architectures specially adapted for image or video understanding
- G06V10/955—Hardware or software architectures specially adapted for image or video understanding using specific electronic processors
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V40/00—Recognition of biometric, human-related or animal-related patterns in image or video data
- G06V40/10—Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
- G06V40/16—Human faces, e.g. facial parts, sketches or expressions
- G06V40/161—Detection; Localisation; Normalisation
Definitions
- the present invention relates to an image processing system.
- Face detection and tracking for example, as described in European Patent No. EP2052347 (Ref: FN-143) is a well-known example of feature detection in image processing. These techniques enable one or more face regions within a scene being imaged to be readily delineated and to allow for subsequent image processing based on this information. Such subsequent image processing can include face recognition which attempts to identify individuals being imaged, for example, for tagging or authentication purposes; auto-focusing by bringing a detected and/or selected face region into focus; or defect detection and/or correction of the face region(s).
- FIG. 1 there is shown a block diagram for a conventional type template matching engine (TME) 10 for identifying features within an image or portion of an image.
- TME type template matching engine
- the feature calculation module 18 it is possible for the feature calculation module 18 to provide the required features buffer 20 for a new window at each clock cycle.
- the classifier 22 typically processes one detector cascade stage per clock cycle and typically, this happens only after the processing pipeline is filled at the start of each new window—this can again involve a number of clock cycles.
- the classifier 22 needs to stall the whole pipeline before it (using a backpressure mechanism indicated by the upwards arrows connecting elements 22 - 14 ).
- the classifier 22 is the bottleneck of the process, due to the fact that the detector cascade stages must be applied in a sequence.
- a Prefilter module is added to a template matching engine (TME) in order to improve performance by accelerating processing.
- TEE template matching engine
- the TME can be accelerated of the order of up to 20 times because the Prefilter can process one window per clock cycle, while an exemplary classifier would need 20 clock cycles to apply the same first 12 stages of the detector (8 cycle pipeline latency+12 cycles for the 12 stages).
- a classifier is trained to base each decision on separate feature maps so that features can be read in a single clock cycle and each stage can be executed in a single clock cycle.
- a programmable controller allows a plurality of reduced stage detectors to be run on a window before deciding on their progress and then determining which, if any, longer stage detectors should be applied to the windows.
- FIG. 1 shows a conventional TME module
- FIG. 2 shows TME module including a Prefilter in accordance with a first embodiment of the present invention
- FIG. 3 illustrates a feature map, window, detector nodes and stages employed within embodiments of the present invention
- FIG. 4 shows the processing for the first stage of an RTC detector cascade employed within an exemplary Prefilter of FIG. 2 ;
- FIG. 5 illustrates the data employed within an RTC detector cascade stage employed within an exemplary Prefilter of FIG. 2 ;
- FIG. 6 shows the architecture of the Prefilter of FIG. 2 in more detail
- FIG. 7 shows detector stage data being collected within FIFO memories within a Prefilter according to an embodiment of the invention
- FIG. 8 illustrates pixels from successive windows of a frame being processed
- FIG. 9 illustrates a template matching engine including a programmable classifier according to a second embodiment of the present invention.
- FIG. 2 there is shown a TME 10 ′ including a Prefilter 24 according to a first embodiment of the present invention.
- the function of the remaining elements of FIG. 2 is basically as explained for the TME of FIG. 1 except where indicated.
- the processing flow is as follows:
- the task of the Prefilter 24 is to reject as many windows as possible before they are analyzed by the classifier 22 .
- the Prefilter 24 performs its task on the fly as window information is being read from the system bus, while running the classifier 22 may take many more clock cycles—for example, a full detector cascade applied by the classifier 22 could have up to 4000 stages or more.
- each stage of the Prefilter 24 comprises a decision tree of a Random Tree Classifier (RTC).
- RTC Random Tree Classifier
- each decision tree comprises 3 nodes, a root node 0 and sub-branch nodes 1 and 2 .
- Each node of a decision tree corresponds to a pixel location within a window to be examined i.e. a relative x,y displacement within a window.
- the values tested at each decision node can come from the corresponding location of a selected feature map.
- a value for a root node, Node 0 is compared with a threshold for that node, Threshold 0 , to determine which sub-branch of the tree should be taken.
- the value for either Node 1 or Node 2 is tested against a respective threshold and depending on the result, a score for the detector cascade is either incremented or decremented by a given amount.
- the thresholds, feature maps and score for each decision tree are determined through training against a test data set.
- each of the 3 nodes for a stage are associated with a relative x,y location within a window and a specific feature map (channel) as well as a threshold value; and for each stage, there will be a stage threshold and a resulting stage score.
- the Prefilter 24 is interposed between the feature calculation/buffer modules 18 / 20 and the classifier 22 so that as feature maps are generated cycle-by-cycle as the image is being scanned, knowing the x,y locations of the nodes of interest, the Prefilter 24 can read the required values from the relevant feature maps (channels 0 . . . 15 ) to apply the decision trees for each of the stages of the Prefilter 24 . Then, according to the accumulated score for the stages applied by the Prefilter 24 , the Prefilter 24 can provide its decision to the classifier 22 to indicate whether or not the classifier 22 should apply full detector cascade(s) to the window as soon as the last relevant node location in a window is reached.
- the Prefilter will have made its decision before a complete window is read from memory.
- the classifier 22 can signal, if required, that a window does not contain an object to be detected, or know immediately if it might need or not to apply any further detector cascades to the window.
- Configuration information 70 provided by the detectors buffer 12 prior to image processing is fed to each of a number of selectors 72 , 74 and comparators 76 —one per node 0 . . . 35 .
- Channel information for each of nodes 0 . . . 35 is written to selectors 72 and location information for nodes 0 . . . 35 is written to selectors 74 . Finally, threshold information for each of nodes 0 . . . 35 is written to the set of comparators 76 .
- Selectors 72 direct channel information for each image pixel location as it is generated to a corresponding selector 74 . When each selector 74 detects that its programmed x,y location within a window has been reached, it provides the selected channel value to a corresponding threshold comparator from the set of comparators 76 .
- a comparator When a comparator detects a channel value provided at its input from a selector 74 , it performs its comparison and writes its decision to a corresponding FIFO 78 .
- a FIFO is provided for every node that is used in any of the detector stages of the Prefilter detector cascade.
- the Prefilter In order to be able to calculate a window score, the Prefilter needs all node decisions to be available in FIFO memories for that window. When all FIFO for all nodes have at least 1 location written, the Prefilter pops-out data from all FIFO memories and calculates 1 window score according to a threshold algorithm 80 .
- the value for node 0 will determine which of the values from nodes 1 or 2 are to be employed to contribute to the final value for the decision stages applied to the window.
- the accumulated score from the detector stages can be compared against a configured window threshold to provide a final score value for a window and this can indicate the level of confidence of the Prefilter 24 in relation to whether a given window contains or does not contain an object of interest.
- This characteristic also enables data to be read in bursts of pixels for example 4 or 8 pixels.
- multiplying and multiplexing the architecture of FIG. 7 it is possible to perform calculations for more than 1 window per clock cycle and so to eliminate or identify windows as candidates for full classification at an even faster rate.
- RTC classifier cascade allows the Prefilter 24 to not alone provide a yes/no decision in relation to any given window, but also a score indicative of the confidence from a detector that a window either includes or does not include an object to be detected. This can be useful for other applications, performing subsequent image processing on a given image, but the information can also be used with the TME 10 ′ especially if multiple windows are being processed in parallel or if multiple detector cascades are being applied by the classifier 22 as explained in more detail below.
- the classifier 22 can apply one or more detector cascades.
- the Prefilter 24 is based on number of RTC stages. Each of the channel values generated as a pixel is read from the system bus are made available to each of the selectors 72 and so each of these can be freely programmed based on the training data set to choose from whichever channel enables the Prefilter 24 to best discriminate between windows which should be rejected before full classification and those which should be subjected to full classification.
- the classifier 22 can also be based on such RTC stages. However, within the classifier 22 each stage is applied in sequence, building a score for a window. At each stage of the detector a stage score is added or subtracted to/from the window score, depending on the stage evaluation result. A window score after each stage is compared with a threshold for a stage. While the window score is above the stage threshold, the next detector stage is applied, whereas if the window score is below the stage threshold the detector is abandoned. If the last stage of the detector cascade is reached, the window score is compared with the global threshold of the detector cascade and if the window score is above the global threshold, a match is signaled.
- Each stage of the classifier is based on channel values corresponding to three nodes within a window. If no assumptions were made about which channels each node of a decision tree for a stage were to be associated with, then at least 2 successive reads from the same channel might be required before a decision could be taken for a stage (assuming that one 1 sub-branch decision for either node 1 or 2 needs to be taken). However, in order to speed up decision making within the classifier 22 , in embodiments of the classifier 22 based on RTC decision trees, each stage is restricted to nodes based on different channels.
- Node 0 for a stage might be based on a HOG value for at a pixel location; Node 1 for a stage might be based on an intensity value for a pixel; and Node 2 for a stage might be based on an II value for a pixel.
- TME might be required to apply a number of different detectors to any given window.
- a biometric recognition application running on the same device as the TME 10 ′ where the application might be required to attempt to recognize a user in one of a number of different poses, for example, front, tilted, left or right side profile.
- the detectors buffer 12 could be provided with a plurality of detector cascades, each for a different detector.
- the classifier 22 might still be required to run a number of full length detector cascades on every window passed by the Prefilter 24 .
- a programmable prefilter (PPF) 26 is provided in order to control the detectors applied by a modified classifier 22 ′.
- PPF programmable prefilter
- the PPF 26 is provided with a rules engine (not shown) which enables the PPF to determine which detector cascades from detectors buffer 12 will be applied or which detectors will be applied in full to any given window.
- the rules engine is either pre-programmed according to application requirements i.e. hardcoded, or the rules engine can be configured by an application (for example, the biometric recognition application referred to above) by providing the required configuration information across the system bus.
- the detectors buffer stores 4 full detector cascades.
- the PPF can apply a first limited number of stages from each cascade, say 12, to a current window. It does this by providing the detector configuration to the classifier 22 ′ via a bus 27 in a similar fashion to the manner in which the classifier 22 of FIGS. 1 and 2 is provided with a detector cascade from the detectors buffer 12 .
- the PPF however is also able to communicate with the classifier 22 ′ via a window control interface (Win_Ctrl) 30 .
- This interface 30 provides the PPF 26 with a score once each detector cascade is complete. Using the scores from each limited stage detector cascade, the PPF can now decide which further detector cascade might be applied to the current window. This could mean that rather than applying 4 full detector cascades to every window not rejected by a Prefilter 24 (where provided), the classifier might only need to apply 1 full detector cascade following a number of limited stage cascades.
- the rules engine could also control whether all of the limited stage detector cascades are indeed applied to a given window—so for example, if a first limited stage detector cascade returned a very high score for a window, the PPF 26 might decide to proceed directly to applying the corresponding full length detector cascade on that window.
- the PPF approach becomes even more useful when applied in conjunction with a classifier 22 ′ based on RTC stages.
- a Prefilter 24 were being employed with the PPF 26 and classifier 22 ′, it would be beneficial if each employed the same burst read size.
- Using a burst read means that detector stages for the classifier 22 ′ can be applied for a plurality of successive windows in parallel.
- the Win_Ctrl interface 30 enables to PPF to obtain scores from multiple windows in a single clock cycle.
- the PPF 26 could then decide to indicate to the classifier 22 ′ via the Win_Ctrl interface that it should only apply a full stage detector corresponding to the second limited stage detector to the windows.
- the full stage detector is applied to all of windows 0 . . . 7 or just to one of windows 0 . . . 7 as the classifier 22 ′ will not be able to advance to the sequence of windows following windows 0 . . . 7 until the full stage detector has completed processing any of windows 0 . . . 7 .
- the information garnered from applying the full stage detector to all of the windows can be used by the PPF to determine the processing to be applied to subsequent windows.
- the approach of applying a number of limited stage detectors before using their results to determine which of any of a number of full stage detectors is to be applied to a window provides a significant reduction in the time required to check an image for the presence of a number of different types of object—or an object such as a face having a number of potential different appearances.
- the TME of the embodiments may only be concerned with processing a portion of an image.
- an application running within the system may determine that only a region of interest (ROI) from a complete image might need to be scanned for the presence of objects and so only this portion might be supplied to the TME 10 ′, 10 ′′ or else the TME might be signaled to only apply the classifier 22 , 22 ′ to a subset of received image data.
- ROI region of interest
- the TME might be signaled to only apply the classifier 22 , 22 ′ to a subset of received image data.
- biometric recognition based on iris patterns
- only areas of an image surrounded by skin portions might be examined by the classifier 22 , 22 ′.
- an image might be provided to the TME in stripes to limit the amount of memory required by the TME 10 ′, 10 ′′.
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Multimedia (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Health & Medical Sciences (AREA)
- General Health & Medical Sciences (AREA)
- Software Systems (AREA)
- Artificial Intelligence (AREA)
- Evolutionary Computation (AREA)
- Databases & Information Systems (AREA)
- Medical Informatics (AREA)
- Computing Systems (AREA)
- Oral & Maxillofacial Surgery (AREA)
- Human Computer Interaction (AREA)
- Data Mining & Analysis (AREA)
- Life Sciences & Earth Sciences (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Bioinformatics & Computational Biology (AREA)
- Evolutionary Biology (AREA)
- General Engineering & Computer Science (AREA)
- Image Analysis (AREA)
Abstract
An image processing system comprises a template matching engine (TME). The TME reads an image from the memory; and as each pixel of the image is being read, calculates a respective feature value of a plurality of feature maps as a function of the pixel value. A pre-filter is responsive to a current pixel location comprising a node within a limited detector cascade to be applied to a window within the image to: compare a feature value from a selected one of the plurality of feature maps corresponding to the pixel location to a threshold value; and responsive to pixels for all nodes within a limited detector cascade to be applied to the window having been read, determine a score for the window. A classifier, responsive to the pre-filter indicating that a score for a window is below a window threshold, does not apply a longer detector cascade to the window before indicating that the window does not comprise an object to be detected.
Description
- This application is a continuation of U.S. patent application Ser. No. 15/380,906 filed Dec. 15, 2016 which claims the benefit of U.S. Provisional Patent Application No. 62/387,247 filed on Dec. 23, 2015, the contents of which are expressly incorporated by reference herein in their entirety.
- The present invention relates to an image processing system.
- Feature detection within images and streams of images is becoming an increasingly important function in image acquisition and processing devices.
- Face detection and tracking, for example, as described in European Patent No. EP2052347 (Ref: FN-143) is a well-known example of feature detection in image processing. These techniques enable one or more face regions within a scene being imaged to be readily delineated and to allow for subsequent image processing based on this information. Such subsequent image processing can include face recognition which attempts to identify individuals being imaged, for example, for tagging or authentication purposes; auto-focusing by bringing a detected and/or selected face region into focus; or defect detection and/or correction of the face region(s).
- Referring now to
FIG. 1 , there is shown a block diagram for a conventional type template matching engine (TME) 10 for identifying features within an image or portion of an image. The processing steps of the TME are: -
- 1. A detector cascade is loaded into a
detectors buffer 12 from system memory (not shown) across a system bus. A detector cascade comprises information for a sequence of stages which are applied to a window within an image to determine if the window contains an object to be detected. The detector cascade, use of which is explained in more detail below, can be arranged to be applied by aclassifier 22 to one or more different forms of features extracted from an image. As well as the image intensity (Y) value itself, examples of features which can be employed by a detector cascade include: Integral Image or II2 image typically employed by a HAAR classifier, Histogram of Gradients (HoG), Census or Linear Binary Patterns (LBP). Details of methods for producing HoG maps are disclosed in PCT Application No. PCT/EP2015/073058 (Ref: FN-398) and U.S. Application No. 62/235,065 filed 30 Sep. 2015 (Ref: FN-0471) and techniques for providing multiple feature maps for a region of interest within an image are disclosed in U.S. Patent Application No. 62/210,243 filed 26 Aug. 2015 (Ref: FN-469); - 2. Intensity plane information, for example, a luminance channel, for the input image or image portion is loaded into a
Y cache 14 from the system memory across the system bus. (Other image planes could also be used if required); - 3. The image in the Y cache is scanned with a sliding window on various scales, one scale at a time as follows:
- a. A
resampler module 16 resampler the input image to the desired scale (usually processing begins with the most downsampled version of an image to detect the largest features). - b. The window size employed after the
resampler 16 is typically fixed and, depending on the application and implementation, may be 22×22, 32×32 or 32×64 pixels. (Thus the size of object being detected within a given image depends on the degree of downsampling of the image prior to application of a detector cascade.) - c. The sliding window step between adjacent windows is typically 1 or 2 pixels.
- a. A
- 4. For each pixel location of the sliding window, the values for the corresponding locations of the feature maps (channels), such as those referred to above, are calculated by a
feature calculator 18. Note that the feature calculator can take into account the fact that consecutive windows overlap so it does not re-calculate feature map values that have already calculated for an image. - 5. The feature map values from the
feature calculator 18 can be buffered in afeatures buffer 20. - 6. The
classifier 22 applies the detector cascade from thedetectors buffer 12 to the feature maps for the current window in thefeatures buffer 20 to determine if the window features match or not an object of interest (e.g. a face). Within theclassifier 22, a detector cascade is typically applied stage-by-stage, building a score for a window. A complete detector cascade can have any number stages, for example, up to 4096 stages is a common maximum length. (Note that most windows fail after a few detector stages. For example, with a well-trained classifier, 95% of the windows tested fail after 12 stages.) - 7.
Steps 2 to 6 of the above process can then be repeated from scratch for the next window in the image.
- 1. A detector cascade is loaded into a
- As disclosed in PCT Application No. PCT/EP2015/073058 (Ref: FN-398), it is possible for the
feature calculation module 18 to provide the requiredfeatures buffer 20 for a new window at each clock cycle. Theclassifier 22 typically processes one detector cascade stage per clock cycle and typically, this happens only after the processing pipeline is filled at the start of each new window—this can again involve a number of clock cycles. - Thus, it will be seen that while processing one window, the
classifier 22 needs to stall the whole pipeline before it (using a backpressure mechanism indicated by the upwards arrows connecting elements 22-14). Thus, theclassifier 22 is the bottleneck of the process, due to the fact that the detector cascade stages must be applied in a sequence. - According to the present invention there is provided a system for processing images as claimed in
claim 1. - In embodiments, a Prefilter module is added to a template matching engine (TME) in order to improve performance by accelerating processing. The Prefilter has the following role and features:
-
- 1. The Prefilter applies a limited number of stages of a detector as an image is being read with a view to rejecting a high proportion of windows with as few stages as possible. In one embodiment, the Prefilter comprises enough stages to reject 95% of windows from needing to be analysed subsequently within a full detector cascade.
- 2. The Prefilter can process one window per clock cycle, meaning that it can process windows without causing backpressure in an image processing pipeline.
- 3. Only if the first limited number of stages of the Prefilter indicate that a window should not be rejected, will the Prefilter indicate to the classifier that it should apply a full detector cascade.
- Using for example, a 12 stage Prefilter, the TME can be accelerated of the order of up to 20 times because the Prefilter can process one window per clock cycle, while an exemplary classifier would need 20 clock cycles to apply the same first 12 stages of the detector (8 cycle pipeline latency+12 cycles for the 12 stages).
- In a second aspect, there is provided an image processing system as claimed in
claim 12. - According to this aspect a classifier is trained to base each decision on separate feature maps so that features can be read in a single clock cycle and each stage can be executed in a single clock cycle.
- In a third aspect, there is provide an image processing system as claimed in
claim 18. - In this aspect, a programmable controller allows a plurality of reduced stage detectors to be run on a window before deciding on their progress and then determining which, if any, longer stage detectors should be applied to the windows.
- Various embodiments of the invention will now be described, by way of example, with reference to the accompanying drawings, in which:
-
FIG. 1 shows a conventional TME module; -
FIG. 2 shows TME module including a Prefilter in accordance with a first embodiment of the present invention; -
FIG. 3 illustrates a feature map, window, detector nodes and stages employed within embodiments of the present invention; -
FIG. 4 shows the processing for the first stage of an RTC detector cascade employed within an exemplary Prefilter ofFIG. 2 ; -
FIG. 5 illustrates the data employed within an RTC detector cascade stage employed within an exemplary Prefilter ofFIG. 2 ; -
FIG. 6 shows the architecture of the Prefilter ofFIG. 2 in more detail; -
FIG. 7 shows detector stage data being collected within FIFO memories within a Prefilter according to an embodiment of the invention; -
FIG. 8 illustrates pixels from successive windows of a frame being processed; and -
FIG. 9 illustrates a template matching engine including a programmable classifier according to a second embodiment of the present invention. - Referring now to
FIG. 2 , there is shown aTME 10′ including aPrefilter 24 according to a first embodiment of the present invention. The function of the remaining elements ofFIG. 2 is basically as explained for the TME ofFIG. 1 except where indicated. In general, the processing flow is as follows: -
- 1) The
detector buffer 12 receives detector cascade configuration for both thePrefilter 24 and theclassifier 22. - 2) The
feature calculation module 18 receives an image pixel-by-pixel as before. - 3) The
Prefilter 24 receives its configuration information (node positions, channel information, thresholds) for the classifier stages it is to apply to each window of the image at once. Typically the number of detector stages applied by the Prefilter is between around 10-20 and in the illustrated example is 12 stages. - 4) The
Prefilter 24 receives all feature maps from thefeatures buffer 20, in raster order. - 5) The
classifier module 22 also receives the features maps fromfeatures buffer 20 as well as an initial decision fromPrefilter 24 signaling which candidate windows should be subjected to full classification by theclassifier 22. - 6) The
classifier 22 only applies its detectors to non-rejected windows from thePrefilter 24 and provides its final decision in relation to which windows of an image contain detected objects to other modules via the system bus.
- 1) The
- Thus, in the
TME 10′, the task of thePrefilter 24 is to reject as many windows as possible before they are analyzed by theclassifier 22. ThePrefilter 24 performs its task on the fly as window information is being read from the system bus, while running theclassifier 22 may take many more clock cycles—for example, a full detector cascade applied by theclassifier 22 could have up to 4000 stages or more. - In one embodiment of the present invention, each stage of the
Prefilter 24 comprises a decision tree of a Random Tree Classifier (RTC). A useful tutorial explaining RTC can be found at http://www.r2d3.us, “A Visual Introduction to Machine Learning”. - Referring now to
FIG. 3 , in such a case, each decision tree comprises 3 nodes, aroot node 0 and 1 and 2. Each node of a decision tree corresponds to a pixel location within a window to be examined i.e. a relative x,y displacement within a window. In the present example, the values tested at each decision node can come from the corresponding location of a selected feature map.sub-branch nodes - Referring now to
FIG. 4 , in a three node decision tree (D3) comprising a stage of a detector cascade, a value for a root node, Node0 is compared with a threshold for that node, Threshold0, to determine which sub-branch of the tree should be taken. At the sub-branch level, the value for either Node1 or Node2, according to the decision taken at Node0, is tested against a respective threshold and depending on the result, a score for the detector cascade is either incremented or decremented by a given amount. The thresholds, feature maps and score for each decision tree are determined through training against a test data set. - Referring to
FIG. 5 , each of the 3 nodes for a stage are associated with a relative x,y location within a window and a specific feature map (channel) as well as a threshold value; and for each stage, there will be a stage threshold and a resulting stage score. - Again, all of these values can be determined through training against a test data set including image windows classified as to be accepted or to be rejected i.e. that they include the kind of features which the
classifier 22 is to detect or not. - For a 12 stage D3 detector cascade being applied by the
Prefilter 24, 36 nodes will be of interest, each testing a feature map value at a corresponding window location against a threshold to determine either which other node of the decision tree is to be tested or the final score for a stage of detector cascade. - Referring to
FIG. 6 , thePrefilter 24 is interposed between the feature calculation/buffer modules 18/20 and theclassifier 22 so that as feature maps are generated cycle-by-cycle as the image is being scanned, knowing the x,y locations of the nodes of interest, thePrefilter 24 can read the required values from the relevant feature maps (channels 0 . . . 15) to apply the decision trees for each of the stages of thePrefilter 24. Then, according to the accumulated score for the stages applied by thePrefilter 24, thePrefilter 24 can provide its decision to theclassifier 22 to indicate whether or not theclassifier 22 should apply full detector cascade(s) to the window as soon as the last relevant node location in a window is reached. This means that a decision whether or not to process a window can in fact be made even before the complete window has been read from the system bus i.e. as long as a last read pixel of a window (typically the bottom right corner) is not required as a node within a detector stage, the Prefilter will have made its decision before a complete window is read from memory. Thus, by the time the complete window is read or even beforehand, theclassifier 22 can signal, if required, that a window does not contain an object to be detected, or know immediately if it might need or not to apply any further detector cascades to the window. - Referring now to
FIG. 7 , which shows an exemplary configuration for thePrefilter 24.Configuration information 70 provided by thedetectors buffer 12 prior to image processing is fed to each of a number of 72, 74 andselectors comparators 76—one pernode 0 . . . 35. - Channel information for each of
nodes 0 . . . 35 is written toselectors 72 and location information fornodes 0 . . . 35 is written toselectors 74. Finally, threshold information for each ofnodes 0 . . . 35 is written to the set ofcomparators 76.Selectors 72 direct channel information for each image pixel location as it is generated to acorresponding selector 74. When eachselector 74 detects that its programmed x,y location within a window has been reached, it provides the selected channel value to a corresponding threshold comparator from the set ofcomparators 76. When a comparator detects a channel value provided at its input from aselector 74, it performs its comparison and writes its decision to acorresponding FIFO 78. A FIFO is provided for every node that is used in any of the detector stages of the Prefilter detector cascade. In order to be able to calculate a window score, the Prefilter needs all node decisions to be available in FIFO memories for that window. When all FIFO for all nodes have at least 1 location written, the Prefilter pops-out data from all FIFO memories and calculates 1 window score according to athreshold algorithm 80. - So for example, the value for
node 0 will determine which of the values from 1 or 2 are to be employed to contribute to the final value for the decision stages applied to the window. The accumulated score from the detector stages can be compared against a configured window threshold to provide a final score value for a window and this can indicate the level of confidence of thenodes Prefilter 24 in relation to whether a given window contains or does not contain an object of interest. - Referring to
FIG. 8 , it will be appreciated that as nodes for windows are in the same relative x,y locations within every window, as theTME 10′ scans across an image, the FIFOs will fill for successive windows so that decisions can be provided at a rate of 1 per clock cycle. - This characteristic also enables data to be read in bursts of pixels for example 4 or 8 pixels. Thus by multiplying and multiplexing the architecture of
FIG. 7 it is possible to perform calculations for more than 1 window per clock cycle and so to eliminate or identify windows as candidates for full classification at an even faster rate. - It will be appreciated that using an RTC classifier cascade allows the
Prefilter 24 to not alone provide a yes/no decision in relation to any given window, but also a score indicative of the confidence from a detector that a window either includes or does not include an object to be detected. This can be useful for other applications, performing subsequent image processing on a given image, but the information can also be used with theTME 10′ especially if multiple windows are being processed in parallel or if multiple detector cascades are being applied by theclassifier 22 as explained in more detail below. - In any case, for any windows which the
Prefilter 24 does not reject, theclassifier 22 can apply one or more detector cascades. As explained in the above described embodiment, thePrefilter 24 is based on number of RTC stages. Each of the channel values generated as a pixel is read from the system bus are made available to each of theselectors 72 and so each of these can be freely programmed based on the training data set to choose from whichever channel enables thePrefilter 24 to best discriminate between windows which should be rejected before full classification and those which should be subjected to full classification. - In some embodiments, the
classifier 22 can also be based on such RTC stages. However, within theclassifier 22 each stage is applied in sequence, building a score for a window. At each stage of the detector a stage score is added or subtracted to/from the window score, depending on the stage evaluation result. A window score after each stage is compared with a threshold for a stage. While the window score is above the stage threshold, the next detector stage is applied, whereas if the window score is below the stage threshold the detector is abandoned. If the last stage of the detector cascade is reached, the window score is compared with the global threshold of the detector cascade and if the window score is above the global threshold, a match is signaled. - Each stage of the classifier is based on channel values corresponding to three nodes within a window. If no assumptions were made about which channels each node of a decision tree for a stage were to be associated with, then at least 2 successive reads from the same channel might be required before a decision could be taken for a stage (assuming that one 1 sub-branch decision for either
1 or 2 needs to be taken). However, in order to speed up decision making within thenode classifier 22, in embodiments of theclassifier 22 based on RTC decision trees, each stage is restricted to nodes based on different channels. So for example, Node0 for a stage might be based on a HOG value for at a pixel location; Node1 for a stage might be based on an intensity value for a pixel; andNode 2 for a stage might be based on an II value for a pixel. This means that the separate feature memories (channels) for each node can be read in the same clock cycle and compared against their threshold values, as required, and the final score for a stage generated in the minimum of clock cycles—potentially speeding up the performance of theclassifier 22 twofold. - It will also be seen that there are applications where the TME might be required to apply a number of different detectors to any given window. Take for example, a biometric recognition application running on the same device as the
TME 10′ where the application might be required to attempt to recognize a user in one of a number of different poses, for example, front, tilted, left or right side profile. - In such a case, the
detectors buffer 12 could be provided with a plurality of detector cascades, each for a different detector. - Even if a
Prefilter 24 trained to reject windows for which no such detector cascades would be successful were employed i.e. a common rejector, theclassifier 22 might still be required to run a number of full length detector cascades on every window passed by thePrefilter 24. - Referring now to
FIG. 9 , in a further variant ofTME 10″, a programmable prefilter (PPF) 26 is provided in order to control the detectors applied by a modifiedclassifier 22′. Again elements ofFIG. 9 having the same reference numerals as inFIGS. 1 and 2 perform substantially the same function. - The
PPF 26 is provided with a rules engine (not shown) which enables the PPF to determine which detector cascades fromdetectors buffer 12 will be applied or which detectors will be applied in full to any given window. The rules engine is either pre-programmed according to application requirements i.e. hardcoded, or the rules engine can be configured by an application (for example, the biometric recognition application referred to above) by providing the required configuration information across the system bus. - In a first example, the detectors buffer stores 4 full detector cascades. The PPF can apply a first limited number of stages from each cascade, say 12, to a current window. It does this by providing the detector configuration to the
classifier 22′ via abus 27 in a similar fashion to the manner in which theclassifier 22 ofFIGS. 1 and 2 is provided with a detector cascade from thedetectors buffer 12. - The PPF however is also able to communicate with the
classifier 22′ via a window control interface (Win_Ctrl) 30. Thisinterface 30 provides thePPF 26 with a score once each detector cascade is complete. Using the scores from each limited stage detector cascade, the PPF can now decide which further detector cascade might be applied to the current window. This could mean that rather than applying 4 full detector cascades to every window not rejected by a Prefilter 24 (where provided), the classifier might only need to apply 1 full detector cascade following a number of limited stage cascades. It will also be seen that the rules engine could also control whether all of the limited stage detector cascades are indeed applied to a given window—so for example, if a first limited stage detector cascade returned a very high score for a window, thePPF 26 might decide to proceed directly to applying the corresponding full length detector cascade on that window. - The PPF approach becomes even more useful when applied in conjunction with a
classifier 22′ based on RTC stages. Again, using the fact that nodes for each RTC stage have the same relative displacement within windows, means that image pixel information can be read in bursts of say 4 or 8 pixels—similar to the manner described above for thePrefilter 24. Indeed if aPrefilter 24 were being employed with thePPF 26 andclassifier 22′, it would be beneficial if each employed the same burst read size. - Using a burst read, means that detector stages for the
classifier 22′ can be applied for a plurality of successive windows in parallel. In this case, theWin_Ctrl interface 30 enables to PPF to obtain scores from multiple windows in a single clock cycle. - Now, by running a first limited stage detector across a number of windows in parallel, followed by second and subsequent limited stage detectors across the same windows, the results can be used by the PPF to determine to which if any of those parallel windows a full detector cascade should be applied.
- So for example, if from a set of
windows 0 . . . 7 being processed in parallel,windows 1 and 5 returned positive scores for a first limited stage detector, while window 3 returned a very positive score for a second limited stage detector, thePPF 26 could then decide to indicate to theclassifier 22′ via the Win_Ctrl interface that it should only apply a full stage detector corresponding to the second limited stage detector to the windows. - Note that in this case, it makes little difference whether the full stage detector is applied to all of
windows 0 . . . 7 or just to one ofwindows 0 . . . 7 as theclassifier 22′ will not be able to advance to the sequence ofwindows following windows 0 . . . 7 until the full stage detector has completed processing any ofwindows 0 . . . 7. Thus, the information garnered from applying the full stage detector to all of the windows can be used by the PPF to determine the processing to be applied to subsequent windows. - Regardless, the approach of applying a number of limited stage detectors before using their results to determine which of any of a number of full stage detectors is to be applied to a window provides a significant reduction in the time required to check an image for the presence of a number of different types of object—or an object such as a face having a number of potential different appearances.
- Note that while the above embodiments have been described in terms of processing an image, it will be appreciated that the TME of the embodiments may only be concerned with processing a portion of an image. For example, an application running within the system may determine that only a region of interest (ROI) from a complete image might need to be scanned for the presence of objects and so only this portion might be supplied to the
TME 10′, 10″ or else the TME might be signaled to only apply the 22,22′ to a subset of received image data. For example, for biometric recognition based on iris patterns, only areas of an image surrounded by skin portions might be examined by theclassifier 22, 22′.classifier - Alternatively, an image might be provided to the TME in stripes to limit the amount of memory required by the
TME 10′, 10″.
Claims (20)
1. An image processing system comprising a template matching engine (TME) operatively connected to a memory storing image information, the TME being configured to:
read at least a portion of an image from said memory using a raster scan; and
as each pixel of said image portion is being read, calculate a respective feature value of a plurality of feature maps as a function of said pixel value;
the TME further comprising:
a pre-filter responsive to a current pixel location comprising a node within a limited detector cascade to be applied to a window within said portion of an image to:
compare a feature value from a selected one of said plurality of feature maps corresponding to said pixel location to a threshold value; and
responsive to pixels for all nodes within a limited detector cascade to be applied to said window having been read, determine a score for said window based on the comparisons of said feature values and said threshold values for said nodes; and
a classifier, responsive to said pre-filter indicating that a score for a window is below a window threshold, not applying a longer detector cascade to said window before indicating that said window does not comprise an object to be detected.
2. An image processing system as claimed in claim 1 wherein said TME is arranged to sub-sample said image prior to calculating said feature values.
3. An image processing system as claimed in claim 1 wherein said TME is arranged to simultaneously read a plurality of pixels from said memory, each pixel corresponding to given relative pixel location within a sequence of windows, said pre-filter being responsive to said given relative pixel location corresponding to a node within a limited detector cascade to be applied to said sequence of windows, to simultaneously compare respective feature values from a selected one of said plurality of feature maps corresponding to said given relative pixel location to a threshold value; and responsive to pixels for all nodes within said limited detector cascade to be applied to said sequence of windows having been read, to determine respective scores for said sequence of windows based on the comparisons of said feature values and said threshold values for said nodes.
4. An image processing system as claimed in claim 3 wherein said TME is arranged to simultaneously read either 4 or 8 pixels.
5. An image processing system as claimed in claim 1 wherein the pre-filter is configured to reject approximately 95% of windows from needing to be analysed subsequently by said classifier with said longer detector cascade.
6. An image processing system as claimed in claim 1 wherein said classifier is configured to apply a plurality of detector cascades to any window not rejected by said pre-filter.
7. An image processing system as claimed in claim 6 wherein said classifier is configured to apply said plurality of detector cascades successively to one window at a time.
8. An image processing system as claimed in claim 1 wherein said classifier is arranged to simultaneously apply a given detector cascade to a sequence windows.
9. An image processing system as claimed in claim 1 wherein said nodes comprise nodes within successive stages of a multi-stage random tree classifier (RTC).
10. An image processing system as claimed in claim 9 wherein said RTC comprises 12 stages, each stage comprising a 3 node decision tree.
11. An image processing apparatus according to claim 1 wherein said TME is further arranged to provide values for a plurality of the following feature maps based on a pixel value:
an Intensity Image;
an Integral Image (II);
an II2 map;
a Census map;
a LBP (linear binary pattern) map; and
a HOG (Histogram of Gradients) map.
12. An image processing system comprising a template matching engine (TME) operatively connected to a memory storing image information, the TME being configured to:
read at least a portion of an image from said memory using a raster scan; and
as each pixel of said image portion is being read, calculate a respective feature value of at least three feature maps as a function of said pixel value;
the TME comprising:
a classifier arranged to apply at least one detector cascade to a window within a portion of an image in order to indicate if said window comprises an object to be detected, one of said at least one detector cascades comprising a multi-stage random tree classifier (RTC), each stage comprising a decision tree having at least three nodes corresponding to respective pixel locations within said window, said classifier being arranged to compare a feature value from a selected one of said plurality of feature maps corresponding to a pixel location to a threshold value for each of said nodes of said detector cascade, wherein training of said detector cascade is restricted so that selected feature maps for each node of a stage are each different, said classifier being arranged to simultaneously read feature values for each node of a stage from said feature maps and to simultaneously compare each of said feature map values to respective thresholds to determine a score for a stage, and said classifier being arranged to compare an accumulated score for a window after each stage with a stage threshold to determine whether or not to continue with a next stage of said detector cascade.
13. An image processing system as claimed in claim 12 wherein said TME is arranged to simultaneously read a plurality of pixels from said memory, each pixel corresponding to given relative pixel location within a sequence of windows, said classifier being arranged to simultaneously apply a given stage from said detector cascade to each of said sequence of windows.
14. An image processing system as claimed in claim 12 wherein said TME is arranged to sub-sample said image prior to calculating said feature values.
15. An image processing system as claimed in claim 12 wherein said TME is arranged to simultaneously read a plurality of pixels from said memory, each pixel corresponding to given relative pixel location within a sequence of windows, said pre-filter being responsive to said given relative pixel location corresponding to a node within a limited detector cascade to be applied to said sequence of windows, to simultaneously compare respective feature values from a selected one of said plurality of feature maps corresponding to said given relative pixel location to a threshold value; and responsive to pixels for all nodes within said limited detector cascade to be applied to said sequence of windows having been read, to determine respective scores for said sequence of windows based on the comparisons of said feature values and said threshold values for said nodes.
16. An image processing system as claimed in claim 12 wherein said classifier is configured to apply a plurality of detector cascades to any window not rejected by said pre-filter.
17. An image processing apparatus according to claim 12 wherein said TME is further arranged to provide values for a plurality of the following feature maps based on a pixel value:
an Intensity Image;
an Integral Image (II);
an II2 map;
a Census map;
a LBP (linear binary pattern) map; and
a HOG (Histogram of Gradients) map.
18. An image processing system comprising a template matching engine (TME) operatively connected to a memory storing image information, the TME being configured to:
read at least a portion of an image from said memory using a raster scan; and
as each pixel of said image portion is being read, calculate a respective feature value of a plurality of feature maps as a function of said pixel value;
the TME comprising:
a classifier arranged to apply at least one multi-stage detector cascade to a window within a portion of an image in order to indicate if said window comprises an object to be detected, said classifier being arranged to compare an accumulated score for a window after each stage with a stage threshold to determine whether or not to continue with a next stage of said detector cascade; and
a programmable controller, said controller being arranged to provide said classifier with a plurality of limited stage detector cascades to be successively applied to a window, the programmable controller being arranged to receive a respective accumulated score for each limited stage detector cascade from said classifier and to apply rules from a rules engine to determine which of a plurality of longer detector cascades corresponding to said limited stage detector cascades are to be applied to said window to enable said classifier to indicate if said window comprises an object to be detected.
19. An image processing system as claimed in claim 18 wherein said TME is arranged to simultaneously read a plurality of pixels from said memory, each pixel corresponding to given relative pixel location within a sequence of windows, said classifier being arranged to simultaneously apply a given stage from a limited stage detector cascade to said sequence of windows, said controller being arranged to simultaneously receive from said classifier respective accumulated scores for said sequence of windows before determining which longer detector cascades corresponding to said limited stage detectors should be applied to said sequence of windows.
20. An image processing system as claimed in claim 19 wherein at least one of said plurality of detector cascades comprises a multi-stage random tree classifier (RTC), each stage comprising a decision tree having at least three nodes corresponding to respective pixel locations within said window, said classifier being arranged to compare a feature value from a selected one of said plurality of feature maps corresponding to said pixel location to a threshold value for each of said nodes.
Priority Applications (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US16/660,559 US20200050885A1 (en) | 2015-12-23 | 2019-10-22 | Image processing system |
| US16/995,451 US11532148B2 (en) | 2015-12-23 | 2020-08-17 | Image processing system |
Applications Claiming Priority (5)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US201562387247P | 2015-12-23 | 2015-12-23 | |
| EPPCT/EP2016/074519 | 2016-10-12 | ||
| PCT/EP2016/074519 WO2017108222A1 (en) | 2015-12-23 | 2016-10-12 | Image processing system |
| US15/380,906 US10460198B2 (en) | 2015-12-23 | 2016-12-15 | Image processing system |
| US16/660,559 US20200050885A1 (en) | 2015-12-23 | 2019-10-22 | Image processing system |
Related Parent Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| US15/380,906 Continuation US10460198B2 (en) | 2015-12-23 | 2016-12-15 | Image processing system |
Related Child Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| US16/995,451 Continuation US11532148B2 (en) | 2015-12-23 | 2020-08-17 | Image processing system |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| US20200050885A1 true US20200050885A1 (en) | 2020-02-13 |
Family
ID=59088527
Family Applications (3)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| US15/380,906 Active 2037-10-26 US10460198B2 (en) | 2015-12-23 | 2016-12-15 | Image processing system |
| US16/660,559 Abandoned US20200050885A1 (en) | 2015-12-23 | 2019-10-22 | Image processing system |
| US16/995,451 Active 2037-08-10 US11532148B2 (en) | 2015-12-23 | 2020-08-17 | Image processing system |
Family Applications Before (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| US15/380,906 Active 2037-10-26 US10460198B2 (en) | 2015-12-23 | 2016-12-15 | Image processing system |
Family Applications After (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| US16/995,451 Active 2037-08-10 US11532148B2 (en) | 2015-12-23 | 2020-08-17 | Image processing system |
Country Status (1)
| Country | Link |
|---|---|
| US (3) | US10460198B2 (en) |
Cited By (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN112232215A (en) * | 2020-10-16 | 2021-01-15 | 哈尔滨市科佳通用机电股份有限公司 | Railway wagon coupler yoke key joist falling fault detection method |
| US11532148B2 (en) | 2015-12-23 | 2022-12-20 | Fotonation Limited | Image processing system |
Families Citing this family (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US10528840B2 (en) | 2015-06-24 | 2020-01-07 | Stryker Corporation | Method and system for surgical instrumentation setup and user preferences |
| WO2019227294A1 (en) * | 2018-05-28 | 2019-12-05 | 华为技术有限公司 | Image processing method, related device and computer storage medium |
| CN113744268B (en) * | 2021-11-04 | 2022-04-22 | 深圳市城市交通规划设计研究中心股份有限公司 | Crack detection method, electronic device and readable storage medium |
Citations (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20090018985A1 (en) * | 2007-07-13 | 2009-01-15 | Microsoft Corporation | Histogram-based classifiers having variable bin sizes |
Family Cites Families (22)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US7263220B2 (en) | 2003-02-28 | 2007-08-28 | Eastman Kodak Company | Method for detecting color objects in digital images |
| US8503800B2 (en) | 2007-03-05 | 2013-08-06 | DigitalOptics Corporation Europe Limited | Illumination detection using classifier chains |
| US7315631B1 (en) | 2006-08-11 | 2008-01-01 | Fotonation Vision Limited | Real-time face tracking in a digital image acquisition device |
| US7590267B2 (en) | 2005-05-31 | 2009-09-15 | Microsoft Corporation | Accelerated face detection based on prior probability of a view |
| JP4556891B2 (en) * | 2006-03-17 | 2010-10-06 | ソニー株式会社 | Information processing apparatus and method, recording medium, and program |
| US7997496B2 (en) * | 2007-01-16 | 2011-08-16 | Scheir Peter L | Laminated printable multi-layer card with entrapped security element |
| US8010471B2 (en) | 2007-07-13 | 2011-08-30 | Microsoft Corporation | Multiple-instance pruning for learning efficient cascade detectors |
| CN100560025C (en) | 2007-07-26 | 2009-11-18 | 上海交通大学 | Face Detection Method with Combined Coefficients of Weak Classifiers |
| JP4948379B2 (en) * | 2007-12-18 | 2012-06-06 | キヤノン株式会社 | Pattern discriminator generation method, information processing apparatus, program, and storage medium |
| JP4710979B2 (en) | 2009-01-09 | 2011-06-29 | ソニー株式会社 | Object detection device, learning device, object detection method and program |
| JP5340088B2 (en) | 2009-09-07 | 2013-11-13 | キヤノン株式会社 | Information processing method and apparatus |
| US8860715B2 (en) * | 2010-09-22 | 2014-10-14 | Siemens Corporation | Method and system for evaluation using probabilistic boosting trees |
| JP5950441B2 (en) | 2012-02-01 | 2016-07-13 | 日本電産エレシス株式会社 | Image recognition apparatus, image recognition method, and image recognition program |
| JP2013161126A (en) * | 2012-02-01 | 2013-08-19 | Honda Elesys Co Ltd | Image recognition device, image recognition method, and image recognition program |
| US8995772B2 (en) | 2012-11-09 | 2015-03-31 | Microsoft Technology Licensing, Llc | Real-time face detection using pixel pairs |
| CN104200236B (en) | 2014-08-22 | 2018-10-26 | 浙江生辉照明有限公司 | Fast target detection method based on DPM |
| EP3058510B1 (en) | 2014-11-28 | 2017-12-20 | FotoNation Limited | A method for producing a histogram of oriented gradients |
| US9697443B2 (en) * | 2014-12-11 | 2017-07-04 | Intel Corporation | Model compression in binary coded image based object detection |
| CN104715023B (en) | 2015-03-02 | 2018-08-03 | 北京奇艺世纪科技有限公司 | Method of Commodity Recommendation based on video content and system |
| US10460198B2 (en) | 2015-12-23 | 2019-10-29 | Fotonation Limited | Image processing system |
| EP3213257B1 (en) | 2015-12-23 | 2018-08-01 | FotoNation Limited | Image processing system |
| WO2017156772A1 (en) * | 2016-03-18 | 2017-09-21 | 深圳大学 | Method of computing passenger crowdedness and system applying same |
-
2016
- 2016-12-15 US US15/380,906 patent/US10460198B2/en active Active
-
2019
- 2019-10-22 US US16/660,559 patent/US20200050885A1/en not_active Abandoned
-
2020
- 2020-08-17 US US16/995,451 patent/US11532148B2/en active Active
Patent Citations (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20090018985A1 (en) * | 2007-07-13 | 2009-01-15 | Microsoft Corporation | Histogram-based classifiers having variable bin sizes |
Cited By (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US11532148B2 (en) | 2015-12-23 | 2022-12-20 | Fotonation Limited | Image processing system |
| CN112232215A (en) * | 2020-10-16 | 2021-01-15 | 哈尔滨市科佳通用机电股份有限公司 | Railway wagon coupler yoke key joist falling fault detection method |
Also Published As
| Publication number | Publication date |
|---|---|
| US20200380291A1 (en) | 2020-12-03 |
| US20170185864A1 (en) | 2017-06-29 |
| US11532148B2 (en) | 2022-12-20 |
| US10460198B2 (en) | 2019-10-29 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US11532148B2 (en) | Image processing system | |
| US12394036B2 (en) | Information processing device, information processing method, and storage medium | |
| EP3213257B1 (en) | Image processing system | |
| US8548201B2 (en) | Apparatus and method for recognizing identifier of vehicle | |
| CN112348787B (en) | Training method of object defect detection model, object defect detection method and device | |
| US9245198B2 (en) | Object recognition by comparison of patterns against map of image | |
| US10679358B2 (en) | Learning image automatic sorting device, learning image automatic sorting method, and learning image automatic sorting program | |
| CN101339609B (en) | Image processing apparatus and image processing method | |
| KR102195940B1 (en) | System and Method for Detecting Deep Learning based Human Object using Adaptive Thresholding Method of Non Maximum Suppression | |
| KR101244220B1 (en) | Apparatus and method for recognizing fingerprint | |
| US9965695B1 (en) | Document image binarization method based on content type separation | |
| US8363983B2 (en) | Real-time face detection apparatus | |
| KR20130072073A (en) | Apparatus and method for extracting edge in image | |
| CN111656397B (en) | Reading system, reading method and storage medium | |
| US20100226578A1 (en) | Object detecting apparatus, and object detecting method | |
| KR20200010658A (en) | Method for identifing person, computing system and program using the same | |
| US7643674B2 (en) | Classification methods, classifier determination methods, classifiers, classifier determination devices, and articles of manufacture | |
| CN108664853B (en) | Face detection method and device | |
| CN117474916A (en) | Image detection method, electronic equipment and storage medium | |
| KR102430120B1 (en) | Apparatus and method for improving the performance of stereo-based ROI detection algorithm | |
| JP2009059047A (en) | Device, method and program for detecting object | |
| KR20220067818A (en) | System for diagnosing skin using non-contact type rgb camera and skin diagnosis method thereof | |
| CN117746477B (en) | Outdoor face recognition method and device, electronic equipment and storage medium | |
| EP4428819A1 (en) | Device and method for enhancing tracking of objects in a scene captured in a video sequence | |
| JP6211473B2 (en) | Object recognition device |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| AS | Assignment |
Owner name: FOTONATION LIMITED, IRELAND Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:NICOARA, NICOLAE;RACEALA, CRISTINA;ZAHARIA, CORNELIU;AND OTHERS;SIGNING DATES FROM 20161214 TO 20161215;REEL/FRAME:050795/0218 |
|
| STPP | Information on status: patent application and granting procedure in general |
Free format text: RESPONSE TO NON-FINAL OFFICE ACTION ENTERED AND FORWARDED TO EXAMINER |
|
| STPP | Information on status: patent application and granting procedure in general |
Free format text: NON FINAL ACTION MAILED |
|
| STCB | Information on status: application discontinuation |
Free format text: ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION |