US20090106699A1 - Image processing device and image processing method - Google Patents
Image processing device and image processing method Download PDFInfo
- Publication number
- US20090106699A1 US20090106699A1 US12/208,486 US20848608A US2009106699A1 US 20090106699 A1 US20090106699 A1 US 20090106699A1 US 20848608 A US20848608 A US 20848608A US 2009106699 A1 US2009106699 A1 US 2009106699A1
- Authority
- US
- United States
- Prior art keywords
- display
- image
- thumbnail
- unit
- preference
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Abandoned
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
- G06F3/048—Interaction techniques based on graphical user interfaces [GUI]
- G06F3/0481—Interaction techniques based on graphical user interfaces [GUI] based on specific properties of the displayed interaction object or a metaphor-based environment, e.g. interaction with desktop elements like windows or icons, or assisted by a cursor's changing behaviour or appearance
- G06F3/04817—Interaction techniques based on graphical user interfaces [GUI] based on specific properties of the displayed interaction object or a metaphor-based environment, e.g. interaction with desktop elements like windows or icons, or assisted by a cursor's changing behaviour or appearance using icons
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/50—Information retrieval; Database structures therefor; File system structures therefor of still image data
- G06F16/58—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
- G06F16/583—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
- G06F16/5846—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content using extracted text
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/50—Information retrieval; Database structures therefor; File system structures therefor of still image data
- G06F16/58—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
- G06F16/583—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
- G06F16/5854—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content using shape and object relationship
Definitions
- the present invention relates to displaying of thumbnail images and searching of images from images in an image database.
- a thumbnail image, or a thumbnail is a shrunk image of a still image, obtained by thinning out pixels from the image. Displaying the thumbnails in the form of a list on a screen enables a user to conveniently browse through a plurality of thumbnails at a time, and to efficiently search for a desired image.
- Some technologies enable the user to easily obtain information about the original images from displayed thumbnails.
- a method is disclosed in Japanese Patent Application Laid-open No. 2001-337994 whereby information pertaining to an original image, such as the file name, creation date, modification date, and security level are stored as tag data in an associated form with the thumbnail of the image.
- tag data associated with the thumbnail is retrieved and displayed alongside.
- an object display mode is provided to enable a thumbnail display of an object in an image when a particular part of the image such as a person or a number plate is specified. This technology thus helps to reduce the labor and time involved in searching for objects in photographic images.
- a display method is preferable that not only enables the user to obtain the attributes of the original image but also the relation between the displayed thumbnails.
- image map a map based on the attributes of the thumbnails.
- image map a group of thumbnails that have similar attributes are grouped together and displayed on the screen.
- the advantage of an image map is that the user can visually identify the thumbnail that he/she is seeking.
- Technologies disclosed in Japanese Patent No. 3614235, Japanese Patent Application Laid-open No. 2005-55743, and Japanese Patent Application Laid-open No. 2005-235041 relate to image map display methods. For example, in the technology disclosed in Japanese Patent No. 3614235, feature quantity of features such as color, form, size, type, and keyword are extracted from a display target image, and feature quantity vectors are created.
- the feature quantity vectors are then projected on two-dimensional coordinate axes using a self-organizing map.
- the density of data is changed and a plurality of screens is lined up in a depth direction to change the view to a three-dimensional view, thus enabling easy search of the required image.
- the attributes of the all the display target images are retrieved, and a center is set on the screen for each attribute value.
- the attributes of the display target images are retrieved from the concerned images, and the thumbnails of the images are arranged in the vicinity of the center of the attribute values of the images.
- the thumbnails of the images having similar attribute values are displayed together.
- N-dimensional feature quantity is extracted from the image data, and a new two-dimensional feature quantity is extracted by multivariate statistical analysis. Further, display position and display size are determined based on clustering data.
- attributes are used as the key when searching for a photographic image or a graphical image displayed as thumbnails.
- search can be performed effectively by looking for objects included in the image and filtering out those images that do not include the objects from the search.
- Search keywords input by the user and read by an optical character reader (OCR) are generally used when searching for an article or a business document.
- OCR optical character reader
- the article or business document can still be searched from the thumbnails of drawings, tables, and captions by activating the visual feature “include drawings, tables, and captions”.
- the method of searching through the thumbnails of a specific object, if the document concerned includes drawings, etc. is more effective than searching the text.
- a search method that uses a specific object contained in the image is often more effective.
- the display position is determined image-by-image in the image map display methods disclosed in the technologies described above. Therefore, if the specified object is not one of the principal objects of the image, the object cannot be easily extracted as an attribute of the image.
- an image processing device including a classifying unit that analyzes a display target image and classifies an object included in the display target image based on an attribute of the object; a feature-data retrieving unit that retrieves feature data that indicates a feature of the object; a display-method determining unit that sets, based on the feature data, at least one of a display position and a display size of a thumbnail of the display target image that includes the object; and a thumbnail generating unit that generates the thumbnail of the display target image.
- the thumbnail generating unit creates a thumbnail list image by at least one of generating the thumbnail in the display size set by the display-method determining unit and arranging the thumbnail in the display position set by the display-method determining unit.
- an image processing device including a classifying unit that analyzes a display target image and classifies an object included in the display target image based on an attribute of the object; a preference setting unit that sets an order of preference of the attributes; a display-method determining unit that sets, based on the order of preference, at least one of a display position and a display size of a thumbnail of the display target image that includes the object; and a thumbnail generating unit that generates the thumbnail of the display target image.
- the thumbnail generating unit creates a thumbnail list image by at least one of generating the thumbnail in the display size set by the display-method determining unit and arranging the thumbnail in the display position set by the display-method determining unit.
- FIG. 1 is a block diagram of an image processing device according to a first embodiment of the present invention
- FIG. 2 is a listing of object attributes classified by an object analyzing unit shown in FIG. 1 ;
- FIG. 3 is a flowchart of a thumbnail list image display process performed by the image processing device shown in FIG. 1 ;
- FIG. 4 is a schematic diagram of a thumbnail list image of regular textual images
- FIG. 5 is a magnified view of an object attribute “drawing” denoted by the reference symbol ZM 1 and the thumbnails around it;
- FIG. 6 is a flowchart of a thumbnail list image display process performed by an image processing device according to a second embodiment of the present invention.
- FIG. 7 is a schematic diagram of a thumbnail list image of regular textual images input as display target images
- FIG. 8 is a magnified view of an object attribute “drawing” denoted by the reference symbol ZM 1 ′ and the thumbnails around it;
- FIG. 9 is a flowchart of a thumbnail list image display process performed by an image processing device according to a third embodiment of the present invention.
- FIG. 10 is a schematic diagram of a thumbnail list image in which the display sizes of the thumbnails have been varied depending on an order of preference.
- FIG. 11 is a flowchart of a thumbnail list image display process performed by an image processing device according to a fourth embodiment of the present invention.
- FIG. 1 is a block diagram of an image processing device 100 according to a first embodiment of the present invention.
- the image processing device 100 includes an input unit 101 , a display unit 102 , a control unit 103 , and a storage unit 104 .
- the input unit 101 can be a keyboard and/or a pointing device such as a mouse, and is used for inputting search condition specification, adding new search conditions, and modifying existing search conditions.
- the display unit 102 can be a liquid crystal display or a cathode ray tube (CRT), and displays thumbnails of images based on the search condition as well as instruction request or instruction result of the input unit 101 .
- CTR cathode ray tube
- the storage unit 104 can be a hard disk, and stores therein images captured by an image capturing device 110 such as a camera, or images of documents such as conference material scanned by a scanner.
- the storage unit 104 also stores therein the thumbnails and object data of the images as folders F 1 to Fn.
- Object data is data pertaining to captions in an image, objects such as drawings, tables, and graphs in a text document, and objects such as portions of an image in a photographic image. Each piece of object data has a corresponding attribute data. Attribute data can be position data that indicates the position of the object, or image attribute quantity obtained by quantification of image attributes of the object, or the pixel size of the object.
- the storage unit 104 also stores therein, corresponding to each image, image identification data such as the pixel size and image ID or image name.
- the control unit 103 includes a central processing unit (CPU), a read-only memory (ROM), a random access memory (RAM), etc.
- the control unit 103 causes the image processing device 100 to realize various functions by executing various programs stored in the ROM.
- the functions realized by the image processing device 100 include those of the object analyzing unit 103 A, a display-method determining unit 103 B, and a thumbnail generating unit 103 C shown in FIG. 1 .
- the object analyzing unit 103 A reads the display target image from the storage unit 104 , analyzes the image, extracts the attribute of the object (object attribute) in the image, classifies the objects by object attribute, associates the object attribute to the object, and adds it to the object data.
- the object analyzing unit 103 A can use the method disclosed in Japanese Patent Application Laid-open No. 2005-78149 to perform image analysis and object extraction. According to this method, the object analyzing unit 103 A divides an image into progressively smaller and smaller areas, and optimizes area boundaries pixel-by-pixel by deploying the Markov random field model that is applicable in spatial frequency domain. Specifically, the object analyzing unit 103 A extracts texture attributes of various scales, and divides the image into areas based on the texture attributes. The object analyzing unit 103 A then limits its scanning to designated areas and carves out the area of the object with predominant texture attribute. Applying the Markov random field model to the carved out area and the areas surrounding it, the object analyzing unit 103 A optimizes the boundary shape between the areas.
- the object analyzing unit 103 A can use the method disclosed in Japanese Patent Application Laid-open No. 2006-39658 for determining object attributes. Though the method disclosed in the patent application is for determining image attributes, it can be adapted for determining object attributes.
- FIG. 2 is a listing of the object attributes classified by the object analyzing unit 103 A.
- Drawings, tables, graphs, captions, etc. are examples of object attributes in a textual image.
- People, plants, animals, articles, etc. are examples of object attributes in a photographic or graphical image.
- the display-method determining unit 103 B determines, for each object attribute classified by the object analyzing unit 103 A, the display positions of the thumbnails of the images that contain the object.
- the thumbnail generating unit 103 C generates the thumbnails of the images that contain the object, and outputs to the display unit 102 a process result in the form of a thumbnail list image in which each thumbnail is arranged in the display position determined by the display-method determining unit 103 B.
- FIG. 3 is a flowchart of a thumbnail list image display process performed by the image processing device 100 .
- the object analyzing unit 103 A retrieves a display target image from the storage unit 104 (Step S 1 ), analyzes the image to extract objects from the image, determines object attributes of the objects, and classifies the objects based on their object attributes (Step S 2 ).
- the object analyzing unit 103 A then retrieves the position data that indicates the position of each object in the original display target image (Step S 3 ).
- the object analyzing unit 103 A determines a center of gravity of each object from the coordinates of the upper left and lower right corners of the original display target image, and retrieves coordinates of the center of gravity of an object as the position data of the object.
- the display-method determining unit 103 B determines the display positions of thumbnails of the display target image that contain the objects (Step S 4 ). Assuming that the size of an object is m ⁇ n pixels, the upper left corner of the object is the origin, the size of the display target image is M ⁇ N pixels, and the coordinates of the center of gravity of the object in the display target image is (fx,fy), then a display position (x,y) of a thumbnail that is to contain the object can be determined from Equation (1):
- the display-method determining unit 103 B checks whether the process has been completed for all the objects in the display target image (Step S 5 ). If there is an unprocessed object (No at Step S 5 ), Steps S 3 to S 5 are repeated for the unprocessed object. If there is no unprocessed object (Yes at Step S 5 ), the thumbnail generating unit 103 C generates a thumbnail of the display target image (Step S 6 ), and checks whether the process has been completed for all the display target images (Step S 7 ).
- Step S 7 If there is no unprocessed display target image (Yes at Step S 7 ), the thumbnail generating unit 103 C creates a thumbnail list image in which the thumbnails generated at step S 6 are placed at the display positions determined at step S 4 , and outputs the thumbnail list image to the display unit 102 (Step S 8 ). If there is an unprocessed display target image (No at Step S 7 ), Step S 1 to S 7 are repeated for the unprocessed display target image.
- the thumbnail generating unit 103 C generates the thumbnail list image by arranging the thumbnails on the image map.
- the display areas of the object attributes in the image map can be preset or can be made changeable with each image according to the number of images, image attribute, or object attribute.
- FIG. 4 is a schematic diagram of a thumbnail list image of a typical textual image. Thumbnails are grouped according to the object attribute, and thumbnails that contain objects of a common object attribute are arranged around each attribute name indicating the object attribute. For example, in FIG. 4 , thumbnails SM 1 - 1 to SM 1 - 7 of images that contain objects belonging to an object attribute “drawing” are arranged around the object attribute “drawing” denoted by the reference symbol ZM 1 .
- FIG. 5 is a magnified view of a portion around the object attribute “drawing” in FIG. 4 . The display position of each of the thumbnails SM 1 - 1 to SM 1 - 7 is determined by the position the object belonging to the object attribute occupies in the original image.
- the display target image includes a plurality of objects of different object attributes, as many thumbnails of the display target image as the object it includes are displayed on the display unit 102 .
- the display target image includes the object of the object attribute “drawing” as well as the object of the object attribute “text”, one thumbnail of the image will be displayed in the vicinity of the attribute name “drawing” and one thumbnail of the image will be displayed in the vicinity of the attribute name “text” on the display unit 102 (for example, the thumbnail SM 1 - 6 in FIG. 4 ).
- the thumbnail list image enables the user to refine the search by using an object as the search key.
- the user can easily guess the display position of the thumbnail of the target image by recalling the general position of the object in the image. Thus, the user can perform search operations more effectively.
- the display method of the thumbnail list image can be a simple thumbnail display.
- the target object can be displayed in high resolution or in highlight.
- Another display method can be that only the target object is displayed.
- Yet another display method can be to display the object within an outline of the image to give a general idea as to the position of the object in the image.
- a self-organizing map can also be output by the process described above.
- the image processing device 100 retrieves the object attributes and the position data of the objects of the display target image and, based on these, creates feature quantity vectors.
- the image processing device 100 can project from the feature quantity space on to a two-dimensional plane, thus automating determination of the display positions of the objects.
- the display target image is not limited to a still image; it can be a three-dimensional image or a moving image.
- the image processing device 100 determines the display position of the thumbnail of the image that contains the object based on the center of gravity of the object and the pixel size of the original image, and arranges the thumbnail in the display area three-dimensionally.
- a time axis (t) is added to the position axes to obtain the coordinate values (fx,fy,t).
- the thumbnails of the moving image can be displayed and made to play at the same positions where the thumbnails of the two-dimensional image would have been displayed or the thumbnails of the moving image can be displayed in the display area three-dimensionally.
- the object analyzing unit 103 A can identify and retrieve the display target images at Step S 1 based on a search condition input via the input unit 101 .
- An image processing device according to a second embodiment of the present invention is described below.
- the parts of the image processing device according to the second embodiment that are similar to those of the first embodiment have been assigned the same reference numerals and symbols, and the explanation thereof is repeated in some cases and not repeated in others.
- the display-method determining unit 103 B determines the display position of the thumbnail based on the position of the object in the original image. In the second embodiment, the display-method determining unit 103 B determines the display position and display size of the thumbnail of the image that contains the object based on the image feature quantity of the object.
- Image feature quantity is obtained by quantification of image features such as texture, color, and edge distribution. For image feature quantity, common object attribute is not essential but it is preferable to use the image feature quantity that allows wide distribution of the objects.
- FIG. 6 is a flowchart of a thumbnail list image display process performed by the image processing device 100 according to the second embodiment.
- the steps in FIG. 6 that are identical to those in FIG. 3 are not described again. Therefore, Steps S 1 , S 2 , and S 5 to S 8 are not described here.
- the display-method determining unit 103 B retrieves, for each object classified by object attribute by the object analyzing unit 103 A, the image feature quantity included in the object data stored in the storage unit 104 (Step S 3 A). Based on the image feature quantity, the display-method determining unit 103 B determines the display position and the display size of the thumbnail of the image that contains the object (Step S 4 A).
- a self-organizing map can be used whereby feature vectors are created from object attributes and image feature quantity and projection is made from the feature quantity space on to the two-dimensional plane.
- the image feature quantity of the image that contains the object belonging to the object attribute is reflected in the display position of the thumbnail of the image.
- a method disclosed in Japanese Patent Application Laid-open No. 2006-303707 can be used. According to the method, even if an image includes an object that cannot be easily identified unless the display size is large, an appropriate display size can be determined so as to enable the user to recognize the object.
- FIG. 7 is a schematic diagram of a thumbnail list image of a typical textual image input as display target images. Thumbnails are grouped according to object attribute, and thumbnails that contain objects of a common object attribute are arranged around each attribute name indicating the object attribute. For example, in FIG. 7 , thumbnails SM 1 - 1 ′ to SM 1 - 7 ′ of the images that contain the object belonging to the object attribute “drawing” are arranged around the object attribute “drawing” denoted by the reference symbol ZM 1 ′.
- FIG. 8 is a magnified view of the object attribute “drawing” denoted by the reference symbol ZM 1 ′ and the thumbnails SM 1 - 1 ′ to SM 1 - 7 ′ around it.
- the display position of each of the thumbnails SM 1 - 1 ′ to SM 1 - 7 ′ is determined by the color density of the target object.
- the display size of each thumbnail is set based on the image feature quantity of the object so that the user can easily identify the object. Therefore, as against the display sizes of the thumbnails SM 1 - 1 to SM 1 - 7 shown in FIG. 5 being uniform, the display sizes of the thumbnails SM 1 - 1 ′ to SM 1 - 7 ′ shown in FIG. 7 are all different.
- the configuration described above allows the object serving as the search key and the objects with similar image feature as that of the search key to be placed close together, with their display sizes adjusted according to the image feature such as frequency data, thus enabling the user to effectively find the desired image data from the thumbnail list image based on the object attribute and image feature quantity.
- data such as layout data that indicates the layout of objects and configuration data, which is data obtained by quantification of object configuration when an object is further broken down, can be used for determining the thumbnail display position.
- An image processing device according to a third embodiment of the present invention is described below.
- the parts of the image processing device according to the third embodiment that are similar to those of the first and second embodiments have been assigned the same reference numerals and symbols, and the explanation thereof is repeated in some cases and not repeated in others.
- an order of preference is set for object attributes, and the display-method determining unit 103 B displays or hides the thumbnails of the images that contain the object according to the order of preference.
- the order of preference of the object attributes is input by the user via the input unit 101 and stored as part of object data in the storage unit 104 .
- FIG. 9 is a flowchart of a thumbnail list image display process performed by the image processing device 100 according to the third embodiment. Steps S 1 , S 2 , and S 3 A are identical to those described with reference to the second embodiment.
- the display-method determining unit 103 B retrieves the order of preference of the object attributes included in the object data stored in the storage unit 104 . The display-method determining unit 103 B then determines whether the retrieved order of preference is greater than a preset integer I of one or greater, and is stored in advance in the storage unit 104 .
- the display-method determining unit 103 B determines the display positions and the display sizes of the thumbnails of the images that contain the object based on the order of preference (Step S 4 A).
- the display-method determining unit 103 B sets more centrally the display position or sets higher in the display order the thumbnail of the image containing the object of the object attribute that is higher in the order of preference.
- the display-method determining unit 103 B sets the display size of the thumbnails such that the higher the object attribute is in the order of preference, the larger the thumbnail of the image containing the object.
- the subsequent steps S 5 to S 8 are identical to those described with reference to the second embodiment. If the order of preference is not greater than I (No at Step SB 2 ), the display-method determining unit 103 B repeats Steps 3 A to SB 2 for the next object.
- FIG. 10 is a schematic diagram of a thumbnail list image in which the thumbnails are displayed in different sizes according to the order of preference.
- the thumbnails are grouped according to the object attribute, and the thumbnails of the images that contain the object of an object attribute are arranged around each attribute name indicating the object attribute.
- the higher the object attribute is in the order of preference the larger the display size is of the thumbnails of the images around the object attribute. If two thumbnails overlap, a thumbnail of higher object attribute can be displayed on top of the other thumbnail.
- thumbnails to be displayed according to the order of preference. Further, the thumbnails of the images that do not include an object of high preference are not displayed, making the search that much finer.
- Preferred thumbnail list image can be obtained as the thumbnails images containing the object of object attribute that is higher in the order of preference are displayed large.
- the configuration enables the thumbnails of the images containing an object of object attribute that is higher in the order of preference to be displayed more centrally in the area of the thumbnail list image that falls in the field of vision of the user, making it more convenient for the user to pick out the desired image more quickly.
- the configuration enables adjustment of the display size of the thumbnails according to the order of preference.
- the thumbnails of the images containing the object of object attribute that is high in the order of preference is displayed in highlight, enabling the user to easily identify the desired image.
- the configuration enables determination of display order of the thumbnails according to the order of preference.
- the thumbnails of the images are displayed beginning with the object attribute of highest preference, shortening the search time and enabling the thumbnails to be displayed in the display order that is suited to the user's intent.
- the display-method determining unit 103 B can be configured to do away with step SB 2 , and directly determine, according to the order of preference, the display positions and the display sizes for all the objects in the display target image.
- the display-method determining unit 103 B can be configured to do away with step S 3 A and directly go to step SB 1 after step S 2 . In this case, at least the display position or the display size can be determined.
- An image processing device according to a fourth embodiment is described below.
- the parts of the image processing device according to the fourth embodiment that are similar to those of the second and third embodiments have been assigned the same reference numerals and symbols, and the explanation thereof is repeated in some cases and not repeated in others.
- the display-method determining unit 103 B sets the order of preference of an object attribute based on a query image specified as a search key for searching similar images, and determines the display position and the display size of the thumbnail based on the order of preference. It is supposed here that only one query image is specified as a search key. Specifically, the display-method determining unit 103 B sets the order of preference of the object attribute of the objects contained in the query image by placing the object having the largest pixel size highest in the order of preference.
- the query images are stored in advance in the storage unit 104 , and the user specifies a query image via the input unit 101 .
- the object analyzing unit 103 A identifies the target image based on the image identification data the user inputs via the input unit 101 , and retrieves the image as the query image.
- FIG. 11 is a flowchart of a thumbnail list image display process performed by the image processing device 100 according to the fourth embodiment.
- the object analyzing unit 103 A of the image processing device 100 retrieves the query image (Step SC 1 ), analyzes the query image, extracts the objects, and classifies the objects by object attribute (Step SC 2 ).
- the object analyzing unit 103 A then retrieves the image feature quantity of each object (Step SC 3 ).
- image feature quantity the object analyzing unit 103 A retrieves the pixel size of each object.
- the object analyzing unit 103 A checks whether the process has been completed for all the objects (Step SC 4 ).
- the object analyzing unit 103 A repeats steps SC 2 and SC 3 . If the process has been completed for all the objects (Yes at Step SC 4 ), the object analyzing unit 103 A checks whether the process has been completed for all the query images (Step SC 5 ). If the process has been completed for all the query images (Yes at Step SC 5 ), the display-method determining unit 103 B sets the order of preference of the object attribute, placing the object that has the largest pixel size highest in the order of preference (Step SC 6 ). If there are several objects of the same attribute in the query image, the object that has the largest pixel size is set as the representative object of the object attribute.
- Step S 4 A the display-method determining unit 103 B determines the display position and the display size of the thumbnail, placing more centrally or setting higher in the display order, and displaying in larger size the thumbnail of the image containing the object belonging to the object attribute that is higher in the order of preference.
- the order of preference can be set based on a query image, enabling the user to perform search based on any vague image feature.
- the thumbnails of the images containing the object of object attribute that is high in the order of preference is displayed in highlight, enabling the user to easily identify the desired image.
- the order of preference can also be set based on the number of object attributes in the query image, setting highest preference to the object attribute of the object that has highest count.
- the display-method determining unit 103 B sets the order of preference of the object attributes by one of the methods described below:
- the display-method determining unit 103 B calculates the distribution of the image feature quantity of each object contained in the query images, and sets the object attribute of the object that has sparser distribution higher in the order of preference.
- the thumbnails sought by the user can be displayed more effectively.
- the image processing device 100 can be configured such that the computer programs executed by it can be downloaded over a network, such as the Internet, from a computer connected to the network.
- the programs can be made available as installable files or executable files on a computer-readable recording medium such as compact disk-read-only memory (CD-ROM), flexible disk (FD), compact disk-rewritable (CD-R), and digital versatile disk (DVD).
- CD-ROM compact disk-read-only memory
- FD flexible disk
- CD-R compact disk-rewritable
- DVD digital versatile disk
- the display target image and the query image are read from the storage unit 104 .
- the image processing device 100 can be configured to retrieve the images from a computer connected to the Internet or from a computer-readable recording medium such as CD-ROM, FD, CD-R, and DVD.
- the image processing device 100 can be a computer, copier, printer, facsimile machine, or a multifunction peripheral equipped with functions such as copying, printing function, and facsimile.
- the image processing device 100 in the embodiments described above includes the input unit 101 and the display unit 102 .
- the input unit 101 and the display unit 102 can be done away with, and the image processing device can be connected to the outside by a cable or wirelessly.
- image search can be performed effectively by focusing on specific objects of an image such as captions, drawings, tables, and graphs accompanying text, partial images of a photographic image, etc.
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Library & Information Science (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Physics & Mathematics (AREA)
- Databases & Information Systems (AREA)
- Data Mining & Analysis (AREA)
- Human Computer Interaction (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Processing Or Creating Images (AREA)
- Editing Of Facsimile Originals (AREA)
- Controls And Circuits For Display Device (AREA)
Abstract
An object analyzing unit analyzes images read from a storage unit and classifies objects contained in the images by object attribute. A display-method determining unit determines display positions of thumbnails of the images containing each object for every object attribute classified by the object analyzing unit. A thumbnail generating unit generates the thumbnails of the images containing each object, and outputs a thumbnail list image by placing the thumbnails at the display positions determined by the display-method determining unit.
Description
- The present application claims priority to and incorporates by reference the entire contents of Japanese priority document 2007-274699 filed in Japan on Oct. 23, 2007.
- 1. Field of the Invention
- The present invention relates to displaying of thumbnail images and searching of images from images in an image database.
- 2. Description of the Related Art
- There exist technologies that enable displaying of thumbnail images to make it convenient to browse through a large number of images stored in a storage device and search for a desired image. A thumbnail image, or a thumbnail, is a shrunk image of a still image, obtained by thinning out pixels from the image. Displaying the thumbnails in the form of a list on a screen enables a user to conveniently browse through a plurality of thumbnails at a time, and to efficiently search for a desired image.
- Some technologies enable the user to easily obtain information about the original images from displayed thumbnails. For example, a method is disclosed in Japanese Patent Application Laid-open No. 2001-337994 whereby information pertaining to an original image, such as the file name, creation date, modification date, and security level are stored as tag data in an associated form with the thumbnail of the image. When the thumbnail is displayed, the tag data associated with the thumbnail is retrieved and displayed alongside. Similarly, in a technology disclosed in Japanese Patent Application Laid-open No. 2006-277409, an object display mode is provided to enable a thumbnail display of an object in an image when a particular part of the image such as a person or a number plate is specified. This technology thus helps to reduce the labor and time involved in searching for objects in photographic images.
- When searching for an image, users often give an attribute of the image they are searching for in the search query. In this case, a display method is preferable that not only enables the user to obtain the attributes of the original image but also the relation between the displayed thumbnails.
- There exist technologies that improve search efficiency by displaying thumbnails in the form of a map (hereinafter, “image map”) based on the attributes of the thumbnails. In an image map, a group of thumbnails that have similar attributes are grouped together and displayed on the screen. The advantage of an image map is that the user can visually identify the thumbnail that he/she is seeking. Technologies disclosed in Japanese Patent No. 3614235, Japanese Patent Application Laid-open No. 2005-55743, and Japanese Patent Application Laid-open No. 2005-235041 relate to image map display methods. For example, in the technology disclosed in Japanese Patent No. 3614235, feature quantity of features such as color, form, size, type, and keyword are extracted from a display target image, and feature quantity vectors are created. The feature quantity vectors are then projected on two-dimensional coordinate axes using a self-organizing map. The density of data is changed and a plurality of screens is lined up in a depth direction to change the view to a three-dimensional view, thus enabling easy search of the required image.
- In the method disclosed in Japanese Patent Application Laid-open No. 2005-55743, the attributes of the all the display target images are retrieved, and a center is set on the screen for each attribute value. The attributes of the display target images are retrieved from the concerned images, and the thumbnails of the images are arranged in the vicinity of the center of the attribute values of the images. Thus, the thumbnails of the images having similar attribute values are displayed together. In the method disclosed in Japanese Patent Application Laid-open No. 2005-235041, N-dimensional feature quantity is extracted from the image data, and a new two-dimensional feature quantity is extracted by multivariate statistical analysis. Further, display position and display size are determined based on clustering data.
- Often, attributes are used as the key when searching for a photographic image or a graphical image displayed as thumbnails. However, when large numbers of images are involved, search can be performed effectively by looking for objects included in the image and filtering out those images that do not include the objects from the search. Search keywords input by the user and read by an optical character reader (OCR) are generally used when searching for an article or a business document. However, if the user fails to enter appropriate search keyword or if the OCR fails, the article or business document can still be searched from the thumbnails of drawings, tables, and captions by activating the visual feature “include drawings, tables, and captions”. The method of searching through the thumbnails of a specific object, if the document concerned includes drawings, etc. is more effective than searching the text. Thus, a search method that uses a specific object contained in the image is often more effective.
- However, the display position is determined image-by-image in the image map display methods disclosed in the technologies described above. Therefore, if the specified object is not one of the principal objects of the image, the object cannot be easily extracted as an attribute of the image.
- It is an object of the present invention to at least partially solve the problems in the conventional technology.
- According to an aspect of the present invention, there is provided an image processing device including a classifying unit that analyzes a display target image and classifies an object included in the display target image based on an attribute of the object; a feature-data retrieving unit that retrieves feature data that indicates a feature of the object; a display-method determining unit that sets, based on the feature data, at least one of a display position and a display size of a thumbnail of the display target image that includes the object; and a thumbnail generating unit that generates the thumbnail of the display target image. The thumbnail generating unit creates a thumbnail list image by at least one of generating the thumbnail in the display size set by the display-method determining unit and arranging the thumbnail in the display position set by the display-method determining unit.
- According to another aspect of the present invention, there is provided an image processing device including a classifying unit that analyzes a display target image and classifies an object included in the display target image based on an attribute of the object; a preference setting unit that sets an order of preference of the attributes; a display-method determining unit that sets, based on the order of preference, at least one of a display position and a display size of a thumbnail of the display target image that includes the object; and a thumbnail generating unit that generates the thumbnail of the display target image. The thumbnail generating unit creates a thumbnail list image by at least one of generating the thumbnail in the display size set by the display-method determining unit and arranging the thumbnail in the display position set by the display-method determining unit.
- The above and other objects, features, advantages and technical and industrial significance of this invention will be better understood by reading the following detailed description of presently preferred embodiments of the invention, when considered in connection with the accompanying drawings.
-
FIG. 1 is a block diagram of an image processing device according to a first embodiment of the present invention; -
FIG. 2 is a listing of object attributes classified by an object analyzing unit shown inFIG. 1 ; -
FIG. 3 is a flowchart of a thumbnail list image display process performed by the image processing device shown inFIG. 1 ; -
FIG. 4 is a schematic diagram of a thumbnail list image of regular textual images; -
FIG. 5 is a magnified view of an object attribute “drawing” denoted by the reference symbol ZM1 and the thumbnails around it; -
FIG. 6 is a flowchart of a thumbnail list image display process performed by an image processing device according to a second embodiment of the present invention; -
FIG. 7 is a schematic diagram of a thumbnail list image of regular textual images input as display target images; -
FIG. 8 is a magnified view of an object attribute “drawing” denoted by the reference symbol ZM1′ and the thumbnails around it; -
FIG. 9 is a flowchart of a thumbnail list image display process performed by an image processing device according to a third embodiment of the present invention; -
FIG. 10 is a schematic diagram of a thumbnail list image in which the display sizes of the thumbnails have been varied depending on an order of preference; and -
FIG. 11 is a flowchart of a thumbnail list image display process performed by an image processing device according to a fourth embodiment of the present invention. - Exemplary embodiments of the present invention are described below with reference to the accompanying drawings.
-
FIG. 1 is a block diagram of animage processing device 100 according to a first embodiment of the present invention. Theimage processing device 100 includes aninput unit 101, a display unit 102, acontrol unit 103, and astorage unit 104. Theinput unit 101 can be a keyboard and/or a pointing device such as a mouse, and is used for inputting search condition specification, adding new search conditions, and modifying existing search conditions. The display unit 102 can be a liquid crystal display or a cathode ray tube (CRT), and displays thumbnails of images based on the search condition as well as instruction request or instruction result of theinput unit 101. - The
storage unit 104 can be a hard disk, and stores therein images captured by an image capturingdevice 110 such as a camera, or images of documents such as conference material scanned by a scanner. Thestorage unit 104 also stores therein the thumbnails and object data of the images as folders F1 to Fn. Object data is data pertaining to captions in an image, objects such as drawings, tables, and graphs in a text document, and objects such as portions of an image in a photographic image. Each piece of object data has a corresponding attribute data. Attribute data can be position data that indicates the position of the object, or image attribute quantity obtained by quantification of image attributes of the object, or the pixel size of the object. Thestorage unit 104 also stores therein, corresponding to each image, image identification data such as the pixel size and image ID or image name. - The
control unit 103 includes a central processing unit (CPU), a read-only memory (ROM), a random access memory (RAM), etc. Thecontrol unit 103 causes theimage processing device 100 to realize various functions by executing various programs stored in the ROM. The functions realized by theimage processing device 100 include those of theobject analyzing unit 103A, a display-method determining unit 103B, and athumbnail generating unit 103C shown inFIG. 1 . Theobject analyzing unit 103A reads the display target image from thestorage unit 104, analyzes the image, extracts the attribute of the object (object attribute) in the image, classifies the objects by object attribute, associates the object attribute to the object, and adds it to the object data. - The
object analyzing unit 103A can use the method disclosed in Japanese Patent Application Laid-open No. 2005-78149 to perform image analysis and object extraction. According to this method, theobject analyzing unit 103A divides an image into progressively smaller and smaller areas, and optimizes area boundaries pixel-by-pixel by deploying the Markov random field model that is applicable in spatial frequency domain. Specifically, theobject analyzing unit 103A extracts texture attributes of various scales, and divides the image into areas based on the texture attributes. Theobject analyzing unit 103A then limits its scanning to designated areas and carves out the area of the object with predominant texture attribute. Applying the Markov random field model to the carved out area and the areas surrounding it, theobject analyzing unit 103A optimizes the boundary shape between the areas. - The
object analyzing unit 103A can use the method disclosed in Japanese Patent Application Laid-open No. 2006-39658 for determining object attributes. Though the method disclosed in the patent application is for determining image attributes, it can be adapted for determining object attributes. -
FIG. 2 is a listing of the object attributes classified by theobject analyzing unit 103A. Drawings, tables, graphs, captions, etc., are examples of object attributes in a textual image. People, plants, animals, articles, etc., are examples of object attributes in a photographic or graphical image. - The display-
method determining unit 103B determines, for each object attribute classified by theobject analyzing unit 103A, the display positions of the thumbnails of the images that contain the object. Thethumbnail generating unit 103C generates the thumbnails of the images that contain the object, and outputs to the display unit 102 a process result in the form of a thumbnail list image in which each thumbnail is arranged in the display position determined by the display-method determining unit 103B. -
FIG. 3 is a flowchart of a thumbnail list image display process performed by theimage processing device 100. Theobject analyzing unit 103A retrieves a display target image from the storage unit 104 (Step S1), analyzes the image to extract objects from the image, determines object attributes of the objects, and classifies the objects based on their object attributes (Step S2). Theobject analyzing unit 103A then retrieves the position data that indicates the position of each object in the original display target image (Step S3). For example, if the original display target image is rectangular, at Step S3 theobject analyzing unit 103A determines a center of gravity of each object from the coordinates of the upper left and lower right corners of the original display target image, and retrieves coordinates of the center of gravity of an object as the position data of the object. - The display-
method determining unit 103B determines the display positions of thumbnails of the display target image that contain the objects (Step S4). Assuming that the size of an object is m×n pixels, the upper left corner of the object is the origin, the size of the display target image is M×N pixels, and the coordinates of the center of gravity of the object in the display target image is (fx,fy), then a display position (x,y) of a thumbnail that is to contain the object can be determined from Equation (1): -
- The display-
method determining unit 103B checks whether the process has been completed for all the objects in the display target image (Step S5). If there is an unprocessed object (No at Step S5), Steps S3 to S5 are repeated for the unprocessed object. If there is no unprocessed object (Yes at Step S5), thethumbnail generating unit 103C generates a thumbnail of the display target image (Step S6), and checks whether the process has been completed for all the display target images (Step S7). If there is no unprocessed display target image (Yes at Step S7), thethumbnail generating unit 103C creates a thumbnail list image in which the thumbnails generated at step S6 are placed at the display positions determined at step S4, and outputs the thumbnail list image to the display unit 102 (Step S8). If there is an unprocessed display target image (No at Step S7), Step S1 to S7 are repeated for the unprocessed display target image. - Suppose that in an image map, a display area is secured for each object attribute and each object attribute is displayed at the designated area. At step S8, the
thumbnail generating unit 103C generates the thumbnail list image by arranging the thumbnails on the image map. The display areas of the object attributes in the image map can be preset or can be made changeable with each image according to the number of images, image attribute, or object attribute. -
FIG. 4 is a schematic diagram of a thumbnail list image of a typical textual image. Thumbnails are grouped according to the object attribute, and thumbnails that contain objects of a common object attribute are arranged around each attribute name indicating the object attribute. For example, inFIG. 4 , thumbnails SM1-1 to SM1-7 of images that contain objects belonging to an object attribute “drawing” are arranged around the object attribute “drawing” denoted by the reference symbol ZM1.FIG. 5 is a magnified view of a portion around the object attribute “drawing” inFIG. 4 . The display position of each of the thumbnails SM1-1 to SM1-7 is determined by the position the object belonging to the object attribute occupies in the original image. - If the display target image includes a plurality of objects of different object attributes, as many thumbnails of the display target image as the object it includes are displayed on the display unit 102. For example, if the display target image includes the object of the object attribute “drawing” as well as the object of the object attribute “text”, one thumbnail of the image will be displayed in the vicinity of the attribute name “drawing” and one thumbnail of the image will be displayed in the vicinity of the attribute name “text” on the display unit 102 (for example, the thumbnail SM1-6 in
FIG. 4 ). - The thumbnail list image enables the user to refine the search by using an object as the search key. The user can easily guess the display position of the thumbnail of the target image by recalling the general position of the object in the image. Thus, the user can perform search operations more effectively.
- The display method of the thumbnail list image can be a simple thumbnail display. Alternatively, the target object can be displayed in high resolution or in highlight. Another display method can be that only the target object is displayed. Yet another display method can be to display the object within an outline of the image to give a general idea as to the position of the object in the image.
- A self-organizing map can also be output by the process described above. For outputting the self-organizing map, the
image processing device 100 retrieves the object attributes and the position data of the objects of the display target image and, based on these, creates feature quantity vectors. By using the self-organizing map, theimage processing device 100 can project from the feature quantity space on to a two-dimensional plane, thus automating determination of the display positions of the objects. - The display target image is not limited to a still image; it can be a three-dimensional image or a moving image. In the case of a three-dimensional image, the
image processing device 100 determines the display position of the thumbnail of the image that contains the object based on the center of gravity of the object and the pixel size of the original image, and arranges the thumbnail in the display area three-dimensionally. In the case of a moving image, a time axis (t) is added to the position axes to obtain the coordinate values (fx,fy,t). When displaying the thumbnail list image, the thumbnails of the moving image can be displayed and made to play at the same positions where the thumbnails of the two-dimensional image would have been displayed or the thumbnails of the moving image can be displayed in the display area three-dimensionally. - The
object analyzing unit 103A can identify and retrieve the display target images at Step S1 based on a search condition input via theinput unit 101. - An image processing device according to a second embodiment of the present invention is described below. The parts of the image processing device according to the second embodiment that are similar to those of the first embodiment have been assigned the same reference numerals and symbols, and the explanation thereof is repeated in some cases and not repeated in others.
- The display-
method determining unit 103B according to the first embodiment determines the display position of the thumbnail based on the position of the object in the original image. In the second embodiment, the display-method determining unit 103B determines the display position and display size of the thumbnail of the image that contains the object based on the image feature quantity of the object. Image feature quantity is obtained by quantification of image features such as texture, color, and edge distribution. For image feature quantity, common object attribute is not essential but it is preferable to use the image feature quantity that allows wide distribution of the objects. -
FIG. 6 is a flowchart of a thumbnail list image display process performed by theimage processing device 100 according to the second embodiment. The steps inFIG. 6 that are identical to those inFIG. 3 are not described again. Therefore, Steps S1, S2, and S5 to S8 are not described here. The display-method determining unit 103B retrieves, for each object classified by object attribute by theobject analyzing unit 103A, the image feature quantity included in the object data stored in the storage unit 104 (Step S3A). Based on the image feature quantity, the display-method determining unit 103B determines the display position and the display size of the thumbnail of the image that contains the object (Step S4A). - For determining the display position, a self-organizing map can be used whereby feature vectors are created from object attributes and image feature quantity and projection is made from the feature quantity space on to the two-dimensional plane. Alternatively, after determining the display position of each object attribute, the image feature quantity of the image that contains the object belonging to the object attribute is reflected in the display position of the thumbnail of the image. For determining the display size of the thumbnail based on the image feature quantity, a method disclosed in Japanese Patent Application Laid-open No. 2006-303707 can be used. According to the method, even if an image includes an object that cannot be easily identified unless the display size is large, an appropriate display size can be determined so as to enable the user to recognize the object.
-
FIG. 7 is a schematic diagram of a thumbnail list image of a typical textual image input as display target images. Thumbnails are grouped according to object attribute, and thumbnails that contain objects of a common object attribute are arranged around each attribute name indicating the object attribute. For example, inFIG. 7 , thumbnails SM1-1′ to SM1-7′ of the images that contain the object belonging to the object attribute “drawing” are arranged around the object attribute “drawing” denoted by the reference symbol ZM1′.FIG. 8 is a magnified view of the object attribute “drawing” denoted by the reference symbol ZM1′ and the thumbnails SM1-1′ to SM1-7′ around it. The display position of each of the thumbnails SM1-1′ to SM1-7′ is determined by the color density of the target object. The display size of each thumbnail is set based on the image feature quantity of the object so that the user can easily identify the object. Therefore, as against the display sizes of the thumbnails SM1-1 to SM1-7 shown inFIG. 5 being uniform, the display sizes of the thumbnails SM1-1′ to SM1-7′ shown inFIG. 7 are all different. - The configuration described above allows the object serving as the search key and the objects with similar image feature as that of the search key to be placed close together, with their display sizes adjusted according to the image feature such as frequency data, thus enabling the user to effectively find the desired image data from the thumbnail list image based on the object attribute and image feature quantity.
- Other than the image feature quantity, data such as layout data that indicates the layout of objects and configuration data, which is data obtained by quantification of object configuration when an object is further broken down, can be used for determining the thumbnail display position.
- An image processing device according to a third embodiment of the present invention is described below. The parts of the image processing device according to the third embodiment that are similar to those of the first and second embodiments have been assigned the same reference numerals and symbols, and the explanation thereof is repeated in some cases and not repeated in others.
- In the third embodiment, an order of preference is set for object attributes, and the display-
method determining unit 103B displays or hides the thumbnails of the images that contain the object according to the order of preference. The order of preference of the object attributes is input by the user via theinput unit 101 and stored as part of object data in thestorage unit 104. -
FIG. 9 is a flowchart of a thumbnail list image display process performed by theimage processing device 100 according to the third embodiment. Steps S1, S2, and S3A are identical to those described with reference to the second embodiment. Next, at Step SB1, the display-method determining unit 103B retrieves the order of preference of the object attributes included in the object data stored in thestorage unit 104. The display-method determining unit 103B then determines whether the retrieved order of preference is greater than a preset integer I of one or greater, and is stored in advance in thestorage unit 104. If the order of preference is greater than I (Yes at Step SB2), the display-method determining unit 103B determines the display positions and the display sizes of the thumbnails of the images that contain the object based on the order of preference (Step S4A). The display-method determining unit 103B, for instance, sets more centrally the display position or sets higher in the display order the thumbnail of the image containing the object of the object attribute that is higher in the order of preference. The display-method determining unit 103B sets the display size of the thumbnails such that the higher the object attribute is in the order of preference, the larger the thumbnail of the image containing the object. The subsequent steps S5 to S8 are identical to those described with reference to the second embodiment. If the order of preference is not greater than I (No at Step SB2), the display-method determining unit 103B repeats Steps 3A to SB2 for the next object. -
FIG. 10 is a schematic diagram of a thumbnail list image in which the thumbnails are displayed in different sizes according to the order of preference. The thumbnails are grouped according to the object attribute, and the thumbnails of the images that contain the object of an object attribute are arranged around each attribute name indicating the object attribute. InFIG. 10 , the higher the object attribute is in the order of preference, the larger the display size is of the thumbnails of the images around the object attribute. If two thumbnails overlap, a thumbnail of higher object attribute can be displayed on top of the other thumbnail. - The configuration described above allows the thumbnails to be displayed according to the order of preference. Further, the thumbnails of the images that do not include an object of high preference are not displayed, making the search that much finer. Preferred thumbnail list image can be obtained as the thumbnails images containing the object of object attribute that is higher in the order of preference are displayed large.
- Further, the configuration enables the thumbnails of the images containing an object of object attribute that is higher in the order of preference to be displayed more centrally in the area of the thumbnail list image that falls in the field of vision of the user, making it more convenient for the user to pick out the desired image more quickly.
- The configuration enables adjustment of the display size of the thumbnails according to the order of preference. The thumbnails of the images containing the object of object attribute that is high in the order of preference is displayed in highlight, enabling the user to easily identify the desired image.
- Further, the configuration enables determination of display order of the thumbnails according to the order of preference. Thus, the thumbnails of the images are displayed beginning with the object attribute of highest preference, shortening the search time and enabling the thumbnails to be displayed in the display order that is suited to the user's intent.
- The display-
method determining unit 103B can be configured to do away with step SB2, and directly determine, according to the order of preference, the display positions and the display sizes for all the objects in the display target image. - The display-
method determining unit 103B can be configured to do away with step S3A and directly go to step SB1 after step S2. In this case, at least the display position or the display size can be determined. - An image processing device according to a fourth embodiment is described below. The parts of the image processing device according to the fourth embodiment that are similar to those of the second and third embodiments have been assigned the same reference numerals and symbols, and the explanation thereof is repeated in some cases and not repeated in others.
- In the fourth embodiment, the display-
method determining unit 103B sets the order of preference of an object attribute based on a query image specified as a search key for searching similar images, and determines the display position and the display size of the thumbnail based on the order of preference. It is supposed here that only one query image is specified as a search key. Specifically, the display-method determining unit 103B sets the order of preference of the object attribute of the objects contained in the query image by placing the object having the largest pixel size highest in the order of preference. - It is supposed here that the query images are stored in advance in the
storage unit 104, and the user specifies a query image via theinput unit 101. Specifically, theobject analyzing unit 103A identifies the target image based on the image identification data the user inputs via theinput unit 101, and retrieves the image as the query image. -
FIG. 11 is a flowchart of a thumbnail list image display process performed by theimage processing device 100 according to the fourth embodiment. Theobject analyzing unit 103A of theimage processing device 100 retrieves the query image (Step SC1), analyzes the query image, extracts the objects, and classifies the objects by object attribute (Step SC2). Theobject analyzing unit 103A then retrieves the image feature quantity of each object (Step SC3). As image feature quantity, theobject analyzing unit 103A retrieves the pixel size of each object. Theobject analyzing unit 103A then checks whether the process has been completed for all the objects (Step SC4). If there are objects for which the process has not been completed (No at Step SC4), theobject analyzing unit 103A repeats steps SC2 and SC3. If the process has been completed for all the objects (Yes at Step SC4), theobject analyzing unit 103A checks whether the process has been completed for all the query images (Step SC5). If the process has been completed for all the query images (Yes at Step SC5), the display-method determining unit 103B sets the order of preference of the object attribute, placing the object that has the largest pixel size highest in the order of preference (Step SC6). If there are several objects of the same attribute in the query image, the object that has the largest pixel size is set as the representative object of the object attribute. - The steps subsequent to Step SC6 are not shown in
FIG. 11 . However, Steps S4A to S8 shown inFIG. 9 follow Step SC6. At Step S4A, the display-method determining unit 103B determines the display position and the display size of the thumbnail, placing more centrally or setting higher in the display order, and displaying in larger size the thumbnail of the image containing the object belonging to the object attribute that is higher in the order of preference. - Thus, in the configuration described above, the order of preference can be set based on a query image, enabling the user to perform search based on any vague image feature. The thumbnails of the images containing the object of object attribute that is high in the order of preference is displayed in highlight, enabling the user to easily identify the desired image.
- The order of preference can also be set based on the number of object attributes in the query image, setting highest preference to the object attribute of the object that has highest count.
- When there is a plurality of query images, the order of preference needs to be set taking into account all the objects in all the query images. In this case, at Step SC6, the display-
method determining unit 103B sets the order of preference of the object attributes by one of the methods described below: - (a) Setting the object attribute of the object that has a higher count in the query images higher in the order of preference
- (b) Setting the object attribute of the object whose distribution is sparser in the query images (in other words, from among the query images, the object having most similar image features) higher in the order of preference
- Specifically, at Step SC6, the display-
method determining unit 103B calculates the distribution of the image feature quantity of each object contained in the query images, and sets the object attribute of the object that has sparser distribution higher in the order of preference. - Thus, by setting the order of preference of the object attributes based on the similarity of the objects in the query images, the thumbnails sought by the user can be displayed more effectively.
- Although the invention has been described with respect to specific embodiments, the constituent elements can be modified within a scope not deviating from the purport of the invention. In addition, various inventions can be formed by combining a plurality components disclosed in the embodiments appropriately. For example, some components from among the components described in the above embodiments can be omitted. Further, components from different embodiments can be combined.
- The
image processing device 100 can be configured such that the computer programs executed by it can be downloaded over a network, such as the Internet, from a computer connected to the network. Alternatively, the programs can be made available as installable files or executable files on a computer-readable recording medium such as compact disk-read-only memory (CD-ROM), flexible disk (FD), compact disk-rewritable (CD-R), and digital versatile disk (DVD). - In the embodiments described above, the display target image and the query image are read from the
storage unit 104. However, theimage processing device 100 can be configured to retrieve the images from a computer connected to the Internet or from a computer-readable recording medium such as CD-ROM, FD, CD-R, and DVD. - The
image processing device 100 can be a computer, copier, printer, facsimile machine, or a multifunction peripheral equipped with functions such as copying, printing function, and facsimile. - The
image processing device 100 in the embodiments described above includes theinput unit 101 and the display unit 102. However, theinput unit 101 and the display unit 102 can be done away with, and the image processing device can be connected to the outside by a cable or wirelessly. - According to the present invention, image search can be performed effectively by focusing on specific objects of an image such as captions, drawings, tables, and graphs accompanying text, partial images of a photographic image, etc.
- Although the invention has been described with respect to specific embodiments for a complete and clear disclosure, the appended claims are not to be thus limited but are to be construed as embodying all modifications and alternative constructions that may occur to one skilled in the art that fairly fall within the basic teaching herein set forth.
Claims (14)
1. An image processing device comprising:
a classifying unit that analyzes a display target image and classifies an object included in the display target image based on an attribute of the object;
a feature-data retrieving unit that retrieves feature data that indicates a feature of the object;
a display-method determining unit that sets, based on the feature data, at least one of a display position and a display size of a thumbnail of the display target image that includes the object; and
a thumbnail generating unit that generates the thumbnail of the display target image,
wherein the thumbnail generating unit creates a thumbnail list image by at least one of generating the thumbnail in the display size set by the display-method determining unit and arranging the thumbnail in the display position set by the display-method determining unit.
2. The image processing device according to claim 1 , wherein the feature data is at least one of position data that indicates a position, image feature data that indicates an image feature, layout data that indicates a layout, and configuration data that indicates a configuration of the object in the display target image that includes the object.
3. The image processing device according to claim 1 , further comprising a first image-retrieving unit that retrieves an image that matches with a search condition as the display target image.
4. The image processing device according to claim 1 , further comprising a display unit that displays the thumbnail list image created by the thumbnail generating unit.
5. An image processing device comprising:
a classifying unit that analyzes a display target image and classifies an object included in the display target image based on an attribute of the object;
a preference setting unit that sets an order of preference of the attributes;
a display-method determining unit that sets, based on the order of preference, at least one of a display position and a display size of a thumbnail of the display target image that includes the object; and
a thumbnail generating unit that generates the thumbnail of the display target image,
wherein the thumbnail generating unit creates a thumbnail list image by at least one of generating the thumbnail in the display size set by the display-method determining unit and arranging the thumbnail in the display position set by the display-method determining unit.
6. The image processing device according to claim 5 , wherein the display-method determining unit does not display the thumbnail of the image that does not include the object classified by attribute having an order of preference greater than a predetermined value.
7. The image processing device according to claim 5 , wherein the display-method determining unit sets the display position more centrally for the attribute that is higher in the order of preference.
8. The image processing device according to claim 5 , wherein the display-method determining unit sets the display size larger for the attribute that is higher in the order of preference.
9. The image processing device according to claim 5 , wherein the display-method determining unit sets higher in a display order the thumbnail of the display target image containing the object of the object attribute that is higher in the order of preference.
10. The image processing device according to claim 5 , further comprising an input receiving unit that receives information indicative of the order of preference of the attributes from a user,
wherein the preference setting unit sets the order of preference based on the information received by the input receiving unit.
11. The image processing device according to claim 5 , further comprising a second image-retrieving unit that retrieves a query image as a search key for searching a similar image,
wherein the preference setting unit sets the order of preference based on the objects included in the query image.
12. The image processing device according to claim 11 , wherein the preference setting unit sets the order of preference based on the number of objects included in the query image for each attribute.
13. The image processing device according to claim 11 , further comprising a feature-data retrieving unit that retrieves feature data that indicates features related to each object classified by the classifying unit,
wherein the preference setting unit sets higher in the order of preference the attribute, from among the attributes of the objects included in the query image, that has a sparser distribution based on the feature data retrieved by the feature-data retrieving unit.
14. The image processing device according to claim 5 , further comprising a display unit that displays the thumbnail list image created by the thumbnail generating unit.
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| JP2007-274699 | 2007-10-23 | ||
| JP2007274699A JP5059545B2 (en) | 2007-10-23 | 2007-10-23 | Image processing apparatus and image processing method |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| US20090106699A1 true US20090106699A1 (en) | 2009-04-23 |
Family
ID=40564769
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| US12/208,486 Abandoned US20090106699A1 (en) | 2007-10-23 | 2008-09-11 | Image processing device and image processing method |
Country Status (3)
| Country | Link |
|---|---|
| US (1) | US20090106699A1 (en) |
| JP (1) | JP5059545B2 (en) |
| CN (1) | CN101419612B (en) |
Cited By (18)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20100188369A1 (en) * | 2009-01-26 | 2010-07-29 | Canon Kabushiki Kaisha | Image displaying apparatus and image displaying method |
| US20100333005A1 (en) * | 2009-06-29 | 2010-12-30 | Sayoko Tanaka | Display Processing Apparatus, Display Processing Method, and Computer Program Product |
| US20120047233A1 (en) * | 2010-08-20 | 2012-02-23 | Pantech Co., Ltd. | Apparatus and method for providing object information |
| US9100678B2 (en) | 2011-03-30 | 2015-08-04 | Casio Computer Co., Ltd. | Image display method, server, and image display system |
| US9098534B2 (en) | 2011-07-25 | 2015-08-04 | Casio Computer Co., Ltd. | Image display system, image display method, social network service system, and computer-readable medium |
| CN104915417A (en) * | 2015-06-08 | 2015-09-16 | 上海如书文化传播有限公司 | Method and device for shooting and processing images into film effect by using mobile terminal |
| US20150286896A1 (en) * | 2012-05-24 | 2015-10-08 | Hitachi, Ltd. | Image Analysis Device, Image Analysis System, and Image Analysis Method |
| US9298246B2 (en) | 2013-01-18 | 2016-03-29 | Ricoh Company, Limited | Information processing device, system, and information processing method |
| US20160104052A1 (en) * | 2014-10-10 | 2016-04-14 | Qualcomm Incorporated | Text-based thumbnail generation |
| DE102015212953A1 (en) * | 2015-07-10 | 2017-01-12 | Siemens Healthcare Gmbh | Artificial neural networks for the classification of medical image datasets |
| US9591359B2 (en) * | 2015-06-26 | 2017-03-07 | Rovi Guides, Inc. | Systems and methods for automatic formatting of images for media assets based on prevalence |
| US9613272B2 (en) | 2013-11-27 | 2017-04-04 | Ricoh Company, Ltd. | Image analyzing device, image analyzing method, and recording medium storing image analyzing program |
| EP3245646A1 (en) * | 2015-01-15 | 2017-11-22 | Qualcomm Incorporated | Text-based image resizing |
| US20170351712A1 (en) * | 2016-06-02 | 2017-12-07 | Naver Corporation | Method and system for map image search using context of image |
| US10162507B2 (en) | 2013-09-27 | 2018-12-25 | Ricoh Company, Ltd. | Display control apparatus, display control system, a method of controlling display, and program |
| US10341739B2 (en) | 2016-05-16 | 2019-07-02 | Rovi Guides, Inc. | Methods and systems for recommending providers of media content to users viewing over-the-top content based on quality of service |
| US10628009B2 (en) | 2015-06-26 | 2020-04-21 | Rovi Guides, Inc. | Systems and methods for automatic formatting of images for media assets based on user profile |
| US10812851B2 (en) | 2016-05-16 | 2020-10-20 | Rovi Guides, Inc. | Methods and systems for presenting media listings based on quality of service at a user device |
Families Citing this family (7)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP5862671B2 (en) * | 2011-08-23 | 2016-02-16 | 日本電気株式会社 | Information processing system, information processing method, information processing apparatus, control method thereof, and control program |
| WO2013118218A1 (en) * | 2012-02-09 | 2013-08-15 | パナソニック株式会社 | Image recognition device, image recognition method, program and integrated circuit |
| CN103092818B (en) * | 2013-02-21 | 2016-05-04 | 用友网络科技股份有限公司 | Thumbnail generation system and the reduced graph generating method of form |
| KR20160015838A (en) * | 2014-07-31 | 2016-02-15 | 삼성전자주식회사 | Method and device for classifying contents |
| CN104270678B (en) * | 2014-09-05 | 2018-05-08 | 广州金山网络科技有限公司 | A kind of humbnail display methods, device and system |
| JP6419560B2 (en) * | 2014-12-05 | 2018-11-07 | 株式会社東芝 | Search device, method and program |
| CN111753181A (en) * | 2019-03-28 | 2020-10-09 | 北京京东尚科信息技术有限公司 | Image-based search method, apparatus, server, client and medium |
Citations (9)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US6160553A (en) * | 1998-09-14 | 2000-12-12 | Microsoft Corporation | Methods, apparatus and data structures for providing a user interface, which exploits spatial memory in three-dimensions, to objects and in which object occlusion is avoided |
| US20010030662A1 (en) * | 1999-12-20 | 2001-10-18 | Toshihiko Ohkawa | System and method for displaying index information on a computer screen |
| US6751620B2 (en) * | 2000-02-14 | 2004-06-15 | Geophoenix, Inc. | Apparatus for viewing information in virtual space using multiple templates |
| US20040233219A1 (en) * | 2003-03-05 | 2004-11-25 | Aguera Y Arcas Blaise | System and method for exact rendering in a zooming user interface |
| US20050010599A1 (en) * | 2003-06-16 | 2005-01-13 | Tomokazu Kake | Method and apparatus for presenting information |
| US6919890B2 (en) * | 2000-09-28 | 2005-07-19 | Curl Corporation | Grid and table layout using elastics |
| US20050246336A1 (en) * | 2002-05-30 | 2005-11-03 | Jurgen Stauder | Method and device for creating semantic browsing options |
| US20080249645A1 (en) * | 2007-04-06 | 2008-10-09 | Denso Corporation | Sound data retrieval support device, sound data playback device, and program |
| US7689525B2 (en) * | 2003-04-22 | 2010-03-30 | Microsoft Corporation | Relationship view |
Family Cites Families (8)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP2003256097A (en) * | 2002-02-27 | 2003-09-10 | Fuji Photo Film Co Ltd | Image display control device |
| JP4341408B2 (en) * | 2004-01-15 | 2009-10-07 | パナソニック株式会社 | Image display method and apparatus |
| JP4332461B2 (en) * | 2004-04-16 | 2009-09-16 | キヤノン株式会社 | Image processing apparatus and method |
| JP4376145B2 (en) * | 2004-07-22 | 2009-12-02 | 日立ソフトウエアエンジニアリング株式会社 | Image classification learning processing system and image identification processing system |
| CN1797578A (en) * | 2004-12-30 | 2006-07-05 | 上海乐金广电电子有限公司 | Method for storing and displaying classified bookmarkers based on different attributes |
| JP2006303707A (en) * | 2005-04-18 | 2006-11-02 | Canon Inc | Image processing apparatus and image processing method |
| JP4565187B2 (en) * | 2005-06-10 | 2010-10-20 | 国立大学法人北海道大学 | Image display position determination device, image display position determination method, image display position determination program, and computer-readable recording medium recording the same |
| JP4950074B2 (en) * | 2006-01-30 | 2012-06-13 | パナソニック株式会社 | Imaging device, display control device, image display system, and imaging system |
-
2007
- 2007-10-23 JP JP2007274699A patent/JP5059545B2/en not_active Expired - Fee Related
-
2008
- 2008-09-11 US US12/208,486 patent/US20090106699A1/en not_active Abandoned
- 2008-10-23 CN CN2008101713852A patent/CN101419612B/en not_active Expired - Fee Related
Patent Citations (9)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US6160553A (en) * | 1998-09-14 | 2000-12-12 | Microsoft Corporation | Methods, apparatus and data structures for providing a user interface, which exploits spatial memory in three-dimensions, to objects and in which object occlusion is avoided |
| US20010030662A1 (en) * | 1999-12-20 | 2001-10-18 | Toshihiko Ohkawa | System and method for displaying index information on a computer screen |
| US6751620B2 (en) * | 2000-02-14 | 2004-06-15 | Geophoenix, Inc. | Apparatus for viewing information in virtual space using multiple templates |
| US6919890B2 (en) * | 2000-09-28 | 2005-07-19 | Curl Corporation | Grid and table layout using elastics |
| US20050246336A1 (en) * | 2002-05-30 | 2005-11-03 | Jurgen Stauder | Method and device for creating semantic browsing options |
| US20040233219A1 (en) * | 2003-03-05 | 2004-11-25 | Aguera Y Arcas Blaise | System and method for exact rendering in a zooming user interface |
| US7689525B2 (en) * | 2003-04-22 | 2010-03-30 | Microsoft Corporation | Relationship view |
| US20050010599A1 (en) * | 2003-06-16 | 2005-01-13 | Tomokazu Kake | Method and apparatus for presenting information |
| US20080249645A1 (en) * | 2007-04-06 | 2008-10-09 | Denso Corporation | Sound data retrieval support device, sound data playback device, and program |
Cited By (28)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20100188369A1 (en) * | 2009-01-26 | 2010-07-29 | Canon Kabushiki Kaisha | Image displaying apparatus and image displaying method |
| US20100333005A1 (en) * | 2009-06-29 | 2010-12-30 | Sayoko Tanaka | Display Processing Apparatus, Display Processing Method, and Computer Program Product |
| US20120047233A1 (en) * | 2010-08-20 | 2012-02-23 | Pantech Co., Ltd. | Apparatus and method for providing object information |
| US9100678B2 (en) | 2011-03-30 | 2015-08-04 | Casio Computer Co., Ltd. | Image display method, server, and image display system |
| US9098534B2 (en) | 2011-07-25 | 2015-08-04 | Casio Computer Co., Ltd. | Image display system, image display method, social network service system, and computer-readable medium |
| US20150286896A1 (en) * | 2012-05-24 | 2015-10-08 | Hitachi, Ltd. | Image Analysis Device, Image Analysis System, and Image Analysis Method |
| US9665798B2 (en) * | 2012-05-24 | 2017-05-30 | Hitachi, Ltd. | Device and method for detecting specified objects in images using metadata |
| US9298246B2 (en) | 2013-01-18 | 2016-03-29 | Ricoh Company, Limited | Information processing device, system, and information processing method |
| US10162507B2 (en) | 2013-09-27 | 2018-12-25 | Ricoh Company, Ltd. | Display control apparatus, display control system, a method of controlling display, and program |
| US9613272B2 (en) | 2013-11-27 | 2017-04-04 | Ricoh Company, Ltd. | Image analyzing device, image analyzing method, and recording medium storing image analyzing program |
| US20160104052A1 (en) * | 2014-10-10 | 2016-04-14 | Qualcomm Incorporated | Text-based thumbnail generation |
| WO2016057161A1 (en) * | 2014-10-10 | 2016-04-14 | Qualcomm Incorporated | Text-based thumbnail generation |
| EP3245646A1 (en) * | 2015-01-15 | 2017-11-22 | Qualcomm Incorporated | Text-based image resizing |
| CN104915417A (en) * | 2015-06-08 | 2015-09-16 | 上海如书文化传播有限公司 | Method and device for shooting and processing images into film effect by using mobile terminal |
| US11481095B2 (en) | 2015-06-26 | 2022-10-25 | ROVl GUIDES, INC. | Systems and methods for automatic formatting of images for media assets based on user profile |
| US9591359B2 (en) * | 2015-06-26 | 2017-03-07 | Rovi Guides, Inc. | Systems and methods for automatic formatting of images for media assets based on prevalence |
| US12405716B2 (en) | 2015-06-26 | 2025-09-02 | Adeia Guides Inc. | Systems and methods for automatic formatting of images for media assets based on user profile |
| US10628009B2 (en) | 2015-06-26 | 2020-04-21 | Rovi Guides, Inc. | Systems and methods for automatic formatting of images for media assets based on user profile |
| US11842040B2 (en) | 2015-06-26 | 2023-12-12 | Rovi Guides, Inc. | Systems and methods for automatic formatting of images for media assets based on user profile |
| DE102015212953A1 (en) * | 2015-07-10 | 2017-01-12 | Siemens Healthcare Gmbh | Artificial neural networks for the classification of medical image datasets |
| DE102015212953B4 (en) | 2015-07-10 | 2024-08-22 | Siemens Healthineers Ag | Artificial neural networks for the classification of medical image data sets |
| US11962864B2 (en) | 2016-05-16 | 2024-04-16 | Rovi Guides, Inc. | Methods and systems for recommending providers of media content to users viewing over-the-top content based on quality of service |
| US10827228B2 (en) | 2016-05-16 | 2020-11-03 | Rovi Guides, Inc. | Methods and systems for recommending providers of media content to users viewing over-the-top content based on quality of service |
| US10812851B2 (en) | 2016-05-16 | 2020-10-20 | Rovi Guides, Inc. | Methods and systems for presenting media listings based on quality of service at a user device |
| US12244905B2 (en) | 2016-05-16 | 2025-03-04 | Adeia Guides Inc. | Methods and systems for recommending providers of media content to users viewing over-the-top content based on quality of service |
| US10341739B2 (en) | 2016-05-16 | 2019-07-02 | Rovi Guides, Inc. | Methods and systems for recommending providers of media content to users viewing over-the-top content based on quality of service |
| US11023518B2 (en) * | 2016-06-02 | 2021-06-01 | Naver Corporation | Method and system for map image search using context of image |
| US20170351712A1 (en) * | 2016-06-02 | 2017-12-07 | Naver Corporation | Method and system for map image search using context of image |
Also Published As
| Publication number | Publication date |
|---|---|
| CN101419612B (en) | 2012-02-15 |
| JP5059545B2 (en) | 2012-10-24 |
| JP2009105594A (en) | 2009-05-14 |
| CN101419612A (en) | 2009-04-29 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US20090106699A1 (en) | Image processing device and image processing method | |
| JP4859025B2 (en) | Similar image search device, similar image search processing method, program, and information recording medium | |
| JP5033580B2 (en) | Image processing apparatus and image processing method | |
| US20190266397A1 (en) | Image processing apparatus, control method for image processing apparatus, and non-transitory storage medium | |
| US7610274B2 (en) | Method, apparatus, and program for retrieving data | |
| JP2005004724A (en) | Image processing apparatus, control method therefor, and program | |
| JP2008140377A (en) | Information retrieval apparatus, method and program | |
| JP2007286864A (en) | Image processing apparatus, image processing method, program, and recording medium | |
| US12223261B2 (en) | Image processing apparatus, image processing method, and storage medium | |
| JP4920928B2 (en) | Image processing apparatus, control method therefor, and program | |
| EP1752895A1 (en) | Image processing apparatus for image retrieval and control method therefor | |
| US20230077608A1 (en) | Information processing apparatus, information processing method, and storage medium | |
| JP4261988B2 (en) | Image processing apparatus and method | |
| JP2007317034A (en) | Image processing apparatus, image processing method, program, and recording medium | |
| JP2021056796A (en) | Structure recognition system, structure recognition device, structure recognition method, and program | |
| JPH10254901A (en) | Image search method and apparatus | |
| JP2005352777A (en) | Image processing apparatus, control method therefor, and program | |
| US20160162752A1 (en) | Retrieval apparatus, retrieval method, and computer program product | |
| JP2010073194A (en) | Image processing device, image processing method, and program | |
| CN113438379A (en) | Information processing apparatus and recording medium | |
| JP2005208977A (en) | Document filing device and method | |
| CN117707398A (en) | Data processing method and device | |
| JP2021027557A (en) | Image processing device, information processing method and program | |
| JP4152927B2 (en) | Image processing apparatus, image forming apparatus, processed document search method, processed document search program, and recording medium | |
| JPH11238072A (en) | Document storage device |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| AS | Assignment |
Owner name: RICOH COMPANY, LIMITED, JAPAN Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:KIHARA, YUKA;KOBAYASHI, KOJI;INAMOTO, HIROHISA;REEL/FRAME:021527/0064 Effective date: 20080828 |
|
| STCB | Information on status: application discontinuation |
Free format text: ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION |