HK1170833B - Defining a border for an image - Google Patents
Defining a border for an image Download PDFInfo
- Publication number
- HK1170833B HK1170833B HK12108546.8A HK12108546A HK1170833B HK 1170833 B HK1170833 B HK 1170833B HK 12108546 A HK12108546 A HK 12108546A HK 1170833 B HK1170833 B HK 1170833B
- Authority
- HK
- Hong Kong
- Prior art keywords
- image
- border
- pixels
- region
- tunnel
- Prior art date
Links
Description
The invention is directed towards image editing. Specifically, the invention is directed towards defining a border for an image.
Image editing applications (as well as video and other media editing applications) provide users with the ability to modify digital images from their original state. Often a user will want to modify the color properties of an entire image, or more commonly, a selection of an image. For example, a user might want to increase the saturation in a selection to make the color more intense and thereby cause that selection stand out more in the image. Other color properties a user might want to change include hue, luminosity, etc. Modification of the color properties of a selection will be referred to as color correction.
In order to modify a selection of an image, a user must first be provided with a tool for defining the selection they want to modify. Some prior art selection tools base the selection on a color selected by a user. A user can specify (by selecting from a color palette or by clicking on a point in an image) what color they want to select, and the selection tool will define a selection as all pixels in the image within a threshold of the selected color. However, in some cases a user will only want to select some of the pixels of the selected color (e.g., if there are multiple faces in an image and a user wants to highlight one of the faces). Further, sometimes a desired selection will include multiple colors (e.g., a head with skin, hair, eyes, etc.).
Other prior art selection tools allow a user to draw a border around the area of the image the user wants to select for color correction. However, doing so is often a very difficult process as the border of the selection is defined by the exact movement of the cursor. This requires a user to move very slowly and carefully through the image. Therefore, there is a need for a selection tool that allows a user to move more quickly through the image yet still defines a border in the image in the appropriate location.
A further shortcoming of such prior art selection tools is the inability to correct a mistake. A user of such selection tools must be able to start at the beginning of a desired border and move a cursor all the way to the desired endpoint without making a mistake. If a mistake is made, the user must start the selection process over. This can be a very frustrating process for the user, especially if the border the user attempts to draw is long, and the user has to make multiple attempts to draw the border. Therefore, there is a need for a selection tool that allows a user to correct mistakes when attempting to define a border in an image.
A third shortcoming of the above prior art selection tools is that they define a border that does not allow for a natural transition from foreground to background. Some tools do not create a hard edge between a selection and the rest of the image, but apply a simple softening of the edge of a selection. However, these tools do not create the softening effect based on an intelligent algorithm that accounts for the actual nature of the border. When attempting to select an area such as a head with hair, it is nearly impossible to trace out every hair, but the ability to keep the hairs in the foreground is a useful feature. Furthermore, even at borders that are easier to select, an intelligent transition from the foreground to background that is specific to the border may be desirable. Therefore, there is a need for a user to be able to define an area as a transition section, and to determine the size and shape of the transition section. In addition, there is a need to be able to define an intelligent transition from foreground to background for a selection.
In US 2006/0039611 techniques are disclosed to provide more efficient and improved border matting for extracted foreground images, e.g., without requiring excessive user interaction. Border matting techniques described herein generate relatively continuous transparency (or alpha values) along the boundary of the extracted object (e.g., limiting color bleeding and/or artifacts).
The paper "Soft Scissors: An Interactive Tool for Realtime High Quality Matting" by Wang et al presents an interactive tool for extracting alpha mattes of foreground objects in realtime. This system efficiently estimates foreground color thereby allowing both the matte and the final composite to be revealed instantly as the user roughly paints along the edge of the foreground object. In addition, the system can dynamically adjust the width and boundary conditions of the scissoring paint brush to approximately capture the boundary of the foreground object that lies ahead on the scissor's path.
The paper "Intelligent Scissors for Image Composition" by Mortensen and Barrett discloses a tool called Intelligent Scissors which can be used for image segmentation and composition. The tool allow objects within digital images to be extracted quickly and accurately using simple gesture motions with a mouse. When the gestured mouse position comes in proximity to an object edge, a live-wire boundary "snaps " to, and wraps around the object of interest. Live-wire boundary detection formulates discrete dynamic programming (DP) as a two-dimensional graph searching problem. DP provides mathematically optimal boundaries while greatly reducing sensitivity to local noise or other intervening structures. Robustness is further enhanced with on-the-fly training which causes the boundary to adhere to the specific type of edge currently being followed, rather than simply the strongest edge in the neighborhood. Boundary cooling automatically freezes unchanging segments and automates input of additional seed points. Cooling also allows the user to be much more free with the gesture path, thereby increasing the efficiency and finesse with which boundaries can be extracted. Extracted objects can be scaled, rotated, and composited using live-wire masks and spatial frequency equivalencing. Frequency equivalencing is performed by applying a Butterworth filter which matches the lowest frequency spectra to all other image components. Intelligent Scissors allow creation of convincing compositions from existing images while dramatically increasing the speed and precision with which objects can be extracted.
The user guide for the Adobe Photoshop 5 software package describes a "magnetic pen" tool that that can be used to trace an object by moving a pointer along the object's edges. As the pointer moves, an active segment snaps to the strongest edge of the image to generate a border. "Fastening points" are periodically added to the border to anchor previous sections of the border.
The invention uses a defined border to perform image-editing operations, such as color correction of a portion of an image or cutting out (i.e., cropping) a portion of an image, as well as other image-editing operations (e.g. adding textures and other effects, etc.). To assist in such operations, the invention generates a tunnel about the defined border. The tunnel has a specified width, with either side of the tunnel at an equal distance from the border. The tunnel's width is varied to avoid self-intersection. The method receives modifications to the tunnel after the tunnel is generated, in some embodiments. The modifications or some embodiments include changes to the width of the tunnel and modifications to the shape of one or both sides of the tunnel.
As mentioned above, some embodiments use the tunnel to perform image-editing operations. For instance, some embodiments use the tunnel to generate a foreground to background transition in an image. To implement this transition, some embodiments sample pixels on the exterior of the tunnel, and determine an alpha value for pixels inside the tunnel based on the sample pixels. The alpha values are determined based on an algorithm that compares image values of the sampled pixels to the image values of the pixels on the interior of the tunnel.
The novel features of the invention are set forth in the appended claims.
However, for purpose of explanation, several embodiments of the invention are set forth in the following figures.
In the following description, numerous details are set forth for purpose of explanation. However, one of ordinary skill in the art will realize that the invention may be practiced without the use of these specific details.
Some embodiments use the defined border to perform image-editing operations, such as color correction of a portion of an image or cutting out a portion of an image. To assist in such operations, some embodiments generate a tunnel about the defined border. In some embodiments, the tunnel has a constant set width, with either side of the tunnel an equal distance from the border. In other embodiments, the tunnel's width is varied to avoid self-intersection. The method receives modifications to the tunnel after the tunnel is generated, in some embodiments. The modifications of some embodiments include changes to the width of the tunnel and modifications to the shape of one or both sides of the tunnel.
As mentioned above, some embodiments use the tunnel to perform image-editing operations. For instance, some embodiments use the tunnel to generate a foreground to background transition in an image. To implement this transition, some embodiments sample pixels on the exterior of the tunnel, and determine an alpha value for pixels inside the tunnel based on the sample pixels. Some embodiments determine the alpha value based on an algorithm that compares image values of the sampled pixels to the image values of the pixels on the interior of the tunnel.
After identifying the edges, the process uses the edges to define (at 110) a border of the image 200 as a cursor traverses over an image. Figure 4 illustrates image 200 with defined border 405. The border 405 is defined along some of the edges 305, and includes segments 410 with points 415. Note that the border does not necessarily need to be selected along an entire continuous edge or be completely enclosing an object.
From the border defined at 110, the process generates (at 115) a tunnel based on the defined border. Figure 5 illustrates tunnel 505 on image 200. The tunnel 505 is generated such that both sides follow the curve of defined border 405. The tunnel 505 encloses the transition area 215. Some embodiments generate a tunnel such that one or both sides do not exactly follow the curve of the selected border. Some embodiments allow the tunnel to be modified after it is generated. The tunnel can be modified in some embodiments by changing the width of the tunnel (i.e., the distance between the two sides) or by altering the shape of one or both sides of the tunnel.
Based on the tunnel generated at 115, the process 100 generates a foreground to background transition. Some embodiments generate the foreground to background transition inside the tunnel based on the pixels outside the tunnel. Some embodiments define an alpha value for each pixel. The alpha values of some embodiments represent the extent to which a pixel is in the foreground. Figure 6 illustrates alpha values for the pixels in image 200. The darker the pixel, the lower the alpha value. In Figure 6 , alpha values have been generated for the entire image in addition to the transition area inside the tunnel. Some embodiments use other methods of generating alpha values in some areas to complement the generation of alpha values from tunnels. Figure 6 illustrates foreground area 205 which is entirely white, background area 210, which is entirely black, and transition area 215, which includes a gradation from white to black. The lightness of a particular pixel indicates the extent to which the particular pixel is in the foreground. Part of the area that was within the tunnel in Figure 5 is entirely white because the pixels are similar in nature to those in the rest of the foreground. After generating the foreground to background transition, the process 100 ends.
Several more detailed embodiments of the invention are described in the sections below. Section II describes an image-editing application of some embodiments. Section III describes several embodiments for performing edge detection of an image. Next, Section IV describes several detailed embodiments for defining a border for an image. Section V describes several embodiments by which a tunnel is generated from a border. Section VI describes several embodiments for generating a foreground to background transition within an image. Next, Section VII describes some embodiments that perform color correction on an image. Finally, Section VIII describes a computer system with which some embodiments of the invention are implemented.
The main display window 705 displays an image that can be edited using the image-editing tools 710-725. The set of edge detection tools 710 provides users with options for identifying edges in the image. The identified edges can be used for the definition of a border for the image. Some embodiments of the invention identify edges of an image as the image is loading and store the edges in memory for use in the subsequent definition of borders. The set of edge detection tools 710 of some embodiments includes a checkbox 730, labeled "Edge", that allows a user to re-identify the edges of a displayed image. In some embodiments, selection of the checkbox 730 causes the image-editing application to re-identify the edges and then display the image in an edge view. In the edge view, only the edges of the image are displayed in display window 705, rather than the actual image. Some embodiments also provide a user interface tool (e.g., a button, or a checkbox) that re-identifies the edges but does not cause the image-editing application to display the image in the edge view. Some embodiments do not allow a user to view the image in the edge view (but still are able to identify the edges of the image). Some embodiments also provide checkbox 735, labeled "Degrain prior Edge Detect", that when selected causes the image-editing application to apply a de-noise algorithm to the image before re-identifying edges. Other edge identification tools provided by some embodiments include sliders 740 that allow for users to alter parameters of the edge identification and de-noise algorithms. In some embodiments, the sliders 740 affect the sensitivity of the edge identification and de-noise algorithms. In other words, the sliders affect the likelihood that a potential edge will be identified. Some embodiments set the sensitivity of the edge identification and de-noise algorithms as defaults and do not provide sliders 740. Edge identification is described in detail in Section III.
The set of border definition tools 715 provides users with various options to use in order to define a border for the image by moving over the image in the display window 705 with a cursor. For example, some embodiments provide a checkbox 745, labeled "Tunnel", that allows a user to determine whether a tunnel will be generated around a border after the border is defined. Some embodiments provide a checkbox 750, labeled "Moving Avg", which allows a user to determine whether a search window will be displayed around the cursor while the user defines a border for the image. Definition of borders, including the definition and use of search windows is described in detail in Section IV.
The set of tunnel and alpha generation tools 720 allows users to (1) generate tunnels around defined borders, and (2) define, for each pixel in an image, the extent to which the pixel is in the foreground or background of the image, In some embodiments, the tunnel and alpha generation tools 720 include "Spline" button 755, which allows users of the image-editing application to define a border of an image in the display area 705 as a spline curve. From the defined border, some embodiments generate a tunnel as two sides around the defined border. The "Move" button 760 provided by some embodiments enables a user to view the sides of a generated tunnel as modifiable curves, and modify the sides. In some embodiments, the modifiable curves are bezier splines. The slider 765 provided by some embodiments enables a user to modify the width of a tunnel, either before or after the tunnel is generated. Tunnel generation is also described in further detail in Section V.
Some embodiments generate alpha values based on a tunnel. Some embodiments sample pixels around the tunnel in order to generate the alpha values. Some embodiments of the image-editing application provide a slider 767 that enables a user to modify the width of the area from which pixels are sampled. Once the alpha values are generated, the image-editing application of some embodiments allows a user to toggle through different views that illustrate the alpha values in different ways. Some embodiments provide a drop-down menu 770 for selecting between the different views. Some embodiments provide a first view that illustrates the image in grayscale with the alpha value of a pixel denoted by the luminance of the pixel. Some embodiments provide a second view that overlays the alpha value of a pixel on the image. The overlay can be gray, red, blue, or green in some embodiments. Some embodiments provide a view that shows the image and any color correction based on the alpha. Alpha generation from a tunnel is discussed in detail below in Section VI. The image-editing application of some embodiments also provides a "Brush" button 775 that allows use of an alpha brush in conjunction with the alpha values. In some embodiments, the alpha brush can define pixels as either foreground, background, or somewhere in between, while in other embodiments the alpha brush can only define pixels as foreground pixels.
The set of color correction tools 725 provides a user with the ability to edit digital images. Color correction tools allow users to edit a selected section of a digital image. In some embodiments, the selected section for editing is selected using the other sets of tools such as border definition tools 715 and alpha generation tools 720. Color correction tools 725 allow users to modify features of a selected section such as the hue and saturation of the pixels. Some embodiments include color wheel 780 that enables a user to shift the color of a selection towards one color or another.
It is to be understood by one skilled in the art that some embodiments of the image-editing application may include the functionalities as described but associated with different UI tools than those described. For example, one skilled in the art would recognize that a UI tool allowing a user to determine whether to graphically display a search window need not be a checkbox labeled "Moving Avg", but could be a checkbox with a different label, a menu option, a selectable button, etc. Similarly, UI tools shown as selectable buttons could be checkboxes, menu options, or other types of UI tools.
The image-editing tools and processes that are described above and below can be incorporated into any image-editing application by way of a plug-in, applet, or direct function incorporated within the application itself. Accordingly, different image-editing applications, such as Apple Aperture®, Apple iPhoto®, Adobe Photoshop®, and Adobe Lightroom® may each implement one or more of the image-editing tools described herein. Additionally, the image-editing tools and processes described above and below can be incorporated within the functionality of any other application (e.g., video-editing applications, digital photo albums, etc.), or within an operating system (e.g., Microsoft Windows®, Apple Mac OS®, etc.).
After receiving the selection of an image to display, the process 800 then computes (at 810) the edges of the image. In some embodiments, the process computes the edges using the Canny edge detection algorithm or a variant thereof. In other embodiments, the process uses different algorithms to compute the edges. Parameters for the edge computation can be set in some embodiments by the use of sliders 915, which a user can modify before selecting to open the image. In some embodiments, the parameters modify the sensitivity of the edge detection algorithm. After computing the edges, the process stores (at 815) the edges in a memory of the computer on which the process is running. The edges can thus be accessed for other processes such as defining a border for the image.
At 820, after the edges are computed, the process displays the image in the display window. Figure 10 illustrates the result of image file 905 being selected by a user. Figure 10 illustrates image 1005 displayed in display window 1010. The edges of image 1005 are already computed before the image is displayed. At 825, the pre-computed edges are used in the definition of a border of the displayed image. In some embodiments, the border is snapped to the edges as the border is defined by a cursor moving over the image. Border definition using the pre-computed edges is described in detail in Section IV.
If the process 1100 has not received a request to recompute edges, the process determines (at 1115) whether the image is still open. If the image is still open, the process returns to 1105 and continues displaying the image. If the image is not still open, process 1100 ends. If a user has closed the image, or opened a different image for editing, then the process would determine that the image is not still open. If, at 1110, the process determines that a request to recompute edges has been received, the process then determines (at 1120) whether a de-noise functionality is selected. In some embodiments, a user selects the de-noise functionality by selecting a UI tool such as checkbox 735. If the de-noise functionality is not selected, the process 1100 computes (at 1130) the edges of the image using the edge detection algorithm. In some embodiments, the process then displays (at 1135) the edges of the image. Figure 12 illustrates the edges of image 1205. In Figure 12 , checkbox 1210 is selected in order to recompute and display the edges of image 1205. Checkbox 1215 is not selected, indicating that the de-noise functionality is not selected.
If the process determines (at 1120) that the de-noise functionality is selected, the process then applies (at 1125) a de-noise algorithm to the image. A de-noise algorithm is applied to an image before performing edge identification such that the edge identification algorithm will only compute the most relevant edges of the image. In some embodiments, the de-noise algorithm is a bilateral or anisotropic filtering algorithm. Such an algorithm blurs the image only where a low frequency texture exists. In other words, areas in which a sharp edge exists will be left alone, while areas of the image with no sharp edges will be blurred. This will result in the relevant (i.e. real) edges being maintained while the areas without relevant edges are blurred. Some embodiments allow a user to modify at least one parameter of the de-noise algorithm. For example, some embodiments provide a slider such as one of the sliders 740 of Figure 7 for adjusting the sensitivity of the de-noise algorithm. After applying the de-noise algorithm to the image, the process then computes (at 1130) the edges of the image. Because the areas of the image not containing relevant edges are blurred with the de-noise algorithm, the edge detection algorithm is less likely to compute irrelevant (i.e. false) edges. The process 1100 then stores (at 1135) the edges in a memory of a computer on which the process is running.
After computing and storing the edges of the image, some embodiments display (at 1140) the edges of the image. Figure 13 illustrates the edges of image 1305. In Figure 13 , checkbox 1310 is selected in order to recompute and display the edges of image 1305. Checkbox 1315 is also selected, indicating that the de-noise functionality is selected and that the de-noise algorithm was applied before the shown edges were computed. Comparing Figure 12 and Figure 13 , it can be seen that area 1220 includes many small computed edges, whereas the corresponding area 1320 does not include any computed edges as a result of the de-noise algorithm having been applied. Overall, the edges of image 1305 are sharper and more defined than the edges of image 1205. Some embodiments do not display the edges of the image, but instead only compute the edges and store them in memory. Some embodiments include an option as to whether to display the edges when the edges are re-computed.
Some embodiments of the invention allow a user to define a border for an image. In some embodiments, the border automatically snaps to the edges of the image. In some embodiments, by snapping the border to the edges it is meant that a segment of the border is automatically drawn along an edge of the image in the vicinity of a cursor moving over the image, as opposed to following the exact path of the cursor. In some embodiments, the edges are computed as described above in Section III. When the edges are pre-computed, the border snaps to the pre-computed edges, as opposed to computing edges as the cursor moves over the image to define the border.
Some embodiments of the invention search for identified edges while defining a border for an image. Some embodiments define a search area around a cursor in which to search for identified edges. Some embodiments base at least one property of the search area on an attribute of the cursor. The property of the search area is the size of the search area in some embodiments. Other embodiments base the orientation or shape of the search area on the attribute of the cursor. In some embodiments the attribute is the speed of the cursor. Other embodiments base the property of the search area on other attributes of the cursor, such as the acceleration.
At 1415, the process 1400 defines a search window for edges of the image based on the speed of the cursor. In some embodiments, the edges are pre-comptited, with or without applying a de-noise algorithm, as described in Section III. The speed of the cursor is determined by how quickly the cursor moves over the image. In some embodiments, the process determines the speed based on pixels per unit of time, inches per unit of time, or other appropriate units for speed. The search window is used to search for edges to which the process automatically snaps the selected border. In some embodiments, snapping the border to an identified edge entails (1) identifying a second point along the edge in the vicinity of a first point (different than the second) over which the cursor is moving, and (2) drawing the border over the second point rather than having the border follow the movement of the cursor.
In some embodiments, the size of the search window is proportional to the speed of the cursor, such that if the cursor moves faster the search window is defined to be larger. This is a linear proportionality in some embodiments, while other embodiments use other relationships between the cursor speed and the search window size. Some embodiments define a proportional relationship over a range of search window sizes, but have a minimum and/or maximum search window size such that the search window is never too small (i.e., very close to zero) or too large (i.e., the entire image).
The shape of the search window is different in different embodiments. Some embodiments define a circular search window in which the radius of the circle is proportional to the speed of the cursor. Some embodiments, on the other hand, define a square search window in which the length of the sides of the square is proportional to the speed of the cursor. Some embodiments center the search window, whether circular, square, or other shape, at the point of the cursor. In some embodiments that center the search window at the cursor, the search window is not centered when the cursor is at the edge of the image. This allows for full utilization of the entire search window. Some embodiments use a search window that is not centered at the point of the cursor.
Some embodiments provide a graphical illustration of the search window for the user that allows the user to visualize the area in which the process is searching for edges. In some embodiments, an image-editing application provides an option to the user as to whether or not to display the graphical illustration. Figure 15 illustrates an image-editing application 1500, an image 1505, a border 1510 that is being selected, a cursor 1515, and a search window 1520 with side length 1525. The search window 1520 indicates the search window in which the image-editing application searches for edges within the image 1505. The side length 1525 of box 1520 is related to the speed of the cursor 1515. In the embodiment illustrated in Figure 15 , the search window 1520 is square and centered about the cursor. As mentioned above, in other embodiments the search window can be shaped and centered differently. In Figure 15 , side length 1525 (and thus search window 1520) is relatively small because the cursor is moving slowly.
After defining a search window at 1415, the process 1400 determines whether any identified edges of the image are within the search window. If at least one identified edge is within the search window, the process automatically snaps (at 1425) the border to an identified edge within the search window. If more than one identified edge is found within the search window, some embodiments snap the border to the edge closest to the cursor. Some embodiments determine whether a previously defined segment of the border is along an edge, and then snap the border to that edge if the edge continues within the search window.
If no identified edges are found within the search window, the process 1400 draws (at 1430) the border along the movement of the cursor. Figure 20 illustrates image 1505 and border sections 2010 and 2015. Border section 2010 is snapped to identified edges; specifically, the edges formed by the baby's face and shoulder against the background. Border section 2015 is not snapped to any identified edges, but instead follows the movement of the cursor through the background of the image.
After drawing the border at either 1425 or 1430, the process proceeds to 1435 and determines whether the current cursor speed is changed from the speed used to initially define the search window at 1415. If the cursor speed is changed, the process redefines (at 1440) the search window based on the cursor speed. In embodiments for which the search window size is proportional to the cursor speed, the process enlarges the search window if the cursor speed is increased and reduces the search window if the cursor speed is decreased. After redefining the search window based on the new cursor speed, the process proceeds to 1445 and determines if input to define the border is still being received. In some embodiments, if a mouse button is still held down then input to define the border is still being received, Other embodiments begin and end the definition of a border with a click and release of a mouse button, such that the mouse button is not held down during the definition process. If at 1435 the process 1400 determines that the cursor speed is not changed, the process proceeds directly to 1445. If border definition input is still being received, the process proceeds to 1420 to continue drawing the border. If border definition input is no longer being received, the process ends.
Some embodiments of the invention delete at least one segment of a previously defined border. In some embodiments, segments are deleted during the process of defining the border. Some embodiments delete segments of the border when the cursor used to define the border moves back over the previously defined border. Other embodiments delete the border when the cursor moves to within a threshold distance of the previously defined border. Other embodiments use other mechanisms to delete segments of the border, such as a combination of the direction of the cursor and the proximity of the cursor to the border.
In some embodiments, the border includes both snapped points (i.e., points that are drawn along identified edges) and non-snapped points (i.e., points that are drawn along the cursor movement). The reference points 2225 are drawn at each of the snapped points in some embodiments, while in other embodiments the reference points 2225 are drawn at a subset of the snapped points. In yet other embodiments, the reference points 2225 are drawn equidistant from each other. Still other embodiments draw the reference points 2225 at varying distances along the border based on how fast the cursor moves to draw the border. Some embodiments define the border as a set of coordinates, as described below.
At 2125, process 2100 determines whether input to define the border is still being received. In some embodiments, if the mouse button is still held down then input to define the border is still being received. Other embodiments begin and end definition of the border with a click and release of a mouse button, such that the mouse button is not held down during the definition process. If input to define the border is no longer being received, the process 2100 ends, as the border definition process is finished. If border definition input is still being received, the process determines (at 2130) whether the cursor is moving away from the last reference point. The last reference point is the most recently drawn reference point on the border. In Figure 22 , the last reference point is at 2235. If the cursor is moving away from the last reference point, the process continues drawing (at 2135) the border, displaying new segments and reference points in between the segments. Figure 23 illustrates new segments 2335 with reference points that have been added to border 2215 as a user continues to draw the border by moving the cursor away from the previously defined border. After 2135, the process returns to 2125 to determine whether border definition input is still being received.
If, at 2130, the process determines that the cursor is not moving away from the last reference point, the process proceeds to 2140 and determines whether the cursor has moved back over the previously defined border. The previously defined border includes all of the border that has been drawn and not yet deleted. In doing so, in some instances a user would essentially be retracing the already-drawn border in the direction opposite which the border was drawn. A user might also loop back to a point along the previously defined border. If the cursor has not moved back over the previously defined border, the process returns to 2125 to determine whether border definition input is still being received, If the cursor has moved back over the border, the process deletes (at 2145) at least one segment of the border. When segments are deleted, this sets a new reference point as the last reference point for the purposes of 2130.
Some embodiments do not require a user to actually retrace the border opposite the direction in which the border was drawn, but will delete at least one segment if the cursor is moved back onto the previously defined border, even if the cursor is moved in a more roundabout way than a directly retracing the border. If a user places the the cursor over the previously defined border more than one segment back from the end of the border, some embodiments delete all segments of the border beyond the location of the cursor. After 2145, the process returns to 2125 to determine whether border definition input is still being received. The process also returns to 2125 from 2140 if it determines that the cursor has not moved back over the previously defined border. For example, if a user holds down the mouse button but does not move the cursor, then in some embodiments no new segments of the border would be drawn, and no segments would be deleted, but border definition input would still be being received.
In the process described immediately above, some embodiments define the border as an ordered set of coordinates while the border is being defined. For example, each pixel along the defined border is assigned an (x, y) pair. When the cursor moves over a pixel with a particular (x, y) pair that is already in the set of coordinates, the process deletes all the coordinates in the ordered set that come after the particular coordinate pair. Some embodiments delete all the coordinates when the cursor moves within a threshold distance of the particular coordinate pair.
In Figure 23 , some of the new segments 2335 are snapped to the baby's bib. However, it is more likely the case that a user is attempting to define a border around the entire baby and would prefer to have the border run down the baby's arm. Figures 24 and 25 illustrate the use of process 2100 to delete segments of a border and then define new segments. Figure 24 illustrates the border 2215 after a user has moved the cursor back over the previously defined border to point 2435, thereby deleting a number of the segments 2335 that were incorrectly drawn. Figure 25 illustrates the border after the user has continued the selection of the border 2215 down the baby's arm. Figure 25 illustrates new segments 2535. From the position of the cursor 2210 in Figure 25 , the user can either finish selection of the border or continue down the baby's arm.
Some embodiments allow a user to modify a border after the border is defined. In some embodiments, the border will attempt to automatically snap to edges as the border is modified. If a user determines that it would be preferable for a border defined along edges to not run along an edge for at least one point, the user can move the border off of the edge. Some embodiments use such a border to define a selection with no transition, and thus do not generate a tunnel or a foreground to background transition from the defined border.
In some embodiments, the border is treated as a parametrizable curve with several modifiable points. A parametrizable curve is a curve that is defined about certain definition points by a particular equation or set of equations. This is in contrast to a raster curve, which is defined by the set of all pixels that make up the curve. In some embodiments, the parametrizable curve can be modified by moving the definition points. In some embodiments (e.g., embodiments using bezier splines), the definition points for modifying the curve lie on the curve. In other embodiments (e.g., embodiments using b-splines), the definition points for modifying the curve lie off of the curve. In addition to the definition points, some embodiments (e.g., embodiments using bezier splines) define the parametrizable curve based on tangents to the curve at the specific points as well. Users can add definition points to the curve in some embodiments, and then modify the curve based on the new point.
Some embodiments of the invention generate a two-dimensional tunnel about a portion of the image based on movement of a cursor through the portion of the image. In some embodiments, the tunnel defines a boundary within the image. Rather than a hard edge, the tunnel is a transitional edge or region that can be multiple pixels wide. The tunnel of some embodiments is generated as a pair of deformable curves. Some embodiments allow both curves to be modified either together or separately.
After receiving input to define the border, the process 2900 determines (at 2915) an initial width for the tunnel that will be generated. Some embodiments determine the initial width based on a UI tool that can be modified by a user. For example, in some embodiments the initial width is based on the setting of a linear slider tool , Referring to Figure 30 , the slider 3015 is the linear slider tool of some embodiments that is used to determine the initial width of the tunnel. Some embodiments use other ways to define the initial width of the tunnel, such as a numeric input from a user.
After determining the initial width for the tunnel, the process determines (at 2920) whether the tunnel will intersect with itself if the two sides of the tunnel have the same shape (that of the defined border). If a border is defined such that it is traced in one direction and then continues back in the opposite direction near the previously defined section, then depending on the initial width it might be impossible to generate a tunnel with both sides having the same shape. If generating the tunnel with both sides having the same shape will not cause the tunnel to intersect itself, the process generates (at 2925) the tunnel at the width determined at 2915. The tunnel is generated such that the sides of the tunnel run parallel to the defined border and are equidistant from the defined border. Figure 31 illustrates image 3005 with tunnel 3110 generated at a constant width from border 3010. Slider 3015 is set at a first distance from the left side that is one-fourth of the distance to the right side of the slider. As the defined border 3015 is mostly straight, generating tunnel 3015 with the two sides having the same shape (and therefore the tunnel having a constant width) does not result in the tunnel intersecting itself.
If the process 2900 determines at 2920 that the tunnel will intersect itself when generated with the two sides having the same shape, then the process generates (at 2930) the tunnel, varying the shape of one or both sides where necessary to avoid self-intersection. The process attempts to draw the tunnel at as constant a width as possible, and will modify the shape of one or both sides of the tunnel in order to keep the width as close to constant as possible without having the tunnel self-intersect. In some embodiments, the modification is kept as minimal as possible. Figure 32 illustrates a defined border 3210 displayed on image 3205, along with slider 3215. Slider 3215 is set near the halfway point to determine the width of a tunnel to be generated from border 3210. Figure 33 illustrates the tunnel 3310 with non-constant width generated from the defined border 3210. The primary tunnel width is set by slider 3215. Tunnel 3310 has sides 3320 and 3325. Because of the shape of border 3210, tunnel 3310 is generated with the two sides 3320 and 3325 shaped differently. Specifically, within area 3330, side 3325 is pinched down as compared to side 3320. The width of the tunnel is kept as close to constant as possible, although the width does decrease slightly close to where side 3325 comes to a point.
Once the tunnel is generated, the process determines (at 2935) whether any modifications to the tunnel are received, Some embodiments allow modifications to the width of the tunnel. The modifications of some embodiments can also be modifications to the shape of one or both sides of the tunnel.
Some embodiments allow modifications directly to the shape of the sides of the tunnel. The sides of the tunnel are defined as parametrizable curves in some embodiments. In some embodiments, the parametrizable curves are bezier splines that are made up of a set of modifiable points. Figure 36 illustrates image 3605, tunnel 3610, cursor 3615, and UI item 3620 (in the embodiment shown, a button labeled "Move"). Tunnel 3610 encircles the baby's hand. Figure 36 illustrates a user utilizing the cursor 3615 to select Ul item 3620. Figure 37 illustrates a close-up of tunnel 3610 after the user has clicked on UI item 3620 which displays the two sides 3705 and 3710 of the tunnel as bezier spline curves. The spline curves include control points and tangents, including corresponding inner and outer control points 3715 and 3720 and tangents 3725 and 3730. The outer tangent 3725 is longer than the inner tangent 3730 because tangent 3725 is on the exterior of a curve while corresponding tangent 3730 is on the interior of the curve. In some embodiments, a user can manipulate the spline curve in order to modify the tunnel. Modifications to one spline do not affect the other spline in some embodiments. The control points, such as points 3715 and 3720, can move that point on the spline in any direction in the image. In some embodiments, moving a control point on one spline causes both splines to move according to the movement of the control point. When a control point is moved, in some embodiments nearby points are moved as well according to the mathematics of bezier spline curves, which is well known to one of ordinary skill in the art.
Returning to process 2900, if the process receives (at 2935) a modification to the tunnel, the process modifies (at 2940) the tunnel in accordance with the received modifications. In some embodiments, the modifications include those described above such as modifications to the width of the tunnel or modifications to one or both of the bezier splines. If no modifications are received at 2935, or after the tunnel is modified (at 2940) in accordance, with any received modifications, the process determines at 2945 whether to continue allowing modifications to the tunnel. In some embodiments, if a user has generated alpha values for pixels inside the tunnel, then the tunnel can no longer be modified. If a user has started to define a new border in order to generate a new tunnel, then in some embodiments the previous tunnel cannot receive any more modifications. If the process determines to continue receiving modifications, the process returns to 2935. If not, the process 2900 ends.
At 4015, the process 4000 generates a transition tunnel region about the section of interest from the defined border. Some embodiments generate the tunnel region as described in Section V above. Figure 43 illustrates a tunnel region 4310 generated about the section of interest 4105. Unlike border 4210, the interior of the tunnel 4310 includes all of the pixels from the transition region 4110. Finally, at 4020, the process 4000 analyzes image data to determine which pixels in the tunnel region are part of the section of interest. Some embodiments compare pixels on the either side of the tunnel (some of which are defined as being in the section of interest and some of which are defined as not being in the section of interest) to pixels on the interior of the tunnel, and classify the interior pixels based on the comparison. Some embodiments generate alpha values for each of the interior pixels. Figure 44 illustrates pixels in black 4405 that are defined as being in the section of interest and pixels in white that are defined as not being in the section of interest.
At 4515, the process 4500 generates a tunnel around the border. Some embodiments generate the tunnel in the manner described above in Section V. Figure 46 illustrates a portion of an image 4605. Figure 46 illustrates tunnel 4610, which divides the portion of the image into three sections: foreground 4615, background 4620, and transition section 4625. In some embodiments, the process determines which side is the background and which side is the foreground based on the direction in which a cursor is traced over the image to define the border. For example, if one imagines walking along the border as the cursor is traced over the image, some embodiments refer to the right side as the foreground and the left side as the background, or vice versa. Some embodiments allow a user to toggle this function such that the foreground and background switch sides. The toggle function can be used before or after drawing the border in some embodiments. Some embodiments enable a user to generate a foreground to background transition for one portion of an image, then generate a foreground to background transition for a second portion of the image. For example, Figure 46 illustrates one section of a baby's head; it would be possible in some embodiments to generate a foreground to background transition within tunnel 4610, then select a new border starting where 4610 ends. In some such embodiments, the foreground and background are defined based on the definition for the previously selected border.
After generating the tunnel, the process 4500 samples (at 4520) foreground and background pixels from the outside of the tunnel. In some embodiments, sampling a pixel involves determining chromatic properties of the pixel, such as the pixel's RGB values or luminance and chrominance values. Some embodiments define a rectangle that includes the tunnel and then sample pixels within the rectangle. In other embodiments, only a narrow strip of pixels just outside the tunnel is used for sample pixels. Some embodiments provide a user interface tool that allows a user to view the regions from which pixels are sampled. This allows a user in some embodiments to manipulate the sides of the tunnel, as described above in Section V, so as to sample pixels from the correct regions.
Some embodiments sample every pixel within the sampled region. Other embodiments only sample a fraction of the pixels within the region, such as every other pixel or every third pixel, for faster computation, Some embodiments also allow a user to directly modify the section of the image from which pixels are sampled (either for foreground or background, or both) without modifying the tunnel.
After sampling the pixels, the process 4500 determines (at 4525) an alpha generation algorithm to generate the foreground to background transition inside the tunnel. In some embodiments, the process always uses the same algorithm. The algorithm is a random walks algorithm in some embodiments. In other embodiments, the process selects between more than one algorithm. Some embodiments select an algorithm based on a user decision as to which algorithm to use. Some embodiments prompt the user to select an algorithm, while other embodiments rely on a predefined selection by the user. Other embodiments select an algorithm based on the sampled pixels. For example, if the background is a bluescreen, an algorithm that creates a sharper foreground to background transition might be advantageously selected.
After the alpha generation algorithm is determined, the process 4500 proceeds to 4530 to start generating alpha values. At 4530, the process selects a pixel inside the tunnel. The process then calculates (at 4535) an alpha value for the selected pixel. The alpha value for the selected pixel is calculated using the alpha generation algorithm determined at 4525. The alpha value is calculated by comparing the pixel values of the samples to the pixel values of the selected pixel. As mentioned above, the pixel values of some embodiments are the chromatic properties of the pixel, such as RGB values or luminance and chrominance values. The calculated alpha value of some embodiments is a value from 0 to 1, inclusive. The alpha value gives the extent to which the pixel is part of the foreground and part of the background. In some embodiments, a value of 0 indicates the pixel is entirely in the background and a value of 1 indicates the pixel is entirely in the foreground. An alpha generation algorithm that would be used for a bluescreen background to create a sharper foreground to background transition would thus be more likely to calculate alpha values close to 0 and 1, as opposed to in the 0.25-0.75 range.
After calculating the alpha value for the selected pixel, the process 4500 determines (at 4540) whether any pixels remain inside the tunnel, If no more pixels remain, the process has finished alpha generation, and thus ends. If more pixels remain, the process returns to 4530 to select a new pixel inside the tunnel. The process continues calculating alpha values for pixels until all the pixels inside the tunnel have an alpha value. Some embodiments calculate the alpha value for each pixel based on the pixel values for the selected pixel and the sampled pixels. Other embodiments use the pixel values of the previously selected pixels and the alpha values calculated for those pixels to modify the algorithm for the calculation of alpha values for the remaining pixels.
Some embodiments of the image-editing application illustrate the calculated alpha values in at least one way. Some embodiments illustrate the calculated alpha values in multiple ways, such as red, green, blue, or gray overlays, showing only the alpha values in grayscale, or not displaying the alpha values at all. Figure 50 illustrates calculated alpha values overlaid on the image 4605. In foreground section 4615, all pixels are colored gray as this entire section has an alpha value of 1. In background section 4620, the overlay has no change because the alpha values are all 0. The alpha values for the foreground section 4615 and background section 4620 are defined as 1 and 0, respectively, during the sampling process. In the transition section 4625, the overlay illustrates that some of the alpha values are the same as the foreground (i.e., a value of 1) and some are the same as the background (i.e., a value of 0), while some are somewhere in between 0 and I (e.g., the pixels in the small encircled area 5030). Some embodiments define all pixels with an initial alpha value of zero (i.e., purely background), and only those areas defined otherwise as being at least partly in the foreground are given a nonzero alpha value.
Some embodiments of the image-editing application can also provide a view that only shows the alpha values. Figure 51 illustrates such a view for the alpha values from Figure 50 . In Figure 51 , alpha values of 1 are shown as white and alpha values of 0 are shown as black. Areas for which an alpha values have not been calculated are designated as having an alpha value of 0, and are thus also black. The pixels within transition section 4625 have alpha values ranging from 0 to 1, and thus the section has pixels with varying shades of gray in addition to black and white. Some embodiments show alpha differently, for example with the foreground (alpha = 1) black and the background (alpha = 0) white.
Some embodiments allow a user to define alpha values for sections of an image with an alpha brush in addition to generating alpha values with the use of tunnels. Figure 52 illustrates the image-editing application of some embodiments with the alpha brush tool selected using button 5205. In some embodiments, selecting the alpha brush tool causes the image-editing application to provide the user interface items 5210. VI items 5210 allow a user to select whether the alpha brush will add alpha (i.e., define areas as foreground with alpha value equal to 1) or remove alpha (i.e., define areas as background with alpha value equal to 0).
In some embodiments, the alpha brush is circular, while in other embodiments the alpha brush is a different shape (e.g., a square). The alpha brush of some embodiments can have a varying size. In some embodiments, a user clicks on a point in the image to start the creation of an alpha brush, and drags outward from the point to determine the radius of the alpha brush. Figure 53 illustrates an alpha brush 5305 with a radius 5310. Alpha brush 5305 also includes control point 5315. In some embodiments, after creating an alpha brush, a user clicks inside the control point to move the alpha brush around the image, thereby defining areas covered by the brush as either foreground or background. The alpha brush 5305 also includes the transition area 5320. The transition area of some embodiments transitions from an alpha value that is defined by most of the brush to the alpha value of zero outside the radius of the brush. In some embodiments, areas of the image that are only covered by the transition area will be defined to have an alpha between 0 and 1. Figure 53 also illustrates indicator 5325 that indicates whether the alpha brush is being used to add or remove alpha. Indicator 5325 is a plus sign, indicating that the brush is currently being used to add alpha.
As mentioned above, in some embodiments the alpha brush can also be used to remove alpha. Figure 56 illustrates alpha brush 5605 having radius 5610. Alpha brush 5605 defines areas covered by the brush as having an alpha of 0 (i.e., as background). Like brushes 5305 and 5405, alpha brush 5605 includes a control point 5615 in the center for moving the brush around the image. Figure 56 also illustrates indicator 5625, which is a minus sign in this figure to indicate that the alpha brush 5605 is for removing alpha. Figure 57 illustrates the use of alpha brush 5605 to remove alpha. Before the use of brush 5605, the entire area of an image shown in Figure 57 had an alpha value of 1. However, due to the use of cursor 5720 to move the brush 5605 around the image, area 5715 (the area covered by the brush 5605) has an alpha value of 0 while area 5710 remains with an alpha value of 1.
The image-editing application of some embodiments uses the calculated alpha values to perform color correction on a selected area of an image. The selection is defined by the calculated alpha values. In some embodiments, the extent to which color correction applies to a particular pixel is defined by the alpha value for that pixel. In some embodiments, the extent to which color correction applies to a particular pixel is given by the equation:
where F is an image where a color correction function has been applied and B is the original image without any color correction function applied. Thus, if a pixel has an alpha value of 1, color correction will be fully applied to the pixel. If a pixel has an alpha value of 0.75, then color correction will affect the pixel only three-fourths as much as it affects a pixel with an alpha value of 1. Of course, pixels with an alpha value of 0 will not be affected at all by color correction.
Some embodiments also use the selection for cutouts. For example, a selection defined by alpha values can be pasted into a new image. The cutouts of some embodiments are defined similarly to color correction. The alpha value defines the extent to which the pixel is part of the selection that is pasted into the new image.
As mentioned above, some embodiments allow a user to generate alpha values from multiple selected borders. This can be advantageous in that it allows a selection of an entire object in multiple pieces. Figures 58-62 illustrate the selection, in multiple pieces, of a baby's head in image 5805. Figure 58 illustrates a first tunnel 5810. Figure 59 illustrates alpha overlay 5910 generated from tunnel 5810 as well as alpha overlay 5915 generated from a second tunnel (not shown). Figure 60 illustrates the alpha overlay after the entire border of the head has been selected, in numerous sections. As can be seen in Figure 60 , a substantial region 6010 in the middle of the head does not have an assigned alpha value (i.e., would be treated as having an alpha value of zero). In such a situation, an alpha brush may be used in some embodiments to fill in the middle section. Some embodiments use the alpha brush described above in Section VI. Figure 61 illustrates the use of alpha brush 6110 to fill in area 6010. Figure 62 illustrates the result of using the alpha brush with the entire head selected. Figure 63 shows the alpha values for the head and surrounding area that were generated by use of the multiple selections and the alpha brush.
Once a section of an image is entirely selected, color correction can be applied to the section if the user so desires. Figure 64 illustrates the entirety of image 5805 including head 6410, along with color correction tools 6415. Color correction tools 6415 include color wheel 6420 and sliders 6425. The various color correction tools can be used to affect a selected portion of an image. In Figure 64 , the baby's head 6410 is selected, although the alpha overlay is not shown because the image is in color correction mode (i.e., showing only the image with any color corrections, and not any alpha values). Figure 65 illustrates that color correction tools have been used to alter the color of the baby's head. The setting for the color wheel 6420 has been moved to the blue region, and saturation has been greatly increased using the saturation slider, one of the sliders 6420. As can be seen, the head 6410 is a different shade in Figure 65 than in Figure 64 , due to the use of color correction tools.
Many of the above-described tools and applications are implemented as software processes that are specified as a set of instructions recorded on a machine readable medium (also referred to as computer readable medium). When these instructions are executed by one or more computational element(s) (such as processors or other computational elements like ASICs and FPGAs), they cause the computational element(s) to perform the actions indicated in the instructions. Computer is meant in its broadest sense, and can include any electronic device with a processor. Examples of computer readable media include, but are not limited to, CD-ROMs, flash drives, RAM chips, hard drives, EPROMs, etc.
In this specification, the term "software" is meant in its broadest sense. It can include firmware residing in read-only memory or applications stored in magnetic storage which can be read into memory for processing by a processor. Also, in some embodiments, multiple software inventions can be implemented as sub-parts of a larger program while remaining distinct software inventions. In some embodiments, multiple software inventions can also be implemented as separate programs. Finally, any combination of separate programs that together implement a software invention described here is within the scope of the invention.
The edge detector 6625 uses the original image data 6610 and identifies edges in the original image to produce edge data 6620. The edge identification process is described in detail in Section III above. The edge data 6620 is passed to the border drawing module 6615, which combines the edge data 6620 with input from the cursor driver 6635 to define a border 6630. The processes performed by the border drawing module 6615 for defining the border, including the generation of varying size search windows, are described in detail in Section IV above.
The border drawing module 6615 passes the defined border 6630 to the image-editing engine, which also receives the original image 6610 and the image-editing operation data 6640 as input. The image-editing operation data is color correction operations, such as hue or saturation adjustments, in some embodiments. From the border 6630, the image-editing engine 6605 of some embodiments determines alpha values for at least a section of the image. The image-editing engine 6605 applies the image-editing operations 6640 to the original image 6610 to produce a modified image 6650. How the image-editing operations 6640 are applied is based on the border 6630 (and alpha values generated based on the border). The processes performed by the image-editing engine are described in detail above in Sections V-VH.
The bus 6705 collectively represents all system, peripheral, and chipset buses that communicatively connect the numerous internal devices of the computer system 6700. For instance, the bus 6705 communicatively connects the processor 6710 with the read-only memory 6730, the GPU 6720, the system memory 6725, and the permanent storage device 6735.
From these various memory units, the processor 6710 retrieves instructions to execute and data to process in order to execute the processes of the invention. Some instructions are passed to and executed by the GPU 6720. The GPU 6720 can offload various computations or complement the image processing provided by the processor 6710. Such functionality can be provided using CoreImage's kernel shading language.
The read-only-memory (ROM) 6730 stores static data and instructions that are needed by the processor 6710 and other modules of the computer system. The permanent storage device 6735, on the other hand, is a read-and-write memory device. This device is a non-volatile memory unit that stores instructions and data even when the computer system 6700 is off. Some embodiments of the invention use a mass-storage device (such as a magnetic or optical disk and its corresponding disk drive) as the permanent storage device 6735.
Other embodiments use a removable storage device (such as a floppy disk or ZIP® disk, and its corresponding disk drive) as the permanent storage device. Like the permanent storage device 6735, the system memory 6725 is a read-and-write memory device. However, unlike storage device 6735, the system memory is a volatile read-and-write memory, such a random access memory. The system memory stores some of the instructions and data that the processor needs at runtime. In some embodiments, the invention's processes are stored in the system memory 6725, the permanent storage device 6735, and/or the read-only memory 6730.
The bus 6705 also connects to the input and output devices 6740 and 6745. The input devices enable the user to communicate information and select commands to the computer system. The input devices 6740 include alphanumeric keyboards and pointing devices. The output devices 6745 display images generated by the computer system. For instance, these devices display a graphical user interface. The output devices include printers and display devices, such as cathode ray tubes (CRT) or liquid crystal displays (LCD).
Finally, as shown in Figure 67 , bus 6705 also couples computer 6700 to a network 6765 through a network adapter (not shown). In this manner, the computer can be a part of a network of computers (such as a local area network ("LAN"), a wide area network ("WAN"), or an Intranet, or a network of networks, such as the internet. For example, the computer 6700 may be coupled to a web server (network 6765) so that a web browser executing on the computer 6700 can interact with the web server as a user interacts with a graphical user interface that operates in the web browser.
Any or all components of computer system 6700 may be used in conjunction with the invention. For instance, in some embodiments the execution of the image-editing functions are performed by the GPU 6720 instead of the CPU 6710, However, a common limitation of the GPU 6720 is the number of instructions that the GPU 6720 is able to store and process at any given time. Therefore, some embodiments adapt the instructions for implementing the image-editing processes so that these processes fit onto the instruction buffer of the GPU 6720 for execution locally on the GPU 6720. Additionally, some GPU do not contain sufficient processing resources to execute the processes of some embodiments and therefore the processor executes the instructions. One of ordinary skill in the art would appreciate that any other system configuration may also be used in conjunction with the present invention.
While the invention has been described with reference to numerous specific details, one of ordinary skill in the art will recognize that the invention can be embodied in other specific forms. For instance, while the Apple Mac OS® environment is used to create some of these examples, a person of ordinary skill in the art would realize that the invention may be practiced in other operating environments such as Microsoft Windows®, UNIX, Linux, etc., and applications such as Adobe. Photoshop®, Adobe Lightroom®, Apple iPhoto®, etc., without the use of these specific details. Also, some of the examples may be executed on a GPU or CPU of a computer system depending on the computing resources available on the computer system or alternatively on any electronic device that is able to view images. The examples have discussed application of the various image editing functions to images, but each of the above examples are extensible to apply to other forms of visual media such as video. Thus, one of ordinary skill in the art would understand that the invention is not to be limited by the foregoing illustrative details, but rather is to be defined by the appended claims.
Claims (15)
- A computer readable medium storing a computer program which when executed by at least one processor selects a section of interest in an image comprising a plurality of pixels, the computer program comprising sets of instructions for:drawing a curvilinear boundary (405) over the image about the section of interest, the curvilinear boundary having a particular shape;determining whether a two-dimensional transition tunnel region (505) can be generated without intersecting with itself about the section of interest as a pair of curves with one curve on either side of the drawn curvilinear boundary at a specified width apart, both curves having the same particular shape as the drawn curvilinear boundary;when the transition tunnel region (505) cannot be generated without intersecting with itself at the specified width with both curves having the same particular shape, generating (2930) the transition tunnel region (505) with a width as close to the specified width as possible with at least one of the curves modified from the particular shape to avoid the self-intersection;when the transition tunnel region (505) can be generated at the specified width with both curves having the same particular shape, generating (2930) the transition tunnel region at the specified width; andanalyzing image data based on the transition tunnel region (505) to identify a subset of pixels in the transition tunnel region (505) that should be associated with the section of interest.
- The computer readable medium of claim 1, wherein the set of instructions for analyzing image data comprises a set of instructions for comparing pixels inside the transition tunnel region to pixels outside the transition tunnel region.
- The computer readable medium of claim 1, wherein the two-dimensional transition tunnel region about the section of interest divides the image into (i) a foreground region on one side of the transition tunnel comprising pixels that are part of the section of interest, (ii) a background region on the other side of the transition tunnel region comprising pixels that are excluded from the region of interest, and (iii) a transition region between the pair of curves comprising pixels for analysis.
- The computer readable medium of claim 3, wherein the set of instructions for analyzing image data comprises a set of instructions for comparing a pixel in the transition tunnel region to pixels in the foreground region and the background region to determine a value for the pixel.
- The computer readable medium of claim 4, wherein the value is for use in performing editing operations on the image.
- The computer readable medium of claim 4, wherein the value identifies an extent to which the pixel is associated with the section of interest.
- The computer readable medium of claim 3, wherein (i) the curvilinear boundary is drawn along the movement of a cursor over the image in a particular direction and (ii) the foreground and background regions are assigned based on the particular direction.
- A method for selecting a section of interest in an image comprising a plurality of pixels, the method comprising:drawing a curvilinear boundary (405) over the image about the section of interest, the curvilinear boundary having a particular shape;determining whether a two-dimensional transition tunnel region (505) can be generated without intersecting with itself about the section of interest as a pair of curves with one curve on either side of the drawn curvilinear boundary at a specified width apart, both curves having the same particular shape as the drawn curvilinear boundary;when the transition tunnel region (505) cannot be generated without intersecting with itself at the specified width with both curves having the same particular shape, generating (2930) the transition tunnel region (505) with a width as close to the specified width as possible with at least one of the curves modified from the particular shape to avoid the self-intersection;when the transition tunnel region (505) can be generated at the specified width with both curves having the same particular shape, generating (2930) the transition tunnel region (505) at the specified width; andanalyzing image data based on the transition tunnel region (505) to identify a subset of pixels in the transition tunnel region (505) that should be associated with the section of interest.
- The method of claim 8, wherein the curvilinear boundary is a parametrizable curve.
- The method of claim 8, wherein analyzing the image data comprises comparing pixels inside the transition tunnel region to pixels outside the transition tunnel region.
- The method of claim 8, wherein the two-dimensional transition tunnel region about the section of interest divides the image into (i) a foreground region on one side of the transition tunnel comprising pixels that are part of the section of interest, (ii) a background region on the other side of the transition tunnel region comprising pixels that are excluded from the region of interest, and (iii) a transition region between the pair of curves comprising pixels for analysis.
- The method of claim 11, wherein analyzing the image data comprises comparing a pixel in the transition tunnel region to pixels in the foreground region and the background region to determine a value for the pixel, the value identifying an extent to which the pixel is associated with the section of interest.
- The method of claim 11, wherein (i) the curvilinear boundary is drawn along the movement of a cursor over the image in a particular direction and (ii) the foreground and background regions are assigned based on the particular direction.
- A system comprising:a set of processing units for executing sets of instructions; anda computer readable medium according to claim 1.
- The system of claim 14, wherein the program further comprises a set of instructions for identifying a set of edges of the image before drawing the curvilinear boundary, wherein the curvilinear boundary is drawn based on the identified edges.
Applications Claiming Priority (8)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US12/154,989 | 2008-05-28 | ||
| US12/154,990 US8280171B2 (en) | 2008-05-28 | 2008-05-28 | Tools for selecting a section of interest within an image |
| US12/154,991 US8548251B2 (en) | 2008-05-28 | 2008-05-28 | Defining a border for an image |
| US12/154,987 US8331685B2 (en) | 2008-05-28 | 2008-05-28 | Defining a border for an image |
| US12/154,987 | 2008-05-28 | ||
| US12/154,991 | 2008-05-28 | ||
| US12/154,989 US8452105B2 (en) | 2008-05-28 | 2008-05-28 | Selecting a section of interest within an image |
| US12/154,990 | 2008-05-28 |
Related Parent Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| HK11108249.9A Addition HK1154103B (en) | 2008-05-28 | 2009-05-22 | Defining a border for an image |
Related Child Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| HK11108249.9A Division HK1154103B (en) | 2008-05-28 | 2009-05-22 | Defining a border for an image |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| HK1170833A1 HK1170833A1 (en) | 2013-03-08 |
| HK1170833B true HK1170833B (en) | 2013-12-13 |
Family
ID=
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| EP2431942B1 (en) | Defining a border for an image | |
| US8571326B2 (en) | Defining a border for an image | |
| US8452105B2 (en) | Selecting a section of interest within an image | |
| US8280171B2 (en) | Tools for selecting a section of interest within an image | |
| US8548251B2 (en) | Defining a border for an image | |
| US10521889B2 (en) | Enhanced vectorization of raster images | |
| US8885977B2 (en) | Automatically extending a boundary for an image to fully divide the image | |
| EP1372109B1 (en) | Method and system for enhancing portrait images | |
| US8351713B2 (en) | Drag-and-drop pasting for seamless image composition | |
| TWI526982B (en) | Area segmentation method, computer program product and inspection device | |
| JPH06203157A (en) | Color image processing method and apparatus | |
| WO2001026050A2 (en) | Improved image segmentation processing by user-guided image processing techniques | |
| KR20140124427A (en) | Image processing apparatus, image processing method, and computer-readable recording medium | |
| US20150248221A1 (en) | Image processing device, image processing method, image processing system, and non-transitory computer readable medium | |
| HK1170833B (en) | Defining a border for an image | |
| HK1154103B (en) | Defining a border for an image | |
| Olsen Jr et al. | Edge-respecting brushes | |
| US20250336127A1 (en) | Image generation | |
| JP2018097415A (en) | Image processing apparatus, image processing method, image processing system, and program | |
| JP3679467B2 (en) | Image processing method and apparatus | |
| WO2002035472A2 (en) | Creating cartoons |