[go: up one dir, main page]

WO2022227393A1 - Image photographing method and apparatus, electronic device, and computer readable storage medium - Google Patents

Image photographing method and apparatus, electronic device, and computer readable storage medium Download PDF

Info

Publication number
WO2022227393A1
WO2022227393A1 PCT/CN2021/120722 CN2021120722W WO2022227393A1 WO 2022227393 A1 WO2022227393 A1 WO 2022227393A1 CN 2021120722 W CN2021120722 W CN 2021120722W WO 2022227393 A1 WO2022227393 A1 WO 2022227393A1
Authority
WO
WIPO (PCT)
Prior art keywords
human body
posture
recommended
scene
image
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
PCT/CN2021/120722
Other languages
French (fr)
Chinese (zh)
Inventor
徐伟伟
黄岗桂
谢东明
王家园
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shanghai Sensetime Intelligent Technology Co Ltd
Original Assignee
Shanghai Sensetime Intelligent Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shanghai Sensetime Intelligent Technology Co Ltd filed Critical Shanghai Sensetime Intelligent Technology Co Ltd
Priority to KR1020227018878A priority Critical patent/KR20220149503A/en
Publication of WO2022227393A1 publication Critical patent/WO2022227393A1/en
Anticipated expiration legal-status Critical
Ceased legal-status Critical Current

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N23/00Cameras or camera modules comprising electronic image sensors; Control thereof
    • H04N23/60Control of cameras or camera modules
    • H04N23/63Control of cameras or camera modules by using electronic viewfinders
    • H04N23/631Graphical user interfaces [GUI] specially adapted for controlling image capture or setting capture parameters
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N23/00Cameras or camera modules comprising electronic image sensors; Control thereof
    • H04N23/60Control of cameras or camera modules
    • H04N23/61Control of cameras or camera modules based on recognised objects
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N23/00Cameras or camera modules comprising electronic image sensors; Control thereof
    • H04N23/60Control of cameras or camera modules
    • H04N23/61Control of cameras or camera modules based on recognised objects
    • H04N23/611Control of cameras or camera modules based on recognised objects where the recognised objects include parts of the human body
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N23/00Cameras or camera modules comprising electronic image sensors; Control thereof
    • H04N23/60Control of cameras or camera modules
    • H04N23/62Control of parameters via user interfaces
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N23/00Cameras or camera modules comprising electronic image sensors; Control thereof
    • H04N23/60Control of cameras or camera modules
    • H04N23/63Control of cameras or camera modules by using electronic viewfinders
    • H04N23/631Graphical user interfaces [GUI] specially adapted for controlling image capture or setting capture parameters
    • H04N23/632Graphical user interfaces [GUI] specially adapted for controlling image capture or setting capture parameters for displaying or modifying preview images prior to image capturing, e.g. variety of image resolutions or capturing parameters
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N23/00Cameras or camera modules comprising electronic image sensors; Control thereof
    • H04N23/60Control of cameras or camera modules
    • H04N23/64Computer-aided capture of images, e.g. transfer from script file into camera, check of taken image quality, advice or proposal for image composition or decision on when to take image

Definitions

  • the present disclosure relates to the field of computer technologies, and in particular, to an image capturing method and apparatus, an electronic device, and a computer-readable storage medium.
  • the embodiments of the present disclosure provide an image capturing technical solution.
  • an image capturing method including: acquiring a scene image corresponding to a shooting interface; recognizing the scene image, determining a scene category corresponding to the scene image and a scene category corresponding to the scene category the recommended human body posture; perform human key point detection on the scene image to determine the object in the scene image and the real human body posture of the object; by displaying the real human body posture and the recommended body posture in the shooting interface Human body posture, guide the object to perform posture adjustment according to the recommended human body posture; and obtain a photographed image of the object when the photographing conditions are met.
  • the image shooting effect is effectively improved, the shooting experience of the user is improved, and the shooting image that satisfies the user can be obtained.
  • the recommended human posture includes a single-person posture
  • the displaying the real human posture and the recommended human posture in the shooting interface includes: according to the object in the scene The position in the image, the real human body posture and the single-person posture are displayed in the shooting interface; or, according to the position of the object in the scene image, the real human body is displayed in the shooting interface.
  • gesture, and the single-person gesture is displayed at the first designated area of the photographing interface.
  • the recommended human body posture includes a multi-person combined posture
  • the method further includes: according to relative positions between multiple objects in the scene image, and each of the multi-person combined postures The relative position between the gestures, and the corresponding gestures of each object are determined from the combined gestures of the multiple people; wherein, displaying the real human body gesture and the recommended human body gesture in the shooting interface includes: for any object , according to the position of the object in the scene image, display the real human body posture of the object and the corresponding posture of the object in the shooting interface; or, according to the position of the object in the scene image , the real human body posture of the object is displayed in the shooting interface, and the corresponding posture of the object is displayed in the second designated area of the shooting interface.
  • the method further includes: determining a key point pair between the first human body key point of the real human body posture and the second human body key point of the recommended human body posture; the first similarity of the key point pair, and from the first human body key points, determine a third human body key point whose first similarity is less than a first preset threshold; wherein, described in the shooting interface Displaying the real human body posture and the recommended human body posture includes: highlighting the region where the third human body key point is located, wherein the highlighting method includes at least one of highlighting, bolding, and changing color. In this way, the parts of the human body with low similarity among the displayed real human body postures can be highlighted, so as to more effectively guide the user to adjust the posture, so as to improve the shooting experience.
  • the method further includes: determining the real human body posture and the recommended human body according to a first similarity of key point pairs between the real human body posture and the recommended human body posture
  • the second similarity between postures, the key point pair is determined according to the first human body key point of the real human body posture and the second human body key point of the recommended human body posture; the second similarity.
  • the method further includes: inputting the scene image to an area recommendation network to obtain a recommended photographing area of the scene image, where the recommended photographing area is used to represent the scene image in the scene image A recommended photographing area, the area recommendation network is a neural network obtained by training the first sample set marking the photographing area; wherein, after obtaining the recommended photographing area of the scene image, the method further includes at least one of the following: In the shooting interface, an identifier is used to indicate the recommended photographing area; the recommended human posture is displayed at the recommended photographing area of the photographing interface, and the recommended photographing area is used to guide the object according to the It is recommended to adjust the position of the photo area. In this way, a photographing area with better visual effect can be recommended to the subject, and the subject can be effectively guided to adjust the position to improve the visual effect of the photographed image, thereby improving the user's photographing experience.
  • recognizing the scene image, determining a scene category corresponding to the scene image and a recommended human pose corresponding to the scene category includes: performing a pose recommendation network on the scene image. Identify and determine the scene category corresponding to the scene image and the recommended human posture corresponding to the scene category, wherein the posture recommendation network is a neural network obtained by training the second sample set marked with the sample scene category and the sample human posture. network, the second sample set includes sample images uploaded by the object. In this way, the pose recommendation network can learn the human pose from the high-quality images, thereby effectively outputting high-quality recommended human poses based on the scene images.
  • the recommended human body postures include multiple ones
  • the method further includes: displaying in the shooting interface a plurality of posture diagrams corresponding to the recommended human body postures; in response to the selection of the posture diagrams Operation, according to the selected posture map, determine and display the recommended human body posture corresponding to the selected posture map.
  • the shooting conditions include: a second degree of similarity between the real human body posture and the recommended human body posture is greater than or equal to a second preset threshold; or, in the shooting interface Shooting controls are triggered.
  • the method further includes: processing the captured image to obtain a processed captured image, wherein the processing includes at least one of the following: performing processing on an object in the captured image Beauty processing, adding a filter to the captured image; the adding a filter to the captured image includes: adjusting the saturation, color temperature, color temperature, etc. of the captured image according to the recommended filter corresponding to the recommended human posture. at least one of brightness.
  • the visual effect of the captured image can be improved, which is conducive to obtaining a captured image that satisfies the user, and improves the image capturing experience.
  • the method is applied to an artificial intelligence education platform, and the artificial intelligence education platform is connected with intelligent hardware; wherein, the artificial intelligence education platform is used to edit the project code for realizing the image capturing method , and send the project code to the intelligent hardware; the intelligent hardware is used to collect scene images and run the project code to obtain the operation result, and send the operation result to the artificial intelligence education platform to The operation result is displayed on the display interface of the artificial intelligence education platform.
  • the artificial intelligence education platform is used to edit the project code for realizing the image capturing method , and send the project code to the intelligent hardware; the intelligent hardware is used to collect scene images and run the project code to obtain the operation result, and send the operation result to the artificial intelligence education platform to The operation result is displayed on the display interface of the artificial intelligence education platform.
  • an image capturing apparatus comprising: an acquisition part configured to acquire a scene image corresponding to a shooting interface; a recognition part configured to recognize the scene image and determine the scene The scene category corresponding to the image and the recommended human body posture corresponding to the scene category; the detection part is configured to perform human key point detection on the scene image, and determine the object in the scene image and the real human body posture of the object
  • the display part is configured to guide the object to adjust the posture according to the recommended human body posture by displaying the real human body posture and the recommended human body posture in the shooting interface; the shooting part is configured to meet the requirements of shooting condition, a captured image of the object is obtained.
  • the recommended human posture includes a single-person posture
  • the display part includes: a first display sub-part, configured to display the position of the object in the scene image according to the position of the object in the scene image.
  • the real human body posture and the single-person posture are displayed in the shooting interface; or, the second display subsection is configured to display the object in the shooting interface according to the position of the object in the scene image.
  • the real human body posture is displayed, and the single-person posture is displayed in the first designated area of the shooting interface.
  • the recommended human body posture includes a multi-person combined posture
  • the apparatus further includes: a position determination part configured to The relative position between each posture in the multi-person combined posture, and the corresponding posture of each object is respectively determined from the multi-person combined posture
  • the display part includes: a third display sub-section, which is configured for any an object, according to the position of the object in the scene image, display the real human body posture of the object and the corresponding posture of the object in the shooting interface; or, the fourth display sub-section is configured to be based on The position of the object in the scene image, the real human body posture of the object is displayed in the shooting interface, and the corresponding posture of the object is displayed in the second designated area of the shooting interface.
  • the apparatus further includes: a first determining part configured to determine the distance between the first human body key point of the real human body posture and the second human body key point of the recommended human body posture The key point pair; the second determination part is configured to determine from the first human body key points that the first similarity is less than the first preset threshold by calculating the first similarity of the key point pair.
  • the third human body key point wherein, the display part, including: a highlight sub-section, is configured to highlight the region where the third human body key point is located, wherein the highlighting method includes highlighting, bolding, changing at least one of the colors.
  • the apparatus further includes: a third determination part, configured to determine the the second similarity between the real human body posture and the recommended human body posture, and the key point pair is determined according to the first human body key point of the real human body posture and the second human body key point of the recommended human body posture;
  • the similarity display part is configured to display the second similarity in the shooting interface.
  • the apparatus further includes: an area determination part, configured to input the scene image into an area recommendation network to obtain a recommended photographing area of the scene image, where the recommended photographing area is used for Representing the area recommended to be photographed in the scene image, and the area recommendation network is a neural network trained by marking the first sample set of the photographing area; wherein, the device further includes: an area display part, configured to In the shooting interface, an identifier is used to indicate the recommended photographing area; or, the recommended human posture is displayed in the recommended photographing area of the photographing interface; the recommended photographing area is used to guide the object according to the recommended photographing area. Adjust the position of the recommended photo area as described above.
  • the area display part is further configured to use an identifier to indicate the recommended photographing area in the photographing interface; and display the recommended photographing area on the photographing interface.
  • the recommended human posture is described.
  • the identifying part is further configured to: identify the scene image through a gesture recommendation network, and determine a scene category corresponding to the scene image and a recommended human body corresponding to the scene category pose, wherein the pose recommendation network is a neural network obtained by training a second sample set marked with sample scene categories and sample human poses, and the second sample set includes sample images uploaded by objects.
  • the pose recommendation network is a neural network obtained by training a second sample set marked with sample scene categories and sample human poses, and the second sample set includes sample images uploaded by objects.
  • the recommended human body postures include a plurality of postures
  • the apparatus further includes: a posture map display part configured to display the posture maps corresponding to the plurality of recommended human body postures in the shooting interface; selecting The part is configured to, in response to a selection operation on the posture map, determine and display a recommended human body posture corresponding to the selected posture map according to the selected posture map.
  • the shooting conditions include: a second degree of similarity between the real human body posture and the recommended human body posture is greater than or equal to a second preset threshold; or, in the shooting interface Shooting controls are triggered.
  • the apparatus further includes: a processing part configured to process the captured image to obtain a processed captured image, wherein the processing includes at least one of the following: Performing beautification processing on the object in the captured image, and adding a filter to the captured image; the adding a filter to the captured image includes: adjusting the captured image according to the recommended filter corresponding to the recommended human posture at least one of saturation, color temperature, and brightness.
  • the device is applied to an artificial intelligence education platform, and the artificial intelligence education platform is connected to intelligent hardware; wherein, the artificial intelligence education platform is used to edit and implement the project code of the image capturing device , and send the project code to the intelligent hardware; the intelligent hardware is used to collect scene images and run the project code to obtain the operation result, and send the operation result to the artificial intelligence education platform to The operation result is displayed on the display interface of the artificial intelligence education platform.
  • an electronic device comprising: a processor; a memory configured to store instructions executable by the processor; wherein the processor is configured to invoke the instructions stored in the memory to execute the above method.
  • a computer-readable storage medium having computer program instructions stored thereon, the computer program instructions implementing the above method when executed by a processor.
  • a computer program comprising computer-readable code, which, when the computer-readable code is executed in an electronic device, implements the above method when executed by a processor in the electronic device.
  • the recommended human body pose corresponding to the scene category can be recommended to the user according to the scene category of the scene image, so that the recommended human body pose matches the scene image, and by displaying the real human body pose and The recommended human posture can guide the user to adjust the posture according to the displayed recommended human posture or facilitate the user to guide the subject to adjust the posture according to the displayed recommended human posture. Satisfied taking pictures.
  • FIG. 1 shows a flowchart of an image capturing method according to an embodiment of the present disclosure.
  • FIG. 2 shows a schematic diagram of a photographing interface according to an embodiment of the present disclosure.
  • FIG. 3 shows a schematic diagram of a multi-person combined gesture according to an embodiment of the present disclosure.
  • FIG. 4 shows a schematic diagram of a real human body pose according to an embodiment of the present disclosure.
  • FIG. 5 shows a schematic diagram of a photographing interface according to an embodiment of the present disclosure.
  • FIG. 6 shows a schematic diagram of a scene image according to an embodiment of the present disclosure.
  • FIG. 7 shows a schematic diagram of a photographing interface according to an embodiment of the present disclosure.
  • FIG. 8 shows a schematic diagram of a photographing interface according to an embodiment of the present disclosure.
  • FIG. 9 shows a schematic diagram of a photographing interface according to an embodiment of the present disclosure.
  • FIG. 10 shows a schematic diagram of a photographing interface according to an embodiment of the present disclosure.
  • FIG. 11a and 11b illustrate schematic diagrams of a photographing interface according to an embodiment of the present disclosure.
  • FIG. 12 shows a schematic diagram of an application of an image capturing method according to an embodiment of the present disclosure.
  • FIG. 13 shows a block diagram of an image capturing apparatus according to an embodiment of the present disclosure.
  • FIG. 14 shows a block diagram of an electronic device according to an embodiment of the present disclosure.
  • FIG. 1 shows a flowchart of an image capturing method according to an embodiment of the present disclosure. As shown in FIG. 1 , the image capturing method includes:
  • step S11 obtain a scene image corresponding to the shooting interface
  • step S12 the scene image is identified, and the scene category corresponding to the scene image and the recommended human posture corresponding to the scene category are determined;
  • step S13 the human body key point detection is performed on the scene image, and the object in the scene image and the real human body posture of the object are determined;
  • step S14 the real human body posture and the recommended human body posture are displayed in the shooting interface, so as to guide the object to adjust the posture according to the recommended human body posture;
  • step S15 when the photographing conditions are satisfied, a photographed image of the object is obtained.
  • the image capturing method may be performed by a terminal device, and the terminal device may be a user equipment (User Equipment, UE), a mobile device, a user terminal, a terminal, a cellular phone, a cordless phone, or a personal digital assistant (Personal Digital Assistant, PDA), handheld device, computing device, vehicle-mounted device, wearable device, etc., the method can be implemented by the processor calling the computer-readable instructions stored in the memory.
  • UE User Equipment
  • PDA Personal Digital Assistant
  • an application (Application, APP) implementing the image capturing method may be deployed in the terminal device, so that the user can directly use the application to capture images;
  • a shooting mode it is integrated into the shooting function of the terminal device, so that when the user uses the shooting function of the terminal device, the user can select the shooting mode that realizes the image shooting method to shoot the image, which is not made in this embodiment of the present disclosure. limit.
  • an image capturing device such as a camera mounted on the terminal device may be used, or an image capturing device (such as a camera) assembled on the terminal device or a
  • the image acquisition device connected to the device collects the scene image of the actual scene in real time and displays it on the shooting interface of the terminal device.
  • a scene image corresponding to the shooting interface is acquired, that is, a scene image acquired by an image acquisition device is acquired.
  • FIG. 2 shows a schematic diagram of a shooting interface according to an embodiment of the present disclosure.
  • a scene image can be displayed in the shooting interface, and a shooting control 22 for triggering shooting can be provided in the shooting interface, which is used to check the shooting interface.
  • the shooting interface shown in FIG. 2 is an implementation provided by the embodiment of the present disclosure. It should be understood that the present disclosure should not be limited to this, and those skilled in the art can set the functions in the shooting interface according to actual needs. control, which is not limited by this embodiment of the present disclosure.
  • the image capturing method may also be performed by a server, and the server may be an independent physical server, a server cluster or a distributed system composed of multiple physical servers, or a cloud-provided server.
  • the disclosed embodiments do not limit the type of server.
  • step S11 a scene image corresponding to the shooting interface is acquired, that is, the server receives the scene image sent by the terminal, wherein the scene image is collected by the terminal through the image capturing device image.
  • step S12 the recognition of the scene image can be triggered by triggering the trigger control 23 as shown in FIG. After the shooting mode of the image shooting method, the scene image recognition is automatically triggered, which is not limited in this embodiment of the present disclosure.
  • recognizing the scene image, determining the scene category corresponding to the scene image and the recommended human pose corresponding to the scene category may include: recognizing the scene image, and obtaining the scene image in the scene image. at least one of the scene feature and the object feature of the image; according to at least one of the identified scene feature and the object feature, determine the scene type corresponding to the scene image; according to the correspondence between the preset scene type and the recommended human posture , and determine the recommended human pose corresponding to the scene type.
  • the scene features can be used to indicate the scene type of the actual scene where the object is located.
  • the scene features of the grassland can include grass, cattle and sheep, etc.
  • the scene features of the indoor can include walls, windows, etc.
  • the identified scene features include Grassland and cattle and sheep, the scene type corresponding to the obtained scene image is grassland.
  • the object features may include at least one of the following: clothing features, physical features, gender features, and the like.
  • the clothing feature can be used to indicate the clothing of the object
  • the body feature can be used to indicate the body shape of the object (for example, tall and small)
  • the gender feature can be used to indicate the gender of the object.
  • the scene type can be determined in combination with the scene feature and the object feature, so as to obtain a more accurate recommended human pose.
  • the scene category can be "indoor-small-girl-wearing sweater, trousers” , handbag”, and then determine the corresponding recommended human pose according to this scene category.
  • the scene image can be recognized by a pre-trained scene recognition network to obtain scene features and/or object features in the scene image, and the embodiments of the present disclosure do not limit the network type, network result, and training method of the scene recognition network.
  • the identified scene features and object features can also be displayed in the shooting interface, for example, the scene features and object features can be marked by means of tags, so as to improve the interest of image shooting.
  • scene types may correspond to different recommended human postures, and the corresponding relationship between scene types and recommended human postures can be preset.
  • the recommended human pose corresponding to the scene category.
  • a pre-trained gesture recommendation network may also be used to identify the scene image, and determine the scene category corresponding to the scene image and the recommended human pose corresponding to the scene category.
  • the embodiments of the present disclosure do not limit the network type, network structure, training method, etc. of the gesture recommendation network.
  • a self-supervised training method can be used to train the pose recommendation network to learn the recommended human poses under different scene types, or, in other words, to learn the correspondence between the scene types and the recommended human poses, so as to use
  • the pose recommendation network directly obtains the recommended human pose based on the scene image. In this way, compared with the above-mentioned method of determining the recommended human posture according to the corresponding relationship between the preset scene type and the recommended human posture, more abundant scene types and recommended human postures can be obtained intelligently, and more accurate It recommends the recommended human pose corresponding to the scene image to the user.
  • the recommended human body posture may include the human body posture learned from the sample images by the above-mentioned posture recommendation network, for example, may also include the human body posture manually drawn by a user (eg, a technician), to which the embodiments of the present disclosure No restrictions apply.
  • the recommended human posture may be represented in the form of a human skeleton (eg, a skeleton obtained by connecting human joint points), or in the form of a human body outline, which is not limited to this embodiment of the present disclosure.
  • the sample image is a high-quality image with better shooting effect.
  • Learning the human body posture from the high-quality image and recommending it to the user can guide the object to approach the human body posture in the high-quality image, so as to obtain a better shooting effect and improve the user's shooting experience.
  • the recommended human posture may include a single-person posture (for example, a single-person comparison), and may also include a multi-person combined posture (for example, a two-person combination). It should be understood that the recommended human gesture corresponding to the scene category may include at least one single-person gesture and/or at least one multi-person combined gesture.
  • FIG. 3 shows a schematic diagram of a multi-person combined posture according to an embodiment of the present disclosure. As shown in FIG. 3 , the posture n and the posture m together form a posture of “comparing hearts”.
  • the scene types may, for example, at least include: landscape, indoor, outdoor, etc.; wherein, the scenery can be divided into beach, landscape, grassland, lake, etc., and the indoor can be divided into For office, conference room, home, etc., outdoor category can be divided into: street view category, playground category, park category, etc.
  • classification of scene types may not be limited to this.
  • those skilled in the art can preset various scene types according to actual needs, or can also self-learn scene types through the above gesture recommendation network. Examples are not limited.
  • any known human body key point detection method may be used, such as using a human body key point detection network to perform human key point detection on the scene image, which is not made in this embodiment of the present disclosure. limit.
  • the embodiments of the present disclosure do not limit the network type, network structure, and training method of the human body key point detection network.
  • the detection of human body key points on the scene image may include: extracting the key points of human body joint parts of objects in the scene image (for example, the human body key points of 20 joint parts), wherein the key points of the human body joint parts are extracted.
  • the number and position of the points may be determined according to actual requirements, which are not limited in this embodiment of the present disclosure.
  • the object in the scene image and the coordinate values of the key points of multiple human joint parts of the object in the scene image can be determined according to the detected key points of the human body joints; value, and connect the key points of multiple human joint parts according to the structure of the human body to obtain the human skeleton of the object, that is, to obtain the real human body posture of the object.
  • FIG. 4 shows a schematic diagram of a real human body posture according to an embodiment of the present disclosure. As shown in FIG. 4 , 20 key points of joint parts of the human body are connected to obtain the real human body posture of the object.
  • performing human body key point detection on the scene image may also include: extracting contour key points on the human body contour of the object in the scene image, obtaining the human body contour of the object, and then using the human body
  • the contour represents the real human pose of the object.
  • the manner in which the posture of the human body is represented may be determined according to actual needs, which is not limited by the embodiment of the present disclosure.
  • steps S12 and S13 may be performed simultaneously; or step S12 may be performed first, and then step S13 may be performed; or step S13 may be performed first, and then step S12 may be performed. Specifically, it may be set according to factors such as the processing capability of the terminal device, the resource occupancy of the terminal device, and the time delay limit in the application process, which is not limited in this embodiment of the present disclosure.
  • displaying the real human body posture and the recommended human body posture in the shooting interface may include: displaying the human body skeleton corresponding to the recommended human body posture and the human body skeleton corresponding to the real human body posture in the shooting interface ; or, displaying the human body contour corresponding to the recommended human body posture and the human body contour corresponding to the real human body posture in the shooting interface.
  • the form of human skeleton or human outline can be used in the shooting interface to display the real human body posture and the recommended human body posture. In this way, the user can be guided to adjust the posture according to the displayed recommended human body posture, so as to achieve a state with a high similarity to the recommended human body posture, thereby obtaining a better shooting effect.
  • FIG. 5 shows a schematic diagram of a shooting interface according to an embodiment of the present disclosure.
  • the real human body posture and the recommended human body posture of the object can be displayed based on the form of the human skeleton; in a possible implementation manner , the real human body posture and the recommended human body posture are displayed in different display forms; for example, different colors can be used to represent the real human body posture and the recommended human body posture of the object, for example, the recommended human body posture is represented by yellow, and the recommended human body posture is represented by green.
  • Real human pose In a possible implementation, as shown in FIG.
  • guide information can also be displayed in the shooting interface, such as displaying the similarity between the real human body posture and the recommended human posture: 30%, and displaying the information that guides the user to adjust the posture.
  • the prompt "Try another action? Try to be consistent with the yellow line”.
  • the real human body posture and the representation of the recommended human body posture shown in FIG. 5 are an implementation manner provided by the embodiments of the present disclosure, and the present disclosure should not be limited to this. It is required to set the display form of the real human body posture and the recommended human body posture, such as color, line thickness, line type, area transparency, etc., as well as setting the content of the guidance information, which is not limited by this embodiment of the present disclosure.
  • step S15 when the image capturing method is executed by the terminal device, the capturing conditions may include that the capturing control in the capturing interface is triggered. It can be understood that the photographer clicks the 2.
  • the shooting controls in the shooting interface shown in FIG. 4 and FIG. 5 trigger the shooting operation, obtain the shooting image of the object and save the local album for viewing.
  • the recommended human body pose corresponding to the scene category can be recommended to the user according to the scene category of the scene image, so that the recommended human body pose matches the scene image, and by displaying the real human body pose and The recommended human posture can guide the user to adjust the posture according to the displayed recommended human posture or facilitate the user to guide the subject to adjust the posture according to the displayed recommended human posture. Satisfied taking pictures.
  • the recommended human posture may include a single-person posture.
  • the real human posture and the recommended human posture are displayed on the shooting interface, including:
  • the real human pose and the single-person pose are displayed in the shooting interface; or,
  • the real human body pose is displayed in the shooting interface, and the single-person pose is displayed at the first designated area of the shooting interface.
  • the above-mentioned human body key point detection on the scene image may include: performing human body detection on the scene image to obtain the object in the scene image and the human body area of the object; and then performing human body key point detection on the object in the human body area. It should be understood that the human body detection can obtain information such as whether an object is included in the scene image, the number of objects included, and the position of the object in the scene image (ie, the position of the human body region in the scene image).
  • the recommended human posture is a single-person posture
  • the real human posture of the object can be displayed directly according to the position of the object;
  • the preset The selection rule is to select an object from multiple objects, and display the real human body posture of the selected object according to the position of the selected object.
  • the preset selection rule may include, for example, selecting an object in the middle of a scene image, or selecting an object closest to the terminal device, etc., which are not limited in this embodiment of the present disclosure.
  • the distance between the object and the terminal device may be obtained by using a technique known in the art, for example, a time of flight (TOF) method may be used, which is not limited in this embodiment of the present disclosure.
  • TOF time of flight
  • the real human pose can be represented and the human pose can be recommended based on the form of human skeleton or human outline.
  • the position of the object in the scene image may include: the position coordinates of the key points of the human body on the human skeleton of the object in the scene image or the human body contour; may also include the human body region of the object in the scene image The position of the object, wherein the human body area of the object can be obtained through the above-mentioned human body detection.
  • the position coordinates of the human body key points can be obtained, and then the real human body posture can be displayed in the shooting interface according to the position coordinates of the human body key points. In this way, the real human body posture in the photographing interface can be displayed following the position of the object.
  • the recommended human posture it can be set to follow the position of the object to display, that is, to display the recommended human posture according to the position of the object in the scene image; it can also be set to the first specified in the shooting interface.
  • the area displays the recommended human body posture, that is, the recommended human body posture is fixedly displayed in a preset area of the shooting interface.
  • displaying the single-person posture in the shooting interface may include: according to the position of the human body region of the object in the scene image, displaying on the human body region of the shooting interface Solo pose.
  • displaying the single-person posture in the shooting interface may include: according to the position of the human body region of the object in the scene image, displaying on the human body region of the shooting interface Solo pose.
  • the single-person posture is displayed in the shooting interface, and may further include: according to the position coordinates of any key point of the human body (such as the key point of the neck) on the real human body posture to display the single-person pose in the shooting interface.
  • This method can be understood as making any key point of the human body in the single-person posture coincide with the key point of the human body corresponding to the real human body posture.
  • the key points of the neck on the two postures can be set to overlap, and the single-person posture can be displayed on this basis. , so as to realize the position display of the single-person posture following the object. In this way, the single-person posture can be displayed more accurately following the position of the object, which is convenient for the user to compare the single-person posture and the real human body posture.
  • the first designated area in the shooting interface can be set according to actual needs, for example, it can be the middle area, the left area, the right area, the four vertex areas, etc. of the shooting interface.
  • the embodiments of the present disclosure are not limited.
  • the size of the range of the first designated area and the display size of the single-person posture in the first designated area may be set according to actual requirements, which are not limited in this embodiment of the present disclosure.
  • the object can be effectively guided to complete the shooting of the single-person posture, and the shooting experience can be improved, and the recommended human posture can be displayed fixedly and can be displayed with the object, which can meet different posture display requirements.
  • the recommended human body posture includes the combined posture of multiple people.
  • the method further includes:
  • the corresponding gestures of each object are respectively determined from the multiple-person combined gestures.
  • the corresponding postures of each object in the scene image in the multi-person combined posture can be effectively determined, so that it is convenient to guide each object to complete a plurality of combined postures.
  • Performing human body key point detection on the scene image may include: performing human body detection on the scene image to obtain the object in the scene image and the human body area of the object; and then performing human key point detection on the object in the human body area. It should be understood that the human body detection can obtain information such as whether an object is included in the scene image, the number of objects included, and the position of the object in the scene image (ie, the position of the human body region in the scene image).
  • the recommended human pose is a multi-person combined pose
  • the number of objects in the scene image is less than the number of objects required to realize the multi-person combined pose, for example, through voice or text, etc. Prompt the user that the number of people is not enough to achieve a multi-person combined posture, thereby guiding the increase of the number of objects to be photographed; or select a human posture that matches the number of objects in the scene image from the multi-person combined posture, and determine the corresponding posture of the object based on the selected human posture .
  • the human body posture that matches the number of objects in the scene image is selected from the combined postures of multiple people.
  • the human body posture that matches the number of objects can be randomly selected, or the human body posture that matches the number of objects can be selected according to a preset selection strategy.
  • the selection strategy may include, for example, selection in order from left to right and from top to bottom, which is not limited to this embodiment of the present disclosure.
  • the selected human body posture is one, the selected human body posture can be directly used as the corresponding posture of the object; if there are multiple selected human body postures, according to the relative positions between the multiple selected human body postures, Determine the corresponding pose of each object.
  • the relative positions of the multiple objects in the scene image and the relative positions of the various gestures in the multi-person combined gesture can be determined.
  • the corresponding poses of each object are determined from the combined poses of multiple people.
  • the relative position may include, for example, a front-rear position, a left-right position, an up-down position, and the like, which is not limited by the embodiment of the present disclosure.
  • FIG. 6 shows a schematic diagram of a scene image according to an embodiment of the present disclosure.
  • the scene image includes object A, object B, and object C
  • the current recommended human pose is, for example, the pose of a group of people shown in FIG. 3 (ie, the "comparison" pose).
  • object A and object B correspond to pose m and pose n respectively, and also That is, it is determined that the corresponding posture of object A is "pose m", and the corresponding posture of object B is "pose n".
  • the relative position between object A and object B is more suitable for completing the multi-person shown in FIG. Combining gestures.
  • step S14 a real human body is displayed in the shooting interface Posture and recommended human posture, including:
  • the real human body posture of the object and the corresponding posture of the object are displayed in the shooting interface; or, according to the position of the object in the scene image, the real human body of the object is displayed in the shooting interface.
  • gesture, and the corresponding gesture of the object is displayed in the second designated area of the shooting interface.
  • the real human body posture of the object is displayed in the shooting interface, and reference may be made to the content disclosed in the above embodiments of the present disclosure.
  • the corresponding posture of the object is displayed in the shooting interface, which can be the same as the method according to the object in the above-mentioned embodiment of the present disclosure.
  • the implementation manner of displaying the single-person posture in the shooting interface at the position in the scene image is the same, which is not limited by the embodiment of the present disclosure.
  • the corresponding posture of each object can be displayed following the position of each object, so as to facilitate guiding each object to compare the corresponding posture with the real human body posture for posture adjustment.
  • the second designated area in the shooting interface can be set according to actual needs, for example, it can be the middle area, left area, right area, four vertex areas, etc. of the shooting interface.
  • the embodiments of the present disclosure are not limited.
  • FIG. 7 shows a schematic diagram of a shooting interface according to an embodiment of the present disclosure. As shown in FIG. 7 , a combined pose of multiple people can be displayed at a second designated area, and according to the position of each object in the scene image, it can be displayed in the shooting interface. The real human pose of each object.
  • the size of the range of the second designated area and the display size of the corresponding posture in the second designated area may be set according to actual needs, which is not limited by the embodiment of the present disclosure.
  • multiple objects can be effectively guided to jointly complete a multi-person combined pose, improving the shooting experience, and the recommended human pose can be displayed in a fixed manner and can be displayed with the object, which can meet different pose display requirements.
  • the object adjusts the posture
  • there may be some human body postures that are close to the recommended human posture while the postures of some human body parts are not close to the recommended human posture, for example, the posture of the arms is close to the recommended human posture (also That is, the similarity is high), and the posture of the legs is not close to the recommended human posture (that is, the similarity is low).
  • the method further includes:
  • step S13 the real human body posture and the recommended human body posture are displayed in the shooting interface, including:
  • the region where the third human body key point is located is highlighted, and the highlighting method includes at least one of highlighting, bolding, and changing color.
  • the first human body key point of the real human body posture and the second human body key point of the recommended human body posture may correspond.
  • the first human body key point of the real human body posture One human body key point includes 20 human body joint key points
  • the second human body key point for the recommended human posture also includes 20 human body joint key points correspondingly.
  • key point pairs can be determined from the first human body key point of the real human body posture and the second human body key point of the recommended human body posture, for example, the left shoulder key point in the first human body key point, and the second human body key point.
  • the left shoulder key point in the points is a pair of key point pairs, 20 key points of human body joints, and 20 key point pairs can be determined.
  • the first human body key point of the real human body posture may carry a mark
  • the second human body key point of the recommended human body posture may also carry a mark.
  • the marks can be used to indicate key points on different joint parts or different contour positions of the human body, and through the same/similar marks on the two postures, it is easy to determine the first human key point of the real human body posture, which is the same as that of the human body. Keypoint pairs between the second human keypoints for the recommended human pose.
  • the above method of determining key point pairs through identification is an implementation method disclosed in the embodiments of the present disclosure.
  • those skilled in the art can use any known method to determine the first human body key point and the first human body key point.
  • the key point pair between two human body key points is not limited in this embodiment of the present disclosure.
  • the first similarity of a keypoint pair can be represented by the distance (such as Euclidean distance, cosine distance, etc.) between two keypoints in the keypoint pair, or by the deviation of the distance, This embodiment of the present disclosure is not limited.
  • the first similarity may be set to be negatively correlated with the distance or the deviation from the distance, that is, the smaller the distance or the smaller the deviation of the distance, the higher the first similarity. It should be understood that the number of key point pairs is consistent with the number of first degrees of similarity, that is, each key point pair corresponds to its respective first degree of similarity.
  • the deviation of the distance can be understood as the difference between the distance of any key point pair and the average value of the distances of all key point pairs.
  • the average value of the distances of 20 key point pairs is X
  • the key point The distance to a is x
  • the deviation of the distance of the key point to a is "x-X", that is, "x-X” can be used to represent the first similarity of the key point to a.
  • the display manner of the recommended human body posture following the object display or fixed display, choose to use the distance or the deviation of the distance to represent the first similarity.
  • the distance can be used to represent the first similarity
  • the deviation of the distance can be used to represent the first similarity
  • the position of the object in the scene image will change, and the distance may not be able to accurately characterize the first similarity of the key point pair. This is because The long distance of any key point pair does not mean that the key point pair is not similar, it may be that the overall distance between the real human body posture and the recommended human body posture is far away. first similarity.
  • the first preset threshold may be determined according to an actual requirement, a calculation method of the first similarity, and the like, which is not limited in this embodiment of the present disclosure.
  • the first similarity is less than the first preset threshold, which means that the similarity of the key point pair corresponding to the first similarity is low, that is, the similarity between the partial posture of the human body part represented by the key point pair and the recommended human posture
  • the first similarity degree is greater than or equal to the first preset threshold, it may mean that the similarity degree of the key point pair corresponding to the first similarity degree is high, that is, the partial posture of the human body part represented by the key point pair is the same as It is recommended that the human pose has a high similarity.
  • determining a third human body key point whose first similarity is less than a first preset threshold from the first human body key points by calculating the first similarity of the key point pair may include: by Calculate the first similarity of the key point pair, determine the target key point pair whose first similarity is less than the first preset threshold from the key point pair, and use the first human body key point in the target key point pair as the third human body key point , that is, the third human body key point whose first similarity is less than the first preset threshold is determined from the first human body key point.
  • highlighting the area where the third human body key point is located may be understood as highlighting the area occupied by the third human body key point in the shooting interface.
  • a solid circle is used to represent the joint parts of the human body, then the area occupied by the solid circle may be the area where the third human body key point is located.
  • the highlighting manner includes at least one of highlighting, bolding, and changing color, which are some implementation manners provided by the embodiments of the present disclosure. In fact, those skilled in the art can design different highlighting according to actual needs.
  • the manner of display is not limited to this embodiment of the present disclosure.
  • the color is changed, for example, the third human body key point in the real human body posture displayed in green is changed to red, which is not limited in this embodiment of the present disclosure.
  • FIG. 8 shows a schematic diagram of a photographing interface according to an embodiment of the present disclosure.
  • the key points 80 of the knee joint parts with lower similarity may be represented in a bold manner.
  • the connection lines 81 and 82 corresponding to the key points 80 of the joint parts are bolded, which is not limited by the embodiment of the present disclosure.
  • the parts of the human body with low similarity among the displayed real human body postures can be highlighted, so as to more effectively guide the user to adjust the posture, so as to improve the shooting experience.
  • the similarity between the real human body posture and the recommended human body posture can be displayed in the shooting interface to guide the subject to adjust the posture.
  • the method further includes:
  • the second similarity is displayed in the shooting interface.
  • the first similarity of the keypoint pair can be characterized by the distance of the keypoint pair or the deviation of the distance.
  • determining the second similarity between the real human body posture and the recommended human posture according to the first similarity of the key point pairs may include: according to the accumulated value of the distances of all key point pairs, the average value, variance, or standard deviation to determine the second similarity, which is not limited in this embodiment of the present disclosure.
  • the accumulated value, average value, variance or standard deviation of the distance may be negatively correlated with the second similarity, that is, the smaller the accumulated value, mean, variance or standard deviation of the distance, etc., the higher the second similarity is. On the contrary, the larger the cumulative value, mean, variance or standard deviation of the distance, the lower the second similarity. It should be understood that the second similarity may reflect the overall similarity between the real human body posture and the recommended human body posture.
  • the second similarity in order for the object to understand the level of the second similarity, or to understand the level of the similarity, the second similarity may be displayed in the shooting interface in the form of a percentage. Based on this, the cumulative value, average value, variance or standard deviation of the distance can be set, and there is a mapping relationship with the second similarity, so that according to the mapping relationship, the cumulative value, average value, variance or standard deviation of the distance can be mapped to The second similarity in percentage form.
  • mapping relationship between the accumulated value, average value, variance or standard deviation of the distance and the second similarity may be used, which is not limited in this embodiment of the present disclosure.
  • the second degree of similarity is displayed in the photographing interface, for example, the second degree of similarity may be displayed in the manner shown in FIG. 5 and FIG. 8 . It should be understood that a person skilled in the art can design a display manner of the second similarity according to actual requirements, which is not limited by this embodiment of the present disclosure.
  • the second similarity between the real human body posture and the recommended human body posture can also be obtained through a neural network, that is, the first human body key point corresponding to the real human body posture and the recommended human body posture can be obtained.
  • the second human body key point is input into the neural network, and the second similarity is output.
  • the first human body key point and the second human body key point can be input into the neural network in the form of a vector or a matrix.
  • the embodiments of the present disclosure do not limit the network type, network structure, and training method of the neural network.
  • the overall similarity between the real human body posture and the recommended human posture can be displayed, so as to effectively guide the user to adjust the posture according to the overall similarity, and improve the shooting experience.
  • the shooting conditions may include that the second similarity between the real human body posture and the recommended human body posture is greater than or equal to a second preset threshold. In this way, the shooting operation can be automatically triggered when the real human body posture is highly similar to the recommended human body posture, and the shooting image can be obtained and saved in a local album for the object to view.
  • the server may determine that the second similarity between the real human body posture and the recommended human body posture is greater than or equal to a second preset threshold.
  • a shooting instruction is sent to the terminal device, and the terminal device is triggered to perform a shooting operation through the shooting instruction to obtain a shot image.
  • the terminal device may automatically trigger a capturing operation when the second similarity between the real human body posture and the recommended human body posture is greater than or equal to a second preset threshold , to get the captured image.
  • the second preset threshold may be set according to actual requirements, for example, may be set to 80%, which is not limited in this embodiment of the present disclosure. It should be understood that the shooting conditions may also include that the shooting controls in the shooting interface are triggered, that is, the photographer manually performs the shooting operation.
  • the method further includes:
  • the area recommendation network is a neural network obtained by training the first sample set of marking the photographing area.
  • the method further includes at least one of the following:
  • an identifier is used to indicate the recommended shooting area
  • the recommended human posture is displayed in the recommended photographing area of the photographing interface, and the recommended photographing area is used to guide the object to adjust the position according to the recommended photographing area.
  • the region recommendation network may adopt a known neural network, such as a convolutional neural network, etc.
  • the embodiment of the present disclosure does not limit the network type, network structure, and training method of the region recommendation network.
  • the sample images of the first sample set may be images with better visual effects.
  • the object can be guided to adjust the position to improve the visual effect of the photographed image, thereby improving the user's photographing experience.
  • the region where the human body is located in the sample images of the first sample set may be the marked photographing region. It should be understood that, a known labeling technology may be used to realize labeling of the human body region in the sample image of the first sample set, which is not limited in this embodiment of the present disclosure.
  • the identifier used to indicate the recommended photographing area may be displayed in the photographing interface in any form such as characters and/or graphics, so as to guide the user object to adjust the position according to the recommended photographing area.
  • the embodiments of the present disclosure are not limited.
  • FIG. 9 shows a schematic diagram of a photographing interface according to an embodiment of the present disclosure. As shown in FIG. 9 , a recommended photographing area may be indicated in the form of graphics, prompt words "recommended station", and the like.
  • the recommended human posture may be displayed in a fixed area (the first designated area or the second designated area) of the photographing interface.
  • the recommended human body posture may be displayed in the recommended photographing area of the photographing interface, in other words, the recommended photographing area may be indicated by the recommended human body posture.
  • the above-mentioned first designated area or second designated area may include a recommended photographing area. In this way, the object can be effectively guided to adjust the position according to the recommended photographing area, and it is also convenient for the object to adjust the posture according to the recommended human body posture.
  • the recommended photographing area may be indicated by both the identifier and the recommended human posture, or the recommended photographing area may be indicated by only the identifier or the recommended human posture, which is not limited by the embodiment of the present disclosure.
  • determining whether the object is in the recommended photographing area may be determined according to the position of the object's human body area in the scene image, or may be determined according to the position coordinates of the key points of the human body, which is not limited in this embodiment of the present disclosure.
  • a photographing area with better visual effect can be recommended to the object, and the object can be effectively guided to adjust the position to improve the visual effect of the photographed image, thereby improving the user's photographing experience.
  • the scene image can be recognized by the gesture recommendation network, and the scene category corresponding to the scene image and the recommended human pose corresponding to the scene category can be determined.
  • the scene image is identified, and the scene category corresponding to the scene image and the recommended human posture corresponding to the scene category are determined, including:
  • the scene image is recognized by the gesture recommendation network, and the scene category corresponding to the scene image and the recommended human pose corresponding to the scene category are determined.
  • the gesture recommendation network is obtained by training the second sample set marked with the sample scene category and the sample human pose.
  • the neural network of the second sample set includes sample images uploaded by the object.
  • the pose recommendation network may adopt a known neural network, for example, a convolutional neural network, a residual neural network, and the like.
  • a convolutional neural network for example, a convolutional neural network, a residual neural network, and the like.
  • the embodiments of the present disclosure do not limit the network type, network structure, training method, etc. of the gesture recommendation network.
  • the self-supervised training method can be used to make the posture recommendation network learn the recommended human posture under different scene types, or, in other words, learn the correspondence between the scene type and the recommended human posture, so that the posture recommendation network can be used based on Scene image output recommends human pose.
  • first sample set and the second sample set may be the same sample set or different sample sets, and the source of the sample images in the first sample set and the second sample set is not limited in the embodiment of the present disclosure. .
  • a known labeling technology can be used to realize labeling of the sample scene category and the sample human body posture in the sample images of the second sample set, which is not limited by the embodiment of the present disclosure.
  • an application program for realizing the image capturing method or a capturing mode for realizing the image capturing method may also provide an upload function for uploading a sample image, and the sample image may be an image that has been captured by the user with a relatively low effect. good image.
  • the second sample set used for training the pose recommendation network can be enriched, so that the trained pose recommendation network can output the recommended human pose based on the scene image more accurately.
  • the sample image uploaded by the user is the image that the user authorizes to enter the background training posture recommendation network, so the sample image uploaded by the user is only visible to the background, and what other users see are the processed recommended human posture and the corresponding posture. image (for example, the human face is hidden from the gesture image, or the human face is replaced or the character is replaced, etc.), thereby ensuring the privacy and security of the user, wherein the gesture image can be used to select different recommended human gestures.
  • the image uploading function may be implemented in an application program and/or a shooting mode by using a known technology in the art, which is not limited by the embodiment of the present disclosure.
  • the training process of the gesture recommendation network may be performed on the server.
  • the sample images can be uploaded to the server to expand the second sample set, and the posture recommendation network can be periodically retrained based on the second sample set to obtain a new version of the posture recommendation network, and then update the deployed posture recommendation network in the terminal device.
  • the pose recommendation network In this way, the posture recommendation network can learn rich human body postures, so that the recommended human postures output by the trained posture recommendation network are more comprehensive and richer.
  • the retraining process of the posture recommendation network deployed in the terminal device can also be performed on the terminal device, that is, the sample image uploaded by the object can be directly used to retrain the deployed gesture recommendation network on the terminal device.
  • the pose recommendation network is incrementally trained. In this way, the recommended human posture output by the deployed posture recommendation network can be closer to the object's preference and targeted.
  • the second sample set may be high-quality images with better shooting effects.
  • the posture recommendation network can learn the human body posture from the high-quality images, thereby effectively outputting high-quality recommended human postures based on scene images. , to guide the subject to approach the high-quality recommended human posture, so as to obtain a better shooting effect and improve the user's shooting experience.
  • the method further includes:
  • a recommended human body posture corresponding to the selected posture map is determined and displayed.
  • FIG. 10 shows a schematic diagram of a photographing interface according to an embodiment of the present disclosure.
  • the photographing interface can display a plurality of posture diagrams, and can also display the recognized scene features “window frame, wall” and objects Features "Sweater, Pants, Hand Raise” for more fun.
  • the identified scene features and object features can be obtained by referring to the method of identifying the scene image in the above-mentioned embodiments of the present disclosure.
  • a sliding control for viewing multiple posture diagrams may also be provided in the photographing interface, so as to view other posture graphs that are not currently displayed in the photographing interface through a sliding operation.
  • the displayed posture map may be a thumbnail, which can be understood as a preview map, a schematic diagram, etc. corresponding to the recommended human body posture. It should be understood that the posture map corresponds to the recommended human body posture, and according to any selected posture map, the corresponding recommended human body posture can be obtained.
  • the selection operation for the posture map may include, for example, operations such as swiping to view the posture map, clicking to select any posture map, and the like. Among them, the selected posture map can be highlighted to achieve friendly human-computer interaction.
  • the displayed posture map can be a thumbnail image.
  • the zoomed-in corresponding to the posture map can be displayed in the shooting interface. map, so that the object can view the pose map more clearly.
  • FIG. 11a and 11b illustrate schematic diagrams of a photographing interface according to an embodiment of the present disclosure.
  • the enlarged image corresponding to the selected posture image can be displayed in the middle area, wherein, in response to operations such as long-pressing the enlarged image, double-clicking the enlarged image, etc., the image can be previewed in the enlarged image as shown in Fig. 11b.
  • the recommended human pose corresponding to the pose map can be displayed in the middle area, wherein, in response to operations such as long-pressing the enlarged image, double-clicking the enlarged image, etc.
  • the recommended human pose corresponding to the pose map.
  • the selected posture diagram and the corresponding recommended human posture can be determined by clicking the "shoot the same style" button shown in Figure 11a and Figure 11b, and shown in Figure 5, Figure 7,
  • the recommended human posture corresponding to the selected posture map is displayed in the photographing interface shown in FIG. 8 .
  • the manner of displaying the recommended human body posture reference may be made to the above-mentioned embodiments of the present disclosure.
  • the manner of displaying the attitude diagram and selecting the attitude diagram in FIG. 10 , FIG. 11 a , and FIG. 11 b above is an implementation method provided by the embodiment of the present disclosure.
  • those skilled in the art can design the The display manner of the attitude map in the shooting interface, the selection method for the attitude map, etc., are not limited in this embodiment of the present disclosure.
  • the method further includes:
  • the captured image is processed to obtain a processed captured image, wherein the processing includes at least one of the following: performing beautification processing on an object in the captured image, and adding a filter to the captured image;
  • the adding a filter to the captured image includes: adjusting at least one of saturation, color temperature, and brightness of the captured image according to a recommended filter corresponding to the recommended human posture.
  • any known image beautifying technology can be used to realize the beautifying processing of the object in the captured image, which is not limited by the embodiment of the present disclosure.
  • the process of beauty processing may include: determining the position of the face in the captured image, then determining the position of the facial defect, and performing processing such as filling, repairing or filtering according to the location of the facial defect.
  • the recommended human pose can be learned from the sample image.
  • the recommended filter can be obtained by analyzing the filter of the sample image. Based on this, the recommended human pose can correspond to the recommended filter. .
  • the recommended filter may include recommended filter parameters such as saturation, color temperature, and brightness.
  • adjusting at least one of the saturation, color temperature, and brightness of the captured image may include: adjusting at least one of the saturation, color temperature, and brightness of the captured image to match the recommended filter The filter parameters are the same.
  • the visual effect of the captured image can be improved, which is beneficial to obtain the captured image that satisfies the user, and improves the image capturing experience.
  • the image capturing method of the embodiment of the present disclosure can be applied to artificial intelligence education platforms, social sharing platforms, video image capturing software, etc., and can intelligently recommend recommended human bodies corresponding to the scene images according to the scene images.
  • the posture is used for self-portrait and camera composition guidance for users, which improves the fun of image shooting, improves the shooting experience, and is conducive to obtaining a shooting image that satisfies the user.
  • intelligent hardware connected to the artificial intelligence education platform can be used, such as Jetson Nano (a microcomputer developed by NVIDIA), Raspberry Pi (a microcomputer) ) etc. to implement the above image capturing method, through this method, based on the form of visualization, it is easier for the user to inspire learning interest.
  • Jetson Nano a microcomputer developed by NVIDIA
  • Raspberry Pi a microcomputer
  • FIG. 12 shows a schematic diagram of an application of an image capturing method according to an embodiment of the present disclosure.
  • the artificial intelligence education platform is connected to the intelligent hardware, wherein the artificial intelligence education platform is used to edit the project code to realize the image capturing method, and send the project code to the intelligent hardware, and the intelligent hardware is used to collect scene images And run the project code, get the running results, and send the running results to the artificial intelligence education platform to display the running results in the display interface of the artificial intelligence education platform.
  • the scene image, the captured image and the operation result of the image capturing method can be displayed on the display interface of the web page of the artificial intelligence education platform.
  • the running results may include recommended human body postures, real human body postures, captured images after adding filters to the captured images, captured images after beautifying the objects in the captured images, and the like.
  • the artificial intelligence education platform can support students to edit and implement the project code of the above-mentioned image shooting method on the platform, such as training code such as posture recommendation network, and connect with intelligent hardware, which will be used to realize the above-mentioned image shooting.
  • the project code of the method is sent to the intelligent hardware for execution, so that the scene image can be collected in real time by the intelligent hardware, and the above-mentioned image capturing method can be realized based on the collected scene image.
  • students can also edit the project code on the artificial intelligence education platform to update and optimize the above-mentioned image shooting methods, such as optimizing each neural network.
  • the present disclosure also provides image capturing devices, electronic devices, computer-readable storage media, and programs, all of which can be used to implement any image capturing method provided by the present disclosure.
  • image capturing devices electronic devices, computer-readable storage media, and programs, all of which can be used to implement any image capturing method provided by the present disclosure.
  • FIG. 13 shows a block diagram of an image capturing apparatus according to an embodiment of the present disclosure. As shown in FIG. 13 , the apparatus includes:
  • the acquiring part 101 is configured to acquire the scene image corresponding to the shooting interface
  • the identification part 102 is configured to identify the scene image, and determine the scene category corresponding to the scene image and the recommended human posture corresponding to the scene category;
  • the detection part 103 is configured to perform human body key point detection on the scene image, and determine the object in the scene image and the real human body posture of the object;
  • the display part 104 is configured to guide the object to perform posture adjustment according to the recommended human body posture by displaying the real human body posture and the recommended human body posture in the shooting interface;
  • the photographing part 105 is configured to obtain a photographed image of the object when photographing conditions are satisfied.
  • the recommended human posture includes a single-person posture
  • the display part 104 includes: a first display sub-part, configured to, according to the position of the object in the scene image, display The real human body posture and the single-person posture are displayed in the shooting interface; or, the second display sub-section is configured to display all the objects in the shooting interface according to the position of the object in the scene image.
  • the real human body posture is displayed, and the single-person posture is displayed in the first designated area of the shooting interface.
  • the recommended human body posture includes a multi-person combined posture
  • the apparatus further includes: a position determination part configured to The relative position between each gesture in the multi-person combined gesture, and the corresponding gesture of each object is respectively determined from the multi-person combined gesture
  • the display part 104 includes: a third display sub-section, which is configured for For any object, according to the position of the object in the scene image, the real human body posture of the object and the corresponding posture of the object are displayed in the shooting interface; or, the fourth display subsection is configured as According to the position of the object in the scene image, the real human body posture of the object is displayed in the shooting interface, and the corresponding posture of the object is displayed in the second designated area of the shooting interface.
  • the apparatus further includes: a first determining part configured to determine the distance between the first human body key point of the real human body posture and the second human body key point of the recommended human body posture The key point pair; the second determination part is configured to determine from the first human body key points that the first similarity is less than the first preset threshold by calculating the first similarity of the key point pair.
  • the apparatus further includes: a third determination part, configured to determine the the second similarity between the real human body posture and the recommended human body posture, and the key point pair is determined according to the first human body key point of the real human body posture and the second human body key point of the recommended human body posture;
  • the similarity display part is configured to display the second similarity in the shooting interface.
  • the apparatus further includes: an area determination part, configured to input the scene image into an area recommendation network to obtain a recommended photographing area of the scene image, where the recommended photographing area is used for Representing the area recommended to be photographed in the scene image, and the area recommendation network is a neural network trained by marking the first sample set of the photographing area; wherein, the device further includes: an area display part, configured to In the shooting interface, an identifier is used to indicate the recommended photographing area; or, the recommended human posture is displayed in the recommended photographing area of the photographing interface; the recommended photographing area is used to guide the object according to the recommended photographing area. Adjust the position of the recommended photo area as described above.
  • the area display part is further configured to use an identifier to indicate the recommended photographing area in the photographing interface; and display the recommended photographing area on the photographing interface.
  • the recommended human posture is described.
  • the identifying part 102 is further configured to identify the scene image through a gesture recommendation network, and determine a scene category corresponding to the scene image and a recommended human body corresponding to the scene category pose, wherein the pose recommendation network is a neural network obtained by training a second sample set marked with sample scene categories and sample human poses, and the second sample set includes sample images uploaded by objects.
  • the pose recommendation network is a neural network obtained by training a second sample set marked with sample scene categories and sample human poses, and the second sample set includes sample images uploaded by objects.
  • the recommended human body postures include a plurality of postures
  • the apparatus further includes: a posture map display part configured to display the posture maps corresponding to the plurality of recommended human body postures in the shooting interface; selecting The part is configured to, in response to a selection operation on the posture map, determine and display a recommended human body posture corresponding to the selected posture map according to the selected posture map.
  • the shooting conditions include: a second degree of similarity between the real human body posture and the recommended human body posture is greater than or equal to a second preset threshold; or, in the shooting interface Shooting controls are triggered.
  • the apparatus further includes: a processing part configured to process the captured image to obtain a processed captured image, wherein the processing includes at least one of the following: Performing beautification processing on the object in the captured image, and adding a filter to the captured image; the adding a filter to the captured image includes: adjusting the captured image according to the recommended filter corresponding to the recommended human posture at least one of saturation, color temperature, and brightness.
  • the device is applied to an artificial intelligence education platform, and the artificial intelligence education platform is connected to intelligent hardware; wherein, the artificial intelligence education platform is used to edit and implement the project code of the image capturing device , and send the project code to the intelligent hardware; the intelligent hardware is used to collect scene images and run the project code to obtain the operation result, and send the operation result to the artificial intelligence education platform to The operation result is displayed on the display interface of the artificial intelligence education platform.
  • the recommended human body pose corresponding to the scene category can be recommended to the user according to the scene category of the scene image, so that the recommended human body pose matches the scene image, and by displaying the real human body pose and The recommended human posture can guide the user to adjust the posture according to the displayed recommended human posture or facilitate the user to guide the subject to adjust the posture according to the displayed recommended human posture. Satisfied taking pictures.
  • the functions or included parts of the apparatus provided in the embodiments of the present disclosure may be configured to execute the methods described in the above method embodiments, and the specific implementation may refer to the above method embodiments. For brevity, I won't go into details here.
  • Embodiments of the present disclosure further provide a computer-readable storage medium, on which computer program instructions are stored, and when the computer program instructions are executed by a processor, the foregoing method is implemented.
  • Computer-readable storage media can be volatile or non-volatile computer-readable storage media.
  • An embodiment of the present disclosure further provides an electronic device, including: a processor; a memory configured to store instructions executable by the processor; wherein the processor is configured to invoke the instructions stored in the memory to execute the above method.
  • Embodiments of the present disclosure also provide a computer program product, including computer-readable codes.
  • a processor in the device executes the image capturing method for implementing the image capturing method provided by any of the above embodiments. instruction.
  • Embodiments of the present disclosure further provide another computer program product for storing computer-readable instructions, which, when executed, cause the computer to perform the operations of the image capturing method provided by any of the foregoing embodiments.
  • the electronic device in the embodiment of the present disclosure may be provided as a terminal device or a server.
  • FIG. 14 shows a block diagram of an electronic device 800 according to an embodiment of the present disclosure.
  • electronic device 800 may be a mobile phone, computer, digital broadcast terminal, messaging device, game console, tablet device, medical device, fitness device, personal digital assistant, etc. terminal.
  • an electronic device 800 may include one or more of the following components: a processing component 802, a memory 804, a power supply component 806, a multimedia component 808, an audio component 810, an input/output (I/O) interface 812, a sensor component 814 , and the communication component 816 .
  • the processing component 802 generally controls the overall operation of the electronic device 800, such as operations associated with display, phone calls, data communications, camera operations, and recording operations.
  • the processing component 802 can include one or more processors 820 to execute instructions to perform all or some of the steps of the methods described above.
  • processing component 802 may include one or more modules that facilitate interaction between processing component 802 and other components.
  • processing component 802 may include a multimedia module to facilitate interaction between multimedia component 808 and processing component 802.
  • Memory 804 is configured to store various types of data to support operation at electronic device 800 . Examples of such data include instructions for any application or method operating on electronic device 800, contact data, phonebook data, messages, pictures, videos, and the like. Memory 804 may be implemented by any type of volatile or nonvolatile storage device or combination thereof, such as static random access memory (SRAM), electrically erasable programmable read only memory (EEPROM), erasable Programmable Read Only Memory (EPROM), Programmable Read Only Memory (PROM), Read Only Memory (ROM), Magnetic Memory, Flash Memory, Magnetic or Optical Disk.
  • SRAM static random access memory
  • EEPROM electrically erasable programmable read only memory
  • EPROM erasable Programmable Read Only Memory
  • PROM Programmable Read Only Memory
  • ROM Read Only Memory
  • Magnetic Memory Flash Memory
  • Magnetic or Optical Disk Magnetic Disk
  • Power supply assembly 806 provides power to various components of electronic device 800 .
  • Power supply components 806 may include a power management system, one or more power supplies, and other components associated with generating, managing, and distributing power to electronic device 800 .
  • Multimedia component 808 includes a screen that provides an output interface between the electronic device 800 and the user.
  • the screen may include a liquid crystal display (LCD) and a touch panel (TP). If the screen includes a touch panel, the screen may be implemented as a touch screen to receive input signals from a user.
  • the touch panel includes one or more touch sensors to sense touch, swipe, and gestures on the touch panel. The touch sensor may not only sense the boundaries of a touch or swipe action, but also detect the duration and pressure associated with the touch or swipe action.
  • the multimedia component 808 includes a front-facing camera and/or a rear-facing camera. When the electronic device 800 is in an operation mode, such as a shooting mode or a video mode, the front camera and/or the rear camera may receive external multimedia data. Each of the front and rear cameras can be a fixed optical lens system or have focal length and optical zoom capability.
  • Audio component 810 is configured to output and/or input audio signals.
  • audio component 810 includes a microphone (MIC) that is configured to receive external audio signals when electronic device 800 is in operating modes, such as calling mode, recording mode, and voice recognition mode.
  • the received audio signal may be further stored in memory 804 or transmitted via communication component 816 .
  • audio component 810 also includes a speaker for outputting audio signals.
  • the I/O interface 812 provides an interface between the processing component 802 and a peripheral interface module, which may be a keyboard, a click wheel, a button, or the like. These buttons may include, but are not limited to: home button, volume buttons, start button, and lock button.
  • Sensor assembly 814 includes one or more sensors for providing status assessment of various aspects of electronic device 800 .
  • the sensor assembly 814 can detect the on/off state of the electronic device 800, the relative positioning of the components, such as the display and the keypad of the electronic device 800, the sensor assembly 814 can also detect the electronic device 800 or one of the electronic device 800 Changes in the position of components, presence or absence of user contact with the electronic device 800 , orientation or acceleration/deceleration of the electronic device 800 and changes in the temperature of the electronic device 800 .
  • Sensor assembly 814 may include a proximity sensor configured to detect the presence of nearby objects in the absence of any physical contact.
  • Sensor assembly 814 may also include a light sensor, such as a complementary metal oxide semiconductor (CMOS) or charge coupled device (CCD) image sensor, for use in imaging applications.
  • CMOS complementary metal oxide semiconductor
  • CCD charge coupled device
  • the sensor assembly 814 may also include an acceleration sensor, a gyroscope sensor, a magnetic sensor, a pressure sensor, or a temperature sensor.
  • Communication component 816 is configured to facilitate wired or wireless communication between electronic device 800 and other devices.
  • the electronic device 800 may access a wireless network based on a communication standard, such as wireless network (WiFi), second generation mobile communication technology (2G) or third generation mobile communication technology (3G), or a combination thereof.
  • the communication component 816 receives broadcast signals or broadcast related information from an external broadcast management system via a broadcast channel.
  • the communication component 816 also includes a near field communication (NFC) module to facilitate short-range communication.
  • the NFC module may be implemented based on radio frequency identification (RFID) technology, infrared data association (IrDA) technology, ultra-wideband (UWB) technology, Bluetooth (BT) technology and other technologies.
  • RFID radio frequency identification
  • IrDA infrared data association
  • UWB ultra-wideband
  • Bluetooth Bluetooth
  • electronic device 800 may be implemented by one or more application specific integrated circuits (ASICs), digital signal processors (DSPs), digital signal processing devices (DSPDs), programmable logic devices (PLDs), field programmable A programmed gate array (FPGA), controller, microcontroller, microprocessor or other electronic component implementation is used to perform the above method.
  • ASICs application specific integrated circuits
  • DSPs digital signal processors
  • DSPDs digital signal processing devices
  • PLDs programmable logic devices
  • FPGA field programmable A programmed gate array
  • controller microcontroller, microprocessor or other electronic component implementation is used to perform the above method.
  • a non-volatile computer-readable storage medium such as a memory 804 comprising computer program instructions executable by the processor 820 of the electronic device 800 to perform the above method is also provided.
  • the present disclosure may be a system, method and/or computer program product.
  • the computer program product may include a computer-readable storage medium having computer-readable program instructions loaded thereon for causing a processor to implement various aspects of the present disclosure.
  • a computer-readable storage medium may be a tangible device that can hold and store instructions for use by the instruction execution device.
  • the computer-readable storage medium may be, for example, but not limited to, an electrical storage device, a magnetic storage device, an optical storage device, an electromagnetic storage device, a semiconductor storage device, or any suitable combination of the foregoing.
  • Non-exhaustive list of computer readable storage media include: portable computer disks, hard disks, random access memory (RAM), read only memory (ROM), erasable programmable read only memory (EPROM) or flash memory), static random access memory (SRAM), portable compact disk read only memory (CD-ROM), digital versatile disk (DVD), memory sticks, floppy disks, mechanically coded devices, such as printers with instructions stored thereon Hole cards or raised structures in grooves, and any suitable combination of the above.
  • RAM random access memory
  • ROM read only memory
  • EPROM erasable programmable read only memory
  • flash memory static random access memory
  • SRAM static random access memory
  • CD-ROM compact disk read only memory
  • DVD digital versatile disk
  • memory sticks floppy disks
  • mechanically coded devices such as printers with instructions stored thereon Hole cards or raised structures in grooves, and any suitable combination of the above.
  • Computer-readable storage media are not to be construed as transient signals per se, such as radio waves or other freely propagating electromagnetic waves, electromagnetic waves propagating through waveguides or other transmission media (eg, light pulses through fiber optic cables), or through electrical wires transmitted electrical signals.
  • the computer readable program instructions described herein may be downloaded to various computing/processing devices from a computer readable storage medium, or to an external computer or external storage device over a network such as the Internet, a local area network, a wide area network, and/or a wireless network.
  • the network may include copper transmission cables, fiber optic transmission, wireless transmission, routers, firewalls, switches, gateway computers, and/or edge servers.
  • a network adapter card or network interface in each computing/processing device receives computer-readable program instructions from a network and forwards the computer-readable program instructions for storage in a computer-readable storage medium in each computing/processing device .
  • Computer program instructions for carrying out operations of the present disclosure may be assembly instructions, instruction set architecture (ISA) instructions, machine instructions, machine-dependent instructions, microcode, firmware instructions, state setting data, or instructions in one or more programming languages.
  • Source or object code written in any combination, including object-oriented programming languages, such as Smalltalk, C++, etc., and conventional procedural programming languages, such as the "C" language or similar programming languages.
  • the computer readable program instructions may execute entirely on the user's computer, partly on the user's computer, as a stand-alone software package, partly on the user's computer and partly on a remote computer, or entirely on the remote computer or server implement.
  • the remote computer may be connected to the user's computer through any kind of network, including a local area network (LAN) or a wide area network (WAN), or may be connected to an external computer (eg, using an Internet service provider through the Internet connect).
  • LAN local area network
  • WAN wide area network
  • custom electronic circuits such as programmable logic circuits, field programmable gate arrays (FPGAs), or programmable logic arrays (PLAs) can be personalized by utilizing state information of computer readable program instructions.
  • Computer readable program instructions are executed to implement various aspects of the present disclosure.
  • These computer readable program instructions may be provided to a processor of a general purpose computer, special purpose computer or other programmable data processing apparatus to produce a machine that causes the instructions when executed by the processor of the computer or other programmable data processing apparatus , resulting in means for implementing the functions/acts specified in one or more blocks of the flowchart and/or block diagrams.
  • These computer readable program instructions can also be stored in a computer readable storage medium, these instructions cause a computer, programmable data processing apparatus and/or other equipment to operate in a specific manner, so that the computer readable medium storing the instructions includes An article of manufacture comprising instructions for implementing various aspects of the functions/acts specified in one or more blocks of the flowchart and/or block diagrams.
  • Computer readable program instructions can also be loaded onto a computer, other programmable data processing apparatus, or other equipment to cause a series of operational steps to be performed on the computer, other programmable data processing apparatus, or other equipment to produce a computer-implemented process , thereby causing instructions executing on a computer, other programmable data processing apparatus, or other device to implement the functions/acts specified in one or more blocks of the flowcharts and/or block diagrams.
  • each block in the flowchart or block diagrams may represent a module, segment, or portion of instructions, which comprises one or more functions for implementing the specified logical function(s) executable instructions.
  • the functions noted in the blocks may occur out of the order noted in the figures. For example, two blocks in succession may, in fact, be executed substantially concurrently, or the blocks may sometimes be executed in the reverse order, depending upon the functionality involved.
  • each block of the block diagrams and/or flowchart illustrations, and combinations of blocks in the block diagrams and/or flowchart illustrations can be implemented in dedicated hardware-based systems that perform the specified functions or actions , or can be implemented in a combination of dedicated hardware and computer instructions.
  • the computer program product can be specifically implemented by hardware, software or a combination thereof.
  • the computer program product is embodied as a computer storage medium, and in another optional embodiment, the computer program product is embodied as a software product, such as a software development kit (Software Development Kit, SDK), etc. Wait.
  • a software development kit Software Development Kit, SDK
  • the recommended human body pose corresponding to the scene category can be recommended to the user according to the scene category of the scene image, so that the recommended human body pose matches the scene image, and by displaying the real human body pose and The recommended human posture can guide the user to adjust the posture according to the displayed recommended human posture or facilitate the user to guide the subject to adjust the posture according to the displayed recommended human posture. Satisfied taking pictures. And, by displaying the single-person posture on the shooting interface, the subject can be effectively guided to complete the shooting of the single-person posture, improving the shooting experience, and it is recommended that the human body posture can be displayed fixedly and can be displayed with the object, which can meet different posture display needs.
  • the human body pose can be displayed fixedly and can be displayed with the object, which can meet different pose display needs.
  • the parts of the human body with low similarity among the displayed real human body postures can be highlighted, so as to more effectively guide the user to adjust the posture, so as to improve the shooting experience.
  • the overall similarity between the real human body posture and the recommended human posture can be displayed, so as to effectively guide the user to adjust the posture according to the overall similarity, and improve the shooting experience.
  • the pose recommendation network can be used to learn human poses from high-quality images, so as to effectively output high-quality recommended human poses based on scene images. And it is convenient for the user to select different recommended human body postures, to satisfy different posture preferences, to improve the shooting experience, and to be beneficial to obtain a shooting image that satisfies the user. And, applying the methods in the embodiments of the present disclosure in an artificial intelligence education platform enables users to learn through interesting cases, and learn artificial intelligence algorithms such as human body gesture recognition and human body key point detection.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Human Computer Interaction (AREA)
  • User Interface Of Digital Computer (AREA)
  • Processing Or Creating Images (AREA)

Abstract

The present disclosure relates to an image photographing method and apparatus, an electronic device, and a computer readable storage medium. The method comprises: obtaining a scene image corresponding to a photographing interface; recognizing the scene image, and determining a scene category corresponding to the scene image and a recommended human body posture corresponding to the scene category; performing human body key point detection on the scene image, and determining an object in the scene image and a real human body posture of the object; displaying the real human body posture and the recommended human body posture in the photographing interface to guide the object to perform posture adjustment according to the recommended human body posture; and when a photographing condition is satisfied, obtaining a photographing image of the object. According to the embodiments of the present disclosure, the image photographing effect can be effectively improved, such that a photographing image that a user is satisfied with can be obtained.

Description

图像拍摄方法及装置、电子设备和计算机可读存储介质Image capturing method and apparatus, electronic device and computer-readable storage medium

相关申请的交叉引用CROSS-REFERENCE TO RELATED APPLICATIONS

本公开基于申请号为202110467998.6、申请日为2021年04月28日、申请名称为“图像拍摄方法及装置、电子设备和存储介质”的中国专利申请提出,并要求该中国专利申请的优先权,该中国专利申请的全部内容在此引入本公开作为参考。The present disclosure is based on the Chinese patent application with the application number of 202110467998.6, the application date of April 28, 2021, and the application title of "image capturing method and device, electronic equipment and storage medium", and claims the priority of the Chinese patent application, The entire contents of this Chinese patent application are hereby incorporated by reference into the present disclosure.

技术领域technical field

本公开涉及计算机技术领域,尤其涉及一种图像拍摄方法及装置、电子设备和计算机可读存储介质。The present disclosure relates to the field of computer technologies, and in particular, to an image capturing method and apparatus, an electronic device, and a computer-readable storage medium.

背景技术Background technique

随着摄像技术的发展,用户可利用手机、平板电脑等移动终端上的拍摄功能进行拍照。当用户缺乏拍照技巧,或用户在镜头前会肢体不够自然且拍照姿势单一,拍摄得到的图像可能无法令用户满意,使得拍摄体验下降。With the development of camera technology, users can take pictures by using the shooting function on mobile terminals such as mobile phones and tablet computers. When the user lacks photographing skills, or the user's limbs are not natural enough in front of the camera and the photographing posture is single, the photographed images may not satisfy the user, which reduces the photographing experience.

发明内容SUMMARY OF THE INVENTION

本公开实施例提出了一种图像拍摄技术方案。The embodiments of the present disclosure provide an image capturing technical solution.

根据本公开的一方面,提供了一种图像拍摄方法,包括:获取与拍摄界面对应的场景图像;对所述场景图像进行识别,确定所述场景图像对应的场景类别以及与所述场景类别对应的推荐人体姿态;对所述场景图像进行人体关键点检测,确定所述场景图像中的对象以及所述对象的真实人体姿态;通过在所述拍摄界面中显示所述真实人体姿态以及所述推荐人体姿态,引导所述对象根据所述推荐人体姿态进行姿态调整;在满足拍摄条件的情况下,得到所述对象的拍摄图像。通过该方式,有效提高图像拍摄效果,提升用户的拍摄体验,有利于得到令用户满足的拍摄图像。According to an aspect of the present disclosure, an image capturing method is provided, including: acquiring a scene image corresponding to a shooting interface; recognizing the scene image, determining a scene category corresponding to the scene image and a scene category corresponding to the scene category the recommended human body posture; perform human key point detection on the scene image to determine the object in the scene image and the real human body posture of the object; by displaying the real human body posture and the recommended body posture in the shooting interface Human body posture, guide the object to perform posture adjustment according to the recommended human body posture; and obtain a photographed image of the object when the photographing conditions are met. In this way, the image shooting effect is effectively improved, the shooting experience of the user is improved, and the shooting image that satisfies the user can be obtained.

在一种可能的实现方式中,所述推荐人体姿态包括单人姿态,所述在所述拍摄界面中显示所述真实人体姿态以及所述推荐人体姿态,包括:根据所述对象在所述场景图像中的位置,在所述拍摄界面中显示所述真实人体姿态以及所述单人姿态;或,根据所述对象在所述场景图像中的位置,在所述拍摄界面中显示所述真实人体姿态,并在所述拍摄界面的第一指定区域处显示所述单人姿态。通过该方式,能够有效引导对象完成单人姿态的拍摄,提升拍摄体验,并且推荐人体姿态可固定显示、可跟随对象显示,能够满足不同的姿态显示需求。In a possible implementation manner, the recommended human posture includes a single-person posture, and the displaying the real human posture and the recommended human posture in the shooting interface includes: according to the object in the scene The position in the image, the real human body posture and the single-person posture are displayed in the shooting interface; or, according to the position of the object in the scene image, the real human body is displayed in the shooting interface. gesture, and the single-person gesture is displayed at the first designated area of the photographing interface. In this way, the object can be effectively guided to complete the shooting of the single-person posture, the shooting experience can be improved, and the recommended human posture can be displayed in a fixed manner and can be displayed with the object, which can meet different posture display requirements.

在一种可能的实现方式中,所述推荐人体姿态包括多人组合姿态,所述方法还包括:根据所述场景图像中多个对象之间的相对位置,以及所述多人组合姿态中各个姿态之间的相对位置,从所述多人组合姿态中分别确定各对象的对应姿态;其中,在所述拍摄界面中显示所述真实人体姿态以及所述推荐人体姿态,包括:针对任一对象,根据所述对象在所述场景图像中的位置,在所述拍摄界面中显示所述对象的真实人体姿态以及所述对象的对应姿态;或,根据所述对象在所述场景图像中的位置,在所述拍摄界面中显示所述对象的真实人体姿态,并在所述拍摄界面的第二指定区域处显示所述对象的对应姿态。通过该方式,能够有效引导多个对象共同完成多人组合姿态,提升拍摄体验,并且推荐人体姿态可固定显示、可跟随对象显示,能够满足不同的姿态显示需求。In a possible implementation manner, the recommended human body posture includes a multi-person combined posture, and the method further includes: according to relative positions between multiple objects in the scene image, and each of the multi-person combined postures The relative position between the gestures, and the corresponding gestures of each object are determined from the combined gestures of the multiple people; wherein, displaying the real human body gesture and the recommended human body gesture in the shooting interface includes: for any object , according to the position of the object in the scene image, display the real human body posture of the object and the corresponding posture of the object in the shooting interface; or, according to the position of the object in the scene image , the real human body posture of the object is displayed in the shooting interface, and the corresponding posture of the object is displayed in the second designated area of the shooting interface. In this way, multiple objects can be effectively guided to complete the combined pose of multiple people, and the shooting experience can be improved, and the recommended human pose can be displayed in a fixed manner and can be displayed with the object, which can meet different pose display requirements.

在一种可能的实现方式中,所述方法还包括:确定所述真实人体姿态的第一人体关键点,与所述推荐人体姿态的第二人体关键点之间的关键点对;通过计算所述关键点对的第一相似度,从所述第一人体关键点中确定出所述第一相似度小于第一预设阈值的第三人体关键点;其中,所述在所述拍摄界面中显示所述真实人体姿态以及所述推荐人体姿态,包括:突出显示所述第三人体关键点所在的区域,其中,突出显示的方式包括高亮、加粗、变更颜色中的至少一种。通过该方式,能够对 显示的真实人体姿态中相似度低的人体部位,进行突出显示,从而更有效地引导用户调整姿态,以提升拍摄体验。In a possible implementation manner, the method further includes: determining a key point pair between the first human body key point of the real human body posture and the second human body key point of the recommended human body posture; the first similarity of the key point pair, and from the first human body key points, determine a third human body key point whose first similarity is less than a first preset threshold; wherein, described in the shooting interface Displaying the real human body posture and the recommended human body posture includes: highlighting the region where the third human body key point is located, wherein the highlighting method includes at least one of highlighting, bolding, and changing color. In this way, the parts of the human body with low similarity among the displayed real human body postures can be highlighted, so as to more effectively guide the user to adjust the posture, so as to improve the shooting experience.

在一种可能的实现方式中,所述方法还包括:根据所述真实人体姿态与所述推荐人体姿态之间的关键点对的第一相似度,确定所述真实人体姿态与所述推荐人体姿态之间的第二相似度,所述关键点对是根据所述真实人体姿态的第一人体关键点与所述推荐人体姿态的第二人体关键点确定的;在所述拍摄界面中显示所述第二相似度。通过该方式,能够显示真实人体姿态与推荐人体姿态之间整体的相似程度,以有效引导用户根据整体的相似度调整姿态,提升拍摄体验。In a possible implementation manner, the method further includes: determining the real human body posture and the recommended human body according to a first similarity of key point pairs between the real human body posture and the recommended human body posture The second similarity between postures, the key point pair is determined according to the first human body key point of the real human body posture and the second human body key point of the recommended human body posture; the second similarity. In this way, the overall similarity between the real human body posture and the recommended human posture can be displayed, so as to effectively guide the user to adjust the posture according to the overall similarity, and improve the shooting experience.

在一种可能的实现方式中,所述方法还包括:将所述场景图像输入至区域推荐网络,得到所述场景图像的推荐拍照区域,所述推荐拍照区域用于表征在所述场景图像中推荐拍照的区域,所述区域推荐网络是通过标注拍照区域的第一样本集训练得到的神经网络;其中,所述得到场景图像的推荐拍照区域之后,所述方法还包括以下至少一种:在所述拍摄界面中,采用标识符指示所述推荐拍照区域;在所述拍摄界面的所述推荐拍照区域处显示所述推荐人体姿态,所述推荐拍照区域用于引导所述对象根据所述推荐拍照区域进行位置调整。通过该方式,能够向对象推荐视觉效果较好的拍照区域,有效引导对象调整位置来提升拍摄图像的视觉效果,从而提升用户拍摄体验。In a possible implementation manner, the method further includes: inputting the scene image to an area recommendation network to obtain a recommended photographing area of the scene image, where the recommended photographing area is used to represent the scene image in the scene image A recommended photographing area, the area recommendation network is a neural network obtained by training the first sample set marking the photographing area; wherein, after obtaining the recommended photographing area of the scene image, the method further includes at least one of the following: In the shooting interface, an identifier is used to indicate the recommended photographing area; the recommended human posture is displayed at the recommended photographing area of the photographing interface, and the recommended photographing area is used to guide the object according to the It is recommended to adjust the position of the photo area. In this way, a photographing area with better visual effect can be recommended to the subject, and the subject can be effectively guided to adjust the position to improve the visual effect of the photographed image, thereby improving the user's photographing experience.

在一种可能的实现方式中,对所述场景图像进行识别,确定所述场景图像对应的场景类别以及与所述场景类别对应的推荐人体姿态,包括:通过姿态推荐网络对所述场景图像进行识别,确定所述场景图像对应的场景类别以及与所述场景类别对应的推荐人体姿态,其中,所述姿态推荐网络是通过标注有样本场景类别以及样本人体姿态的第二样本集训练得到的神经网络,所述第二样本集包括对象上传的样本图像。通过该方式,能够使姿态推荐网络从优质图像中学习人体姿态,从而有效地基于场景图像输出优质的推荐人体姿态。In a possible implementation manner, recognizing the scene image, determining a scene category corresponding to the scene image and a recommended human pose corresponding to the scene category, includes: performing a pose recommendation network on the scene image. Identify and determine the scene category corresponding to the scene image and the recommended human posture corresponding to the scene category, wherein the posture recommendation network is a neural network obtained by training the second sample set marked with the sample scene category and the sample human posture. network, the second sample set includes sample images uploaded by the object. In this way, the pose recommendation network can learn the human pose from the high-quality images, thereby effectively outputting high-quality recommended human poses based on the scene images.

在一种可能的实现方式中,所述推荐人体姿态包括多个,所述方法还包括:在所述拍摄界面中显示多个推荐人体姿态对应的姿态图;响应于针对所述姿态图的选择操作,根据选中的姿态图,确定与所述选中的姿态图对应的推荐人体姿态并进行显示。通过该方式,能够便于用户选择不同的推荐人体姿态,满足不同的姿态喜好,提升拍摄体验,有利于得到令用户满意的拍摄图像。In a possible implementation manner, the recommended human body postures include multiple ones, and the method further includes: displaying in the shooting interface a plurality of posture diagrams corresponding to the recommended human body postures; in response to the selection of the posture diagrams Operation, according to the selected posture map, determine and display the recommended human body posture corresponding to the selected posture map. In this way, it is convenient for the user to select different recommended human body postures, to satisfy different posture preferences, to improve the shooting experience, and to obtain a photographed image that satisfies the user.

在一种可能的实现方式中,所述拍摄条件包括:所述真实人体姿态与所述推荐人体姿态之间的第二相似度大于或等于第二预设阈值;或,所述拍摄界面中的拍摄控件被触发。In a possible implementation manner, the shooting conditions include: a second degree of similarity between the real human body posture and the recommended human body posture is greater than or equal to a second preset threshold; or, in the shooting interface Shooting controls are triggered.

在一种可能的实现方式中,所述方法还包括:对所述拍摄图像进行处理,得到处理后的拍摄图像,其中,所述处理包括以下至少一种:对所述拍摄图像中的对象进行美颜处理、对所述拍摄图像添加滤镜;所述对所述拍摄图像添加滤镜,包括:根据与所述推荐人体姿态对应的推荐滤镜,调整所述拍摄图像的饱和度、色温、亮度中的至少一种。通过该方式,能够使拍摄图像的视觉效果更优质,有利于得到令用户满意的拍摄图像,提升图像拍摄体验。In a possible implementation manner, the method further includes: processing the captured image to obtain a processed captured image, wherein the processing includes at least one of the following: performing processing on an object in the captured image Beauty processing, adding a filter to the captured image; the adding a filter to the captured image includes: adjusting the saturation, color temperature, color temperature, etc. of the captured image according to the recommended filter corresponding to the recommended human posture. at least one of brightness. In this way, the visual effect of the captured image can be improved, which is conducive to obtaining a captured image that satisfies the user, and improves the image capturing experience.

在一种可能的实现方式中,所述方法应用于人工智能教育平台,所述人工智能教育平台与智能硬件连接;其中,所述人工智能教育平台用于编辑实现所述图像拍摄方法的项目代码,并将所述项目代码发送至所述智能硬件;所述智能硬件用于采集场景图像及运行所述项目代码,得到运行结果,并将所述运行结果发送至所述人工智能教育平台,以在所述人工智能教育平台的显示界面中显示所述运行结果。通过该方式,能够使用户通过有趣的案例进行学习,学习人体姿态识别、人体关键点检测等人工智能算法。In a possible implementation manner, the method is applied to an artificial intelligence education platform, and the artificial intelligence education platform is connected with intelligent hardware; wherein, the artificial intelligence education platform is used to edit the project code for realizing the image capturing method , and send the project code to the intelligent hardware; the intelligent hardware is used to collect scene images and run the project code to obtain the operation result, and send the operation result to the artificial intelligence education platform to The operation result is displayed on the display interface of the artificial intelligence education platform. In this way, users can learn through interesting cases, and learn artificial intelligence algorithms such as human body gesture recognition and human body key point detection.

根据本公开的一方面,提供了一种图像拍摄装置,包括:获取部分,被配置为获取与拍摄界面对应的场景图像;识别部分,被配置为对所述场景图像进行识别,确定所述场景图像对应的场景类别以及与所述场景类别对应的推荐人体姿态;检测部分,被配置为对所述场景图像进行人体关键点检测,确定所述场景图像中的对象以及所述对象的真实人体姿态;显示部分,被配置为通过在所述拍摄界面中显示所述真实人体姿态以及所述推荐人体姿态,引导所述对象根据所述推荐人体姿态进行姿态调整;拍摄部分,被配置为在满足拍摄条件的情况下,得到所述对象的拍摄图像。According to an aspect of the present disclosure, there is provided an image capturing apparatus, comprising: an acquisition part configured to acquire a scene image corresponding to a shooting interface; a recognition part configured to recognize the scene image and determine the scene The scene category corresponding to the image and the recommended human body posture corresponding to the scene category; the detection part is configured to perform human key point detection on the scene image, and determine the object in the scene image and the real human body posture of the object The display part is configured to guide the object to adjust the posture according to the recommended human body posture by displaying the real human body posture and the recommended human body posture in the shooting interface; the shooting part is configured to meet the requirements of shooting condition, a captured image of the object is obtained.

在一种可能的实现方式中,所述推荐人体姿态包括单人姿态,所述显示部分,包括:第一显示子部分,被配置为根据所述对象在所述场景图像中的位置,在所述拍摄界面中显示所述真实人体姿态以及所述单人姿态;或,第二显示子部分,被配置为根据所述对象在所述场景图像中的位置,在所述拍摄界面中显示所述真实人体姿态,并在所述拍摄界面的第一指定区域处显示所述单人姿态。In a possible implementation manner, the recommended human posture includes a single-person posture, and the display part includes: a first display sub-part, configured to display the position of the object in the scene image according to the position of the object in the scene image. The real human body posture and the single-person posture are displayed in the shooting interface; or, the second display subsection is configured to display the object in the shooting interface according to the position of the object in the scene image. The real human body posture is displayed, and the single-person posture is displayed in the first designated area of the shooting interface.

在一种可能的实现方式中,所述推荐人体姿态包括多人组合姿态,所述装置还包括:位置确定部分,被配置为根据所述场景图像中多个对象之间的相对位置,以及所述多人组合姿态中各个姿态之间的相对位置,从所述多人组合姿态中分别确定各对象的对应姿态;其中,所述显示部分,包括:第三显示子部分,被配置为针对任一对象,根据所述对象在所述场景图像中的位置,在所述拍摄界 面中显示所述对象的真实人体姿态以及所述对象的对应姿态;或,第四显示子部分,被配置为根据所述对象在所述场景图像中的位置,在所述拍摄界面中显示所述对象的真实人体姿态,并在所述拍摄界面的第二指定区域处显示所述对象的对应姿态。In a possible implementation manner, the recommended human body posture includes a multi-person combined posture, and the apparatus further includes: a position determination part configured to The relative position between each posture in the multi-person combined posture, and the corresponding posture of each object is respectively determined from the multi-person combined posture; wherein, the display part includes: a third display sub-section, which is configured for any an object, according to the position of the object in the scene image, display the real human body posture of the object and the corresponding posture of the object in the shooting interface; or, the fourth display sub-section is configured to be based on The position of the object in the scene image, the real human body posture of the object is displayed in the shooting interface, and the corresponding posture of the object is displayed in the second designated area of the shooting interface.

在一种可能的实现方式中,所述装置还包括:第一确定部分,被配置为确定所述真实人体姿态的第一人体关键点,与所述推荐人体姿态的第二人体关键点之间的关键点对;第二确定部分,被配置为通过计算所述关键点对的第一相似度,从所述第一人体关键点中确定出所述第一相似度小于第一预设阈值的第三人体关键点;其中,所述显示部分,包括:突出显示子部分,被配置为突出显示所述第三人体关键点所在的区域,其中,突出显示的方式包括高亮、加粗、变更颜色中的至少一种。In a possible implementation manner, the apparatus further includes: a first determining part configured to determine the distance between the first human body key point of the real human body posture and the second human body key point of the recommended human body posture The key point pair; the second determination part is configured to determine from the first human body key points that the first similarity is less than the first preset threshold by calculating the first similarity of the key point pair. The third human body key point; wherein, the display part, including: a highlight sub-section, is configured to highlight the region where the third human body key point is located, wherein the highlighting method includes highlighting, bolding, changing at least one of the colors.

在一种可能的实现方式中,所述装置还包括:第三确定部分,被配置为根据所述真实人体姿态与所述推荐人体姿态之间的关键点对的第一相似度,确定所述真实人体姿态与所述推荐人体姿态之间的第二相似度,所述关键点对是根据所述真实人体姿态的第一人体关键点与所述推荐人体姿态的第二人体关键点确定的;相似度显示部分,被配置为在所述拍摄界面中显示所述第二相似度。In a possible implementation manner, the apparatus further includes: a third determination part, configured to determine the the second similarity between the real human body posture and the recommended human body posture, and the key point pair is determined according to the first human body key point of the real human body posture and the second human body key point of the recommended human body posture; The similarity display part is configured to display the second similarity in the shooting interface.

在一种可能的实现方式中,所述装置还包括:区域确定部分,被配置为将所述场景图像输入至区域推荐网络,得到所述场景图像的推荐拍照区域,所述推荐拍照区域用于表征在所述场景图像中推荐拍照的区域,所述区域推荐网络是通过标注拍照区域的第一样本集训练得到的神经网络;其中,所述装置还包括:区域显示部分,被配置为在所述拍摄界面中,采用标识符指示所述推荐拍照区域;或者,在所述拍摄界面的所述推荐拍照区域处显示所述推荐人体姿态;所述推荐拍照区域用于引导所述对象根据所述推荐拍照区域进行位置调整。In a possible implementation manner, the apparatus further includes: an area determination part, configured to input the scene image into an area recommendation network to obtain a recommended photographing area of the scene image, where the recommended photographing area is used for Representing the area recommended to be photographed in the scene image, and the area recommendation network is a neural network trained by marking the first sample set of the photographing area; wherein, the device further includes: an area display part, configured to In the shooting interface, an identifier is used to indicate the recommended photographing area; or, the recommended human posture is displayed in the recommended photographing area of the photographing interface; the recommended photographing area is used to guide the object according to the recommended photographing area. Adjust the position of the recommended photo area as described above.

在一种可能的实现方式中,所述区域显示部分,还被配置为在所述拍摄界面中,采用标识符指示所述推荐拍照区域;在所述拍摄界面的所述推荐拍照区域处显示所述推荐人体姿态。In a possible implementation manner, the area display part is further configured to use an identifier to indicate the recommended photographing area in the photographing interface; and display the recommended photographing area on the photographing interface. The recommended human posture is described.

在一种可能的实现方式中,所述识别部分,还被配置为:通过姿态推荐网络对所述场景图像进行识别,确定所述场景图像对应的场景类别以及与所述场景类别对应的推荐人体姿态,其中,所述姿态推荐网络是通过标注有样本场景类别以及样本人体姿态的第二样本集训练得到的神经网络,所述第二样本集包括对象上传的样本图像。In a possible implementation manner, the identifying part is further configured to: identify the scene image through a gesture recommendation network, and determine a scene category corresponding to the scene image and a recommended human body corresponding to the scene category pose, wherein the pose recommendation network is a neural network obtained by training a second sample set marked with sample scene categories and sample human poses, and the second sample set includes sample images uploaded by objects.

在一种可能的实现方式中,所述推荐人体姿态包括多个,所述装置还包括:姿态图显示部分,被配置为在所述拍摄界面中显示多个推荐人体姿态对应的姿态图;选择部分,被配置为响应于针对所述姿态图的选择操作,根据选中的姿态图,确定与所述选中的姿态图对应的推荐人体姿态并进行显示。In a possible implementation manner, the recommended human body postures include a plurality of postures, and the apparatus further includes: a posture map display part configured to display the posture maps corresponding to the plurality of recommended human body postures in the shooting interface; selecting The part is configured to, in response to a selection operation on the posture map, determine and display a recommended human body posture corresponding to the selected posture map according to the selected posture map.

在一种可能的实现方式中,所述拍摄条件包括:所述真实人体姿态与所述推荐人体姿态之间的第二相似度大于或等于第二预设阈值;或,所述拍摄界面中的拍摄控件被触发。In a possible implementation manner, the shooting conditions include: a second degree of similarity between the real human body posture and the recommended human body posture is greater than or equal to a second preset threshold; or, in the shooting interface Shooting controls are triggered.

在一种可能的实现方式中,所述装置还包括:处理部分,被配置为对所述拍摄图像进行处理,得到处理后的拍摄图像,其中,所述处理包括以下至少一种:对所述拍摄图像中的对象进行美颜处理、对所述拍摄图像添加滤镜;所述对所述拍摄图像添加滤镜,包括:根据与所述推荐人体姿态对应的推荐滤镜,调整所述拍摄图像的饱和度、色温、亮度中的至少一种。In a possible implementation manner, the apparatus further includes: a processing part configured to process the captured image to obtain a processed captured image, wherein the processing includes at least one of the following: Performing beautification processing on the object in the captured image, and adding a filter to the captured image; the adding a filter to the captured image includes: adjusting the captured image according to the recommended filter corresponding to the recommended human posture at least one of saturation, color temperature, and brightness.

在一种可能的实现方式中,所述装置应用于人工智能教育平台,所述人工智能教育平台与智能硬件连接;其中,所述人工智能教育平台用于编辑实现所述图像拍摄装置的项目代码,并将所述项目代码发送至所述智能硬件;所述智能硬件用于采集场景图像及运行所述项目代码,得到运行结果,并将所述运行结果发送至所述人工智能教育平台,以在所述人工智能教育平台的显示界面中显示所述运行结果。In a possible implementation, the device is applied to an artificial intelligence education platform, and the artificial intelligence education platform is connected to intelligent hardware; wherein, the artificial intelligence education platform is used to edit and implement the project code of the image capturing device , and send the project code to the intelligent hardware; the intelligent hardware is used to collect scene images and run the project code to obtain the operation result, and send the operation result to the artificial intelligence education platform to The operation result is displayed on the display interface of the artificial intelligence education platform.

根据本公开的一方面,提供了一种电子设备,包括:处理器;被配置为存储处理器可执行指令的存储器;其中,所述处理器被配置为调用所述存储器存储的指令,以执行上述方法。According to an aspect of the present disclosure, there is provided an electronic device, comprising: a processor; a memory configured to store instructions executable by the processor; wherein the processor is configured to invoke the instructions stored in the memory to execute the above method.

根据本公开的一方面,提供了一种计算机可读存储介质,其上存储有计算机程序指令,所述计算机程序指令被处理器执行时实现上述方法。According to an aspect of the present disclosure, there is provided a computer-readable storage medium having computer program instructions stored thereon, the computer program instructions implementing the above method when executed by a processor.

根据本公开的一方面,提供了一种计算机程序,包括计算机可读代码,在所述计算机可读代码在电子设备中运行的情况下,所述电子设备中的处理器执行时实现上述方法。According to an aspect of the present disclosure, there is provided a computer program comprising computer-readable code, which, when the computer-readable code is executed in an electronic device, implements the above method when executed by a processor in the electronic device.

在本公开实施例中,能够根据场景图像的场景类别,向用户推荐与场景类别对应的推荐人体姿态,这样使得推荐人体姿态是与场景图像匹配的,并且通过在拍摄界面中显示真实人体姿态以及推荐人体姿态,可引导用户根据显示的推荐人体姿态进行姿态调整或便于用户根据显示的推荐人体姿态来指导拍摄对象进行姿态调整,有效提高图像拍摄效果,提升用户的拍摄体验,有利于得到令用户满足的拍摄图像。In the embodiment of the present disclosure, the recommended human body pose corresponding to the scene category can be recommended to the user according to the scene category of the scene image, so that the recommended human body pose matches the scene image, and by displaying the real human body pose and The recommended human posture can guide the user to adjust the posture according to the displayed recommended human posture or facilitate the user to guide the subject to adjust the posture according to the displayed recommended human posture. Satisfied taking pictures.

应当理解的是,以上的一般描述和后文的细节描述仅是示例性和解释性的,而非限制本公开。 根据下面参考附图对示例性实施例的详细说明,本公开的其它特征及方面将变得清楚。It is to be understood that the foregoing general description and the following detailed description are exemplary and explanatory only and are not restrictive of the present disclosure. Other features and aspects of the present disclosure will become apparent from the following detailed description of exemplary embodiments with reference to the accompanying drawings.

附图说明Description of drawings

此处的附图被并入说明书中并构成本说明书的一部分,这些附图示出了符合本公开的实施例,并与说明书一起用于说明本公开的技术方案。The accompanying drawings, which are incorporated into and constitute a part of this specification, illustrate embodiments consistent with the present disclosure, and together with the description, serve to explain the technical solutions of the present disclosure.

图1示出根据本公开实施例的图像拍摄方法的流程图。FIG. 1 shows a flowchart of an image capturing method according to an embodiment of the present disclosure.

图2示出根据本公开实施例的一种拍摄界面的示意图。FIG. 2 shows a schematic diagram of a photographing interface according to an embodiment of the present disclosure.

图3示出根据本公开实施例的一种多人组合姿态的示意图。FIG. 3 shows a schematic diagram of a multi-person combined gesture according to an embodiment of the present disclosure.

图4示出根据本公开实施例的一种真实人体姿态的示意图。FIG. 4 shows a schematic diagram of a real human body pose according to an embodiment of the present disclosure.

图5示出根据本公开实施例的一种拍摄界面的示意图。FIG. 5 shows a schematic diagram of a photographing interface according to an embodiment of the present disclosure.

图6示出根据本公开实施例的场景图像的示意图。FIG. 6 shows a schematic diagram of a scene image according to an embodiment of the present disclosure.

图7示出根据本公开实施例的拍摄界面的示意图。FIG. 7 shows a schematic diagram of a photographing interface according to an embodiment of the present disclosure.

图8示出根据本公开实施例的一种拍摄界面的示意图。FIG. 8 shows a schematic diagram of a photographing interface according to an embodiment of the present disclosure.

图9示出根据本公开实施例的一种拍摄界面的示意图。FIG. 9 shows a schematic diagram of a photographing interface according to an embodiment of the present disclosure.

图10示出根据本公开实施例的一种拍摄界面的示意图。FIG. 10 shows a schematic diagram of a photographing interface according to an embodiment of the present disclosure.

图11a、图11b示出根据本公开实施例的拍摄界面的示意图。11a and 11b illustrate schematic diagrams of a photographing interface according to an embodiment of the present disclosure.

图12示出根据本公开实施例的图像拍摄方法的应用示意图。FIG. 12 shows a schematic diagram of an application of an image capturing method according to an embodiment of the present disclosure.

图13示出根据本公开实施例的图像拍摄装置的框图。FIG. 13 shows a block diagram of an image capturing apparatus according to an embodiment of the present disclosure.

图14示出根据本公开实施例的一种电子设备的框图。FIG. 14 shows a block diagram of an electronic device according to an embodiment of the present disclosure.

具体实施方式Detailed ways

以下将参考附图详细说明本公开的各种示例性实施例、特征和方面。附图中相同的附图标记表示功能相同或相似的元件。尽管在附图中示出了实施例的各种方面,但是除非特别指出,不必按比例绘制附图。Various exemplary embodiments, features and aspects of the present disclosure will be described in detail below with reference to the accompanying drawings. The same reference numbers in the figures denote elements that have the same or similar functions. While various aspects of the embodiments are shown in the drawings, the drawings are not necessarily drawn to scale unless otherwise indicated.

在这里专用的词“示例性”意为“用作例子、实施例或说明性”。这里作为“示例性”所说明的任何实施例不必解释为优于或好于其它实施例。The word "exemplary" is used exclusively herein to mean "serving as an example, embodiment, or illustration." Any embodiment described herein as "exemplary" is not necessarily to be construed as preferred or advantageous over other embodiments.

本文中术语“和/或”,仅仅是一种描述关联对象的关联关系,表示可以存在三种关系,例如,A和/或B,可以表示:单独存在A,同时存在A和B,单独存在B这三种情况。另外,本文中术语“至少一种”表示多种中的任意一种或多种中的至少两种的任意组合,例如,包括A、B、C中的至少一种,可以表示包括从A、B和C构成的集合中选择的任意一个或多个元素。The term "and/or" in this article is only an association relationship to describe the associated objects, indicating that there can be three kinds of relationships, for example, A and/or B, it can mean that A exists alone, A and B exist at the same time, and A and B exist independently B these three cases. In addition, the term "at least one" herein refers to any combination of any one of the plurality or at least two of the plurality, for example, including at least one of A, B, and C, and may mean including from A, B, and C. Any one or more elements selected from the set of B and C.

应当理解,本公开的权利要求、说明书及附图中的术语“第一”、“第二”及“第三”等是用于区别描述,而不是用于描述特定顺序,也不能理解为指示或暗示相对重要性。本公开的说明书和权利要求书中使用的术语“包括”和“包含”指示所描述特征、整体、步骤、操作、元素和/或组件的存在,但并不排除一个或多个其它特征、整体、步骤、操作、元素、组件和/或其集合的存在或添加。It should be understood that the terms "first", "second" and "third" in the claims, description and drawings of the present disclosure are used for distinguishing descriptions, rather than for describing a specific order, nor can they be construed as indications Or imply relative importance. The terms "comprising" and "comprising" as used in the specification and claims of the present disclosure indicate the presence of the described feature, integer, step, operation, element and/or component, but do not exclude one or more other features, integers , step, operation, element, component and/or the presence or addition of a collection thereof.

另外,为了更好地说明本公开,在下文的具体实施方式中给出了众多的具体细节。本领域技术人员应当理解,没有某些具体细节,本公开同样可以实施。在一些实例中,对于本领域技术人员熟知的方法、手段、元件和电路未作详细描述,以便于凸显本公开的主旨。In addition, in order to better illustrate the present disclosure, numerous specific details are set forth in the following detailed description. It will be understood by those skilled in the art that the present disclosure may be practiced without certain specific details. In some instances, methods, means, components and circuits well known to those skilled in the art have not been described in detail so as not to obscure the subject matter of the present disclosure.

图1示出根据本公开实施例的图像拍摄方法的流程图,如图1所示,所述图像拍摄方法包括:FIG. 1 shows a flowchart of an image capturing method according to an embodiment of the present disclosure. As shown in FIG. 1 , the image capturing method includes:

在步骤S11中,获取与拍摄界面对应的场景图像;In step S11, obtain a scene image corresponding to the shooting interface;

在步骤S12中,对场景图像进行识别,确定场景图像对应的场景类别以及与场景类别对应的推荐人体姿态;In step S12, the scene image is identified, and the scene category corresponding to the scene image and the recommended human posture corresponding to the scene category are determined;

在步骤S13中,对场景图像进行人体关键点检测,确定场景图像中的对象以及对象的真实人体姿态;In step S13, the human body key point detection is performed on the scene image, and the object in the scene image and the real human body posture of the object are determined;

在步骤S14中,在拍摄界面中显示真实人体姿态以及推荐人体姿态,以引导对象根据推荐人体姿态进行姿态调整;In step S14, the real human body posture and the recommended human body posture are displayed in the shooting interface, so as to guide the object to adjust the posture according to the recommended human body posture;

在步骤S15中,在满足拍摄条件的情况下,得到对象的拍摄图像。In step S15, when the photographing conditions are satisfied, a photographed image of the object is obtained.

在一种可能的实现方式中,所述图像拍摄方法可以由终端设备执行,终端设备可以为用户设备(User Equipment,UE)、移动设备、用户终端、终端、蜂窝电话、无绳电话、个人数字助理(Personal  Digital Assistant,PDA)、手持设备、计算设备、车载设备、可穿戴设备等,所述方法可以通过处理器调用存储器中存储的计算机可读指令的方式来实现。本公开实施例对于终端设备的类型不作限制。In a possible implementation manner, the image capturing method may be performed by a terminal device, and the terminal device may be a user equipment (User Equipment, UE), a mobile device, a user terminal, a terminal, a cellular phone, a cordless phone, or a personal digital assistant (Personal Digital Assistant, PDA), handheld device, computing device, vehicle-mounted device, wearable device, etc., the method can be implemented by the processor calling the computer-readable instructions stored in the memory. This embodiment of the present disclosure does not limit the type of the terminal device.

在一种可能的实现方式中,终端设备中可部署有实现该图像拍摄方法的应用程序(Application,APP),这样用户可通过直接使用该应用程序,进行图像拍摄;也可以将该图像拍摄方法作为一种拍摄模式,集成在终端设备自带的拍摄功能中,这样用户在使用终端设备的拍摄功能时,可选择实现该图像拍摄方法的拍摄模式,进行图像拍摄,对此本公开实施例不作限制。In a possible implementation manner, an application (Application, APP) implementing the image capturing method may be deployed in the terminal device, so that the user can directly use the application to capture images; As a shooting mode, it is integrated into the shooting function of the terminal device, so that when the user uses the shooting function of the terminal device, the user can select the shooting mode that realizes the image shooting method to shoot the image, which is not made in this embodiment of the present disclosure. limit.

应理解的是,在启动实现本公开中图像拍摄方法的应用程序,或选择实现本公开中图像拍摄方法的拍摄模式后,可通过终端设备上装配的图像采集设备(如摄像头),或与终端设备连接的图像采集设备,实时采集实际场景的场景图像,并显示在终端设备的拍摄界面中。在步骤S11中,获取与拍摄界面对应的场景图像,也即,获取图像采集设备采集的场景图像。It should be understood that, after starting an application for implementing the image capturing method in the present disclosure, or selecting a capturing mode for implementing the image capturing method in the present disclosure, an image capturing device (such as a camera) mounted on the terminal device may be used, or an image capturing device (such as a camera) assembled on the terminal device or a The image acquisition device connected to the device collects the scene image of the actual scene in real time and displays it on the shooting interface of the terminal device. In step S11, a scene image corresponding to the shooting interface is acquired, that is, a scene image acquired by an image acquisition device is acquired.

图2示出根据本公开实施例的一种拍摄界面的示意图,如图2所示,拍摄界面中可显示场景图像,拍摄界面中可提供用于触发拍摄的拍摄控件22、用于查看已拍摄图像的查看控件21、用于触发对场景图像进行识别的触发控件23等。FIG. 2 shows a schematic diagram of a shooting interface according to an embodiment of the present disclosure. As shown in FIG. 2 , a scene image can be displayed in the shooting interface, and a shooting control 22 for triggering shooting can be provided in the shooting interface, which is used to check the shooting interface. A viewing control 21 for an image, a triggering control 23 for triggering recognition of a scene image, and the like.

需要说明的是,图2示出的拍摄界面是本公开实施例提供的一种实现方式,应理解的是,本公开应不限于此,本领域技术人员可根据实际需求设置拍摄界面中的功能控件,对此本公开实施例不作限制。It should be noted that the shooting interface shown in FIG. 2 is an implementation provided by the embodiment of the present disclosure. It should be understood that the present disclosure should not be limited to this, and those skilled in the art can set the functions in the shooting interface according to actual needs. control, which is not limited by this embodiment of the present disclosure.

在一种可能的实现方式中,所述图像拍摄方法可以也可以由服务器执行,服务器可以是独立的物理服务器,也可以是多个物理服务器构成的服务器集群或者分布式系统,还可以是提供云服务、云数据库、云计算、云函数、云存储、网络服务、云通信、中间件服务、域名服务、安全服务、CDN、以及大数据和人工智能平台等基础云计算服务的云服务器等,本公开实施例对于服务器的类型不作限制。In a possible implementation manner, the image capturing method may also be performed by a server, and the server may be an independent physical server, a server cluster or a distributed system composed of multiple physical servers, or a cloud-provided server. Services, cloud databases, cloud computing, cloud functions, cloud storage, network services, cloud communications, middleware services, domain name services, security services, CDN, and cloud servers for basic cloud computing services such as big data and artificial intelligence platforms, etc. The disclosed embodiments do not limit the type of server.

在所述图像拍摄方法由服务器执行的情况下,在步骤S11中,获取与拍摄界面对应的场景图像,也即,服务器接收终端发送的场景图像,其中,场景图像为终端通过图像采集设备采集的图像。In the case where the image capturing method is executed by the server, in step S11, a scene image corresponding to the shooting interface is acquired, that is, the server receives the scene image sent by the terminal, wherein the scene image is collected by the terminal through the image capturing device image.

在一种可能的实现方式中,在步骤S12中,可通过触发如图2所示的触发控件23,触发对场景图像进行识别;也可在用户启动实现图像拍摄方法的应用程序,或选择实现图像拍摄方法的拍摄模式后,自动触发对场景图像进行识别,对此本公开实施例不作限制。In a possible implementation manner, in step S12, the recognition of the scene image can be triggered by triggering the trigger control 23 as shown in FIG. After the shooting mode of the image shooting method, the scene image recognition is automatically triggered, which is not limited in this embodiment of the present disclosure.

在一种可能的实现方式中,在步骤S12中,对场景图像进行识别,确定场景图像对应的场景类别以及与场景类别对应的推荐人体姿态,可包括:对场景图像进行识别,得到场景图像中的场景特征与对象特征中的至少之一;根据识别出的场景特征与对象特征中的至少之一,确定场景图像对应的场景类型;根据预设的场景类型与推荐人体姿态之间的对应关系,确定与场景类型对应的推荐人体姿态。In a possible implementation manner, in step S12, recognizing the scene image, determining the scene category corresponding to the scene image and the recommended human pose corresponding to the scene category, may include: recognizing the scene image, and obtaining the scene image in the scene image. at least one of the scene feature and the object feature of the image; according to at least one of the identified scene feature and the object feature, determine the scene type corresponding to the scene image; according to the correspondence between the preset scene type and the recommended human posture , and determine the recommended human pose corresponding to the scene type.

其中,场景特征可用于指示对象所处的实际场景的场景类型,例如,草原的场景特征可包括草地、牛羊等;室内的场景特征可包括墙面、窗户等;若识别出的场景特征包括草地和牛羊,可得到场景图像对应的场景类型为草原。The scene features can be used to indicate the scene type of the actual scene where the object is located. For example, the scene features of the grassland can include grass, cattle and sheep, etc.; the scene features of the indoor can include walls, windows, etc.; if the identified scene features include Grassland and cattle and sheep, the scene type corresponding to the obtained scene image is grassland.

在一种可能的实现方式中,对象特征可包括以下至少一种:服饰特征、形体特征、性别特征等。其中,服饰特征可用于指示对象的穿着,形体特征可用于指示对象的体型(例如体型高大、体型小巧),性别特征可用于指示对象的性别。通过该方式,能够结合场景特征与对象特征确定场景类型,从而获得更精准的推荐人体姿态。In a possible implementation manner, the object features may include at least one of the following: clothing features, physical features, gender features, and the like. Among them, the clothing feature can be used to indicate the clothing of the object, the body feature can be used to indicate the body shape of the object (for example, tall and small), and the gender feature can be used to indicate the gender of the object. In this way, the scene type can be determined in combination with the scene feature and the object feature, so as to obtain a more accurate recommended human pose.

举例来说,若场景特征指示实际场景为室内,对象特征指示对象穿着“毛衣、长裤、手提包”、对象为小巧女生,则场景类别可为“室内-小巧-女生-穿着毛衣、长裤、手提包”,进而依此场景类别确定对应的推荐人体姿态。For example, if the scene feature indicates that the actual scene is indoor, the object feature indicates that the object is wearing "sweater, trousers, handbag", and the object is a small girl, the scene category can be "indoor-small-girl-wearing sweater, trousers" , handbag”, and then determine the corresponding recommended human pose according to this scene category.

其中,可通过预训练的场景识别网络对场景图像进行识别,得到场景图像中的场景特征和/或对象特征,对于场景识别网络的网络类型、网络结果以及训练方式,本公开实施例不作限制。The scene image can be recognized by a pre-trained scene recognition network to obtain scene features and/or object features in the scene image, and the embodiments of the present disclosure do not limit the network type, network result, and training method of the scene recognition network.

在一种可能的实现方式中,还可将识别出的场景特征与对象特征显示在拍摄界面中,例如,可通过标签的方式标注场景特征与对象特征,以提高图像拍摄的趣味性。In a possible implementation manner, the identified scene features and object features can also be displayed in the shooting interface, for example, the scene features and object features can be marked by means of tags, so as to improve the interest of image shooting.

应理解的是,不同的场景类型可对应不同的推荐人体姿态,可预设场景类型与推荐人体姿态的对应关系,在根据识别出的场景特征确定出场景类型后,可根据该对应关系确定与场景类别对应的推荐人体姿态。It should be understood that different scene types may correspond to different recommended human postures, and the corresponding relationship between scene types and recommended human postures can be preset. The recommended human pose corresponding to the scene category.

在一种可能的实现方式中,在步骤S12中,也可通过预训练的姿态推荐网络,对场景图像进行识别,确定场景图像对应的场景类别以及与场景类别对应的推荐人体姿态。其中,对于姿态推荐网络的网络类型、网络结构以及训练方式等,本公开实施例不作限制。In a possible implementation manner, in step S12, a pre-trained gesture recommendation network may also be used to identify the scene image, and determine the scene category corresponding to the scene image and the recommended human pose corresponding to the scene category. The embodiments of the present disclosure do not limit the network type, network structure, training method, etc. of the gesture recommendation network.

在一种可能的实现方式中,可采用自监督的训练方式,训练姿态推荐网络学习不同场景类型下的推荐人体姿态,或者说,学习场景类型与推荐人体姿态之间的对应关系,从而可利用该姿态推荐网络基于场景图像直接得到推荐人体姿态。通过该方式,相对于上述根据预设的场景类型与推荐人体姿态之间的对应关系确定推荐人体姿态的方式,可智能化地得到更为丰富的场景类型以及推荐人体姿态,并可更为准确地向用户推荐与场景图像对应的推荐人体姿态。In a possible implementation, a self-supervised training method can be used to train the pose recommendation network to learn the recommended human poses under different scene types, or, in other words, to learn the correspondence between the scene types and the recommended human poses, so as to use The pose recommendation network directly obtains the recommended human pose based on the scene image. In this way, compared with the above-mentioned method of determining the recommended human posture according to the corresponding relationship between the preset scene type and the recommended human posture, more abundant scene types and recommended human postures can be obtained intelligently, and more accurate It recommends the recommended human pose corresponding to the scene image to the user.

在一种可能的实现方式中,推荐人体姿态可包括上述姿态推荐网络从样本图像中学习得到的人体姿态,例如还可包括用户(如技术人员)手动绘制的人体姿态,对此本公开实施例不作限制。应理解的是,推荐人体姿态可通过人体骨架(如人体关节点连接得到的骨架)的形式表征,也可通过人体轮廓的形式表征,对此本公开实施例不作限制。In a possible implementation manner, the recommended human body posture may include the human body posture learned from the sample images by the above-mentioned posture recommendation network, for example, may also include the human body posture manually drawn by a user (eg, a technician), to which the embodiments of the present disclosure No restrictions apply. It should be understood that the recommended human posture may be represented in the form of a human skeleton (eg, a skeleton obtained by connecting human joint points), or in the form of a human body outline, which is not limited to this embodiment of the present disclosure.

其中,样本图像可是拍摄效果较优的优质图像,从优质图像中学习人体姿态并向用户推荐,可引导对象接近优质图像中的人体姿态,从而获得较好的拍摄效果,提升用户拍摄体验。Among them, the sample image is a high-quality image with better shooting effect. Learning the human body posture from the high-quality image and recommending it to the user can guide the object to approach the human body posture in the high-quality image, so as to obtain a better shooting effect and improve the user's shooting experience.

在一种可能的实现方式中,推荐人体姿态可包括单人姿态(例如单人比心),也可包括多人组合姿态(例如双人组合比心)。应理解的是,与场景类别对应的推荐人体姿态可包括至少一种单人姿态和/或至少一种多人组合姿态。图3示出根据本公开实施例的一种多人组合姿态的示意图,如图3所示,姿态n与姿态m共同组成“比心”的姿态。In a possible implementation manner, the recommended human posture may include a single-person posture (for example, a single-person comparison), and may also include a multi-person combined posture (for example, a two-person combination). It should be understood that the recommended human gesture corresponding to the scene category may include at least one single-person gesture and/or at least one multi-person combined gesture. FIG. 3 shows a schematic diagram of a multi-person combined posture according to an embodiment of the present disclosure. As shown in FIG. 3 , the posture n and the posture m together form a posture of “comparing hearts”.

在一种可能的实现方式中,场景类型例如可至少包括:风景类、室内类、户外类等;其中,风景类可分为沙滩类、山水类、草原类、湖泊类等,室内类可分为办公室类、会议室类、居家类等,户外类可分为:街景类、游乐场类、公园类等。In a possible implementation manner, the scene types may, for example, at least include: landscape, indoor, outdoor, etc.; wherein, the scenery can be divided into beach, landscape, grassland, lake, etc., and the indoor can be divided into For office, conference room, home, etc., outdoor category can be divided into: street view category, playground category, park category, etc.

应理解的是,场景类型的分类可不限于此,实际上,本领域技术人员可根据实际需求预先设置各种场景类型,或,也可通过上述姿态推荐网络自学习场景类型,对此本公开实施例不作限制。It should be understood that the classification of scene types may not be limited to this. In fact, those skilled in the art can preset various scene types according to actual needs, or can also self-learn scene types through the above gesture recommendation network. Examples are not limited.

在一种可能的实现方式中,在步骤S13中,可以采用任何已知的人体关键点检测方式,如采用人体关键点检测网络,对场景图像进行人体关键点检测,对此本公开实施例不作限制。其中,对于人体关键点检测网络的网络类型、网络结构以及训练方式,本公开实施例不作限制。In a possible implementation manner, in step S13, any known human body key point detection method may be used, such as using a human body key point detection network to perform human key point detection on the scene image, which is not made in this embodiment of the present disclosure. limit. The embodiments of the present disclosure do not limit the network type, network structure, and training method of the human body key point detection network.

在一种可能的实现方式中,对场景图像进行人体关键点检测,可包括:提取场景图像中对象的人体关节部位关键点(如20个关节部位的人体关键点),其中,人体关节部位关键点的数量及位置可以根据实际需求确定,对此本公开实施例不做限制。In a possible implementation manner, the detection of human body key points on the scene image may include: extracting the key points of human body joint parts of objects in the scene image (for example, the human body key points of 20 joint parts), wherein the key points of the human body joint parts are extracted. The number and position of the points may be determined according to actual requirements, which are not limited in this embodiment of the present disclosure.

在一种可能的实现方式中,可根据检测出的人体关节部位关键点,确定出场景图像中的对象,以及对象的多个人体关节部位关键点在场景图像中的坐标值;进而可以基于坐标值,将多个人体关节部位关键点按照人体的结构连接,得到对象的人体骨架,也即得到对象的真实人体姿态。图4示出根据本公开实施例的一种真实人体姿态的示意图,如图4所示,将20个人体关节部位关键点连接,得到对象的真实人体姿态。In a possible implementation manner, the object in the scene image and the coordinate values of the key points of multiple human joint parts of the object in the scene image can be determined according to the detected key points of the human body joints; value, and connect the key points of multiple human joint parts according to the structure of the human body to obtain the human skeleton of the object, that is, to obtain the real human body posture of the object. FIG. 4 shows a schematic diagram of a real human body posture according to an embodiment of the present disclosure. As shown in FIG. 4 , 20 key points of joint parts of the human body are connected to obtain the real human body posture of the object.

在一种可能的实现方式中,在步骤S13中,对场景图像进行人体关键点检测,也可包括:提取场景图像中对象的人体轮廓上的轮廓关键点,得到对象的人体轮廓,进而用人体轮廓表征对象的真实人体姿态。对于采用何种方式表征人体姿态,可依据实际需求确定,对此本公开实施例不作限制。In a possible implementation manner, in step S13, performing human body key point detection on the scene image may also include: extracting contour key points on the human body contour of the object in the scene image, obtaining the human body contour of the object, and then using the human body The contour represents the real human pose of the object. The manner in which the posture of the human body is represented may be determined according to actual needs, which is not limited by the embodiment of the present disclosure.

需要说明的是,上述步骤S12和步骤S13可以同时执行;也可以是先执行步骤S12,再执行步骤S13;也可先执行步骤S13,再执行步骤S12。具体可依据终端设备的处理能力、该终端设备的资源占用情况、应用过程中对于时延的限制等因素进行设定,对此本公开实施例不做限制。It should be noted that, the above-mentioned steps S12 and S13 may be performed simultaneously; or step S12 may be performed first, and then step S13 may be performed; or step S13 may be performed first, and then step S12 may be performed. Specifically, it may be set according to factors such as the processing capability of the terminal device, the resource occupancy of the terminal device, and the time delay limit in the application process, which is not limited in this embodiment of the present disclosure.

在一种可能的实现方式中,在步骤S14中,在拍摄界面中显示真实人体姿态以及推荐人体姿态,可包括:在拍摄界面中显示推荐人体姿态对应的人体骨架与真实人体姿态对应的人体骨架;或,在拍摄界面中显示推荐人体姿态对应的人体轮廓与真实人体姿态对应的人体轮廓。换句话说,可在拍摄界面中采用人体骨架或人体轮廓的形式,显示真实人体姿态及推荐人体姿态。通过该方式,可引导用户根据显示的推荐人体姿态进行姿态调整,以达到与推荐人体姿态相似度较高的状态,从获得较好的拍摄效果。In a possible implementation manner, in step S14, displaying the real human body posture and the recommended human body posture in the shooting interface may include: displaying the human body skeleton corresponding to the recommended human body posture and the human body skeleton corresponding to the real human body posture in the shooting interface ; or, displaying the human body contour corresponding to the recommended human body posture and the human body contour corresponding to the real human body posture in the shooting interface. In other words, the form of human skeleton or human outline can be used in the shooting interface to display the real human body posture and the recommended human body posture. In this way, the user can be guided to adjust the posture according to the displayed recommended human body posture, so as to achieve a state with a high similarity to the recommended human body posture, thereby obtaining a better shooting effect.

图5示出根据本公开实施例的一种拍摄界面的示意图,如图5所示,可基于人体骨架的形式,显示出对象的真实人体姿态及推荐人体姿态;在一种可能的实现方式中,对真实人体姿态与推荐人体姿态采用不同的显示形式进行显示;示例性地,可用不同的颜色来分别表征对象的真实人体姿态以及推荐人体姿态,如,用黄色表征推荐人体姿态,用绿色表征真实人体姿态。在一种可能的实现方式中,如图5所示,还可在拍摄界面中显示引导信息,如显示真实人体姿态与推荐人体姿态之间的相似度:30%,以及显示引导用户调整姿态的提示语“换个动作试试?尽量跟黄线保持一致”。FIG. 5 shows a schematic diagram of a shooting interface according to an embodiment of the present disclosure. As shown in FIG. 5 , the real human body posture and the recommended human body posture of the object can be displayed based on the form of the human skeleton; in a possible implementation manner , the real human body posture and the recommended human body posture are displayed in different display forms; for example, different colors can be used to represent the real human body posture and the recommended human body posture of the object, for example, the recommended human body posture is represented by yellow, and the recommended human body posture is represented by green. Real human pose. In a possible implementation, as shown in FIG. 5 , guide information can also be displayed in the shooting interface, such as displaying the similarity between the real human body posture and the recommended human posture: 30%, and displaying the information that guides the user to adjust the posture. The prompt "Try another action? Try to be consistent with the yellow line".

应理解的是,图5示出的真实人体姿态及推荐人体姿态的表现形式,是本公开实施例提供的一种实现方式,本公开应不限于此,实际上,本领域技术人员可根据实际需求设置真实人体姿态及推荐 人体姿态的显示形式,如颜色、线条粗细程度,线条类型,区域透明度等等,以及设置引导信息的内容,对此本公开实施例不作限制。It should be understood that the real human body posture and the representation of the recommended human body posture shown in FIG. 5 are an implementation manner provided by the embodiments of the present disclosure, and the present disclosure should not be limited to this. It is required to set the display form of the real human body posture and the recommended human body posture, such as color, line thickness, line type, area transparency, etc., as well as setting the content of the guidance information, which is not limited by this embodiment of the present disclosure.

在一种可能的实现方式中,在步骤S15中,在所述图像拍摄方法由终端设备执行的情况下,拍摄条件可包括拍摄界面中的拍摄控件被触发,可理解为,拍摄者点击如图2、图4、图5示出的拍摄界面中的拍摄控件触发拍照操作,得到对象的拍摄图像并保存本地相册,以供查看。In a possible implementation manner, in step S15, when the image capturing method is executed by the terminal device, the capturing conditions may include that the capturing control in the capturing interface is triggered. It can be understood that the photographer clicks the 2. The shooting controls in the shooting interface shown in FIG. 4 and FIG. 5 trigger the shooting operation, obtain the shooting image of the object and save the local album for viewing.

在本公开实施例中,能够根据场景图像的场景类别,向用户推荐与场景类别对应的推荐人体姿态,这样使得推荐人体姿态是与场景图像匹配的,并且通过在拍摄界面中显示真实人体姿态以及推荐人体姿态,可引导用户根据显示的推荐人体姿态进行姿态调整或便于用户根据显示的推荐人体姿态来指导拍摄对象进行姿态调整,有效提高图像拍摄效果,提升用户的拍摄体验,有利于得到令用户满足的拍摄图像。In the embodiment of the present disclosure, the recommended human body pose corresponding to the scene category can be recommended to the user according to the scene category of the scene image, so that the recommended human body pose matches the scene image, and by displaying the real human body pose and The recommended human posture can guide the user to adjust the posture according to the displayed recommended human posture or facilitate the user to guide the subject to adjust the posture according to the displayed recommended human posture. Satisfied taking pictures.

如上文所述,推荐人体姿态可包括单人姿态,在一种可能的实现方式中,在步骤S14中,在拍摄界面中显示真实人体姿态以及推荐人体姿态,包括:As described above, the recommended human posture may include a single-person posture. In a possible implementation manner, in step S14, the real human posture and the recommended human posture are displayed on the shooting interface, including:

根据对象在场景图像中的位置,在拍摄界面中显示真实人体姿态以及单人姿态;或,According to the position of the object in the scene image, the real human pose and the single-person pose are displayed in the shooting interface; or,

根据对象在场景图像中的位置,在拍摄界面中显示真实人体姿态,并在拍摄界面的第一指定区域处显示单人姿态。According to the position of the object in the scene image, the real human body pose is displayed in the shooting interface, and the single-person pose is displayed at the first designated area of the shooting interface.

应理解的是,场景图像中可包括一个或多个对象。上述对场景图像进行人体关键点检测,可包括:对场景图像进行人体检测,得到场景图像中的对象以及对象的人体区域;进而对人体区域内的对象进行人体关键点检测。应理解的是,人体检测可得到场景图像中是否包含对象、包含的对象的数量以及对象在场景图像中的位置(也即人体区域在场景图像中的位置)等信息。It should be understood that one or more objects may be included in the scene image. The above-mentioned human body key point detection on the scene image may include: performing human body detection on the scene image to obtain the object in the scene image and the human body area of the object; and then performing human body key point detection on the object in the human body area. It should be understood that the human body detection can obtain information such as whether an object is included in the scene image, the number of objects included, and the position of the object in the scene image (ie, the position of the human body region in the scene image).

针对推荐人体姿态为单人姿态的情况,若场景图像中包含一个对象,则可直接根据该一个对象的位置,显示该对象的真实人体姿态;若场景图像中包含多个对象,则按照预设的选取规则,从多个对象中选取对象,根据该选取的对象的位置,显示该选取的对象的真实人体姿态。For the case where the recommended human posture is a single-person posture, if the scene image contains an object, the real human posture of the object can be displayed directly according to the position of the object; if the scene image contains multiple objects, the preset The selection rule is to select an object from multiple objects, and display the real human body posture of the selected object according to the position of the selected object.

在一种可能的实现方式中,预设的选取规则,例如可包括:选取处于场景图像中间位置的对象、或选取距离终端设备最近的对象等,对此本公开实施例不作限制。其中,可采用本领域已知的技术得到对象与终端设备之间的距离,例如,可采用飞行时间法(Time of flight,TOF),对此本公开实施例不作限制。In a possible implementation manner, the preset selection rule may include, for example, selecting an object in the middle of a scene image, or selecting an object closest to the terminal device, etc., which are not limited in this embodiment of the present disclosure. The distance between the object and the terminal device may be obtained by using a technique known in the art, for example, a time of flight (TOF) method may be used, which is not limited in this embodiment of the present disclosure.

如上所述,可基于人体骨架或人体轮廓的形式,表征真实人体姿态以及推荐人体姿态。在一种可能的实现方式中,对象在场景图像中的位置可包括:场景图像中对象的人体骨架上或人体轮廓上的人体关键点的位置坐标;也可包括在场景图像中对象的人体区域的位置,其中,可通过上述人体检测得到该对象的人体区域。As described above, the real human pose can be represented and the human pose can be recommended based on the form of human skeleton or human outline. In a possible implementation manner, the position of the object in the scene image may include: the position coordinates of the key points of the human body on the human skeleton of the object in the scene image or the human body contour; may also include the human body region of the object in the scene image The position of the object, wherein the human body area of the object can be obtained through the above-mentioned human body detection.

应理解的是,通过上述人体关键点检测,可得到人体关键点的位置坐标,进而可依据人体关键点的位置坐标,在拍摄界面中显示真实人体姿态。通过该方式,能够使拍摄界面中的真实人体姿态跟随对象的位置显示。It should be understood that, through the detection of the human body key points, the position coordinates of the human body key points can be obtained, and then the real human body posture can be displayed in the shooting interface according to the position coordinates of the human body key points. In this way, the real human body posture in the photographing interface can be displayed following the position of the object.

在一种可能的实现方式中,对于推荐人体姿态,可设置为跟随对象的位置显示,也即根据对象在场景图像中的位置显示推荐人体姿态;也可设置为在拍摄界面中的第一指定区域显示该推荐人体姿态,也即将推荐人体姿态固定显示在拍摄界面某个预设区域中。In a possible implementation manner, for the recommended human posture, it can be set to follow the position of the object to display, that is, to display the recommended human posture according to the position of the object in the scene image; it can also be set to the first specified in the shooting interface. The area displays the recommended human body posture, that is, the recommended human body posture is fixedly displayed in a preset area of the shooting interface.

在一种可能的实现方式中,根据对象在场景图像中的位置,在拍摄界面中显示单人姿态,可包括:根据场景图像中对象的人体区域的位置,在拍摄界面的该人体区域处显示单人姿态。通过该方式,可较为便捷的实现推荐人体姿态根据对象的位置显示,便于引导对象比较单人姿态与真实人体姿态,以进行姿态调整。In a possible implementation manner, according to the position of the object in the scene image, displaying the single-person posture in the shooting interface may include: according to the position of the human body region of the object in the scene image, displaying on the human body region of the shooting interface Solo pose. In this way, it is relatively convenient to realize that the recommended human body posture is displayed according to the position of the object, and it is convenient to guide the object to compare the posture of a single person with the posture of a real human body for posture adjustment.

在一种可能的实现方式中,根据对象在场景图像中的位置,在拍摄界面中显示单人姿态,还可包括:根据真实人体姿态上任一人体关键点(如颈部关键点)的位置坐标,在拍摄界面中显示单人姿态。该方式可理解为,使单人姿态上任一人体关键点与真实人体姿态上对应的人体关键点重合,例如,可设置使两姿态上颈部关键点重合,并在此基础上显示单人姿态,从而实现单人姿态跟随对象的位置显示。通过该方式,可实现使单人姿态较为精准地跟随对象的位置显示,便于用户对比单人姿态与真实人体姿态。In a possible implementation manner, according to the position of the object in the scene image, the single-person posture is displayed in the shooting interface, and may further include: according to the position coordinates of any key point of the human body (such as the key point of the neck) on the real human body posture to display the single-person pose in the shooting interface. This method can be understood as making any key point of the human body in the single-person posture coincide with the key point of the human body corresponding to the real human body posture. For example, the key points of the neck on the two postures can be set to overlap, and the single-person posture can be displayed on this basis. , so as to realize the position display of the single-person posture following the object. In this way, the single-person posture can be displayed more accurately following the position of the object, which is convenient for the user to compare the single-person posture and the real human body posture.

需要说明的是,以上在拍摄界面中显示单人姿态的方式,是本公开实施例提供的一些实现方式,实际上,本公开应不限于,只要是根据对象在场景图像中的位置显示单人姿态的实现方式,均在本公开的保护范围内。It should be noted that the above manners of displaying the pose of a single person in the shooting interface are some implementation manners provided by the embodiments of the present disclosure. In fact, the present disclosure should not be limited, as long as the single person is displayed according to the position of the object in the scene image The implementation manner of the gesture is all within the protection scope of the present disclosure.

在一种可能的实现方式中,拍摄界面中的第一指定区域,可根据实际需求设定,例如,可是拍摄界面的中间区域、左侧区域、右侧区域、四个顶点区域等,对此本公开实施例不作限制。通过在 拍摄界面的第一指定区域处显示单人姿态,能够将推荐人体姿态固定显示在拍摄界面中,节省绘制推荐人体姿态所需的运算资源。In a possible implementation manner, the first designated area in the shooting interface can be set according to actual needs, for example, it can be the middle area, the left area, the right area, the four vertex areas, etc. of the shooting interface. The embodiments of the present disclosure are not limited. By displaying the single-person posture at the first designated area of the photographing interface, the recommended human body posture can be fixedly displayed in the photographing interface, thereby saving computing resources required for drawing the recommended human body posture.

其中,对于第一指定区域的范围大小,单人姿态在第一指定区域内的显示大小,可依据实际需求设定,对此本公开实施例不作限制。The size of the range of the first designated area and the display size of the single-person posture in the first designated area may be set according to actual requirements, which are not limited in this embodiment of the present disclosure.

在本公开实施例中,能够有效引导对象完成单人姿态的拍摄,提升拍摄体验,并且推荐人体姿态可固定显示、可跟随对象显示,能够满足不同的姿态显示需求。In the embodiment of the present disclosure, the object can be effectively guided to complete the shooting of the single-person posture, and the shooting experience can be improved, and the recommended human posture can be displayed fixedly and can be displayed with the object, which can meet different posture display requirements.

如上所述,推荐人体姿态包括多人组合姿态,在一种可能的实现方式中,所述方法还包括:As mentioned above, the recommended human body posture includes the combined posture of multiple people. In a possible implementation manner, the method further includes:

根据场景图像中多个对象之间的相对位置,以及多人组合姿态中各个姿态之间的相对位置,从多人组合姿态中分别确定各对象的对应姿态。通过该方式,能够有效确定出场景图像中各个对象在多人组合姿态中分别对应的姿态,从而便于引导各个对象完成多个组合姿态。According to the relative positions between the multiple objects in the scene image and the relative positions between the respective gestures in the multiple-person combined gestures, the corresponding gestures of each object are respectively determined from the multiple-person combined gestures. In this way, the corresponding postures of each object in the scene image in the multi-person combined posture can be effectively determined, so that it is convenient to guide each object to complete a plurality of combined postures.

如上所述,场景图像中可包括一个或多个对象。对场景图像进行人体关键点检测,可包括:对场景图像进行人体检测,得到场景图像中的对象以及对象的人体区域;进而对人体区域内的对象进行人体关键点检测。应理解的是,人体检测可得到场景图像中是否包含对象、包含的对象的数量以及对象在场景图像中的位置(也即人体区域在场景图像中的位置)等信息。As mentioned above, one or more objects may be included in the scene image. Performing human body key point detection on the scene image may include: performing human body detection on the scene image to obtain the object in the scene image and the human body area of the object; and then performing human key point detection on the object in the human body area. It should be understood that the human body detection can obtain information such as whether an object is included in the scene image, the number of objects included, and the position of the object in the scene image (ie, the position of the human body region in the scene image).

针对推荐人体姿态为多人组合姿态的情况,在一种可能的实现方式中,若场景图像中对象的数量,小于实现多人组合姿态所需的对象数量,可例如通过语音或文字等形式,提示用户人数不够实现多人组合姿态,从而引导增加被拍摄对象的数量;或者从多人组合姿态中选取与场景图像中对象的数量相符的人体姿态,并基于选取的人体姿态确定对象的对应姿态。For the case where the recommended human pose is a multi-person combined pose, in a possible implementation manner, if the number of objects in the scene image is less than the number of objects required to realize the multi-person combined pose, for example, through voice or text, etc. Prompt the user that the number of people is not enough to achieve a multi-person combined posture, thereby guiding the increase of the number of objects to be photographed; or select a human posture that matches the number of objects in the scene image from the multi-person combined posture, and determine the corresponding posture of the object based on the selected human posture .

其中,从多人组合姿态中选取与场景图像中对象的数量相符的人体姿态,例如,可随机选取与对象的数量相符的人体姿态,也可依据预设的选取策略,选取与对象的数量相符的人体姿态,选取策略例如可包括按照从左到右、从上到下的顺序选取,对此本公开实施例不作限制。Among them, the human body posture that matches the number of objects in the scene image is selected from the combined postures of multiple people. For example, the human body posture that matches the number of objects can be randomly selected, or the human body posture that matches the number of objects can be selected according to a preset selection strategy. The selection strategy may include, for example, selection in order from left to right and from top to bottom, which is not limited to this embodiment of the present disclosure.

应理解的是,若选取的人体姿态为一个,则可直接将选取的人体姿态作为对象的对应姿态;若选取的人体姿态为多个,可按照多个选取的人体姿态之间的相对位置,确定各对象的对应姿态。It should be understood that if the selected human body posture is one, the selected human body posture can be directly used as the corresponding posture of the object; if there are multiple selected human body postures, according to the relative positions between the multiple selected human body postures, Determine the corresponding pose of each object.

若场景图像中对象的数量,大于或等于实现多人组合姿态所需的对象数量,可根据场景图像中多个对象之间的相对位置,以及多人组合姿态中各个姿态之间的相对位置,从多人组合姿态中分别确定各对象的对应姿态。其中,相对位置可例如包括前后位置、左右位置、上下位置等,对此本公开实施例不作限制。If the number of objects in the scene image is greater than or equal to the number of objects required to realize the multi-person combined gesture, the relative positions of the multiple objects in the scene image and the relative positions of the various gestures in the multi-person combined gesture can be determined. The corresponding poses of each object are determined from the combined poses of multiple people. Wherein, the relative position may include, for example, a front-rear position, a left-right position, an up-down position, and the like, which is not limited by the embodiment of the present disclosure.

举例来说,图6示出根据本公开实施例的场景图像的示意图。如图6所示,场景图像中包含对象A、对象B、对象C,当前的推荐人体姿态例如为图3示出的多人组合姿态(即“比心”姿态)。那么根据对象A、对象B、对象C之间的相对位置,以及多人组合姿态中姿态m及姿态n之间的相对位置,可得到对象A与对象B分别与姿态m及姿态n对应,也即确定出对象A的对应姿态为“姿态m”,对象B的对应姿态为“姿态n”,可理解为,对象A与对象B之间的相对位置,更适合完成图3示出的多人组合姿态。For example, FIG. 6 shows a schematic diagram of a scene image according to an embodiment of the present disclosure. As shown in FIG. 6 , the scene image includes object A, object B, and object C, and the current recommended human pose is, for example, the pose of a group of people shown in FIG. 3 (ie, the "comparison" pose). Then, according to the relative positions of object A, object B, and object C, as well as the relative positions of pose m and pose n in the combined pose of multiple people, it can be obtained that object A and object B correspond to pose m and pose n respectively, and also That is, it is determined that the corresponding posture of object A is "pose m", and the corresponding posture of object B is "pose n". It can be understood that the relative position between object A and object B is more suitable for completing the multi-person shown in FIG. Combining gestures.

在推荐人体姿态为多人组合姿态,并从多人组合姿态中分别确定出各对象的对应姿态的情况下,在一种可能的实现方式中,在步骤S14中,在拍摄界面中显示真实人体姿态以及推荐人体姿态,包括:In the case where the recommended human body posture is a multi-person combined posture, and the corresponding posture of each object is determined from the multi-person combined posture, in a possible implementation manner, in step S14, a real human body is displayed in the shooting interface Posture and recommended human posture, including:

针对任一对象,根据对象在场景图像中的位置,在拍摄界面中显示对象的真实人体姿态以及对象的对应姿态;或,根据对象在场景图像中的位置,在拍摄界面中显示对象的真实人体姿态,并在拍摄界面的第二指定区域处显示对象的对应姿态。For any object, according to the position of the object in the scene image, the real human body posture of the object and the corresponding posture of the object are displayed in the shooting interface; or, according to the position of the object in the scene image, the real human body of the object is displayed in the shooting interface. gesture, and the corresponding gesture of the object is displayed in the second designated area of the shooting interface.

其中,对于多个对象中的任一对象,根据该对象在场景图像中的位置,在拍摄界面中显示该对象的真实人体姿态,可参照上述本公开实施例中公开的内容。Wherein, for any object among the multiple objects, according to the position of the object in the scene image, the real human body posture of the object is displayed in the shooting interface, and reference may be made to the content disclosed in the above embodiments of the present disclosure.

在一种可能的实现方式中,对于多个对象中的任一对象,根据该对象在场景图像中的位置,在拍摄界面中显示该对象的对应姿态,可与上述本公开实施例中根据对象在场景图像中的位置在拍摄界面中显示单人姿态的实现方式相同,对此本公开实施例不作限制。通过该方式,能够针对多个对象中的各个对象,实现各个对象的对应姿态跟随各个对象的位置显示,从而便于引导各个对象比较各自的对应姿态与真实人体姿态,以进行姿态调整。In a possible implementation manner, for any object among the multiple objects, according to the position of the object in the scene image, the corresponding posture of the object is displayed in the shooting interface, which can be the same as the method according to the object in the above-mentioned embodiment of the present disclosure. The implementation manner of displaying the single-person posture in the shooting interface at the position in the scene image is the same, which is not limited by the embodiment of the present disclosure. In this way, for each of the multiple objects, the corresponding posture of each object can be displayed following the position of each object, so as to facilitate guiding each object to compare the corresponding posture with the real human body posture for posture adjustment.

在一种可能的实现方式中,拍摄界面中的第二指定区域,可根据实际需求设定,例如,可是拍摄界面的中间区域、左侧区域、右侧区域、四个顶点区域等,对此本公开实施例不作限制。通过在拍摄界面的第二指定区域处显示对象的对应姿态,能够将推荐人体姿态固定显示在拍摄界面中,节省绘制推荐人体姿态所需的运算资源。In a possible implementation manner, the second designated area in the shooting interface can be set according to actual needs, for example, it can be the middle area, left area, right area, four vertex areas, etc. of the shooting interface. The embodiments of the present disclosure are not limited. By displaying the corresponding posture of the object in the second designated area of the photographing interface, the recommended human body posture can be fixedly displayed in the photographing interface, thereby saving computing resources required for drawing the recommended human body posture.

应理解的是,第一指定区域与第二指定区域可不同、可相同;第二指定区域可包括多个区域,可在多个区域分别显示多人组合姿态中各个姿态;第二指定区域也可包括一个区域,在该一个区域 内显示该多人组合姿态,对此本公开实施例不作限制。图7示出根据本公开实施例的拍摄界面的示意图,如图7所示,可在第二指定区域处显示多人组合姿态,并根据各个对象在场景图像中的位置,在拍摄界面中显示各个对象的真实人体姿态。It should be understood that the first designated area and the second designated area may be different or the same; the second designated area may include multiple areas, and each gesture in the combined gesture of multiple people may be displayed in the multiple areas; the second designated area also It may include an area in which the multi-person combined gesture is displayed, which is not limited in this embodiment of the present disclosure. FIG. 7 shows a schematic diagram of a shooting interface according to an embodiment of the present disclosure. As shown in FIG. 7 , a combined pose of multiple people can be displayed at a second designated area, and according to the position of each object in the scene image, it can be displayed in the shooting interface. The real human pose of each object.

其中,对于第二指定区域的范围大小,对应姿态在第二指定区域内的显示大小,可依据实际需求设定,对此本公开实施例不作限制。The size of the range of the second designated area and the display size of the corresponding posture in the second designated area may be set according to actual needs, which is not limited by the embodiment of the present disclosure.

在本公开实施例中,能够有效引导多个对象共同完成多人组合姿态,提升拍摄体验,并且推荐人体姿态可固定显示、可跟随对象显示,能够满足不同的姿态显示需求。In the embodiment of the present disclosure, multiple objects can be effectively guided to jointly complete a multi-person combined pose, improving the shooting experience, and the recommended human pose can be displayed in a fixed manner and can be displayed with the object, which can meet different pose display requirements.

考虑到,对象在调整姿态时,可能存在部分人体部位的姿态,接近推荐人体姿态,而部分人体部位的姿态并不接近推荐人体姿态的情况,例如,双臂的姿态与推荐人体姿态接近(也即相似度高),双腿的姿态与推荐人体姿态不接近(也即相似度低)。Considering that when the object adjusts the posture, there may be some human body postures that are close to the recommended human posture, while the postures of some human body parts are not close to the recommended human posture, for example, the posture of the arms is close to the recommended human posture (also That is, the similarity is high), and the posture of the legs is not close to the recommended human posture (that is, the similarity is low).

为更有效地引导用户调整姿态,可对显示的真实人体姿态中相似度低的人体部位进行突出显示。在一种可能的实现方式中,所述方法还包括:In order to guide the user to adjust the posture more effectively, the body parts with low similarity among the displayed real body postures can be highlighted. In a possible implementation, the method further includes:

确定真实人体姿态的第一人体关键点,与推荐人体姿态的第二人体关键点之间的关键点对;通过计算关键点对的第一相似度,从第一人体关键点中确定出第一相似度小于第一预设阈值的第三人体关键点;Determine the key point pair between the first human body key point of the real human body posture and the second human body key point of the recommended human body posture; determine the first human body key point from the first human body key point by calculating the first similarity of the key point pair a third human body key point whose similarity is less than the first preset threshold;

其中,在步骤S13中,在拍摄界面中显示真实人体姿态以及推荐人体姿态,包括:Wherein, in step S13, the real human body posture and the recommended human body posture are displayed in the shooting interface, including:

突出显示第三人体关键点所在的区域,其中,突出显示的方式包括高亮、加粗、变更颜色中的至少一种。The region where the third human body key point is located is highlighted, and the highlighting method includes at least one of highlighting, bolding, and changing color.

在一种可能的实现方式中,为便于比较真实人体姿态与推荐人体姿态,真实人体姿态的第一人体关键点与推荐人体姿态的第二人体关键点可是对应的,例如,真实人体姿态的第一人体关键点包括20个人体关节部位关键点,推荐人体姿态的第二人体关键点也对应包含20个人体关节部位关键点。基于此,可从真实人体姿态的第一人体关键点与推荐人体姿态的第二人体关键点中,确定出关键点对,例如,第一人体关键点中的左肩关键点,与第二人体关键点中的左肩关键点是一对关键点对,20个人体关节部位关键点,可确定出20个关键点对。In a possible implementation manner, in order to facilitate the comparison between the real human body posture and the recommended human body posture, the first human body key point of the real human body posture and the second human body key point of the recommended human body posture may correspond. For example, the first human body key point of the real human body posture One human body key point includes 20 human body joint key points, and the second human body key point for the recommended human posture also includes 20 human body joint key points correspondingly. Based on this, key point pairs can be determined from the first human body key point of the real human body posture and the second human body key point of the recommended human body posture, for example, the left shoulder key point in the first human body key point, and the second human body key point. The left shoulder key point in the points is a pair of key point pairs, 20 key points of human body joints, and 20 key point pairs can be determined.

在一种可能的实现方式中,真实人体姿态的第一人体关键点可带有标识,推荐人体姿态的第二人体关键点也可带有标识。应理解的是,标识可用于指示人体不同关节部位或不同轮廓位置上的关键点,通过两个姿态上带有的相同/相似标识,可便于确定出真实人体姿态的第一人体关键点,与推荐人体姿态的第二人体关键点之间的关键点对。In a possible implementation manner, the first human body key point of the real human body posture may carry a mark, and the second human body key point of the recommended human body posture may also carry a mark. It should be understood that the marks can be used to indicate key points on different joint parts or different contour positions of the human body, and through the same/similar marks on the two postures, it is easy to determine the first human key point of the real human body posture, which is the same as that of the human body. Keypoint pairs between the second human keypoints for the recommended human pose.

需要说明的是,以上通过标识确定关键点对的方式,本公开实施例公开的一种实现方式,实际上,本领域技术人员可采用任何已知的方式,确定出第一人体关键点与第二人体关键点之间的关键点对,对此本公开实施例不作限制。It should be noted that the above method of determining key point pairs through identification is an implementation method disclosed in the embodiments of the present disclosure. In fact, those skilled in the art can use any known method to determine the first human body key point and the first human body key point. The key point pair between two human body key points is not limited in this embodiment of the present disclosure.

在一种可能的实现方式中,关键点对的第一相似度,可用关键点对中两个关键点之间的距离(如欧氏距离、余弦距离等)表征,也可用距离的偏差表征,对此本公开实施例不作限制。其中,可设置第一相似度与距离或与距离的偏差负相关,也即,距离越小或距离的偏差越小,第一相似度越高。应理解的是,关键点对的数量,与第一相似度的数量一致,也即各关键点对分别对应各自的第一相似度。In a possible implementation manner, the first similarity of a keypoint pair can be represented by the distance (such as Euclidean distance, cosine distance, etc.) between two keypoints in the keypoint pair, or by the deviation of the distance, This embodiment of the present disclosure is not limited. The first similarity may be set to be negatively correlated with the distance or the deviation from the distance, that is, the smaller the distance or the smaller the deviation of the distance, the higher the first similarity. It should be understood that the number of key point pairs is consistent with the number of first degrees of similarity, that is, each key point pair corresponds to its respective first degree of similarity.

其中,距离的偏差,可理解为,任一关键点对的距离,与全部关键对的距离的平均值之间的差值,例如,20个关键点对的距离的平均值为X,关键点对a的距离为x,则关键点对a的距离的偏差为“x-X”,也即可用“x-X”表征关键点对a的第一相似度。Among them, the deviation of the distance can be understood as the difference between the distance of any key point pair and the average value of the distances of all key point pairs. For example, the average value of the distances of 20 key point pairs is X, and the key point The distance to a is x, then the deviation of the distance of the key point to a is "x-X", that is, "x-X" can be used to represent the first similarity of the key point to a.

在一种可能的实现方式中,可根据推荐人体姿态的显示方式:跟随对象显示或固定显示,选择用距离,还是用距离的偏差来表征第一相似度。例如,在推荐人体姿态是跟随对象显示的情况下,可使用距离表征第一相似度;在推荐人体姿态是固定显示在拍摄界面中的情况下,可用距离的偏差表征第一相似度。In a possible implementation manner, according to the display manner of the recommended human body posture: following the object display or fixed display, choose to use the distance or the deviation of the distance to represent the first similarity. For example, when the recommended human posture is displayed following the object, the distance can be used to represent the first similarity; when the recommended human posture is fixedly displayed in the photographing interface, the deviation of the distance can be used to represent the first similarity.

应理解的是,在推荐人体姿态是固定显示在拍摄界面中的情况下,对象在场景图像中的位置是会变化的,用距离可能无法准确表征关键点对的第一相似度,这是因为任一关键点对的距离远,并不意味着该关键点对不相似,可能是真实人体姿态与推荐人体姿态整体距离远,此时采用距离的偏差,能够较准确地反映各关键点对的第一相似度。It should be understood that in the case where the recommended human pose is fixed and displayed in the shooting interface, the position of the object in the scene image will change, and the distance may not be able to accurately characterize the first similarity of the key point pair. This is because The long distance of any key point pair does not mean that the key point pair is not similar, it may be that the overall distance between the real human body posture and the recommended human body posture is far away. first similarity.

在一种可能的实现方式中,第一预设阈值可根据实际需求、第一相似度的计算方式等确定,对此本公开实施例不作限制。第一相似度小于第一预设阈值,可意味着该第一相似度所对应的关键点对的相似程度低,也即,该关键点对表征的人体部位的部分姿态与推荐人体姿态相似度低;反之,第一相似度大于或等于第一预设阈值,可意味着该第一相似度对应的关键点对的相似程度高,也即, 该关键点对表征的人体部位的部分姿态与推荐人体姿态相似度高。In a possible implementation manner, the first preset threshold may be determined according to an actual requirement, a calculation method of the first similarity, and the like, which is not limited in this embodiment of the present disclosure. The first similarity is less than the first preset threshold, which means that the similarity of the key point pair corresponding to the first similarity is low, that is, the similarity between the partial posture of the human body part represented by the key point pair and the recommended human posture On the contrary, if the first similarity degree is greater than or equal to the first preset threshold, it may mean that the similarity degree of the key point pair corresponding to the first similarity degree is high, that is, the partial posture of the human body part represented by the key point pair is the same as It is recommended that the human pose has a high similarity.

在一种可能的实现方式中,通过计算关键点对的第一相似度,从第一人体关键点中确定出第一相似度小于第一预设阈值的第三人体关键点,可包括:通过计算关键点对的第一相似度,从关键点对确定出第一相似度小于第一预设阈值的目标关键点对,将目标关键点对中的第一人体关键点作为第三人体关键点,也即实现从第一人体关键点中确定出第一相似度小于第一预设阈值的第三人体关键点。In a possible implementation manner, determining a third human body key point whose first similarity is less than a first preset threshold from the first human body key points by calculating the first similarity of the key point pair, may include: by Calculate the first similarity of the key point pair, determine the target key point pair whose first similarity is less than the first preset threshold from the key point pair, and use the first human body key point in the target key point pair as the third human body key point , that is, the third human body key point whose first similarity is less than the first preset threshold is determined from the first human body key point.

在一种可能的实现方式中,突出显示第三人体关键点所在的区域,可理解为,突出显示该第三人体关键点在拍摄界面中所占的区域。举例来说,如图4示出的真实人体姿态中用实心圆表征人体关节部位,那么实心圆的所占区域可是第三人体关键点所在的区域。In a possible implementation manner, highlighting the area where the third human body key point is located may be understood as highlighting the area occupied by the third human body key point in the shooting interface. For example, in the real human body pose shown in FIG. 4 , a solid circle is used to represent the joint parts of the human body, then the area occupied by the solid circle may be the area where the third human body key point is located.

应理解的是,突出显示的方式包括高亮、加粗、变更颜色中的至少一种,是本公开实施例提供的一些实现方式,实际上,本领域技术人员可根据实际需求设计不同的突出显示的方式,对此本公开实施例不作限制。其中,变更颜色,例如将用绿色显示的真实人体姿态中的第三人体关键点变为红色,对此本公开实施例不作限制。It should be understood that the highlighting manner includes at least one of highlighting, bolding, and changing color, which are some implementation manners provided by the embodiments of the present disclosure. In fact, those skilled in the art can design different highlighting according to actual needs. The manner of display is not limited to this embodiment of the present disclosure. The color is changed, for example, the third human body key point in the real human body posture displayed in green is changed to red, which is not limited in this embodiment of the present disclosure.

图8示出根据本公开实施例的一种拍摄界面的示意图,如图8所示,可通过加粗的方式表征相似度较低的膝关节部位的关键点80,其中,还可对该膝关节部位的关键点80所对应的连接线81与82(也即表征腿部的连接线)进行加粗,对此本公开实施例不作限制。FIG. 8 shows a schematic diagram of a photographing interface according to an embodiment of the present disclosure. As shown in FIG. 8 , the key points 80 of the knee joint parts with lower similarity may be represented in a bold manner. The connection lines 81 and 82 corresponding to the key points 80 of the joint parts (that is, the connection lines representing the legs) are bolded, which is not limited by the embodiment of the present disclosure.

在本公开实施例中,能够对显示的真实人体姿态中相似度低的人体部位,进行突出显示,从而更有效地引导用户调整姿态,以提升拍摄体验。In the embodiment of the present disclosure, the parts of the human body with low similarity among the displayed real human body postures can be highlighted, so as to more effectively guide the user to adjust the posture, so as to improve the shooting experience.

如上文所述,可在拍摄界面中显示真实人体姿态与推荐人体姿态之间的相似度,以引导对象进行姿态调整。在一种可能的实现方式中,所述方法还包括:As described above, the similarity between the real human body posture and the recommended human body posture can be displayed in the shooting interface to guide the subject to adjust the posture. In a possible implementation, the method further includes:

根据真实人体姿态与推荐人体姿态之间的关键点对的第一相似度,确定真实人体姿态与推荐人体姿态之间的第二相似度,关键点对是根据真实人体姿态的第一人体关键点与推荐人体姿态的第二人体关键点确定的;Determine the second similarity between the real human body posture and the recommended human body posture according to the first similarity of the key point pair between the real human body posture and the recommended human body posture, and the key point pair is the first human body key point according to the real human body posture Determined with the second human key point of the recommended human posture;

在拍摄界面中显示第二相似度。The second similarity is displayed in the shooting interface.

其中,关键点对的确定方式,可参照上述本公开实施例。For the way of determining the key point pair, reference may be made to the above embodiments of the present disclosure.

如上所述,关键点对的第一相似度可用关键点对的距离或距离的偏差表征。在一种可能的实现方式中,根据关键点对的第一相似度,确定真实人体姿态与推荐人体姿态之间的第二相似度,可包括:根据全部关键点对的距离的累加值、平均值、方差或标准差,确定第二相似度,对此本公开实施例不作限制。As described above, the first similarity of the keypoint pair can be characterized by the distance of the keypoint pair or the deviation of the distance. In a possible implementation manner, determining the second similarity between the real human body posture and the recommended human posture according to the first similarity of the key point pairs may include: according to the accumulated value of the distances of all key point pairs, the average value, variance, or standard deviation to determine the second similarity, which is not limited in this embodiment of the present disclosure.

其中,距离的累加值、平均值、方差或标准差,可与第二相似度成负相关,也即,距离的累加值、平均值、方差或标准差等越小,代表第二相似度越高;反之,距离的累加值、平均值、方差或标准差等越大,代表第二相似度越低。应理解的是,第二相似度可反映真实人体姿态与推荐人体姿态之间整体的相似程度。Among them, the accumulated value, average value, variance or standard deviation of the distance may be negatively correlated with the second similarity, that is, the smaller the accumulated value, mean, variance or standard deviation of the distance, etc., the higher the second similarity is. On the contrary, the larger the cumulative value, mean, variance or standard deviation of the distance, the lower the second similarity. It should be understood that the second similarity may reflect the overall similarity between the real human body posture and the recommended human body posture.

在一种可能的实现方式中,为便于对象理解第二相似度的高低,或者说理解相似程度的高低,第二相似度可用百分比的形式显示在拍摄界面中。基于此,可设置距离的累加值、平均值、方差或标准差,与第二相似度之间存在映射关系,从而根据该映射关系,将距离的累加值、平均值、方差或标准差,映射成百分比形式下的第二相似度。In a possible implementation manner, in order for the object to understand the level of the second similarity, or to understand the level of the similarity, the second similarity may be displayed in the shooting interface in the form of a percentage. Based on this, the cumulative value, average value, variance or standard deviation of the distance can be set, and there is a mapping relationship with the second similarity, so that according to the mapping relationship, the cumulative value, average value, variance or standard deviation of the distance can be mapped to The second similarity in percentage form.

其中,本领域技术人员可采用任何已知的方式,设置距离的累加值、平均值、方差或标准差,与第二相似度之间的映射关系,对此本公开实施例不作限制。Wherein, those skilled in the art may use any known manner to set the mapping relationship between the accumulated value, average value, variance or standard deviation of the distance and the second similarity, which is not limited in this embodiment of the present disclosure.

在一种可能的实现方式中,在拍摄界面中显示第二相似度,例如可通过图5、图8示出的方式,显示第二相似度。应理解的是,本领域技术人员可根据实际需求设计第二相似度的显示方式,对此本公开实施例不作限制。In a possible implementation manner, the second degree of similarity is displayed in the photographing interface, for example, the second degree of similarity may be displayed in the manner shown in FIG. 5 and FIG. 8 . It should be understood that a person skilled in the art can design a display manner of the second similarity according to actual requirements, which is not limited by this embodiment of the present disclosure.

在一种可能的实现方式中,还可通过神经网络得到真实人体姿态与推荐人体姿态之间的第二相似度,也即可将真实人体姿态对应的第一人体关键点与推荐人体姿态对应的第二人体关键点输入至神经网络中,输出第二相似度。其中,第一人体关键点与第二人体关键点可采用向量或矩阵的形式,输入至神经网络中。对于该神经网络的网络类型、网络结构以及训练方式,本公开实施例不作限制。In a possible implementation manner, the second similarity between the real human body posture and the recommended human body posture can also be obtained through a neural network, that is, the first human body key point corresponding to the real human body posture and the recommended human body posture can be obtained. The second human body key point is input into the neural network, and the second similarity is output. The first human body key point and the second human body key point can be input into the neural network in the form of a vector or a matrix. The embodiments of the present disclosure do not limit the network type, network structure, and training method of the neural network.

在本公开实施例中,能够显示真实人体姿态与推荐人体姿态之间整体的相似程度,以有效引导用户根据整体的相似度调整姿态,提升拍摄体验。In the embodiment of the present disclosure, the overall similarity between the real human body posture and the recommended human posture can be displayed, so as to effectively guide the user to adjust the posture according to the overall similarity, and improve the shooting experience.

如上所述,在步骤S15中,在满足拍摄条件的情况下,得到对象的拍摄图像。在一种可能的实现方式中,拍摄条件可包括真实人体姿态与所述推荐人体姿态之间的第二相似度大于或等于第二预设阈值。通过该方式,可实现在真实人体姿态与推荐人体姿态相似度较高的情况下,自动触发拍摄操 作,得到拍摄图像并保存本地相册,以供对象查看。As described above, in step S15, when the photographing conditions are satisfied, a photographed image of the subject is obtained. In a possible implementation manner, the shooting conditions may include that the second similarity between the real human body posture and the recommended human body posture is greater than or equal to a second preset threshold. In this way, the shooting operation can be automatically triggered when the real human body posture is highly similar to the recommended human body posture, and the shooting image can be obtained and saved in a local album for the object to view.

在一种可能的实现方式中,在所述图像拍摄方法由服务器执行的情况下,服务器可在真实人体姿态与所述推荐人体姿态之间的第二相似度大于或等于第二预设阈值的情况下,向终端设备发送拍摄指令,通过拍摄指令触发终端设备进行拍摄操作,得到拍摄图像。在所述图像拍摄方法由终端设备执行的情况下,终端设备可在真实人体姿态与所述推荐人体姿态之间的第二相似度大于或等于第二预设阈值的情况下,自动触发拍摄操作,得到拍摄图像。In a possible implementation manner, when the image capturing method is executed by the server, the server may determine that the second similarity between the real human body posture and the recommended human body posture is greater than or equal to a second preset threshold. In this case, a shooting instruction is sent to the terminal device, and the terminal device is triggered to perform a shooting operation through the shooting instruction to obtain a shot image. In the case where the image capturing method is performed by a terminal device, the terminal device may automatically trigger a capturing operation when the second similarity between the real human body posture and the recommended human body posture is greater than or equal to a second preset threshold , to get the captured image.

其中,第二预设阈值,可根据实际需求设定,例如,可设置为80%,对此本公开实施例不作限制。应理解的是,拍摄条件也可包括拍摄界面中的拍摄控件被触发,也即,拍摄者手动进行拍照操作。The second preset threshold may be set according to actual requirements, for example, may be set to 80%, which is not limited in this embodiment of the present disclosure. It should be understood that the shooting conditions may also include that the shooting controls in the shooting interface are triggered, that is, the photographer manually performs the shooting operation.

考虑到,若对象在实际场景中有较好的站位,或者说对象在场景图像中所在的位置较为合适,能够使拍摄图像呈现出较好的视觉效果,例如,构图上更加协调、悦目。在一种可能的实现方式中,所述方法还包括:It is considered that if the object has a better position in the actual scene, or the object is in a suitable position in the scene image, the captured image can present a better visual effect, for example, the composition is more harmonious and pleasing to the eye. In a possible implementation, the method further includes:

将场景图像输入至区域推荐网络,得到场景图像的推荐拍照区域,推荐拍照区域用于表征在场景图像中推荐拍照的区域,区域推荐网络是通过标注拍照区域的第一样本集训练得到的神经网络;Input the scene image to the regional recommendation network to obtain the recommended photographing area of the scene image. The recommended photographing area is used to represent the recommended photographing area in the scene image. The area recommendation network is a neural network obtained by training the first sample set of marking the photographing area. network;

其中,得到场景图像的推荐拍照区域之后,所述方法还包括以下至少一种:Wherein, after obtaining the recommended photographing area of the scene image, the method further includes at least one of the following:

在拍摄界面中,采用标识符指示推荐拍照区域;In the shooting interface, an identifier is used to indicate the recommended shooting area;

在拍摄界面的推荐拍照区域处显示推荐人体姿态,推荐拍照区域用于引导对象根据推荐拍照区域进行位置调整。The recommended human posture is displayed in the recommended photographing area of the photographing interface, and the recommended photographing area is used to guide the object to adjust the position according to the recommended photographing area.

其中,区域推荐网络可采用已知的神经网络,例如,卷积神经网络等,对于区域推荐网络的网络类型、网络结构以及训练方式,本公开实施例不作限制。The region recommendation network may adopt a known neural network, such as a convolutional neural network, etc. The embodiment of the present disclosure does not limit the network type, network structure, and training method of the region recommendation network.

在一种可能的实现方式中,第一样本集的样本图像可是视觉效果较优的图像。通过从视觉效果较优的第一样本集中学习推荐拍照区域,可引导对象调整位置来提升拍摄图像的视觉效果,从而提升用户拍摄体验。In a possible implementation manner, the sample images of the first sample set may be images with better visual effects. By learning the recommended photographing area from the first sample set with better visual effect, the object can be guided to adjust the position to improve the visual effect of the photographed image, thereby improving the user's photographing experience.

其中,第一样本集的样本图像中人体的所在区域,可是标注的拍照区域。应理解的是,可采用已知的标注技术,实现对第一样本集的样本图像中人体区域的标注,对此本公开实施例不作限制。The region where the human body is located in the sample images of the first sample set may be the marked photographing region. It should be understood that, a known labeling technology may be used to realize labeling of the human body region in the sample image of the first sample set, which is not limited in this embodiment of the present disclosure.

在一种可能的实现方式中,用于指示推荐拍照区域的标识符,可采用字符和/或图形等任何形式显示在拍摄界面中,从而实现引导用户对象根据推荐拍照区域进行位置调整,对此本公开实施例不作限制。图9示出根据本公开实施例的一种拍摄界面的示意图,如图9所示,可通过图形、提示语“推荐站位”等形式指示推荐拍照区域。In a possible implementation manner, the identifier used to indicate the recommended photographing area may be displayed in the photographing interface in any form such as characters and/or graphics, so as to guide the user object to adjust the position according to the recommended photographing area. The embodiments of the present disclosure are not limited. FIG. 9 shows a schematic diagram of a photographing interface according to an embodiment of the present disclosure. As shown in FIG. 9 , a recommended photographing area may be indicated in the form of graphics, prompt words "recommended station", and the like.

如上文所述,推荐人体姿态可固定显示在拍摄界面的指定区域(上述第一指定区域或第二指定区域)中。在一种可能的实现方式中,可在拍摄界面的推荐拍照区域处显示推荐人体姿态,换句话说,可用推荐人体姿态指示推荐拍照区域。应理解的是,上述第一指定区域或第二指定区域可包括推荐拍照区域。通过该方式,能够有效地引导对象根据推荐拍照区域进行位置调整,也便于对象根据推荐人体姿态进行姿态调整。As described above, the recommended human posture may be displayed in a fixed area (the first designated area or the second designated area) of the photographing interface. In a possible implementation manner, the recommended human body posture may be displayed in the recommended photographing area of the photographing interface, in other words, the recommended photographing area may be indicated by the recommended human body posture. It should be understood that the above-mentioned first designated area or second designated area may include a recommended photographing area. In this way, the object can be effectively guided to adjust the position according to the recommended photographing area, and it is also convenient for the object to adjust the posture according to the recommended human body posture.

应理解的是,可同时采用标识符及推荐人体姿态指示推荐拍照区域,也可仅采用标识符或推荐人体姿态指示推荐拍照区域,对此本公开实施例不作限制。It should be understood that the recommended photographing area may be indicated by both the identifier and the recommended human posture, or the recommended photographing area may be indicated by only the identifier or the recommended human posture, which is not limited by the embodiment of the present disclosure.

在一种可能的实现方式中,在采用标识符指示推荐拍照区域,且检测到场景图像中对象处于推荐拍照区域内的情况下,可隐藏该标识符,从而可使拍摄界面更简洁。其中,判断对象是否处于推荐拍照区域内,可根据场景图像中对象的人体区域的位置确定,也可根据人体关键点的位置坐标确定,对此本公开实施例不作限制。In a possible implementation, when an identifier is used to indicate the recommended photographing area, and it is detected that the object in the scene image is within the recommended photographing area, the identifier can be hidden, thereby making the photographing interface more concise. Wherein, determining whether the object is in the recommended photographing area may be determined according to the position of the object's human body area in the scene image, or may be determined according to the position coordinates of the key points of the human body, which is not limited in this embodiment of the present disclosure.

在本公开实施例中,能够向对象推荐视觉效果较好的拍照区域,有效引导对象调整位置来提升拍摄图像的视觉效果,从而提升用户拍摄体验。In the embodiment of the present disclosure, a photographing area with better visual effect can be recommended to the object, and the object can be effectively guided to adjust the position to improve the visual effect of the photographed image, thereby improving the user's photographing experience.

如上所述,可通过姿态推荐网络对场景图像进行识别,确定场景图像对应的场景类别以及与场景类别对应的推荐人体姿态。在一种可能的实现方式中,在步骤S12中,对场景图像进行识别,确定场景图像对应的场景类别以及与场景类别对应的推荐人体姿态,包括:As described above, the scene image can be recognized by the gesture recommendation network, and the scene category corresponding to the scene image and the recommended human pose corresponding to the scene category can be determined. In a possible implementation manner, in step S12, the scene image is identified, and the scene category corresponding to the scene image and the recommended human posture corresponding to the scene category are determined, including:

通过姿态推荐网络对场景图像进行识别,确定场景图像对应的场景类别以及与场景类别对应的推荐人体姿态,其中,姿态推荐网络是通过标注有样本场景类别以及样本人体姿态的第二样本集训练得到的神经网络,第二样本集包括对象上传的样本图像。The scene image is recognized by the gesture recommendation network, and the scene category corresponding to the scene image and the recommended human pose corresponding to the scene category are determined. The gesture recommendation network is obtained by training the second sample set marked with the sample scene category and the sample human pose. The neural network of the second sample set includes sample images uploaded by the object.

其中,姿态推荐网络可采用已知的神经网络,例如,卷积神经网络、残差神经网络等。如上所述,对于姿态推荐网络的网络类型、网络结构以及训练方式等,本公开实施例不作限制。Wherein, the pose recommendation network may adopt a known neural network, for example, a convolutional neural network, a residual neural network, and the like. As described above, the embodiments of the present disclosure do not limit the network type, network structure, training method, etc. of the gesture recommendation network.

如上所述,可采用自监督的训练方式,使姿态推荐网络学习不同场景类型下的推荐人体姿态, 或者说,学习场景类型与推荐人体姿态之间的对应关系,从而可利用该姿态推荐网络基于场景图像输出推荐人体姿态。As mentioned above, the self-supervised training method can be used to make the posture recommendation network learn the recommended human posture under different scene types, or, in other words, learn the correspondence between the scene type and the recommended human posture, so that the posture recommendation network can be used based on Scene image output recommends human pose.

应理解的是,第一样本集与第二样本集可是相同的样本集,也可是不同的样本集,对于第一样本集与第二样本集中样本图像的来源,本公开实施例不作限制。It should be understood that the first sample set and the second sample set may be the same sample set or different sample sets, and the source of the sample images in the first sample set and the second sample set is not limited in the embodiment of the present disclosure. .

其中,可采用已知的标注技术,实现对第二样本集的样本图像中样本场景类别以及样本人体姿态的标注,对此本公开实施例不作限制。Wherein, a known labeling technology can be used to realize labeling of the sample scene category and the sample human body posture in the sample images of the second sample set, which is not limited by the embodiment of the present disclosure.

在一种可能的实现方式中,实现图像拍摄方法的应用程序或实现图像拍摄方法的拍摄模式中,还可提供用于上传样本图像的上传功能,该样本图像可以是用户已拍摄图像中效果较好的图像。通过该方式,能够丰富用于训练姿态推荐网络的第二样本集,使训练得到的姿态推荐网络,可更准确地基于场景图像输出推荐人体姿态。In a possible implementation manner, an application program for realizing the image capturing method or a capturing mode for realizing the image capturing method may also provide an upload function for uploading a sample image, and the sample image may be an image that has been captured by the user with a relatively low effect. good image. In this way, the second sample set used for training the pose recommendation network can be enriched, so that the trained pose recommendation network can output the recommended human pose based on the scene image more accurately.

应理解的是,用户上传的样本图像可是用户授权进入后台训练姿态推荐网络的图像,这样用户上传的样本图像仅对后台可见,其他用户看到的均为处理过的推荐人体姿态及对应的姿态图(姿态图中例如隐去人脸或者替换了人脸或替换人物等),从而保障了用户隐私安全,其中,姿态图可用于选择不同的推荐人体姿态。It should be understood that the sample image uploaded by the user is the image that the user authorizes to enter the background training posture recommendation network, so the sample image uploaded by the user is only visible to the background, and what other users see are the processed recommended human posture and the corresponding posture. image (for example, the human face is hidden from the gesture image, or the human face is replaced or the character is replaced, etc.), thereby ensuring the privacy and security of the user, wherein the gesture image can be used to select different recommended human gestures.

其中,可采用本领域的已知的技术,在应用程序和/或拍摄模式中实现该图像上传功能,对此本公开实施例不作限制。Wherein, the image uploading function may be implemented in an application program and/or a shooting mode by using a known technology in the art, which is not limited by the embodiment of the present disclosure.

应理解的是,姿态推荐网络的训练过程可在服务器进行。在一种可能的实现方式中样本图像可上传至服务器来扩充第二样本集,并可定期基于第二样本集再训练姿态推荐网络,得到新版本的姿态推荐网络,进而更新终端设备中已部署的姿态推荐网络。通过该方式,可使姿态推荐网络学习到丰富的人体姿态,从而使训练得到的姿态推荐网络所输出的推荐人体姿态更全面、更丰富。It should be understood that the training process of the gesture recommendation network may be performed on the server. In a possible implementation, the sample images can be uploaded to the server to expand the second sample set, and the posture recommendation network can be periodically retrained based on the second sample set to obtain a new version of the posture recommendation network, and then update the deployed posture recommendation network in the terminal device. The pose recommendation network. In this way, the posture recommendation network can learn rich human body postures, so that the recommended human postures output by the trained posture recommendation network are more comprehensive and richer.

在一种可能的实现方式中,终端设备中已部署的姿态推荐网络的再训练过程,也可在终端设备上进行,也即,可直接利用对象上传的样本图像,对终端设备上已部署的姿态推荐网络进行增量训练。通过该方式,可使已部署的姿态推荐网络所输出的推荐人体姿态,更贴近对象的喜好,具有针对性。In a possible implementation manner, the retraining process of the posture recommendation network deployed in the terminal device can also be performed on the terminal device, that is, the sample image uploaded by the object can be directly used to retrain the deployed gesture recommendation network on the terminal device. The pose recommendation network is incrementally trained. In this way, the recommended human posture output by the deployed posture recommendation network can be closer to the object's preference and targeted.

应理解的是,第二样本集可是拍摄效果较优的优质图像,在本公开实施例中,能够使姿态推荐网络从优质图像中学习人体姿态,从而有效地基于场景图像输出优质的推荐人体姿态,以引导拍摄对象接近优质的推荐人体姿态,从而获得较好的拍摄效果,提升用户拍摄体验。It should be understood that the second sample set may be high-quality images with better shooting effects. In the embodiment of the present disclosure, the posture recommendation network can learn the human body posture from the high-quality images, thereby effectively outputting high-quality recommended human postures based on scene images. , to guide the subject to approach the high-quality recommended human posture, so as to obtain a better shooting effect and improve the user's shooting experience.

如上所述,推荐人体姿态包括多个,为便于对象选择或切换不同的推荐人体姿态,在一种可能的实现方式中,所述方法还包括:As described above, there are multiple recommended human body poses. In order to facilitate object selection or switching between different recommended human body poses, in a possible implementation manner, the method further includes:

在拍摄界面中显示多个推荐人体姿态对应的姿态图;Displaying a plurality of posture maps corresponding to the recommended human postures in the shooting interface;

响应于针对姿态图的选择操作,根据选中的姿态图,确定与选中的姿态图对应的推荐人体姿态并进行显示。In response to the selection operation on the posture map, according to the selected posture map, a recommended human body posture corresponding to the selected posture map is determined and displayed.

图10示出根据本公开实施例的一种拍摄界面的示意图,如图10所示,拍摄界面中可展示多个姿态图,还可展示识别出的场景特征“窗框、墙面”和对象特征“毛衣、长裤、手提高”,以提高趣味性。其中,可参照上述本公开实施例中对场景图像进行识别的方式,得到识别出的场景特征及对象特征。应理解的是,拍摄界面中还可提供用于查看多个姿态图的滑动控件,以通过滑动操作查看其它当前未显示在拍摄界面中的姿态图。FIG. 10 shows a schematic diagram of a photographing interface according to an embodiment of the present disclosure. As shown in FIG. 10 , the photographing interface can display a plurality of posture diagrams, and can also display the recognized scene features “window frame, wall” and objects Features "Sweater, Pants, Hand Raise" for more fun. Wherein, the identified scene features and object features can be obtained by referring to the method of identifying the scene image in the above-mentioned embodiments of the present disclosure. It should be understood that, a sliding control for viewing multiple posture diagrams may also be provided in the photographing interface, so as to view other posture graphs that are not currently displayed in the photographing interface through a sliding operation.

在一种可能的实现方式中,显示的姿态图可是缩略图,可理解为推荐人体姿态对应的预览图、示意图等。应理解的是,姿态图与推荐人体姿态对应,根据选中的任一姿态图,可得到对应的推荐人体姿态。In a possible implementation manner, the displayed posture map may be a thumbnail, which can be understood as a preview map, a schematic diagram, etc. corresponding to the recommended human body posture. It should be understood that the posture map corresponds to the recommended human body posture, and according to any selected posture map, the corresponding recommended human body posture can be obtained.

在一种可能的实现方式中,针对姿态图的选择操作,例如可包括滑动查看姿态图、点击选中任一姿态图等操作。其中,被选中的姿态图可突出显示,实现友好的人机交互。In a possible implementation manner, the selection operation for the posture map may include, for example, operations such as swiping to view the posture map, clicking to select any posture map, and the like. Among them, the selected posture map can be highlighted to achieve friendly human-computer interaction.

如上所述,显示的姿态图可是缩略图,在一种可能的实现方式中,在如图10示出的拍摄界面中选中任一姿态图后,可在拍摄界面中显示该姿态图对应的放大图,以便于对象更清楚的查看姿态图。As mentioned above, the displayed posture map can be a thumbnail image. In a possible implementation, after selecting any posture map in the shooting interface as shown in FIG. 10 , the zoomed-in corresponding to the posture map can be displayed in the shooting interface. map, so that the object can view the pose map more clearly.

图11a、图11b示出根据本公开实施例的拍摄界面的示意图。如图11a所示,中间区域显示的可是被选中的姿态图对应的放大图,其中,还可响应于长按放大图、双击放大图等操作,在如图11b所示的放大图中预览该姿态图对应的推荐人体姿态。11a and 11b illustrate schematic diagrams of a photographing interface according to an embodiment of the present disclosure. As shown in Fig. 11a, the enlarged image corresponding to the selected posture image can be displayed in the middle area, wherein, in response to operations such as long-pressing the enlarged image, double-clicking the enlarged image, etc., the image can be previewed in the enlarged image as shown in Fig. 11b. The recommended human pose corresponding to the pose map.

在一种可能的实现方式中,可通过点击图11a、图11b中示出的“拍同款”按钮,以确定选中的姿态图及对应的推荐人体姿态,并在如图5、图7、图8示出的拍摄界面中显示与选中的姿态图对应的推荐人体姿态。其中,显示推荐人体姿态的方式可参照上述本公开实施例。In a possible implementation manner, the selected posture diagram and the corresponding recommended human posture can be determined by clicking the "shoot the same style" button shown in Figure 11a and Figure 11b, and shown in Figure 5, Figure 7, The recommended human posture corresponding to the selected posture map is displayed in the photographing interface shown in FIG. 8 . Wherein, for the manner of displaying the recommended human body posture, reference may be made to the above-mentioned embodiments of the present disclosure.

需要说明的是,以上图10、图11a、图11b中显示姿态图以及选择姿态图的方式,是本公开实施 例提供的一种实现方式,实际上,本领域技术人员可根据实际需求,设计姿态图在拍摄界面中的显示方式、以及针对姿态图的选择方式等,对此本公开实施例不作限制。It should be noted that the manner of displaying the attitude diagram and selecting the attitude diagram in FIG. 10 , FIG. 11 a , and FIG. 11 b above is an implementation method provided by the embodiment of the present disclosure. In fact, those skilled in the art can design the The display manner of the attitude map in the shooting interface, the selection method for the attitude map, etc., are not limited in this embodiment of the present disclosure.

在本公开实施例中,能够便于用户选择不同的推荐人体姿态,满足不同的姿态喜好,提升拍摄体验,有利于得到令用户满意的拍摄图像。In the embodiment of the present disclosure, it is convenient for the user to select different recommended human body postures, to satisfy different posture preferences, to improve the shooting experience, and to help obtain a photographed image that satisfies the user.

在一种可能的实现方式中,所述方法还包括:In a possible implementation, the method further includes:

对拍摄图像进行处理,得到处理后的拍摄图像,其中,处理包括以下至少一种:对拍摄图像中的对象进行美颜处理、对拍摄图像添加滤镜;The captured image is processed to obtain a processed captured image, wherein the processing includes at least one of the following: performing beautification processing on an object in the captured image, and adding a filter to the captured image;

所述对拍摄图像添加滤镜,包括:根据与推荐人体姿态对应的推荐滤镜,调整拍摄图像的饱和度、色温、亮度中的至少一种。The adding a filter to the captured image includes: adjusting at least one of saturation, color temperature, and brightness of the captured image according to a recommended filter corresponding to the recommended human posture.

其中,可采用任何已知的图像美颜技术,实现对拍摄图像中的对象进行美颜处理,对此本公开实施例不作限制。例如,美颜处理的过程可包括:确定拍摄图像中人脸面部位置,再确定面部瑕疵位置,根据定位到面部瑕疵位置进行填补、修复或滤除等处理。Wherein, any known image beautifying technology can be used to realize the beautifying processing of the object in the captured image, which is not limited by the embodiment of the present disclosure. For example, the process of beauty processing may include: determining the position of the face in the captured image, then determining the position of the facial defect, and performing processing such as filling, repairing or filtering according to the location of the facial defect.

如上所示,推荐人体姿态可是从样本图像中学习到的,在一种可能的实现方式中,推荐滤镜可通过分析样本图像的滤镜获得,基于此,推荐人体姿态可与推荐滤镜对应。As shown above, the recommended human pose can be learned from the sample image. In a possible implementation, the recommended filter can be obtained by analyzing the filter of the sample image. Based on this, the recommended human pose can correspond to the recommended filter. .

应理解的是,推荐滤镜可包括推荐的饱和度、色温、亮度等滤镜参数。在一种可能的实现方式中,调整拍摄图像的饱和度、色温、亮度中的至少一种,可包括:将拍摄图像的饱和度、色温、亮度中的至少一种,调整至与推荐滤镜的滤镜参数一致。It should be understood that the recommended filter may include recommended filter parameters such as saturation, color temperature, and brightness. In a possible implementation manner, adjusting at least one of the saturation, color temperature, and brightness of the captured image may include: adjusting at least one of the saturation, color temperature, and brightness of the captured image to match the recommended filter The filter parameters are the same.

其中,本领域技术人员可采用本领域已知的图像处理技术,实现调整拍摄图像的饱和度、色温、亮度,对此本公开实施例不作限制。Wherein, those skilled in the art can use image processing technologies known in the art to adjust the saturation, color temperature, and brightness of the captured image, which is not limited to the embodiment of the present disclosure.

在本公开实施例中,能够使拍摄图像的视觉效果更优质,有利于得到令用户满意的拍摄图像,提升图像拍摄体验。In the embodiment of the present disclosure, the visual effect of the captured image can be improved, which is beneficial to obtain the captured image that satisfies the user, and improves the image capturing experience.

在一种可能的实现方式中,本公开实施例的图像拍摄方法,能够应用于人工智能教育平台、社交分享平台、视频图像拍摄软件等,能够根据场景图像智能地推荐与场景图像对应的推荐人体姿态,供用户进行自拍和摄像构图指导,提高图像拍摄的趣味性,提高拍摄体验,有利于得到令用户满足的拍摄图像。In a possible implementation manner, the image capturing method of the embodiment of the present disclosure can be applied to artificial intelligence education platforms, social sharing platforms, video image capturing software, etc., and can intelligently recommend recommended human bodies corresponding to the scene images according to the scene images. The posture is used for self-portrait and camera composition guidance for users, which improves the fun of image shooting, improves the shooting experience, and is conducive to obtaining a shooting image that satisfies the user.

其中,将图像拍摄方法应用于人工智能教育平台时,可借助与该人工智能教育平台连接的智能硬件,如Jetson Nano(一种英伟达开发的微型计算机),树莓派Raspberry Pi(一种微型电脑)等实现上述图像拍摄方法,通过该方式,能够基于可视化的形态,使用户更容易启发学习兴趣。Among them, when the image capturing method is applied to the artificial intelligence education platform, intelligent hardware connected to the artificial intelligence education platform can be used, such as Jetson Nano (a microcomputer developed by NVIDIA), Raspberry Pi (a microcomputer) ) etc. to implement the above image capturing method, through this method, based on the form of visualization, it is easier for the user to inspire learning interest.

图12示出根据本公开实施例的图像拍摄方法的应用示意图。如图12所示,人工智能教育平台与智能硬件连接,其中,人工智能教育平台用于编辑实现所述图像拍摄方法的项目代码,并将项目代码发送至智能硬件,智能硬件用于采集场景图像及运行项目代码,得到运行结果,并将运行结果发送至人工智能教育平台,以在人工智能教育平台的显示界面中显示运行结果。FIG. 12 shows a schematic diagram of an application of an image capturing method according to an embodiment of the present disclosure. As shown in Figure 12, the artificial intelligence education platform is connected to the intelligent hardware, wherein the artificial intelligence education platform is used to edit the project code to realize the image capturing method, and send the project code to the intelligent hardware, and the intelligent hardware is used to collect scene images And run the project code, get the running results, and send the running results to the artificial intelligence education platform to display the running results in the display interface of the artificial intelligence education platform.

其中,可在人工智能教育平台的网页端的显示界面中显示场景图像、拍摄图像以及图像拍摄方法的运行结果。其中,运行结果可包括推荐人体姿态、真实人体姿态、对拍摄图像添加滤镜后的拍摄图像、对拍摄图像中的对象进行美颜处理后的拍摄图像等。Among them, the scene image, the captured image and the operation result of the image capturing method can be displayed on the display interface of the web page of the artificial intelligence education platform. The running results may include recommended human body postures, real human body postures, captured images after adding filters to the captured images, captured images after beautifying the objects in the captured images, and the like.

在一种可能的实现方式中,该人工智能教育平台可以支持学生在平台上编辑实现上述图像拍摄方法的项目代码,例如训练姿态推荐网络等代码,并连接智能硬件,将用于实现上述图像拍摄方法的项目代码发送到智能硬件上执行,从而可通过智能硬件实时采集场景图像,并基于采集的场景图像实现上述图像拍摄方法。In a possible implementation, the artificial intelligence education platform can support students to edit and implement the project code of the above-mentioned image shooting method on the platform, such as training code such as posture recommendation network, and connect with intelligent hardware, which will be used to realize the above-mentioned image shooting. The project code of the method is sent to the intelligent hardware for execution, so that the scene image can be collected in real time by the intelligent hardware, and the above-mentioned image capturing method can be realized based on the collected scene image.

其中,学生还可以在该人工智能教育平台上编辑项目代码,以更新优化上述图像拍摄方法,例如优化各神经网络等。Among them, students can also edit the project code on the artificial intelligence education platform to update and optimize the above-mentioned image shooting methods, such as optimizing each neural network.

在本公开实施例中,可无须在本地安装各类硬件驱动、部署各类人工智能算法的依赖库和依赖环境,能够使用户通过有趣的案例进行学习,学习人体姿态识别、人体关键点检测等人工智能算法。In the embodiment of the present disclosure, it is not necessary to install various hardware drivers locally, deploy the dependent libraries and dependent environments of various artificial intelligence algorithms, and enable users to learn through interesting cases, such as human body gesture recognition, human body key point detection, etc. artificial intelligence algorithms.

可以理解,本公开提及的上述各个方法实施例,在不违背原理逻辑的情况下,均可以彼此相互结合形成结合后的实施例,限于篇幅,本公开不再赘述。本领域技术人员可以理解,在具体实施方式的上述方法中,各步骤的具体执行顺序应当以其功能和可能的内在逻辑确定。It can be understood that the above-mentioned method embodiments mentioned in the present disclosure can be combined with each other to form a combined embodiment without violating the principle and logic. Those skilled in the art can understand that, in the above method of the specific embodiment, the specific execution order of each step should be determined by its function and possible internal logic.

此外,本公开还提供了图像拍摄装置、电子设备、计算机可读存储介质、程序,上述均可用来实现本公开提供的任一种图像拍摄方法,相应技术方案和描述和参见方法部分的相应记载,不再赘述。In addition, the present disclosure also provides image capturing devices, electronic devices, computer-readable storage media, and programs, all of which can be used to implement any image capturing method provided by the present disclosure. For the corresponding technical solutions and descriptions, refer to the corresponding records in the Methods section. ,No longer.

图13示出根据本公开实施例的图像拍摄装置的框图,如图13所示,所述装置包括:FIG. 13 shows a block diagram of an image capturing apparatus according to an embodiment of the present disclosure. As shown in FIG. 13 , the apparatus includes:

获取部分101,被配置为获取与拍摄界面对应的场景图像;The acquiring part 101 is configured to acquire the scene image corresponding to the shooting interface;

识别部分102,被配置为对所述场景图像进行识别,确定所述场景图像对应的场景类别以及与所述场景类别对应的推荐人体姿态;The identification part 102 is configured to identify the scene image, and determine the scene category corresponding to the scene image and the recommended human posture corresponding to the scene category;

检测部分103,被配置为对所述场景图像进行人体关键点检测,确定所述场景图像中的对象以及所述对象的真实人体姿态;The detection part 103 is configured to perform human body key point detection on the scene image, and determine the object in the scene image and the real human body posture of the object;

显示部分104,被配置为通过在所述拍摄界面中显示所述真实人体姿态以及所述推荐人体姿态,引导所述对象根据所述推荐人体姿态进行姿态调整;The display part 104 is configured to guide the object to perform posture adjustment according to the recommended human body posture by displaying the real human body posture and the recommended human body posture in the shooting interface;

拍摄部分105,被配置为在满足拍摄条件的情况下,得到所述对象的拍摄图像。The photographing part 105 is configured to obtain a photographed image of the object when photographing conditions are satisfied.

在一种可能的实现方式中,所述推荐人体姿态包括单人姿态,所述显示部分104,包括:第一显示子部分,被配置为根据所述对象在所述场景图像中的位置,在所述拍摄界面中显示所述真实人体姿态以及所述单人姿态;或,第二显示子部分,被配置为根据所述对象在所述场景图像中的位置,在所述拍摄界面中显示所述真实人体姿态,并在所述拍摄界面的第一指定区域处显示所述单人姿态。In a possible implementation manner, the recommended human posture includes a single-person posture, and the display part 104 includes: a first display sub-part, configured to, according to the position of the object in the scene image, display The real human body posture and the single-person posture are displayed in the shooting interface; or, the second display sub-section is configured to display all the objects in the shooting interface according to the position of the object in the scene image. The real human body posture is displayed, and the single-person posture is displayed in the first designated area of the shooting interface.

在一种可能的实现方式中,所述推荐人体姿态包括多人组合姿态,所述装置还包括:位置确定部分,被配置为根据所述场景图像中多个对象之间的相对位置,以及所述多人组合姿态中各个姿态之间的相对位置,从所述多人组合姿态中分别确定各对象的对应姿态;其中,所述显示部分104,包括:第三显示子部分,被配置为针对任一对象,根据所述对象在所述场景图像中的位置,在所述拍摄界面中显示所述对象的真实人体姿态以及所述对象的对应姿态;或,第四显示子部分,被配置为根据所述对象在所述场景图像中的位置,在所述拍摄界面中显示所述对象的真实人体姿态,并在所述拍摄界面的第二指定区域处显示所述对象的对应姿态。In a possible implementation manner, the recommended human body posture includes a multi-person combined posture, and the apparatus further includes: a position determination part configured to The relative position between each gesture in the multi-person combined gesture, and the corresponding gesture of each object is respectively determined from the multi-person combined gesture; wherein, the display part 104 includes: a third display sub-section, which is configured for For any object, according to the position of the object in the scene image, the real human body posture of the object and the corresponding posture of the object are displayed in the shooting interface; or, the fourth display subsection is configured as According to the position of the object in the scene image, the real human body posture of the object is displayed in the shooting interface, and the corresponding posture of the object is displayed in the second designated area of the shooting interface.

在一种可能的实现方式中,所述装置还包括:第一确定部分,被配置为确定所述真实人体姿态的第一人体关键点,与所述推荐人体姿态的第二人体关键点之间的关键点对;第二确定部分,被配置为通过计算所述关键点对的第一相似度,从所述第一人体关键点中确定出所述第一相似度小于第一预设阈值的第三人体关键点;其中,所述显示部分104,包括:突出显示子部分,被配置为突出显示所述第三人体关键点所在的区域,其中,突出显示的方式包括高亮、加粗、变更颜色中的至少一种。In a possible implementation manner, the apparatus further includes: a first determining part configured to determine the distance between the first human body key point of the real human body posture and the second human body key point of the recommended human body posture The key point pair; the second determination part is configured to determine from the first human body key points that the first similarity is less than the first preset threshold by calculating the first similarity of the key point pair. The third human body key point; wherein, the display part 104 includes: a highlighting sub-section configured to highlight the region where the third human body key point is located, wherein the highlighting method includes highlighting, bolding, Change at least one of the colors.

在一种可能的实现方式中,所述装置还包括:第三确定部分,被配置为根据所述真实人体姿态与所述推荐人体姿态之间的关键点对的第一相似度,确定所述真实人体姿态与所述推荐人体姿态之间的第二相似度,所述关键点对是根据所述真实人体姿态的第一人体关键点与所述推荐人体姿态的第二人体关键点确定的;相似度显示部分,被配置为在所述拍摄界面中显示所述第二相似度。In a possible implementation manner, the apparatus further includes: a third determination part, configured to determine the the second similarity between the real human body posture and the recommended human body posture, and the key point pair is determined according to the first human body key point of the real human body posture and the second human body key point of the recommended human body posture; The similarity display part is configured to display the second similarity in the shooting interface.

在一种可能的实现方式中,所述装置还包括:区域确定部分,被配置为将所述场景图像输入至区域推荐网络,得到所述场景图像的推荐拍照区域,所述推荐拍照区域用于表征在所述场景图像中推荐拍照的区域,所述区域推荐网络是通过标注拍照区域的第一样本集训练得到的神经网络;其中,所述装置还包括:区域显示部分,被配置为在所述拍摄界面中,采用标识符指示所述推荐拍照区域;或,在所述拍摄界面的所述推荐拍照区域处显示所述推荐人体姿态;所述推荐拍照区域用于引导所述对象根据所述推荐拍照区域进行位置调整。In a possible implementation manner, the apparatus further includes: an area determination part, configured to input the scene image into an area recommendation network to obtain a recommended photographing area of the scene image, where the recommended photographing area is used for Representing the area recommended to be photographed in the scene image, and the area recommendation network is a neural network trained by marking the first sample set of the photographing area; wherein, the device further includes: an area display part, configured to In the shooting interface, an identifier is used to indicate the recommended photographing area; or, the recommended human posture is displayed in the recommended photographing area of the photographing interface; the recommended photographing area is used to guide the object according to the recommended photographing area. Adjust the position of the recommended photo area as described above.

在一种可能的实现方式中,所述区域显示部分,还被配置为在所述拍摄界面中,采用标识符指示所述推荐拍照区域;在所述拍摄界面的所述推荐拍照区域处显示所述推荐人体姿态。In a possible implementation manner, the area display part is further configured to use an identifier to indicate the recommended photographing area in the photographing interface; and display the recommended photographing area on the photographing interface. The recommended human posture is described.

在一种可能的实现方式中,所述识别部分102,还被配置为通过姿态推荐网络对所述场景图像进行识别,确定所述场景图像对应的场景类别以及与所述场景类别对应的推荐人体姿态,其中,所述姿态推荐网络是通过标注有样本场景类别以及样本人体姿态的第二样本集训练得到的神经网络,所述第二样本集包括对象上传的样本图像。In a possible implementation manner, the identifying part 102 is further configured to identify the scene image through a gesture recommendation network, and determine a scene category corresponding to the scene image and a recommended human body corresponding to the scene category pose, wherein the pose recommendation network is a neural network obtained by training a second sample set marked with sample scene categories and sample human poses, and the second sample set includes sample images uploaded by objects.

在一种可能的实现方式中,所述推荐人体姿态包括多个,所述装置还包括:姿态图显示部分,被配置为在所述拍摄界面中显示多个推荐人体姿态对应的姿态图;选择部分,被配置为响应于针对所述姿态图的选择操作,根据选中的姿态图,确定与所述选中的姿态图对应的推荐人体姿态并进行显示。In a possible implementation manner, the recommended human body postures include a plurality of postures, and the apparatus further includes: a posture map display part configured to display the posture maps corresponding to the plurality of recommended human body postures in the shooting interface; selecting The part is configured to, in response to a selection operation on the posture map, determine and display a recommended human body posture corresponding to the selected posture map according to the selected posture map.

在一种可能的实现方式中,所述拍摄条件包括:所述真实人体姿态与所述推荐人体姿态之间的第二相似度大于或等于第二预设阈值;或,所述拍摄界面中的拍摄控件被触发。In a possible implementation manner, the shooting conditions include: a second degree of similarity between the real human body posture and the recommended human body posture is greater than or equal to a second preset threshold; or, in the shooting interface Shooting controls are triggered.

在一种可能的实现方式中,所述装置还包括:处理部分,被配置为对所述拍摄图像进行处理,得到处理后的拍摄图像,其中,所述处理包括以下至少一种:对所述拍摄图像中的对象进行美颜处理、对所述拍摄图像添加滤镜;所述对所述拍摄图像添加滤镜,包括:根据与所述推荐人体姿态对 应的推荐滤镜,调整所述拍摄图像的饱和度、色温、亮度中的至少一种。In a possible implementation manner, the apparatus further includes: a processing part configured to process the captured image to obtain a processed captured image, wherein the processing includes at least one of the following: Performing beautification processing on the object in the captured image, and adding a filter to the captured image; the adding a filter to the captured image includes: adjusting the captured image according to the recommended filter corresponding to the recommended human posture at least one of saturation, color temperature, and brightness.

在一种可能的实现方式中,所述装置应用于人工智能教育平台,所述人工智能教育平台与智能硬件连接;其中,所述人工智能教育平台用于编辑实现所述图像拍摄装置的项目代码,并将所述项目代码发送至所述智能硬件;所述智能硬件用于采集场景图像及运行所述项目代码,得到运行结果,并将所述运行结果发送至所述人工智能教育平台,以在所述人工智能教育平台的显示界面中显示所述运行结果。In a possible implementation, the device is applied to an artificial intelligence education platform, and the artificial intelligence education platform is connected to intelligent hardware; wherein, the artificial intelligence education platform is used to edit and implement the project code of the image capturing device , and send the project code to the intelligent hardware; the intelligent hardware is used to collect scene images and run the project code to obtain the operation result, and send the operation result to the artificial intelligence education platform to The operation result is displayed on the display interface of the artificial intelligence education platform.

在本公开实施例中,能够根据场景图像的场景类别,向用户推荐与场景类别对应的推荐人体姿态,这样使得推荐人体姿态是与场景图像匹配的,并且通过在拍摄界面中显示真实人体姿态以及推荐人体姿态,可引导用户根据显示的推荐人体姿态进行姿态调整或便于用户根据显示的推荐人体姿态来指导拍摄对象进行姿态调整,有效提高图像拍摄效果,提升用户的拍摄体验,有利于得到令用户满足的拍摄图像。In the embodiment of the present disclosure, the recommended human body pose corresponding to the scene category can be recommended to the user according to the scene category of the scene image, so that the recommended human body pose matches the scene image, and by displaying the real human body pose and The recommended human posture can guide the user to adjust the posture according to the displayed recommended human posture or facilitate the user to guide the subject to adjust the posture according to the displayed recommended human posture. Satisfied taking pictures.

在一些实施例中,本公开实施例提供的装置具有的功能或包含的部分可以被配置为执行上文方法实施例描述的方法,其具体实现可以参照上文方法实施例的描述,为了简洁,这里不再赘述。In some embodiments, the functions or included parts of the apparatus provided in the embodiments of the present disclosure may be configured to execute the methods described in the above method embodiments, and the specific implementation may refer to the above method embodiments. For brevity, I won't go into details here.

本公开实施例还提出一种计算机可读存储介质,其上存储有计算机程序指令,所述计算机程序指令被处理器执行时实现上述方法。计算机可读存储介质可以是易失性或非易失性计算机可读存储介质。Embodiments of the present disclosure further provide a computer-readable storage medium, on which computer program instructions are stored, and when the computer program instructions are executed by a processor, the foregoing method is implemented. Computer-readable storage media can be volatile or non-volatile computer-readable storage media.

本公开实施例还提出一种电子设备,包括:处理器;被配置为存储处理器可执行指令的存储器;其中,所述处理器被配置为调用所述存储器存储的指令,以执行上述方法。An embodiment of the present disclosure further provides an electronic device, including: a processor; a memory configured to store instructions executable by the processor; wherein the processor is configured to invoke the instructions stored in the memory to execute the above method.

本公开实施例还提供了一种计算机程序产品,包括计算机可读代码,当计算机可读代码在设备上运行时,设备中的处理器执行用于实现如上任一实施例提供的图像拍摄方法的指令。Embodiments of the present disclosure also provide a computer program product, including computer-readable codes. When the computer-readable codes are run on a device, a processor in the device executes the image capturing method for implementing the image capturing method provided by any of the above embodiments. instruction.

本公开实施例还提供了另一种计算机程序产品,用于存储计算机可读指令,指令被执行时使得计算机执行上述任一实施例提供的图像拍摄方法的操作。Embodiments of the present disclosure further provide another computer program product for storing computer-readable instructions, which, when executed, cause the computer to perform the operations of the image capturing method provided by any of the foregoing embodiments.

本公开实施例中的电子设备可以被提供为终端设备或服务器。The electronic device in the embodiment of the present disclosure may be provided as a terminal device or a server.

图14示出根据本公开实施例的一种电子设备800的框图。例如,电子设备800可以是移动电话,计算机,数字广播终端,消息收发设备,游戏控制台,平板设备,医疗设备,健身设备,个人数字助理等终端。FIG. 14 shows a block diagram of an electronic device 800 according to an embodiment of the present disclosure. For example, electronic device 800 may be a mobile phone, computer, digital broadcast terminal, messaging device, game console, tablet device, medical device, fitness device, personal digital assistant, etc. terminal.

参照图14,电子设备800可以包括以下一个或多个组件:处理组件802,存储器804,电源组件806,多媒体组件808,音频组件810,输入/输出(I/O)的接口812,传感器组件814,以及通信组件816。14, an electronic device 800 may include one or more of the following components: a processing component 802, a memory 804, a power supply component 806, a multimedia component 808, an audio component 810, an input/output (I/O) interface 812, a sensor component 814 , and the communication component 816 .

处理组件802通常控制电子设备800的整体操作,诸如与显示,电话呼叫,数据通信,相机操作和记录操作相关联的操作。处理组件802可以包括一个或多个处理器820来执行指令,以完成上述的方法的全部或部分步骤。此外,处理组件802可以包括一个或多个模块,便于处理组件802和其他组件之间的交互。例如,处理组件802可以包括多媒体模块,以方便多媒体组件808和处理组件802之间的交互。The processing component 802 generally controls the overall operation of the electronic device 800, such as operations associated with display, phone calls, data communications, camera operations, and recording operations. The processing component 802 can include one or more processors 820 to execute instructions to perform all or some of the steps of the methods described above. Additionally, processing component 802 may include one or more modules that facilitate interaction between processing component 802 and other components. For example, processing component 802 may include a multimedia module to facilitate interaction between multimedia component 808 and processing component 802.

存储器804被配置为存储各种类型的数据以支持在电子设备800的操作。这些数据的示例包括用于在电子设备800上操作的任何应用程序或方法的指令,联系人数据,电话簿数据,消息,图片,视频等。存储器804可以由任何类型的易失性或非易失性存储设备或者它们的组合实现,如静态随机存取存储器(SRAM),电可擦除可编程只读存储器(EEPROM),可擦除可编程只读存储器(EPROM),可编程只读存储器(PROM),只读存储器(ROM),磁存储器,快闪存储器,磁盘或光盘。Memory 804 is configured to store various types of data to support operation at electronic device 800 . Examples of such data include instructions for any application or method operating on electronic device 800, contact data, phonebook data, messages, pictures, videos, and the like. Memory 804 may be implemented by any type of volatile or nonvolatile storage device or combination thereof, such as static random access memory (SRAM), electrically erasable programmable read only memory (EEPROM), erasable Programmable Read Only Memory (EPROM), Programmable Read Only Memory (PROM), Read Only Memory (ROM), Magnetic Memory, Flash Memory, Magnetic or Optical Disk.

电源组件806为电子设备800的各种组件提供电力。电源组件806可以包括电源管理系统,一个或多个电源,及其他与为电子设备800生成、管理和分配电力相关联的组件。Power supply assembly 806 provides power to various components of electronic device 800 . Power supply components 806 may include a power management system, one or more power supplies, and other components associated with generating, managing, and distributing power to electronic device 800 .

多媒体组件808包括在所述电子设备800和用户之间的提供一个输出接口的屏幕。在一些实施例中,屏幕可以包括液晶显示器(LCD)和触摸面板(TP)。如果屏幕包括触摸面板,屏幕可以被实现为触摸屏,以接收来自用户的输入信号。触摸面板包括一个或多个触摸传感器以感测触摸、滑动和触摸面板上的手势。所述触摸传感器可以不仅感测触摸或滑动动作的边界,而且还检测与所述触摸或滑动操作相关的持续时间和压力。在一些实施例中,多媒体组件808包括一个前置摄像头和/或后置摄像头。当电子设备800处于操作模式,如拍摄模式或视频模式时,前置摄像头和/或后置摄像头可以接收外部的多媒体数据。每个前置摄像头和后置摄像头可以是一个固定的光学透镜系统或具有焦距和光学变焦能力。Multimedia component 808 includes a screen that provides an output interface between the electronic device 800 and the user. In some embodiments, the screen may include a liquid crystal display (LCD) and a touch panel (TP). If the screen includes a touch panel, the screen may be implemented as a touch screen to receive input signals from a user. The touch panel includes one or more touch sensors to sense touch, swipe, and gestures on the touch panel. The touch sensor may not only sense the boundaries of a touch or swipe action, but also detect the duration and pressure associated with the touch or swipe action. In some embodiments, the multimedia component 808 includes a front-facing camera and/or a rear-facing camera. When the electronic device 800 is in an operation mode, such as a shooting mode or a video mode, the front camera and/or the rear camera may receive external multimedia data. Each of the front and rear cameras can be a fixed optical lens system or have focal length and optical zoom capability.

音频组件810被配置为输出和/或输入音频信号。例如,音频组件810包括一个麦克风(MIC),当电子设备800处于操作模式,如呼叫模式、记录模式和语音识别模式时,麦克风被配置为接收外部音频信号。所接收的音频信号可以被进一步存储在存储器804或经由通信组件816发送。在一些实施例 中,音频组件810还包括一个扬声器,用于输出音频信号。Audio component 810 is configured to output and/or input audio signals. For example, audio component 810 includes a microphone (MIC) that is configured to receive external audio signals when electronic device 800 is in operating modes, such as calling mode, recording mode, and voice recognition mode. The received audio signal may be further stored in memory 804 or transmitted via communication component 816 . In some embodiments, audio component 810 also includes a speaker for outputting audio signals.

I/O接口812为处理组件802和外围接口模块之间提供接口,上述外围接口模块可以是键盘,点击轮,按钮等。这些按钮可包括但不限于:主页按钮、音量按钮、启动按钮和锁定按钮。The I/O interface 812 provides an interface between the processing component 802 and a peripheral interface module, which may be a keyboard, a click wheel, a button, or the like. These buttons may include, but are not limited to: home button, volume buttons, start button, and lock button.

传感器组件814包括一个或多个传感器,用于为电子设备800提供各个方面的状态评估。例如,传感器组件814可以检测到电子设备800的打开/关闭状态,组件的相对定位,例如所述组件为电子设备800的显示器和小键盘,传感器组件814还可以检测电子设备800或电子设备800一个组件的位置改变,用户与电子设备800接触的存在或不存在,电子设备800方位或加速/减速和电子设备800的温度变化。传感器组件814可以包括接近传感器,被配置用来在没有任何的物理接触时检测附近物体的存在。传感器组件814还可以包括光传感器,如互补金属氧化物半导体(CMOS)或电荷耦合装置(CCD)图像传感器,用于在成像应用中使用。在一些实施例中,该传感器组件814还可以包括加速度传感器,陀螺仪传感器,磁传感器,压力传感器或温度传感器。Sensor assembly 814 includes one or more sensors for providing status assessment of various aspects of electronic device 800 . For example, the sensor assembly 814 can detect the on/off state of the electronic device 800, the relative positioning of the components, such as the display and the keypad of the electronic device 800, the sensor assembly 814 can also detect the electronic device 800 or one of the electronic device 800 Changes in the position of components, presence or absence of user contact with the electronic device 800 , orientation or acceleration/deceleration of the electronic device 800 and changes in the temperature of the electronic device 800 . Sensor assembly 814 may include a proximity sensor configured to detect the presence of nearby objects in the absence of any physical contact. Sensor assembly 814 may also include a light sensor, such as a complementary metal oxide semiconductor (CMOS) or charge coupled device (CCD) image sensor, for use in imaging applications. In some embodiments, the sensor assembly 814 may also include an acceleration sensor, a gyroscope sensor, a magnetic sensor, a pressure sensor, or a temperature sensor.

通信组件816被配置为便于电子设备800和其他设备之间有线或无线方式的通信。电子设备800可以接入基于通信标准的无线网络,如无线网络(WiFi),第二代移动通信技术(2G)或第三代移动通信技术(3G),或它们的组合。在一个示例性实施例中,通信组件816经由广播信道接收来自外部广播管理系统的广播信号或广播相关信息。在一个示例性实施例中,所述通信组件816还包括近场通信(NFC)模块,以促进短程通信。例如,在NFC模块可基于射频识别(RFID)技术,红外数据协会(IrDA)技术,超宽带(UWB)技术,蓝牙(BT)技术和其他技术来实现。Communication component 816 is configured to facilitate wired or wireless communication between electronic device 800 and other devices. The electronic device 800 may access a wireless network based on a communication standard, such as wireless network (WiFi), second generation mobile communication technology (2G) or third generation mobile communication technology (3G), or a combination thereof. In one exemplary embodiment, the communication component 816 receives broadcast signals or broadcast related information from an external broadcast management system via a broadcast channel. In an exemplary embodiment, the communication component 816 also includes a near field communication (NFC) module to facilitate short-range communication. For example, the NFC module may be implemented based on radio frequency identification (RFID) technology, infrared data association (IrDA) technology, ultra-wideband (UWB) technology, Bluetooth (BT) technology and other technologies.

在示例性实施例中,电子设备800可以被一个或多个应用专用集成电路(ASIC)、数字信号处理器(DSP)、数字信号处理设备(DSPD)、可编程逻辑器件(PLD)、现场可编程门阵列(FPGA)、控制器、微控制器、微处理器或其他电子元件实现,用于执行上述方法。In an exemplary embodiment, electronic device 800 may be implemented by one or more application specific integrated circuits (ASICs), digital signal processors (DSPs), digital signal processing devices (DSPDs), programmable logic devices (PLDs), field programmable A programmed gate array (FPGA), controller, microcontroller, microprocessor or other electronic component implementation is used to perform the above method.

在示例性实施例中,还提供了一种非易失性计算机可读存储介质,例如包括计算机程序指令的存储器804,上述计算机程序指令可由电子设备800的处理器820执行以完成上述方法。In an exemplary embodiment, a non-volatile computer-readable storage medium, such as a memory 804 comprising computer program instructions executable by the processor 820 of the electronic device 800 to perform the above method is also provided.

本公开可以是系统、方法和/或计算机程序产品。计算机程序产品可以包括计算机可读存储介质,其上载有用于使处理器实现本公开的各个方面的计算机可读程序指令。The present disclosure may be a system, method and/or computer program product. The computer program product may include a computer-readable storage medium having computer-readable program instructions loaded thereon for causing a processor to implement various aspects of the present disclosure.

计算机可读存储介质可以是可以保持和存储由指令执行设备使用的指令的有形设备。计算机可读存储介质例如可以是(但不限于)电存储设备、磁存储设备、光存储设备、电磁存储设备、半导体存储设备或者上述的任意合适的组合。计算机可读存储介质的更具体的例子(非穷举的列表)包括:便携式计算机盘、硬盘、随机存取存储器(RAM)、只读存储器(ROM)、可擦式可编程只读存储器(EPROM或闪存)、静态随机存取存储器(SRAM)、便携式压缩盘只读存储器(CD-ROM)、数字多功能盘(DVD)、记忆棒、软盘、机械编码设备、例如其上存储有指令的打孔卡或凹槽内凸起结构、以及上述的任意合适的组合。这里所使用的计算机可读存储介质不被解释为瞬时信号本身,诸如无线电波或者其他自由传播的电磁波、通过波导或其他传输媒介传播的电磁波(例如,通过光纤电缆的光脉冲)、或者通过电线传输的电信号。A computer-readable storage medium may be a tangible device that can hold and store instructions for use by the instruction execution device. The computer-readable storage medium may be, for example, but not limited to, an electrical storage device, a magnetic storage device, an optical storage device, an electromagnetic storage device, a semiconductor storage device, or any suitable combination of the foregoing. More specific examples (non-exhaustive list) of computer readable storage media include: portable computer disks, hard disks, random access memory (RAM), read only memory (ROM), erasable programmable read only memory (EPROM) or flash memory), static random access memory (SRAM), portable compact disk read only memory (CD-ROM), digital versatile disk (DVD), memory sticks, floppy disks, mechanically coded devices, such as printers with instructions stored thereon Hole cards or raised structures in grooves, and any suitable combination of the above. Computer-readable storage media, as used herein, are not to be construed as transient signals per se, such as radio waves or other freely propagating electromagnetic waves, electromagnetic waves propagating through waveguides or other transmission media (eg, light pulses through fiber optic cables), or through electrical wires transmitted electrical signals.

这里所描述的计算机可读程序指令可以从计算机可读存储介质下载到各个计算/处理设备,或者通过网络、例如因特网、局域网、广域网和/或无线网下载到外部计算机或外部存储设备。网络可以包括铜传输电缆、光纤传输、无线传输、路由器、防火墙、交换机、网关计算机和/或边缘服务器。每个计算/处理设备中的网络适配卡或者网络接口从网络接收计算机可读程序指令,并转发该计算机可读程序指令,以供存储在各个计算/处理设备中的计算机可读存储介质中。The computer readable program instructions described herein may be downloaded to various computing/processing devices from a computer readable storage medium, or to an external computer or external storage device over a network such as the Internet, a local area network, a wide area network, and/or a wireless network. The network may include copper transmission cables, fiber optic transmission, wireless transmission, routers, firewalls, switches, gateway computers, and/or edge servers. A network adapter card or network interface in each computing/processing device receives computer-readable program instructions from a network and forwards the computer-readable program instructions for storage in a computer-readable storage medium in each computing/processing device .

用于执行本公开操作的计算机程序指令可以是汇编指令、指令集架构(ISA)指令、机器指令、机器相关指令、微代码、固件指令、状态设置数据、或者以一种或多种编程语言的任意组合编写的源代码或目标代码,所述编程语言包括面向对象的编程语言—诸如Smalltalk、C++等,以及常规的过程式编程语言—诸如“C”语言或类似的编程语言。计算机可读程序指令可以完全地在用户计算机上执行、部分地在用户计算机上执行、作为一个独立的软件包执行、部分在用户计算机上部分在远程计算机上执行、或者完全在远程计算机或服务器上执行。在涉及远程计算机的情形中,远程计算机可以通过任意种类的网络—包括局域网(LAN)或广域网(WAN)—连接到用户计算机,或者,可以连接到外部计算机(例如利用因特网服务提供商来通过因特网连接)。在一些实施例中,通过利用计算机可读程序指令的状态信息来个性化定制电子电路,例如可编程逻辑电路、现场可编程门阵列(FPGA)或可编程逻辑阵列(PLA),该电子电路可以执行计算机可读程序指令,从而实现本公开的各个方面。Computer program instructions for carrying out operations of the present disclosure may be assembly instructions, instruction set architecture (ISA) instructions, machine instructions, machine-dependent instructions, microcode, firmware instructions, state setting data, or instructions in one or more programming languages. Source or object code, written in any combination, including object-oriented programming languages, such as Smalltalk, C++, etc., and conventional procedural programming languages, such as the "C" language or similar programming languages. The computer readable program instructions may execute entirely on the user's computer, partly on the user's computer, as a stand-alone software package, partly on the user's computer and partly on a remote computer, or entirely on the remote computer or server implement. In the case of a remote computer, the remote computer may be connected to the user's computer through any kind of network, including a local area network (LAN) or a wide area network (WAN), or may be connected to an external computer (eg, using an Internet service provider through the Internet connect). In some embodiments, custom electronic circuits, such as programmable logic circuits, field programmable gate arrays (FPGAs), or programmable logic arrays (PLAs), can be personalized by utilizing state information of computer readable program instructions. Computer readable program instructions are executed to implement various aspects of the present disclosure.

这里参照根据本公开实施例的方法、装置(系统)和计算机程序产品的流程图和/或框图描述了本公开的各个方面。应当理解,流程图和/或框图的每个方框以及流程图和/或框图中各方框的组合,都可以由计算机可读程序指令实现。Aspects of the present disclosure are described herein with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems) and computer program products according to embodiments of the disclosure. It will be understood that each block of the flowchart illustrations and/or block diagrams, and combinations of blocks in the flowchart illustrations and/or block diagrams, can be implemented by computer readable program instructions.

这些计算机可读程序指令可以提供给通用计算机、专用计算机或其它可编程数据处理装置的处理器,从而生产出一种机器,使得这些指令在通过计算机或其它可编程数据处理装置的处理器执行时,产生了实现流程图和/或框图中的一个或多个方框中规定的功能/动作的装置。也可以把这些计算机可读程序指令存储在计算机可读存储介质中,这些指令使得计算机、可编程数据处理装置和/或其他设备以特定方式工作,从而,存储有指令的计算机可读介质则包括一个制造品,其包括实现流程图和/或框图中的一个或多个方框中规定的功能/动作的各个方面的指令。These computer readable program instructions may be provided to a processor of a general purpose computer, special purpose computer or other programmable data processing apparatus to produce a machine that causes the instructions when executed by the processor of the computer or other programmable data processing apparatus , resulting in means for implementing the functions/acts specified in one or more blocks of the flowchart and/or block diagrams. These computer readable program instructions can also be stored in a computer readable storage medium, these instructions cause a computer, programmable data processing apparatus and/or other equipment to operate in a specific manner, so that the computer readable medium storing the instructions includes An article of manufacture comprising instructions for implementing various aspects of the functions/acts specified in one or more blocks of the flowchart and/or block diagrams.

也可以把计算机可读程序指令加载到计算机、其它可编程数据处理装置、或其它设备上,使得在计算机、其它可编程数据处理装置或其它设备上执行一系列操作步骤,以产生计算机实现的过程,从而使得在计算机、其它可编程数据处理装置、或其它设备上执行的指令实现流程图和/或框图中的一个或多个方框中规定的功能/动作。Computer readable program instructions can also be loaded onto a computer, other programmable data processing apparatus, or other equipment to cause a series of operational steps to be performed on the computer, other programmable data processing apparatus, or other equipment to produce a computer-implemented process , thereby causing instructions executing on a computer, other programmable data processing apparatus, or other device to implement the functions/acts specified in one or more blocks of the flowcharts and/or block diagrams.

附图中的流程图和框图显示了根据本公开的多个实施例的系统、方法和计算机程序产品的可能实现的体系架构、功能和操作。在这点上,流程图或框图中的每个方框可以代表一个模块、程序段或指令的一部分,所述模块、程序段或指令的一部分包含一个或多个用于实现规定的逻辑功能的可执行指令。在有些作为替换的实现中,方框中所标注的功能也可以以不同于附图中所标注的顺序发生。例如,两个连续的方框实际上可以基本并行地执行,它们有时也可以按相反的顺序执行,这依所涉及的功能而定。也要注意的是,框图和/或流程图中的每个方框、以及框图和/或流程图中的方框的组合,可以用执行规定的功能或动作的专用的基于硬件的系统来实现,或者可以用专用硬件与计算机指令的组合来实现。The flowchart and block diagrams in the Figures illustrate the architecture, functionality, and operation of possible implementations of systems, methods and computer program products according to various embodiments of the present disclosure. In this regard, each block in the flowchart or block diagrams may represent a module, segment, or portion of instructions, which comprises one or more functions for implementing the specified logical function(s) executable instructions. In some alternative implementations, the functions noted in the blocks may occur out of the order noted in the figures. For example, two blocks in succession may, in fact, be executed substantially concurrently, or the blocks may sometimes be executed in the reverse order, depending upon the functionality involved. It is also noted that each block of the block diagrams and/or flowchart illustrations, and combinations of blocks in the block diagrams and/or flowchart illustrations, can be implemented in dedicated hardware-based systems that perform the specified functions or actions , or can be implemented in a combination of dedicated hardware and computer instructions.

该计算机程序产品可以具体通过硬件、软件或其结合的方式实现。在一个可选实施例中,所述计算机程序产品具体体现为计算机存储介质,在另一个可选实施例中,计算机程序产品具体体现为软件产品,例如软件开发包(Software Development Kit,SDK)等等。The computer program product can be specifically implemented by hardware, software or a combination thereof. In an optional embodiment, the computer program product is embodied as a computer storage medium, and in another optional embodiment, the computer program product is embodied as a software product, such as a software development kit (Software Development Kit, SDK), etc. Wait.

以上已经描述了本公开的各实施例,上述说明是示例性的,并非穷尽性的,并且也不限于所披露的各实施例。在不偏离所说明的各实施例的范围和精神的情况下,对于本技术领域的普通技术人员来说许多修改和变更都是显而易见的。本文中所用术语的选择,旨在最好地解释各实施例的原理、实际应用或对市场中的技术的改进,或者使本技术领域的其它普通技术人员能理解本文披露的各实施例。Various embodiments of the present disclosure have been described above, and the foregoing descriptions are exemplary, not exhaustive, and not limiting of the disclosed embodiments. Numerous modifications and variations will be apparent to those of ordinary skill in the art without departing from the scope and spirit of the described embodiments. The terminology used herein was chosen to best explain the principles of the various embodiments, the practical application or improvement over the technology in the marketplace, or to enable others of ordinary skill in the art to understand the various embodiments disclosed herein.

工业实用性Industrial Applicability

在本公开实施例中,能够根据场景图像的场景类别,向用户推荐与场景类别对应的推荐人体姿态,这样使得推荐人体姿态是与场景图像匹配的,并且通过在拍摄界面中显示真实人体姿态以及推荐人体姿态,可引导用户根据显示的推荐人体姿态进行姿态调整或便于用户根据显示的推荐人体姿态来指导拍摄对象进行姿态调整,有效提高图像拍摄效果,提升用户的拍摄体验,有利于得到令用户满足的拍摄图像。以及,通过在拍摄界面显示单人姿态,能够有效引导对象完成单人姿态的拍摄,提升拍摄体验,并且推荐人体姿态可固定显示、可跟随对象显示,能够满足不同的姿态显示需求。以及,通过在拍摄界面显示多人组合姿态,能够有效引导多个对象共同完成多人组合姿态,提升拍摄体验,并且推荐人体姿态可固定显示、可跟随对象显示,能够满足不同的姿态显示需求。以及,能够对显示的真实人体姿态中相似度低的人体部位,进行突出显示,从而更有效地引导用户调整姿态,以提升拍摄体验。以及,能够显示真实人体姿态与推荐人体姿态之间整体的相似程度,以有效引导用户根据整体的相似度调整姿态,提升拍摄体验。以及,通过推荐拍照区域,能够向对象推荐视觉效果较好的拍照区域,有效引导对象调整位置来提升拍摄图像的视觉效果,从而提升用户拍摄体验。以及,能够利用姿态推荐网络从优质图像中学习人体姿态,从而有效地基于场景图像输出优质的推荐人体姿态。并便于用户选择不同的推荐人体姿态,满足不同的姿态喜好,提升拍摄体验,有利于得到令用户满意的拍摄图像。以及,在人工智能教育平台中应用本公开实施例中的方法,能够使用户通过有趣的案例进行学习,学习人体姿态识别、人体关键点检测等人工智能算法。In the embodiment of the present disclosure, the recommended human body pose corresponding to the scene category can be recommended to the user according to the scene category of the scene image, so that the recommended human body pose matches the scene image, and by displaying the real human body pose and The recommended human posture can guide the user to adjust the posture according to the displayed recommended human posture or facilitate the user to guide the subject to adjust the posture according to the displayed recommended human posture. Satisfied taking pictures. And, by displaying the single-person posture on the shooting interface, the subject can be effectively guided to complete the shooting of the single-person posture, improving the shooting experience, and it is recommended that the human body posture can be displayed fixedly and can be displayed with the object, which can meet different posture display needs. And, by displaying the combined pose of multiple people on the shooting interface, multiple objects can be effectively guided to complete the combined pose of multiple people, improving the shooting experience, and it is recommended that the human body pose can be displayed fixedly and can be displayed with the object, which can meet different pose display needs. And, the parts of the human body with low similarity among the displayed real human body postures can be highlighted, so as to more effectively guide the user to adjust the posture, so as to improve the shooting experience. And, the overall similarity between the real human body posture and the recommended human posture can be displayed, so as to effectively guide the user to adjust the posture according to the overall similarity, and improve the shooting experience. And, by recommending a photographing area, a photographing area with better visual effect can be recommended to the subject, and the subject can be effectively guided to adjust the position to improve the visual effect of the photographed image, thereby improving the user's photographing experience. And, the pose recommendation network can be used to learn human poses from high-quality images, so as to effectively output high-quality recommended human poses based on scene images. And it is convenient for the user to select different recommended human body postures, to satisfy different posture preferences, to improve the shooting experience, and to be beneficial to obtain a shooting image that satisfies the user. And, applying the methods in the embodiments of the present disclosure in an artificial intelligence education platform enables users to learn through interesting cases, and learn artificial intelligence algorithms such as human body gesture recognition and human body key point detection.

Claims (25)

一种图像拍摄方法,包括:An image capturing method comprising: 获取与拍摄界面对应的场景图像;Obtain the scene image corresponding to the shooting interface; 对所述场景图像进行识别,确定所述场景图像对应的场景类别以及与所述场景类别对应的推荐人体姿态;Identifying the scene image, and determining a scene category corresponding to the scene image and a recommended human posture corresponding to the scene category; 对所述场景图像进行人体关键点检测,确定所述场景图像中的对象以及所述对象的真实人体姿态;Performing human body key point detection on the scene image, and determining the object in the scene image and the real human body posture of the object; 通过在所述拍摄界面中显示所述真实人体姿态以及所述推荐人体姿态,引导所述对象根据所述推荐人体姿态进行姿态调整;By displaying the real human body posture and the recommended human body posture in the shooting interface, the object is guided to perform posture adjustment according to the recommended human body posture; 在满足拍摄条件的情况下,得到所述对象的拍摄图像。When the shooting conditions are satisfied, a shot image of the object is obtained. 根据权利要求1所述的方法,其中,所述推荐人体姿态包括单人姿态,所述在所述拍摄界面中显示所述真实人体姿态以及所述推荐人体姿态,包括:The method according to claim 1, wherein the recommended human body posture includes a single person posture, and the displaying the real human body posture and the recommended human body posture in the shooting interface comprises: 根据所述对象在所述场景图像中的位置,在所述拍摄界面中显示所述真实人体姿态以及所述单人姿态;或,According to the position of the object in the scene image, the real human body posture and the single-person posture are displayed in the shooting interface; or, 根据所述对象在所述场景图像中的位置,在所述拍摄界面中显示所述真实人体姿态,并在所述拍摄界面的第一指定区域处显示所述单人姿态。According to the position of the object in the scene image, the real human pose is displayed in the shooting interface, and the single-person pose is displayed in a first designated area of the shooting interface. 根据权利要求1或2所述的方法,其中,所述推荐人体姿态包括多人组合姿态,所述方法还包括:The method according to claim 1 or 2, wherein the recommended human body posture comprises a multi-person combined posture, and the method further comprises: 根据所述场景图像中多个对象之间的相对位置,以及所述多人组合姿态中各个姿态之间的相对位置,从所述多人组合姿态中分别确定各对象的对应姿态;According to the relative positions between the multiple objects in the scene image and the relative positions between the respective gestures in the multiple-person combined gestures, the corresponding gestures of each object are respectively determined from the multiple-person combined gestures; 其中,在所述拍摄界面中显示所述真实人体姿态以及所述推荐人体姿态,包括:Wherein, displaying the real human body posture and the recommended human body posture in the shooting interface includes: 针对任一对象,根据所述对象在所述场景图像中的位置,在所述拍摄界面中显示所述对象的真实人体姿态以及所述对象的对应姿态;或,For any object, display the real human body posture of the object and the corresponding posture of the object in the shooting interface according to the position of the object in the scene image; or, 根据所述对象在所述场景图像中的位置,在所述拍摄界面中显示所述对象的真实人体姿态,并在所述拍摄界面的第二指定区域处显示所述对象的对应姿态。According to the position of the object in the scene image, the real human body posture of the object is displayed in the shooting interface, and the corresponding posture of the object is displayed in the second designated area of the shooting interface. 根据权利要求1-3任一项所述的方法,其中,所述方法还包括:The method according to any one of claims 1-3, wherein the method further comprises: 确定所述真实人体姿态的第一人体关键点,与所述推荐人体姿态的第二人体关键点之间的关键点对;Determine the key point pair between the first human body key point of the real human body posture and the second human body key point of the recommended human body posture; 通过计算所述关键点对的第一相似度,从所述第一人体关键点中确定出所述第一相似度小于第一预设阈值的第三人体关键点;By calculating the first similarity of the key point pair, a third human body key point whose first similarity is less than a first preset threshold is determined from the first human body key points; 其中,所述在所述拍摄界面中显示所述真实人体姿态以及所述推荐人体姿态,包括:Wherein, the displaying the real human body posture and the recommended human body posture in the shooting interface includes: 突出显示所述第三人体关键点所在的区域,其中,突出显示的方式包括高亮、加粗、变更颜色中的至少一种。The region where the third human body key point is located is highlighted, and the highlighting manner includes at least one of highlighting, bolding, and changing color. 根据权利要求1-4任一项所述的方法,其中,所述方法还包括:The method according to any one of claims 1-4, wherein the method further comprises: 根据所述真实人体姿态与所述推荐人体姿态之间的关键点对的第一相似度,确定所述真实人体姿态与所述推荐人体姿态之间的第二相似度,所述关键点对是根据所述真实人体姿态的第一人体关键点与所述推荐人体姿态的第二人体关键点确定的;According to the first similarity of the key point pair between the real human body posture and the recommended human body posture, the second similarity between the real human body posture and the recommended human body posture is determined, and the key point pair is Determined according to the first human body key point of the real human body posture and the second human body key point of the recommended human body posture; 在所述拍摄界面中显示所述第二相似度。The second similarity is displayed in the shooting interface. 根据权利要求1-5任一项所述的方法,其中,所述方法还包括:The method according to any one of claims 1-5, wherein the method further comprises: 将所述场景图像输入至区域推荐网络,得到所述场景图像的推荐拍照区域,所述推荐拍照区域用于表征在所述场景图像中推荐拍照的区域,所述区域推荐网络是通过标注拍照区域的第一样本集训练得到的神经网络;Inputting the scene image into a regional recommendation network to obtain a recommended photographing area of the scene image, where the recommended photographing area is used to represent the recommended photographing area in the scene image, and the area recommending network is to mark the photographing area by marking the photographing area. The neural network obtained by training the first sample set of ; 其中,所述得到场景图像的推荐拍照区域之后,所述方法还包括以下至少一种:Wherein, after obtaining the recommended photographing area of the scene image, the method further includes at least one of the following: 在所述拍摄界面中,采用标识符指示所述推荐拍照区域;In the photographing interface, an identifier is used to indicate the recommended photographing area; 在所述拍摄界面的所述推荐拍照区域处显示所述推荐人体姿态;所述推荐拍照区域用于引导所述对象根据所述推荐拍照区域进行位置调整。The recommended human posture is displayed in the recommended photographing area of the photographing interface; the recommended photographing area is used to guide the object to perform position adjustment according to the recommended photographing area. 根据权利要求1-6任一项所述的方法,其中,对所述场景图像进行识别,确定所述场景图像对应的场景类别以及与所述场景类别对应的推荐人体姿态,包括:The method according to any one of claims 1-6, wherein identifying the scene image, determining a scene category corresponding to the scene image and a recommended human pose corresponding to the scene category, comprising: 通过姿态推荐网络对所述场景图像进行识别,确定所述场景图像对应的场景类别以及与所述场景类别对应的推荐人体姿态,其中,所述姿态推荐网络是通过标注有样本场景类别以及样本人体姿态的第二样本集训练得到的神经网络,所述第二样本集包括对象上传的样本图像。Identify the scene image through a gesture recommendation network, and determine the scene category corresponding to the scene image and the recommended human pose corresponding to the scene category, wherein the gesture recommendation network is marked with the sample scene category and the sample human body. A neural network obtained by training a second sample set of poses, where the second sample set includes sample images uploaded by the object. 根据权利要求1所述的方法,其中,所述推荐人体姿态包括多个,所述方法还包括:The method according to claim 1, wherein the recommended human body postures include multiple ones, and the method further comprises: 在所述拍摄界面中显示多个推荐人体姿态对应的姿态图;Displaying a plurality of posture diagrams corresponding to the recommended human postures in the shooting interface; 响应于针对所述姿态图的选择操作,根据选中的姿态图,确定与所述选中的姿态图对应的推荐人体姿态并进行显示。In response to the selection operation on the posture map, according to the selected posture map, a recommended human body posture corresponding to the selected posture map is determined and displayed. 根据权利要求1或5所述的方法,其中,所述拍摄条件包括:The method according to claim 1 or 5, wherein the shooting conditions include: 所述真实人体姿态与所述推荐人体姿态之间的第二相似度大于或等于第二预设阈值;或,The second similarity between the real human body posture and the recommended human body posture is greater than or equal to a second preset threshold; or, 所述拍摄界面中的拍摄控件被触发。The shooting controls in the shooting interface are triggered. 根据权利要求1-9任一项所述的方法,其中,所述方法还包括:The method according to any one of claims 1-9, wherein the method further comprises: 对所述拍摄图像进行处理,得到处理后的拍摄图像,其中,所述处理包括以下至少一种:对所述拍摄图像中的对象进行美颜处理、对所述拍摄图像添加滤镜;The captured image is processed to obtain a processed captured image, wherein the processing includes at least one of the following: performing beautification processing on an object in the captured image, and adding a filter to the captured image; 所述对所述拍摄图像添加滤镜,包括:根据与所述推荐人体姿态对应的推荐滤镜,调整所述拍摄图像的饱和度、色温、亮度中的至少一种。The adding a filter to the captured image includes: adjusting at least one of saturation, color temperature, and brightness of the captured image according to a recommended filter corresponding to the recommended human posture. 根据权利要求1-10任一项所述的方法,其中,所述方法应用于人工智能教育平台,所述人工智能教育平台与智能硬件连接;The method according to any one of claims 1-10, wherein the method is applied to an artificial intelligence education platform, and the artificial intelligence education platform is connected with intelligent hardware; 其中,所述人工智能教育平台用于编辑实现所述图像拍摄方法的项目代码,并将所述项目代码发送至所述智能硬件;Wherein, the artificial intelligence education platform is used to edit the project code for realizing the image capturing method, and send the project code to the intelligent hardware; 所述智能硬件用于采集场景图像及运行所述项目代码,得到运行结果,并将所述运行结果发送至所述人工智能教育平台,以在所述人工智能教育平台的显示界面中显示所述运行结果。The intelligent hardware is used to collect scene images and run the project code to obtain the running results, and send the running results to the artificial intelligence education platform to display the artificial intelligence education platform on the display interface. operation result. 一种图像拍摄装置,包括:An image capturing device, comprising: 获取部分,被配置为获取与拍摄界面对应的场景图像;an acquisition part, configured to acquire a scene image corresponding to the shooting interface; 识别部分,被配置为对所述场景图像进行识别,确定所述场景图像对应的场景类别以及与所述场景类别对应的推荐人体姿态;an identification part, configured to identify the scene image, and determine a scene category corresponding to the scene image and a recommended human posture corresponding to the scene category; 检测部分,被配置为对所述场景图像进行人体关键点检测,确定所述场景图像中的对象以及所述对象的真实人体姿态;a detection part, configured to perform human body key point detection on the scene image, and determine the object in the scene image and the real human body posture of the object; 显示部分,被配置为在所述拍摄界面中显示所述真实人体姿态以及所述推荐人体姿态,以引导所述对象根据所述推荐人体姿态进行姿态调整;a display part, configured to display the real human body posture and the recommended human body posture in the shooting interface, so as to guide the object to perform posture adjustment according to the recommended human body posture; 拍摄部分,被配置为在满足拍摄条件的情况下,得到所述对象的拍摄图像。The photographing part is configured to obtain a photographed image of the object when photographing conditions are satisfied. 根据权利要求12所述的装置,其中,所述推荐人体姿态包括单人姿态,所述显示部分,包括:The apparatus according to claim 12, wherein the recommended human posture comprises a single-person posture, and the display part comprises: 第一显示子部分,被配置为根据所述对象在所述场景图像中的位置,在所述拍摄界面中显示所述真实人体姿态以及所述单人姿态;a first display subsection, configured to display the real human body posture and the single-person posture in the shooting interface according to the position of the object in the scene image; 或,or, 第二显示子部分,被配置为根据所述对象在所述场景图像中的位置,在所述拍摄界面中显示所述真实人体姿态,并在所述拍摄界面的第一指定区域处显示所述单人姿态。The second display subsection is configured to display the real human body posture in the shooting interface according to the position of the object in the scene image, and display the real human body posture in the first designated area of the shooting interface Solo pose. 根据权利要求12或13所述的装置,其中,所述推荐人体姿态包括多人组合姿态,所述装置还包括:The device according to claim 12 or 13, wherein the recommended human body posture comprises a multi-person combined posture, and the device further comprises: 位置确定部分,被配置为根据所述场景图像中多个对象之间的相对位置,以及所述多人组合姿态中各个姿态之间的相对位置,从所述多人组合姿态中分别确定各对象的对应姿态;其中,a position determination part configured to respectively determine each object from the multi-person combined gesture according to the relative positions between the plurality of objects in the scene image and the relative positions between the various gestures in the multi-person combined gesture The corresponding posture of ; where, 所述显示部分,包括:第三显示子部分,被配置为针对任一对象,根据所述对象在所述场景图像中的位置,在所述拍摄界面中显示所述对象的真实人体姿态以及所述对象的对应姿态;The display part includes: a third display sub-part, which is configured to, for any object, display the real human body posture of the object and all the objects in the shooting interface according to the position of the object in the scene image. the corresponding pose of the object; 或,or, 第四显示子部分,被配置为根据所述对象在所述场景图像中的位置,在所述拍摄界面中显示所述对象的真实人体姿态,并在所述拍摄界面的第二指定区域处显示所述对象的对应姿态。a fourth display subsection, configured to display the real human body posture of the object in the shooting interface according to the position of the object in the scene image, and display it at the second designated area of the shooting interface the corresponding pose of the object. 根据权利要求12-14任一项所述的装置,其中,所述装置还包括:The apparatus of any one of claims 12-14, wherein the apparatus further comprises: 第一确定部分,被配置为确定所述真实人体姿态的第一人体关键点,与所述推荐人体姿态的第二人体关键点之间的关键点对;a first determining part, configured to determine a key point pair between the first human body key point of the real human body posture and the second human body key point of the recommended human body posture; 第二确定部分,被配置为通过计算所述关键点对的第一相似度,从所述第一人体关键点中确定出所述第一相似度小于第一预设阈值的第三人体关键点;The second determination part is configured to determine, from the first human body key points, a third human body key point whose first similarity is less than a first preset threshold by calculating the first similarity of the key point pair ; 其中,所述显示部分,包括:Wherein, the display part includes: 突出显示子部分,被配置为突出显示所述第三人体关键点所在的区域,其中,突出显示的方式包括高亮、加粗、变更颜色中的至少一种。The highlighting subsection is configured to highlight the region where the third human body key point is located, wherein the highlighting manner includes at least one of highlighting, bolding, and changing color. 根据权利要求12-15任一项所述的装置,其中,所述装置还包括:The apparatus of any one of claims 12-15, wherein the apparatus further comprises: 第三确定部分,被配置为根据所述真实人体姿态与所述推荐人体姿态之间的关键点对的第一相似度,确定所述真实人体姿态与所述推荐人体姿态之间的第二相似度,所述关键点对是根据所述真实人体姿态的第一人体关键点与所述推荐人体姿态的第二人体关键点确定的;A third determining part is configured to determine a second similarity between the real human body posture and the recommended human body posture according to the first similarity of key point pairs between the real human body posture and the recommended human body posture The key point pair is determined according to the first human body key point of the real human body posture and the second human body key point of the recommended human body posture; 相似度显示部分,被配置为在所述拍摄界面中显示所述第二相似度。The similarity display part is configured to display the second similarity in the shooting interface. 根据权利要求12-16任一项所述的装置,其中,所述装置还包括:The apparatus of any one of claims 12-16, wherein the apparatus further comprises: 区域确定部分,被配置为将所述场景图像输入至区域推荐网络,得到所述场景图像的推荐拍照区域,所述推荐拍照区域用于表征在所述场景图像中推荐拍照的区域,所述区域推荐网络是通过标注拍照区域的第一样本集训练得到的神经网络;an area determination part, configured to input the scene image into an area recommendation network to obtain a recommended photographing area of the scene image, where the recommended photographing area is used to represent a recommended photographing area in the scene image, the area The recommendation network is a neural network trained by marking the first sample set of the photo area; 其中,所述装置还包括:Wherein, the device also includes: 区域显示部分,被配置为在所述拍摄界面中,采用标识符指示所述推荐拍照区域;an area display part, configured to use an identifier to indicate the recommended photographing area in the photographing interface; 或者,在所述拍摄界面的所述推荐拍照区域处显示所述推荐人体姿态;所述推荐拍照区域用于引导所述对象根据所述推荐拍照区域进行位置调整;Or, displaying the recommended human posture in the recommended photographing area of the photographing interface; the recommended photographing area is used to guide the object to perform position adjustment according to the recommended photographing area; 或者,所述区域显示部分,还被配置为在所述拍摄界面中,采用标识符指示所述推荐拍照区域;在所述拍摄界面的所述推荐拍照区域处显示所述推荐人体姿态。Alternatively, the area display part is further configured to use an identifier to indicate the recommended photographing area in the photographing interface; and display the recommended human posture at the recommended photographing area on the photographing interface. 根据权利要求12-17任一项所述的装置,其中,The apparatus of any of claims 12-17, wherein, 所述识别部分,还被配置为:通过姿态推荐网络对所述场景图像进行识别,确定所述场景图像对应的场景类别以及与所述场景类别对应的推荐人体姿态,其中,所述姿态推荐网络是通过标注有样本场景类别以及样本人体姿态的第二样本集训练得到的神经网络,所述第二样本集包括对象上传的样本图像。The identification part is further configured to: identify the scene image through a gesture recommendation network, and determine a scene category corresponding to the scene image and a recommended human posture corresponding to the scene category, wherein the gesture recommendation network It is a neural network obtained by training a second sample set marked with sample scene categories and sample body poses, and the second sample set includes sample images uploaded by objects. 根据权利要求12所述的装置,其中,所述推荐人体姿态包括多个,所述装置还包括:The device according to claim 12, wherein the recommended human body posture comprises a plurality of, and the device further comprises: 姿态图显示部分,被配置为在所述拍摄界面中显示多个推荐人体姿态对应的姿态图;选择部分,被配置为响应于针对所述姿态图的选择操作,根据选中的姿态图,确定与所述选中的姿态图对应的推荐人体姿态并进行显示。The posture map display part is configured to display a plurality of posture maps corresponding to the recommended human body postures in the shooting interface; the selection part is configured to respond to the selection operation for the posture map, according to the selected posture map, determine and The recommended human body posture corresponding to the selected posture map is displayed and displayed. 根据权利要求12或16所述的装置,其中,所述拍摄条件包括:所述真实人体姿态与所述推荐人体姿态之间的第二相似度大于或等于第二预设阈值;或,所述拍摄界面中的拍摄控件被触发。The apparatus according to claim 12 or 16, wherein the shooting condition comprises: a second similarity between the real human body posture and the recommended human body posture is greater than or equal to a second preset threshold; or, the The capture controls in the capture interface are triggered. 根据权利要求12-20任一项所述的装置,其中,所述装置还包括:The apparatus of any one of claims 12-20, wherein the apparatus further comprises: 处理部分,被配置为对所述拍摄图像进行处理,得到处理后的拍摄图像,其中,所述处理包括以下至少一种:对所述拍摄图像中的对象进行美颜处理、对所述拍摄图像添加滤镜;所述对所述拍摄图像添加滤镜,包括:根据与所述推荐人体姿态对应的推荐滤镜,调整所述拍摄图像的饱和度、色温、亮度中的至少一种。The processing part is configured to process the captured image to obtain a processed captured image, wherein the processing includes at least one of the following: performing beauty processing on the object in the captured image, performing processing on the captured image Adding a filter; the adding a filter to the captured image includes: adjusting at least one of saturation, color temperature, and brightness of the captured image according to a recommended filter corresponding to the recommended human posture. 根据权利要求12-21任一项所述的装置,其中,所述装置应用于人工智能教育平台,所述人工智能教育平台与智能硬件连接;其中,所述人工智能教育平台用于编辑实现所述图像拍摄装置的项目代码,并将所述项目代码发送至所述智能硬件;所述智能硬件用于采集场景图像及运行所述项目代码,得到运行结果,并将所述运行结果发送至所述人工智能教育平台,以在所述人工智能教育平台的显示界面中显示所述运行结果。The device according to any one of claims 12-21, wherein the device is applied to an artificial intelligence education platform, and the artificial intelligence education platform is connected with intelligent hardware; wherein, the artificial intelligence education platform is used to edit and implement all the project code of the image capturing device, and send the project code to the intelligent hardware; the intelligent hardware is used to collect scene images and run the project code to obtain the operation result, and send the operation result to the intelligent hardware. The artificial intelligence education platform is used to display the operation result in the display interface of the artificial intelligence education platform. 一种电子设备,包括:An electronic device comprising: 处理器;processor; 被配置为存储处理器可执行指令的存储器;a memory configured to store processor-executable instructions; 其中,所述处理器被配置为调用所述存储器存储的指令,以执行权利要求1至11中任意一项所述的方法。wherein the processor is configured to invoke instructions stored in the memory to perform the method of any one of claims 1-11. 一种计算机可读存储介质,其上存储有计算机程序指令,所述计算机程序指令被处理器执行时实现权利要求1至11中任意一项所述的方法。A computer-readable storage medium having computer program instructions stored thereon, the computer program instructions implementing the method of any one of claims 1 to 11 when executed by a processor. 一种计算机程序,包括计算机可读代码,在所述计算机可读代码在电子设备中运行的情况下,所述电子设备中的处理器执行时实现权利要求1至11中任意一项所述的方法。A computer program, comprising computer-readable codes, when the computer-readable codes are executed in an electronic device, a processor in the electronic device implements the method described in any one of claims 1 to 11 when executed. method.
PCT/CN2021/120722 2021-04-28 2021-09-26 Image photographing method and apparatus, electronic device, and computer readable storage medium Ceased WO2022227393A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
KR1020227018878A KR20220149503A (en) 2021-04-28 2021-09-26 Image capturing method and apparatus, electronic device and computer readable storage medium

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202110467998.6 2021-04-28
CN202110467998.6A CN113194254A (en) 2021-04-28 2021-04-28 Image shooting method and device, electronic equipment and storage medium

Publications (1)

Publication Number Publication Date
WO2022227393A1 true WO2022227393A1 (en) 2022-11-03

Family

ID=76979970

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2021/120722 Ceased WO2022227393A1 (en) 2021-04-28 2021-09-26 Image photographing method and apparatus, electronic device, and computer readable storage medium

Country Status (3)

Country Link
KR (1) KR20220149503A (en)
CN (1) CN113194254A (en)
WO (1) WO2022227393A1 (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117690184A (en) * 2023-12-12 2024-03-12 深圳市创想数维科技有限公司 Human body gesture recognition and interaction method and system applied to virtual reality
US12094032B2 (en) * 2022-09-16 2024-09-17 Adobe Inc. Pose recommendation and real-time guidance for user-generated content

Families Citing this family (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111627115A (en) * 2020-05-26 2020-09-04 浙江商汤科技开发有限公司 Interactive group photo method and device, interactive device and computer storage medium
CN113194254A (en) * 2021-04-28 2021-07-30 上海商汤智能科技有限公司 Image shooting method and device, electronic equipment and storage medium
CN113923355B (en) * 2021-09-30 2024-08-13 上海商汤临港智能科技有限公司 Vehicle and image shooting method, device, equipment, and storage medium
CN113888454A (en) * 2021-10-11 2022-01-04 深圳市商汤科技有限公司 Method and device for generating visitor registration photo, electronic equipment and storage medium
CN114285988B (en) * 2021-12-03 2024-04-09 维沃移动通信有限公司 Display method, device, electronic device and storage medium
CN114598817B (en) * 2022-03-04 2024-06-18 无锡乐骐科技股份有限公司 Man-machine interaction judgment method and device based on multi-man interaction judgment
CN116823869A (en) * 2022-03-18 2023-09-29 华为技术有限公司 Background replacement methods and electronic devices
CN116996761A (en) * 2022-04-14 2023-11-03 北京字跳网络技术有限公司 Photography methods, devices, equipment, storage media and program products
CN120495598B (en) * 2025-06-10 2025-09-23 芯知科技(江苏)有限公司 Data intelligent processing method and system

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107734251A (en) * 2017-09-29 2018-02-23 维沃移动通信有限公司 A kind of photographic method and mobile terminal
CN110543578A (en) * 2019-08-09 2019-12-06 华为技术有限公司 object recognition method and device
CN110868538A (en) * 2019-11-11 2020-03-06 三星电子(中国)研发中心 Recommended method and electronic device for photographing poses
US20200106955A1 (en) * 2018-09-29 2020-04-02 Apple Inc. Devices, Methods, and Graphical User Interfaces for Assisted Photo-Taking
CN111277759A (en) * 2020-02-27 2020-06-12 Oppo广东移动通信有限公司 Composition prompting method, device, storage medium and electronic device
CN112351185A (en) * 2019-08-07 2021-02-09 华为技术有限公司 Photographing method and mobile terminal
CN113194254A (en) * 2021-04-28 2021-07-30 上海商汤智能科技有限公司 Image shooting method and device, electronic equipment and storage medium

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107239728B (en) * 2017-01-04 2021-02-02 赛灵思电子科技(北京)有限公司 UAV interaction device and method based on deep learning attitude estimation
CN109727208A (en) * 2018-12-10 2019-05-07 北京达佳互联信息技术有限公司 Filter recommended method, device, electronic equipment and storage medium
CN111062276A (en) * 2019-12-03 2020-04-24 广州极泽科技有限公司 Human body posture recommendation method and device based on human-computer interaction, machine readable medium and equipment

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107734251A (en) * 2017-09-29 2018-02-23 维沃移动通信有限公司 A kind of photographic method and mobile terminal
US20200106955A1 (en) * 2018-09-29 2020-04-02 Apple Inc. Devices, Methods, and Graphical User Interfaces for Assisted Photo-Taking
CN112351185A (en) * 2019-08-07 2021-02-09 华为技术有限公司 Photographing method and mobile terminal
CN110543578A (en) * 2019-08-09 2019-12-06 华为技术有限公司 object recognition method and device
CN110868538A (en) * 2019-11-11 2020-03-06 三星电子(中国)研发中心 Recommended method and electronic device for photographing poses
CN111277759A (en) * 2020-02-27 2020-06-12 Oppo广东移动通信有限公司 Composition prompting method, device, storage medium and electronic device
CN113194254A (en) * 2021-04-28 2021-07-30 上海商汤智能科技有限公司 Image shooting method and device, electronic equipment and storage medium

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US12094032B2 (en) * 2022-09-16 2024-09-17 Adobe Inc. Pose recommendation and real-time guidance for user-generated content
CN117690184A (en) * 2023-12-12 2024-03-12 深圳市创想数维科技有限公司 Human body gesture recognition and interaction method and system applied to virtual reality

Also Published As

Publication number Publication date
KR20220149503A (en) 2022-11-08
CN113194254A (en) 2021-07-30

Similar Documents

Publication Publication Date Title
WO2022227393A1 (en) Image photographing method and apparatus, electronic device, and computer readable storage medium
JP6574937B2 (en) COMMUNICATION SYSTEM, CONTROL METHOD, AND STORAGE MEDIUM
US11030733B2 (en) Method, electronic device and storage medium for processing image
WO2022179026A1 (en) Image processing method and apparatus, electronic device, and storage medium
WO2021023059A1 (en) Photographing method and mobile terminal
CN108289169A (en) Image pickup method, device, electronic equipment and storage medium
JP2016531362A (en) Skin color adjustment method, skin color adjustment device, program, and recording medium
CN107133576A (en) Age of user recognition methods and device
CN112702521B (en) Image shooting method and device, electronic equipment and computer readable storage medium
CN107018333A (en) Shoot template and recommend method, device and capture apparatus
CN115529378B (en) A video processing method and related device
CN110717399A (en) Face recognition method and electronic terminal equipment
CN114387445A (en) Object key point identification method and device, electronic equipment and storage medium
CN107018330A (en) A kind of guidance method and device of taking pictures in real time
WO2020093798A1 (en) Method and apparatus for displaying target image, terminal, and storage medium
CN115623313B (en) Image processing method, image processing device, electronic device, and storage medium
CN115225756A (en) Method, photographing method and device for determining target object
CN113781408B (en) Intelligent guiding system and method for image shooting
CN114758037B (en) Image generation method, device, terminal equipment and storage medium
CN112099639A (en) Display attribute adjusting method and device, display equipment and storage medium
CN105138763A (en) Method for real scene and reality information superposition in augmented reality
CN113822964A (en) Image optimization rendering method, device, device and storage medium
CN113362434A (en) Image processing method and device, electronic equipment and storage medium
CN119012001A (en) Auxiliary shooting method, auxiliary shooting device, electronic equipment and storage medium
CN112533071B (en) Image processing method and device and electronic equipment

Legal Events

Date Code Title Description
ENP Entry into the national phase

Ref document number: 2022527185

Country of ref document: JP

Kind code of ref document: A

121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 21938857

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 21938857

Country of ref document: EP

Kind code of ref document: A1

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 17/05/2024)

122 Ep: pct application non-entry in european phase

Ref document number: 21938857

Country of ref document: EP

Kind code of ref document: A1