CN116711303A

CN116711303A - Three-dimensional video call method and electronic device

Info

Publication number: CN116711303A
Application number: CN202180087392.8A
Authority: CN
Inventors: 雷涛; 石洁珂; 李宗岩; 檀珠峰
Original assignee: Huawei Technologies Co Ltd
Current assignee: Huawei Technologies Co Ltd
Priority date: 2021-01-06
Filing date: 2021-01-06
Publication date: 2023-09-05
Also published as: WO2022147698A1

Abstract

The application provides a three-dimensional video call method and electronic equipment, which can realize three-dimensional video call and reduce time delay for acquiring three-dimensional video. The electronic device includes: the face image acquisition module is used for dividing the face depth image into a plurality of subunits comprising a first subunit and a second subunit; dividing the face two-dimensional image into a plurality of subunits including a third subunit and a fourth subunit; transmitting the first subunit and the third subunit to the video coding module; the second subunit and the fourth subunit are then sent to the video encoding module. The video coding module is used for obtaining a first coding unit according to the first subunit and the third subunit and sending the first coding unit to the network transmission module; and then, obtaining a second coding unit according to the second subunit and the fourth subunit and sending the second coding unit to the network transmission module. The network transmission module is used for sending the first coding unit to the second electronic equipment; after the first coding unit is transmitted, the second coding unit is transmitted to the second electronic device.

Description

Three-dimensional video call method and electronic device

technical field

本申请涉及通信领域，尤其涉及一种三维视频通话方法及电子设备。The present application relates to the communication field, and in particular to a three-dimensional video call method and electronic equipment.

Background technique

随着视频编解码技术的发展，视频通话技术已成为较为流行的社交方式之一。现有的视频通话方案已经能够实现三维视频通话。具体地，视频发送设备先获取二维图像和深度图像，然后将二维图像和深度图像压缩后发送至服务器。服务器对接收的二维图像和深度图像进行解码，根据解码后的二维图像和深度图像生成三维图像，并将三维图像压缩后发送至视频接收设备，以实现用户之间的三维视频通话。With the development of video codec technology, video call technology has become one of the more popular social methods. Existing video call solutions have been able to implement three-dimensional video calls. Specifically, the video sending device first acquires the two-dimensional image and the depth image, and then compresses the two-dimensional image and the depth image and sends them to the server. The server decodes the received 2D image and depth image, generates a 3D image according to the decoded 2D image and depth image, compresses the 3D image and sends it to the video receiving device, so as to realize 3D video communication between users.

上述视频通话方案中，服务器对接收的二维图像和深度图像进行解码、以及对生成的三维图像进行编码的过程，增加了视频接收设备获取三维视频的时延。In the above video call solution, the server decodes the received 2D image and depth image, and encodes the generated 3D image, which increases the time delay for the video receiving device to obtain the 3D video.

发明内容Contents of the invention

本申请实施例提供一种三维视频通话方法及电子设备，可以在视频通话的过程中，降低获取三维视频的时延。为达到上述目的，本申请采用如下技术方案。Embodiments of the present application provide a 3D video call method and electronic equipment, which can reduce the time delay for acquiring 3D video during the video call process. In order to achieve the above purpose, the present application adopts the following technical solutions.

第一方面，提供一种电子设备。该电子设备包括：人脸图像采集模块、视频编码模块和网络传输模块。其中，人脸图像采集模块，用于获取人脸深度图像和人脸二维图像；将人脸深度图像划分为包括第一子单元和第二子单元的多个子单元；将人脸二维图像划分为包括第三子单元和第四子单元的多个子单元；向视频编码模块发送第一子单元和第三子单元；在发送第一子单元和第三子单元后，向视频编码模块发送第二子单元和第四子单元。其中，第一子单元对应于第三子单元，第二子单元对应于第四子单元。视频编码模块，用于根据第一子单元和第三子单元获得第一编码单元，向网络传输模块发送第一编码单元；在获得并发送第一编码单元后，根据第二子单元和第四子单元获得第二编码单元，向网络传输模块发送第二编码单元。网络传输模块，用于向第二电子设备发送第一编码单元；在发送第一编码单元后，向第二电子设备发送第二编码单元。In a first aspect, an electronic device is provided. The electronic device includes: a face image acquisition module, a video encoding module and a network transmission module. Wherein, the face image acquisition module is used to obtain the face depth image and the two-dimensional image of the face; the depth image of the face is divided into a plurality of subunits including the first subunit and the second subunit; the two-dimensional image of the face Divided into a plurality of subunits including the third subunit and the fourth subunit; sending the first subunit and the third subunit to the video coding module; after sending the first subunit and the third subunit, sending to the video coding module The second subunit and the fourth subunit. Wherein, the first subunit corresponds to the third subunit, and the second subunit corresponds to the fourth subunit. The video coding module is used to obtain the first coding unit according to the first sub-unit and the third sub-unit, and send the first coding unit to the network transmission module; after obtaining and sending the first coding unit, according to the second sub-unit and the fourth sub-unit The subunit obtains the second encoding unit, and sends the second encoding unit to the network transmission module. A network transmission module, configured to send the first encoding unit to the second electronic device; after sending the first encoding unit, send the second encoding unit to the second electronic device.

基于第一方面所述的电子设备，该电子设备的人脸图像采集模块将人脸深度图像和人脸二维图像分别划分为多个子单元，向视频编码模块发送一对子单元后再发送下一对子单元，该一对子单元包括人脸深度图像的一个子单元和与该人脸深度图像的一个子单元对应的人脸二维图像的一个子单元。如此，可以缩短视频编码模块等待接收图像的时间。视频编码模块根据一对子单元获得一个编码单元并发送给网络传输模块后，再对下一对子单元做相同处理，可以缩短网络传输模块等待接收编码单元的时间。网络传输模块接收一个编码单元并向第二电子设备发送后，再接收下一个编码单元并向第二电子设备发送，可以缩短第二电子设备等待接收编码单元的时间，从而可以降低第二电子设备获取三维视频的时延。Based on the electronic device described in the first aspect, the face image acquisition module of the electronic device divides the depth image of the face and the two-dimensional image of the face into a plurality of sub-units, sends a pair of sub-units to the video encoding module, and then sends the next A pair of sub-units, the pair of sub-units includes a sub-unit of the depth image of the face and a sub-unit of the two-dimensional image of the face corresponding to the sub-unit of the depth image of the face. In this way, the waiting time for the video encoding module to receive images can be shortened. After the video encoding module obtains a coding unit from a pair of subunits and sends it to the network transmission module, it performs the same processing on the next pair of subunits, which can shorten the waiting time for the network transmission module to receive the coding unit. After the network transmission module receives a coding unit and sends it to the second electronic device, it receives the next coding unit and sends it to the second electronic device, which can shorten the waiting time for the second electronic device to receive the coding unit, thereby reducing the number of second electronic devices. Get the time delay of 3D video.

在一种可能的设计中，人脸图像采集模块，具体可用于接收人脸深度信息；接收人脸二维信息；根据人脸深度信息获得人脸深度图像，根据人脸二维信息获得人脸二维图像。如此，人脸图像采集模块对人脸深度信息进行处理获得人脸深度图像，从而可以显示真实的三维视频，实现三维视频通话。In a possible design, the face image acquisition module can specifically be used to receive face depth information; receive face two-dimensional information; obtain face depth images according to face depth information, and obtain face 2D image. In this way, the face image acquisition module processes the face depth information to obtain a face depth image, so that a real 3D video can be displayed and a 3D video call can be realized.

在一种可能的设计中，视频编码模块，具体可用于：对第三子单元进行编码，以获得第三编码单元；根据第一子单元和第三编码单元，获得第一编码单元；以及，对第四子单元进行编码，以获得第四编码单元；根据第二子单元和第四编码单元，获得第二编码单元。也就是说，视频编码模块可以先对人脸二维图像的一个子单元进行编码，获得一个编码单元，再将人脸深度图像的一个子单元与该编码单元进行混合编码。如此，将人脸深度图像的子单元与人脸二维图像的子单元编码至同一编码单元中，可以降低传输的复杂度。In a possible design, the video encoding module may be specifically configured to: encode the third subunit to obtain a third coding unit; obtain the first coding unit according to the first subunit and the third coding unit; and, Encode the fourth sub-unit to obtain a fourth coding unit; obtain a second coding unit according to the second sub-unit and the fourth coding unit. That is to say, the video encoding module may first encode a sub-unit of the two-dimensional face image to obtain a coding unit, and then perform mixed coding on a sub-unit of the face depth image and the coding unit. In this way, encoding the sub-units of the depth image of the face and the sub-units of the two-dimensional image of the face into the same coding unit can reduce the complexity of transmission.

需要说明的是，本申请不对电子设备的视频编码模块对人脸二维图像的子单元和人脸深度图像的子单元进行编码的顺序进行限定，例如，视频编码模块可以先对第一子单元进行编码，再将编码后的第一子单元与第三子单元进行混合编码，获得第一编码单元。It should be noted that this application does not limit the order in which the video encoding module of the electronic device encodes the subunits of the two-dimensional face image and the subunits of the depth image of the face. For example, the video encoding module may first encode the first subunit Encoding is performed, and then the encoded first sub-unit and the third sub-unit are mixed-encoded to obtain the first coding unit.

在一种可能的设计中，第一方面所述的电子设备还可以包括：人脸三维生成模块和显示模块。其中，人脸三维生成模块，用于根据第一子单元和第三子单元获得第一人脸三维子图像，向显示模块发送第一人脸三维子图像；在获得并发送第一人脸三维子图像后，根据第二子单元和第四子单元获得第二人脸三维子图像，向显示模块发送第二人脸三维子图像。显示模块，用于将第一人脸三维子图像与场景二维图像进行叠加；在将第一人脸三维子图像与场景二维图像进行叠加后，将第二人脸三维子图像与场景二维图像进行叠加。如此，人脸三维生成模块根据一对子单元获得一个人脸三维子图像，并向显示模块发送后，再对下一对子单元进行相同的处理，可以缩短显示模块的等待时间，从而可以降低电子设备获得人脸三维图像的时延，进而可以降低电子设备获取三维视频的时延。In a possible design, the electronic device described in the first aspect may further include: a three-dimensional face generation module and a display module. Wherein, the face three-dimensional generating module is used for obtaining the first three-dimensional face sub-image according to the first subunit and the third sub-unit, and sending the first three-dimensional face sub-image to the display module; after obtaining and sending the first three-dimensional face sub-image After the sub-image, the second three-dimensional sub-image of human face is obtained according to the second sub-unit and the fourth sub-unit, and the second three-dimensional sub-image of human face is sent to the display module. The display module is used to superimpose the first three-dimensional sub-image of human face and the two-dimensional image of the scene; dimensional images are superimposed. In this way, the three-dimensional face generation module obtains a three-dimensional sub-image of a human face according to a pair of sub-units, and after sending it to the display module, performs the same processing on the next pair of sub-units, which can shorten the waiting time of the display module, thereby reducing The time delay for the electronic device to obtain the 3D image of the face can further reduce the time delay for the electronic device to obtain the 3D video.

第二方面，提供一种电子设备。该电子设备包括：网络传输模块、视频解码模块、三维人脸生成模块和显示模块。其中，网络传输模块，用于接收来自第一电子设备的第一编码单元，向视频解码模块发送第一编码单元；在接收并发送第一编码单元后，接收来自第一电子设备的第二编码单元，向视频解码模块发送第二编码单元。视频解码模块，用于根据第一编码单元获得第一子单元和第三子单元；在获得第一子单元和第三子单元后，根据第二编码单元获得第二子单元和第四子单元。其中，第一子单元和第二子单元分别为人脸深度图像中的子单元，第三子单元和第四子单元分别为人脸二维图像中的子单元，第一子单元对应于第三子单元，第二子单元对应于第四子单元。三维人脸生成模块，用于根据第一子单元和第三子单元获得第一人脸三维子图像，向显示模块发送第一人脸三维子图像；在获得并发送第一人脸三维子图像后，根据第二子单元和第四子单元获得第二人脸三维子图像，向显示模块发送第二人脸三维子图像。显示模块，用于将第一人脸三维子图像与场景二维图像进行叠加；在将第一人脸三维子图像与场景二维图像进行叠加后，将第二人脸三维子图像与场景二维图像进行叠加。In a second aspect, an electronic device is provided. The electronic device includes: a network transmission module, a video decoding module, a three-dimensional human face generation module and a display module. Wherein, the network transmission module is used to receive the first coding unit from the first electronic device, and send the first coding unit to the video decoding module; after receiving and sending the first coding unit, receive the second coding unit from the first electronic device unit, and send the second coding unit to the video decoding module. The video decoding module is used to obtain the first sub-unit and the third sub-unit according to the first coding unit; after obtaining the first sub-unit and the third sub-unit, obtain the second sub-unit and the fourth sub-unit according to the second coding unit . Wherein, the first subunit and the second subunit are subunits in the depth image of the face respectively, the third subunit and the fourth subunit are subunits in the two-dimensional image of the face respectively, and the first subunit corresponds to the third subunit unit, the second subunit corresponds to the fourth subunit. The three-dimensional face generation module is used to obtain the first three-dimensional sub-image of human face according to the first subunit and the third sub-unit, and send the first three-dimensional sub-image of human face to the display module; after obtaining and sending the first three-dimensional sub-image of human face Afterwards, the second three-dimensional sub-image of human face is obtained according to the second subunit and the fourth subunit, and the second three-dimensional sub-image of human face is sent to the display module. The display module is used to superimpose the first three-dimensional sub-image of human face and the two-dimensional image of the scene; dimensional images are superimposed.

基于第二方面所述的电子设备，电子设备的网络传输模块接收一个编码单元并向视频解码模块发送后，接收下一个编码单元并向视频解码模块发送，可以缩短视频解码模块等待的时间。视频解码模块对一个编码单元解码，获得人脸三维图像的一个子单元和人脸二维图像的一个子单元，并向三维人脸生成模块发送后，对下一个编码单元进行相同处理，可以缩短三维人脸生成模块等待的时间。三维人脸生成模块根据一对子单元获得一个人脸三维子图像，并发送给显示模块后，再对下一对子单元进行相同处理，可以缩短显示模块等待的时间，从而可以降低电子设备获取人脸三维图像的时延，降低电子设备获取三维视频的时延。Based on the electronic device described in the second aspect, after the network transmission module of the electronic device receives one coding unit and sends it to the video decoding module, it receives the next coding unit and sends it to the video decoding module, which can shorten the waiting time of the video decoding module. The video decoding module decodes a coding unit, obtains a subunit of a 3D face image and a subunit of a 2D face image, and sends them to the 3D face generation module, and performs the same processing on the next coding unit, which can shorten The waiting time of the 3D face generation module. The 3D face generation module obtains a 3D face sub-image according to a pair of sub-units, and sends it to the display module, and then performs the same processing on the next pair of sub-units, which can shorten the waiting time of the display module, thereby reducing the acquisition time of electronic equipment. The time delay of the 3D image of the face reduces the time delay of the electronic equipment to obtain the 3D video.

在一种可能的设计中，视频解码模块，还用于对第一编码单元进行解析，以获得第一子单元和第三编码单元；对第三编码单元进行解码，以获得第三子单元；在对第三编码单元进行解码后，对第二编码单元进行解析，以获得第二子单元和第四编码单元；对第四编码单元进行解码，以获得第四子单元。如此，视频解码模块可从一个编码单元中解码出一对子单元，可以降低获取人脸深度图像的子单元和人脸二维图像的子单元的复杂度。In a possible design, the video decoding module is further configured to parse the first coding unit to obtain the first sub-unit and the third coding unit; decode the third coding unit to obtain the third sub-unit; After the third coding unit is decoded, the second coding unit is parsed to obtain the second sub-unit and the fourth coding unit; the fourth coding unit is decoded to obtain the fourth sub-unit. In this way, the video decoding module can decode a pair of sub-units from one coding unit, which can reduce the complexity of acquiring the sub-units of the depth image of the face and the sub-units of the two-dimensional image of the face.

在一种可能的设计中，第二方面提供的电子设备，还可以包括：触控模块。其中，触控模块，用于检测调整动作。显示模块，用于根据调整动作，调整显示人脸三维图像中人脸的角度。如此，电子设备可以实现显示三维视频中人脸的不同角度。In a possible design, the electronic device provided in the second aspect may further include: a touch module. Wherein, the touch module is used for detecting adjustment actions. The display module is configured to adjust and display the angle of the human face in the three-dimensional image of the human face according to the adjustment action. In this way, the electronic device can display different angles of faces in the 3D video.

第三方面，提供一种三维视频通话方法。该三维视频通话方法包括：获取人脸深度图像和人脸二维图像；将人脸深度图像划分为包括第一子单元和第二子单元的多个子单元；将人脸二维图像划分为包括第三子单元和第四子单元的多个子单元。根据第一子单元和第三子单元获得第一编码单元；在获得第一编码单元后，根据第二子单元和第四子单元获得第二编码单元。向第二电子设备发送第一编码单元；在发送第一编码单元后，向第二电子设备发送第二编码单元。其中，第一子单元对应于第三子单元，第二子单元对应于第四子单元。In a third aspect, a method for 3D video calling is provided. The three-dimensional video call method includes: acquiring a face depth image and a two-dimensional face image; dividing the face depth image into a plurality of subunits including a first subunit and a second subunit; dividing the two-dimensional face image into a plurality of subunits including A plurality of subunits of the third subunit and the fourth subunit. The first coding unit is obtained according to the first sub-unit and the third sub-unit; after the first coding unit is obtained, the second coding unit is obtained according to the second sub-unit and the fourth sub-unit. Send the first coding unit to the second electronic device; after sending the first coding unit, send the second coding unit to the second electronic device. Wherein, the first subunit corresponds to the third subunit, and the second subunit corresponds to the fourth subunit.

在一种可能的设计中，上述获取人脸深度图像和人脸二维图像，可以包括：接收人脸深度信息；接收人脸二维信息；根据人脸深度信息获得人脸深度图像，根据人脸二维信息获得人脸二维图像。In a possible design, the acquisition of the face depth image and the face two-dimensional image may include: receiving face depth information; receiving face two-dimensional information; obtaining the face depth image according to the face depth information, and The two-dimensional face information is used to obtain a two-dimensional face image.

在一种可能的设计中，第三方面所述的三维视频通话方法，还可以包括：对第三子单元进行编码，以获得第三编码单元；根据第一子单元和第三编码单元，获得第一编码单元；以及，对第四子单元进行编码，以获得第四编码单元；根据第二子单元和第四编码单元，获得第二编码单元。In a possible design, the 3D video call method described in the third aspect may further include: encoding the third subunit to obtain a third coding unit; according to the first subunit and the third coding unit, obtaining The first coding unit; and, encoding the fourth sub-unit to obtain a fourth coding unit; and obtaining a second coding unit according to the second sub-unit and the fourth coding unit.

在一种可能的设计中，第三方面所述的三维视频通话方法，还可以包括：根据第一子单元和第三子单元获得第一人脸三维子图像；在获得第一人脸三维子图像后，根据第二子单元和第四子单元获得第二人脸三维子图像。将第一人脸三维子图像与场景二维图像进行叠加；在将第一人脸三维子图像与场景二维图像进行叠加后，将第二人脸三维子图像与场景二维图像进行叠加。In a possible design, the 3D video call method described in the third aspect may further include: obtaining the first 3D sub-image of human face according to the first subunit and the third subunit; After the image is obtained, the second three-dimensional sub-image of human face is obtained according to the second subunit and the fourth subunit. Superimpose the first 3D sub-image of human face with the 2D scene image; after superimposing the first 3D sub-image of human face with the 2D scene image, superimpose the second 3D sub-image of human face with the 2D scene image.

此外，第三方面所述的三维视频通话方法的技术效果可以参考第一方面中的任意一种实现方式所述的电子设备的技术效果，此处不再赘述。In addition, for the technical effect of the 3D video calling method described in the third aspect, reference may be made to the technical effect of the electronic device described in any implementation manner in the first aspect, which will not be repeated here.

第四方面，提供一种三维视频通话方法。该三维视频通话方法包括：接收来自第一电子设备的第一编码单元；在接收第一编码单元后，接收来自第一电子设备的第二编码单元。根据第一编码单元获得第一子单元和第三子单元；在获得第一子单元和第三子单元后，根据第二编码单元获得第二子单元和第四子单元；其中，第一子单元和第二子单元分别为人脸深度图像中的子单元，第三子单元和第四子单元分别为人脸二维图像中的子单元，第一子单元对应于第三子单元，第二子单元对应于第四子单元。根据第一子单元和第三子单元获得第一人脸三维子图像；在获得第一人脸三维子图像后，根据第二子单元和第四子单元获得第二人脸三维子图像。将第一人脸三维子图像与场景二维图像进行叠加；在将第一人脸三维子图像与场景二维图像进行叠加后，将第二人脸三维子图像与场景二维图像进行叠加。In a fourth aspect, a method for 3D video calling is provided. The three-dimensional video calling method includes: receiving a first coding unit from a first electronic device; after receiving the first coding unit, receiving a second coding unit from the first electronic device. Obtain the first subunit and the third subunit according to the first coding unit; after obtaining the first subunit and the third subunit, obtain the second subunit and the fourth subunit according to the second coding unit; wherein, the first subunit The unit and the second subunit are subunits in the face depth image respectively, the third subunit and the fourth subunit are subunits in the two-dimensional image of the face respectively, the first subunit corresponds to the third subunit, and the second subunit A unit corresponds to a fourth subunit. The first three-dimensional sub-image of human face is obtained according to the first subunit and the third sub-unit; after the first three-dimensional sub-image of human face is obtained, the second three-dimensional sub-image of human face is obtained according to the second subunit and the fourth subunit. Superimpose the first 3D sub-image of human face with the 2D scene image; after superimposing the first 3D sub-image of human face with the 2D scene image, superimpose the second 3D sub-image of human face with the 2D scene image.

在一种可能的设计中，第四方面所述的三维视频通话方法，还可以包括：对第一编码单元进行解析，以获得第一子单元和第三编码单元；对第三编码单元进行解码，以获得第三子单元；在对第三编码单元进行解码后，对第二编码单元进行解析，以获得第二子单元和第四编码单元；对第四编码单元进行解码，以获得第四子单元。In a possible design, the 3D video calling method described in the fourth aspect may further include: parsing the first coding unit to obtain the first sub-unit and the third coding unit; decoding the third coding unit , to obtain the third subunit; after decoding the third coding unit, parse the second coding unit to obtain the second subunit and the fourth coding unit; decode the fourth coding unit to obtain the fourth subunit.

在一种可能的设计中，第四方面所述的三维视频通话方法，还可以包括：检测调整动作，响应于调整动作，调整显示人脸三维图像中人脸的角度。In a possible design, the 3D video calling method described in the fourth aspect may further include: detecting an adjustment action, and adjusting the angle of the human face displayed in the 3D image of the human face in response to the adjustment action.

此外，第四方面所述的三维视频通话方法的技术效果可以参考第二方面中的任意一种实现方式所述的电子设备的技术效果，此处不再赘述。In addition, for the technical effect of the 3D video call method described in the fourth aspect, reference may be made to the technical effect of the electronic device described in any one of the implementation manners in the second aspect, which will not be repeated here.

第五方面，提供一种电子设备，该电子设备包括：处理器，处理器与存储器耦合。存储器，用于存储计算机程序。处理器，用于执行存储器中存储的计算机程序，以使得电子设备执行如第三方面至第四方面中任意一种可能的实现方式所述的三维视频通话方法。According to a fifth aspect, an electronic device is provided, and the electronic device includes: a processor, and the processor is coupled to a memory. Memory, used to store computer programs. The processor is configured to execute the computer program stored in the memory, so that the electronic device executes the three-dimensional video calling method described in any one possible implementation manner of the third aspect to the fourth aspect.

在一种可能的设计中，第五方面所述的电子设备还可以包括收发器。该收发器可以为收发电路或输入/输出端口。所述收发器可以用于该电子设备与其他设备通信。In a possible design, the electronic device described in the fifth aspect may further include a transceiver. The transceiver can be a transceiver circuit or an input/output port. The transceiver can be used for the electronic device to communicate with other devices.

在本申请中，第五方面所述的电子设备可以为电子设备，或者设置于电子设备内部的芯片或芯片系统。In the present application, the electronic device described in the fifth aspect may be an electronic device, or a chip or a chip system disposed inside the electronic device.

此外，第五方面所述的电子设备的技术效果可以参考第三方面至第四方面中任意一种实现方式所述的三维视频通话方法的技术效果，此处不再赘述。In addition, for the technical effect of the electronic device described in the fifth aspect, reference may be made to the technical effect of the 3D video call method described in any one of the implementation manners from the third aspect to the fourth aspect, which will not be repeated here.

第六方面，提供一种三维视频通话系统。该三维视频通话系统包括第一方面中任意一种可能的实现方式所述的电子设备，以及第二方面中任意一种可能的实现方式所述的电子设备。In a sixth aspect, a 3D video call system is provided. The 3D video call system includes the electronic device described in any possible implementation manner in the first aspect, and the electronic device described in any possible implementation manner in the second aspect.

第七方面，提供一种计算机可读存储介质，该计算机可读存储介质上存储有计算机程序或指令，当计算机程序或指令在计算机上运行时，使得计算机执行第三方面至第四方面中任意一种可能的实现方式所述的三维视频通话方法。According to a seventh aspect, a computer-readable storage medium is provided, the computer-readable storage medium stores a computer program or an instruction, and when the computer program or instruction is run on a computer, the computer executes any one of the third to fourth aspects. The three-dimensional video calling method described in a possible implementation manner.

第八方面，提供一种计算机程序产品，该计算机程序产品包括：计算机程序或指令，当计算机程序或指令在计算机上运行时，使得计算机执行第三方面至第四方面中任意一种可能的实现方式所述的三维视频通话方法。According to an eighth aspect, a computer program product is provided, and the computer program product includes: a computer program or an instruction, when the computer program or instruction is run on a computer, the computer executes any possible implementation of the third aspect to the fourth aspect The three-dimensional video calling method described in the method.

Description of drawings

图1为本申请实施例提供的三维视频通话系统的架构示意图；FIG. 1 is a schematic structural diagram of a three-dimensional video call system provided by an embodiment of the present application;

图2为本申请实施例提供的电子设备的结构示意图一；FIG. 2 is a first schematic structural diagram of an electronic device provided by an embodiment of the present application;

图3为本申请实施例提供的电子设备的软件结构框图；FIG. 3 is a block diagram of the software structure of the electronic device provided by the embodiment of the present application;

图4为本申请实施例提供的电子设备的结构示意图二；FIG. 4 is a second schematic structural diagram of the electronic device provided by the embodiment of the present application;

图5为本申请实施例提供的人脸图像采集模块的结构示意图；FIG. 5 is a schematic structural diagram of a face image acquisition module provided by an embodiment of the present application;

图6为本申请实施例提供的电子设备的结构示意图三；FIG. 6 is a schematic structural diagram III of the electronic device provided by the embodiment of the present application;

图7为本申请实施例提供的三维视频通话方法的流程示意图；FIG. 7 is a schematic flowchart of a three-dimensional video call method provided in an embodiment of the present application;

图8为本申请实施例提供的人脸深度图像和人脸二维图像的示意图；FIG. 8 is a schematic diagram of a face depth image and a two-dimensional face image provided by an embodiment of the present application;

图9为本申请实施例提供的第一电子设备的应用示意图一；FIG. 9 is a first schematic diagram of the application of the first electronic device provided by the embodiment of the present application;

图10为本申请实施例提供的第一电子设备的应用示意图二；FIG. 10 is a second schematic diagram of the application of the first electronic device provided by the embodiment of the present application;

图11为本申请实施例提供的三维视频通话方法的流程示意图二；FIG. 11 is the second schematic flow diagram of the three-dimensional video call method provided by the embodiment of the present application;

图12为本申请实施例提供的码流的结构示意图；FIG. 12 is a schematic structural diagram of a code stream provided by an embodiment of the present application;

图13为本申请实施例提供的第二电子设备的应用示意图；FIG. 13 is a schematic diagram of the application of the second electronic device provided by the embodiment of the present application;

图14为本申请实施例提供的人脸三维图像的示意图。FIG. 14 is a schematic diagram of a three-dimensional face image provided by an embodiment of the present application.

Detailed ways

下面结合附图对本申请实施例提供的三维视频通话方法及电子设备进行详细地描述。The 3D video call method and electronic equipment provided in the embodiments of the present application will be described in detail below with reference to the accompanying drawings.

本申请的描述中所提到的术语“包括”和“具有”以及它们的任何变形，意图在于覆盖不排他的包含。例如包含了一系列步骤或单元的过程、方法、系统、产品或设备没有限定于已列出的步骤或单元，而是可选地还包括其他没有列出的步骤或单元，或可选地还包括对于这些过程、方法、产品或设备固有的其它步骤或单元。The terms "including" and "having" and any variations thereof mentioned in the description of the present application are intended to cover non-exclusive inclusion. For example, a process, method, system, product or device comprising a series of steps or units is not limited to the listed steps or units, but optionally also includes other unlisted steps or units, or optionally also includes Other steps or elements inherent to the process, method, product or apparatus are included.

需要说明的是，本申请实施例中，“示例性的”或者“例如”等词用于表示作例子、例证或说明。本申请实施例中被描述为“示例性的”或者“例如”的任何实施例或设计方案不应被解释为比其它实施例或设计方案更优选或更具优势。确切而言，使用“示例性的”或者“例如”等词旨在以具体方式呈现相关概念。It should be noted that, in the embodiments of the present application, words such as "exemplary" or "for example" are used as examples, illustrations or descriptions. Any embodiment or design scheme described as "exemplary" or "for example" in the embodiments of the present application shall not be interpreted as being more preferred or more advantageous than other embodiments or design schemes. Rather, the use of words such as "exemplary" or "such as" is intended to present related concepts in a concrete manner.

在本申请的描述中，除非另有说明，“多个”的含义是指两个或两个以上。本文中的“和/或”仅仅是一种描述关联对象的关联关系，表示可以存在三种关系，例如，A和/或B，可以表示：单独存在A，同时存在A和B，单独存在B这三种情况。In the description of the present application, unless otherwise specified, the meaning of "plurality" refers to two or more. The "and/or" in this article is just an association relationship describing associated objects, which means that there can be three relationships, for example, A and/or B, which can mean: A exists alone, A and B exist at the same time, and B exists alone These three situations.

本申请实施例描述的应用场景是为了更加清楚的说明本申请实施例的技术方案，并不构成对于本申请实施例提供的技术方案的限定，本领域普通技术人员可知，随着网络架构的演变和新业务场景的出现，本申请实施例提供的技术方案对于类似的技术问题，同样适用。The application scenarios described in the embodiments of the present application are for more clearly illustrating the technical solutions of the embodiments of the present application, and do not constitute limitations on the technical solutions provided by the embodiments of the present application. Those skilled in the art know that with the evolution of the network architecture With the emergence of new business scenarios, the technical solutions provided by the embodiments of this application are also applicable to similar technical problems.

图1为本申请实施例提供的三维视频通话方法所适用的一种三维视频通话系统的架构示意图。为便于理解本申请实施例，以图1中示出的三维视频通话系统为例详细说明适用于本申请实施例的三维视频通话系统。应当指出的是，本申请实施例中的方案还可以应用于其他三维视频通话系统中，如第一电子设备对多个第二电子设备、或多个第一电子设备对多个第二电子设备的视频通话场景，相应的名称也可以用其他三维视频通话系统中的对应功能的名称进行替代。FIG. 1 is a schematic structural diagram of a 3D video call system to which the 3D video call method provided in the embodiment of the present application is applicable. To facilitate understanding of the embodiment of the present application, the 3D video calling system shown in FIG. 1 is taken as an example to describe in detail the 3D video calling system applicable to the embodiment of the present application. It should be noted that the solutions in the embodiments of the present application can also be applied to other 3D video call systems, such as a first electronic device pairing multiple second electronic devices, or multiple first electronic devices pairing multiple second electronic devices For the video call scene, the corresponding name can also be replaced by the name of the corresponding function in other 3D video call systems.

如图1所示，该三维视频通话系统包括至少两个电子设备，如第一电子设备和第二电子设备。其中，本申请实施例以第一电子设备作为三维视频的发送端、第二电子设备作为三维视频的接收端为例进行阐述。其中，电子设备具体可以是手机、平板电脑、车载设备、增强现实(augmented reality，AR)/虚拟现实(virtual reality，VR)设备、笔记本电脑、超级移动个人计算机(ultra-mobile personal computer，UMPC)、上网本、个人数字助理(personal digital assistant，PDA)、人工智能(artificial intelligence)设备、可穿戴设备等具有视频通话功能的终端设备，可穿戴设备可以是智能手表、智能手环、智能眼镜、智能头盔等。本申请实施例对电子设备的具体类型不作任何限制。As shown in FIG. 1 , the 3D video call system includes at least two electronic devices, such as a first electronic device and a second electronic device. Wherein, the embodiment of the present application is described by taking the first electronic device as the sending end of the 3D video and the second electronic device as the receiving end of the 3D video as an example. Wherein, the electronic device specifically may be a mobile phone, a tablet computer, a vehicle-mounted device, an augmented reality (augmented reality, AR)/virtual reality (virtual reality, VR) device, a notebook computer, an ultra-mobile personal computer (ultra-mobile personal computer, UMPC) , netbooks, personal digital assistants (personal digital assistant, PDA), artificial intelligence (artificial intelligence) devices, wearable devices and other terminal devices with video call functions. Wearable devices can be smart watches, smart bracelets, smart glasses, smart Helmets etc. The embodiment of the present application does not impose any limitation on the specific type of the electronic device.

图2为本申请实施例提供的电子设备的结构示意图一。如图2所示，电子设备100可以包括处理器110，外部存储器接口120，内部存储器121，通用串行总线(universal serial bus，USB)接口130，充电管理模块140，电源管理模块141，电池142，天线1，天线2，移动通信模块150，无线通信模块160，音频模块170，传感器模块190，按键190，马达191，指示器192，摄像头193，显示屏194，以及用户标识模块(subscriber identification module，SIM)卡接口195等。FIG. 2 is a first structural schematic diagram of an electronic device provided by an embodiment of the present application. As shown in FIG. 2 , the electronic device 100 may include a processor 110, an external memory interface 120, an internal memory 121, a universal serial bus (universal serial bus, USB) interface 130, a charging management module 140, a power management module 141, and a battery 142 , antenna 1, antenna 2, mobile communication module 150, wireless communication module 160, audio module 170, sensor module 190, button 190, motor 191, indicator 192, camera 193, display screen 194, and subscriber identification module (subscriber identification module , SIM) card interface 195 etc.

处理器110可以包括一个或多个处理单元，例如：处理器110可以包括应用处理器(application processor，AP)，调制解调处理器，图形处理器(graphics processing unit，GPU)，图像信号处理器(image signal processor，ISP)，控制器，存储器，视频编解码器，数字信号处理器(digital signal processor，DSP)，基带处理器，和/或神经网络处理器(neural-network processing unit，NPU)等。其中，不同的处理单元可以是独立的器件，也可以集成在一个或多个处理器中。The processor 110 may include one or more processing units, for example: the processor 110 may include an application processor (application processor, AP), a modem processor, a graphics processing unit (graphics processing unit, GPU), an image signal processor (image signal processor, ISP), controller, memory, video codec, digital signal processor (digital signal processor, DSP), baseband processor, and/or neural network processor (neural-network processing unit, NPU) wait. Wherein, different processing units may be independent devices, or may be integrated in one or more processors.

其中，控制器可以是电子设备100的神经中枢和指挥中心。控制器可以根据指令操作码和时序信号，产生操作控制信号，完成取指令和执行指令的控制。Wherein, the controller may be the nerve center and command center of the electronic device 100 . The controller can generate an operation control signal according to the instruction opcode and timing signal, and complete the control of fetching and executing the instruction.

处理器110中还可以设置存储器，用于存储指令和数据。在一些实施例中，处理器110中的存储器为高速缓冲存储器。该存储器可以保存处理器110刚用过或循环使用的指令或数据。如果处理器110需要再次使用该指令或数据，可从存储器中直接调用。避免了重复存取，减少了处理器110的等待时间，因而提高了系统的效率。A memory may also be provided in the processor 110 for storing instructions and data. In some embodiments, the memory in processor 110 is a cache memory. The memory may hold instructions or data that the processor 110 has just used or recycled. If the processor 110 needs to use the instruction or data again, it can be directly called from the memory. Repeated access is avoided, and the waiting time of the processor 110 is reduced, thereby improving the efficiency of the system.

在本申请的一些实施例中，电子设备100可以利用处理器110获取人脸深度图像和人脸二维图像，将人脸深度图像划分为多个子单元，将人脸二维图像划分为多个子单元。可选地，电子设备100可以利用处理器110根据人脸深度图像和人脸二维图像获得人脸三维子图像。具体地，电子设备100可以利用处理器110根据人脸深度图像的一个子单元和人脸二维图像的一个子单元获得一个人脸三维子图像，该人脸深度图像的一个子单元与该人脸二维图像的一个子单元相对应。In some embodiments of the present application, the electronic device 100 can use the processor 110 to acquire the depth image of the face and the two-dimensional image of the face, divide the depth image of the face into multiple sub-units, and divide the two-dimensional image of the face into multiple sub-units. unit. Optionally, the electronic device 100 may use the processor 110 to obtain a three-dimensional face sub-image according to the face depth image and the two-dimensional face image. Specifically, the electronic device 100 may use the processor 110 to obtain a three-dimensional face sub-image according to a sub-unit of the face depth image and a sub-unit of the two-dimensional face image, and a sub-unit of the face depth image is related to the person corresponds to a subunit of the 2D image of the face.

充电管理模块140用于从充电器接收充电输入。其中，充电器可以是无线充电器，也可以是有线充电器。The charging management module 140 is configured to receive a charging input from a charger. Wherein, the charger may be a wireless charger or a wired charger.

电源管理模块141用于连接电池142、充电管理模块140与处理器110。电源管理模块141接收电池142和/或充电管理模块140的输入，为处理器110，内部存储器121，外部存储器，显示屏194，摄像头193，和无线通信模块160等供电。The power management module 141 is used for connecting the battery 142 , the charging management module 140 and the processor 110 . The power management module 141 receives the input from the battery 142 and/or the charging management module 140 to provide power for the processor 110 , the internal memory 121 , the external memory, the display screen 194 , the camera 193 , and the wireless communication module 160 .

电子设备100的无线通信功能可以通过天线1、天线2、移动通信模块150、无线通信模块160、调制解调处理器以及基带处理器等实现。The wireless communication function of the electronic device 100 may be realized by the antenna 1, the antenna 2, the mobile communication module 150, the wireless communication module 160, a modem processor, a baseband processor, and the like.

移动通信模块150可以提供应用在电子设备100上的包括2G/3G/4G/5G等无线通信的解决方案。移动通信模块150可以包括至少一个滤波器，开关，功率放大器，低噪声放大器(low noise amplifier，LNA)等。The mobile communication module 150 can provide wireless communication solutions including 2G/3G/4G/5G applied on the electronic device 100 . The mobile communication module 150 may include at least one filter, switch, power amplifier, low noise amplifier (low noise amplifier, LNA) and the like.

在本申请的一些实施例中，电子设备100可以利用移动通信模块150向其他电子设备发送编码后的人脸深度图像和人脸二维图像，和/或，接收来自其他电子设备的编码后的人脸深度图像和人脸二维图像。示例性地，电子设备100可以利用移动通信模块150向其他电子设备发送编码后的人脸深度图像的子单元和人脸二维图像的子单元，和/或，接收来自其他电子设备的编码后的人脸深度图像的子单元和人脸二维图像的子单元。In some embodiments of the present application, the electronic device 100 may use the mobile communication module 150 to send encoded face depth images and face two-dimensional images to other electronic devices, and/or receive encoded face depth images and face two-dimensional images from other electronic devices. Face depth image and face 2D image. Exemplarily, the electronic device 100 can use the mobile communication module 150 to send the subunits of the encoded face depth image and the subunits of the two-dimensional face image to other electronic devices, and/or receive encoded The sub-units of the face depth image and the sub-units of the two-dimensional face image.

无线通信模块160可以提供应用在电子设备100上的包括无线局域网(wireless local area networks，WLAN)(如无线保真(wireless fidelity，Wi-Fi)网络)，蓝牙(bluetooth，BT)，全球导航卫星系统(global navigation satellite system，GNSS)，调频(frequency modulation，FM)，近距离无线通信技术(near field communication，NFC)，红外技术(infrared，IR)等无线通信的解决方案。The wireless communication module 160 can provide wireless local area network (wireless local area networks, WLAN) (such as wireless fidelity (wireless fidelity, Wi-Fi) network), bluetooth (bluetooth, BT), global navigation satellite Wireless communication solutions such as global navigation satellite system (GNSS), frequency modulation (FM), near field communication (NFC), and infrared technology (IR).

电子设备100通过GPU，显示屏194，以及应用处理器等实现显示功能。GPU为图像处理的微处理器，连接显示屏194和应用处理器。GPU用于执行数学和几何计算，用于图形渲染。处理器110可包括一个或多个GPU，其执行程序指令以生成或改变显示信息。The electronic device 100 realizes the display function through the GPU, the display screen 194 , and the application processor. The GPU is a microprocessor for image processing, and is connected to the display screen 194 and the application processor. GPUs are used to perform mathematical and geometric calculations for graphics rendering. Processor 110 may include one or more GPUs that execute program instructions to generate or change display information.

在本申请的一些实施例中，电子设备100可以利用GPU将人脸三维子图像与场景二维图像进行叠加。In some embodiments of the present application, the electronic device 100 may use the GPU to superimpose the 3D sub-image of the face with the 2D image of the scene.

显示屏194用于显示图像，视频等。在一些实施例中，电子设备100可以包括1个或N个显示屏194，N为大于1的正整数。在一些实施例中，电子设备100可以包括1个或N个摄像头193。The display screen 194 is used to display images, videos and the like. In some embodiments, the electronic device 100 may include 1 or N display screens 194 , where N is a positive integer greater than 1. In some embodiments, the electronic device 100 may include 1 or N cameras 193 .

电子设备100可以通过ISP、摄像头193、视频编解码器、GPU、显示屏194以及应用处理器等实现拍摄功能。The electronic device 100 can realize the shooting function through the ISP, the camera 193 , the video codec, the GPU, the display screen 194 , and the application processor.

摄像头193用于捕获静态图像或视频。摄像头193可以包括飞行时间(time of flight，TOF)传感器、三维结构光传感器、以及颜色(red green blue，RGB)传感器等。Camera 193 is used to capture still images or video. The camera 193 may include a time of flight (time of flight, TOF) sensor, a three-dimensional structured light sensor, a color (red green blue, RGB) sensor, and the like.

在本申请的一些实施例中，电子设备100可以利用摄像头193采集人脸深度图像和人脸二维图像。In some embodiments of the present application, the electronic device 100 may use the camera 193 to collect a face depth image and a two-dimensional face image.

在本申请的一些实施例中，电子设备100可以利用视频编解码器对人脸深度图像和人脸二维图像进行编码，和/或，通过解码获得人脸深度图像和人脸二维图像。示例性地，电子设备100可以利用视频编解码器对人脸深度图像的子单元和人脸二维图像的子单元进行编码，和/或，通过解码获得人脸深度图像的子单元和人脸二维图像的子单元。In some embodiments of the present application, the electronic device 100 may use a video codec to encode the face depth image and the two-dimensional face image, and/or obtain the face depth image and the two-dimensional face image through decoding. Exemplarily, the electronic device 100 may use a video codec to encode the sub-units of the face depth image and the sub-units of the two-dimensional face image, and/or obtain the sub-units of the face depth image and the face A subunit of a 2D image.

外部存储器接口120可以用于连接外部存储卡，例如Micro SD卡，实现扩展电子设备100的存储能力。The external memory interface 120 can be used to connect an external memory card, such as a Micro SD card, to expand the storage capacity of the electronic device 100 .

内部存储器121可以用于存储计算机可执行程序代码，所述可执行程序代码包括指令。内部存储器121可以包括存储程序区和存储数据区。内部存储器121可以包括高速随机存取存储器，还可以包括非易失性存储器，例如至少一个磁盘存储器件，闪存器件，通用闪存存储器(universal flash storage，UFS)等。处理器110通过运行存储在内部存储器121的指令，和/或存储在设置于处理器中的存储器的指令，执行电子设备100的各种功能应用以及数据处理。在本申请的一些实施例中，内部存储器121可以用于存储人工智能算法模型、和/或三维人脸生成算法模型等。The internal memory 121 may be used to store computer-executable program codes including instructions. The internal memory 121 may include an area for storing programs and an area for storing data. The internal memory 121 may include a high-speed random access memory, and may also include a non-volatile memory, such as at least one magnetic disk storage device, flash memory device, universal flash storage (universal flash storage, UFS) and the like. The processor 110 executes various functional applications and data processing of the electronic device 100 by executing instructions stored in the internal memory 121 and/or instructions stored in a memory provided in the processor. In some embodiments of the present application, the internal memory 121 may be used to store an artificial intelligence algorithm model, and/or a 3D face generation algorithm model, and the like.

音频模块170包括扬声器，受话器，麦克风，耳机接口等。The audio module 170 includes a speaker, a receiver, a microphone, an earphone jack, and the like.

音频模块170用于将数字音频数据转换成模拟音频电信号输出，也用于将模拟音频电信号输入转换为数字音频数据，音频模块170可以包括模/数转换器和数/模转换器。The audio module 170 is used to convert digital audio data into an analog audio electrical signal output, and is also used to convert an analog audio electrical signal input into digital audio data. The audio module 170 may include an analog/digital converter and a digital/analog converter.

在一些实施例中，电子设备100可以通过音频模块170，以及应用处理器等实现音频功能。例如音乐播放，录音等。In some embodiments, the electronic device 100 can implement the audio function through the audio module 170 and the application processor. Such as music playback, recording, etc.

传感器模块190可以包括压力传感器，陀螺仪传感器，气压传感器，磁传感器，加速度传感器，距离传感器，接近光传感器，指纹传感器，温度传感器，触摸传感器，环境光传感器，骨传导传感器等。The sensor module 190 may include a pressure sensor, a gyro sensor, an air pressure sensor, a magnetic sensor, an acceleration sensor, a distance sensor, a proximity light sensor, a fingerprint sensor, a temperature sensor, a touch sensor, an ambient light sensor, a bone conduction sensor, and the like.

在本申请的一些实施例中，电子设备100可以利用触摸传感器检测调整动作，以调整显示屏194显示人脸三维图像中人脸的角度。In some embodiments of the present application, the electronic device 100 may use the touch sensor to detect an adjustment action, so as to adjust the angle of the face in the three-dimensional image of the face displayed on the display screen 194 .

可以理解的是，本申请实施例示意的结构并不构成对电子设备100的具体限定。在本申请另一些实施例中，电子设备100可以包括比图示更多或更少的部件，或者组合某些部件，或者拆分某些部件，或者不同的部件布置。图示的部件可以以硬件，软件或软件和硬件的组合实现。It can be understood that, the structure illustrated in the embodiment of the present application does not constitute a specific limitation on the electronic device 100 . In other embodiments of the present application, the electronic device 100 may include more or fewer components than shown in the figure, or combine certain components, or separate certain components, or arrange different components. The illustrated components can be realized in hardware, software or a combination of software and hardware.

电子设备100的软件系统可以采用分层架构、事件驱动架构、微核架构，微服务架构、或云架构。本发明实施例以分层架构的Android系统为例，示例性说明电子设备100的软件结构。The software system of the electronic device 100 may adopt a layered architecture, an event-driven architecture, a micro-kernel architecture, a micro-service architecture, or a cloud architecture. In the embodiment of the present invention, the software structure of the electronic device 100 is exemplarily described by taking an Android system with a layered architecture as an example.

图3为本申请实施例提供的电子设备的软件结构框图。FIG. 3 is a block diagram of a software structure of an electronic device provided by an embodiment of the present application.

分层架构将软件分成若干个层，每一层都有清晰的角色和分工。层与层之间通过软件接口通信。在一些实施例中，将Android系统分为四层，从上至下分别为应用程序层，应用程序框架层，安卓运行时(Android runtime)和系统库，以及内核层。The layered architecture divides the software into several layers, and each layer has a clear role and division of labor. Layers communicate through software interfaces. In some embodiments, the Android system is divided into four layers, which are, from top to bottom, the application program layer, the application program framework layer, the Android runtime (Android runtime) and the system library, and the kernel layer.

应用程序层可以包括一系列应用程序包。The application layer can consist of a series of application packages.

如图3所示，应用程序包可以包括相机，日历，地图，WLAN，音乐，短信息，图库，通话，导航等应用程序。As shown in Figure 3, the application package may include application programs such as camera, calendar, map, WLAN, music, short message, gallery, call, and navigation.

其中，通话应用可用于实现三维视频通话。Wherein, the calling application can be used to realize three-dimensional video calling.

应用程序框架层为应用程序层的应用程序提供应用编程接口(application programming interface，API)和编程框架。应用程序框架层包括一些预先定义的函数。The application framework layer provides an application programming interface (application programming interface, API) and a programming framework for applications in the application layer. The application framework layer includes some predefined functions.

如图3所示，应用程序框架层可以包括窗口管理器，内容提供器，视图系统，电话管理器，资源管理器，通知管理器等。As shown in Figure 3, the application framework layer can include window manager, content provider, view system, phone manager, resource manager, notification manager, etc.

在一些实施例中，三维视频通话也可以实现为电子设备应用程序框架层中的模块，如三维视频通话模块。In some embodiments, the 3D video call can also be implemented as a module in the application framework layer of the electronic device, such as a 3D video call module.

窗口管理器用于管理窗口程序。窗口管理器可以获取显示屏大小，判断是否有状态栏，锁定屏幕，截取屏幕等。A window manager is used to manage window programs. The window manager can get the size of the display screen, determine whether there is a status bar, lock the screen, capture the screen, etc.

内容提供器用来存放和获取数据，并使这些数据可以被应用程序访问。所述数据可以包括视频，图像，音频，拨打和接听的电话，浏览历史和书签，电话簿等。Content providers are used to store and retrieve data and make it accessible to applications. Said data may include video, images, audio, calls made and received, browsing history and bookmarks, phonebook, etc.

视图系统包括可视控件，例如显示文字的控件，显示图片的控件等。视图系统可用于构建应用程序。显示界面可以由一个或多个视图组成的。例如，包括短信通知图标的显示界面，可以包括显示文字的视图以及显示图片的视图。The view system includes visual controls, such as controls for displaying text, controls for displaying pictures, and so on. The view system can be used to build applications. A display interface can consist of one or more views. For example, a display interface including a text message notification icon may include a view for displaying text and a view for displaying pictures.

电话管理器用于提供电子设备100的通信功能。例如通话状态的管理(包括接通，挂断等)。The phone manager is used to provide communication functions of the electronic device 100 . For example, the management of call status (including connected, hung up, etc.).

资源管理器为应用程序提供各种资源，比如本地化字符串，图标，图片，布局文件，视频文件等等。The resource manager provides various resources for the application, such as localized strings, icons, pictures, layout files, video files, and so on.

通知管理器使应用程序可以在状态栏中显示通知信息，可以用于传达告知类型的消息，可以短暂停留后自动消失，无需用户交互。比如通知管理器被用于告知下载完成，消息提醒等。通知管理器还可以是以图表或者滚动条文本形式出现在系统顶部状态栏的通知，例如后台运行的应用程序的通知，还可以是以对话窗口形式出现在屏幕上的通知。例如在状态栏提示文本信息，发出提示音，指示灯闪烁等。The notification manager enables the application to display notification information in the status bar, which can be used to convey notification-type messages, and can automatically disappear after a short stay without user interaction. For example, the notification manager is used to notify the download completion, message reminder, etc. The notification manager can also be a notification that appears on the top status bar of the system in the form of a chart or scroll bar text, such as a notification of an application running in the background, or a notification that appears on the screen in the form of a dialog window. For example, prompt text information in the status bar, make a prompt sound, and flash the indicator light, etc.

Android Runtime包括核心库和虚拟机。Android runtime负责安卓系统的调度和管理。Android Runtime includes core library and virtual machine. The Android runtime is responsible for the scheduling and management of the Android system.

核心库包含两部分：一部分是java语言需要调用的功能函数，另一部分是安卓的核心库。The core library consists of two parts: one part is the function function that the java language needs to call, and the other part is the core library of Android.

应用程序层和应用程序框架层运行在虚拟机中。虚拟机将应用程序层和应用程序框架层的java文件执行为二进制文件。虚拟机用于执行对象生命周期的管理，堆栈管理，线程管理，安全和异常的管理，以及垃圾回收等功能。The application layer and the application framework layer run in virtual machines. The virtual machine executes the java files of the application program layer and the application program framework layer as binary files. The virtual machine is used to perform functions such as object life cycle management, stack management, thread management, security and exception management, and garbage collection.

系统库可以包括多个功能模块。例如：表面管理器(surface manager)，媒体库(Media Libraries)，三维图形处理库(例如：OpenGL ES)，2D图形引擎(例如：SGL)等。A system library can include multiple function modules. For example: surface manager (surface manager), media library (Media Libraries), 3D graphics processing library (eg: OpenGL ES), 2D graphics engine (eg: SGL), etc.

表面管理器用于对显示子系统进行管理，并且为多个应用程序提供了2D和3D图层的融合。The surface manager is used to manage the display subsystem and provides the fusion of 2D and 3D layers for multiple applications.

媒体库支持多种常用的音频，视频格式回放和录制，以及静态图像文件等。媒体库可以支持多种音视频编码格式，例如:MPEG4，H.264，MP3，AAC，AMR，JPG，PNG等。The media library supports playback and recording of various commonly used audio and video formats, as well as still image files, etc. The media library can support a variety of audio and video encoding formats, such as: MPEG4, H.264, MP3, AAC, AMR, JPG, PNG, etc.

三维图形处理库用于实现三维图形绘图，图像渲染，合成，和图层处理等。The 3D graphics processing library is used to implement 3D graphics drawing, image rendering, compositing, and layer processing, etc.

2D图形引擎是2D绘图的绘图引擎。2D graphics engine is a drawing engine for 2D drawing.

内核层是硬件和软件之间的层。内核层至少包含显示驱动，摄像头驱动，音频驱动，传感器驱动。The kernel layer is the layer between hardware and software. The kernel layer includes at least a display driver, a camera driver, an audio driver, and a sensor driver.

图4为本申请实施例提供的电子设备的结构示意图二。FIG. 4 is a second structural schematic diagram of the electronic device provided by the embodiment of the present application.

图4所示的电子设备400可以为第一电子设备，即电子设备400可以为三维视频的发送端。如图4所示，本申请实施例提供的第一电子设备400可以包括人脸图像采集模块410、视频编码模块420和网络传输模块430。可选地，第一电子设备400还可以包括人脸三维生成模块440和显示模块450。The electronic device 400 shown in FIG. 4 may be a first electronic device, that is, the electronic device 400 may be a sending end of a 3D video. As shown in FIG. 4 , the first electronic device 400 provided by the embodiment of the present application may include a face image acquisition module 410 , a video encoding module 420 and a network transmission module 430 . Optionally, the first electronic device 400 may further include a face three-dimensional generation module 440 and a display module 450.

需要说明的是，图4所示的模块能够以电子硬件、计算机软件、或者计算机软件和电子硬件的结合来实现。示例性地，当使用软件实现时，图4所示的模块可以实现为图3所示的应用程序层中的通话应用，或者图4所示的模块也可以实现为图3所示的应用程序框架层中的三维视频通话模块。当使用硬件实现时，脸图像采集模块410、视频编码模块420、人脸三维生成模块440可以实现为图2所示的处理器110，网络传输模块430可以实现为图2所示的移动通信模块150，显示模块450可以实现为图2所示的显示屏194。当以计算机软件和电子硬件的结合来实现时，可将上述使用软件实现的方式和使用硬件实现的方式进行结合，本申请实施例不再赘述。It should be noted that the modules shown in FIG. 4 can be implemented by electronic hardware, computer software, or a combination of computer software and electronic hardware. Exemplarily, when implemented by software, the module shown in FIG. 4 can be implemented as a call application in the application layer shown in FIG. 3 , or the module shown in FIG. 4 can also be implemented as an application program shown in FIG. 3 3D video call module in the framework layer. When implemented using hardware, the face image acquisition module 410, the video encoding module 420, and the three-dimensional face generation module 440 can be implemented as the processor 110 shown in Figure 2, and the network transmission module 430 can be implemented as the mobile communication module shown in Figure 2 150. The display module 450 may be implemented as the display screen 194 shown in FIG. 2 . When it is realized by combining computer software and electronic hardware, the above-mentioned way of implementing by using software and the way of implementing by using hardware can be combined, and the embodiments of the present application will not repeat them here.

示例性地，人脸图像采集模块410可用于获取人脸深度图像和人脸二维图像，将人脸深度图像划分为包括第一子单元和第二子单元的多个子单元，将人脸二维图像划分为包括第三子单元和第四子单元的多个子单元。示例性地，人脸图像采集模块410可用于向下述视频编码模块420发送第一子单元和第三子单元；在发送第一子单元和第三子单元后，向下述视频编码模块420发送第二子单元和第四子单元。其中，第一子单元对应于第三子单元，第二子单元对应于第四子单元。Exemplarily, the face image acquisition module 410 may be configured to acquire a face depth image and a two-dimensional face image, divide the face depth image into a plurality of subunits including a first subunit and a second subunit, and divide the face depth image into a plurality of subunits including a first subunit and a second subunit. The dimensional image is divided into a plurality of subunits including a third subunit and a fourth subunit. Exemplarily, the face image acquisition module 410 can be used to send the first subunit and the third subunit to the following video encoding module 420; after sending the first subunit and the third subunit, send the following video encoding module 420 Send the second subunit and the fourth subunit. Wherein, the first subunit corresponds to the third subunit, and the second subunit corresponds to the fourth subunit.

可选地，人脸图像采集模块410可以用于向下述人脸三维生成模块440发送第一子单元和第三子单元；在发送第一子单元和第三子单元后，向下述人脸三维生成模块440发送第二子单元和第四子单元。Optionally, the face image acquisition module 410 can be used to send the first subunit and the third subunit to the following three-dimensional face generation module 440; after sending the first subunit and the third subunit, send the following person The face three-dimensional generation module 440 sends the second subunit and the fourth subunit.

可选地，人脸图像采集模块410可以用于获取场景二维图像，向下述人脸三维生成模块440和/或视频编码模块420发送该场景二维图像。示例性地，场景二维图像中包括当前视频场景中的场景图像。Optionally, the face image acquisition module 410 may be used to acquire a two-dimensional image of the scene, and send the two-dimensional image of the scene to the three-dimensional face generation module 440 and/or the video encoding module 420 described below. Exemplarily, the scene two-dimensional image includes the scene image in the current video scene.

在一些实施例中，人脸图像采集模块410可具体用于接收人脸深度信息，接收人脸二维信息，根据人脸深度信息获得人脸深度图像，根据人脸二维信息获得人脸二维图像。示例性地，人脸深度信息可以是高精度深度摄像头采集的，如TOF传感器、三维结构光传感器等。In some embodiments, the face image acquisition module 410 may be specifically configured to receive face depth information, receive face two-dimensional information, obtain a face depth image according to the face depth information, and obtain a face two-dimensional image according to the face two-dimensional information. dimensional image. Exemplarily, the face depth information may be collected by a high-precision depth camera, such as a TOF sensor, a three-dimensional structured light sensor, and the like.

示例性地，人脸二维信息可以包括人脸信息和当前视频场景中的场景信息。或者，示例性地，人脸二维信息只包括人脸信息，不包括场景信息。人脸图像采集模块410可用于接收场景二维信息，根据场景二维信息获得场景图像。可选地，人脸二维信息和场景二维信息均可以是二维摄像头采集的，如RGB传感器等。Exemplarily, the two-dimensional face information may include face information and scene information in the current video scene. Or, for example, the two-dimensional face information only includes face information and does not include scene information. The face image acquisition module 410 can be used to receive the two-dimensional information of the scene, and obtain the scene image according to the two-dimensional information of the scene. Optionally, both the two-dimensional information of the face and the two-dimensional information of the scene may be collected by a two-dimensional camera, such as an RGB sensor.

在另一些实施例中，人脸图像采集模块410可具体用于采集人脸深度信息，采集人脸二维信息，根据人脸深度信息获得人脸深度图像，根据人脸二维信息获得人脸二维图像。也就是说，人脸图像采集模块410可以包括用于采集人脸深度信息的模块和用于采集人脸二维信息的模块。可选地，人脸图像采集模块410可具体用于采集场景二维信息。In some other embodiments, the face image acquisition module 410 can be specifically used to collect face depth information, collect face two-dimensional information, obtain a face depth image according to the face depth information, and obtain a face image according to the two-dimensional information of the face. 2D image. That is to say, the face image collection module 410 may include a module for collecting depth information of a face and a module for collecting two-dimensional information of a face. Optionally, the face image acquisition module 410 may be specifically used to acquire scene two-dimensional information.

示例性地，图5为本申请实施例提供的人脸图像采集模块的结构示意图。如图5所示，人脸图像采集模块410可以包括：人脸深度图像采集子模块411、二维图像采集子模块412和图像信号处理(image signal processing，ISP)子模块413。Exemplarily, FIG. 5 is a schematic structural diagram of a face image acquisition module provided in an embodiment of the present application. As shown in FIG. 5 , the face image acquisition module 410 may include: a face depth image acquisition submodule 411 , a two-dimensional image acquisition submodule 412 and an image signal processing (image signal processing, ISP) submodule 413 .

其中，人脸深度图像采集子模块411，可用于采集当前视频场景中人脸深度信息，并发送给下述图像信号处理子模块413。示例性地，人脸深度图像采集子模块411可以为高精度深度摄像头，可以包括但不限于TOF传感器和三维结构光传感器。其中，TOF传感器可以向目标物体连续发送光脉冲，然后接收从目标物体返回的光，通过探测发送和接收光脉冲的飞行(往返)时间获得自身到目标物体距离，并生成深度信息。三维结构光传感器通过投射结构光到目标物体表面，接收目标物体表面反射的光线来获得目标物体表面的深度信息。Among them, the face depth image collection sub-module 411 can be used to collect face depth information in the current video scene, and send it to the image signal processing sub-module 413 described below. Exemplarily, the face depth image acquisition sub-module 411 may be a high-precision depth camera, which may include but not limited to a TOF sensor and a three-dimensional structured light sensor. Among them, the TOF sensor can continuously send light pulses to the target object, and then receive the light returned from the target object, and obtain the distance from itself to the target object by detecting the flight (round trip) time of sending and receiving light pulses, and generate depth information. The 3D structured light sensor obtains the depth information of the surface of the target object by projecting structured light onto the surface of the target object and receiving the light reflected from the surface of the target object.

二维图像采集子模块412，可用于采集当前视频场景中人脸二维信息，并发送给下述图像信号处理子模块413。示例性地，二维图像采集子模块412可以为RGB传感器等，本申请对此不进行限定。可选地，二维图像采集子模块412，可用于采集当前视频场景中场景二维信息，并发送给下述图像信号处理子模块413。The two-dimensional image collection sub-module 412 can be used to collect two-dimensional face information in the current video scene, and send it to the image signal processing sub-module 413 described below. Exemplarily, the two-dimensional image acquisition sub-module 412 may be an RGB sensor, etc., which is not limited in this application. Optionally, the two-dimensional image collection sub-module 412 may be used to collect scene two-dimensional information in the current video scene, and send it to the image signal processing sub-module 413 described below.

ISP子模块413，可用于接收来自人脸深度图像采集子模块411的人脸深度信息和来自二维图像采集子模块412的人脸二维信息，并根据人脸深度信息获得人脸深度图像，根据人脸二维信息获得人脸二维图像。可选地，ISP子模块413，可用于根据场景二维信息获得场景二维图像。The ISP sub-module 413 can be used to receive the face depth information from the face depth image acquisition sub-module 411 and the face two-dimensional information from the two-dimensional image acquisition sub-module 412, and obtain a face depth image according to the face depth information, A two-dimensional face image is obtained according to the two-dimensional face information. Optionally, the ISP submodule 413 may be configured to obtain a two-dimensional image of the scene according to the two-dimensional information of the scene.

具体地，ISP子模块413，可用于将人脸深度图像划分为包括第一子单元和第二子单元的多个子单元；将人脸二维图像划分为包括第三子单元和第四子单元的多个子单元。ISP子模块413，可用于向下述视频编码模块420发送第一子单元和第三子单元；在发送第一子单元和第三子单元后，向下述视频编码模块420发送第二子单元和第四子单元。可选地，ISP子模块413，可用于向下述视频编码模块420发送场景二维图像。Specifically, the ISP submodule 413 can be used to divide the face depth image into a plurality of subunits including the first subunit and the second subunit; divide the two-dimensional face image into a plurality of subunits including the third subunit and the fourth subunit multiple subunits. The ISP submodule 413 can be used to send the first subunit and the third subunit to the following video encoding module 420; after sending the first subunit and the third subunit, send the second subunit to the following video encoding module 420 and the fourth subunit. Optionally, the ISP submodule 413 may be configured to send the scene two-dimensional image to the following video encoding module 420 .

可选地，ISP子模块413，可用于向下述人脸三维生成模块440发送第一子单元和第三子单元；在发送第一子单元和第三子单元后，向人脸三维生成模块440发送第二子单元和第四子单元。ISP子模块413，可用于向下述人脸三维生成模块440发送场景二维图像。Optionally, the ISP submodule 413 can be used to send the first subunit and the third subunit to the following three-dimensional face generation module 440; after sending the first subunit and the third subunit, send the three-dimensional face generation module 440 Send the second subunit and the fourth subunit. The ISP sub-module 413 may be configured to send the two-dimensional image of the scene to the following three-dimensional face generation module 440 .

视频编码模块420，可用于根据第一子单元和第三子单元获得第一编码单元，向网络传输模块430发送第一编码单元；在获得并发送第一编码单元后，根据第二子单元和第四子单元获得第二编码单元，向网络传输模块430发送第二编码单元。如此，视频编码模块420以一对子单元为粒度流水线式接收并编码人脸深度图像包括的多个子单元和人脸二维图像包括的多个子单元，该一对子单元包括人脸深度图像的一个子单元和与该人脸深度图像的一个子单元对应的人脸二维图像的一个子单元，从而流水线式获得并发送每一对子单元分别对应的编码单元，可以降低第二电子设备获得人脸三维图像的时延，提高获取三维视频的效率。The video coding module 420 is configured to obtain the first coding unit according to the first subunit and the third subunit, and send the first coding unit to the network transmission module 430; after obtaining and sending the first coding unit, according to the second subunit and the The fourth subunit obtains the second encoding unit, and sends the second encoding unit to the network transmission module 430 . In this way, the video encoding module 420 receives and encodes multiple subunits included in the face depth image and multiple subunits included in the face two-dimensional image with a pair of subunits as the granularity pipeline, and the pair of subunits includes the face depth image. One subunit and one subunit of the two-dimensional face image corresponding to one subunit of the depth image of the face, so that the encoding unit corresponding to each pair of subunits is obtained and sent in a pipelined manner, which can reduce the acquisition time of the second electronic device. The time delay of the 3D image of the face improves the efficiency of obtaining 3D video.

可选地，视频编码模块420，具体用于：对第三子单元进行编码，以获得第三编码单元；根据第一子单元和第三编码单元，获得第一编码单元；以及，对第四子单元进行编码，以获得第四编码单元；根据第二子单元和第四编码单元，获得第二编码单元。也就是说，视频编码模块420可对人脸二维图像的子单元进行编码，获得第三编码单元，再将人脸深度图像的子单元与第三编码单元进行编码，获得第一编码单元。Optionally, the video encoding module 420 is specifically configured to: encode the third sub-unit to obtain a third coding unit; obtain the first coding unit according to the first sub-unit and the third coding unit; The sub-units are encoded to obtain a fourth coding unit; according to the second sub-unit and the fourth coding unit, a second coding unit is obtained. That is to say, the video encoding module 420 can encode the sub-units of the face two-dimensional image to obtain the third coding unit, and then encode the sub-units of the face depth image and the third coding unit to obtain the first coding unit.

需要说明的是，本申请实施例不对电子设备对第一子单元和第三子单元进行编码的顺序进行限定，例如，可以先对第一子单元进行编码，再将编码后的第一子单元与第三子单元进行混合编码，获得第一编码单元。It should be noted that this embodiment of the present application does not limit the order in which the electronic device encodes the first subunit and the third subunit. For example, the first subunit may be encoded first, and then the encoded first subunit Perform mixed coding with the third sub-unit to obtain the first coding unit.

可选地，视频编码模块420可对场景二维图像进行编码，将编码后的场景二维图像发送给网络传输模块430。Optionally, the video encoding module 420 may encode the two-dimensional image of the scene, and send the encoded two-dimensional image of the scene to the network transmission module 430 .

网络传输模块430，可用于向第二电子设备发送编码后的人脸深度图像和人脸二维图像。具体地，网络传输模块430可用于向第二电子设备发送第一编码单元；在发送第一编码单元后，向第二电子设备发送第二编码单元。如此，网络传输模块430可以流水线式向第二电子设备发送多对子单元分别对应的编码单元，以使第二电子设备流水线式获得人脸三维子图像，可以降低第二电子设备获得人脸三维图像的时延，降低获得三维视频的时延。The network transmission module 430 may be configured to send the encoded face depth image and the two-dimensional face image to the second electronic device. Specifically, the network transmission module 430 may be configured to send the first encoding unit to the second electronic device; after sending the first encoding unit, send the second encoding unit to the second electronic device. In this way, the network transmission module 430 can send the encoding units corresponding to multiple pairs of sub-units to the second electronic device in a pipelined manner, so that the second electronic device can obtain the three-dimensional face sub-image in a pipelined manner, which can reduce the time required for the second electronic device to obtain the three-dimensional face sub-image. Image delay, reducing the delay in obtaining 3D video.

可选地，网络传输模块430，可用于向第二电子设备发送编码后的场景二维图像。Optionally, the network transmission module 430 may be configured to send the encoded two-dimensional image of the scene to the second electronic device.

可选地，人脸三维生成模块440，可用于根据人脸深度图像和人脸二维图像获得人脸三维图像。具体地，人脸三维生成模块440，可用于根据第一子单元和第三子单元获得第一人脸三维子图像，向下述显示模块450发送第一人脸三维子图像；在获得并发送第一人脸三维子图像后，根据第二子单元和第四子单元获得第二人脸三维子图像，向下述显示模块450发送第二人脸三维子图像。Optionally, the three-dimensional face generating module 440 may be configured to obtain a three-dimensional face image according to the depth image of the face and the two-dimensional image of the face. Specifically, the three-dimensional face generation module 440 can be used to obtain the first three-dimensional sub-image of the human face according to the first subunit and the third sub-unit, and send the first three-dimensional sub-image of the human face to the following display module 450; After the first 3D sub-image of human face, the second 3D sub-image of human face is obtained according to the second subunit and the fourth subunit, and the second 3D sub-image of human face is sent to the display module 450 described below.

可选地，人脸三维生成模块440可用于接收来自人脸图像采集模块410的场景二维图像。或者，人脸三维生成模块440可用于根据人脸二维图像获得的场景二维图像，并发送给下述显示模块450。其中，人脸二维图像包括场景图像和人脸图像。Optionally, the three-dimensional face generation module 440 may be configured to receive the two-dimensional image of the scene from the face image acquisition module 410 . Alternatively, the three-dimensional face generation module 440 may be used to obtain a two-dimensional image of the scene according to the two-dimensional image of the face, and send it to the display module 450 described below. Wherein, the two-dimensional face image includes a scene image and a face image.

可选地，人脸三维生成模块440可用于向显示模块450发送场景二维图像。Optionally, the three-dimensional face generation module 440 may be configured to send the two-dimensional image of the scene to the display module 450 .

可选地，显示模块450，用于将第一人脸三维子图像与场景二维图像进行叠加。在将第一人脸三维子图像与场景二维图像进行叠加后，将第二人脸三维子图像与场景二维图像进行叠加。如此，显示模块450可流水线式将多个人脸三维子图像与场景二维图像进行叠加，以显示人脸三维图像。Optionally, the display module 450 is configured to superimpose the first 3D sub-image of the face with the 2D image of the scene. After superimposing the first three-dimensional face sub-image and the two-dimensional scene image, superimposing the second three-dimensional face sub-image and the two-dimensional scene image. In this way, the display module 450 can superimpose multiple 3D sub-images of the face and the 2D scene image in a pipelined manner to display the 3D image of the face.

图6为本申请实施例提供的电子设备的结构示意图三。FIG. 6 is a third structural schematic diagram of an electronic device provided by an embodiment of the present application.

图6所示的电子设备600可以为第二电子设备，即电子设备600可以为三维视频的接收端。如图6所示，本申请实施例提供的第二电子设备600可以包括网络传输模块610、视频解码模块620、人脸三维生成模块630和显示模块640。可选地，第二电子设备600还可以包括触控模块650。The electronic device 600 shown in FIG. 6 may be a second electronic device, that is, the electronic device 600 may be a receiving end of a 3D video. As shown in FIG. 6 , the second electronic device 600 provided by the embodiment of the present application may include a network transmission module 610 , a video decoding module 620 , a three-dimensional face generation module 630 and a display module 640 . Optionally, the second electronic device 600 may further include a touch module 650 .

需要说明的是，图6所示的模块能够以电子硬件、计算机软件、或者计算机软件和电子硬件的结合来实现。示例性地，当使用软件实现时，图6所示的模块可以实现为图3所示的应用程序层中的通话应用，或者，图6所示的模块也可以实现为图3所示的应用程序框架层中的三维视频通话模块。当使用硬件实现时，视频解码模块620、人脸三维生成模块630可以实现为图2所示的处理器110，网络传输模块610可以实现为图2所示的移动通信模块150，显示模块640可以实现为图2所示的显示屏194，触控模块650可以实现为图2所示的传感器模块190。当以计算机软件和电子硬件的结合来实现时，可将上述使用软件实现的方式和使用硬件实现的方式进行结合，本申请实施例不再赘述。It should be noted that the modules shown in FIG. 6 can be implemented by electronic hardware, computer software, or a combination of computer software and electronic hardware. Exemplarily, when implemented by software, the module shown in FIG. 6 can be implemented as a call application in the application layer shown in FIG. 3 , or, the module shown in FIG. 6 can also be implemented as the application shown in FIG. 3 The 3D video call module in the program framework layer. When implemented using hardware, the video decoding module 620 and the three-dimensional face generation module 630 can be implemented as the processor 110 shown in Figure 2, the network transmission module 610 can be implemented as the mobile communication module 150 shown in Figure 2, and the display module 640 can be implemented as shown in Figure 2 Realized as the display screen 194 shown in FIG. 2 , the touch module 650 may be realized as the sensor module 190 shown in FIG. 2 . When it is realized by combining computer software and electronic hardware, the above-mentioned way of implementing by using software and the way of implementing by using hardware can be combined, and the embodiments of the present application will not repeat them here.

其中，网络传输模块610，可用于接收编码后的人脸深度图像和人脸二维图像。具体地，网络传输模块610，可用于接收来自第一电子设备的第一编码单元，向视频解码模块620发送第一编码单元；在接收并发送第一编码单元后，接收来自第一电子设备的第二编码单元，向视频解码模块620发送第二编码单元。可选地，网络传输模块610，可用于接收编码后的场景二维图像，并发送给视频解码模块620。Wherein, the network transmission module 610 may be configured to receive the encoded depth image of the face and the two-dimensional image of the face. Specifically, the network transmission module 610 may be configured to receive the first encoding unit from the first electronic device, and send the first encoding unit to the video decoding module 620; after receiving and sending the first encoding unit, receive the first encoding unit from the first electronic device The second coding unit, sending the second coding unit to the video decoding module 620 . Optionally, the network transmission module 610 may be configured to receive the encoded two-dimensional image of the scene and send it to the video decoding module 620 .

视频解码模块620，可用于对编码后的人脸深度图像和人脸二维图像进行解码。具体地，视频解码模块620，可用于根据第一编码单元获得第一子单元和第三子单元，向下述人脸三维生成模块630发送第一子单元和第三子单元；在获得并发送第一子单元和第三子单元后，根据第二编码单元获得第二子单元和第四子单元，向下述人脸三维生成模块630发送第二子单元和第四子单元。其中，第一子单元和第二子单元分别为人脸深度图像中的子单元，第三子单元和第四子单元分别为人脸二维图像中的子单元，第一子单元对应于第三子单元，第二子单元对应于第四子单元。The video decoding module 620 can be used to decode the encoded face depth image and face two-dimensional image. Specifically, the video decoding module 620 can be used to obtain the first subunit and the third subunit according to the first coding unit, and send the first subunit and the third subunit to the following three-dimensional face generation module 630; after obtaining and sending After the first subunit and the third subunit, the second subunit and the fourth subunit are obtained according to the second coding unit, and the second subunit and the fourth subunit are sent to the 3D face generation module 630 described below. Wherein, the first subunit and the second subunit are subunits in the depth image of the face respectively, the third subunit and the fourth subunit are subunits in the two-dimensional image of the face respectively, and the first subunit corresponds to the third subunit unit, the second subunit corresponds to the fourth subunit.

可选地，视频解码模块620，可具体用于对第一编码单元进行解析，以获得第一子单元和第三编码单元；对第三编码单元进行解码，以获得第三子单元。在对第三编码单元进行解码后，对第二编码单元进行解析，以获得第二子单元和第四编码单元；对第四编码单元进行解码，以获得第四子单元。需要说明的是，视频解码模块620解码的方式与视频编码模块420编码的方式相对应，本申请不对视频解码模块620的具体解码方式进行限定。Optionally, the video decoding module 620 may be specifically configured to parse the first coding unit to obtain the first sub-unit and the third coding unit; and decode the third coding unit to obtain the third sub-unit. After the third coding unit is decoded, the second coding unit is parsed to obtain the second sub-unit and the fourth coding unit; the fourth coding unit is decoded to obtain the fourth sub-unit. It should be noted that the decoding method of the video decoding module 620 corresponds to the encoding method of the video encoding module 420 , and this application does not limit the specific decoding method of the video decoding module 620 .

可选地，视频解码模块620，可用于对编码后的场景二维图像进行解码，以获得场景二维图像，并向人脸三维生成模块630发送场景二维图像。Optionally, the video decoding module 620 may be configured to decode the encoded two-dimensional image of the scene to obtain the two-dimensional image of the scene, and send the two-dimensional image of the scene to the three-dimensional face generating module 630 .

人脸三维生成模块630，可用于根据人脸深度图像和人脸二维图像获得人脸三维图像。具体地，人脸三维生成模块630，可用于根据第一子单元和第三子单元获得第一人脸三维子图像，向显示模块640发送第一人脸三维子图像。在获得并发送第一人脸三维子图像后，根据第二子单元和第四子单元获得第二人脸三维子图像，向显示模块640发送第二人脸三维子图像。The three-dimensional face generation module 630 can be used to obtain a three-dimensional face image according to the depth image of the face and the two-dimensional image of the face. Specifically, the 3D face generation module 630 may be configured to obtain the first 3D sub-image of the face according to the first subunit and the third subunit, and send the first 3D sub-image of the face to the display module 640 . After the first 3D sub-image of human face is obtained and sent, the second 3D sub-image of human face is obtained according to the second subunit and the fourth subunit, and the second 3D sub-image of human face is sent to the display module 640 .

可选地，人脸三维生成模块630，可用于根据人脸二维图像获得的场景二维图像。其中，人脸二维图像包括场景图像和人脸图像。Optionally, the three-dimensional face generating module 630 may be used to obtain a two-dimensional image of a scene based on a two-dimensional image of a human face. Wherein, the two-dimensional face image includes a scene image and a face image.

可选地，人脸三维生成模块630，可用于向下述显示模块640发送场景二维图像。Optionally, the three-dimensional face generation module 630 may be configured to send a two-dimensional scene image to the display module 640 described below.

显示模块640，用于将第一人脸三维子图像与场景二维图像进行叠加。在将第一人脸三维子图像与场景二维图像进行叠加后，将第二人脸三维子图像与场景二维图像进行叠加。如此，显示模块640可流水线式将多个人脸三维子图像与场景二维图像进行叠加，以显示人脸三维图像。The display module 640 is configured to superimpose the first 3D sub-image of the face with the 2D image of the scene. After superimposing the first three-dimensional face sub-image and the two-dimensional scene image, superimposing the second three-dimensional face sub-image and the two-dimensional scene image. In this way, the display module 640 can superimpose multiple 3D sub-images of the face and the 2D scene image in a pipelined manner to display the 3D image of the face.

可选地，触控模块650，用于检测调整动作。Optionally, the touch module 650 is configured to detect adjustment actions.

可选地，显示模块640，可用于根据调整动作，调整显示整体三维图像中人脸的角度。也就是说，电子设备600可以通过调整显示三维视频中人脸的不同角度。Optionally, the display module 640 may be configured to adjust and display the angle of the human face in the overall three-dimensional image according to the adjustment action. That is to say, the electronic device 600 can adjust and display different angles of faces in the 3D video.

下面将结合图7-图14对本申请实施例提供的三维视频通话方法进行具体阐述。The three-dimensional video call method provided by the embodiment of the present application will be described in detail below with reference to FIGS. 7-14 .

图7为本申请实施例提供的三维视频通话方法的流程示意图一。FIG. 7 is a first schematic flowchart of a three-dimensional video call method provided by an embodiment of the present application.

如图7所示，该三维视频通话方法包括如下步骤：As shown in Figure 7, the three-dimensional video call method includes the following steps:

S701，第一电子设备获取人脸深度图像和人脸二维图像，将人脸深度图像划分为包括第一子单元和第二子单元的多个子单元，将人脸二维图像划分为包括第三子单元和第四子单元的多个子单元。S701. The first electronic device acquires a face depth image and a two-dimensional face image, divides the face depth image into multiple subunits including a first subunit and a second subunit, and divides the two-dimensional face image into multiple subunits including a second subunit. Three subunits and multiple subunits for a fourth subunit.

具体地，第一子单元对应于第三子单元，第二子单元对应于第四子单元。也就是说，人脸深度图像的多个子单元与人脸二维图像的多个子单元一一对应。Specifically, the first subunit corresponds to the third subunit, and the second subunit corresponds to the fourth subunit. That is to say, the multiple sub-units of the depth image of the face are in one-to-one correspondence with the sub-units of the two-dimensional image of the face.

图8为本申请实施例提供的人脸深度图像和人脸二维图像的示意图。以将人脸深度图像和人脸二维图像分别划分为4个子单元为例。如图8所示，将人脸深度图像划分为子单元1、子单元2、子单元3和子单元4，将人脸二维图像划分为子单元5、子单元6、子单元7和子单元8。其中，子单元1对应于子单元5，子单元2对应于子单元6，子单元3对应于子单元7，子单元4对应于子单元8。FIG. 8 is a schematic diagram of a face depth image and a two-dimensional face image provided by an embodiment of the present application. Take the face depth image and face two-dimensional image divided into four sub-units as an example. As shown in Figure 8, the face depth image is divided into subunit 1, subunit 2, subunit 3 and subunit 4, and the face two-dimensional image is divided into subunit 5, subunit 6, subunit 7 and subunit 8 . Wherein, subunit 1 corresponds to subunit 5 , subunit 2 corresponds to subunit 6 , subunit 3 corresponds to subunit 7 , and subunit 4 corresponds to subunit 8 .

需要说明的是，图8仅为本申请实施例提出的一种将人脸深度图像和人脸二维图像分别划分为多个子单元的方式，例如，还可以将人脸深度图像和人脸二维图像分别沿竖直方向划分为多个子单元，本申请对此不进行限定。It should be noted that FIG. 8 is only a way of dividing the face depth image and the face two-dimensional image into multiple sub-units proposed by the embodiment of the present application. For example, the face depth image and the face two-dimensional The three-dimensional image is divided into multiple subunits along the vertical direction, which is not limited in this application.

在一种可能的设计方式中，上述S701中，第一电子设备获取人脸深度图像和人脸二维图像，可以包括下述步骤一至步骤三。In a possible design manner, in the above S701, the first electronic device acquires the depth image of the face and the two-dimensional image of the face, which may include the following steps 1 to 3.

步骤一，第一电子设备接收人脸深度信息。Step 1, the first electronic device receives face depth information.

示例性地，人脸深度信息可以是高精度深度摄像头采集的，如TOF传感器、三维结构光传感器等。图9为本申请实施例提供的第一电子设备的应用示意图一。结合图9，人脸图像采集模块410可接收人脸深度信息。Exemplarily, the face depth information may be collected by a high-precision depth camera, such as a TOF sensor, a three-dimensional structured light sensor, and the like. FIG. 9 is a first schematic diagram of the application of the first electronic device provided by the embodiment of the present application. Referring to FIG. 9 , the face image acquisition module 410 may receive face depth information.

步骤二，第一电子设备接收人脸二维信息。Step 2, the first electronic device receives the two-dimensional face information.

示例性地，人脸二维信息可以包括人脸信息。可选地，人脸二维信息还可以包括当前视频场景中的场景信息。Exemplarily, the two-dimensional face information may include face information. Optionally, the two-dimensional face information may also include scene information in the current video scene.

在一些实施例中，第一电子设备可以接收场景二维信息。其中，场景二维信息可以包括场景信息。In some embodiments, the first electronic device may receive scene two-dimensional information. Wherein, the scene two-dimensional information may include scene information.

示例性地，人脸二维信息和场景二维信息均可以是二维摄像头采集的，如RGB传感器等。结合图9，人脸图像采集模块410可接收人脸二维信息。可选地，人脸图像采集模块410还可以接收场景二维信息。Exemplarily, both the two-dimensional face information and the two-dimensional scene information may be collected by a two-dimensional camera, such as an RGB sensor. Referring to FIG. 9 , the face image acquisition module 410 may receive two-dimensional face information. Optionally, the face image acquisition module 410 may also receive scene two-dimensional information.

步骤三，第一电子设备根据人脸深度信息获得人脸深度图像，根据人脸二维信息获得人脸二维图像。Step 3, the first electronic device obtains a face depth image according to the face depth information, and obtains a two-dimensional face image according to the two-dimensional information of the face.

结合图9，人脸图像采集模块410将人脸深度信息转化为人脸深度图像，将人脸二维信息转化为人脸二维图像。当人脸二维信息包括人脸信息，不包括场景信息时，人脸二维图像包括人脸图像；当人脸二维信息包括人脸信息和场景信息时，人脸二维图像中包括人脸图像和场景图像。Referring to FIG. 9 , the face image acquisition module 410 converts the face depth information into a face depth image, and converts the two-dimensional face information into a two-dimensional face image. When the two-dimensional face information includes face information but does not include scene information, the two-dimensional face image includes face images; when the two-dimensional face information includes face information and scene information, the two-dimensional face image includes face images and scene images.

可选地，人脸图像采集模块410可以将场景二维信息转化为场景二维图像。Optionally, the face image acquisition module 410 may convert the two-dimensional information of the scene into a two-dimensional image of the scene.

需要说明的是，本申请实施例不对上述步骤一至步骤三的执行顺序进行限定，以能够获取人脸深度图像和人脸二维图像为准。It should be noted that the embodiment of the present application does not limit the execution sequence of the above steps 1 to 3, and the depth image and the two-dimensional image of the face shall prevail.

结合图9，人脸图像采集模块410将人脸深度图像划分为包括第一子单元和第二子单元的多个子单元，将人脸二维图像划分为包括第三子单元和第四子单元的多个子单元。可选地，人脸图像采集模块410通过人脸深度图像缓存，将人脸深度图像的多个子单元传输至视频编码模块420和/或人脸三维生成模块440。人脸图像采集模块410通过人脸二维图像缓存，将人脸二维图像的多个子单元和/或场景二维图像传输至视频编码模块420和/或人脸三维生成模块440。In conjunction with FIG. 9, the face image acquisition module 410 divides the face depth image into a plurality of subunits including a first subunit and a second subunit, and divides a two-dimensional face image into a plurality of subunits including a third subunit and a fourth subunit. multiple subunits. Optionally, the face image acquisition module 410 transmits multiple sub-units of the face depth image to the video encoding module 420 and/or the face three-dimensional generation module 440 through the face depth image cache. The face image acquisition module 410 transmits multiple subunits of the face two-dimensional image and/or the scene two-dimensional image to the video encoding module 420 and/or the face three-dimensional generation module 440 through the face two-dimensional image cache.

在另一种可能的设计方式中，上述S701中，第一电子设备获取人脸深度图像和人脸二维图像，可以包括下述步骤四至步骤六。In another possible design manner, in the above S701, the first electronic device acquires the depth image of the face and the two-dimensional image of the face, which may include the following steps 4 to 6.

步骤四，第一电子设备采集人脸深度信息。Step 4, the first electronic device collects face depth information.

图10为本申请实施例提供的第一电子设备的应用示意图二。结合图10，人脸图像采集模块410可以包括：人脸深度图像采集子模块411、二维图像采集子模块412和图像信号处理子模块413。其中，人脸深度图像采集子模块411可采集人脸深度信息，并将该人脸深度信息发送给图像信号处理子模块413。FIG. 10 is a second schematic diagram of the application of the first electronic device provided by the embodiment of the present application. Referring to FIG. 10 , the face image acquisition module 410 may include: a face depth image acquisition submodule 411 , a two-dimensional image acquisition submodule 412 and an image signal processing submodule 413 . Wherein, the face depth image collection sub-module 411 can collect face depth information, and send the face depth information to the image signal processing sub-module 413 .

需要说明的是，图10所示的第一电子设备与图9所示的第一电子设备的区别在于，人脸图像采集模块410的结构不相同，其它部分均相同。下述关于视频编码模块420、网络传输模块430、人脸三维生成模块440和显示模块450的具体阐述，对于图9以及图10所示的第一电子设备均适用。It should be noted that the difference between the first electronic device shown in FIG. 10 and the first electronic device shown in FIG. 9 lies in that the structure of the face image acquisition module 410 is different, and other parts are the same. The following detailed explanations about the video encoding module 420 , the network transmission module 430 , the face 3D generation module 440 and the display module 450 are applicable to the first electronic device shown in FIG. 9 and FIG. 10 .

步骤五，第一电子设备采集人脸二维信息。Step five, the first electronic device collects two-dimensional face information.

结合图10，二维图像采集子模块412可采集人脸二维信息和/或场景二维信息，将人脸二维信息和/或场景二维信息发送给图像信号处理子模块413。关于人脸二维信息和场景二维信息的具体阐述可参照上述步骤二，此处不再赘述。Referring to FIG. 10 , the 2D image collection submodule 412 can collect 2D face information and/or scene 2D information, and send the 2D face information and/or scene 2D information to the image signal processing submodule 413 . For the specific elaboration of the two-dimensional information of the face and the two-dimensional information of the scene, please refer to the above step two, which will not be repeated here.

步骤六，第一电子设备根据人脸深度信息获得人脸深度图像，根据人脸二维信息获得人脸二维图像。Step 6, the first electronic device obtains the face depth image according to the face depth information, and obtains the face two-dimensional image according to the two-dimensional information of the face.

结合图10，图像信号处理子模块413可接收来自人脸深度图像采集子模块411的人脸深度信息，并将人脸深度信息转化为人脸深度图像。图像信号处理子模块413可接收来自二维图像采集子模块412的人脸二维信息和/或场景二维信息，将人脸二维信息转化为人脸二维图像，将场景二维信息转化为场景二维图像。Referring to FIG. 10 , the image signal processing sub-module 413 can receive the face depth information from the face depth image acquisition sub-module 411 , and convert the face depth information into a face depth image. The image signal processing sub-module 413 can receive the two-dimensional face information and/or the two-dimensional information of the scene from the two-dimensional image acquisition sub-module 412, convert the two-dimensional face information into a two-dimensional image of the face, and convert the two-dimensional information of the scene into 2D image of the scene.

需要说明的是，关于人脸二维图像和场景二维图像的具体阐述可参照上述步骤三。本申请实施例不对上述步骤四至步骤六的执行顺序进行限定，以能够获取人脸深度图像和人脸二维图像为准。It should be noted that, for the specific elaboration on the two-dimensional image of the face and the two-dimensional image of the scene, reference may be made to the third step above. The embodiment of the present application does not limit the execution order of the above-mentioned steps 4 to 6, as long as the depth image of the face and the two-dimensional image of the face can be acquired.

结合图10，人脸图像采集模块410中的图像信号处理子模块413将人脸深度图像划分为包括第一子单元和第二子单元的多个子单元，将人脸二维图像划分为包括第三子单元和第四子单元的多个子单元。可选地，图像信号处理子模块413通过人脸深度图像缓存，将人脸深度图像的多个子单元传输至视频编码模块420和/或人脸三维生成模块440。图像信号处理子模块413通过人脸二维图像缓存，将人脸二维图像的多个子单元和/或场景二维图像传输至视频编码模块420和/或人脸三维生成模块440。10, the image signal processing submodule 413 in the face image acquisition module 410 divides the depth image of the face into a plurality of subunits including the first subunit and the second subunit, and divides the two-dimensional image of the face into subunits including the second subunit. Three subunits and multiple subunits for a fourth subunit. Optionally, the image signal processing sub-module 413 transmits multiple sub-units of the face depth image to the video encoding module 420 and/or the three-dimensional face generation module 440 through the face depth image cache. The image signal processing sub-module 413 transmits multiple sub-units of the two-dimensional face image and/or the two-dimensional scene image to the video encoding module 420 and/or the three-dimensional face generation module 440 through the two-dimensional face image cache.

本申请实施例中，第一电子设备以一对子单元为粒度对人脸深度图像和人脸二维图像进行流水线式处理。其中，一对子单元包括人脸深度图像的一个子单元和与该人脸深度图像的一个子单元对应的人脸二维图像的一个子单元。In the embodiment of the present application, the first electronic device performs pipeline processing on the face depth image and the two-dimensional face image at a granularity of a pair of sub-units. Wherein, a pair of sub-units includes a sub-unit of the depth image of the face and a sub-unit of the two-dimensional image of the face corresponding to the sub-unit of the depth image of the face.

图11为本申请实施例提供的三维视频通话方法的流程示意图二。其中，子片可以包括一对子单元，或者与该一对子单元相关的信息。结合图8和图11，子片1可以包括子单元1和子单元5，或者，子片1可以为对子单元1和子单元5进行处理后获得的编码单元1a、或人脸三维子图像1。子片2可以包括子单元2和子单元6，或者，子片2可以为对子单元2和子单元6进行处理后获得的编码单元2a、或人脸三维子图像2。类似地，子片3可以包括子单元3和子单元7，或者，子片3可以为对子单元3和子单元7进行处理后获得的编码单元3a、或人脸三维子图像3。子片4可以包括子单元4和子单元8，或者，子片4可以为对子单元4和子单元8进行处理后获得的编码单元4a、或人脸三维子图像4。FIG. 11 is a second schematic flow diagram of a three-dimensional video calling method provided by an embodiment of the present application. Wherein, a sub-slice may include a pair of sub-units, or information related to the pair of sub-units. Referring to FIG. 8 and FIG. 11 , sub-slice 1 may include sub-unit 1 and sub-unit 5 , or sub-slice 1 may be a coding unit 1 a obtained after processing sub-unit 1 and sub-unit 5 , or a three-dimensional face sub-image 1 . The sub-slice 2 may include a sub-unit 2 and a sub-unit 6, or the sub-slice 2 may be a coding unit 2a obtained after processing the sub-unit 2 and the sub-unit 6, or a three-dimensional face sub-image 2. Similarly, sub-slice 3 may include sub-unit 3 and sub-unit 7 , or sub-slice 3 may be coding unit 3 a obtained after processing sub-unit 3 and sub-unit 7 , or face three-dimensional sub-image 3 . The sub-slice 4 may include a sub-unit 4 and a sub-unit 8 , or the sub-slice 4 may be a coding unit 4 a obtained after processing the sub-unit 4 and the sub-unit 8 , or a three-dimensional face sub-image 4 .

结合图11，处理视频的周期为T，人脸图像采集模块410可以在第一个T时间内获得子单元1和子单元5，并发送给视频编码模块420。接着在第二个T时间内，获得子单元2和子单元6，并发送给视频编码模块420。在第三个T时间内，获得子单元3和子单元7，并发送给视频编码模块420。在第四个T时间内，获得子单元和子单元8，并发送给视频编码模块420。Referring to FIG. 11 , the video processing period is T, and the face image acquisition module 410 can obtain subunit 1 and subunit 5 within the first T time, and send them to the video encoding module 420 . Then, within the second T time, subunit 2 and subunit 6 are obtained and sent to the video encoding module 420 . During the third T time, subunit 3 and subunit 7 are obtained and sent to the video encoding module 420 . In the fourth T time, the subunit and subunit 8 are obtained and sent to the video encoding module 420 .

类似地，人脸图像采集模块410可以一对子单元为粒度，将人脸深度图像的多个子单元和人脸二维图像的多个子单元流水线式发送给人脸三维生成模块440，图11中未示出，此处不再赘述。Similarly, the human face image acquisition module 410 can be a pair of subunits as the granularity, and send multiple subunits of the depth image of the human face and multiple subunits of the two-dimensional image of the human face to the three-dimensional face generation module 440 in a pipelined manner, as shown in FIG. 11 It is not shown, and will not be described in detail here.

需要说明的是，为了便于描述，图11中所示的时间段T与电子设备的模块处理各子片的时间的最大值相等，在实际应用中，时间段T可大于电子设备的模块处理各子片的时间，本申请实施例不对时间段T的大小进行限定。电子设备的模块包括但不限于图4-图6所示出的模块。It should be noted that, for the convenience of description, the time period T shown in FIG. 11 is equal to the maximum value of the processing time of each sub-slice by the module of the electronic device. The time of the sub-slices, the embodiment of the present application does not limit the size of the time period T. The modules of the electronic device include but are not limited to the modules shown in FIGS. 4-6 .

在一些实施例中，电子设备可以采用流水线式处理方案对各个子片进行处理。示例性地，若时间段T大于电子设备的模块处理各子片的时间，电子设备的模块可以在一个T时间内处理子片1后，接着处理子片2。示例性地，人脸图像采集模块410可以在第一个T时间开始时获取子单元1和子单元5，并发送给视频编码模块420，在第一个T时间的四分之三处完成该过程。紧接着获取子单元2和子单元6，并发送给视频编码模块420，在第二个T时间的二分之一处完成该过程。类似地，电子设备的各个模块完成对各个子片的处理，此处不再一一列举。In some embodiments, the electronic device may process each sub-slice using a pipeline processing scheme. Exemplarily, if the time period T is greater than the processing time of each sub-slice by the module of the electronic device, the module of the electronic device may process sub-slice 1 within a T time, and then process sub-slice 2. Exemplarily, the face image acquisition module 410 can acquire subunit 1 and subunit 5 at the beginning of the first T time, and send them to the video encoding module 420, and complete the process at three quarters of the first T time . Then sub-unit 2 and sub-unit 6 are obtained and sent to the video coding module 420, and the process is completed at one-half of the second T time. Similarly, each module of the electronic device completes the processing of each sub-chip, which will not be listed here.

在另一些实施例中，电子设备可以采用定时处理方案对各个子片进行处理。示例性地，若时间段T大于电子设备中各个模块处理各子片的时间，电子设备的多个模块中的一个或多个模块可以定时处理各个子片。例如，第一个T时间的四分之三处完成对子片1的处理后，不立即处理子片2，而是等到第二个T时间开始时，才处理子片2。示例性地，人脸图像采集模块410在第一个T时间开始时获取子单元1和子单元5，并发送给视频编码模块420，在第一个T时间的四分之三处完成该过程。等待一段时间，当第二个T时间开始时，才获取子单元2和子单元6，并发送给视频编码模块420，在第二个T时间的四分之三处完成该过程。再等待一段时间，当第三个T时间开始时，开始处理下一个子片。类似地，电子设备的各个模块完成对各个子片的处理，此处不再一一列举。In other embodiments, the electronic device may process each sub-slice using a timing processing scheme. Exemplarily, if the time period T is longer than the processing time of each sub-slice by each module in the electronic device, one or more modules in the multiple modules of the electronic device may process each sub-slice regularly. For example, after the processing of sub-slice 1 is completed in three quarters of the first T time, sub-slice 2 is not processed immediately, but sub-slice 2 is not processed until the second T time starts. Exemplarily, the face image acquisition module 410 acquires subunit 1 and subunit 5 at the beginning of the first T time, and sends them to the video encoding module 420, and completes the process at three quarters of the first T time. After waiting for a period of time, when the second T time begins, subunit 2 and subunit 6 are obtained and sent to the video encoding module 420, and the process is completed at three quarters of the second T time. Wait for a while, and when the third T time starts, start processing the next sub-slice. Similarly, each module of the electronic device completes the processing of each sub-chip, which will not be listed here.

需要说明的是，本申请实施例不对电子设备的各个子模块是否定时处理各个子片进行限定，也不对电子设备的部分子模块或全部模块定时处理各个子片进行限定，可以是电子设备中的部分模块定时处理各个子片，或者，电子设备中的各个模块均定时处理各个子片，或者，电子设备中的各个模块均不定时处理各个子片。It should be noted that the embodiment of the present application does not limit whether each sub-module of the electronic device processes each sub-slice regularly, nor does it limit whether some sub-modules or all modules of the electronic device regularly process each sub-slice. Some modules process each sub-slice regularly, or each module in the electronic device processes each sub-slice regularly, or each module in the electronic device processes each sub-slice irregularly.

可选地，若电子设备的一个或多个模块处理一个或多个子片的时间大于时间段T，电子设备可以采用如下方式处理各个子片，以电子设备能够正常运行为准。例如，针对流水线式处理方案，若处理M个子片的时间小于第一子片阈值，则电子设备的模块可流水线式对各个子片进行处理，其中，第一子片阈值可以为预设置的处理M个子片的最大时间；否则，电子设备的模块可以对M个子片中的部分子片进行处理，舍弃对另一部分子片的处理(例如，可以采用上一帧图像对应的子片的处理结果进行替代)，M为大于1的整数。又例如，针对定时处理方案，以时间段T小于人脸图像采集模块410处理子片1的时间为例，人脸图像采集模块410可以舍弃对子片1的处理(例如，可以采用上一帧图像对应的子片1的处理结果进行替代)，在第二个T时间开始时，定时对子片2进行处理。其中，电子设备的一个或多个模块处理一个或多个子片的时间大于时间段T，可以是电子设备的某个或某些模块故障导致等原因导致，本申请对此不进行限定。舍弃对子片的处理包括：已开始对子片进行处理但未获得处理结果时终断对子片的处理。Optionally, if the time for one or more modules of the electronic device to process one or more sub-slices is longer than the time period T, the electronic device may process each sub-slice in the following manner, subject to the normal operation of the electronic device. For example, for the pipelined processing solution, if the time for processing M sub-slices is less than the first sub-slice threshold, the modules of the electronic device can process each sub-slice in a pipelined manner, wherein the first sub-slice threshold can be a preset processing The maximum time of M sub-slices; otherwise, the module of the electronic device can process some sub-slices in the M sub-slices, and discard the processing of another part of the sub-slices (for example, the processing result of the sub-slice corresponding to the previous frame image can be used replace), M is an integer greater than 1. For another example, for the timing processing scheme, take the time period T less than the time for processing sub-slice 1 by the face image acquisition module 410 as an example, the face image acquisition module 410 can abandon the processing of sub-slice 1 (for example, the last frame can be used The processing result of the sub-slice 1 corresponding to the image is replaced), and at the beginning of the second T time, the sub-slice 2 is processed regularly. The processing time of one or more sub-slices by one or more modules of the electronic device is longer than the time period T, which may be caused by failure of one or some modules of the electronic device, which is not limited in this application. Abandoning the processing of the sub-slices includes: terminating the processing of the sub-slices when the processing of the sub-slices has started but the processing result has not been obtained.

S702，第一电子设备根据第一子单元和第三子单元获得第一编码单元；在获得第一编码单元后，根据第二子单元和第四子单元获得第二编码单元。S702. The first electronic device obtains the first coding unit according to the first subunit and the third subunit; after obtaining the first coding unit, obtains the second coding unit according to the second subunit and the fourth subunit.

在一种可能的设计方式中，上述S702，可以包括下述步骤七至步骤十。In a possible design manner, the above S702 may include the following steps 7 to 10.

步骤七，第一电子设备对第三子单元进行编码，以获得第三编码单元。In step seven, the first electronic device encodes the third subunit to obtain a third encoding unit.

示例性地，第三编码单元可以为视频编码层面(video coding layer，VCL)类型的网络抽象层面(network abstraction layer，NAL)单元。Exemplarily, the third coding unit may be a video coding layer (video coding layer, VCL) type network abstraction layer (network abstraction layer, NAL) unit.

图12为本申请实施例提供的码流的结构示意图。如图12所示，码流可以包括头部(NALU Header)、(sequence paramater set，SPS)序列参数集、(picture paramater set，PPS)图像参数集和至少一个编码单元。第一电子设备可以将子单元5编码为编码单元1b，编码单元1b为VCL类型的NAL单元。FIG. 12 is a schematic structural diagram of a code stream provided by an embodiment of the present application. As shown in FIG. 12 , the code stream may include a header (NALU Header), a (sequence paramater set, SPS) sequence parameter set, a (picture paramater set, PPS) picture parameter set, and at least one coding unit. The first electronic device may encode the subunit 5 into an encoding unit 1b, and the encoding unit 1b is a VCL type NAL unit.

结合图9或图10，视频编码模块420可以对第三子单元进行编码，获得第三编码单元。Referring to FIG. 9 or FIG. 10 , the video coding module 420 may code the third sub-unit to obtain a third coding unit.

步骤八，第一电子设备根据第一子单元和第三编码单元，获得第一编码单元。Step eight, the first electronic device obtains the first coding unit according to the first sub-unit and the third coding unit.

示例性地，第一编码单元可以为补充增强信息(supplemental enhancement information，SEI)类型的NAL单元。Exemplarily, the first coding unit may be a NAL unit of supplemental enhancement information (supplemental enhancement information, SEI) type.

结合图12，第一电子设备可以将子单元1填充到编码单元1b的预设字段，获得编码单元1a，编码单元1a为SEI类型的NAL单元。可选地，预设字段可以为补充增强信息SEI字段。Referring to FIG. 12 , the first electronic device may fill the subunit 1 into a preset field of the coding unit 1b to obtain the coding unit 1a, and the coding unit 1a is an SEI-type NAL unit. Optionally, the preset field may be a supplemental enhancement information SEI field.

结合图9或图10，视频编码模块420可以对第一子单元和第三编码单元进行编码，获得第一编码单元。Referring to FIG. 9 or FIG. 10 , the video encoding module 420 may encode the first sub-unit and the third coding unit to obtain the first coding unit.

本申请实施例中，第一电子设备将人脸深度图像的子单元与人脸二维图像的子单元编码至同一编码单元中，避免将人脸深度图像的子单元与人脸二维图像的子单元独立传输，可以减少因多条码流传输和时间同步而增加的复杂度。In the embodiment of the present application, the first electronic device encodes the sub-units of the depth image of the face and the sub-units of the two-dimensional image of the face into the same coding unit, avoiding combining the sub-units of the depth image of the face with the sub-units of the two-dimensional image of the face The sub-units are transmitted independently, which can reduce the increased complexity caused by the transmission of multiple barcode streams and time synchronization.

步骤九，第一电子设备对第四子单元进行编码，以获得第四编码单元。In step nine, the first electronic device encodes the fourth subunit to obtain a fourth encoding unit.

示例性地，第四编码单元可以为VCL类型的NAL单元。Exemplarily, the fourth coding unit may be a VCL type NAL unit.

结合图12，第一电子设备可以将子单元6编码为编码单元2b，编码单元2b为VCL类型的NAL单元。Referring to FIG. 12 , the first electronic device may encode the subunit 6 into a coding unit 2b, and the coding unit 2b is a VCL type NAL unit.

结合图9或图10，视频编码模块420可以对第四子单元进行编码，获得第四编码单元。Referring to FIG. 9 or FIG. 10 , the video encoding module 420 may encode the fourth sub-unit to obtain a fourth coding unit.

步骤十，第一电子设备根据第二子单元和第四编码单元，获得第二编码单元。Step ten, the first electronic device obtains the second encoding unit according to the second subunit and the fourth encoding unit.

示例性地，第二编码单元可以为SEI类型的NAL单元。Exemplarily, the second coding unit may be an SEI-type NAL unit.

结合图12，第一电子设备可以将子单元2填充到编码单元2b的预设字段，获得编码单元2a，编码单元2a为SEI类型的NAL单元。可选地，预设字段可以为补充增强信息SEI字段。Referring to FIG. 12 , the first electronic device may fill the subunit 2 into the preset field of the coding unit 2b to obtain the coding unit 2a, and the coding unit 2a is an SEI type NAL unit. Optionally, the preset field may be a supplemental enhancement information SEI field.

结合图9或图10，视频编码模块420可以对第二子单元和第四编码单元进行编码，获得第二编码单元。Referring to FIG. 9 or FIG. 10 , the video encoding module 420 may encode the second sub-unit and the fourth coding unit to obtain the second coding unit.

类似地，结合图12，第一电子设备可以采用上述步骤七至步骤八所记载的方式，获得编码单元3b、编码单元3a以及编码单元4b、编码单元4a，此处不再详细赘述。Similarly, with reference to FIG. 12 , the first electronic device can obtain the encoding unit 3b, the encoding unit 3a, the encoding unit 4b, and the encoding unit 4a in the manner described in the above steps 7 to 8, which will not be described in detail here.

在本申请实施例中，第一电子设备的视频编码模块可以对流水线式接收的人脸深度图像的子单元和人脸二维图像的子单元进行流水线式编码，从而降低第二电子设备获得人脸三维图像的时延。In this embodiment of the present application, the video coding module of the first electronic device can perform pipeline coding on the sub-units of the face depth image and the sub-units of the two-dimensional face image received in a pipeline, thereby reducing the number of people obtained by the second electronic device. Time-lapse of 3D images of faces.

结合图9或图10，视频编码模块420可以对第一子单元和第三子单元进行编码，获得第一编码单元；在获得第一编码单元后，对第二子单元和第四子单元进行编码，获得第二编码单元。可选地，视频编码模块420可将编码后的第一编码单元和第二编码单元通过编码单元缓存发送给网络传输模块430。With reference to FIG. 9 or FIG. 10, the video encoding module 420 can encode the first subunit and the third subunit to obtain the first coding unit; after obtaining the first coding unit, perform encoding on the second subunit and the fourth subunit Encode to obtain the second coding unit. Optionally, the video coding module 420 may send the coded first coding unit and the second coding unit to the network transmission module 430 through the coding unit cache.

结合图11，在第二个T时间内，视频编码模块420可对子单元1和子单元5进行编码，获得编码单元1a，并发送给网络传输模块430。在第三个T时间内，视频编码模块420对子单元2和子单元6进行编码，获得编码单元2a，并发送给网络传输模块430。类似地，在第四个T时间内，视频编码模块420可对子单元3和子单元7进行编码，获得编码单元3a，并发送给网络传输模块430。在第五个T时间内，视频编码模块420可对子单元4和子单元8进行编码，获得编码单元4a，并发送给网络传输模块430。Referring to FIG. 11 , within the second T time, the video encoding module 420 can encode the subunit 1 and the subunit 5 to obtain the encoding unit 1 a and send it to the network transmission module 430 . In the third time T, the video encoding module 420 encodes the subunit 2 and the subunit 6 to obtain the encoding unit 2a, and sends it to the network transmission module 430 . Similarly, in the fourth time T, the video encoding module 420 can encode the subunit 3 and the subunit 7 to obtain the encoding unit 3 a and send it to the network transmission module 430 . During the fifth T time, the video encoding module 420 can encode the sub-unit 4 and the sub-unit 8 to obtain the encoding unit 4 a and send it to the network transmission module 430 .

S703，第一电子设备向第二电子设备发送第一编码单元；在发送第一编码单元后，向第二电子设备发送第二编码单元。S703. The first electronic device sends the first encoding unit to the second electronic device; after sending the first encoding unit, sends the second encoding unit to the second electronic device.

结合图9或图10，网络传输模块430可以向第二电子设备发送第一编码单元；在发送第一编码单元后，向第二电子设备发送第二编码单元。Referring to FIG. 9 or FIG. 10 , the network transmission module 430 may send the first encoding unit to the second electronic device; after sending the first encoding unit, send the second encoding unit to the second electronic device.

在本申请实施例中，第一电子设备可以流水线式向第二电子设备发送编码后的人脸深度图像子单元和人脸二维图像子单元，以使第二电子设备流水线式获取人脸三维图像，可降低第二电子设备获得人脸三维图像的时延，提高获得三维视频的效率。In this embodiment of the present application, the first electronic device may send the encoded face depth image sub-unit and the face two-dimensional image sub-unit to the second electronic device in a pipelined manner, so that the second electronic device may acquire the three-dimensional face image in a pipelined manner. The image can reduce the time delay for the second electronic device to obtain the three-dimensional image of the face, and improve the efficiency of obtaining the three-dimensional video.

结合图11，在第三个T时间内，网络传输模块430可以向第二电子设备发送编码单元1a。在第四个T时间内，网络传输模块430向第二电子设备发送编码单元2a。在第五个T时间内，网络传输模块430向第二电子设备发送编码单元3a。在第六个T时间内，网络传输模块430向第二电子设备发送编码单元4a。Referring to FIG. 11 , within the third time T, the network transmission module 430 may send the encoding unit 1a to the second electronic device. During the fourth T time, the network transmission module 430 sends the encoding unit 2a to the second electronic device. During the fifth T period, the network transmission module 430 sends the encoding unit 3a to the second electronic device. During the sixth T time, the network transmission module 430 sends the encoding unit 4a to the second electronic device.

在一些实施例中，第一电子设备可以定时向第二电子设备发送第一编码单元和第二编码单元。In some embodiments, the first electronic device may periodically send the first encoding unit and the second encoding unit to the second electronic device.

结合图9或图10，网络传输模块430可以定时向第二电子设备发送第一编码单元和第二编码单元。Referring to FIG. 9 or FIG. 10 , the network transmission module 430 may periodically send the first encoding unit and the second encoding unit to the second electronic device.

在一些实施例中，第一电子设备可以显示第一电子设备的用户当前的视频场景的三维视频，第一电子设备可以对人脸深度图像的多个子单元和人脸二维图像的多个子单元进行流水线式处理，从而提高获取三维视频的效率。In some embodiments, the first electronic device can display the 3D video of the current video scene of the user of the first electronic device, and the first electronic device can analyze the multiple subunits of the depth image of the face and the multiple subunits of the two-dimensional image of the face Pipeline processing is performed to improve the efficiency of acquiring 3D video.

可选地，本申请实施例提供的三维视频通话方法，还可以包括：第一电子设备根据第一子单元和第三子单元获得第一人脸三维子图像；在获得第一人脸三维子图像后，根据第二子单元和第四子单元获得第二人脸三维子图像。Optionally, the 3D video call method provided in the embodiment of the present application may further include: the first electronic device obtains the first 3D sub-image of human face according to the first subunit and the third subunit; After the image is obtained, the second three-dimensional sub-image of human face is obtained according to the second subunit and the fourth subunit.

结合图9或图10，人脸三维生成模块440可以根据第一子单元和第三子单元获得第一人脸三维子图像，向显示模块450发送第一人脸三维子图像。在获得并发送第一人脸三维子图像后，根据第二子单元和第四子单元获得第二人脸三维子图像，向显示模块450发送第二人脸三维子图像。Referring to FIG. 9 or FIG. 10 , the 3D face generation module 440 can obtain the first 3D sub-image of the face according to the first subunit and the third subunit, and send the first 3D sub-image of the face to the display module 450 . After the first 3D sub-image of human face is obtained and sent, the second 3D sub-image of human face is obtained according to the second subunit and the fourth subunit, and the second 3D sub-image of human face is sent to the display module 450 .

可选地，人脸三维生成模块440可以采用人工智能算法模型、或三维人脸生成算法模型，根据人脸深度图像的子单元和人脸二维图像的子单元获得人脸三维子图像。Optionally, the 3D face generation module 440 may use an artificial intelligence algorithm model or a 3D face generation algorithm model to obtain a 3D face sub-image according to the sub-units of the depth image of the face and the sub-units of the 2-D face image.

结合图8和图9，人脸三维生成模块440可以根据子单元1和子单元5生成人脸三维子图像1，并将其发送给显示模块450。然后，根据子单元2和子单元6生成人脸三维子图像2，并将其发送给显示模块450。根据子单元3和子单元7生成人脸三维子图像3，并将其发送给显示模块450。最后，根据子单元4和子单元8生成人脸三维子图像4，并将其发送给显示模块450。需要说明的是，图10所示的人脸三维生成模块440可实现相同的功能，此处不再赘述。Referring to FIG. 8 and FIG. 9 , the three-dimensional face generation module 440 can generate a three-dimensional face sub-image 1 according to the subunit 1 and the subunit 5 and send it to the display module 450 . Then, a three-dimensional face sub-image 2 is generated according to the sub-unit 2 and the sub-unit 6 , and sent to the display module 450 . The three-dimensional face sub-image 3 is generated according to the sub-unit 3 and the sub-unit 7 , and sent to the display module 450 . Finally, a three-dimensional face sub-image 4 is generated according to the sub-unit 4 and the sub-unit 8 , and sent to the display module 450 . It should be noted that the three-dimensional human face generation module 440 shown in FIG. 10 can realize the same function, which will not be repeated here.

可选地，人脸三维生成模块440可以将场景二维图像发送给显示模块450。示例性地，当人脸二维图像包括场景图像和人脸图像时，该场景二维图像可以是人脸三维生成模块440根据人脸二维图像获得的。或者，该场景二维图像可以是人脸图像采集模块410发送给人脸三维生成模块440的。Optionally, the three-dimensional face generation module 440 may send the two-dimensional image of the scene to the display module 450 . Exemplarily, when the two-dimensional face image includes a scene image and a face image, the two-dimensional scene image may be obtained by the three-dimensional face generating module 440 according to the two-dimensional face image. Alternatively, the two-dimensional image of the scene may be sent by the face image acquisition module 410 to the three-dimensional face generation module 440 .

可选地，本申请实施例提供的三维视频通话方法，还可以包括：第一电子设备将第一人脸三维子图像与场景二维图像进行叠加；在将第一人脸三维子图像与场景二维图像进行叠加后，将第二人脸三维子图像与场景二维图像进行叠加。Optionally, the 3D video call method provided in the embodiment of the present application may further include: the first electronic device superimposes the first 3D sub-image of human face with the 2D image of the scene; After the two-dimensional image is superimposed, the second human face three-dimensional sub-image is superimposed on the scene two-dimensional image.

结合图9或图10，显示模块450可以将第一人脸三维子图像与场景二维图像进行叠加。在将第一人脸三维子图像与场景二维图像进行叠加后，将第二人脸三维子图像与场景二维图像进行叠加。可选地，显示模块450的显示子系统可以通过预览缓存流水线式接收多个人脸三维子图像和场景二维图像，并流水线式将人脸三维子系统与场景二维图像进行叠加，获得人脸三维图像，然后传输至显示屏进行显示。Referring to FIG. 9 or FIG. 10 , the display module 450 may superimpose the first 3D sub-image of the human face with the 2D scene image. After superimposing the first three-dimensional face sub-image and the two-dimensional scene image, superimposing the second three-dimensional face sub-image and the two-dimensional scene image. Optionally, the display subsystem of the display module 450 can receive multiple 3D sub-images of faces and 2D images of the scene through the preview buffer pipeline, and superimpose the 3D subsystem of the face with the 2D scene images in a pipeline to obtain the face The three-dimensional image is then transferred to a display screen for display.

示例性地，显示模块450可以将人脸三维子图像1与场景二维图像进行叠加，然后，将人脸三维子图像2与场景二维图像进行叠加，将人脸三维子图像3与场景二维图像进行叠加，最后，将人脸三维子图像4与场景二维图像进行叠加，从而获得人脸三维图像。Exemplarily, the display module 450 can superimpose the 3D sub-image 1 of the face with the 2D scene image, then superimpose the 3D sub-image 2 of the face with the 2D scene image, and superimpose the 3D sub-image 3 of the face with the 2D scene image. The three-dimensional image is superimposed, and finally, the three-dimensional sub-image 4 of the face is superimposed on the two-dimensional image of the scene, so as to obtain the three-dimensional image of the face.

如此，第一电子设备通过多次执行上述S701-S704，可以获得多帧人脸三维图像从而可以显示第一电子设备的用户的三维视频，采用以一对子单元为粒度流水线式对人脸深度图像和人脸二维图像进行处理，可以降低第一电子设备获取人脸三维图像的时延，从而提高显示三维视频的效率。In this way, the first electronic device can obtain multiple frames of three-dimensional face images by executing the above S701-S704 multiple times, so as to display the three-dimensional video of the user of the first electronic device. Processing the image and the two-dimensional image of the face can reduce the time delay for the first electronic device to acquire the three-dimensional image of the face, thereby improving the efficiency of displaying the three-dimensional video.

S704，第二电子设备接收来自第一电子设备的第一编码单元；在接收第一编码单元后，接收来自第一电子设备的第二编码单元。S704. The second electronic device receives the first encoding unit from the first electronic device; after receiving the first encoding unit, receives the second encoding unit from the first electronic device.

图13为本申请实施例提供的第二电子设备的应用示意图。结合图13，网络传输模610，可用于接收来自第一电子设备的第一编码单元，向视频解码模块620发送第一编码单元。在接收并发送第一编码单元后，接收来自第一电子设备的第二编码单元，向视频解码模块620发送第二编码单元。FIG. 13 is a schematic diagram of the application of the second electronic device provided by the embodiment of the present application. Referring to FIG. 13 , the network transmission module 610 may be configured to receive the first coding unit from the first electronic device, and send the first coding unit to the video decoding module 620 . After receiving and sending the first encoding unit, receive the second encoding unit from the first electronic device, and send the second encoding unit to the video decoding module 620 .

结合图11，在第四个T时间内，网络传输模块610获取编码单元1a并向视频解码模块620发送编码单元1a。在第五个T时间内，获取编码单元2a并向视频解码模块620发送编码单元2a。在第六个T时间内，获取编码单元3a并向视频解码模块620发送编码单元3a。在第七个T时间内，获取编码单元4a并向视频解码模块620发送编码单元4a。Referring to FIG. 11 , within the fourth time T, the network transmission module 610 acquires the encoding unit 1 a and sends the encoding unit 1 a to the video decoding module 620 . During the fifth T time, the encoding unit 2 a is acquired and sent to the video decoding module 620 . During the sixth T time, the encoding unit 3 a is acquired and sent to the video decoding module 620 . During the seventh T time, the encoding unit 4 a is acquired and sent to the video decoding module 620 .

S705，第二电子设备根据第一编码单元获得第一子单元和第三子单元；在获得第一子单元和第三子单元后，根据第二编码单元获得第二子单元和第四子单元。S705, the second electronic device obtains the first subunit and the third subunit according to the first coding unit; after obtaining the first subunit and the third subunit, obtains the second subunit and the fourth subunit according to the second coding unit .

结合图13，视频解码模块620可对第一编码单元进行解码，获得第一子单元和第三子单元。可选地，视频解码模块620可通过人脸深度图像缓存将第一子单元传输至人脸三维生成模块440，通过人脸二维图像缓存将第三子单元传输至人脸三维生成模块440。类似地，视频解码模块620可对第二编码单元进行解码，获得第二子单元和第四子单元。可选地，视频解码模块620可通过人脸深度图像缓存将第二子单元传输至人脸三维生成模块440，通过人脸二维图像缓存将第四子单元传输至人脸三维生成模块440。Referring to FIG. 13 , the video decoding module 620 may decode the first coding unit to obtain the first sub-unit and the third sub-unit. Optionally, the video decoding module 620 may transmit the first subunit to the 3D face generation module 440 through the face depth image buffer, and transmit the third subunit to the 3D face generation module 440 through the 2D face image buffer. Similarly, the video decoding module 620 may decode the second coding unit to obtain the second sub-unit and the fourth sub-unit. Optionally, the video decoding module 620 may transmit the second subunit to the 3D face generation module 440 through the face depth image cache, and transmit the fourth subunit to the 3D face generation module 440 through the 2D face image cache.

结合图11，视频解码模块620在第五个T时间内，对编码单元1a解码获得子单元1和子单元5，并发送给人脸三维生成模块630。在第六个T时间内，对编码单元2a解码获得子单元2和子单元6，并发送给人脸三维生成模块630。在第七个T时间内，对编码单元3a解码获得子单元3和子单元7，并发送给人脸三维生成模块630。在第八个T时间内，对编码单元4a解码获得子单元4和子单元8，并发送给人脸三维生成模块630。Referring to FIG. 11 , the video decoding module 620 decodes the encoding unit 1a to obtain subunit 1 and subunit 5 within the fifth time T, and sends them to the 3D face generation module 630 . In the sixth T time, decode the encoding unit 2a to obtain the subunit 2 and the subunit 6, and send them to the three-dimensional face generation module 630 . In the seventh T time, decode the encoding unit 3 a to obtain subunit 3 and subunit 7 , and send them to the face three-dimensional generating module 630 . In the eighth time T, the encoding unit 4a is decoded to obtain the subunit 4 and the subunit 8, and sent to the three-dimensional face generation module 630 .

在一些实施例中，上述S705，可以包括下述步骤十一和步骤十四。In some embodiments, the above S705 may include the following steps eleven and fourteen.

步骤十一，第二电子设备对第一编码单元进行解析，以获得第一子单元和第三编码单元。Step eleven, the second electronic device parses the first coding unit to obtain the first sub-unit and the third coding unit.

结合图13，视频解码模块620可对第一编码单元进行解析，获得第一子单元和第三编码单元。示例性地，视频解码模块620可以对编码单元1a进行解析，获得子单元1和编码单元1b。Referring to FIG. 13 , the video decoding module 620 may analyze the first coding unit to obtain the first sub-unit and the third coding unit. Exemplarily, the video decoding module 620 may analyze the coding unit 1a to obtain the sub-unit 1 and the coding unit 1b.

步骤十二，第二电子设备对第三编码单元进行解码，以获得第三子单元。In step 12, the second electronic device decodes the third encoding unit to obtain a third subunit.

结合图13，视频解码模块620可对第三编码单元进行解码，获得第三子单元。示例性地，视频解码模块620可以对编码单元1b进行解码，获得子单元5。Referring to FIG. 13 , the video decoding module 620 may decode the third coding unit to obtain the third sub-unit. Exemplarily, the video decoding module 620 may decode the coding unit 1 b to obtain the sub-unit 5 .

步骤十三，第二电子设备在对第三编码单元进行解码后，对第二编码单元进行解析，以获得第二子单元和第四编码单元。Step 13: After decoding the third coding unit, the second electronic device parses the second coding unit to obtain the second sub-unit and the fourth coding unit.

结合图13，视频解码模块620可对第二编码单元进行解析，获得第二子单元和第四编码单元。示例性地，视频解码模块620可以对编码单元2a进行解析，获得子单元2和编码单元2b。Referring to FIG. 13 , the video decoding module 620 may analyze the second coding unit to obtain the second sub-unit and the fourth coding unit. Exemplarily, the video decoding module 620 may analyze the coding unit 2a to obtain the sub-unit 2 and the coding unit 2b.

步骤十四，第二电子设备对第四编码单元进行解码，以获得第四子单元。Step fourteen, the second electronic device decodes the fourth encoding unit to obtain a fourth subunit.

结合图13，视频解码模块620可对第四编码单元进行解码，获得第四子单元。示例性地，视频解码模块620可以对编码单元2b进行解码，获得子单元6。Referring to FIG. 13 , the video decoding module 620 may decode the fourth coding unit to obtain the fourth sub-unit. Exemplarily, the video decoding module 620 may decode the coding unit 2 b to obtain the sub-unit 6 .

类似地，第二电子设备可以采用上述步骤十一至步骤十二所记载的方式，获得子单元3、子单元7以及子单元4、子单元8，此处不再详细赘述。Similarly, the second electronic device can obtain the subunit 3, the subunit 7, the subunit 4, and the subunit 8 in the manner described in the above steps 11 to 12, which will not be described in detail here.

结合图11，视频解码模块620在第五个T时间内，将编码单元1a进行解析，获得子单元1和编码单元1b，对编码单元1b解码获得子单元5，然后将子单元1和子单元5发送给人脸三维生成模块630。在第六个T时间内，将编码单元2a进行解析，获得子单元2和编码单元2b，对编码单元2b解码获得子单元6，然后将子单元2和子单元6发送给人脸三维生成模块630。在第七个T时间内，将编码单元3a进行解析，获得子单元3和编码单元3b，对编码单元3b解码获得子单元7，然后将子单元3和子单元7发送给人脸三维生成模块630。在第八个T时间内，将编码单元4a进行解析，获得子单元4和编码单元4b，对编码单元4b解码获得子单元8，然后将子单元4和子单元8发送给人脸三维生成模块630。Referring to FIG. 11 , the video decoding module 620 parses the coding unit 1a within the fifth T time to obtain subunit 1 and coding unit 1b, decodes coding unit 1b to obtain subunit 5, and then converts subunit 1 and subunit 5 Send to the face three-dimensional generation module 630. In the sixth T time, the encoding unit 2a is analyzed to obtain the subunit 2 and the encoding unit 2b, and the encoding unit 2b is decoded to obtain the subunit 6, and then the subunit 2 and the subunit 6 are sent to the face three-dimensional generation module 630 . In the seventh T time, the encoding unit 3a is analyzed to obtain the subunit 3 and the encoding unit 3b, and the encoding unit 3b is decoded to obtain the subunit 7, and then the subunit 3 and the subunit 7 are sent to the face three-dimensional generation module 630 . In the eighth T time, the encoding unit 4a is analyzed to obtain the subunit 4 and the encoding unit 4b, and the encoding unit 4b is decoded to obtain the subunit 8, and then the subunit 4 and the subunit 8 are sent to the face three-dimensional generation module 630 .

S706，第二电子设备根据第一子单元和第三子单元获得第一人脸三维子图像；在获得第一人脸三维子图像后，根据第二子单元和第四子单元获得第二人脸三维子图像。S706, the second electronic device obtains the first three-dimensional face sub-image according to the first subunit and the third subunit; after obtaining the first three-dimensional face sub-image, obtains the second person according to the second subunit and the fourth subunit 3D sub-image of the face.

结合图13，人脸三维生成模块630可以根据第一子单元和第三子单元获得第一人脸三维子图像，向显示模块640发送第一人脸三维子图像。在获得并发送第一人脸三维子图像后，根据第二子单元和第四子单元获得第二人脸三维子图像，向显示模块640发送第二人脸三维子图像。Referring to FIG. 13 , the 3D face generation module 630 can obtain the first 3D subimage of the face according to the first subunit and the third subunit, and send the first 3D subimage of the face to the display module 640 . After the first 3D sub-image of human face is obtained and sent, the second 3D sub-image of human face is obtained according to the second subunit and the fourth subunit, and the second 3D sub-image of human face is sent to the display module 640 .

结合图11，人脸三维生成模块630可以在第六个T时间内，根据子单元1和子单元5生成人脸三维子图像1，并将其发送给显示模块640。然后，在第七个T时间内，根据子单元2和子单元6生成人脸三维子图像2，并将其发送给显示模块640。在第八个T时间内，根据子单元3和子单元7生成人脸三维子图像3，并将其发送给显示模块640。在第九个T时间内，根据子单元4和子单元8生成人脸三维子图像4，并将其发送给显示模块640。Referring to FIG. 11 , the 3D face generation module 630 can generate a 3D face sub-image 1 according to the subunit 1 and the subunit 5 within the sixth time T, and send it to the display module 640 . Then, within the seventh T time, generate a three-dimensional face sub-image 2 according to the sub-unit 2 and the sub-unit 6 , and send it to the display module 640 . In the eighth time T, a three-dimensional face sub-image 3 is generated according to the sub-unit 3 and the sub-unit 7 , and sent to the display module 640 . In the ninth time T, generate the three-dimensional face sub-image 4 according to the sub-unit 4 and the sub-unit 8 , and send it to the display module 640 .

可选地，人脸三维生成模块630可以将场景二维图像发送给显示模块640。当人脸二维图像包括场景图像时，该场景二维图像可以是人脸三维生成模块630根据人脸二维图像获得的。当人脸二维图像不包括场景图像时，该场景二维图像可以是第一电子设备发送给人脸三维生成模块630的。Optionally, the three-dimensional face generation module 630 may send the two-dimensional image of the scene to the display module 640 . When the two-dimensional face image includes a scene image, the two-dimensional scene image may be obtained by the three-dimensional face generation module 630 according to the two-dimensional face image. When the two-dimensional face image does not include a scene image, the two-dimensional scene image may be sent to the three-dimensional face generation module 630 by the first electronic device.

S707，第二电子设备将第一人脸三维子图像与场景二维图像进行叠加；在将第一人脸三维子图像与场景二维图像进行叠加后，将第二人脸三维子图像与场景二维图像进行叠加。S707. The second electronic device superimposes the first three-dimensional face sub-image and the two-dimensional scene image; after superimposing the first three-dimensional face sub-image and the two-dimensional scene image, the second three-dimensional face sub-image and the scene Two-dimensional images are superimposed.

结合图13，显示模块640可以将第一人脸三维子图像与场景二维图像进行叠加；在将第一人脸三维子图像与场景二维图像进行叠加后，将第二人脸三维子图像与场景二维图像进行叠加。可选地，显示模块640的显示子系统可以通过预览缓存流水线式接收多个人脸三维子图像和场景二维图像，并流水线式将人脸三维子系统与场景二维图像进行叠加，获得人脸三维图像，然后传输至显示屏进行显示。13, the display module 640 can superimpose the first three-dimensional face sub-image and the two-dimensional scene image; after superimposing the first three-dimensional face sub-image and the two-dimensional scene image, the second three-dimensional face sub-image Overlay with the 2D image of the scene. Optionally, the display subsystem of the display module 640 can receive multiple 3D sub-images of faces and 2D images of the scene through the preview buffer pipeline, and superimpose the 3D subsystem of the face with the 2D scene images in a pipeline to obtain the face The three-dimensional image is then transferred to a display screen for display.

示例性地，显示模块640可以将人脸三维图像与场景二维图像进行叠加，以获得人脸三维图像。具体地，结合图11，在第七个T时间内，显示模块640可以将人脸三维子图像1与场景二维图像进行叠加。在第八个T时间内，将人脸三维子图像2与场景二维图像进行叠加。在第九个T时间内，将人脸三维子图像3与场景二维图像进行叠加。在第十个T时间内，将人脸三维子图像4与场景二维图像进行叠加，从而获得人脸三维图像。Exemplarily, the display module 640 may superimpose the 3D face image and the 2D scene image to obtain the 3D face image. Specifically, referring to FIG. 11 , during the seventh time T, the display module 640 may superimpose the 3D sub-image 1 of the face with the 2D image of the scene. In the eighth time T, the face 3D sub-image 2 is superimposed on the scene 2D image. In the ninth time T, the face 3D sub-image 3 is superimposed on the scene 2D image. In the tenth time T, the three-dimensional sub-image 4 of the face is superimposed on the two-dimensional image of the scene, so as to obtain the three-dimensional image of the face.

在一种可能的设计方式中，本申请实施例提供的三维视频通话方法，还可以包括：检测调整动作，响应于调整动作，调整显示人脸三维图像中人脸的角度。In a possible design manner, the 3D video call method provided in the embodiment of the present application may further include: detecting an adjustment action, and adjusting the angle of the face in the 3D image of the displayed face in response to the adjustment action.

结合图13，触控模块650，用于检测调整动作，显示模块640，可用于响应于该调整动作，调整显示人脸三维图像中人脸的角度。Referring to FIG. 13 , the touch module 650 is configured to detect an adjustment action, and the display module 640 may be configured to adjust the angle of a human face displayed in a three-dimensional image of a human face in response to the adjustment action.

示例性地，调整动作可为用户设置的左偏角度或右偏角度。例如，电子设备可以将人脸三维图像正面显示的角度确定为0度，电子设备的显示界面可包括左偏角度设置区域和右偏角度设置区域，用户可通过设置左偏角度设置区域或右偏角度设置区域的数值来调整人脸显示的角度。Exemplarily, the adjustment action may be a left angle or a right angle set by the user. For example, the electronic device can determine the frontal display angle of the three-dimensional image of the face as 0 degrees, and the display interface of the electronic device can include a left angle setting area and a right angle setting area, and the user can set the left angle setting area or the right angle setting area. Use the value in the Angle setting area to adjust the angle of the face display.

示例性地，调整动作可为用户作用于触摸屏的旋转动作。例如，用户将两个手指放置在触摸屏上以顺时针或逆时针方向旋动。如图14中(a)所示，当前三维视频中，显示人脸的正面三维图像，用户用两个手指在放置在触摸屏上以顺时针方向旋转后，调整人脸的显示角度如图14中(b)所示，显示人脸的左侧面三维图像。Exemplarily, the adjustment action may be a rotation action performed by the user on the touch screen. For example, a user places two fingers on the touch screen to swivel in a clockwise or counterclockwise direction. As shown in (a) in Figure 14, in the current 3D video, the frontal 3D image of the face is displayed, and the user uses two fingers to rotate clockwise on the touch screen to adjust the display angle of the face as shown in Figure 14 As shown in (b), the 3D image of the left side of the human face is displayed.

需要说明的是，本申请上述实施例提供的三维视频通话方法是以将通话视频中人脸进行三维显示为例进行阐述的，本申请实施例提供的三维视频通话方法还可以将通话视频中人脸和场景均进行三维显示，具体实现方式与上述S701-S707类似，上述人脸深度信息可以包括人脸和场景的深度信息，人脸深度图像可以包括人脸的深度图像和场景的深度图像，此处不再详细赘述。It should be noted that the 3D video call method provided in the above embodiments of the present application is described by taking the three-dimensional display of the face in the call video as an example. The 3D video call method provided in the embodiment of the present application can also display the face of the person in the call video Both the face and the scene are displayed in three dimensions, and the specific implementation method is similar to the above S701-S707. The above-mentioned face depth information may include the depth information of the face and the scene, and the face depth image may include the depth image of the face and the depth image of the scene. No more details here.

基于图7所示的三维视频通话方法，第一电子设备将人脸深度图像和人脸二维图像分别划分为多个子单元，并以一对子单元为粒度流水线式对人脸深度图像和人脸二维图像进行编码，且流水线式向第二电子设备发送，第二电子设备流水线式接收并解码，获得人脸深度图像的子单元和人脸二维图像的子单元，流水线式获取人脸三维子图像。如此，可以降低第二电子设备获取人脸三维图像的时延，从而降低获取三维视频的时延。Based on the three-dimensional video call method shown in FIG. 7 , the first electronic device divides the depth image of the face and the two-dimensional image of the face into a plurality of subunits, and processes the depth image of the face and the two-dimensional image of the face with a pair of subunits as the granularity pipeline Encode the two-dimensional image of the face, and send it to the second electronic device in a pipeline, and the second electronic device receives and decodes it in a pipeline, and obtains the subunit of the depth image of the face and the subunit of the two-dimensional image of the face, and obtains the face in a pipeline 3D subimage. In this way, the time delay for the second electronic device to acquire the 3D image of the face can be reduced, thereby reducing the time delay for acquiring the 3D video.

本申请实施例提供一种三维视频通话系统。该系统包括上述一个或多个第一电子设备，以及一个或多个第二电子设备。An embodiment of the present application provides a three-dimensional video call system. The system includes the above-mentioned one or more first electronic devices, and one or more second electronic devices.

本申请实施例提供一种计算机可读存储介质，计算机可读存储介质上存储有计算机程序或指令，当计算机程序或指令在计算机上运行时，使得该计算机执行上述方法实施例所述的三维视频通话方法。An embodiment of the present application provides a computer-readable storage medium. Computer programs or instructions are stored on the computer-readable storage medium. When the computer program or instructions are run on a computer, the computer executes the three-dimensional video recording described in the above-mentioned method embodiments. call method.

本申请实施例提供一种计算机程序产品，该计算机程序产品包括：计算机程序或指令，当计算机程序或指令在计算机上运行时，使得该计算机执行上述方法实施例所述的三维视频通话方法。An embodiment of the present application provides a computer program product. The computer program product includes: a computer program or an instruction. When the computer program or instruction is run on a computer, the computer is made to execute the three-dimensional video calling method described in the method embodiment above.

上述实施例，可以全部或部分地通过软件、硬件(如电路)、固件或其他任意组合来实现。当使用软件实现时，上述实施例可以全部或部分地以计算机程序产品的形式实现。所述计算机程序产品包括一个或多个计算机指令或计算机程序。在计算机上加载或执行所述计算机指令或计算机程序时，全部或部分地产生按照本申请实施例所述的流程或功能。所述计算机可以为电子设备、通用计算机、专用计算机、计算机网络、或者其他可编程装置。所述计算机指令可以存储在计算机可读存储介质中，或者从一个计算机可读存储介质向另一个计算机可读存储介质传输，例如，所述计算机指令可以从一个网站站点、计算机、服务器或数据中心通过有线(例如红外、无线、微波等)方式向另一个网站站点、计算机、服务器或数据中心进行传输。所述计算机可读存储介质可以是计算机能够存取的任何可用介质或者是包含一个或多个可用介质集合的服务器、数据中心等数据存储设备。所述可用介质可以是磁性介质(例如，软盘、硬盘、磁带)、光介质(例如，DVD)、或者半导体介质。半导体介质可以是固态硬盘。The above-mentioned embodiments may be implemented in whole or in part by software, hardware (such as circuits), firmware, or other arbitrary combinations. When implemented using software, the above-described embodiments may be implemented in whole or in part in the form of computer program products. The computer program product comprises one or more computer instructions or computer programs. When the computer instruction or computer program is loaded or executed on the computer, the processes or functions according to the embodiments of the present application will be generated in whole or in part. The computer may be an electronic device, a general computer, a special computer, a computer network, or other programmable devices. The computer instructions may be stored in or transmitted from one computer-readable storage medium to another computer-readable storage medium, for example, the computer instructions may be transmitted from a website, computer, server or data center Transmission to another website site, computer, server or data center by wired (such as infrared, wireless, microwave, etc.). The computer-readable storage medium may be any available medium that can be accessed by a computer, or a data storage device such as a server or a data center that includes one or more sets of available media. The available media may be magnetic media (eg, floppy disk, hard disk, magnetic tape), optical media (eg, DVD), or semiconductor media. The semiconductor medium may be a solid state drive.

本申请中，“至少一个”是指一个或者多个，“多个”是指两个或两个以上。“以下至少一项(个)”或其类似表达，是指的这些项中的任意组合，包括单项(个)或复数项(个)的任意组合。例如，a,b,或c中的至少一项(个)，可以表示：a,b,c,a-b,a-c,b-c,或a-b-c，其中a,b,c可以是单个，也可以是多个。In this application, "at least one" means one or more, and "multiple" means two or more. "At least one of the following" or similar expressions refer to any combination of these items, including any combination of single or plural items. For example, at least one item (piece) of a, b, or c can represent: a, b, c, a-b, a-c, b-c, or a-b-c, where a, b, c can be single or multiple .

应理解，在本申请的各种实施例中，上述各过程的序号的大小并不意味着执行顺序的先后，各过程的执行顺序应以其功能和内在逻辑确定，而不应对本申请实施例的实施过程构成任何限定。It should be understood that, in various embodiments of the present application, the sequence numbers of the above-mentioned processes do not mean the order of execution, and the execution order of the processes should be determined by their functions and internal logic, and should not be used in the embodiments of the present application. The implementation process constitutes any limitation.

本领域普通技术人员可以意识到，结合本文中所公开的实施例描述的各示例的单元或模块及算法步骤，能够以电子硬件、或者计算机软件和电子硬件的结合来实现。这些功能究竟以硬件还是软件方式来执行，取决于技术方案的特定应用和设计约束条件。专业技术人员可以对每个特定的应用来使用不同方法来实现所描述的功能，但是这种实现不应认为超出本申请的范围。Those skilled in the art can appreciate that the units or modules and algorithm steps of the examples described in conjunction with the embodiments disclosed herein can be implemented by electronic hardware, or a combination of computer software and electronic hardware. Whether these functions are executed by hardware or software depends on the specific application and design constraints of the technical solution. Skilled artisans may use different methods to implement the described functions for each specific application, but such implementation should not be regarded as exceeding the scope of the present application.

所属领域的技术人员可以清楚地了解到，为描述的方便和简洁，上述描述的系统、装置和单元或模块的具体工作过程，可以参考前述方法实施例中的对应过程，在此不再赘述。Those skilled in the art can clearly understand that for the convenience and brevity of the description, the specific working process of the above-described system, device and unit or module can refer to the corresponding process in the foregoing method embodiment, which will not be repeated here.

在本申请所提供的几个实施例中，应该理解到，所揭露的系统、装置和方法，可以通过其它的方式实现。例如，以上所描述的装置实施例仅仅是示意性的，例如，上述单元或模块的划分，仅仅为一种逻辑功能划分，实际实现时可以有另外的划分方式，例如多个单元或模块可以结合或者可以集成到另一个系统，或一些单元或模块可以忽略，或其对应的功能不执行。另一点，所显示或讨论的相互之间的耦合或直接耦合或通信连接可以是通过一些接口，装置或单元/模块的间接耦合或通信连接，可以是电性，机械或其它的形式。In the several embodiments provided in this application, it should be understood that the disclosed systems, devices and methods may be implemented in other ways. For example, the device embodiments described above are only illustrative. For example, the division of the above units or modules is only a logical function division. In actual implementation, there may be other division methods. For example, multiple units or modules can be combined Or it can be integrated into another system, or some units or modules can be ignored, or their corresponding functions are not performed. In another point, the mutual coupling or direct coupling or communication connection shown or discussed may be through some interfaces, and the indirect coupling or communication connection of devices or units/modules may be in electrical, mechanical or other forms.

所述作为分离部件说明的单元/模块可以是或者也可以不是物理上分开的，作为单元/模块显示的部件可以是或者也可以不是物理单元/模块，即可以位于一个地方，或者也可以分布到多个网络单元/模块上。可以根据实际的需要选择其中的部分或者全部单元/模块来实现本实施例方案的目的。The units/modules described as separate components may or may not be physically separated, and the components displayed as units/modules may or may not be physical units/modules, that is, they may be located in one place, or may also be distributed to on multiple network elements/modules. Part or all of the units/modules can be selected according to actual needs to achieve the purpose of the solution of this embodiment.

另外，在本申请各个实施例中的各功能单元/模块可以集成在一个处理单元/模块中，也可以是各个单元/模块单独物理存在，也可以两个或两个以上单元/模块集成在一个单元/模块中。In addition, each functional unit/module in each embodiment of the present application may be integrated into one processing unit/module, each unit/module may exist separately physically, or two or more units/modules may be integrated into one processing unit/module. unit/module.

所述功能如果以软件功能单元/模块的形式实现并作为独立的产品销售或使用时，可以存储在一个计算机可读取存储介质中。基于这样的理解，本申请的技术方案本质上或者说对现有技术做出贡献的部分或者该技术方案的部分可以以软件产品的形式体现出来，该计算机软件产品存储在一个存储介质中，包括若干指令用以使得一台计算机设备(可以是个人计算机，服务器，或者网络设备等)执行本申请各个实施例所述方法的全部或部分步骤。而前述的存储介质包括：U盘、移动硬盘、只读存储器(read-only memory，ROM)、随机存取存储器(random access memory，RAM)、磁碟或者光盘等各种可以存储程序代码的介质。If the functions are implemented in the form of software function units/modules and sold or used as independent products, they can be stored in a computer-readable storage medium. Based on this understanding, the technical solution of the present application is essentially or the part that contributes to the prior art or the part of the technical solution can be embodied in the form of a software product, and the computer software product is stored in a storage medium, including Several instructions are used to make a computer device (which may be a personal computer, a server, or a network device, etc.) execute all or part of the steps of the methods described in the various embodiments of the present application. The aforementioned storage medium includes: U disk, mobile hard disk, read-only memory (read-only memory, ROM), random access memory (random access memory, RAM), magnetic disk or optical disk and other various media that can store program codes. .

在本申请实施例中，在无逻辑矛盾的前提下，各实施例之间可以相互引用，例如方法实施例之间的方法和/或术语可以相互引用，例如装置实施例之间的功能和/或术语可以相互引用，例如装置实施例和方法实施例之间的功能和/或术语可以相互引用。In the embodiments of the present application, on the premise that there is no logical contradiction, the various embodiments may refer to each other, for example, the methods and/or terms between the method embodiments may refer to each other, such as the functions and/or terms between the device embodiments Or terms may refer to each other, for example, functions and/or terms between the apparatus embodiment and the method embodiment may refer to each other.

以上所述，仅为本申请的具体实施方式，但本申请的保护范围并不局限于此，任何熟悉本技术领域的技术人员在本申请揭露的技术范围内，可轻易想到变化或替换，都应涵盖在本申请的保护范围之内。因此，本申请的保护范围应以所述权利要求的保护范围为准。The above is only a specific implementation of the application, but the scope of protection of the application is not limited thereto. Anyone familiar with the technical field can easily think of changes or substitutions within the technical scope disclosed in the application. Should be covered within the protection scope of this application. Therefore, the protection scope of the present application should be determined by the protection scope of the claims.

Claims

An electronic device, comprising: the system comprises a face image acquisition module, a video coding module and a network transmission module; wherein,

the face image acquisition module is used for acquiring a face depth image and a face two-dimensional image; dividing the face depth image into a plurality of sub-units including a first sub-unit and a second sub-unit; dividing the face two-dimensional image into a plurality of subunits including a third subunit and a fourth subunit; transmitting the first subunit and the third subunit to the video coding module; after the first subunit and the third subunit are sent, sending the second subunit and the fourth subunit to the video coding module; wherein the first subunit corresponds to the third subunit and the second subunit corresponds to the fourth subunit;

The video coding module is used for obtaining a first coding unit according to the first subunit and the third subunit and sending the first coding unit to the network transmission module; after the first coding unit is obtained and sent, a second coding unit is obtained according to the second subunit and the fourth subunit, and the second coding unit is sent to the network transmission module;

the network transmission module is used for sending the first coding unit to the second electronic equipment; and after the first coding unit is sent, sending the second coding unit to the second electronic equipment.
The electronic device of claim 1, wherein the electronic device comprises a memory device,

the face image acquisition module is specifically used for receiving face depth information; receiving two-dimensional information of a human face; and obtaining the face depth image according to the face depth information and obtaining the face two-dimensional image according to the face two-dimensional information.
The electronic device according to claim 1 or 2, characterized in that,

the video coding module is specifically configured to:

encoding the third subunit to obtain a third encoding unit; obtaining a first coding unit according to the first subunit and the third coding unit; the method comprises the steps of,

Encoding the fourth subunit to obtain a fourth encoding unit; and obtaining a second coding unit according to the second subunit and the fourth coding unit.
The electronic device of any one of claims 1-3, further comprising: the face three-dimensional generation module and the display module; wherein,

the face three-dimensional generation module is used for obtaining a first face three-dimensional sub-image according to the first sub-unit and the third sub-unit and sending the first face three-dimensional sub-image to the display module; after the first face three-dimensional sub-image is obtained and sent, a second face three-dimensional sub-image is obtained according to the second sub-unit and the fourth sub-unit, and the second face three-dimensional sub-image is sent to the display module;

the display module is used for superposing the first face three-dimensional sub-image and the scene two-dimensional image; and after the first face three-dimensional sub-image and the scene two-dimensional image are overlapped, the second face three-dimensional sub-image and the scene two-dimensional image are overlapped.
An electronic device, comprising: the system comprises a network transmission module, a video decoding module, a three-dimensional face generation module and a display module; wherein,

The network transmission module is used for receiving a first coding unit from first electronic equipment and sending the first coding unit to the video decoding module; after receiving and transmitting the first coding unit, receiving a second coding unit from the first electronic device, and transmitting the second coding unit to the video decoding module;

the video decoding module is used for obtaining a first subunit and a third subunit according to the first coding unit; after the first subunit and the third subunit are obtained, obtaining a second subunit and a fourth subunit according to the second coding unit; the first subunit and the second subunit are subunits in the face depth image respectively, the third subunit and the fourth subunit are subunits in the face two-dimensional image respectively, the first subunit corresponds to the third subunit, and the second subunit corresponds to the fourth subunit;

the three-dimensional face generation module is used for obtaining a first face three-dimensional sub-image according to the first sub-unit and the third sub-unit and sending the first face three-dimensional sub-image to the display module; after the first face three-dimensional sub-image is obtained and sent, a second face three-dimensional sub-image is obtained according to the second sub-unit and the fourth sub-unit, and the second face three-dimensional sub-image is sent to the display module;

The display module is used for superposing the first face three-dimensional sub-image and the scene two-dimensional image; and after the first face three-dimensional sub-image and the scene two-dimensional image are overlapped, the second face three-dimensional sub-image and the scene two-dimensional image are overlapped.
The electronic device of claim 5, wherein the electronic device comprises a memory device,

the video decoding module is further configured to parse the first coding unit to obtain the first subunit and a third coding unit; decoding the third coding unit to obtain the third subunit; after decoding the third coding unit, analyzing the second coding unit to obtain the second subunit and a fourth coding unit; decoding the fourth coding unit to obtain the fourth subunit.
A three-dimensional video call method, comprising:

acquiring a face depth image and a face two-dimensional image; dividing the face depth image into a plurality of sub-units including a first sub-unit and a second sub-unit; dividing the face two-dimensional image into a plurality of subunits including a third subunit and a fourth subunit; wherein the first subunit corresponds to the third subunit and the second subunit corresponds to the fourth subunit;

Obtaining a first coding unit according to the first subunit and the third subunit; after the first coding unit is obtained, a second coding unit is obtained according to the second subunit and the fourth subunit;

transmitting the first coding unit to a second electronic device; and after the first coding unit is sent, sending the second coding unit to the second electronic equipment.
The method of three-dimensional video call according to claim 7, wherein the acquiring the face depth image and the face two-dimensional image comprises:

receiving face depth information; receiving two-dimensional information of a human face; and obtaining the face depth image according to the face depth information and obtaining the face two-dimensional image according to the face two-dimensional information.
The three-dimensional video call method according to claim 7 or 8, further comprising:

encoding the third subunit to obtain a third encoding unit; obtaining a first coding unit according to the first subunit and the third coding unit; the method comprises the steps of,

encoding the fourth subunit to obtain a fourth encoding unit; and obtaining a second coding unit according to the second subunit and the fourth coding unit.
The three-dimensional video call method according to any one of claims 7 to 9, further comprising:

obtaining a first face three-dimensional sub-image according to the first sub-unit and the third sub-unit; after the first face three-dimensional sub-image is obtained, a second face three-dimensional sub-image is obtained according to the second sub-unit and the fourth sub-unit;

superposing the first face three-dimensional sub-image and a scene two-dimensional image; and after the first face three-dimensional sub-image and the scene two-dimensional image are overlapped, the second face three-dimensional sub-image and the scene two-dimensional image are overlapped.
A three-dimensional video call method, comprising:

receiving a first encoding unit from a first electronic device; receiving a second coding unit from the first electronic device after receiving the first coding unit;

obtaining a first subunit and a third subunit according to the first coding unit; after the first subunit and the third subunit are obtained, obtaining a second subunit and a fourth subunit according to the second coding unit; the first subunit and the second subunit are subunits in the face depth image respectively, the third subunit and the fourth subunit are subunits in the face two-dimensional image respectively, the first subunit corresponds to the third subunit, and the second subunit corresponds to the fourth subunit;

Obtaining a first face three-dimensional sub-image according to the first sub-unit and the third sub-unit; after the first face three-dimensional sub-image is obtained, a second face three-dimensional sub-image is obtained according to the second sub-unit and the fourth sub-unit;

superposing the first face three-dimensional sub-image and a scene two-dimensional image; and after the first face three-dimensional sub-image and the scene two-dimensional image are overlapped, the second face three-dimensional sub-image and the scene two-dimensional image are overlapped.
The three-dimensional video call method of claim 11, further comprising:

analyzing the first coding unit to obtain the first subunit and a third coding unit; decoding the third coding unit to obtain the third subunit; after decoding the third coding unit, analyzing the second coding unit to obtain the second subunit and a fourth coding unit; decoding the fourth coding unit to obtain the fourth subunit.
A three-dimensional video telephony system comprising an electronic device as claimed in any of claims 1 to 4, and an electronic device as claimed in any of claims 5 to 6.
A computer readable storage medium, characterized in that the computer readable storage medium has stored thereon a computer program or instructions which, when run on a computer, cause the computer to perform the three-dimensional video telephony method of any of claims 7 to 12.
A computer program product, the computer program product comprising: computer program or instructions which, when run on a computer, cause the computer to perform the three-dimensional video telephony method of any of claims 7 to 12.