[go: up one dir, main page]

CN111476901B - Three-dimensional human body shape representation method - Google Patents

Three-dimensional human body shape representation method Download PDF

Info

Publication number
CN111476901B
CN111476901B CN202010279674.5A CN202010279674A CN111476901B CN 111476901 B CN111476901 B CN 111476901B CN 202010279674 A CN202010279674 A CN 202010279674A CN 111476901 B CN111476901 B CN 111476901B
Authority
CN
China
Prior art keywords
human body
deformation
mesh
sub
reconstruction
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202010279674.5A
Other languages
Chinese (zh)
Other versions
CN111476901A (en
Inventor
张举勇
江博艺
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
University of Science and Technology of China USTC
Original Assignee
University of Science and Technology of China USTC
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by University of Science and Technology of China USTC filed Critical University of Science and Technology of China USTC
Priority to CN202010279674.5A priority Critical patent/CN111476901B/en
Publication of CN111476901A publication Critical patent/CN111476901A/en
Application granted granted Critical
Publication of CN111476901B publication Critical patent/CN111476901B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T17/00Three dimensional [3D] modelling, e.g. data description of 3D objects
    • G06T17/20Finite element generation, e.g. wire-frame surface description, tesselation
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/088Non-supervised learning, e.g. competitive learning
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T9/00Image coding
    • G06T9/002Image coding using neural networks
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2210/00Indexing scheme for image generation or computer graphics
    • G06T2210/44Morphing
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02TCLIMATE CHANGE MITIGATION TECHNOLOGIES RELATED TO TRANSPORTATION
    • Y02T90/00Enabling technologies or technologies with a potential or indirect contribution to GHG emissions mitigation

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Evolutionary Computation (AREA)
  • Software Systems (AREA)
  • Artificial Intelligence (AREA)
  • General Health & Medical Sciences (AREA)
  • Biomedical Technology (AREA)
  • Computational Linguistics (AREA)
  • Biophysics (AREA)
  • Molecular Biology (AREA)
  • Computing Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Mathematical Physics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Computer Graphics (AREA)
  • Geometry (AREA)
  • Multimedia (AREA)
  • Processing Or Creating Images (AREA)

Abstract

本发明公开了一种三维人体形状表示方法,一方面,通过处理采集到的人体网格数据集,可以得到大量的有标签的训练数据,增加了模型的鲁棒性;另一方面,通过使用变形表示,并定义了一种基于人体近似刚性块的变形表示,相比于直接使用欧氏距离能有更高的精度和鲁棒性。同时,针对人体的铰链式变形,设计了一种阶层式的重建网络,充分利用了人体的形状先验,提高了模型的精度。

Figure 202010279674

The invention discloses a three-dimensional human body shape representation method. On the one hand, by processing the collected human body grid data set, a large amount of labeled training data can be obtained, which increases the robustness of the model; on the other hand, by using Deformation representation, and defines a deformation representation based on the approximate rigid block of the human body, which can have higher accuracy and robustness than directly using the Euclidean distance. Meanwhile, for the hinged deformation of the human body, a hierarchical reconstruction network is designed, which makes full use of the shape prior of the human body and improves the accuracy of the model.

Figure 202010279674

Description

Three-dimensional human body shape representation method
Technical Field
The invention relates to the technical field of human body three-dimensional reconstruction, in particular to a three-dimensional human body shape representation method.
Background
The parameterized human body model has wide application in the fields of computer graphics and computer vision, including three-dimensional human body tracking, three-dimensional human body reconstruction and attitude estimation. Since the human body has various attributes including gender, race, posture, body type, and the like, which contain abundant geometric deformation, high-precision reconstruction of the human body is a challenging task. In recent years, with the development of deep learning, geometric reconstruction using a neural network has become a trend.
In the past, human reconstruction was primarily based on the skeletal skinning method. The method expresses the identity deformation space of a human body by a group of simple linear basis deformations, and then deforms the human body to a specific posture by using a skeleton. The method is simple and high in efficiency in calculation, decoupling of the identity and the posture is achieved, high-frequency change of a human identity deformation space is ignored, and geometric reconstruction accuracy is limited. In addition, motion has a relative motion representation of skeletal joint points, without encoding a prior distribution of human motion, and thus may produce an abnormal human model. The other main method is based on deformation expression, typically, a three-dimensional point coordinate is replaced by triangular deformation, and decoupling parametric modeling of a human body is realized by decomposing the deformation into parts related to various attributes such as identity, posture and the like. The method has higher geometric reconstruction accuracy, but the calculation process is more complex, the parameter quantity is large, and the speed is slower.
Recently, learning some shapes like human faces using neural networks has been developed. Through end-to-end training, the network maps the shape manifold to a low-dimensional nonlinear space, which shows higher precision. But the human body has significant large-scale deformation caused by different poses compared to the human face. A network architecture which achieves good accuracy in face modeling is directly applied to a human body grid, and a good result cannot be obtained. This is because the european coordinate is not easy to maintain the original deformation mode for large-scale deformation, and is easy to cause distortion. On the other hand, the network architecture is not designed for a special skeleton hinge structure of a human body, and the precision is not improved by using the prior of the shape of the human body.
Disclosure of Invention
The invention aims to provide a three-dimensional human body shape representation method which can improve the reconstruction precision of a three-dimensional human body model.
The purpose of the invention is realized by the following technical scheme:
a three-dimensional human body shape representation method comprising:
preprocessing the collected human body grid data set, deforming based on a standard posture, and calculating ACAP deformation representing and describing human body deformation characteristics of rigid block deformation to form a training data set;
constructing an encoder network and a hierarchical reconstruction network to form an end-to-end network structure, and training the network structure by using a training data set; in the training process, the ACAP deformation expression is coded through a coder network to obtain an identity attribute and an action attribute, a three-dimensional human body model is reconstructed through the reconstruction network by utilizing the identity attribute and the action attribute, and the coder network and a hierarchical reconstruction network are trained by utilizing an error between a reconstruction result and input training data;
and after the training is finished, inputting the identity attribute and the action attribute into a trained hierarchical reconstruction network to obtain a three-dimensional human body model reconstruction result.
According to the technical scheme provided by the invention, 1) a large amount of labeled training data can be obtained by processing the acquired human body grid data set, so that the robustness of the model is improved. 2) By using the deformation representation and defining a deformation representation based on human body approximate rigid blocks, higher precision and robustness can be achieved compared with the mode that Euclidean distance is directly used. 3) Aiming at the hinge type deformation of the human body, a hierarchical reconstruction network is designed, the shape prior of the human body is fully utilized, and the precision of the model is improved.
Drawings
In order to more clearly illustrate the technical solutions of the embodiments of the present invention, the drawings needed to be used in the description of the embodiments are briefly introduced below, and it is obvious that the drawings in the following description are only some embodiments of the present invention, and it is obvious for those skilled in the art to obtain other drawings based on the drawings without creative efforts.
Fig. 1 is a flowchart of a method for representing a three-dimensional human body shape according to an embodiment of the present invention;
FIG. 2 is a reference body grid for calculating ACAP (consistent deformation) features and rigid blocks defined by calculating large-scale features g of a body according to an embodiment of the present invention;
fig. 3 is a schematic visualization diagram of reconstruction of each phase of a neutral posture and an original posture generated by a reconstruction network according to an embodiment of the present invention.
Detailed Description
The technical solutions in the embodiments of the present invention are clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments of the present invention without making any creative effort, shall fall within the protection scope of the present invention.
In the field of human body parametric representation, the traditional skeleton skin-based method has high model speed and strong action expression capability, but the reconstruction precision is limited because simple linear dimension reduction is only carried out on the deformation space of the identity; on the other hand, the representation of the motion is directly based on the relative motion of the joint points and does not limit the rationality of the human motion. To this end, an embodiment of the present invention provides a method for representing a three-dimensional human body shape, as shown in fig. 1, the method mainly includes:
and 11, preprocessing the collected human body grid data set, deforming based on the standard posture, and calculating the ACAP deformation expression and the human body deformation characteristics describing the deformation of the rigid blocks to form a training data set.
The preferred embodiment of this step is as follows:
1) And carrying out standardization processing on the collected human body grid data set to obtain human body grid data with unified topology, and deforming to obtain a neutral human body grid corresponding to each human body grid data through the defined standard posture.
In the embodiment of the invention, the original human body grid data can be obtained from the network, including SCAPE, FAUST, dyna, MANO and the like. For collected human mesh data sets of different sources, the mesh representations of the collected human mesh data sets are inconsistent and need to be converted into a standard topology G = { V, E }, where V is a vertex set and E is an edge set. In this embodiment, a source data is used as a standard topology (e.g., SCAPE); calculating corresponding ACAP (consistent deformation) deformation representation by using action grids in standard topology (for example, seventy action grids in SCAPE), obtaining a group of priori deformation representation bases C of human body actions, and recovering the deformation representation of a human body by using a group of parameters w; and (3) using the linear space of Cw as a priori space of human body deformation, then optimizing a group of vertex coordinates p and rigid transformation parameters of standard topology, namely a rotation parameter R and a translation parameter t, and standardizing human body grid data sets from different sources.
The normalization process is to solve the following optimization problem:
Figure BDA0002446089520000031
wherein λ is 1 、λ 2 、λ 3 Are all set weights; | w | charging 1 Is a sparse regularization constraint on parameter w;
E prior is a human body deformation prior term determined by a prior deformation representation base C, so that the optimized grid vertex conforms to the human body shape as much as possible, and is represented as follows:
Figure BDA0002446089520000032
wherein, T i (w) is the prior deformation of a neighborhood of the ith vertex in the standard topology, the prior deformation represents that the base C is multiplied by w to obtain an ACAP characteristic, and then the ith vertex component of the ACAP characteristic is converted to obtain T i (w);q i Is the position of the ith vertex in the deformed reference grid under the standard topology; relative to the deformation reference grid, the position of the ith vertex of the grid to be optimized under the standard topology is p i (ii) a N (i) refers to a neighborhood vertex index set of the ith vertex under the standard topology, j refers to the jth vertex in the N (i), and the positions of the corresponding vertices in the deformation reference grid and the grid to be optimized are respectively represented as q j 、p j ;c ij Is an edge weight value calculated on the deformation reference grid, called cotangent weight, and is specifically q j And q is i The edge weight value of (2). In the embodiment of the invention, the deformation reference grid is a human body grid required for calculating ACAP deformation expression, and can be selected according to actual conditions or experience; as shown in fig. 2, the pre-selected warped reference grid is on the left. Those skilled in the art will appreciate that the deformed reference mesh corresponds one-to-one to all vertices of the standard topology, except for the different locations of the vertices; in the embodiment of the invention, the deformation reference gridFor reference, optimizing a grid under a standard topology; that is, q i Is stationary, p i Is optimized and the value is changed.
E icp Is a point-to-plane registration energy term with the nearest neighbor of the target mesh such that the optimization result is close to the target, expressed as:
Figure BDA0002446089520000041
d is an index set of the corresponding point pair selected by dynamic calculation under the standard topology; v. of l(i) Is and p i The corresponding point on the corresponding target grid,
Figure BDA0002446089520000042
representing point v l(i) Normal direction of (2); the target grid refers to human body grid data sets of different sources to be optimized;
E lan the L2 loss energy terms of a group of manually marked sparse corresponding points of the standard topology and the target grid are expressed as follows to reduce sliding errors and avoid local minimum values:
Figure BDA0002446089520000043
where L is a pre-selected set of corresponding points in a standard topology, the above equation limits the spatial location of the points in the pre-selected set of corresponding points L to be as small as possible.
In the embodiment of the invention, the human body grid data fitted based on the mode has consistent standard topology, and has tolerable registration error with the original data.
The human body model data constructed on the basis also needs to define the corresponding neutral human body grids. Illustratively, a consistent a-gesture in the SPRING dataset may be used as the neutral gesture. And for all the human body data of each identity, selecting a human body with the minimum error with the SPRING average grid as a candidate human body grid, and then converting the candidate human body grid into the posture A by using an ARAP (as rigid as possible) deformation method to be used as a neutral human body grid of the human body of the identity.
2) Calculating ACAP deformation representation of each human body grid data and corresponding neutral human body grid, and recording as f and f s (ii) a And respectively calculating human body deformation characteristics g and g of the human body grid and the corresponding neutral human body grid describing rigid block deformation according to the hinge type deformation characteristics of the human body based on the skeleton s To obtain a set of training data { f, f } s ,g,g s And correspondingly processing the collected data to obtain a training data set.
By the method, the human body grid data with the uniform topology are obtained, and each identity human body has a neutral posture grid. In the embodiment of the invention, the deformation expression of the geometric shape of the ACAP is used for replacing the original Euclidean coordinate shape expression to enhance the modeling precision of large-scale deformation and obtain better performance compared with a general linear model.
The formula for calculating the human body grid data and the ACAP deformation expression of the corresponding neutral human body grid is as follows:
Figure BDA0002446089520000051
similarly, i represents a vertex in the standard topology, and the above is standardized, so the standard topology can be any standard topology after being standardized or a previously selected standard topology. The other parameters have the same meanings as above and are not described in detail.
T in the above formula i Refers to the affine transformation matrix of the ith vertex in the standard topology, the affine transformation matrix T i The method comprises the steps of transforming a local umbrella-shaped structure of a neighborhood of a deformation reference grid into various deformation information of a structure of a computational grid; by polar decomposition, T i Decomposition to rigid R i And a non-rigid part S i (determined by 3 and 6 degrees of freedom, respectively), after disambiguation of the rigid deformation part of each vertex, the ACAP deformation of the computational mesh is obtainedRepresenting; the dimensions of the representation are 9 times the number of vertices, each vertex having 3 and 6 parametric record rigid and non-rigid deformations, respectively.
Taking each human body grid data and corresponding neutral human body grid as calculation grid to be substituted into the formula to obtain corresponding ACAP deformation expressions f and f s
In addition, some approximately rigid parts, such as the lower arm, the head, etc., are defined on the human body in consideration of the skeleton-based hinged deformation characteristics of the human body. On the basis, a deformation characteristic g of a large scale of a human body for describing the deformation of the rigid block is defined, and the calculation formula is as follows:
Figure BDA0002446089520000052
wherein v is k Is the set of vertices of the kth rigid block, q i' 、p i' Respectively representing the positions of the ith' vertex on the kth rigid block in the deformation reference grid and the calculation grid;
Figure BDA0002446089520000053
and &>
Figure BDA0002446089520000054
Respectively averaging the k-th rigid block on the deformation reference grid and the calculation grid; by means of radiation deformation of a rigid block>
Figure BDA0002446089520000055
And performing polar decomposition and parameterization to obtain the human body deformation characteristic, wherein the dimension of the human body deformation characteristic is 9 times of the number of approximate rigid blocks of the human body.
Similarly, each human body grid data and corresponding neutral human body grid are taken as calculation grids and are substituted into the formula to obtain deformation characteristics g and g s
As shown in fig. 2, the rigid blocks defined for calculating the deformation reference grid (left side of fig. 2) and the large-scale features of the human body (right side of fig. 2) are represented for ACAP deformation.
Step 12, constructing an encoder network and a hierarchical reconstruction network to form an end-to-end network structure, and training the network structure by using a training data set; in the training process, the ACAP deformation expression is coded through a coder network to obtain identity attributes and action attributes, the identity attributes and the action attributes are utilized to reconstruct a three-dimensional human body model through a reconstruction network, and errors between a reconstruction result and input training data are utilized to train the coder network and the hierarchical reconstruction network.
In the embodiment of the invention, the encoder network can adopt a standard variational self-encoder structure to learn the identity attribute e from the ACAP deformation expression f of the human body grid data s And attitude attribute e p Then, an end-to-end network structure is formed by combining the reconstructed network for training. Training data { f, f s ,g,g s Only f is used as the input of the network and the rest of the data is used to calculate the reconstruction error.
In the embodiment of the invention, the hierarchical reconstruction network is based on human body geometric prior and comprises two parts, wherein the first part is used for hierarchically reconstructing a main part of a three-dimensional human body model, the second part is used for reconstructing a difference part, and the reconstructed main part and the reconstructed difference part are added to obtain a reconstruction result; the reconstructed network is described as:
Figure BDA0002446089520000061
Figure BDA0002446089520000062
wherein,
Figure BDA0002446089520000063
representation utilization identity attribute e s And attitude attribute e p In conjunction with the reconstruction result of (a), based on the result of (b)>
Figure BDA0002446089520000064
Representation utilization identity attribute e s The reconstruction result of (2);
Figure BDA0002446089520000065
Indicates a reconstruction result->
Figure BDA0002446089520000066
The main part b in (1), W represents a skin layer;
Figure BDA0002446089520000067
Represents the result of a reconstruction>
Figure BDA0002446089520000068
The difference part d in (1);
Figure BDA0002446089520000069
Indicates a reconstruction result->
Figure BDA00024460895200000610
Main part b of (1) s
Figure BDA00024460895200000611
Representing a skin layer;
Figure BDA00024460895200000612
represents the result of a reconstruction>
Figure BDA00024460895200000613
The difference part d in s
Reconstruction of the main parts b and b s The following equation is used to reconstruct the human deformation characteristics
Figure BDA00024460895200000623
And &>
Figure BDA00024460895200000624
Figure BDA00024460895200000614
Figure BDA00024460895200000615
Wherein,
Figure BDA00024460895200000616
are independent mapping transformations, which may be modeled, for example, using a multi-tier perceptron.
Then, the skin layer is utilized
Figure BDA00024460895200000617
Reconstruction of the main parts b and b s
Figure BDA00024460895200000618
Figure BDA00024460895200000619
Wherein the skin layer
Figure BDA00024460895200000620
In the form of a matrix, is selected>
Figure BDA00024460895200000621
Represents->
Figure BDA00024460895200000622
The xth row and the y column; y is the number of rigid blocks, e.g., Y =16; the above principle is that the deformation of each vertex is obtained by linear convex combination of the relative rigid block deformation of the vertex, and the coefficient of the convex combination is determined by learning training and is not set manually.
Difference parts d and d s Expressed as:
Figure BDA0002446089520000071
Figure BDA0002446089520000072
wherein,
Figure BDA0002446089520000073
independent mapping transformations, illustratively, these mapping transformations may be modeled using a multi-tier perceptron.
As shown in fig. 3, an example of the visualization of the various computed portions of the reconstructed network over one reconstructed instance is given. B (e) in FIG. 3 s ,e p )、B(e s 0) i.e. main parts b and b as mentioned herein s
In the embodiment of the invention, the network can be trained end to end through the training data set, and after the training is finished, the decoupled low-dimensional hidden layer can be used for representing e s ,e p The method can be used for reconstructing a human body (or applied to aspects of human body editing, motion migration and the like), can also be applied to aspects of human body editing, motion migration and the like, and has wide application prospects in the fields of video live broadcast, virtual fitting, somatosensory games and the like.
During training, L1 mode loss between reconstruction and input features and distribution regularization loss of hidden layer variables can be adopted as loss functions.
The L1 mode loss can be expressed as:
Figure BDA0002446089520000074
Figure BDA0002446089520000075
in the above-mentioned formula, the compound has the following structure,
Figure BDA0002446089520000076
all are the reconstruction results of the relevant data in the training data, and the specific values 9 (number of deformation features) and 16 (number of rigid blocks) involved in the above formula) Are by way of example only and are not limiting.
The regular partial loss of the hidden layer parameter distribution is:
E sKL =D KL (q(e s |f)||p(e s ))
E pKL =D KL (q(e p |f)||p(e p ))
the two losses are KL divergence losses of which the distribution of the standard constraint hidden layer in the variational self-encoder meets the prior distribution.
And step 13, after training, inputting the identity attribute and the action attribute into a trained hierarchical reconstruction network to obtain a three-dimensional human body model reconstruction result.
After the network training is completed through the step 12, the encoder network can be abandoned, and the reconstructed network is directly used as a decoupled low-dimensional human body parameterized model. The model reconstructs a complete human body grid from two groups of decoupling parameters respectively representing identity and posture. Specifically, the trained hierarchical reconstruction network may reconstruct the identity attribute and the posture attribute of the input data according to the method introduced in the training phase
Figure BDA0002446089520000081
And/or>
Figure BDA0002446089520000082
Both are represented by ACAP deformation, and the corresponding human body grid and the neutral human body grid can be obtained through a simple conversion. The conversion method referred to herein can be referred to in the prior art, and is not described in detail.
Compared with the traditional human body parameterized model representation method, the scheme of the embodiment of the invention mainly has the following advantages:
1) The characteristics of input and output are represented by nonlinear deformation, the traditional Euclidean coordinates are replaced, the precision is higher, and the deformation with large scale is more robust.
2) The reconstruction accuracy of the model is further improved by utilizing the strong fitting capability of the neural network and combining the framework design of human body deformation prior.
3) The obtained posture hidden layer representation has certain semantics by utilizing the learning of a large number of human body models with various postures, namely embedding reasonable actions of the human body into a low-dimensional space. However, the posture parameters of the conventional model often have no semantics and may generate unreasonable human body actions.
Through the description of the above embodiments, it is clear to those skilled in the art that the above embodiments may be implemented by software, or by software plus a necessary general hardware platform. With this understanding, the technical solutions of the embodiments can be embodied in the form of a software product, which can be stored in a non-volatile storage medium (which can be a CD-ROM, a usb disk, a removable hard disk, etc.), and includes several instructions for enabling a computer device (which can be a personal computer, a server, or a network device, etc.) to execute the methods according to the embodiments of the present invention.
The above description is only for the preferred embodiment of the present invention, but the scope of the present invention is not limited thereto, and any changes or substitutions that can be easily conceived by those skilled in the art within the technical scope of the present invention are included in the scope of the present invention. Therefore, the protection scope of the present invention shall be subject to the protection scope of the claims.

Claims (3)

1.一种三维人体形状表示方法,其特征在于,包括:1. A method for representing a three-dimensional human body shape, characterized in that it includes: 对收集的人体网格数据集进行预处理,并基于标准姿态进行变形,再计算ACAP变形表示与描述刚性块变形的人体形变特征,构成的训练数据集;The collected human mesh dataset is preprocessed and deformed based on standard poses. Then, the ACAP deformation representation and the human deformation features describing the deformation of rigid blocks are calculated to form the training dataset. 构建编码器网络和阶层式重建网络构成端到端的网络结构,并利用训练数据集对网络结构进行训练;训练过程中,通过编码器网络对ACAP变形表示进行编码,获得身份属性与动作属性,重建网络则利用身份属性与动作属性重建三维人体模型,并利用重建结果与输入的训练数据之间的误差训练编码器网络和阶层式重建网络;An encoder network and a hierarchical reconstruction network are constructed to form an end-to-end network structure, and the network structure is trained using a training dataset. During the training process, the encoder network encodes the ACAP deformation representation to obtain identity attributes and action attributes. The reconstruction network then uses the identity attributes and action attributes to reconstruct a 3D human body model, and uses the error between the reconstruction result and the input training data to train the encoder network and the hierarchical reconstruction network. 训练完毕后,将身份属性与动作属性输入至训练好的阶层式重建网络中,得到三维人体模型重建结果;After training, the identity attributes and action attributes are input into the trained hierarchical reconstruction network to obtain the 3D human body model reconstruction result. 其中,所述对收集的人体网格数据集进行预处理,获得ACAP变形表示与人体形变特征构成的训练数据集包括:The preprocessing of the collected human mesh dataset to obtain the training dataset consisting of ACAP deformation representation and human deformation features includes: 首先,对收集人体网格数据集进行标准化处理,得到统一拓扑的人体网格数据,并通过定义的标准姿态,变形得到每个人体网格数据对应的中性人体网格;First, the collected human mesh dataset is standardized to obtain human mesh data with a unified topology. Then, the neutral human mesh corresponding to each human mesh data is obtained by deformation through the defined standard pose. 然后,计算每个人体网格数据及对应的中性人体网格的ACAP变形表示,记为f和fs;并且,根据人体基于骨架的铰链式变形特点,分别计算人体网格和其对应的中性人体网格的描述刚性块变形的人体形变特征g与gs,从而得到一组训练数据{f,fs,g,gs},对于收集的数据做相应处理得到训练数据集;Then, calculate the ACAP deformation representation of each human body mesh and the corresponding neutral human body mesh, denoted as f and fs ; and, based on the hinge-like deformation characteristics of the human body based on the skeleton, calculate the human body deformation features g and gs describing the rigid block deformation of the human body mesh and its corresponding neutral human body mesh, thereby obtaining a set of training data {f, fs , g, gs }. The collected data is then processed accordingly to obtain the training dataset. 计算描述刚性块变形的人体形变特征g的公式为:The formula for calculating the human body deformation characteristic g, which describes the deformation of a rigid block, is:
Figure FDA0004059770490000011
Figure FDA0004059770490000011
其中,vk是第k个刚性块的顶点集合,qi'、pi'分别表示变形基准网格和计算网格的第k个刚性块上的第i'个顶点;
Figure FDA0004059770490000012
Figure FDA0004059770490000013
分别是变形基准网格和计算网格上第k个刚性块的平均;通过对刚性块的放射变形
Figure FDA0004059770490000014
进行极分解和参数化,得到人体形变特征;
Where v_k is the set of vertices of the k-th rigid block, and q_i' and p_i' represent the i'-th vertex on the k-th rigid block of the deformable reference mesh and the computational mesh, respectively;
Figure FDA0004059770490000012
and
Figure FDA0004059770490000013
These are the average values of the k-th rigid block on the deformation reference mesh and the computational mesh, respectively; through radial deformation of the rigid block...
Figure FDA0004059770490000014
Polar decomposition and parameterization are performed to obtain the human body deformation characteristics;
将每个人体网格数据及对应的中性人体网格作为计算网格带入上述式子,得到形变特征g和gsSubstituting each human body mesh data and its corresponding neutral human body mesh as the computational mesh into the above formula yields the deformation characteristics g and gs ; 利用训练数据集对网络结构进行训练时,采用重构和输入特征之间的L1模损失、以及对隐层变量的分布正则化损失作为损失函数;When training the network structure using the training dataset, the L1 modulus loss between the reconstruction and input features, as well as the distribution regularization loss of the hidden layer variables, are used as loss functions. 所述阶层式重建网络为基于人体几何先验的阶层式重建网络,包括两部分,第一部分阶层式重建三维人体模型的主要部分,第二部分重建差异部分,将重建主要部分与重建差异部分相加即为重建结果;重建网络描述为:The hierarchical reconstruction network is a hierarchical reconstruction network based on human geometric priors, consisting of two parts: the first part reconstructs the main parts of the 3D human model hierarchically, and the second part reconstructs the differential parts. The reconstruction result is obtained by adding the reconstructed main parts and the reconstructed differential parts. The reconstruction network is described as follows:
Figure FDA0004059770490000021
Figure FDA0004059770490000021
Figure FDA0004059770490000022
Figure FDA0004059770490000022
其中,
Figure FDA0004059770490000023
表示利用身份属性es与姿态属性ep的重建结果,
Figure FDA0004059770490000024
表示利用身份属性es的重建结果;
Figure FDA0004059770490000025
表示重建结果
Figure FDA0004059770490000026
中的主要部分b,
Figure FDA0004059770490000027
表示蒙皮层;
Figure FDA0004059770490000028
表示重建结果
Figure FDA0004059770490000029
中的差异部分d;
Figure FDA00040597704900000210
表示重建结果
Figure FDA00040597704900000211
中的主要部分bs
Figure FDA00040597704900000212
表示蒙皮层;
Figure FDA00040597704900000213
表示重建结果
Figure FDA00040597704900000214
中的差异部分ds
in,
Figure FDA0004059770490000023
This represents the reconstruction result using the identity attribute es and the pose attribute ep .
Figure FDA0004059770490000024
This represents the reconstruction result using the identity attribute e s ;
Figure FDA0004059770490000025
Indicates the reconstruction result
Figure FDA0004059770490000026
The main part b,
Figure FDA0004059770490000027
Indicates the skin layer;
Figure FDA0004059770490000028
Indicates the reconstruction result
Figure FDA0004059770490000029
The difference in part d;
Figure FDA00040597704900000210
Indicates the reconstruction result
Figure FDA00040597704900000211
The main part of it is b s .
Figure FDA00040597704900000212
Indicates the skin layer;
Figure FDA00040597704900000213
Indicates the reconstruction result
Figure FDA00040597704900000214
The difference in ds ;
重建主要部分b与bs时,利用下述式子来重建人体形变特征
Figure FDA00040597704900000215
Figure FDA00040597704900000216
When reconstructing the main parts b and bs , the following formula is used to reconstruct the human body deformation features.
Figure FDA00040597704900000215
and
Figure FDA00040597704900000216
Figure FDA00040597704900000217
Figure FDA00040597704900000217
Figure FDA00040597704900000218
Figure FDA00040597704900000218
其中,
Figure FDA00040597704900000219
是独立的映射变换;
in,
Figure FDA00040597704900000219
It is an independent mapping transformation;
之后,利用蒙皮层
Figure FDA00040597704900000220
重建主要部分b与bs
Then, using the skin layer
Figure FDA00040597704900000220
Reconstructing the main parts b and b s :
Figure FDA00040597704900000221
Figure FDA00040597704900000221
Figure FDA00040597704900000222
Figure FDA00040597704900000222
其中,蒙皮层
Figure FDA00040597704900000223
为矩阵形式,
Figure FDA00040597704900000224
表示
Figure FDA00040597704900000225
中第x行第y列元素;Y为刚性块的数目;
Among them, the skin layer
Figure FDA00040597704900000223
In matrix form,
Figure FDA00040597704900000224
express
Figure FDA00040597704900000225
The element in the x-th row and y-th column; Y is the number of rigid blocks;
建差异部分d与ds表示为:The difference between d and ds is represented as follows:
Figure FDA00040597704900000226
Figure FDA00040597704900000226
Figure FDA00040597704900000227
Figure FDA00040597704900000227
其中,
Figure FDA00040597704900000228
独立的映射变换。
in,
Figure FDA00040597704900000228
Independent mapping transformations.
2.根据权利要求1所述的一种三维人体形状表示方法,其特征在于,所述对收集人体网格数据集进行标准化处理包括:2. The method for representing a three-dimensional human body shape according to claim 1, characterized in that the standardization process of the collected human body mesh dataset includes: 对于收集到的不同来源的人体网格数据集,以某一来源数据为标准拓扑;利用标准拓扑中的动作网格,计算相应的ACAP变形表示,得到一组人体动作的先验变形表示基C,进而使用一组参数w来恢复一个人体的变形表示;使用Cw的线性空间作为人体变形的先验空间,然后优化一组标准拓扑的顶点坐标p和刚性变换参数,即旋转参数R和平移参数t,将不同来源的人体网格数据集进行标准化。For human body mesh datasets collected from different sources, a standard topology is used based on data from one source. Using the action mesh in the standard topology, the corresponding ACAP deformation representation is calculated to obtain a set of prior deformation representation basis C for human actions. Then, a set of parameters w is used to recover the deformation representation of a human body. The linear space of Cw is used as the prior space for human body deformation. Then, the vertex coordinates p and rigid transformation parameters of a set of standard topologies, namely rotation parameters R and translation parameters t, are optimized to standardize human body mesh datasets from different sources. 3.根据权利要求2所述的一种三维人体形状表示方法,其特征在于,标准化处理是求解以下优化问题:3. The method for representing a three-dimensional human body shape according to claim 2, characterized in that the standardization process involves solving the following optimization problem:
Figure FDA0004059770490000031
Figure FDA0004059770490000031
其中,λ1、λ2、λ3均为设定的权重;||w||1是对参数w的稀疏正则化限制;Where λ1 , λ2 , and λ3 are all set weights; ||w|| 1 is a sparse regularization constraint on the parameter w; Eprior是由先验变形表示基C所决定的人体形变先验项,表示为:E prior is the a priori term of human body deformation determined by the a priori deformation representation basis C, expressed as:
Figure FDA0004059770490000032
Figure FDA0004059770490000032
其中,Ti(w)是标准拓扑下第i个顶点的一邻域的先验变形,qi是标准拓扑下第i个顶点在变形基准网格中的位置,所述变形基准网格预先选定;相对于变形基准网格而言,标准拓扑下待优化的网格,第i个顶点的位置为pi;N(i)指的是标准拓扑下第i个顶点的邻域顶点指标集合,j是指N(i)中的第j个顶点,相应顶点在变形基准网格与待优化的网格中的位置分别表示为qj、pj;cij是变形基准网格上计算的一个边权重值;Where Ti (w) is the prior deformation of a neighborhood of the i-th vertex under the standard topology, q <sub>i </sub> is the position of the i-th vertex in the deformed reference grid under the standard topology, and the deformed reference grid is pre-selected; relative to the deformed reference grid, the position of the i-th vertex in the grid to be optimized under the standard topology is p <sub>i </sub>; N(i) refers to the set of neighboring vertex indices of the i-th vertex under the standard topology, j refers to the j-th vertex in N(i), and the positions of the corresponding vertices in the deformed reference grid and the grid to be optimized are represented as q <sub>j </sub> and p<sub>j </sub> , respectively; c <sub>ij</sub> is an edge weight value calculated on the deformed reference grid; Eicp是和目标网格的最近邻的点到平面的注册能量项,表示为:E <sub>icp</sub> is the registered energy term from the nearest neighbor point of the target grid to the plane, expressed as:
Figure FDA0004059770490000033
Figure FDA0004059770490000033
其中,D为动态计算中选出的对应点在标准拓扑下的指标集合;vl(i)是和pi对应的目标网格上的对应点,
Figure FDA0004059770490000034
表示点vl(i)的法向;目标网格是指待优化的不同来源的人体网格数据集;
Where D is the set of indices for the corresponding points selected in the dynamic calculation under the standard topology; v <sub>l</sub>(i) is the corresponding point on the target mesh corresponding to p<sub>i</sub> .
Figure FDA0004059770490000034
The normal vector of point v l(i) is represented; the target mesh refers to the human body mesh dataset from different sources to be optimized.
Elan是标准拓扑和目标网格的一组手动标注稀疏对应点的L2损失能量项,表示为:E <sub>lan</sub> is a set of manually labeled sparse corresponding points in the standard topology and the target mesh, expressed as:
Figure FDA0004059770490000035
Figure FDA0004059770490000035
其中,L为预先在标准拓扑中选定的对应点集合。Where L is the set of corresponding points pre-selected in the standard topology.
CN202010279674.5A 2020-04-10 2020-04-10 Three-dimensional human body shape representation method Active CN111476901B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010279674.5A CN111476901B (en) 2020-04-10 2020-04-10 Three-dimensional human body shape representation method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010279674.5A CN111476901B (en) 2020-04-10 2020-04-10 Three-dimensional human body shape representation method

Publications (2)

Publication Number Publication Date
CN111476901A CN111476901A (en) 2020-07-31
CN111476901B true CN111476901B (en) 2023-04-07

Family

ID=71751818

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010279674.5A Active CN111476901B (en) 2020-04-10 2020-04-10 Three-dimensional human body shape representation method

Country Status (1)

Country Link
CN (1) CN111476901B (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113743283A (en) * 2021-08-30 2021-12-03 成都数字天空科技有限公司 Mesh topology structure acquisition method and device, electronic equipment and storage medium

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108805977A (en) * 2018-06-06 2018-11-13 浙江大学 A kind of face three-dimensional rebuilding method based on end-to-end convolutional neural networks
CN110363833A (en) * 2019-06-11 2019-10-22 华南理工大学 A Parametric Representation Method of Complete Human Motion Based on Local Sparse Representation
CN110428493A (en) * 2019-07-12 2019-11-08 清华大学 Method and system for 3D reconstruction of human body from single image based on mesh deformation
CN110619681A (en) * 2019-07-05 2019-12-27 杭州同绘科技有限公司 Human body geometric reconstruction method based on Euler field deformation constraint
CN110827342A (en) * 2019-10-21 2020-02-21 中国科学院自动化研究所 Three-dimensional human body model reconstruction method, storage device, and control device

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR101194604B1 (en) * 2008-12-22 2012-10-25 한국전자통신연구원 Method and apparatus for shape deforming surface based 3d human model
WO2016207311A1 (en) * 2015-06-24 2016-12-29 MAX-PLANCK-Gesellschaft zur Förderung der Wissenschaften e.V. Skinned multi-person linear model
US9898858B2 (en) * 2016-05-18 2018-02-20 Siemens Healthcare Gmbh Human body representation with non-rigid parts in an imaging system
EP3579196A1 (en) * 2018-06-05 2019-12-11 Cristian Sminchisescu Human clothing transfer method, system and device

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108805977A (en) * 2018-06-06 2018-11-13 浙江大学 A kind of face three-dimensional rebuilding method based on end-to-end convolutional neural networks
CN110363833A (en) * 2019-06-11 2019-10-22 华南理工大学 A Parametric Representation Method of Complete Human Motion Based on Local Sparse Representation
CN110619681A (en) * 2019-07-05 2019-12-27 杭州同绘科技有限公司 Human body geometric reconstruction method based on Euler field deformation constraint
CN110428493A (en) * 2019-07-12 2019-11-08 清华大学 Method and system for 3D reconstruction of human body from single image based on mesh deformation
CN110827342A (en) * 2019-10-21 2020-02-21 中国科学院自动化研究所 Three-dimensional human body model reconstruction method, storage device, and control device

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
袁仁奇 ; 徐增波 ; .基于Kinect的人体模板化三维模型拟合重建.丝绸.2017,(10),全文. *
高丽伟 ; 张元 ; 韩燮 ; .基于Kinect的三维人体建模研究.中国科技论文.2018,(02),全文. *

Also Published As

Publication number Publication date
CN111476901A (en) 2020-07-31

Similar Documents

Publication Publication Date Title
Deng et al. A survey of non‐rigid 3D registration
Zhu et al. AdaFit: Rethinking learning-based normal estimation on point clouds
CN117974693B (en) Image segmentation method, device, computer equipment and storage medium
CN113111861A (en) Face texture feature extraction method, 3D face reconstruction method, device and storage medium
Fan et al. Dual neural networks coupling data regression with explicit priors for monocular 3D face reconstruction
CN111274525A (en) A Tensor Data Recovery Method Based on Multilinear Augmented Lagrange Multiplier Method
CN118470222B (en) Medical ultrasonic image three-dimensional reconstruction method and system based on SDF diffusion
CN110909778A (en) Image semantic feature matching method based on geometric consistency
CN119130785B (en) Image generation method and device based on thermal kernel theory on Riemann manifold
CN114550292A (en) High-physical-reality human body motion capture method based on neural motion control
CN105427360A (en) Error-controllable CAGE sequence representation algorithm for dynamic grid
Sun et al. Cgof++: Controllable 3d face synthesis with conditional generative occupancy fields
CN115049764A (en) Training method, device, equipment and medium for SMPL parameter prediction model
CA3177593A1 (en) Transformer-based shape models
Chen et al. Neuraltps: Learning signed distance functions without priors from single sparse point clouds
CN111027433A (en) Multiple style face characteristic point detection method based on convolutional neural network
Gan et al. Fine-grained multi-view hand reconstruction using inverse rendering
CN111476901B (en) Three-dimensional human body shape representation method
Chan et al. Creation of novel soft robot designs using generative ai
CN106484511A (en) A kind of spectrum attitude moving method
Zhang et al. Norest-net: normal estimation neural network for 3-d noisy point clouds
Zheng et al. Deformation representation based convolutional mesh autoencoder for 3D hand generation
Zhu et al. CED-Net: contextual encoder–decoder network for 3D face reconstruction
CN110544309A (en) Real-time sparse editing method and system based on large-scale grid model representation
CN113658323A (en) Method for reconstructing network model in three dimensions based on graph convolution and computer equipment

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant