Chairs
Test chairs from ShapeNet. This is a frame-by-frame keypoint prediction on each animation frame. No temporal information is used.We show how the network is able to utilize the same keypoints across object instances and consistently predict keypoints across viewing angles, even when parts are occluded such as the back legs.
Planes
Test planes from ShapeNet. Notice failure cases in the bottom row where the orientation network fails to predict the correct orientation of a few planes with very unusual wing shapes.Cars
Test cars from ShapeNet. Notice failure cases on the bottom row: The second car is mostly black which is the same as the background. The third car looks very symmetric that the predicted orientation is sometimes reversed.Ablation Study
We present an ablation study for the primary losses as well as how their weights affect the results.Baseline (12 keypoints) |
No multiview consistency Notice how points are no longer stationary. |
No relative pose loss Points are concentrated at the center, but are still separated from separation loss |
More noise in relative pose loss This encourages points to be more robust to estimation error by spreading them out further. |
No silhouette loss This causes the points to lie outside the car. |