There is no abstract available for this article.
This work was partly supported by National Natural Science Foundation of China (Grant Nos. U1736217, 61502023).
Videos and other supplemental documents.
[1] Mousavian A, Anguelov D, Flynn J, et al. 3D bounding box estimation using deep learning and geometry. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2017. Google Scholar
[2] Fidler S, Dickinson S J, Urtasun R. 3D object detection and viewpoint estimation with a deformable 3D cuboid model. In: Proceedings of the 26th Annual Conference on Neural Information Processing Systems, 2012. Google Scholar
[3] Chen X Z, Kundu K, Zhang Z Y. Monocular 3D object detection for autonomous driving. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2016. Google Scholar
[4] Tulsiani S, Malik J. Viewpoints and keypoints. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2015. Google Scholar
[5] Chang A X, Funkhouser T A, Guibas L J. Shapenet: an information-rich 3D model repository. 2015,. arXiv Google Scholar
[6] Su H, Qi C R, Li Y Y, et al. Render for CNN: viewpoint estimation in images using cnns trained with rendered 3D model views. In: Proceedings of IEEE International Conference on Computer Vision, 2015. Google Scholar
[7] Xiang Y, Kim W, Chen W. Objectnet3D: a large scale database for 3D object recognition. In: Proceedings of European Conference on Computer Vision, 2016. Google Scholar
[8] Xiang Y, Mottaghi R, Savarese S. Beyond PASCAL: a benchmark for 3D object detection in the wild. In: Proceedings of IEEE Winter Conference on Applications of Computer Vision, 2014. Google Scholar
Figure 1
(Color online) (a) Proposed network architecture; (b) three parts of accuracy on 5 categories.