no code implementations • 5 Jul 2024 • Long Teng, Wei Feng, Menglong Zhu, Xinchao Li
Thus it is difficult to perform supervised learning for motion tracking.
no code implementations • 8 Feb 2022 • Guhong Nie, Lirui Xiao, Menglong Zhu, Dongliang Chu, Yue Shen, Peng Li, Kang Yang, Li Du, Bo Chen
For binary neural networks (BNNs) to become the mainstream on-device computer vision algorithm, they must achieve a superior speed-vs-accuracy tradeoff than 8-bit quantization and establish a similar degree of general applicability in vision tasks.
no code implementations • 4 Jan 2021 • Keren Ye, Adriana Kovashka, Mark Sandler, Menglong Zhu, Andrew Howard, Marco Fornoni
In this paper we address the question: can task-specific detectors be trained and represented as a shared set of weights, plus a very small set of additional weights for each task?
2 code implementations • 15 Jul 2019 • Deng-Ping Fan, Zheng Lin, Jia-Xing Zhao, Yun Liu, Zhao Zhang, Qibin Hou, Menglong Zhu, Ming-Ming Cheng
The use of RGB-D information for salient object detection has been extensively explored in recent years.
Ranked #4 on RGB-D Salient Object Detection on RGBD135
2 code implementations • 25 Mar 2019 • Mason Liu, Menglong Zhu, Marie White, Yinxiao Li, Dmitry Kalenichenko
Models and examples built with TensorFlow
Ranked #32 on Video Object Detection on ImageNet VID (using extra training data)
3 code implementations • CVPR 2019 • Marvin Teichmann, Andre Araujo, Menglong Zhu, Jack Sim
Then, we demonstrate how a trained landmark detector, using our new dataset, can be leveraged to index image regions and improve retrieval accuracy while being much more efficient than existing regional methods.
154 code implementations • CVPR 2018 • Mark Sandler, Andrew Howard, Menglong Zhu, Andrey Zhmoginov, Liang-Chieh Chen
In this paper we describe a new mobile architecture, MobileNetV2, that improves the state of the art performance of mobile models on multiple tasks and benchmarks as well as across a spectrum of different model sizes.
Ranked #7 on Retinal OCT Disease Classification on OCT2017
20 code implementations • CVPR 2018 • Benoit Jacob, Skirmantas Kligys, Bo Chen, Menglong Zhu, Matthew Tang, Andrew Howard, Hartwig Adam, Dmitry Kalenichenko
The rising popularity of intelligent mobile devices and the daunting computational cost of deep learning-based models call for efficient and accurate on-device inference schemes.
3 code implementations • CVPR 2018 • Mason Liu, Menglong Zhu
This paper introduces an online model for object detection in videos designed to run in real-time on low-powered mobile and embedded devices.
1 code implementation • ICCV 2017 • Xintong Han, Zuxuan Wu, Phoenix X. Huang, Xiao Zhang, Menglong Zhu, Yuan Li, Yang Zhao, Larry S. Davis
This paper proposes an automatic spatially-aware concept discovery approach using weakly labeled image-text data from shopping websites.
156 code implementations • 17 Apr 2017 • Andrew G. Howard, Menglong Zhu, Bo Chen, Dmitry Kalenichenko, Weijun Wang, Tobias Weyand, Marco Andreetto, Hartwig Adam
We present a class of efficient models called MobileNets for mobile and embedded vision applications.
Ranked #245 on Object Detection on COCO test-dev
1 code implementation • 9 Jan 2017 • Xiaowei Zhou, Menglong Zhu, Georgios Pavlakos, Spyridon Leonardos, Kostantinos G. Derpanis, Kostas Daniilidis
Recovering 3D full-body human pose is a challenging problem with many applications.
14 code implementations • CVPR 2017 • Jonathan Huang, Vivek Rathod, Chen Sun, Menglong Zhu, Anoop Korattikara, Alireza Fathi, Ian Fischer, Zbigniew Wojna, Yang song, Sergio Guadarrama, Kevin Murphy
On the opposite end in which accuracy is critical, we present a detector that achieves state-of-the-art performance measured on the COCO detection task.
Ranked #224 on Object Detection on COCO test-dev (using extra training data)
no code implementations • ICCV 2015 • Menglong Zhu, Xiaowei Zhou, Kostas Daniilidis
We introduce a new approach for estimating a fine grained 3D shape and continuous pose of an object from a single image.
1 code implementation • CVPR 2016 • Xiaowei Zhou, Menglong Zhu, Spyridon Leonardos, Kosta Derpanis, Kostas Daniilidis
Here, two cases are considered: (i) the image locations of the human joints are provided and (ii) the image locations of joints are unknown.
Ranked #38 on Monocular 3D Human Pose Estimation on Human3.6M
no code implementations • 14 Sep 2015 • Xiaowei Zhou, Menglong Zhu, Spyridon Leonardos, Kostas Daniilidis
We investigate the problem of estimating the 3D shape of an object defined by a set of 3D landmarks, given their 2D correspondences in a single image.
Ranked #133 on 3D Human Pose Estimation on Human3.6M (PA-MPJPE metric)
1 code implementation • ICCV 2015 • Xiaowei Zhou, Menglong Zhu, Kostas Daniilidis
In this paper we propose a global optimization-based approach to jointly matching a set of images.
no code implementations • 1 Feb 2015 • Menglong Zhu, Xiaowei Zhou, Kostas Daniilidis
We introduce a new approach for estimating the 3D pose and the 3D shape of an object from a single image.
no code implementations • 1 Apr 2014 • Menglong Zhu, Nikolay Atanasov, George J. Pappas, Kostas Daniilidis
This paper presents an active approach for part-based object detection, which optimizes the order of part filter evaluations and the time at which to stop and make a prediction.