1 code implementation • 30 Jul 2024 • Qijun Gan, Zijie Zhou, Jianke Zhu
Hand avatars play a pivotal role in a wide array of digital interfaces, enhancing user immersion and facilitating natural interaction within virtual environments.
1 code implementation • 8 Jul 2024 • Qijun Gan, Wentong Li, Jinwei Ren, Jianke Zhu
Reconstructing high-fidelity hand models with intricate textures plays a crucial role in enhancing human-object interaction and advancing real-world applications.
1 code implementation • 2 Jul 2024 • Wentong Li, Yuqian Yuan, Jian Liu, Dongqi Tang, Song Wang, Jie Qin, Jianke Zhu, Lei Zhang
However, the visual tokens are redundant and can be considerably increased when dealing with high-resolution images, impairing the efficiency of MLLMs significantly.
no code implementations • 13 Jun 2024 • Qijun Gan, Song Wang, Shengtao Wu, Jianke Zhu
Although key presses can be directly derived from sheet music, the transitional movements among key presses require more extensive guidance in piano performance.
1 code implementation • 24 May 2024 • Song Wang, Jiawei Yu, Wentong Li, Hao Shi, Kailun Yang, Junbo Chen, Jianke Zhu
Semantic scene completion aims to infer the 3D geometric structures with semantic classes from camera or LiDAR, which provide essential occupancy information in autonomous driving.
no code implementations • 27 Apr 2024 • Shaofan Liu, Junbo Chen, Jianke Zhu
Incremental scene reconstruction is essential to the navigation in robotics.
1 code implementation • CVPR 2024 • Song Wang, Jiawei Yu, Wentong Li, Wenyu Liu, Xiaolu Liu, Junbo Chen, Jianke Zhu
Furthermore, the voxels in the boundary region are more challenging to differentiate than those in the interior.
1 code implementation • CVPR 2024 • Xiaolu Liu, Song Wang, Wentong Li, Ruizi Yang, Junbo Chen, Jianke Zhu
Currently, high-definition (HD) map construction leans towards a lightweight online generation tendency, which aims to preserve timely and reliable road scene information.
no code implementations • 29 Mar 2024 • Zhuopeng Li, Yilin Zhang, Chenming Wu, Jianke Zhu, Liangjun Zhang
The rapid growth of 3D Gaussian Splatting (3DGS) has revolutionized neural rendering, enabling real-time production of high-quality renderings.
1 code implementation • 13 Mar 2024 • Hao Shi, Song Wang, Jiaming Zhang, Xiaoting Yin, Zhongdao Wang, Guangming Wang, Jianke Zhu, Kailun Yang, Kaiwei Wang
Vision-based occupancy prediction, also known as 3D Semantic Scene Completion (SSC), presents a significant challenge in computer vision.
1 code implementation • 14 Feb 2024 • Xin Zheng, Jianke Zhu
Nowadays, sensor suits have been equipped with redundant LiDARs and IMUs to mitigate the risks associated with sensor failure.
1 code implementation • CVPR 2024 • Yuqian Yuan, Wentong Li, Jian Liu, Dongqi Tang, Xinjie Luo, Chi Qin, Lei Zhang, Jianke Zhu
In this paper, we propose Osprey, a mask-text instruction tuning approach, to extend MLLMs by incorporating fine-grained mask regions into language instruction, aiming at achieving pixel-wise visual understanding.
no code implementations • 28 Nov 2023 • Zhuopeng Li, Chenming Wu, Liangjun Zhang, Jianke Zhu
Despite the recent success of Neural Radiance Field (NeRF), it is still challenging to render large-scale driving scenes with long trajectories, particularly when the rendering quality and efficiency are in high demand.
1 code implementation • 20 Nov 2023 • Lixiang Lin, Jianke Zhu
To enable realistic experience in AR/VR and digital entertainment, we present the first point-based human avatar model that embodies the entirety expressive range of digital humans.
1 code implementation • NeurIPS 2023 • Wentong Li, Yuqian Yuan, Song Wang, Wenyu Liu, Dongqi Tang, Jian Liu, Jianke Zhu, Lei Zhang
In this work, we formulate the affinity modeling as an affinity propagation process, and propose a local and a global pairwise affinity terms to generate accurate soft pseudo labels.
no code implementations • journal 2023 • Hui Wei, Hanxun Yu, Kewei Zhang, Zhixiang Wang, Jianke Zhu, and Zheng Wang
Next, we embed these Moiré patterns as a backdoor trigger into digital images and use this dataset to train a backdoored detector.
1 code implementation • 25 Sep 2023 • Xin Zheng, Jianke Zhu
Therefore, our proposed Traj-LO approach tries to recover the spatial-temporal consistent movement of LiDAR by tightly coupling the geometric information from LiDAR points and kinematic constraints from trajectory smoothness.
1 code implementation • ICCV 2023 • Wentong Li, Yuqian Yuan, Song Wang, Jianke Zhu, Jianshu Li, Jian Liu, Lei Zhang
Weakly-supervised image segmentation has recently attracted increasing research attentions, aiming to avoid the expensive pixel-wise labeling.
1 code implementation • 12 Jul 2023 • Jinwei Ren, Jianke Zhu
Most of the existing methods extract features from color images to estimate the root-aligned hand meshes, which neglect the crucial depth and scale information in the real world.
no code implementations • 29 May 2023 • Yisu Zhang, Jianke Zhu, Lixiang Lin
Despite the promising results of multi-view reconstruction, the recent neural rendering-based methods, such as implicit surface rendering (IDR) and volume rendering (NeuS), not only incur a heavy computational burden on training but also have the difficulties in disentangling the geometric and appearance.
no code implementations • 12 May 2023 • Weikun Zhang, Jianke Zhu
With the growing popularity of neural rendering, there has been an increasing number of neural implicit multi-view reconstruction methods.
no code implementations • 26 Apr 2023 • Yisu Zhang, Jianke Zhu, Lixiang Lin
Comparing to the conventional deep learning-based multi-view stereo methods, our proposed RA-MVSNet approach obtains more complete reconstruction results by taking advantage of signed distance supervision.
Ranked #5 on Point Clouds on Tanks and Temples
1 code implementation • CVPR 2023 • Song Wang, Wentong Li, Wenyu Liu, Xiaolu Liu, Jianke Zhu
To mitigate the defects caused by lacking semantic cues in LiDAR data, we present an online Camera-to-LiDAR distillation scheme to facilitate the semantic learning from image to point cloud.
no code implementations • 3 Feb 2023 • Chuan Zhang, Daoxin Zhang, Ruixiu Zhang, Jiawei Li, Jianke Zhu
To tackle this problem, we propose a multimodal relevance estimation network to capture the relevant semantics among modalities in multimodal emotions.
no code implementations • CVPR 2023 • Yisu Zhang, Jianke Zhu, Lixiang Lin
It is essential to the textureless regions and surface boundary that cannot be properly reconstructed. To address this issue, we suggest to take advantage of point-to-surface distance so that the model is able to perceive a wider range of surfaces.
2 code implementations • 3 Dec 2022 • Wentong Li, Wenyu Liu, Jianke Zhu, Miaomiao Cui, Risheng Yu, Xiansheng Hua, Lei Zhang
In contrast to fully supervised methods using pixel-wise mask labels, box-supervised instance segmentation takes advantage of simple box annotations, which has recently attracted increasing research attention.
no code implementations • 26 Nov 2022 • Lixiang Lin, Songyou Peng, Qijun Gan, Jianke Zhu
We propose an approach for optimizing high-quality clothed human body shapes in minutes, using multi-view posed images.
1 code implementation • 19 Jul 2022 • Wentong Li, Wenyu Liu, Jianke Zhu, Miaomiao Cui, Xiansheng Hua, Lei Zhang
A simple mask supervised SOLOv2 model is adapted to predict the instance-aware mask map as the level set for each instance.
2 code implementations • 4 Jul 2022 • Wenyu Liu, Wentong Li, Jianke Zhu, Miaomiao Cui, Xuansong Xie, Lei Zhang
With DIAL-Filters, we design both unsupervised and supervised frameworks for nighttime driving-scene segmentation, which can be trained in an end-to-end manner.
no code implementations • 23 Jun 2022 • Xinrui Zhan, Yang Li, Wenyu Liu, Jianke Zhu
In this paper, we propose Warped Convolution Networks (WCN) to effectively learn and represent the homography by SL(3) group and sl(3) algebra with group convolution.
1 code implementation • 17 Jun 2022 • Xin Zheng, Jianke Zhu
Prism-based LiDARs are more compact and cheaper than the conventional mechanical multi-line spinning LiDARs, which have become increasingly popular in robotics, recently.
1 code implementation • 25 May 2022 • Lixiang Lin, Jianke Zhu, Yisu Zhang
Although having achieved the promising results on shape and color recovery through self-supervision, the multi-layer perceptrons-based methods usually suffer from heavy computational cost on learning the deep implicit surface representation.
1 code implementation • 11 May 2022 • Zhuopeng Li, Lu Li, Zeyu Ma, Ping Zhang, Junbo Chen, Jianke Zhu
In this paper, a large-scale neural rendering method is proposed to synthesize the autonomous driving scene~(READ), which makes it possible to synthesize large-scale driving scenarios on a PC through a variety of sampling schemes.
Ranked #1 on Novel View Synthesis on KITTI
1 code implementation • 18 Apr 2022 • Jinwei Ren, Jianke Zhu, Jialiang Zhang
In this paper, we consider the challenging task of simultaneously locating and recovering multiple hands from a single 2D image.
1 code implementation • 27 Feb 2022 • Song Wang, Jianke Zhu, Ruixiang Zhang
LiDAR sensor is essential to the perception system in autonomous vehicles and intelligent robots.
Ranked #18 on 3D Semantic Segmentation on SemanticKITTI
1 code implementation • 15 Dec 2021 • Wenyu Liu, Gaofeng Ren, Runsheng Yu, Shi Guo, Jianke Zhu, Lei Zhang
Though deep learning-based object detection methods have achieved promising results on the conventional datasets, it is still challenging to locate objects from the low-quality images captured in adverse weather conditions.
3 code implementations • 15 Dec 2021 • Xinrui Zhan, Yueran Liu, Jianke Zhu, Yang Li
Planar object tracking plays an important role in AI applications, such as robotics, visual servoing, and visual SLAM.
no code implementations • 7 Dec 2021 • Wentong Li, Yijie Chen, Wenyu Liu, Jianke Zhu
Instead of learning the pairwise affinity, the level set method with the carefully designed energy functions treats the object segmentation as curve evolution, which is able to accurately recover the object's boundaries and prevent the interference from the indistinguishable background and similar objects.
no code implementations • 16 Jun 2021 • Shuyi Qu, Zhenxing Niu, Kaizhu Huang, Jianke Zhu, Matan Protter, Gadi Zimerman, Yinghui Xu
Recent deep generative models have achieved promising performance in image inpainting.
1 code implementation • 11 Jun 2021 • Lixiang Lin, Jianke Zhu
It is challenging to directly estimate the human geometry from a single image due to the high diversity and complexity of body shapes with the various clothing styles.
no code implementations • 2 Jun 2021 • Hantang Liu, Wentong Li, Jianke Zhu
Although enjoying the merits of promising results on the semantic parsing, deep learning methods cannot directly make use of the architectural rules, which play an important role for man-made structures.
2 code implementations • CVPR 2022 • Wentong Li, Yijie Chen, Kaixuan Hu, Jianke Zhu
In contrast to the generic object, aerial targets are often non-axis aligned with arbitrary orientations having the cluttered surroundings.
Ranked #7 on Oriented Object Detection on DOTA 1.0
no code implementations • 22 Apr 2021 • Xin Zheng, Jianke Zhu
LiDAR odometry plays an important role in self-localization and mapping for autonomous navigation, which is usually treated as a scan registration problem.
1 code implementation • 11 Jan 2021 • Tun Zhu, Daoxin Zhang, Yao Hu, Tianran Wang, XiaoLong Jiang, Jianke Zhu, Jiawei Li
Alongside the prevalence of mobile videos, the general public leans towards consuming vertical videos on hand-held devices.
1 code implementation • 6 Jan 2021 • Jialiang Zhang, Lixiang Lin, Jianke Zhu, Steven C. H. Hoi
3D face reconstruction plays a very important role in many real-world multimedia applications, including digital entertainment, social media, affection analysis, and person identification.
1 code implementation • 16 Aug 2020 • Shengyu Zhang, Tan Jiang, Tan Wang, Kun Kuang, Zhou Zhao, Jianke Zhu, Jin Yu, Hongxia Yang, Fei Wu
In this paper, we propose to investigate the problem of out-of-domain visio-linguistic pretraining, where the pretraining data distribution differs from that of downstream data on which the pretrained model will be fine-tuned.
no code implementations • 8 Jan 2020 • Shu-Ting Shi, Wenhao Zheng, Jun Tang, Qing-Guo Chen, Yao Hu, Jianke Zhu, Ming Li
Click-through rate (CTR) prediction is an essential task in industrial applications such as video recommendation.
1 code implementation • 21 Oct 2019 • Jialiang Zhang, Lixiang Lin, Yang Li, Yun-chen Chen, Jianke Zhu, Yao Hu, Steven C. H. Hoi
To tackle this critical problem, we propose an attribute-aware pedestrian detector to explicitly model people's semantic attributes in a high-level feature detection fashion.
no code implementations • 11 Sep 2019 • Yang Li, Florian Richter, Jingpei Lu, Emily K. Funk, Ryan K. Orosco, Jianke Zhu, Michael C. Yip
In this work, we propose a novel surgical perception framework, SuPer, for surgical robotic control.
no code implementations • 16 Jul 2018 • Shufei Zhang, Kai-Zhu Huang, Jianke Zhu, Yang Liu
All the existing adversarial training methods consider only how the worst perturbed examples (i. e., adversarial examples) could affect the model output.
no code implementations • 22 Mar 2018 • Xiongwei Wu, Daoxin Zhang, Jianke Zhu, Steven C. H. Hoi
Recent years have witnessed many exciting achievements for object detection using deep learning techniques.
2 code implementations • 14 Dec 2017 • Yang Li, Jianke Zhu, Steven C. H. Hoi, Wenjie Song, Zhefeng Wang, Hantang Liu
In order to efficiently search in such a large 4-DoF space in real-time, we formulate the problem into two 2-DoF sub-problems and apply an efficient Block Coordinates Descent solver to optimize the estimation result.
no code implementations • 3 Dec 2017 • Jialiang Zhang, Xiongwei Wu, Jianke Zhu, Steven C. H. Hoi
In this paper, we propose a novel simple yet effective framework of "Feature Agglomeration Networks" (FANet) to build a new single stage face detector, which not only achieves state-of-the-art performance but also runs efficiently.
no code implementations • 16 Nov 2017 • Dan Ma, Bin Liu, Zhao Kang, Jiayu Zhou, Jianke Zhu, Zenglin Xu
Generating high fidelity identity-preserving faces with different facial attributes has a wide range of applications.
no code implementations • 15 Mar 2016 • Qingqun Ning, Jianke Zhu, Zhiyuan Zhong, Steven C. H. Hoi, Chun Chen
Unlike the existing approaches, in this paper, we propose a novel approach called Sparse Product Quantization (SPQ) to encoding the high-dimensional feature vectors into sparse representation.
no code implementations • CVPR 2015 • Yang Li, Jianke Zhu, Steven C. H. Hoi
Most modern trackers typically employ a bounding box given in the first frame to track visual objects, where their tracking results are often sensitive to the initialization.
no code implementations • NeurIPS 2009 • Lei Wu, Rong Jin, Steven C. Hoi, Jianke Zhu, Nenghai Yu
Learning distance functions with side information plays a key role in many machine learning and data mining applications.
no code implementations • NeurIPS 2009 • Zenglin Xu, Rong Jin, Jianke Zhu, Irwin King, Michael Lyu, Zhirong Yang
In this framework, SVM and TSVM can be regarded as a learning machine without regularization and one with full regularization from the unlabeled data, respectively.
no code implementations • NeurIPS 2007 • Zenglin Xu, Rong Jin, Jianke Zhu, Irwin King, Michael Lyu
We consider the problem of Support Vector Machine transduction, which involves a combinatorial problem with exponential computational complexity in the number of unlabeled examples.