1 code implementation • 18 Dec 2024 • Haotong Lin, Sida Peng, Jingxiao Chen, Songyou Peng, Jiaming Sun, Minghuan Liu, Hujun Bao, Jiashi Feng, Xiaowei Zhou, Bingyi Kang
Prompts play a critical role in unleashing the power of language and vision foundation models for specific tasks.
1 code implementation • 12 Dec 2024 • Zhen Xu, Yinghao Xu, Zhiyuan Yu, Sida Peng, Jiaming Sun, Hujun Bao, Xiaowei Zhou
In addition, the tree-like structure of the Gaussian hierarchy allows us to efficiently represent the scene at a particular moment with a subset of Gaussian primitives, leading to nearly constant GPU memory usage during the training or rendering regardless of the video length.
no code implementations • 7 Dec 2024 • Tierui Gong, Jiaming Sun, Chau Yuen, Guangwei Hu, Yufei Zhao, Yong Liang Guan, Chong Meng Samson See, Mérouane Debbah, Lajos Hanzo
Our scheme and model provide explicit design guidance to RAQR-aided wireless systems.
1 code implementation • 11 Dec 2023 • Zhen Xu, Tao Xie, Sida Peng, Haotong Lin, Qing Shuai, Zhiyuan Yu, Guangzhao He, Jiaming Sun, Hujun Bao, Xiaowei Zhou
Volumetric video is a technology that digitally records dynamic events such as artistic performances, sporting events, and remote conversations.
no code implementations • CVPR 2024 • Zhen Xu, Sida Peng, Haotong Lin, Guangzhao He, Jiaming Sun, Yujun Shen, Hujun Bao, Xiaowei Zhou
Experiments show that our representation can be rendered at over 400 FPS on the DNA-Rendering dataset at 1080p resolution and 80 FPS on the ENeRF-Outdoor dataset at 4K resolution using an RTX 4090 GPU, which is 30x faster than previous methods and achieves the state-of-the-art rendering quality.
no code implementations • 14 Sep 2023 • Lei Zhang, Zhengkun Tian, Xiang Chen, Jiaming Sun, Hongyu Xiang, Ke Ding, Guanglu Wan
To address this issue, we draw inspiration from the multifaceted capabilities of LLMs and Whisper, and focus on integrating multiple ASR text processing tasks related to speech recognition into the ASR model.
no code implementations • CVPR 2024 • Zhen Xu, Sida Peng, Chen Geng, Linzhan Mou, Zihan Yan, Jiaming Sun, Hujun Bao, Xiaowei Zhou
Based on the HDQ algorithm, we leverage sphere tracing to efficiently estimate the surface intersection and light visibility.
1 code implementation • CVPR 2024 • Xingyi He, Jiaming Sun, Yifan Wang, Sida Peng, QiXing Huang, Hujun Bao, Xiaowei Zhou
We propose a new detector-free SfM framework to draw benefits from the recent success of detector-free matchers to avoid the early determination of keypoints, while solving the multi-view inconsistency issue of detector-free matchers.
no code implementations • 18 Jan 2023 • Xingyi He, Jiaming Sun, Yuang Wang, Di Huang, Hujun Bao, Xiaowei Zhou
We propose a new method for object pose estimation without CAD models.
no code implementations • 30 Nov 2022 • Di Huang, Xiaopeng Ji, Xingyi He, Jiaming Sun, Tong He, Qing Shuai, Wanli Ouyang, Xiaowei Zhou
The key idea is that the hand motion naturally provides multiple views of the object and the motion can be reliably estimated by a hand pose tracker.
1 code implementation • 25 May 2022 • Jiaming Sun, Xi Chen, Qianqian Wang, Zhengqi Li, Hadar Averbuch-Elor, Xiaowei Zhou, Noah Snavely
We are witnessing an explosion of neural implicit representations in computer vision and graphics.
1 code implementation • CVPR 2022 • Jiaming Sun, ZiHao Wang, Siyu Zhang, Xingyi He, Hongcheng Zhao, Guofeng Zhang, Xiaowei Zhou
We propose a new method named OnePose for object pose estimation.
1 code implementation • CVPR 2022 • Yuanqing Zhang, Jiaming Sun, Xingyi He, Huan Fu, Rongfei Jia, Xiaowei Zhou
The key insight is that indirect illumination can be conveniently derived from the neural radiance field learned from input images instead of being estimated jointly with direct illumination and materials.
3 code implementations • CVPR 2021 • Jiaming Sun, Yiming Xie, Linghao Chen, Xiaowei Zhou, Hujun Bao
We present a novel framework named NeuralRecon for real-time 3D scene reconstruction from a monocular video.
4 code implementations • CVPR 2021 • Jiaming Sun, Zehong Shen, Yuang Wang, Hujun Bao, Xiaowei Zhou
We present a novel method for local image feature matching.
no code implementations • ICCV 2021 • Jiaming Sun, Yiming Xie, Siyu Zhang, Linghao Chen, Guofeng Zhang, Hujun Bao, Xiaowei Zhou
In this work, we propose a novel system for integrated 3D object detection and tracking, which uses a dynamic object occupancy map and previous object states as spatial-temporal memory to assist object detection in future frames.
1 code implementation • ECCV 2020 • Jianan Zhen, Qi Fang, Jiaming Sun, Wentao Liu, Wei Jiang, Hujun Bao, Xiaowei Zhou
Recovering multi-person 3D poses with absolute scales from a single RGB image is a challenging problem due to the inherent depth and scale ambiguity from a single view.
Ranked #11 on
3D Multi-Person Pose Estimation (absolute)
on MuPoTS-3D
1 code implementation • CVPR 2020 • Jiaming Sun, Linghao Chen, Yiming Xie, Siyu Zhang, Qinhong Jiang, Xiaowei Zhou, Hujun Bao
In this paper, we propose a novel system named Disp R-CNN for 3D object detection from stereo images.
3D Object Detection From Stereo Images
Disparity Estimation
+2