no code implementations • 16 Apr 2024 • Jing Zeng, Yanxu Li, Jiahao Sun, Qi Ye, Yunlong Ran, Jiming Chen
In the paper, we propose to 1) incorporate frontier-based exploration tasks for global coverage with implicit surface uncertainty-based reconstruction tasks to achieve high-quality reconstruction.
no code implementations • 29 Mar 2024 • Yanyan Shao, Shuting He, Qi Ye, Yuchao Feng, Wenhan Luo, Jiming Chen
Tracking by natural language specification (TNL) aims to consistently localize a target in a video sequence given a linguistic description in the initial frame.
1 code implementation • 7 Mar 2024 • Tao Zhou, Wenhan Luo, Qi Ye, Zhiguo Shi, Jiming Chen
Recently, promptable segmentation models, such as the Segment Anything Model (SAM), have demonstrated robust zero-shot generalization capabilities on static images.
no code implementations • 29 Jan 2024 • Yizheng Chen, Rengan Xie, Qi Ye, Sen yang, Zixuan Xie, Tianxiao Chen, Rong Li, Yuchi Huo
Specifically, we first leverage to decouple the shading information from the generated images to reduce the impact of inconsistent lighting; then, we introduce mono prior with view-dependent transient encoding to enhance the reconstructed normal; and finally, we design a view augmentation fusion strategy that minimizes pixel-level loss in generated sparse views and semantic loss in augmented random views, resulting in view-consistent geometry and detailed textures.
no code implementations • 27 Dec 2023 • Shijian Jiang, Qi Ye, Rengan Xie, Yuchi Huo, Xiang Li, Yang Zhou, Jiming Chen
We evaluate our approach on HO3D and HOD datasets and demonstrate that it outperforms the state-of-the-art methods in terms of reconstruction surface quality, with an improvement of $52\%$ on HO3D and $20\%$ on HOD.
no code implementations • 20 Nov 2023 • Zixuan Xie, Rengan Xie, Rong Li, Kai Huang, Pengju Qiao, Jingsen Zhu, Xu Yin, Qi Ye, Wei Hua, Yuchi Huo, Hujun Bao
In this work, we use multi-view aerial images to reconstruct the geometry, lighting, and material of facades using neural signed distance fields (SDFs).
no code implementations • 8 Nov 2023 • Li-Wei Yu, Weikang Li, Qi Ye, Zhide Lu, Zizhao Han, Dong-Ling Deng
In particular, for global loss functions, we rigorously prove that high expressibility of both the global and local quantum encodings can lead to exponential concentration of quantum tangent kernel values to zero.
no code implementations • 6 Aug 2023 • Yanyan Shao, Qi Ye, Wenhan Luo, Kaihao Zhang, Jiming Chen
Understanding human interaction with objects is an important research topic for embodied Artificial Intelligence and identifying the objects that humans are interacting with is a primary problem for interaction understanding.
1 code implementation • ICCV 2023 • Xiangyu Wang, Jingsen Zhu, Qi Ye, Yuchi Huo, Yunlong Ran, Zhihua Zhong, Jiming Chen
With the popularity of implicit neural representations, or neural radiance fields (NeRF), there is a pressing need for editing methods to interact with the implicit 3D models for tasks like post-processing reconstructed scenes and 3D content creation.
1 code implementation • 1 Jun 2023 • Qi Ye, Lihua Guo
As a result, MIMO offers detailed information on the segmentation of each organ in each sample, thereby aiding developers in analyzing and improving the model.
no code implementations • 14 Mar 2023 • Jingsen Zhu, Yuchi Huo, Qi Ye, Fujun Luan, Jifan Li, Dianbing Xi, Lisha Wang, Rui Tang, Wei Hua, Hujun Bao, Rui Wang
In this work, we present I$^2$-SDF, a new method for intrinsic indoor scene reconstruction and editing using differentiable Monte Carlo raytracing on neural signed distance fields (SDFs).
no code implementations • 1 Mar 2023 • Huixiong Chen, Qi Ye
In this paper, we use composite optimization algorithms to solve sigmoid networks.
no code implementations • CVPR 2023 • Leyi Li, Huijie Qiao, Qi Ye, Qinmin Yang
Many low-level computer vision tasks are desirable to utilize the unprocessed RAW image as input, which remains the linear relationship between pixel values and scene radiance.
no code implementations • CVPR 2023 • Jingsen Zhu, Yuchi Huo, Qi Ye, Fujun Luan, Jifan Li, Dianbing Xi, Lisha Wang, Rui Tang, Wei Hua, Hujun Bao, Rui Wang
Further, we propose to decompose the neural radiance field into spatially-varying material of the scene as a neural field through surface-based, differentiable Monte Carlo raytracing and emitter semantic segmentations, which enables physically based and photorealistic scene relighting and editing applications.
no code implementations • ICCV 2023 • Tao Zhou, Qi Ye, Wenhan Luo, Kaihao Zhang, Zhiguo Shi, Jiming Chen
Multi-object tracking (MOT) aims to build moving trajectories for number-agnostic objects.
1 code implementation • 21 Nov 2022 • Yongyu Yan, Kui Xue, Xiaoming Shi, Qi Ye, Jingping Liu, Tong Ruan
Continual pretraining is a popular way of building a domain-specific pretrained language model from a general-domain language model.
no code implementations • 17 Oct 2022 • Shijian Jiang, Guwen Han, Danhang Tang, Yang Zhou, Xiang Li, Jiming Chen, Qi Ye
The decoder aggregate both local image features in pixels and geometric features in vertices.
no code implementations • 17 Oct 2022 • Haoming Li, Xinzhuo Lin, Yang Zhou, Xiang Li, Yuchi Huo, Jiming Chen, Qi Ye
To tackle the challenge, we introduce an intermediate variable for grasp contact areas to constrain the grasp generation; in other words, we factorize the mapping into two sequential stages by assuming that grasping poses are fully constrained given contact maps: 1) we first learn contact map distributions to generate the potential contact maps for grasps; 2) then learn a mapping from the contact maps to the grasping poses.
no code implementations • 4 Oct 2022 • Anjun Chen, Xiangyu Wang, Kun Shi, Shaohao Zhu, Bin Fang, Yingfeng Chen, Jiming Chen, Yuchi Huo, Qi Ye
However, combining RGB and mmWave signals for robust all-weather 3D human reconstruction is still an open challenge, given the sparse nature of mmWave and the vulnerability of RGB images.
no code implementations • 12 Sep 2022 • Anjun Chen, Xiangyu Wang, Shaohao Zhu, Yanxu Li, Jiming Chen, Qi Ye
The results demonstrate that 1) despite the noise and sparsity of the generated point clouds, the mmWave radar can achieve better reconstruction accuracy than the RGB camera but worse than the depth camera; 2) the reconstruction from the mmWave radar is affected by adverse weather conditions moderately while the RGB(D) camera is severely affected.
no code implementations • 22 Jul 2022 • Yunlong Ran, Jing Zeng, Shibo He, Lincheng Li, Yingfeng Chen, Gimhee Lee, Jiming Chen, Qi Ye
In this paper, we explore for the first time the possibility of using implicit neural representations for autonomous 3D scene reconstruction by addressing two key challenges: 1) seeking a criterion to measure the quality of the candidate viewpoints for the view planning based on the new representations, and 2) learning the criterion from data that can generalize to different scenes instead of a hand-crafting one.
no code implementations • 1 Nov 2021 • Chao Dong, Qi Ye, Wenchao Meng, Kaixiang Yang
Recent approaches based on metric learning have achieved great progress in few-shot learning.
no code implementations • 7 Sep 2021 • Qi Ye
In the convergence theorems, we show the convergence of the approximate solutions to the exact solutions by the weak* topology of the Banach space.
no code implementations • 19 Jul 2021 • Haoyuan Cai, Qi Ye, Dong-Ling Deng
Quantum computers hold unprecedented potentials for machine learning applications.
1 code implementation • ICCV 2021 • Xuepeng Shi, Qi Ye, Xiaozhi Chen, Chuangrong Chen, Zhixiang Chen, Tae-Kyun Kim
The experimental results show that our method achieves the state-of-the-art performance on the monocular 3D Object Detection and Birds Eye View tasks of the KITTI dataset, and can generalize to images with different camera intrinsics.
Ranked #15 on Monocular 3D Object Detection on KITTI Cars Moderate
no code implementations • 31 Mar 2021 • Lin He, Zongxia Liang, Yilun Song, Qi Ye
In this paper, we study the individual's optimal retirement time and optimal consumption under habitual persistence.
no code implementations • ECCV 2020 • Jingjing Shen, Thomas J. Cashman, Qi Ye, Tim Hutton, Toby Sharp, Federica Bogo, Andrew William Fitzgibbon, Jamie Shotton
Realtime perceptual and interaction capabilities in mixed reality require a range of 3D tracking problems to be solved at low latency on resource-constrained hardware such as head-mounted devices.
no code implementations • ECCV 2018 • Qi Ye, Tae-Kyun Kim
The proposed method leverages the state-of-the-art hand pose estimators based on Convolutional Neural Networks to facilitate feature learning, while it models the multiple modes in a two-level hierarchy to reconcile single-valued and multi-valued mapping in its output.
no code implementations • 7 Jul 2017 • Shanxin Yuan, Qi Ye, Guillermo Garcia-Hernando, Tae-Kyun Kim
We present the 2017 Hands in the Million Challenge, a public competition designed for the evaluation of the task of 3D hand pose estimation.
no code implementations • CVPR 2017 • Shanxin Yuan, Qi Ye, Bjorn Stenger, Siddhant Jain, Tae-Kyun Kim
We also show significant improvements in egocentric hand pose estimation with a CNN trained on the new dataset.
1 code implementation • 12 Apr 2016 • Qi Ye, Shanxin Yuan, Tae-Kyun Kim
In this paper, a hybrid hand pose estimation method is proposed by applying the kinematic hierarchy strategy to the input space (as well as the output space) of the discriminative method by a spatial attention mechanism and to the optimization of the generative method by hierarchical Particle Swarm Optimization (PSO).
no code implementations • 6 Sep 2012 • Gregory E. Fasshauer, Fred J. Hickernell, Qi Ye
In this paper we solve support vector machines in reproducing kernel Banach spaces with reproducing kernels defined on nonsymmetric domains instead of the traditional methods in reproducing kernel Hilbert spaces.