Search Results for author: Qi Ye

Found 32 papers, 6 papers with code

Autonomous Implicit Indoor Scene Reconstruction with Frontier Exploration

no code implementations16 Apr 2024 Jing Zeng, Yanxu Li, Jiahao Sun, Qi Ye, Yunlong Ran, Jiming Chen

In the paper, we propose to 1) incorporate frontier-based exploration tasks for global coverage with implicit surface uncertainty-based reconstruction tasks to achieve high-quality reconstruction.

3D Scene Reconstruction Indoor Scene Reconstruction

Context-Aware Integration of Language and Visual References for Natural Language Tracking

no code implementations29 Mar 2024 Yanyan Shao, Shuting He, Qi Ye, Yuchao Feng, Wenhan Luo, Jiming Chen

Tracking by natural language specification (TNL) aims to consistently localize a target in a video sequence given a linguistic description in the initial frame.

SAM-PD: How Far Can SAM Take Us in Tracking and Segmenting Anything in Videos by Prompt Denoising

1 code implementation7 Mar 2024 Tao Zhou, Wenhan Luo, Qi Ye, Zhiguo Shi, Jiming Chen

Recently, promptable segmentation models, such as the Segment Anything Model (SAM), have demonstrated robust zero-shot generalization capabilities on static images.

Denoising Instance Segmentation +4

2L3: Lifting Imperfect Generated 2D Images into Accurate 3D

no code implementations29 Jan 2024 Yizheng Chen, Rengan Xie, Qi Ye, Sen yang, Zixuan Xie, Tianxiao Chen, Rong Li, Yuchi Huo

Specifically, we first leverage to decouple the shading information from the generated images to reduce the impact of inconsistent lighting; then, we introduce mono prior with view-dependent transient encoding to enhance the reconstructed normal; and finally, we design a view augmentation fusion strategy that minimizes pixel-level loss in generated sparse views and semantic loss in augmented random views, resulting in view-consistent geometry and detailed textures.

3D Object Reconstruction 3D Reconstruction +1

In-Hand 3D Object Reconstruction from a Monocular RGB Video

no code implementations27 Dec 2023 Shijian Jiang, Qi Ye, Rengan Xie, Yuchi Huo, Xiang Li, Yang Zhou, Jiming Chen

We evaluate our approach on HO3D and HOD datasets and demonstrate that it outperforms the state-of-the-art methods in terms of reconstruction surface quality, with an improvement of $52\%$ on HO3D and $20\%$ on HOD.

3D Object Reconstruction 3D Reconstruction +2

Holistic Inverse Rendering of Complex Facade via Aerial 3D Scanning

no code implementations20 Nov 2023 Zixuan Xie, Rengan Xie, Rong Li, Kai Huang, Pengju Qiao, Jingsen Zhu, Xu Yin, Qi Ye, Wei Hua, Yuchi Huo, Hujun Bao

In this work, we use multi-view aerial images to reconstruct the geometry, lighting, and material of facades using neural signed distance fields (SDFs).

Benchmarking Inverse Rendering +2

Expressibility-induced Concentration of Quantum Neural Tangent Kernels

no code implementations8 Nov 2023 Li-Wei Yu, Weikang Li, Qi Ye, Zhide Lu, Zizhao Han, Dong-Ling Deng

In particular, for global loss functions, we rigorously prove that high expressibility of both the global and local quantum encodings can lead to exponential concentration of quantum tangent kernel values to zero.

Quantum Machine Learning

InterTracker: Discovering and Tracking General Objects Interacting with Hands in the Wild

no code implementations6 Aug 2023 Yanyan Shao, Qi Ye, Wenhan Luo, Kaihao Zhang, Jiming Chen

Understanding human interaction with objects is an important research topic for embodied Artificial Intelligence and identifying the objects that humans are interacting with is a primary problem for interaction understanding.

Object Object Tracking

Seal-3D: Interactive Pixel-Level Editing for Neural Radiance Fields

1 code implementation ICCV 2023 Xiangyu Wang, Jingsen Zhu, Qi Ye, Yuchi Huo, Yunlong Ran, Zhihua Zhong, Jiming Chen

With the popularity of implicit neural representations, or neural radiance fields (NeRF), there is a pressing need for editing methods to interact with the implicit 3D models for tasks like post-processing reconstructed scenes and 3D content creation.

Evaluation of Multi-indicator And Multi-organ Medical Image Segmentation Models

1 code implementation1 Jun 2023 Qi Ye, Lihua Guo

As a result, MIMO offers detailed information on the segmentation of each organ in each sample, thereby aiding developers in analyzing and improving the model.

Image Segmentation Medical Image Segmentation +2

I$^2$-SDF: Intrinsic Indoor Scene Reconstruction and Editing via Raytracing in Neural SDFs

no code implementations14 Mar 2023 Jingsen Zhu, Yuchi Huo, Qi Ye, Fujun Luan, Jifan Li, Dianbing Xi, Lisha Wang, Rui Tang, Wei Hua, Hujun Bao, Rui Wang

In this work, we present I$^2$-SDF, a new method for intrinsic indoor scene reconstruction and editing using differentiable Monte Carlo raytracing on neural signed distance fields (SDFs).

Indoor Scene Reconstruction Novel View Synthesis

Composite Optimization Algorithms for Sigmoid Networks

no code implementations1 Mar 2023 Huixiong Chen, Qi Ye

In this paper, we use composite optimization algorithms to solve sigmoid networks.

Handwritten Digit Recognition

Metadata-Based RAW Reconstruction via Implicit Neural Functions

no code implementations CVPR 2023 Leyi Li, Huijie Qiao, Qi Ye, Qinmin Yang

Many low-level computer vision tasks are desirable to utilize the unprocessed RAW image as input, which remains the linear relationship between pixel values and scene radiance.

Raw reconstruction Super-Resolution

I2-SDF: Intrinsic Indoor Scene Reconstruction and Editing via Raytracing in Neural SDFs

no code implementations CVPR 2023 Jingsen Zhu, Yuchi Huo, Qi Ye, Fujun Luan, Jifan Li, Dianbing Xi, Lisha Wang, Rui Tang, Wei Hua, Hujun Bao, Rui Wang

Further, we propose to decompose the neural radiance field into spatially-varying material of the scene as a neural field through surface-based, differentiable Monte Carlo raytracing and emitter semantic segmentations, which enables physically based and photorealistic scene relighting and editing applications.

Indoor Scene Reconstruction Novel View Synthesis

AF Adapter: Continual Pretraining for Building Chinese Biomedical Language Model

1 code implementation21 Nov 2022 Yongyu Yan, Kui Xue, Xiaoming Shi, Qi Ye, Jingping Liu, Tong Ruan

Continual pretraining is a popular way of building a domain-specific pretrained language model from a general-domain language model.

Continual Pretraining Language Modelling

Pixel-Aligned Non-parametric Hand Mesh Reconstruction

no code implementations17 Oct 2022 Shijian Jiang, Guwen Han, Danhang Tang, Yang Zhou, Xiang Li, Jiming Chen, Qi Ye

The decoder aggregate both local image features in pixels and geometric features in vertices.

Contact2Grasp: 3D Grasp Synthesis via Hand-Object Contact Constraint

no code implementations17 Oct 2022 Haoming Li, Xinzhuo Lin, Yang Zhou, Xiang Li, Yuchi Huo, Jiming Chen, Qi Ye

To tackle the challenge, we introduce an intermediate variable for grasp contact areas to constrain the grasp generation; in other words, we factorize the mapping into two sequential stages by assuming that grasping poses are fully constrained given contact maps: 1) we first learn contact map distributions to generate the potential contact maps for grasps; 2) then learn a mapping from the contact maps to the grasping poses.

Grasp Generation Object +2

ImmFusion: Robust mmWave-RGB Fusion for 3D Human Body Reconstruction in All Weather Conditions

no code implementations4 Oct 2022 Anjun Chen, Xiangyu Wang, Kun Shi, Shaohao Zhu, Bin Fang, Yingfeng Chen, Jiming Chen, Yuchi Huo, Qi Ye

However, combining RGB and mmWave signals for robust all-weather 3D human reconstruction is still an open challenge, given the sparse nature of mmWave and the vulnerability of RGB images.

3D Human Reconstruction

mmBody Benchmark: 3D Body Reconstruction Dataset and Analysis for Millimeter Wave Radar

no code implementations12 Sep 2022 Anjun Chen, Xiangyu Wang, Shaohao Zhu, Yanxu Li, Jiming Chen, Qi Ye

The results demonstrate that 1) despite the noise and sparsity of the generated point clouds, the mmWave radar can achieve better reconstruction accuracy than the RGB camera but worse than the depth camera; 2) the reconstruction from the mmWave radar is affected by adverse weather conditions moderately while the RGB(D) camera is severely affected.

NeurAR: Neural Uncertainty for Autonomous 3D Reconstruction with Implicit Neural Representations

no code implementations22 Jul 2022 Yunlong Ran, Jing Zeng, Shibo He, Lincheng Li, Yingfeng Chen, Gimhee Lee, Jiming Chen, Qi Ye

In this paper, we explore for the first time the possibility of using implicit neural representations for autonomous 3D scene reconstruction by addressing two key challenges: 1) seeking a criterion to measure the quality of the candidate viewpoints for the view planning based on the new representations, and 2) learning the criterion from data that can generalize to different scenes instead of a hand-crafting one.

3D Reconstruction 3D Scene Reconstruction

Analysis of Regularized Learning for Linear-functional Data in Banach Spaces

no code implementations7 Sep 2021 Qi Ye

In the convergence theorems, we show the convergence of the approximate solutions to the exact solutions by the weak* topology of the Banach space.

Geometry-based Distance Decomposition for Monocular 3D Object Detection

1 code implementation ICCV 2021 Xuepeng Shi, Qi Ye, Xiaozhi Chen, Chuangrong Chen, Zhixiang Chen, Tae-Kyun Kim

The experimental results show that our method achieves the state-of-the-art performance on the monocular 3D Object Detection and Birds Eye View tasks of the KITTI dataset, and can generalize to images with different camera intrinsics.

Autonomous Driving Monocular 3D Object Detection +2

Optimal Retirement Time and Consumption with the Variation in Habitual Persistence

no code implementations31 Mar 2021 Lin He, Zongxia Liang, Yilun Song, Qi Ye

In this paper, we study the individual's optimal retirement time and optimal consumption under habitual persistence.

The Phong Surface: Efficient 3D Model Fitting using Lifted Optimization

no code implementations ECCV 2020 Jingjing Shen, Thomas J. Cashman, Qi Ye, Tim Hutton, Toby Sharp, Federica Bogo, Andrew William Fitzgibbon, Jamie Shotton

Realtime perceptual and interaction capabilities in mixed reality require a range of 3D tracking problems to be solved at low latency on resource-constrained hardware such as head-mounted devices.

Mixed Reality

Occlusion-aware Hand Pose Estimation Using Hierarchical Mixture Density Network

no code implementations ECCV 2018 Qi Ye, Tae-Kyun Kim

The proposed method leverages the state-of-the-art hand pose estimators based on Convolutional Neural Networks to facilitate feature learning, while it models the multiple modes in a two-level hierarchy to reconcile single-valued and multi-valued mapping in its output.

Hand Pose Estimation

The 2017 Hands in the Million Challenge on 3D Hand Pose Estimation

no code implementations7 Jul 2017 Shanxin Yuan, Qi Ye, Guillermo Garcia-Hernando, Tae-Kyun Kim

We present the 2017 Hands in the Million Challenge, a public competition designed for the evaluation of the task of 3D hand pose estimation.

3D Hand Pose Estimation

Spatial Attention Deep Net with Partial PSO for Hierarchical Hybrid Hand Pose Estimation

1 code implementation12 Apr 2016 Qi Ye, Shanxin Yuan, Tae-Kyun Kim

In this paper, a hybrid hand pose estimation method is proposed by applying the kinematic hierarchy strategy to the input space (as well as the output space) of the discriminative method by a spatial attention mechanism and to the optimization of the generative method by hierarchical Particle Swarm Optimization (PSO).

Hand Pose Estimation

Solving Support Vector Machines in Reproducing Kernel Banach Spaces with Positive Definite Functions

no code implementations6 Sep 2012 Gregory E. Fasshauer, Fred J. Hickernell, Qi Ye

In this paper we solve support vector machines in reproducing kernel Banach spaces with reproducing kernels defined on nonsymmetric domains instead of the traditional methods in reproducing kernel Hilbert spaces.

Cannot find the paper you are looking for? You can Submit a new open access paper.