Search Results for author: Qi Ye

Found 32 papers, 6 papers with code

Autonomous Implicit Indoor Scene Reconstruction with Frontier Exploration

no code implementations • 16 Apr 2024 • Jing Zeng, Yanxu Li, Jiahao Sun, Qi Ye, Yunlong Ran, Jiming Chen

In the paper, we propose to 1) incorporate frontier-based exploration tasks for global coverage with implicit surface uncertainty-based reconstruction tasks to achieve high-quality reconstruction.

3D Scene Reconstruction Indoor Scene Reconstruction

Paper
Add Code

Context-Aware Integration of Language and Visual References for Natural Language Tracking

no code implementations • 29 Mar 2024 • Yanyan Shao, Shuting He, Qi Ye, Yuchao Feng, Wenhan Luo, Jiming Chen

Tracking by natural language specification (TNL) aims to consistently localize a target in a video sequence given a linguistic description in the initial frame.

Paper
Add Code

SAM-PD: How Far Can SAM Take Us in Tracking and Segmenting Anything in Videos by Prompt Denoising

1 code implementation • 7 Mar 2024 • Tao Zhou, Wenhan Luo, Qi Ye, Zhiguo Shi, Jiming Chen

Recently, promptable segmentation models, such as the Segment Anything Model (SAM), have demonstrated robust zero-shot generalization capabilities on static images.

Denoising Instance Segmentation +4

Paper
Code

2L3: Lifting Imperfect Generated 2D Images into Accurate 3D

no code implementations • 29 Jan 2024 • Yizheng Chen, Rengan Xie, Qi Ye, Sen yang, Zixuan Xie, Tianxiao Chen, Rong Li, Yuchi Huo

Specifically, we first leverage to decouple the shading information from the generated images to reduce the impact of inconsistent lighting; then, we introduce mono prior with view-dependent transient encoding to enhance the reconstructed normal; and finally, we design a view augmentation fusion strategy that minimizes pixel-level loss in generated sparse views and semantic loss in augmented random views, resulting in view-consistent geometry and detailed textures.

3D Object Reconstruction 3D Reconstruction +1

Paper
Add Code

In-Hand 3D Object Reconstruction from a Monocular RGB Video

no code implementations • 27 Dec 2023 • Shijian Jiang, Qi Ye, Rengan Xie, Yuchi Huo, Xiang Li, Yang Zhou, Jiming Chen

We evaluate our approach on HO3D and HOD datasets and demonstrate that it outperforms the state-of-the-art methods in terms of reconstruction surface quality, with an improvement of $52\%$ on HO3D and $20\%$ on HOD.

3D Object Reconstruction 3D Reconstruction +2

Paper
Add Code

Holistic Inverse Rendering of Complex Facade via Aerial 3D Scanning

no code implementations • 20 Nov 2023 • Zixuan Xie, Rengan Xie, Rong Li, Kai Huang, Pengju Qiao, Jingsen Zhu, Xu Yin, Qi Ye, Wei Hua, Yuchi Huo, Hujun Bao

In this work, we use multi-view aerial images to reconstruct the geometry, lighting, and material of facades using neural signed distance fields (SDFs).

Benchmarking Inverse Rendering +2

Paper
Add Code

Expressibility-induced Concentration of Quantum Neural Tangent Kernels

no code implementations • 8 Nov 2023 • Li-Wei Yu, Weikang Li, Qi Ye, Zhide Lu, Zizhao Han, Dong-Ling Deng

In particular, for global loss functions, we rigorously prove that high expressibility of both the global and local quantum encodings can lead to exponential concentration of quantum tangent kernel values to zero.

Quantum Machine Learning

Paper
Add Code

InterTracker: Discovering and Tracking General Objects Interacting with Hands in the Wild

no code implementations • 6 Aug 2023 • Yanyan Shao, Qi Ye, Wenhan Luo, Kaihao Zhang, Jiming Chen

Understanding human interaction with objects is an important research topic for embodied Artificial Intelligence and identifying the objects that humans are interacting with is a primary problem for interaction understanding.

Object Object Tracking

Paper
Add Code

Seal-3D: Interactive Pixel-Level Editing for Neural Radiance Fields

1 code implementation • ICCV 2023 • Xiangyu Wang, Jingsen Zhu, Qi Ye, Yuchi Huo, Yunlong Ran, Zhihua Zhong, Jiming Chen

With the popularity of implicit neural representations, or neural radiance fields (NeRF), there is a pressing need for editing methods to interact with the implicit 3D models for tasks like post-processing reconstructed scenes and 3D content creation.

157

Paper
Code

Evaluation of Multi-indicator And Multi-organ Medical Image Segmentation Models

1 code implementation • 1 Jun 2023 • Qi Ye, Lihua Guo

As a result, MIMO offers detailed information on the segmentation of each organ in each sample, thereby aiding developers in analyzing and improving the model.

Image Segmentation Medical Image Segmentation +2

Paper
Code

I$^2$-SDF: Intrinsic Indoor Scene Reconstruction and Editing via Raytracing in Neural SDFs

no code implementations • 14 Mar 2023 • Jingsen Zhu, Yuchi Huo, Qi Ye, Fujun Luan, Jifan Li, Dianbing Xi, Lisha Wang, Rui Tang, Wei Hua, Hujun Bao, Rui Wang

In this work, we present I$^2$-SDF, a new method for intrinsic indoor scene reconstruction and editing using differentiable Monte Carlo raytracing on neural signed distance fields (SDFs).

Indoor Scene Reconstruction Novel View Synthesis

Paper
Add Code

Composite Optimization Algorithms for Sigmoid Networks

no code implementations • 1 Mar 2023 • Huixiong Chen, Qi Ye

In this paper, we use composite optimization algorithms to solve sigmoid networks.

Handwritten Digit Recognition

Paper
Add Code

Metadata-Based RAW Reconstruction via Implicit Neural Functions

no code implementations • CVPR 2023 • Leyi Li, Huijie Qiao, Qi Ye, Qinmin Yang

Many low-level computer vision tasks are desirable to utilize the unprocessed RAW image as input, which remains the linear relationship between pixel values and scene radiance.

Raw reconstruction Super-Resolution

Paper
Add Code

I2-SDF: Intrinsic Indoor Scene Reconstruction and Editing via Raytracing in Neural SDFs

no code implementations • CVPR 2023 • Jingsen Zhu, Yuchi Huo, Qi Ye, Fujun Luan, Jifan Li, Dianbing Xi, Lisha Wang, Rui Tang, Wei Hua, Hujun Bao, Rui Wang

Further, we propose to decompose the neural radiance field into spatially-varying material of the scene as a neural field through surface-based, differentiable Monte Carlo raytracing and emitter semantic segmentations, which enables physically based and photorealistic scene relighting and editing applications.

Indoor Scene Reconstruction Novel View Synthesis

Paper
Add Code

F&F Attack: Adversarial Attack against Multiple Object Trackers by Inducing False Negatives and False Positives

no code implementations • ICCV 2023 • Tao Zhou, Qi Ye, Wenhan Luo, Kaihao Zhang, Zhiguo Shi, Jiming Chen

Multi-object tracking (MOT) aims to build moving trajectories for number-agnostic objects.

Adversarial Attack Multi-Object Tracking +1

Paper
Add Code

AF Adapter: Continual Pretraining for Building Chinese Biomedical Language Model

1 code implementation • 21 Nov 2022 • Yongyu Yan, Kui Xue, Xiaoming Shi, Qi Ye, Jingping Liu, Tong Ruan

Continual pretraining is a popular way of building a domain-specific pretrained language model from a general-domain language model.

Continual Pretraining Language Modelling

Paper
Code

Pixel-Aligned Non-parametric Hand Mesh Reconstruction

no code implementations • 17 Oct 2022 • Shijian Jiang, Guwen Han, Danhang Tang, Yang Zhou, Xiang Li, Jiming Chen, Qi Ye

The decoder aggregate both local image features in pixels and geometric features in vertices.

Paper
Add Code

Contact2Grasp: 3D Grasp Synthesis via Hand-Object Contact Constraint

no code implementations • 17 Oct 2022 • Haoming Li, Xinzhuo Lin, Yang Zhou, Xiang Li, Yuchi Huo, Jiming Chen, Qi Ye

To tackle the challenge, we introduce an intermediate variable for grasp contact areas to constrain the grasp generation; in other words, we factorize the mapping into two sequential stages by assuming that grasping poses are fully constrained given contact maps: 1) we first learn contact map distributions to generate the potential contact maps for grasps; 2) then learn a mapping from the contact maps to the grasping poses.

Grasp Generation Object +2

Paper
Add Code

ImmFusion: Robust mmWave-RGB Fusion for 3D Human Body Reconstruction in All Weather Conditions

no code implementations • 4 Oct 2022 • Anjun Chen, Xiangyu Wang, Kun Shi, Shaohao Zhu, Bin Fang, Yingfeng Chen, Jiming Chen, Yuchi Huo, Qi Ye

However, combining RGB and mmWave signals for robust all-weather 3D human reconstruction is still an open challenge, given the sparse nature of mmWave and the vulnerability of RGB images.

3D Human Reconstruction

Paper
Add Code

mmBody Benchmark: 3D Body Reconstruction Dataset and Analysis for Millimeter Wave Radar

no code implementations • 12 Sep 2022 • Anjun Chen, Xiangyu Wang, Shaohao Zhu, Yanxu Li, Jiming Chen, Qi Ye

The results demonstrate that 1) despite the noise and sparsity of the generated point clouds, the mmWave radar can achieve better reconstruction accuracy than the RGB camera but worse than the depth camera; 2) the reconstruction from the mmWave radar is affected by adverse weather conditions moderately while the RGB(D) camera is severely affected.

Paper
Add Code

NeurAR: Neural Uncertainty for Autonomous 3D Reconstruction with Implicit Neural Representations

no code implementations • 22 Jul 2022 • Yunlong Ran, Jing Zeng, Shibo He, Lincheng Li, Yingfeng Chen, Gimhee Lee, Jiming Chen, Qi Ye

In this paper, we explore for the first time the possibility of using implicit neural representations for autonomous 3D scene reconstruction by addressing two key challenges: 1) seeking a criterion to measure the quality of the candidate viewpoints for the view planning based on the new representations, and 2) learning the criterion from data that can generalize to different scenes instead of a hand-crafting one.

3D Reconstruction 3D Scene Reconstruction

Paper
Add Code

Few-shot learning with improved local representations via bias rectify module

no code implementations • 1 Nov 2021 • Chao Dong, Qi Ye, Wenchao Meng, Kaixiang Yang

Recent approaches based on metric learning have achieved great progress in few-shot learning.

Few-Shot Learning Metric Learning

Paper
Add Code

Analysis of Regularized Learning for Linear-functional Data in Banach Spaces

no code implementations • 7 Sep 2021 • Qi Ye

In the convergence theorems, we show the convergence of the approximate solutions to the exact solutions by the weak* topology of the Banach space.

Paper
Add Code

Sample Complexity of Learning Parametric Quantum Circuits

no code implementations • 19 Jul 2021 • Haoyuan Cai, Qi Ye, Dong-Ling Deng

Quantum computers hold unprecedented potentials for machine learning applications.

BIG-bench Machine Learning Quantum Machine Learning

Paper
Add Code

Geometry-based Distance Decomposition for Monocular 3D Object Detection

1 code implementation • ICCV 2021 • Xuepeng Shi, Qi Ye, Xiaozhi Chen, Chuangrong Chen, Zhixiang Chen, Tae-Kyun Kim

The experimental results show that our method achieves the state-of-the-art performance on the monocular 3D Object Detection and Birds Eye View tasks of the KITTI dataset, and can generalize to images with different camera intrinsics.

Ranked #15 on Monocular 3D Object Detection on KITTI Cars Moderate

Autonomous Driving Monocular 3D Object Detection +2

114

Paper
Code

Optimal Retirement Time and Consumption with the Variation in Habitual Persistence

no code implementations • 31 Mar 2021 • Lin He, Zongxia Liang, Yilun Song, Qi Ye

In this paper, we study the individual's optimal retirement time and optimal consumption under habitual persistence.

Paper
Add Code

The Phong Surface: Efficient 3D Model Fitting using Lifted Optimization

no code implementations • ECCV 2020 • Jingjing Shen, Thomas J. Cashman, Qi Ye, Tim Hutton, Toby Sharp, Federica Bogo, Andrew William Fitzgibbon, Jamie Shotton

Realtime perceptual and interaction capabilities in mixed reality require a range of 3D tracking problems to be solved at low latency on resource-constrained hardware such as head-mounted devices.

Mixed Reality

Paper
Add Code

Occlusion-aware Hand Pose Estimation Using Hierarchical Mixture Density Network

no code implementations • ECCV 2018 • Qi Ye, Tae-Kyun Kim

The proposed method leverages the state-of-the-art hand pose estimators based on Convolutional Neural Networks to facilitate feature learning, while it models the multiple modes in a two-level hierarchy to reconcile single-valued and multi-valued mapping in its output.

Hand Pose Estimation

Paper
Add Code

The 2017 Hands in the Million Challenge on 3D Hand Pose Estimation

no code implementations • 7 Jul 2017 • Shanxin Yuan, Qi Ye, Guillermo Garcia-Hernando, Tae-Kyun Kim

We present the 2017 Hands in the Million Challenge, a public competition designed for the evaluation of the task of 3D hand pose estimation.

3D Hand Pose Estimation

Paper
Add Code

BigHand2.2M Benchmark: Hand Pose Dataset and State of the Art Analysis

no code implementations • CVPR 2017 • Shanxin Yuan, Qi Ye, Bjorn Stenger, Siddhant Jain, Tae-Kyun Kim

We also show significant improvements in egocentric hand pose estimation with a CNN trained on the new dataset.

Art Analysis Hand Pose Estimation

Paper
Add Code

Spatial Attention Deep Net with Partial PSO for Hierarchical Hybrid Hand Pose Estimation

1 code implementation • 12 Apr 2016 • Qi Ye, Shanxin Yuan, Tae-Kyun Kim

In this paper, a hybrid hand pose estimation method is proposed by applying the kinematic hierarchy strategy to the input space (as well as the output space) of the discriminative method by a spatial attention mechanism and to the optimization of the generative method by hierarchical Particle Swarm Optimization (PSO).

Hand Pose Estimation

Paper
Code

Solving Support Vector Machines in Reproducing Kernel Banach Spaces with Positive Definite Functions

no code implementations • 6 Sep 2012 • Gregory E. Fasshauer, Fred J. Hickernell, Qi Ye

In this paper we solve support vector machines in reproducing kernel Banach spaces with reproducing kernels defined on nonsymmetric domains instead of the traditional methods in reproducing kernel Hilbert spaces.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.