Search Results for author: Yueqi Duan

Found 37 papers, 20 papers with code

Vector Neurons: A General Framework for SO(3)-Equivariant Networks

4 code implementations ICCV 2021 Congyue Deng, Or Litany, Yueqi Duan, Adrien Poulenard, Andrea Tagliasacchi, Leonidas Guibas

Invariance and equivariance to the rotation group have been widely discussed in the 3D deep learning community for pointclouds.

OccWorld: Learning a 3D Occupancy World Model for Autonomous Driving

1 code implementation27 Nov 2023 Wenzhao Zheng, Weiliang Chen, Yuanhui Huang, Borui Zhang, Yueqi Duan, Jiwen Lu

In this paper, we explore a new framework of learning a world model, OccWorld, in the 3D Occupancy space to simultaneously predict the movement of the ego car and the evolution of the surrounding scenes.

Autonomous Driving

OccNeRF: Advancing 3D Occupancy Prediction in LiDAR-Free Environments

1 code implementation14 Dec 2023 Chubin Zhang, Juncheng Yan, Yi Wei, Jiaxin Li, Li Liu, Yansong Tang, Yueqi Duan, Jiwen Lu

As a fundamental task of vision-based perception, 3D occupancy prediction reconstructs 3D structures of surrounding environments.

Autonomous Driving Depth Estimation +1

Diffusion-SDF: Text-to-Shape via Voxelized Diffusion

1 code implementation CVPR 2023 Muheng Li, Yueqi Duan, Jie zhou, Jiwen Lu

With the rising industrial attention to 3D virtual modeling technology, generating novel 3D content based on specified conditions (e. g. text) has become a hot issue.

Sherpa3D: Boosting High-Fidelity Text-to-3D Generation via Coarse 3D Prior

1 code implementation11 Dec 2023 Fangfu Liu, Diankun Wu, Yi Wei, Yongming Rao, Yueqi Duan

Instead of retraining a costly viewpoint-aware model, we study how to fully exploit easily accessible coarse 3D knowledge to enhance the prompts and guide 2D lifting optimization for refinement.

3D Generation Text to 3D

CAPTRA: CAtegory-level Pose Tracking for Rigid and Articulated Objects from Point Clouds

1 code implementation ICCV 2021 Yijia Weng, He Wang, Qiang Zhou, Yuzhe Qin, Yueqi Duan, Qingnan Fan, Baoquan Chen, Hao Su, Leonidas J. Guibas

For the first time, we propose a unified framework that can handle 9DoF pose tracking for novel rigid object instances as well as per-part pose tracking for articulated objects from known categories.

Pose Tracking

Bridge-Prompt: Towards Ordinal Action Understanding in Instructional Videos

1 code implementation CVPR 2022 Muheng Li, Lei Chen, Yueqi Duan, Zhilan Hu, Jianjiang Feng, Jie zhou, Jiwen Lu

The generated text prompts are paired with corresponding video clips, and together co-train the text encoder and the video encoder via a contrastive approach.

Ranked #4 on Action Segmentation on GTEA (using extra training data)

Action Segmentation Action Understanding +1

Curriculum DeepSDF

1 code implementation ECCV 2020 Yueqi Duan, Haidong Zhu, He Wang, Li Yi, Ram Nevatia, Leonidas J. Guibas

When learning to sketch, beginners start with simple and flexible shapes, and then gradually strive for more complex and accurate ones in the subsequent training sessions.

3D Shape Representation Representation Learning

IF-Defense: 3D Adversarial Point Cloud Defense via Implicit Function based Restoration

2 code implementations11 Oct 2020 Ziyi Wu, Yueqi Duan, He Wang, Qingnan Fan, Leonidas J. Guibas

The former aims to recover the surface of point cloud through implicit function, while the latter encourages evenly-distributed points.

MonoNeRF: Learning a Generalizable Dynamic Radiance Field from Monocular Videos

1 code implementation ICCV 2023 Fengrui Tian, Shaoyi Du, Yueqi Duan

More specifically, we learn an implicit velocity field to estimate point trajectory from temporal features with Neural ODE, which is followed by a flow-based feature aggregation module to obtain spatial features along the point trajectory.

SegGroup: Seg-Level Supervision for 3D Instance and Semantic Segmentation

1 code implementation18 Dec 2020 An Tao, Yueqi Duan, Yi Wei, Jiwen Lu, Jie zhou

Most existing point cloud instance and semantic segmentation methods rely heavily on strong supervision signals, which require point-level labels for every point in the scene.

3D Instance Segmentation 3D Semantic Segmentation +1

Learning Transferable Human-Object Interaction Detector With Natural Language Supervision

1 code implementation CVPR 2022 Suchen Wang, Yueqi Duan, Henghui Ding, Yap-Peng Tan, Kim-Hui Yap, Junsong Yuan

More specifically, we propose a new HOI visual encoder to detect the interacting humans and objects, and map them to a joint feature space to perform interaction recognition.

Human-Object Interaction Detection

Graph-Based Social Relation Reasoning

1 code implementation ECCV 2020 Wanhua Li, Yueqi Duan, Jiwen Lu, Jianjiang Feng, Jie zhou

Human beings are fundamentally sociable -- that we generally organize our social lives in terms of relations with other people.

Relation Relational Reasoning +1

Discovering Dynamic Causal Space for DAG Structure Learning

1 code implementation5 Jun 2023 Fangfu Liu, Wenchang Ma, An Zhang, Xiang Wang, Yueqi Duan, Tat-Seng Chua

Discovering causal structure from purely observational data (i. e., causal discovery), aiming to identify causal relationships among variables, is a fundamental task in machine learning.

Causal Discovery Combinatorial Optimization

Semantic Flow: Learning Semantic Field of Dynamic Scenes from Monocular Videos

1 code implementation8 Apr 2024 Fengrui Tian, Yueqi Duan, Angtian Wang, Jianfei Guo, Shaoyi Du

As there is 2D-to-3D ambiguity problem in the viewing direction when extracting 3D flow features from 2D video frames, we consider the volume densities as opacity priors that describe the contributions of flow features to the semantics on the frames.

GeoAuxNet: Towards Universal 3D Representation Learning for Multi-sensor Point Clouds

1 code implementation28 Mar 2024 Shengjun Zhang, Xin Fei, Yueqi Duan

In this paper, we propose geometry-to-voxel auxiliary learning to enable voxel representations to access point-level geometric information, which supports better generalisation of the voxel-based backbone with additional interpretations of multi-sensor point clouds.

Auxiliary Learning Representation Learning

Dynamics-aware Adversarial Attack of 3D Sparse Convolution Network

1 code implementation17 Dec 2021 An Tao, Yueqi Duan, He Wang, Ziyi Wu, Pengliang Ji, Haowen Sun, Jie zhou, Jiwen Lu

It results in a serious issue of lagged gradient, making the learned attack at the current step ineffective due to the architecture changes afterward.

3D Classification 3D Semantic Segmentation +2

Dynamics-aware Adversarial Attack of Adaptive Neural Networks

1 code implementation15 Oct 2022 An Tao, Yueqi Duan, Yingqi Wang, Jiwen Lu, Jie zhou

To address this issue, we propose a Leaded Gradient Method (LGM) and show the significant effects of the lagged gradient.

Adversarial Attack Computational Efficiency

Image Set Querying Based Localization

no code implementations20 Sep 2015 Lei Deng, Siyuan Huang, Yueqi Duan, Baohua Chen, Jie zhou

Conventional single image based localization methods usually fail to localize a querying image when there exist large variations between the querying image and the pre-built scene.

Image-Based Localization

Deep Adversarial Metric Learning

no code implementations CVPR 2018 Yueqi Duan, Wenzhao Zheng, Xudong Lin, Jiwen Lu, Jie zhou

Learning an effective distance metric between image pairs plays an important role in visual analysis, where the training procedure largely relies on hard negative samples.

Metric Learning

GraphBit: Bitwise Interaction Mining via Deep Reinforcement Learning

no code implementations CVPR 2018 Yueqi Duan, Ziwei Wang, Jiwen Lu, Xudong Lin, Jie zhou

Specifically, we design a deep reinforcement learning model to learn the structure of the graph for bitwise interaction mining, reducing the uncertainty of binary codes by maximizing the mutual information with inputs and related bits, so that the ambiguous bits receive additional instruction from the graph for confident binarization.

Binarization reinforcement-learning +2

Deep Variational Metric Learning

no code implementations ECCV 2018 Xudong Lin, Yueqi Duan, Qiyuan Dong, Jiwen Lu, Jie zhou

Deep metric learning has been extensively explored recently, which trains a deep neural network to produce discriminative embedding features.

Metric Learning

Learning Deep Binary Descriptor With Multi-Quantization

no code implementations CVPR 2017 Yueqi Duan, Jiwen Lu, Ziwei Wang, Jianjiang Feng, Jie zhou

In this paper, we propose an unsupervised feature learning method called deep binary descriptor with multi-quantization (DBD-MQ) for visual matching.

Binarization Image Retrieval +2

Structural Relational Reasoning of Point Clouds

no code implementations CVPR 2019 Yueqi Duan, Yu Zheng, Jiwen Lu, Jie Zhou, Qi Tian

The symmetry for the corners of a box, the continuity for the surfaces of a monitor, the linkage between the torso and other body parts --- it suggests that 3D objects may have common and underlying inner relations between local structures, and it is a fundamental ability for intelligent species to reason for them.

3D Part Segmentation 3D Point Cloud Classification +3

UniformFace: Learning Deep Equidistributed Representation for Face Recognition

no code implementations CVPR 2019 Yueqi Duan, Jiwen Lu, Jie Zhou

In this paper, we propose a new supervision objective named uniform loss to learn deep equidistributed representations for face recognition.

Face Recognition

Deep Embedding Learning With Discriminative Sampling Policy

no code implementations CVPR 2019 Yueqi Duan, Lei Chen, Jiwen Lu, Jie Zhou

Deep embedding learning aims to learn a distance metric for effective similarity measurement, which has achieved promising performance in various tasks.

Object Pursuit: Building a Space of Objects via Discriminative Weight Generation

no code implementations ICLR 2022 Chuanyu Pan, Yanchao Yang, Kaichun Mo, Yueqi Duan, Leonidas Guibas

We perform an extensive study of the key features of the proposed framework and analyze the characteristics of the learned representations.

Disentanglement Object

HyperDet3D: Learning a Scene-conditioned 3D Object Detector

no code implementations CVPR 2022 Yu Zheng, Yueqi Duan, Jiwen Lu, Jie zhou, Qi Tian

A bathtub in a library, a sink in an office, a bed in a laundry room -- the counter-intuition suggests that scene provides important prior knowledge for 3D object detection, which instructs to eliminate the ambiguous detection of similar objects.

3D Object Detection Object +1

6D Camera Relocalization in Visually Ambiguous Extreme Environments

no code implementations13 Jul 2022 Yang Zheng, Tolga Birdal, Fei Xia, Yanchao Yang, Yueqi Duan, Leonidas J. Guibas

To this end, we propose: (i) a hierarchical localization system, where we leverage temporal information and (ii) a novel environment-aware image enhancement method to boost the robustness and accuracy.

Camera Relocalization Image Enhancement

SEFormer: Structure Embedding Transformer for 3D Object Detection

no code implementations5 Sep 2022 Xiaoyu Feng, Heming Du, Yueqi Duan, Yongpan Liu, Hehe Fan

Effectively preserving and encoding structure features from objects in irregular and sparse LiDAR points is a key challenge to 3D object detection on point cloud.

3D Object Detection Autonomous Driving +2

Category-Level Multi-Part Multi-Joint 3D Shape Assembly

no code implementations10 Mar 2023 Yichen Li, Kaichun Mo, Yueqi Duan, He Wang, Jiequan Zhang, Lin Shao, Wojciech Matusik, Leonidas Guibas

A successful joint-optimized assembly needs to satisfy the bilateral objectives of shape structure and joint alignment.

Graph Learning Graph Representation Learning

Memory-based Adapters for Online 3D Scene Perception

no code implementations11 Mar 2024 Xiuwei Xu, Chong Xia, Ziwei Wang, Linqing Zhao, Yueqi Duan, Jie zhou, Jiwen Lu

To this end, we propose an adapter-based plug-and-play module for the backbone of 3D scene perception model, which constructs memory to cache and aggregate the extracted RGB-D features to empower offline models with temporal learning ability.

Make-Your-3D: Fast and Consistent Subject-Driven 3D Content Generation

no code implementations14 Mar 2024 Fangfu Liu, HanYang Wang, Weiliang Chen, Haowen Sun, Yueqi Duan

Recent years have witnessed the strong power of 3D generation models, which offer a new level of creative flexibility by allowing users to guide the 3D content generation process through a single image or natural language.

3D Generation

DreamReward: Text-to-3D Generation with Human Preference

no code implementations21 Mar 2024 Junliang Ye, Fangfu Liu, Qixiu Li, Zhengyi Wang, Yikai Wang, Xinzhou Wang, Yueqi Duan, Jun Zhu

Building upon the 3D reward model, we finally perform theoretical analysis and present the Reward3D Feedback Learning (DreamFL), a direct tuning algorithm to optimize the multi-view diffusion models with a redefined scorer.

3D Generation Text to 3D +1

Learning a Category-level Object Pose Estimator without Pose Annotations

no code implementations8 Apr 2024 Fengrui Tian, Yaoyao Liu, Adam Kortylewski, Yueqi Duan, Shaoyi Du, Alan Yuille, Angtian Wang

Instead of using manually annotated images, we leverage diffusion models (e. g., Zero-1-to-3) to generate a set of images under controlled pose differences and propose to learn our object pose estimator with those images.

Object Pose Estimation

Cannot find the paper you are looking for? You can Submit a new open access paper.