no code implementations • 5 Jan 2025 • Minglin Chen, Longguang Wang, Sheng Ao, Ye Zhang, Kai Xu, Yulan Guo
To fully leverage 2D diffusion priors in geometry and appearance generation, we introduce a semantic-guided geometry diffusion model and a semantic-geometry guided diffusion model which are finetuned on a scene dataset.
no code implementations • 27 Dec 2024 • Kai Xu, Tze Ho Elden Tse, Jizong Peng, Angela Yao
Our proposed method is termed DAS3R, an abbreviation for Dynamics-Aware Gaussian Splatting for Static Scene Reconstruction.
no code implementations • 17 Dec 2024 • Aldo Pareja, Nikhil Shivakumar Nayak, Hao Wang, KrishnaTeja Killamsetty, Shivchander Sudalairaj, Wenlong Zhao, Seungwook Han, Abhishek Bhandwaldar, Guangxuan Xu, Kai Xu, Ligong Han, Luke Inglis, Akash Srivastava
The rise of large language models (LLMs) has created a significant disparity: industrial research labs with their computational resources, expert teams, and advanced infrastructures, can effectively fine-tune LLMs, while individual developers and small organizations face barriers due to limited resources.
no code implementations • 11 Dec 2024 • Yihan Cao, Jiazhao Zhang, Zhinan Yu, Shuzhen Liu, Zheng Qin, Qin Zou, Bo Du, Kai Xu
Inspired by neuroscientific evidence that humans consistently update their cognitive states while searching for objects in unseen environments, we present CogNav, which attempts to model this cognitive process with the help of large language models.
1 code implementation • 11 Dec 2024 • Yihan Cao, Jiazhao Zhang, Zhinan Yu, Kai Xu
Consequently, most existing methods tackle this challenge by leveraging non-gradient-based optimization methods. In this work, we present a hybrid camera placement optimization approach that incorporates both gradient-based and non-gradient-based optimization methods.
2 code implementations • 10 Dec 2024 • Jun-Peng Zhu, Boyan Niu, Peng Cai, Zheming Ni, Jianwei Wan, Kai Xu, Jiajun Huang, Shengbo Ma, Bing Wang, Xuan Zhou, Guanglei Bao, Donghui Zhang, Liu Tang, Qi Liu
Exploratory data analysis (EDA), coupled with SQL, is essential for data analysts involved in data exploration and analysis.
no code implementations • 26 Nov 2024 • Yukun Wang, Longguang Wang, Zhiyuan Ma, Qibin Hu, Kai Xu, Yulan Guo
Despite the typical inversion-then-editing paradigm using text-to-image (T2I) models has demonstrated promising results, directly extending it to text-to-video (T2V) models still suffers severe artifacts such as color flickering and content distortion.
no code implementations • 15 Nov 2024 • Yanhao Sun, Runze Tian, Xiao Han, Xinyao Liu, Yan Zhang, Kai Xu
With the emergence of large-scale Text-to-Image(T2I) models and implicit 3D representations like Neural Radiance Fields (NeRF), many text-driven generative editing methods based on NeRF have appeared.
no code implementations • 5 Nov 2024 • Xin Wen, Xuening Zhu, Renjiao Yi, Zhifeng Wang, Chenyang Zhu, Kai Xu
Given sparse-view input images, we run a model and pose retrieval from the library, to get a model with similar shapes, serving as the density supervision and pose initializations.
no code implementations • 4 Nov 2024 • Guangxuan Xu, Kai Xu, Shivchander Sudalairaj, Hao Wang, Akash Srivastava
In our experiment using density ratio from a pair of Mistral-7B models, CDR achieves a RewardBench score of 82. 6, outperforming the best trained reward functions from same model class and demonstrating competitive performance against SoTA models in Safety (91. 0) and Reasoning (88. 0) domains.
no code implementations • 25 Oct 2024 • Kristjan Greenewald, Yuancheng Yu, Hao Wang, Kai Xu
Training generative models with differential privacy (DP) typically involves injecting noise into gradient updates or adapting the discriminator's training procedure.
1 code implementation • 7 Oct 2024 • Chuanhao Sun, Thanos Triantafyllou, Anthos Makris, Maja Drmač, Kai Xu, Luo Mai, Mahesh K. Marina
View synthesis using Neural Radiance Fields (NeRF) and Gaussian Splatting (GS) has demonstrated impressive fidelity in rendering real-world scenarios.
no code implementations • 29 Aug 2024 • Zhirui Gao, Renjiao Yi, Chenyang Zhu, Ke Zhuang, Wei Chen, Kai Xu
COLMAP is frequently employed for preprocessing to estimate poses, while it necessitates a large number of feature matches to operate effectively, and it struggles with scenes characterized by sparse features, large baselines between images, or a limited number of input images.
no code implementations • 27 Aug 2024 • Maxwell Schrader, Navish Kumar, Esben Sørig, Soonmyeong Yoon, Akash Srivastava, Kai Xu, Maria Astefanoaei, Nicolas Collignon
We introduce two datasets that allow for in-depth analysis and modelling of service times of cargo bikes and use existing datasets to reason about differences in delivery performance across vehicle types.
no code implementations • 26 Aug 2024 • Jiasong Feng, LiChun Wang, Hongbo Xu, Kai Xu, BaoCai Yin
Scene Graph Generation (SGG) aims to generate a comprehensive graphical representation that accurately captures the semantic information of a given scenario.
no code implementations • 22 Aug 2024 • Wenhao Li, Zhiyuan Yu, Qijin She, Zhinan Yu, Yuqing Lan, Chenyang Zhu, Ruizhen Hu, Kai Xu
The household rearrangement task involves spotting misplaced objects in a scene and accommodate them with proper places.
no code implementations • 20 Aug 2024 • Zhirui Gao, Renjiao Yi, Yuhang Huang, Wei Chen, Chenyang Zhu, Kai Xu
In this paper, we introduce $\textbf{PartGS}$, $\textbf{part}$-aware 3D reconstruction by a hybrid representation of 2D $\textbf{G}$aussians and $\textbf{S}$uperquadrics, which parses objects or scenes into semantic parts, digging 3D structural clues from multi-view image inputs.
1 code implementation • 12 Jul 2024 • Chuanhao Sun, Zhihang Yuan, Kai Xu, Luo Mai, N. Siddharth, Shuo Chen, Mahesh K. Marina
Fourier features based positional encoding (PE) is commonly used in machine learning tasks that involve learning high-frequency features from low-dimensional inputs, such as 3D view synthesis and time series regression with neural tangent kernels.
no code implementations • 29 Jun 2024 • Xiaogang Wang, Liang Wang, Hongyu Wu, GuoQiang Xiao, Kai Xu
This framework consists of a primitive network and a constraint network, transforming the sketch analysis task into a set prediction problem to enhance the effective handling of primitives and constraints.
no code implementations • 28 Jun 2024 • Kexiong Yu, Hang Zhao, Yuhang Huang, Renjiao Yi, Kai Xu, Chenyang Zhu
Combinatorial Optimization (CO) problems are fundamentally important in numerous real-world applications across diverse industries, characterized by entailing enormous solution space and demanding time-sensitive response.
no code implementations • 21 Jun 2024 • Xuening Zhu, Renjiao Yi, Xin Wen, Chenyang Zhu, Kai Xu
The insertion of objects into a scene and relighting are commonly utilized applications in augmented reality (AR).
no code implementations • 18 Jun 2024 • Z. T. Wang, Qiuhao Chen, Yuxuan Du, Z. H. Yang, Xiaoxia Cai, Kaixuan Huang, Jingning Zhang, Kai Xu, Jun Du, Yinan Li, Yuling Jiao, Xingyao Wu, Wu Liu, Xiliang Lu, Huikai Xu, Yirong Jin, Ruixia Wang, Haifeng Yu, S. P. Zhao
To effectively implement quantum algorithms on noisy intermediate-scale quantum (NISQ) processors is a central task in modern quantum technology.
no code implementations • 16 Jun 2024 • Kai Xu, Farid Tajaddodianfar, Ben Allison
Recently proposed reward-conditioned policies (RCPs) offer an appealing alternative in reinforcement learning.
1 code implementation • 31 May 2024 • Amin Heyrani Nobari, Akash Srivastava, Dan Gutfreund, Kai Xu, Faez Ahmed
Moreover, we introduce a significantly more challenging benchmark, named LINK ABC, which involves synthesizing linkages that trace the trajectories of English capital alphabets, an inverse design benchmark task that existing methods struggle with due to large nonlinearities and tiny feasible space.
no code implementations • 2 May 2024 • Yuhang Huang, SHilong Zou, Xinwang Liu, Kai Xu
This paper presents a novel latent 3D diffusion model for the generation of neural voxel fields, aiming to achieve accurate part-aware structures.
no code implementations • 1 May 2024 • Zhinan Yu, Zheng Qin, Yijie Tang, Yongjun Wang, Renjiao Yi, Chenyang Zhu, Kai Xu
In this paper, we present NeRF-UR, a novel frame-to-model optimization framework for unsupervised RGB-D registration.
1 code implementation • CVPR 2024 • Zhiyuan Yu, Zheng Qin, Lintao Zheng, Kai Xu
The superpoint correspondences are then extended to instance candidates at the fine level according to the instance masks.
1 code implementation • 22 Mar 2024 • Sisi Dai, Wenhao Li, Haowen Sun, Haibin Huang, Chongyang Ma, Hui Huang, Kai Xu, Ruizhen Hu
In this study, we tackle the complex task of generating 3D human-object interactions (HOI) from textual descriptions in a zero-shot text-to-3D manner.
no code implementations • 21 Mar 2024 • Hui Tian, Kai Xu
On the other hand, UDF-based methods can effectively represent open surfaces but often introduce noise, leading to artefacts in the mesh.
no code implementations • 2 Mar 2024 • Shivchander Sudalairaj, Abhishek Bhandwaldar, Aldo Pareja, Kai Xu, David D. Cox, Akash Srivastava
This work introduces LAB (Large-scale Alignment for chatBots), a novel methodology designed to overcome the scalability challenges in the instruction-tuning phase of large language model (LLM) training.
no code implementations • 21 Feb 2024 • Shishun Zhang, Qijin She, Wenhao Li, Chenyang Zhu, Yongjun Wang, Ruizhen Hu, Kai Xu
To achieve the goal, the core idea is to develop an effective object-to-arm task assignment strategy for minimizing the cumulative task execution time and maximizing the dual-arm cooperation efficiency.
no code implementations • 4 Feb 2024 • Zhengqiu Zhu, Yong Zhao, Bin Chen, Sihang Qiu, Kai Xu, Quanjun Yin, Jincai Huang, Zhong Liu, Fei-Yue Wang
The transition from CPS-based Industry 4. 0 to CPSS-based Industry 5. 0 brings new requirements and opportunities to current sensing approaches, especially in light of recent progress in Chatbots and Large Language Models (LLMs).
no code implementations • 3 Feb 2024 • Cunxiao Du, Jing Jiang, Xu Yuanchen, Jiawei Wu, Sicheng Yu, Yongqi Li, Shenggui Li, Kai Xu, Liqiang Nie, Zhaopeng Tu, Yang You
Speculative decoding is a relatively new decoding framework that leverages small and efficient draft models to reduce the latency of LLMs.
no code implementations • 25 Jan 2024 • Mengyao Du, Miao Zhang, Yuwen Pu, Kai Xu, Shouling Ji, Quanjun Yin
To tackle the scarcity and privacy issues associated with domain-specific datasets, the integration of federated learning in conjunction with fine-tuning has emerged as a practical solution.
1 code implementation • 4 Jan 2024 • Yunfan Ye, Kai Xu, Yuhang Huang, Renjiao Yi, Zhiping Cai
With the recent success of the diffusion probabilistic model (DPM), we found it is especially suitable for accurate and crisp edge detection since the denoising process is directly applied to the original image size.
no code implementations • 1 Dec 2023 • Bum Chul Kwon, Samuel Friedman, Kai Xu, Steven A Lubitz, Anthony Philippakis, Puneet Batra, Patrick T Ellinor, Kenney Ng
Machine learning models built on training data with multiple modalities can reveal new insights that are not accessible through unimodal datasets.
no code implementations • 29 Nov 2023 • Ting Liu, Yue Hu, Wansen Wu, Youkai Wang, Kai Xu, Quanjun Yin
Then we introduce soft visual prompts in the input space of the visual encoder in a pretrained model.
no code implementations • 23 Oct 2023 • Hui Tian, Kai Xu
This method provides more flexible to represent sharp feature and surface boundary on open surface.
no code implementations • 9 Oct 2023 • Xiong-Hui Chen, Junyin Ye, Hang Zhao, Yi-Chen Li, Haoran Shi, Yu-Yan Xu, Zhihao Ye, Si-Hang Yang, Anqi Huang, Kai Xu, Zongzhang Zhang, Yang Yu
In this work, we focus on imitator learning based on only one expert demonstration.
1 code implementation • 30 Sep 2023 • Kai Xu, Rongyu Chen, Gianni Franchi, Angela Yao
The capacity of a modern deep learning system to determine if a sample falls within its realm of knowledge is fundamental and important.
Ranked #1 on Out-of-Distribution Detection on Far-OOD
Out-of-Distribution Detection Out of Distribution (OOD) Detection
1 code implementation • ICCV 2023 • Yaojie Shen, Xin Gu, Kai Xu, Heng Fan, Longyin Wen, Libo Zhang
Addressing this, we study video captioning from a different perspective in compressed domain, which brings multi-fold advantages over the existing pipeline: 1) Compared to raw images from the decoded video, the compressed video, consisting of I-frames, motion vectors and residuals, is highly distinguishable, which allows us to leverage the entire video for learning without manual sampling through a specialized model design; 2) The captioning model is more efficient in inference as smaller and less redundant information is processed.
Ranked #8 on Video Captioning on VATEX
no code implementations • 7 Sep 2023 • Ting Liu, Yue Hu, Wansen Wu, Youkai Wang, Kai Xu, Quanjun Yin
In the indoor-aware stage, we apply an efficient tuning paradigm to learn deep visual prompts from an indoor dataset, in order to augment pretrained models with inductive biases towards indoor environments.
1 code implementation • 30 Aug 2023 • Kai Xu, Shiyu Lu, Bin Huang, Weiwen Wu, Qiegen Liu
Diffusion models have emerged as potential tools to tackle the challenge of sparse-view CT reconstruction, displaying superior performance compared to conventional methods.
1 code implementation • 28 Aug 2023 • Hui Tian, Chenyang Zhu, Yifei Shi, Kai Xu
The key insight is that if the UDF is estimated correctly, the 3D points should be locally projected onto the underlying surface following the gradient of the UDF.
no code implementations • 17 Aug 2023 • Yijie Tang, Jiazhao Zhang, Zhinan Yu, He Wang, Kai Xu
For the first time, randomized optimization is made possible in neural tracking with several key designs to the learning process, enabling efficient and robust tracking even under fast camera motions.
1 code implementation • ICCV 2023 • Minhao Li, Zheng Qin, Zhirui Gao, Renjiao Yi, Chenyang Zhu, Yulan Guo, Kai Xu
The commonly adopted detect-then-match approach to registration finds difficulties in the cross-modality cases due to the incompatible keypoint detection and inconsistent feature description.
1 code implementation • ICCV 2023 • Jingjia Shi, Shuaifeng Zhi, Kai Xu
With an initial camera pose and per-frame plane predictions provided from the first stage, exclusively designed modules, potentially relying on extra plane correspondence labelling, are applied to merge multi-view plane entities and produce 6DoF camera pose.
1 code implementation • 25 Jul 2023 • Zheng Qin, Hao Yu, Changjian Wang, Yulan Guo, Yuxing Peng, Slobodan Ilic, Dewen Hu, Kai Xu
They seek correspondences over downsampled superpoints, which are then propagated to dense points.
no code implementations • 16 Jul 2023 • Yifei Shi, Junhua Xi, Dewen Hu, Zhiping Cai, Kai Xu
In particular, we propose RayMVSNet which learns sequential prediction of a 1D implicit field along each camera ray with the zero-crossing point indicating scene depth.
1 code implementation • 28 Jun 2023 • Hui Tian, Zheng Qin, Renjiao Yi, Chenyang Zhu, Kai Xu
Surface reconstruction from raw point clouds has been studied for decades in the computer graphics community, which is highly demanded by modeling and rendering applications nowadays.
1 code implementation • 27 Jun 2023 • Yunfan Ye, Renjiao Yi, Zhirui Gao, Zhiping Cai, Kai Xu
Learning-based edge detection usually suffers from predicting thick edges.
2 code implementations • 23 Jun 2023 • Yuhang Huang, Zheng Qin, Xinwang Liu, Kai Xu
We propose incorporating an analytical image attenuation process into the forward diffusion process for high-quality (un)conditioned image generation with significantly fewer denoising steps compared to the vanilla diffusion model requiring thousands of steps.
no code implementations • 18 May 2023 • Georgi Ganev, Kai Xu, Emiliano De Cristofaro
We believe our work will aid the deployment of DP synthetic data techniques by navigating through the best candidate models vis-a-vis the dataset features, desired privacy levels, and downstream tasks.
no code implementations • 1 May 2023 • Akash Srivastava, Seungwook Han, Kai Xu, Benjamin Rhodes, Michael U. Gutmann
We show that if these auxiliary densities are constructed such that they overlap with $p$ and $q$, then a multi-class logistic regression allows for estimating $\log p/q$ on the domain of any of the $K+2$ distributions and resolves the distribution shift problems of the current state-of-the-art methods.
1 code implementation • CVPR 2024 • Kai Xu, Ziwei Yu, Xin Wang, Michael Bi Mi, Angela Yao
We show that bilinear interpolation inherently attenuates high-frequency information while an MLP-based coordinate network can approximate more frequencies.
Ranked #2 on Video Super-Resolution on REDS4- 4x upscaling
no code implementations • CVPR 2023 • Renjiao Yi, Chenyang Zhu, Kai Xu
For re-rendering, we propose a differentiable specular rendering layer to render low-frequency non-Lambertian materials under various illuminations of spherical harmonics.
no code implementations • ICCV 2023 • Boyan Wan, Yifei Shi, Kai Xu
We propose Semantically-aware Object Coordinate Space (SOCS) built by warping-and-aligning the objects guided by a sparse set of keypoints with semantically meaningful correspondence.
1 code implementation • CVPR 2023 • Zheng Qin, Hao Yu, Changjian Wang, Yuxing Peng, Kai Xu
We first design a local spatial consistency measure over the deformation graph of the point cloud, which evaluates the spatial compatibility only between the correspondences in the vicinity of a graph node.
1 code implementation • 15 Mar 2023 • Zhirui Gao, Renjiao Yi, Zheng Qin, Yunfan Ye, Chenyang Zhu, Kai Xu
To tackle the challenges, we propose an accurate template matching method based on differentiable coarse-to-fine correspondence refinement.
1 code implementation • CVPR 2023 • Yunfan Ye, Renjiao Yi, Zhirui Gao, Chenyang Zhu, Zhiping Cai, Kai Xu
We study the problem of reconstructing 3D feature curves of an object from a set of calibrated multi-view images.
1 code implementation • 4 Mar 2023 • Charlotte Loh, Seungwook Han, Shivchander Sudalairaj, Rumen Dangovski, Kai Xu, Florian Wenzel, Marin Soljacic, Akash Srivastava
In this work, we present Multi-Symmetry Ensembles (MSE), a framework for constructing diverse ensembles by capturing the multiplicity of hypotheses along symmetry axes, which explore the hypothesis space beyond stochastic perturbations of model weights and hyperparameters.
no code implementations • 7 Feb 2023 • Junwen Huang, Alexey Artemov, Yujin Chen, Shuaifeng Zhi, Kai Xu, Matthias Nießner
Most deep learning approaches to comprehensive semantic modeling of 3D indoor spaces require costly dense annotations in the 3D domain.
no code implementations • 30 Jan 2023 • Max Schrader, Navish Kumar, Nicolas Collignon, Esben Sørig, Soonmyeong Yoon, Akash Srivastava, Kai Xu, Maria Astefanoaei
Light goods vehicles (LGV) used extensively in the last mile of delivery are one of the leading polluters in cities.
1 code implementation • CVPR 2024 • Xiaojie Jin, BoWen Zhang, Weibo Gong, Kai Xu, Xueqing Deng, Peng Wang, Zhao Zhang, Xiaohui Shen, Jiashi Feng
The first is a Temporal Adaptation Module that is incorporated in the video branch to introduce global and local temporal contexts.
no code implementations • 12 Jan 2023 • Xiaogang Wang, Yuhang Cheng, Liang Wang, Jiangbo Lu, Kai Xu, GuoQiang Xiao
Among them, the differential Laplican regularizer can effectively alleviate the implicit surface unsmoothness caused by the point cloud quality deteriorates; Meanwhile, in order to reduce the excessive smoothing at the edge regions of implicit suface, we proposed a dynamic edge extract strategy for sampling near the sharp edge of point cloud, which can effectively avoid the Laplacian regularizer from smoothing all regions.
1 code implementation • CVPR 2023 • Sheng Ao, Qingyong Hu, Hanyun Wang, Kai Xu, Yulan Guo
Extensive experiments on real-world scenarios demonstrate that our method achieves the best of both worlds in accuracy, efficiency, and generalization.
1 code implementation • 5 Dec 2022 • Hang Zhao, Zherong Pan, Yang Yu, Kai Xu
We study the problem of learning online packing skills for irregular 3D shapes, which is arguably the most challenging setting of bin packing problems.
1 code implementation • 3 Dec 2022 • Yaqiao Dai, Renjiao Yi, Chenyang Zhu, Hongjun He, Kai Xu
Therefore, we propose a novel depth map fusion module to combine the advantages of estimations with multi-resolution inputs.
no code implementations • CVPR 2023 • Jiazhao Zhang, Liu Dai, Fanpeng Meng, Qingnan Fan, Xuelin Chen, Kai Xu, He Wang
However, leveraging 3D scene representation can be prohibitively unpractical for policy learning in this floor-level task, due to low sample efficiency and expensive computational cost.
no code implementations • 20 Oct 2022 • Zeyu Huang, Juzhan Xu, Sisi Dai, Kai Xu, Hao Zhang, Hui Huang, Ruizhen Hu
Given a few object manipulation demos, NIFT guides the generation of the interaction imitation for a new object instance by matching the Neural Interaction Template (NIT) extracted from the demos in the target Neural Interaction Field (NIF) defined for the new object.
no code implementations • 17 Sep 2022 • Chenyi Liu, Fei Chen, Lu Deng, Renjiao Yi, Lintao Zheng, Chenyang Zhu, Jia Wang, Kai Xu
We introduce a well-targeted down-sampling strategy that focuses more on edge area for efficient feature extraction of complex geometry.
no code implementations • 3 Aug 2022 • Tiankai Gu, Chaokun Wang, Cheng Wu, Jingcao Xu, Yunkai Lou, Changping Wang, Kai Xu, Can Ye, Yang song
One of the most important tasks in recommender systems is to predict the potential connection between two nodes under a specific edge type (i. e., relationship).
1 code implementation • 27 Jul 2022 • Yaojie Shen, Libo Zhang, Kai Xu, Xiaojie Jin
First we learn the embedding of video transitions through a video transition classification task.
1 code implementation • 17 Jul 2022 • Simao Eduardo, Kai Xu, Alfredo Nazabal, Charles Sutton
Seeing as a systematic outlier is a combination of patterns of a clean instance and systematic error patterns, our main insight is that inliers can be modelled by a smaller representation (subspace) in a model than outliers.
1 code implementation • 12 Jul 2022 • Sofiane Mahiou, Kai Xu, Georgi Ganev
We propose a general, flexible, and scalable framework dpart, an open source Python library for differentially private synthetic data generation.
no code implementations • CVPR 2022 • Junhua Xi, Yifei Shi, Yijie Wang, Yulan Guo, Kai Xu
In particular, we propose RayMVSNet which learns sequential prediction of a 1D implicit field along each camera ray with the zero-crossing point indicating scene depth.
no code implementations • 3 Apr 2022 • Qijin She, Ruizhen Hu, Juzhan Xu, Min Liu, Kai Xu, Hui Huang
To resolve the sample efficiency issue in learning the high-dimensional and complex control of dexterous grasping, we propose an effective representation of grasping state characterizing the spatial interaction between the gripper and the target object.
no code implementations • CVPR 2022 • Yao Duan, Chenyang Zhu, Yuqing Lan, Renjiao Yi, Xinwang Liu, Kai Xu
However, adopting relations between all the object or patch proposals for detection is inefficient, and an imbalanced combination of local and global relations brings extra noise that could mislead the training.
1 code implementation • 28 Feb 2022 • Joya Chen, Kai Xu, Yuhui Wang, Yifei Cheng, Angela Yao
A standard hardware bottleneck when training deep neural networks is GPU memory.
1 code implementation • 20 Feb 2022 • Yuqing Lan, Yao Duan, Yifei Shi, Hui Huang, Kai Xu
Context has proven to be one of the most important factors in object layout reasoning for 3D scene understanding.
1 code implementation • 20 Feb 2022 • Yuqing Lan, Yao Duan, Chenyi Liu, Chenyang Zhu, Yueshan Xiong, Hui Huang, Kai Xu
In the field of 3D object detection, previous methods have been taking the advantage of context encoding, graph embedding, or explicit relation reasoning to extract relation context.
2 code implementations • CVPR 2022 • Zheng Qin, Hao Yu, Changjian Wang, Yulan Guo, Yuxing Peng, Kai Xu
Such sparse and loose matching requires contextual features capturing the geometric structure of the point clouds.
1 code implementation • CVPR 2022 • Chengjie Niu, Manyi Li, Kai Xu, Hao Zhang
Each level of the tree corresponds to an assembly of shape parts, represented as implicit functions, to reconstruct the input shape.
1 code implementation • 13 Jan 2022 • Yunfan Ye, Renjiao Yi, Zhiping Cai, Kai Xu
In particular, we impose a consistency regularization which enforces the outputs from each of the multiple layers to be consistent for the input image and its perturbed counterpart.
no code implementations • 9 Jan 2022 • Yan Liu, Qingyong Hu, Yinjie Lei, Kai Xu, Jonathan Li, Yulan Guo
In this paper, we introduce a neural architecture, termed Box2Seg, to learn point-level semantics of 3D point clouds with bounding box-level supervision.
1 code implementation • CVPR 2022 • Kunhong Li, Longguang Wang, Li Liu, Qing Ran, Kai Xu, Yulan Guo
Weakly supervised learning can help local feature methods to overcome the obstacle of acquiring a large-scale dataset with densely labeled correspondences.
Ranked #1 on Camera Localization on Aachen Day-Night benchmark
no code implementations • NeurIPS 2021 • Kai Xu, Akash Srivastava, Dan Gutfreund, Felix Sosa, Tomer Ullman, Josh Tenenbaum, Charles Sutton
In this paper, we propose a Bayesian-symbolic framework (BSP) for physical reasoning and learning that is close to human-level sample-efficiency and accuracy.
3 code implementations • NeurIPS 2021 • Cole Hurwitz, Akash Srivastava, Kai Xu, Justin Jude, Matthew G. Perich, Lee E. Miller, Matthias H. Hennig
These approaches, however, are limited in their ability to capture the underlying neural dynamics (e. g. linear) and in their ability to relate the learned dynamics back to the observed behaviour (e. g. no time lag).
1 code implementation • ICLR 2022 • Hang Zhao, Yang Yu, Kai Xu
PCT is a full-fledged description of the state and action space of bin packing which can support packing policy learning based on deep reinforcement learning (DRL).
no code implementations • 29 Sep 2021 • Akash Srivastava, Seungwook Han, Benjamin Rhodes, Kai Xu, Michael U. Gutmann
As such, estimating density ratios accurately using only samples from $p$ and $q$ is of high significance and has led to a flurry of recent work in this direction.
2 code implementations • 31 Aug 2021 • Hang Zhao, Chenyang Zhu, Xin Xu, Hui Huang, Kai Xu
In this problem, the items are delivered to the agent without informing the full sequence information.
2 code implementations • CVPR 2022 • Kai Xu, Angela Yao
We propose an efficient plug-and-play acceleration framework for semi-supervised video object segmentation by exploiting the temporal redundancies in videos presented by the compressed bitstream.
no code implementations • 9 Jun 2021 • Baoyun Peng, Min Liu, Zhaoning Zhang, Kai Xu, Dongsheng Li
Based on the proposed quality measurement, we propose a deep Tiny Face Quality network (tinyFQnet) to learn a quality prediction function from data.
no code implementations • 20 May 2021 • Guanjie Zheng, Hanyang Liu, Kai Xu, Zhenhui Li
Traffic simulators act as an essential component in the operating and planning of transportation systems.
no code implementations • 12 May 2021 • Jiazhao Zhang, Chenyang Zhu, Lintao Zheng, Kai Xu
We propose to tackle the difficulties of fast-motion camera tracking in the absence of inertial measurements using random optimization, in particular, the Particle Filter Optimization (PFO).
no code implementations • 5 Apr 2021 • Dengsheng Chen, Haowen Deng, Jun Li, Duo Li, Yao Duan, Kai Xu
In this work, rather than defining a continuous or discrete kernel, we directly embed convolutional kernels into the learnable potential fields, giving rise to potential convolution.
no code implementations • CVPR 2021 • Xiaogang Wang, Xun Sun, Xinyu Cao, Kai Xu, Bin Zhou
Learning-based 3D shape segmentation is usually formulated as a semantic labeling problem, assuming that all parts of training shapes are annotated with a given set of tags.
no code implementations • CVPR 2021 • Yifei Shi, Junwen Huang, Xin Xu, Yifan Zhang, Kai Xu
According to the theory of geometric stability analysis, a minimal set of three planar/cylindrical patches are geometrically stable and determine the full 6DoFs of the object pose.
no code implementations • 1 Jan 2021 • Kai Xu, Akash Srivastava, Dan Gutfreund, Felix Sosa, Tomer Ullman, Joshua B. Tenenbaum, Charles Sutton
As such, learning the laws is then reduced to symbolic regression and Bayesian inference methods are used to obtain the distribution of unobserved properties.
no code implementations • 1 Jan 2021 • Zhikang Zhang, Kai Xu, Fengbo Ren
In this paper, we propose a selective sensing framework that adopts the novel concept of data-driven nonuniform subsampling to reduce the dimensionality of acquired signals while retaining the information of interest in a computation-free fashion.
no code implementations • 31 Dec 2020 • Zheng Zhao, Kai Xu, Attaphon Kaewsnod, Xuyang Liu, Ayut Limphirat, Yupeng Yan
The masses of tetraquark states of all $qc\bar q \bar c$ and $cc\bar c \bar c$ quark configurations are evaluated in a constituent quark model, where the Cornell-like potential and one-gluon exchange spin-spin coupling are employed.
High Energy Physics - Phenomenology
no code implementations • 24 Dec 2020 • Pengdi Huang, Liqiang Lin, Fuyou Xue, Kai Xu, Danny Cohen-Or, Hui Huang
We show that HPC constitutes a powerful point feature learning with a rather compact set of only four types of geometric priors as kernels.
no code implementations • 11 Dec 2020 • Liqiang Lin, Pengdi Huang, Chi-Wing Fu, Kai Xu, Hao Zhang, Hui Huang
We present a novel attention-based mechanism to learn enhanced point features for point cloud processing tasks, e. g., classification and segmentation.
no code implementations • 5 Nov 2020 • Kai Xu
The succeed of simulation strongly supports the ellipse packing hypothesis that was proposed to explain the dynamic behaviors of a trivalent 2D structure.
Biological Physics Adaptation and Self-Organizing Systems Cell Behavior
no code implementations • 25 Oct 2020 • Akash Srivastava, Yamini Bansal, Yukun Ding, Cole Lincoln Hurwitz, Kai Xu, Bernhard Egger, Prasanna Sattigeri, Joshua B. Tenenbaum, Agus Sudjianto, Phuong Le, Arun Prakash R, Nengfeng Zhou, Joel Vaughan, Yaqun Wang, Anwesha Bhattacharyya, Kristjan Greenewald, David D. Cox, Dan Gutfreund
Current autoencoder-based disentangled representation learning methods achieve disentanglement by penalizing the (aggregate) posterior to encourage statistical independence of the latent factors.
1 code implementation • 16 Sep 2020 • Daniel Jünger, Robin Kobus, André Müller, Christian Hundt, Kai Xu, Weiguo Liu, Bertil Schmidt
The rapidly growing amount of data emerging in many fields motivated the need for accelerated hash tables designed for modern parallel architectures.
Distributed, Parallel, and Cluster Computing
no code implementations • 2 Aug 2020 • Yifei Shi, Junwen Huang, Hongjia Zhang, Xin Xu, Szymon Rusinkiewicz, Kai Xu
We propose an end-to-end deep neural network which is able to predict both reflectional and rotational symmetries of 3D objects present in the input RGB-D image.
no code implementations • NeurIPS 2020 • Xiaogang Wang, Yuelang Xu, Kai Xu, Andrea Tagliasacchi, Bin Zhou, Ali Mahdavi-Amiri, Hao Zhang
We introduce an end-to-end learnable technique to robustly identify feature edges in 3D point cloud data.
1 code implementation • 26 Jun 2020 • Hang Zhao, Qijin She, Chenyang Zhu, Yin Yang, Kai Xu
We solve a challenging yet practically useful variant of 3D Bin Packing Problem (3D-BPP).
1 code implementation • 24 Jun 2020 • Dengsheng Chen, Jun Li, Kai Xu
Adding the attention module with a rectified linear unit (ReLU) results in an amplification of positive elements and a suppression of negative ones, both with learned, data-adaptive parameters.
1 code implementation • NeurIPS 2020 • Benjamin Rhodes, Kai Xu, Michael U. Gutmann
Density-ratio estimation via classification is a cornerstone of unsupervised learning.
1 code implementation • CVPR 2020 • Qian Xie, Yu-Kun Lai, Jing Wu, Zhoutao Wang, Yiming Zhang, Kai Xu, Jun Wang
We demonstrate these by capturing contextual information at patch, object and scene levels.
Ranked #2 on 3D Object Detection on ARKitScenes
1 code implementation • CVPR 2020 • Jiazhao Zhang, Chenyang Zhu, Lintao Zheng, Kai Xu
Online semantic 3D segmentation in company with real-time RGB-D reconstruction poses special challenges such as how to perform 3D convolution directly over the progressively fused 3D geometric data, and how to smartly fuse information from frame to frame.
1 code implementation • IEEE Transactions on Image Processing 2020 • Qi Bi, Kun Qin, Zhili Li, Han Zhang, Kai Xu, Gui-Song Xia
It regards aerial scene classification as a multiple-instance learning problem so that local semantics can be further investigated.
Ranked #3 on Scene Recognition on AID
4 code implementations • CVPR 2020 • Kai Xu, Minghai Qin, Fei Sun, Yuhao Wang, Yen-Kuang Chen, Fengbo Ren
Experiment results show that learning in the frequency domain with static channel selection can achieve higher accuracy than the conventional spatial downsampling approach and meanwhile further reduce the input data size.
2 code implementations • 7 Feb 2020 • Mohamed Tarek, Kai Xu, Martin Trapp, Hong Ge, Zoubin Ghahramani
Since DynamicPPL is a modular, stand-alone library, any probabilistic programming system written in Julia, such as Turing. jl, can use DynamicPPL to specify models and trace their model parameters.
no code implementations • 4 Feb 2020 • Min Liu, Zherong Pan, Kai Xu, Kanishka Ganguly, Dinesh Manocha
We present an end-to-end algorithm for training deep neural networks to grasp novel objects.
Robotics
no code implementations • CVPR 2020 • Dengsheng Chen, Jun Li, Zheng Wang, Kai Xu
To tackle intra-class shape variations, we learn canonical shape space (CASS), a unified representation for a large variety of instances of a certain object category.
no code implementations • 17 Dec 2019 • Kai Xu, Xiao-Jun Wu, Wen-Bo Hu
Based on further studying the low-rank subspace clustering (LRSC) and L2-graph subspace clustering algorithms, we propose a F-graph subspace clustering algorithm with a symmetric constraint (FSSC), which constructs a new objective function with a symmetric constraint basing on F-norm, whose the most significant advantage is to obtain a closed-form solution of the coefficient matrix.
1 code implementation • 9 Dec 2019 • Qiaoyun Wu, Kai Xu, Jun Wang, Mingliang Xu, Dinesh Manocha
The regularization maximizes the mutual information between navigation actions and visual observation transforms of an agent, thus promoting more informed navigation decisions.
Robotics
no code implementations • 26 Nov 2019 • Siyan Dong, Songyin Wu, Yixin Zhuang, Kai Xu, Shanghang Zhang, Baoquan Chen
To address this issue, we approach camera relocalization with a decoupled solution where feature extraction, coordinate regression, and pose estimation are performed separately.
3 code implementations • CVPR 2020 • Rundi Wu, Yixin Zhuang, Kai Xu, Hao Zhang, Baoquan Chen
We introduce PQ-NET, a deep neural network which represents and generates 3D shapes via sequential part assembly.
1 code implementation • pproximateinference AABI Symposium 2019 • Tor Erlend Fjelde, Kai Xu, Mohamed Tarek, Sharan Yalburgi, Hong Ge
Transforming one probability distribution to another is a powerful tool in Bayesian inference and machine learning.
1 code implementation • pproximateinference AABI Symposium 2019 • Kai Xu, Hong Ge, Will Tebbutt, Mohamed Tarek, Martin Trapp, Zoubin Ghahramani
Stan's Hamilton Monte Carlo (HMC) has demonstrated remarkable sampling robustness and efficiency in a wide range of Bayesian inference problems through carefully crafted adaption schemes to the celebrated No-U-Turn sampler (NUTS) algorithm.
no code implementations • ICCV 2019 • Maciej Halber, Yifei Shi, Kai Xu, Thomas Funkhouser
In depth-sensing applications ranging from home robotics to AR/VR, it will be common to acquire 3D scans of interior spaces repeatedly at sparse time intervals (e. g., as part of regular daily use).
no code implementations • 7 Sep 2019 • Zhen-Biao Yang, Pei-Rong Han, Xin-Jie Huang, Wen Ning, HekangLi, Kai Xu, Dongning Zheng, Heng Fan, Shi-Biao Zheng
No-cloning theorem forbids perfect cloning of an unknown quantum state.
Quantum Physics
no code implementations • 22 Aug 2019 • Qi Bi, Kun Qin, Zhili Li, Han Zhang, Kai Xu
While the current convolution neural network tends to extract global features and global semantic information in a scene, the geo-spatial objects can be located at anywhere in an aerial image scene and their spatial arrangement tends to be more complicated.
no code implementations • 22 Aug 2019 • Qi Bi, Kun Qin, Han Zhang, Wenjun Han, Zhili Li, Kai Xu
Exhaustive experiments indicate that the proposed method can detect building change types directly and outperform the current multi-index learning method.
no code implementations • 18 Jun 2019 • Lintao Zheng, Chenyang Zhu, Jiazhao Zhang, Hang Zhao, Hui Huang, Matthias Niessner, Kai Xu
In our method, the exploratory robot scanning is both driven by and targeting at the recognition and segmentation of semantic objects from the scene.
1 code implementation • 17 Jun 2019 • Qiaoyun Wu, Dinesh Manocha, Jun Wang, Kai Xu
First, the latent distribution is conditioned on current observations and the target view, leading to a model-based, target-driven navigation.
no code implementations • 16 Jun 2019 • Jun Li, Chengjie Niu, Kai Xu
Enlightened by the fact that 3D shape structure is characterized as part composition and placement, we propose to model 3D shape variations with a part-aware deep generative network, coined as PAGENet.
1 code implementation • NeurIPS 2019 • Cole L. Hurwitz, Kai Xu, Akash Srivastava, Alessio P. Buccino, Matthias H. Hennig
Determining the positions of neurons in an extracellular recording is useful for investigating functional properties of the underlying neural circuitry.
1 code implementation • 12 May 2019 • Guanjie Zheng, Yuanhao Xiong, Xinshi Zang, Jie Feng, Hua Wei, Huichu Zhang, Yong Li, Kai Xu, Zhenhui Li
Increasingly available city data and advanced learning techniques have empowered people to improve the efficiency of our city functions.
no code implementations • 12 May 2019 • Guanjie Zheng, Xinshi Zang, Nan Xu, Hua Wei, Zhengyao Yu, Vikash Gayah, Kai Xu, Zhenhui Li
In this paper, we propose to re-examine the RL approaches through the lens of classic transportation theory.
4 code implementations • 11 May 2019 • Hua Wei, Nan Xu, Huichu Zhang, Guanjie Zheng, Xinshi Zang, Chacha Chen, Wei-Nan Zhang, Yanmin Zhu, Kai Xu, Zhenhui Li
To enable cooperation of traffic signals, in this paper, we propose a model, CoLight, which uses graph attentional networks to facilitate communication.
Multi-agent Reinforcement Learning Reinforcement Learning +1
1 code implementation • CVPR 2019 • Kai Xu, Longyin Wen, Guorong Li, Liefeng Bo, Qingming Huang
Specifically, the temporal coherence branch pretrained in an adversarial fashion from unlabeled video data, is designed to capture the dynamic appearance and motion cues of video sequences to guide object segmentation.
Ranked #2 on Semi-Supervised Video Object Segmentation on YouTube
no code implementations • CVPR 2020 • Chenyang Zhu, Kai Xu, Siddhartha Chaudhuri, Li Yi, Leonidas Guibas, Hao Zhang
While the part prior network can be trained with noisy and inconsistently segmented shapes, the final output of AdaCoSeg is a consistent part labeling for the input set, with each shape segmented into up to (a user-specified) K parts.
1 code implementation • CVPR 2019 • Xiaogang Wang, Bin Zhou, Yahao Shi, Xiaowu Chen, Qinping Zhao, Kai Xu
For the task of mobility analysis of 3D shapes, we propose joint analysis for simultaneous motion part segmentation and motion attribute estimation, taking a single 3D model as input.
no code implementations • CVPR 2019 • Yifei Shi, Angel Xuan Chang, Zhelun Wu, Manolis Savva, Kai Xu
Indoor scenes exhibit rich hierarchical structure in 3D object layouts.
no code implementations • CVPR 2019 • Fenggen Yu, Kun Liu, Yan Zhang, Chenyang Zhu, Kai Xu
Meanwhile, to increase the segmentation accuracy at each node, we enhance the recursive contextual feature with the shape feature extracted for the corresponding part.
Ranked #16 on 3D Part Segmentation on ShapeNet-Part (Class Average IoU metric)
no code implementations • 1 Mar 2019 • Min Liu, Zherong Pan, Kai Xu, Kanishka Ganguly, Dinesh Manocha
The quality of the grasp poses is on par with the groundtruth poses in the dataset.
Robotics
no code implementations • 14 Sep 2018 • Chenyang Zhu, Kai Xu, Siddhartha Chaudhuri, Renjiao Yi, Hao Zhang
The network may significantly alter the geometry and structure of the input parts and synthesize a novel shape structure based on the inputs, while adding or removing parts to minimize a structure plausibility loss.
no code implementations • 13 Sep 2018 • Xiaogang Wang, Bin Zhou, Haiyue Fang, Xiaowu Chen, Qinping Zhao, Kai Xu
We propose to generate part hypotheses from the components based on a hierarchical grouping strategy, and perform labeling on those part groups instead of directly on the components.
no code implementations • 20 Aug 2018 • Songle Chen, Lintao Zheng, Yan Zhang, Zhixin Sun, Kai Xu
Multi-view deep neural network is perhaps the most successful approach in 3D shape classification.
no code implementations • 20 Aug 2018 • Yawei Zhao, Kai Xu, Xinwang Liu, En Zhu, Xinzhong Zhu, Jianping Yin
The reason is that it finds the similar instances according to their features directly, which is usually impacted by the imperfect data, and thus returns sub-optimal results.
2 code implementations • 11 Aug 2018 • Biao Leng, Cheng Zhang, Xiaocheng Zhou, Cheng Xu, Kai Xu
In this network, a Score Generation Unit is devised to evaluate the quality of each projected image with score vectors.
1 code implementation • ECCV 2018 • Kai Xu, Zhikang Zhang, Fengbo Ren
We propose a scalable Laplacian pyramid reconstructive adversarial network (LAPRAN) that enables high-fidelity, flexible and fast CS images reconstruction.
no code implementations • 24 Jul 2018 • Manyi Li, Akshay Gadi Patil, Kai Xu, Siddhartha Chaudhuri, Owais Khan, Ariel Shamir, Changhe Tu, Baoquan Chen, Daniel Cohen-Or, Hao Zhang
We present a generative neural network which enables us to generate plausible 3D indoor scenes in large quantities and varieties, easily and highly efficiently.
Graphics
no code implementations • ICLR 2020 • Akash Srivastava, Kai Xu, Michael U. Gutmann, Charles Sutton
In this work, we take their insight of using kernels as fixed adversaries further and present a novel method for training deep generative models that does not involve saddlepoint optimization.