no code implementations • ECCV 2020 • Jin Xie, Hisham Cholakkal, Rao Muhammad Anwer, Fahad Shahbaz Khan, Yanwei Pang, Ling Shao, Mubarak Shah
We further introduce a count-and-similarity branch within the two-stage detection framework, which predicts pedestrian count as well as proposal similarity.
1 code implementation • 5 Sep 2024 • Lin Sun, Jiale Cao, Jin Xie, Fahad Shahbaz Khan, Yanwei Pang
To address this, we propose an iterative refinement framework for training-free segmentation, named iSeg, having an entropy-reduced self-attention module which utilizes a gradient descent scheme to reduce the entropy of self-attention map, thereby suppressing the weak responses corresponding to irrelevant global information.
no code implementations • 29 Jul 2024 • Yang Wu, Kaihua Zhang, Jianjun Qian, Jin Xie, Jian Yang
The complex traffic environment and various weather conditions make the collection of LiDAR data expensive and challenging.
no code implementations • 4 Jul 2024 • Jin Xie, Songze Li
Training large models requires a large amount of data, as well as abundant computation resources.
no code implementations • 19 Apr 2024 • Jin Xie, Chenqing Zhu, Songze Li
We focus on the problem of Personalized Federated Continual Learning (PFCL): a group of distributed clients, each with a sequence of local tasks on arbitrary data distributions, collaborate through a central server to train a personalized model at each client, with the model expected to achieve good performance on all local tasks.
no code implementations • 15 Apr 2024 • Bonan Ding, Jin Xie, Jing Nie, Jiale Cao, Xuelong Li, Yanwei Pang
Therefore, an effective solution involves transforming monocular images into LiDAR-like representations and employing a LiDAR-based 3D object detector to predict the 3D coordinates of objects.
no code implementations • 11 Apr 2024 • Hefeng Wang, Jiale Cao, Jin Xie, Aiping Yang, Yanwei Pang
The explicit branch utilizes the ground-truth labels of corresponding images as text prompts to condition feature extraction of diffusion model.
1 code implementation • 29 Mar 2024 • Qianliang Wu, Haobo Jiang, Lei Luo, Jun Li, Yaqing Ding, Jin Xie, Jian Yang
Establishing reliable correspondences is essential for registration tasks such as 3D and 2D3D registration.
1 code implementation • 19 Mar 2024 • Wenqi Zhu, Jiale Cao, Jin Xie, Shuangming Yang, Yanwei Pang
Open-vocabulary video instance segmentation strives to segment and track instances belonging to an open set of categories in a video.
no code implementations • 25 Jan 2024 • Xinwei Yue, Jin Xie, Chongjun Ouyang, Yuanwei Liu, Xia Shen, Zhiguo Ding
The numerical results are presented and show that: 1) ASTARS-NOMA with pSIC outperforms ASTARS assisted-orthogonal multiple access (ASTARS-OMA) in terms of outage probability and ergodic data rate; 2) The outage probability of ASTARS-NOMA can be further reduced within a certain range by increasing the power amplification factors; 3) The system throughputs of ASTARS-NOMA are superior to that of ASTARS-OMA in both delay-limited and delay-tolerant transmission modes.
1 code implementation • CVPR 2024 • Can Xu, Yuehui Han, Rui Xu, Le Hui, Jin Xie, Jian Yang
3D visual grounding aims to localize 3D objects described by free-form language sentences.
no code implementations • 31 Dec 2023 • Qianliang Wu, Haobo Jiang, Yaqing Ding, Lei Luo, Jin Xie, Jian Yang
They typically compute candidate correspondences based on distances in the point feature space.
1 code implementation • 21 Dec 2023 • Yun Zhu, Le Hui, Yaqi Shen, Jin Xie
To this end, we propose a novel superpoint grouping network for indoor anchor-free one-stage 3D object detection.
Ranked #5 on 3D Object Detection on S3DIS
1 code implementation • CVPR 2024 • Bin Xie, Jiale Cao, Jin Xie, Fahad Shahbaz Khan, Yanwei Pang
In this paper, we propose a simple encoder-decoder, named SED, for open-vocabulary semantic segmentation, which comprises a hierarchical encoder-based cost map generation and a gradual fusion decoder with category early rejection.
no code implementations • 12 Sep 2023 • Qianliang Wu, Yaqing Ding, Lei Luo, Haobo Jiang, Shuo Gu, Chuanwei Zhou, Jin Xie, Jian Yang
These high-order features are then propagated to dense points and utilized by a Sinkhorn matching module to identify key correspondences for successful registration.
1 code implementation • 24 Aug 2023 • Wei Xie, Haobo Jiang, Shuo Gu, Jin Xie
Robust obstacle avoidance is one of the critical steps for successful goal-driven indoor navigation tasks. Due to the obstacle missing in the visual image and the possible missed detection issue, visual image-based obstacle avoidance techniques still suffer from unsatisfactory robustness.
1 code implementation • 6 Jun 2023 • Hefeng Wang, Jiale Cao, Rao Muhammad Anwer, Jin Xie, Fahad Shahbaz Khan, Yanwei Pang
Our DFormer outperforms the recent diffusion-based panoptic segmentation method Pix2Seq-D with a gain of 3. 6% on MS COCO val2017 set.
1 code implementation • CVPR 2023 • Yaqi Shen, Le Hui, Jin Xie, Jian Yang
In our superpoint generation module, we utilize the bidirectional flow information at the previous iteration to obtain the matching points of points and superpoint centers for soft point-to-superpoint association construction, in which the superpoints are generated for pairwise point clouds.
no code implementations • 24 Apr 2023 • Hanqing Sun, Yanwei Pang, Jiale Cao, Jin Xie, Xuelong Li
In this paper, we explore the model design of Transformers in binocular 3D object detection, focusing particularly on extracting and encoding task-specific image correspondence information.
1 code implementation • CVPR 2023 • Kaiyou Song, Jin Xie, Shan Zhang, Zimeng Luo
Different from existing SSL-KD methods that transfer knowledge from a static pre-trained teacher to a student, in MOKD, two different models learn collaboratively in a self-supervised manner.
1 code implementation • CVPR 2023 • Haochen Wang, Kaiyou Song, Junsong Fan, Yuxi Wang, Jin Xie, Zhaoxiang Zhang
We observe that the reconstruction loss can naturally be the metric of the difficulty of the pre-training task.
1 code implementation • CVPR 2023 • Haobo Jiang, Zheng Dang, Zhen Wei, Jin Xie, Jian Yang, Mathieu Salzmann
Embedded with the inlier/outlier label, the posterior feature distribution is label-dependent and discriminative.
no code implementations • 21 Mar 2023 • Zhiqiang Dong, Jiale Cao, Rao Muhammad Anwer, Jin Xie, Fahad Khan, Yanwei Pang
Given a set of sparse and learnable proposals, LEAPS employs a dynamic person search head to directly perform person detection and corresponding re-id feature generation without non-maximum suppression post-processing.
no code implementations • 12 Feb 2023 • Qianliang Wu, Yaqi Shen, Haobo Jiang, Guofeng Mei, Yaqing Ding, Lei Luo, Jin Xie, Jian Yang
Point Cloud Registration is a fundamental and challenging problem in 3D computer vision.
no code implementations • ICCV 2023 • Le Hui, Linghua Tang, Yuchao Dai, Jin Xie, Jian Yang
Then, to generate homogeneous superpoints from the sparse LiDAR point cloud, we propose a LiDAR point grouping algorithm that simultaneously considers the similarity of point embeddings and the Euclidean distance of points in 3D space.
no code implementations • ICCV 2023 • Haobo Jiang, Zheng Dang, Shuo Gu, Jin Xie, Mathieu Salzmann, Jian Yang
Our method decouples the translation from the entire transformation by predicting the object center and estimating the rotation in a center-aware manner.
no code implementations • ICCV 2023 • Kaiyou Song, Shan Zhang, Zihao An, Zimeng Luo, Tong Wang, Jin Xie
In contrastive self-supervised learning, the common way to learn discriminative representation is to pull different augmented "views" of the same image closer while pushing all other images further apart, which has been proven to be effective.
1 code implementation • 11 Oct 2022 • Linghua Tang, Le Hui, Jin Xie
Due to the few annotated labels of 3D point clouds, how to learn discriminative features of point clouds to segment object instances is a challenging problem.
no code implementations • 14 Sep 2022 • Haobo Jiang, Kaihao Lan, Le Hui, Guangyu Li, Jin Xie, Jian Yang
The core of Siamese feature matching is how to assign high feature similarity on the corresponding points between the template and search area for precise object localization.
no code implementations • 9 Aug 2022 • Yikai Bian, Le Hui, Jianjun Qian, Jin Xie
Unsupervised domain adaptation for point cloud semantic segmentation has attracted great attention due to its effectiveness in learning with unlabeled data.
1 code implementation • 25 Jul 2022 • Mu He, Le Hui, Yikai Bian, Jian Ren, Jin Xie, Jian Yang
In this paper, we propose a resolution adaptive self-supervised monocular depth estimation method (RA-Depth) by learning the scale invariance of the scene depth.
1 code implementation • 25 Jul 2022 • Yuehui Han, Le Hui, Haobo Jiang, Jianjun Qian, Jin Xie
To this end, in this paper, we propose a novel adaptive subgraph generation based contrastive learning framework for efficient and robust self-supervised graph representation learning, and the optimal transport distance is utilized as the similarity metric between the subgraphs.
1 code implementation • 25 Jul 2022 • Le Hui, Lingpeng Wang, Linghua Tang, Kaihao Lan, Jin Xie, Jian Yang
Siamese network based trackers formulate 3D single object tracking as cross-correlation learning between point features of a template and a search area.
no code implementations • 30 Apr 2022 • Yubin Guo, Haobo Jiang, Xinlei Qi, Jin Xie, Cheng-Zhong Xu, Hui Kong
Meanwhile, we release a large dual-spectrum depth estimation dataset with visible-light and far-infrared stereo images captured in different scenes to the society.
1 code implementation • CVPR 2022 • Jiale Cao, Yanwei Pang, Rao Muhammad Anwer, Hisham Cholakkal, Jin Xie, Mubarak Shah, Fahad Shahbaz Khan
We propose a novel one-step transformer-based person search framework, PSTR, that jointly performs person detection and re-identification (re-id) in a single architecture.
1 code implementation • 22 Mar 2022 • Haobo Jiang, Jin Xie, Jian Yang
Finally, we use the maximum value in the second set of estimators to clip the action value of the chosen action in the first set of estimators and the clipped value is used for approximating the maximum expected action value.
no code implementations • 14 Mar 2022 • Jin Xie, Teng Zhang, Jose Blanchet, Peter Glynn, Matthew Randolph, David Scheinker
In order for an algorithm to see sustained use, it must be compatible with changes to hospital capacity, patient volumes, and scheduling practices.
no code implementations • 24 Feb 2022 • Rui Xu, Zongyan Han, Le Hui, Jianjun Qian, Jin Xie
Then, we develop a generative adversarial network that combines the domain-specific features of the seen categories with the aligned domain-invariant features to synthesize samples, where the synthesized samples of the unseen categories are generated by using the corresponding word embeddings.
1 code implementation • 23 Feb 2022 • Yaqi Shen, Le Hui, Haobo Jiang, Jin Xie, Jian Yang
In this paper, we propose a neighborhood consensus based reliable inlier evaluation method for robust unsupervised point cloud registration.
1 code implementation • NeurIPS 2021 • Le Hui, Lingpeng Wang, Mingmei Cheng, Jin Xie, Jian Yang
The Siamese shape-aware feature learning network can capture 3D shape information of the object to learn the discriminative features of the object so that the potential target from the background in sparse point clouds can be identified.
1 code implementation • ICCV 2021 • Haobo Jiang, Yaqi Shen, Jin Xie, Jun Li, Jianjun Qian, Jian Yang
Based on the reward function, for each state, we then construct a fused score function to evaluate the sampled transformations, where we weight the current and future rewards of the transformations.
no code implementations • 7 Sep 2021 • Jin Xie, Xinyu Li, Liang Gao, Lin Gui
According to the above finding, this paper proposes a new N8 neighborhood structure considering the movement of critical operations within a critical block and the movement of critical operations outside the critical block.
no code implementations • 3 Sep 2021 • Yiming Tu, Jin Xie
Nonetheless, few efforts are made on the unsupervised deep lidar odometry.
no code implementations • 5 Aug 2021 • Haobo Jiang, Jin Xie, Jianjun Qian, Jian Yang
By modeling the point cloud registration process as a Markov decision process (MDP), we develop a latent dynamic model of point clouds, consisting of a transformation network and evaluation network.
1 code implementation • 1 Aug 2021 • Yifan Zhao, Le Hui, Jin Xie
To achieve this, we exploit the consistency between the input sparse point cloud and generated dense point cloud for the shapes and rendered images.
1 code implementation • 3 May 2021 • Haobo Jiang, Jin Xie, Jian Yang
Finally, we use the maximum value in the second set of estimators to clip the action value of the chosen action in the first set of estimators and the clipped value is used for approximating the maximum expected action value.
1 code implementation • 16 Apr 2021 • Mingmei Cheng, Le Hui, Jin Xie, Jian Yang
In order to reduce the number of annotated labels, we propose a semi-supervised semantic point cloud segmentation network, named SSPC-Net, where we train the semantic segmentation network by inferring the labels of unlabeled points from the few annotated 3D points.
1 code implementation • 29 Mar 2021 • Ziyu Li, Yuncong Yao, Zhibin Quan, Wankou Yang, Jin Xie
Specifically, we design the Spatial Information Enhancement (SIE) module to predict the spatial shapes of the foreground points within proposals, and extract the structure information to learn the representative features for further box refinement.
no code implementations • 24 Mar 2021 • Guangwei Gao, Guoan Xu, Yi Yu, Jin Xie, Jian Yang, Dong Yue
In recent years, how to strike a good trade-off between accuracy and inference speed has become the core issue for real-time semantic segmentation applications, which plays a vital role in real-world scenarios such as autonomous driving systems and drones.
1 code implementation • 7 Jan 2021 • Le Hui, Mingmei Cheng, Jin Xie, Jian Yang
In this paper, we develop an efficient point cloud learning network (EPC-Net) to form a global descriptor for visual place recognition, which can obtain good performance and reduce computation memory and inference time.
1 code implementation • ICCV 2021 • Le Hui, Hang Yang, Mingmei Cheng, Jin Xie, Jian Yang
In order to obtain discriminative global descriptors, we construct a pyramid VLAD module to aggregate the multi-scale feature maps of point clouds into the global descriptors.
Ranked #3 on 3D Place Recognition on Oxford RobotCar Dataset
1 code implementation • ICCV 2021 • Le Hui, Jia Yuan, Mingmei Cheng, Jin Xie, Xiaoya Zhang, Jian Yang
Specifically, in our clustering network, we first jointly learn a soft point-superpoint association map from the coordinate and feature spaces of point clouds, where each point is assigned to the superpoint with a learned weight.
1 code implementation • 18 Nov 2020 • Yanwei Pang, Jiale Cao, Yazhao Li, Jin Xie, Hanqing Sun, Jinfeng Gong
In addition, a new diverse pedestrian dataset is further built.
2 code implementations • 1 Oct 2020 • Jiale Cao, Yanwei Pang, Jin Xie, Fahad Shahbaz Khan, Ling Shao
In addition to single-spectral pedestrian detection, we also review multi-spectral pedestrian detection, which provides more robust features for illumination variance.
no code implementations • 30 Jul 2020 • Mingmei Cheng, Le Hui, Jin Xie, Jian Yang, Hui Kong
In this paper, we propose a cascaded non-local neural network for point cloud segmentation.
1 code implementation • ECCV 2020 • Le Hui, Rui Xu, Jin Xie, Jianjun Qian, Jian Yang
Starting from the low-resolution point clouds, with the bilateral interpolation and max-pooling operations, the deconvolution network can progressively output high-resolution local and global feature maps.
no code implementations • 25 Jan 2020 • Jin Xie, Yanwei Pang, Hisham Cholakkal, Rao Muhammad Anwer, Fahad Shahbaz Khan, Ling Shao
On the heavy occluded (\textbf{HO}) set of CityPerosns test set, our PSC-Net obtains an absolute gain of 4. 0\% in terms of log-average miss rate over the state-of-the-art with same backbone, input scale and without using additional VBB supervision.
no code implementations • 26 Nov 2019 • Jin Xie, Longfei Wang, Paula Webster, Yang Yao, Jiayao Sun, Shuo Wang, Huihui Zhou
In this study, we developed a novel two-stream deep learning network for this recognition based on 700 images and corresponding eye movement patterns of ASD and TD, and obtained an accuracy of 0. 95, which was higher than the previous state-of-the-art.
1 code implementation • ICCV 2019 • Yanwei Pang, Jin Xie, Muhammad Haris Khan, Rao Muhammad Anwer, Fahad Shahbaz Khan, Ling Shao
Our approach obtains an absolute gain of 9. 5% in log-average miss rate, compared to the best reported results on the heavily occluded (HO) pedestrian set of CityPersons test set.
no code implementations • 20 Dec 2018 • Xi Chen, Jin Xie, Qingcong Yuan
Here we present models of deep learning (DL) and apply them to gene expression data for the diagnosis and categorization of cancer.
no code implementations • CVPR 2017 • Jin Xie, Guoxian Dai, Fan Zhu, Yi Fang
For 3D shapes, we then compute the Wasserstein barycenters of deep features of multiple projections to form a barycentric representation.
no code implementations • CVPR 2016 • Jin Xie, Meng Wang, Yi Fang
Different from these real-valued local shape descriptors, in this paper, we propose to learn a novel binary spectral shape descriptor with the deep neural network for 3D shape correspondence.
no code implementations • CVPR 2015 • Yi Fang, Jin Xie, Guoxian Dai, Meng Wang, Fan Zhu, Tiantian Xu, Edward Wong
Shape descriptor is a concise yet informative representation that provides a 3D object with an identification as a member of some category.
no code implementations • CVPR 2015 • Jin Xie, Yi Fang, Fan Zhu, Edward Wong
Then, by imposing the Fisher discrimination criterion on the neurons in the hidden layer, we developed a novel discriminative deep auto-encoder for shape feature learning.