no code implementations • 28 Nov 2024 • Corentin Dumery, Noa Etté, Jingyi Xu, Aoxiang Fan, Ren Li, Hieu Le, Pascal Fua
Visual object counting is a fundamental computer vision task underpinning numerous real-world applications, from cell counting in biomedicine to traffic and wildlife monitoring.
no code implementations • 27 Nov 2024 • Shuangqi Li, Hieu Le, Jingyi Xu, Mathieu Salzmann
To improve the model's compositional ability, we propose a method for mining these reliable cases, resulting in a curated training set of generated images without requiring any manual annotation.
no code implementations • 21 Nov 2024 • Jingyi Xu, Xieyuanli Chen, Junyi Ma, Jiawei Huang, Jintao Xu, Yue Wang, Ling Pei
Existing 3D OCF approaches struggle to predict plausible spatial details for movable objects and suffer from slow inference speeds due to neglecting the bias and uneven distribution of changing occupancy states in both space and time.
no code implementations • 16 Oct 2024 • Jingyi Xu, Yeqi Luo, Weidong Yang, Keyi Liu, Shengnan Wang, Ben Fei, Lei Bai
Arctic sea ice performs a vital role in global climate and has paramount impacts on both polar ecosystems and coastal communities.
no code implementations • 10 Oct 2024 • Jingyi Xu, Siwei Tu, Weidong Yang, Shuhao Li, Keyi Liu, Yeqi Luo, Lipeng Ma, Ben Fei, Lei Bai
Variation of Arctic sea ice has significant impacts on polar ecosystems, transporting routes, coastal communities, and global climate.
1 code implementation • 1 Oct 2024 • Zhangshuo Qi, Junyi Ma, Jingyi Xu, Zijie Zhou, Luqi Cheng, Guangming Xiong
Place recognition is a crucial module to ensure autonomous vehicles obtain usable localization information in GPS-denied environments.
no code implementations • 29 Sep 2024 • Jingyi Xu, Hieu Le, Zhixin Shu, Yang Wang, Yi-Hsuan Tsai, Dimitris Samaras
The training signals for this predictor are obtained through our emotion-agnostic intensity pseudo-labeling method without the need of frame-wise intensity labeling.
no code implementations • 8 Sep 2024 • Keyi Liu, Yeqi Luo, Weidong Yang, Jingyi Xu, Zhijun Li, Wen-Ming Chen, Ben Fei
3DGS utilizes multi-view rendered images as input to generate enhanced point cloud distributions and novel view images, facilitating data augmentation and cross-modal contrastive learning.
no code implementations • 4 Sep 2024 • Junyi Ma, Xieyuanli Chen, Wentao Bao, Jingyi Xu, Hesheng Wang
Understanding human intentions and actions through egocentric videos is important on the path to embodied artificial intelligence.
1 code implementation • 21 Jul 2024 • Jingyi Xu, Hieu Le, Dimitris Samaras
We show that the proposed score correlates highly with the sample quality for various generative models including VAEs, GANs and Latent Diffusion Models.
no code implementations • 9 Jun 2024 • Rui Zhang, Tianyue Luo, Weidong Yang, Ben Fei, Jingyi Xu, Qingyuan Zhou, Keyi Liu, Ying He
3D Gaussian Splatting (3D-GS) has made a notable advancement in the field of neural rendering, 3D scene reconstruction, and novel view synthesis.
no code implementations • 1 Jun 2024 • Ben Fei, Yixuan Li, Weidong Yang, Hengjun Gao, Jingyi Xu, Lipeng Ma, Yatian Yang, Pinghong Zhou
The development of medical imaging techniques has made a significant contribution to clinical decision-making.
1 code implementation • 7 May 2024 • Junyi Ma, Jingyi Xu, Xieyuanli Chen, Hesheng Wang
Understanding how humans would behave during hand-object interaction is vital for applications in service robot manipulation and extended reality.
no code implementations • 7 May 2024 • Yanli Yuan, Bingbing Wang, Chuan Zhang, Jingyi Xu, Ximeng Liu, Liehuang Zhu
Though recent methods based on Fully Convolutional Neural Networks (F-CNNs) have shown success in many segmentation tasks, fusing features from images with different scales is still a challenge: (1) Due to the lack of spatial awareness, F-CNNs share the same weights at different spatial locations.
1 code implementation • 22 Mar 2024 • Shuhao Li, Yue Cui, Jingyi Xu, Libin Li, Lingkai Meng, Weidong Yang, Fan Zhang, Xiaofang Zhou
Traffic prediction has long been a focal and pivotal area in research, witnessing both significant strides from city-level to road-level predictions in recent years.
1 code implementation • 15 Mar 2024 • Jingyi Xu, Weidong Yang, Lingdong Kong, Youquan Liu, Rui Zhang, Qingyuan Zhou, Ben Fei
Then, another VFM trained on fine-grained 2D masks is adopted to guide the generation of semantically augmented images and point clouds to enhance the performance of neural networks, which mix the data from source and target domains like view frustums (FrustumMixing).
2 code implementations • 27 Feb 2024 • Jingyi Xu, Junyi Ma, Qi Wu, Zijie Zhou, Yue Wang, Xieyuanli Chen, Ling Pei
Fusion-based place recognition is an emerging technique jointly utilizing multi-modal perception data, to recognize previously visited places in GPS-denied scenarios for robots and autonomous vehicles.
no code implementations • 11 Feb 2024 • Ben Fei, Jingyi Xu, Rui Zhang, Qingyuan Zhou, Weidong Yang, Ying He
3D Gaussian Splatting (3D-GS) has emerged as a significant advancement in the field of Computer Graphics, offering explicit scene representation and novel view synthesis without the reliance on neural networks, such as Neural Radiance Fields (NeRF).
1 code implementation • CVPR 2024 • Junyi Ma, Xieyuanli Chen, Jiawei Huang, Jingyi Xu, Zhen Luo, Jintao Xu, Weihao Gu, Rui Ai, Hesheng Wang
Furthermore, the standardized evaluation protocol for preset multiple tasks is also provided to compare the performance of all the proposed baselines on present and future occupancy estimation with respect to objects of interest in autonomous driving scenarios.
1 code implementation • 6 Nov 2023 • Zijie Zhou, Jingyi Xu, Guangming Xiong, Junyi Ma
However, most existing multimodal place recognition methods only use limited field-of-view camera images, which leads to an imbalance between features from different modalities and limits the effectiveness of sensor fusion.
no code implementations • 22 Sep 2023 • Jingyi Xu, Hieu Le, Dimitris Samaras
Thus, we propose zero-shot object counting (ZSC), a new setting where only the class name is available during test time.
no code implementations • 16 Jul 2023 • Hieu Le, Jingyi Xu, Nicolas Talabot, Jiancheng Yang, Pascal Fua
Medical applications often require accurate 3D representations of complex organs with multiple parts, such as the heart and spine.
no code implementations • 15 Jul 2023 • Jingyi Xu, Hieu Le, Dimitris Samaras
In this paper, we point out that the task of counting objects of interest when there are multiple object classes in the image (namely, multi-class object counting) is particularly challenging for current object counting models.
no code implementations • CVPR 2023 • Jingyi Xu, Hieu Le, Dimitris Samaras
To mitigate this issue, we propose a novel variational autoencoder (VAE) based data generation model, which is capable of generating data with increased crop-related diversity.
1 code implementation • CVPR 2023 • Jingyi Xu, Tushar Vaidya, Yufei Wu, Saket Chandra, Zhangsheng Lai, Kai Fong Ernest Chong
We introduce algebraic machine reasoning, a new reasoning framework that is well-suited for abstract reasoning.
1 code implementation • CVPR 2023 • Jingyi Xu, Hieu Le, Vu Nguyen, Viresh Ranjan, Dimitris Samaras
By applying this model to all the candidate patches, we can select the most suitable patches as exemplars for counting.
Ranked #7 on Zero-Shot Counting on FSC147
1 code implementation • 3 Feb 2023 • Junyi Ma, Guangming Xiong, Jingyi Xu, Xieyuanli Chen
LiDAR-based place recognition (LPR) is one of the most crucial components of autonomous vehicles to identify previously visited places in GPS-denied environments.
1 code implementation • 16 Sep 2022 • Junyi Ma, Xieyuanli Chen, Jingyi Xu, Guangming Xiong
It uses multi-scale transformers to generate a global descriptor for each sequence of LiDAR range images in an end-to-end fashion.
no code implementations • 31 May 2022 • Ziyuan Xia, Anchen Sun, Jingyi Xu, Yuanzhe Peng, Rui Ma, Minghui Cheng
This survey paper conducts a comprehensive analysis of the evolution and contemporary landscape of recommendation systems, which have been extensively incorporated across a myriad of web applications.
1 code implementation • CVPR 2022 • Jingyi Xu, Hieu Le
To mitigate this issue, we propose to generate visual samples based on semantic embeddings using a conditional variational autoencoder (CVAE) model.
1 code implementation • CVPR 2022 • Jingyi Xu, Zihan Chen, Tony Q. S. Quek, Kai Fong Ernest Chong
Although there exist methods in centralized learning for tackling label noise, such methods do not perform well on heterogeneous label noise in FL settings, due to the typically smaller sizes of client datasets and data privacy requirements in FL.
1 code implementation • 22 Feb 2022 • Jingyi Xu, Zirui Li, Li Gao, Junyi Ma, Qi Liu, Yanan Zhao
Different exploration methods of DRL, including adding action space noise and parameter space noise, are compared against each other in the transfer learning process in this work.
1 code implementation • 27 May 2021 • Jingyi Xu, Tony Q. S. Quek, Kai Fong Ernest Chong
In particular, we shall assume that a small subset of any given noisy dataset is known to have correct labels, which we treat as "positive", while the remaining noisy subset is treated as "unlabeled".
Ranked #7 on Image Classification on Clothing1M (using clean data) (using extra training data)
no code implementations • ICCV 2021 • Jingyi Xu, Hieu Le, Mingzhen Huang, ShahRukh Athar, Dimitris Samaras
We assume that the distribution of intra-class variance generalizes across the base class and the novel class.
Ranked #16 on Few-Shot Image Classification on CUB 200 5-way 5-shot
no code implementations • 7 Oct 2020 • Jingyi Xu, Zhixin Shu, Dimitris Samaras
However, some testing data are considered "hard" as they lie close to the decision boundaries and are prone to misclassification, leading to performance degradation for ZSL.
no code implementations • 5 Mar 2020 • Jeffrey Ichnowski, Michael Danielczuk, Jingyi Xu, Vishal Satish, Ken Goldberg
Rapid and reliable robot bin picking is a critical challenge in automating warehouses, often measured in picks-per-hour (PPH).
Robotics
1 code implementation • ICML 2018 • Jingyi Xu, Zilu Zhang, Tal Friedman, Yitao Liang, Guy Van Den Broeck
This paper develops a novel methodology for using symbolic knowledge in deep learning.