Search Results for author: Jingyi Xu

Found 37 papers, 17 papers with code

Counting Stacked Objects from Multi-View Images

no code implementations28 Nov 2024 Corentin Dumery, Noa Etté, Jingyi Xu, Aoxiang Fan, Ren Li, Hieu Le, Pascal Fua

Visual object counting is a fundamental computer vision task underpinning numerous real-world applications, from cell counting in biomedicine to traffic and wildlife monitoring.

3D geometry Object Counting

Enhancing Compositional Text-to-Image Generation with Reliable Random Seeds

no code implementations27 Nov 2024 Shuangqi Li, Hieu Le, Jingyi Xu, Mathieu Salzmann

To improve the model's compositional ability, we propose a method for mining these reliable cases, resulting in a curated training set of generated images without requiring any manual annotation.

Text-to-Image Generation

Spatiotemporal Decoupling for Efficient Vision-Based Occupancy Forecasting

no code implementations21 Nov 2024 Jingyi Xu, Xieyuanli Chen, Junyi Ma, Jiawei Huang, Jintao Xu, Yue Wang, Ling Pei

Existing 3D OCF approaches struggle to predict plausible spatial details for movable objects and suffer from slow inference speeds due to neglecting the bias and uneven distribution of changing occupancy states in both space and time.

SIFM: A Foundation Model for Multi-granularity Arctic Sea Ice Forecasting

no code implementations16 Oct 2024 Jingyi Xu, Yeqi Luo, Weidong Yang, Keyi Liu, Shengnan Wang, Ben Fei, Lei Bai

Arctic sea ice performs a vital role in global climate and has paramount impacts on both polar ecosystems and coastal communities.

IceDiff: High Resolution and High-Quality Sea Ice Forecasting with Generative Diffusion Prior

no code implementations10 Oct 2024 Jingyi Xu, Siwei Tu, Weidong Yang, Shuhao Li, Keyi Liu, Yeqi Luo, Lipeng Ma, Ben Fei, Lei Bai

Variation of Arctic sea ice has significant impacts on polar ecosystems, transporting routes, coastal communities, and global climate.

GSPR: Multimodal Place Recognition Using 3D Gaussian Splatting for Autonomous Driving

1 code implementation1 Oct 2024 Zhangshuo Qi, Junyi Ma, Jingyi Xu, Zijie Zhou, Luqi Cheng, Guangming Xiong

Place recognition is a crucial module to ensure autonomous vehicles obtain usable localization information in GPS-denied environments.

Autonomous Driving

Learning Frame-Wise Emotion Intensity for Audio-Driven Talking-Head Generation

no code implementations29 Sep 2024 Jingyi Xu, Hieu Le, Zhixin Shu, Yang Wang, Yi-Hsuan Tsai, Dimitris Samaras

The training signals for this predictor are obtained through our emotion-agnostic intensity pseudo-labeling method without the need of frame-wise intensity labeling.

Talking Head Generation

GS-PT: Exploiting 3D Gaussian Splatting for Comprehensive Point Cloud Understanding via Self-supervised Learning

no code implementations8 Sep 2024 Keyi Liu, Yeqi Luo, Weidong Yang, Jingyi Xu, Zhijun Li, Wen-Ming Chen, Ben Fei

3DGS utilizes multi-view rendered images as input to generate enhanced point cloud distributions and novel view images, facilitating data augmentation and cross-modal contrastive learning.

3D Object Classification Contrastive Learning +3

MADiff: Motion-Aware Mamba Diffusion Models for Hand Trajectory Prediction on Egocentric Videos

no code implementations4 Sep 2024 Junyi Ma, Xieyuanli Chen, Wentao Bao, Jingyi Xu, Hesheng Wang

Understanding human intentions and actions through egocentric videos is important on the path to embodied artificial intelligence.

Denoising Mamba +2

Assessing Sample Quality via the Latent Space of Generative Models

1 code implementation21 Jul 2024 Jingyi Xu, Hieu Le, Dimitris Samaras

We show that the proposed score correlates highly with the sample quality for various generative models including VAEs, GANs and Latent Diffusion Models.

Few-Shot Image Classification

RefGaussian: Disentangling Reflections from 3D Gaussian Splatting for Realistic Rendering

no code implementations9 Jun 2024 Rui Zhang, Tianyue Luo, Weidong Yang, Ben Fei, Jingyi Xu, Qingyuan Zhou, Keyi Liu, Ying He

3D Gaussian Splatting (3D-GS) has made a notable advancement in the field of neural rendering, 3D scene reconstruction, and novel view synthesis.

3D Scene Reconstruction Depth Estimation +2

Lightening Anything in Medical Images

no code implementations1 Jun 2024 Ben Fei, Yixuan Li, Weidong Yang, Hengjun Gao, Jingyi Xu, Lipeng Ma, Yatian Yang, Pinghong Zhou

The development of medical imaging techniques has made a significant contribution to clinical decision-making.

Decision Making Image Enhancement +1

Diff-IP2D: Diffusion-Based Hand-Object Interaction Prediction on Egocentric Videos

1 code implementation7 May 2024 Junyi Ma, Jingyi Xu, Xieyuanli Chen, Hesheng Wang

Understanding how humans would behave during hand-object interaction is vital for applications in service robot manipulation and extended reality.

Denoising Object +1

MFA-Net: Multi-Scale feature fusion attention network for liver tumor segmentation

no code implementations7 May 2024 Yanli Yuan, Bingbing Wang, Chuan Zhang, Jingyi Xu, Ximeng Liu, Liehuang Zhu

Though recent methods based on Fully Convolutional Neural Networks (F-CNNs) have shown success in many segmentation tasks, fusing features from images with different scales is still a challenge: (1) Due to the lack of spatial awareness, F-CNNs share the same weights at different spatial locations.

Segmentation Tumor Segmentation

Unifying Lane-Level Traffic Prediction from a Graph Structural Perspective: Benchmark and Baseline

1 code implementation22 Mar 2024 Shuhao Li, Yue Cui, Jingyi Xu, Libin Li, Lingkai Meng, Weidong Yang, Fan Zhang, Xiaofang Zhou

Traffic prediction has long been a focal and pivotal area in research, witnessing both significant strides from city-level to road-level predictions in recent years.

Autonomous Driving Traffic Prediction

Visual Foundation Models Boost Cross-Modal Unsupervised Domain Adaptation for 3D Semantic Segmentation

1 code implementation15 Mar 2024 Jingyi Xu, Weidong Yang, Lingdong Kong, Youquan Liu, Rui Zhang, Qingyuan Zhou, Ben Fei

Then, another VFM trained on fine-grained 2D masks is adopted to guide the generation of semantically augmented images and point clouds to enhance the performance of neural networks, which mix the data from source and target domains like view frustums (FrustumMixing).

3D Semantic Segmentation Autonomous Driving +2

Explicit Interaction for Fusion-Based Place Recognition

2 code implementations27 Feb 2024 Jingyi Xu, Junyi Ma, Qi Wu, Zijie Zhou, Yue Wang, Xieyuanli Chen, Ling Pei

Fusion-based place recognition is an emerging technique jointly utilizing multi-modal perception data, to recognize previously visited places in GPS-denied scenarios for robots and autonomous vehicles.

Autonomous Vehicles

3D Gaussian as a New Era: A Survey

no code implementations11 Feb 2024 Ben Fei, Jingyi Xu, Rui Zhang, Qingyuan Zhou, Weidong Yang, Ying He

3D Gaussian Splatting (3D-GS) has emerged as a significant advancement in the field of Computer Graphics, offering explicit scene representation and novel view synthesis without the reliance on neural networks, such as Neural Radiance Fields (NeRF).

Autonomous Navigation Novel View Synthesis +1

Cam4DOcc: Benchmark for Camera-Only 4D Occupancy Forecasting in Autonomous Driving Applications

1 code implementation CVPR 2024 Junyi Ma, Xieyuanli Chen, Jiawei Huang, Jingyi Xu, Zhen Luo, Jintao Xu, Weihao Gu, Rui Ai, Hesheng Wang

Furthermore, the standardized evaluation protocol for preset multiple tasks is also provided to compare the performance of all the proposed baselines on present and future occupancy estimation with respect to objects of interest in autonomous driving scenarios.

Autonomous Driving

LCPR: A Multi-Scale Attention-Based LiDAR-Camera Fusion Network for Place Recognition

1 code implementation6 Nov 2023 Zijie Zhou, Jingyi Xu, Guangming Xiong, Junyi Ma

However, most existing multimodal place recognition methods only use limited field-of-view camera images, which leads to an imbalance between features from different modalities and limits the effectiveness of sensor fusion.

Autonomous Vehicles Sensor Fusion

Zero-Shot Object Counting with Language-Vision Models

no code implementations22 Sep 2023 Jingyi Xu, Hieu Le, Dimitris Samaras

Thus, we propose zero-shot object counting (ZSC), a new setting where only the class name is available during test time.

Object Object Counting

Enforcing 3D Topological Constraints in Composite Objects via Implicit Functions

no code implementations16 Jul 2023 Hieu Le, Jingyi Xu, Nicolas Talabot, Jiancheng Yang, Pascal Fua

Medical applications often require accurate 3D representations of complex organs with multiple parts, such as the heart and spine.

3D Object Reconstruction 3D Reconstruction +1

Learning from Pseudo-labeled Segmentation for Multi-Class Object Counting

no code implementations15 Jul 2023 Jingyi Xu, Hieu Le, Dimitris Samaras

In this paper, we point out that the task of counting objects of interest when there are multiple object classes in the image (namely, multi-class object counting) is particularly challenging for current object counting models.

Object Object Counting +1

Generating Features with Increased Crop-related Diversity for Few-Shot Object Detection

no code implementations CVPR 2023 Jingyi Xu, Hieu Le, Dimitris Samaras

To mitigate this issue, we propose a novel variational autoencoder (VAE) based data generation model, which is capable of generating data with increased crop-related diversity.

Diversity Few-Shot Object Detection +1

Zero-shot Object Counting

1 code implementation CVPR 2023 Jingyi Xu, Hieu Le, Vu Nguyen, Viresh Ranjan, Dimitris Samaras

By applying this model to all the candidate patches, we can select the most suitable patches as exemplars for counting.

Object Object Counting +1

CVTNet: A Cross-View Transformer Network for Place Recognition Using LiDAR Data

1 code implementation3 Feb 2023 Junyi Ma, Guangming Xiong, Jingyi Xu, Xieyuanli Chen

LiDAR-based place recognition (LPR) is one of the most crucial components of autonomous vehicles to identify previously visited places in GPS-denied environments.

Autonomous Vehicles

SeqOT: A Spatial-Temporal Transformer Network for Place Recognition Using Sequential LiDAR Data

1 code implementation16 Sep 2022 Junyi Ma, Xieyuanli Chen, Jingyi Xu, Guangming Xiong

It uses multi-scale transformers to generate a global descriptor for each sequence of LiDAR range images in an end-to-end fashion.

Autonomous Vehicles

Contemporary Recommendation Systems on Big Data and Their Applications: A Survey

no code implementations31 May 2022 Ziyuan Xia, Anchen Sun, Jingyi Xu, Yuanzhe Peng, Rui Ma, Minghui Cheng

This survey paper conducts a comprehensive analysis of the evolution and contemporary landscape of recommendation systems, which have been extensively incorporated across a myriad of web applications.

Collaborative Filtering Recommendation Systems +1

Generating Representative Samples for Few-Shot Classification

1 code implementation CVPR 2022 Jingyi Xu, Hieu Le

To mitigate this issue, we propose to generate visual samples based on semantic embeddings using a conditional variational autoencoder (CVAE) model.

Few-Shot Learning General Classification

FedCorr: Multi-Stage Federated Learning for Label Noise Correction

1 code implementation CVPR 2022 Jingyi Xu, Zihan Chen, Tony Q. S. Quek, Kai Fong Ernest Chong

Although there exist methods in centralized learning for tackling label noise, such methods do not perform well on heterogeneous label noise in FL settings, due to the typically smaller sizes of client datasets and data privacy requirements in FL.

Federated Learning Privacy Preserving

A Comparative Study of Deep Reinforcement Learning-based Transferable Energy Management Strategies for Hybrid Electric Vehicles

1 code implementation22 Feb 2022 Jingyi Xu, Zirui Li, Li Gao, Junyi Ma, Qi Liu, Yanan Zhao

Different exploration methods of DRL, including adding action space noise and parameter space noise, are compared against each other in the transfer learning process in this work.

Deep Reinforcement Learning energy management +4

Training Classifiers that are Universally Robust to All Label Noise Levels

1 code implementation27 May 2021 Jingyi Xu, Tony Q. S. Quek, Kai Fong Ernest Chong

In particular, we shall assume that a small subset of any given noisy dataset is known to have correct labels, which we treat as "positive", while the remaining noisy subset is treated as "unlabeled".

Ranked #7 on Image Classification on Clothing1M (using clean data) (using extra training data)

Image Classification

Learning Clusterable Visual Features for Zero-Shot Recognition

no code implementations7 Oct 2020 Jingyi Xu, Zhixin Shu, Dimitris Samaras

However, some testing data are considered "hard" as they lie close to the decision boundaries and are prone to misclassification, leading to performance degradation for ZSL.

Classification Few-Shot Learning +2

GOMP: Grasp-Optimized Motion Planning for Bin Picking

no code implementations5 Mar 2020 Jeffrey Ichnowski, Michael Danielczuk, Jingyi Xu, Vishal Satish, Ken Goldberg

Rapid and reliable robot bin picking is a critical challenge in automating warehouses, often measured in picks-per-hour (PPH).

Robotics

Cannot find the paper you are looking for? You can Submit a new open access paper.