no code implementations • 2 Apr 2025 • Shu Han, Xubo Zhu, Ji Wu, Ximeng Cai, Wen Yang, Huai Yu, Gui-Song Xia
DF-Calib estimates a dense depth map from the camera image and completes the sparse LiDAR projected depth map, using a shared feature encoder to extract consistent depth-to-depth features, effectively bridging the 2D-3D cross-modal gap.
no code implementations • 13 Feb 2025 • Yan Zhang, Wen Yang, Chang Xu, Qian Hu, Fang Xu, Gui-Song Xia
For instance, a slight deviation of a tiny object in the thermal modality will induce it to drift from the main body of itself in the RGB modality.
no code implementations • 16 Dec 2024 • Chang Xu, Ruixiang Zhang, Wen Yang, Haoran Zhu, Fang Xu, Jian Ding, Gui-Song Xia
Detecting oriented tiny objects, which are limited in appearance information yet prevalent in real-world applications, remains an intricate and under-explored problem.
no code implementations • 8 Dec 2024 • Haoran Zhu, Chang Xu, Ruixiang Zhang, Fang Xu, Wen Yang, Haijian Zhang, Gui-Song Xia
To handle label noise from scale ambiguity and location shifts in point annotations, Point Teacher employs the teacher-student architecture and decouples the learning into a two-phase denoising process.
no code implementations • 8 Dec 2024 • Damien de Mijolla, Wen Yang, Philippa Duckett, Christopher Frye, Mark Worrall
Fine-tuning removes the need for task-specific demonstrations of tool usage at runtime; however, this ties new capabilities to a single model, thus making already-heavier setup costs a recurring expense.
no code implementations • 5 Dec 2024 • Haitian Zhang, Xiangyuan Wang, Chang Xu, Xinya Wang, Fang Xu, Huai Yu, Lei Yu, Wen Yang
We further propose a training strategy, Time Shift, which enforces the module to align the prediction from temporally shifted Event-RGB pairs and their original representation, that is, consistent with Event-aligned annotations.
no code implementations • 4 Dec 2024 • Ji Wu, Huai Yu, Shu Han, Xi-Meng Cai, Ming-Feng Wang, Wen Yang, Gui-Song Xia
In the realm of large-scale point cloud registration, designing a compact symbolic representation is crucial for efficiently processing vast amounts of data, ensuring registration robustness against significant viewpoint variations and occlusions.
1 code implementation • 29 Nov 2024 • Wanyue Zhang, Ziyong Li, Wen Yang, Chunlin Leng, Yinan Bai, Qianlong Du, Chengqing Zong, Jiajun Zhang
With this approach, we release the largest, high-quality and fine-grained Chinese text ChineseWebText2. 0, which consists of 3. 8TB and each text is associated with a quality score, domain labels, a toxicity label and a toxicity score, facilitating the LLM researchers to select data based on various types of fine-grained information.
no code implementations • 22 Nov 2024 • Haoyuan Li, Chang Xu, Wen Yang, Li Mi, Huai Yu, Haijian Zhang
As such, our unsupervised paradigm naturally avoids the problem of region-specific overfitting, enabling generic CVGL for UAV images without feature fine-tuning or data-driven training.
no code implementations • 23 Oct 2024 • Wen Yang, Kai Fan, Minpeng Liao
Chain of Thought (CoT) of multi-step benefits from the logical structure of the reasoning steps and task-specific actions, significantly enhancing the mathematical reasoning capabilities of large language models.
1 code implementation • 11 Oct 2024 • Wen Yang, Junhong Wu, Chen Wang, Chengqing Zong, Jiajun Zhang
Large Language Models (LLMs) have achieved state-of-the-art performance across numerous tasks.
no code implementations • 2 Oct 2024 • Jing Luo, Run Luo, Longze Chen, Liang Zhu, Chang Ao, Jiaming Li, Yukun Chen, Xin Cheng, Wen Yang, Jiayuan Su, Chengming Li, Min Yang
To bridge this gap, we propose a data augmentation approach and introduce PersonaMathQA, a dataset derived from MATH and GSM8K, on which we train the PersonaMath models.
1 code implementation • 25 Jul 2024 • Haoran Zhu, Yifan Zhou, Chang Xu, Ruixiang Zhang, Wen Yang
This letter introduces Orthogonal Mapping (OM), a simple yet effective method aimed at addressing the challenge of semantic confusion inherent in FGOD.
1 code implementation • 30 May 2024 • Chong Li, Wen Yang, Jiajun Zhang, Jinliang Lu, Shaonan Wang, Chengqing Zong
In addition, we find that models tuned on cross-lingual instruction following samples can follow the instruction in the output language without further tuning.
no code implementations • 20 May 2024 • Yihan Wu, Tao Chang, Siliang Chen, Xiaodong Niu, Yu Li, Yuan Fang, Lei Yang, Yixuan Zong, Yaoxin Yang, Yuehua Li, Mengsong Wang, Wen Yang, Yixuan Wu, Chen Fu, Xia Fang, Yuxin Quan, Xilin Peng, Qiang Sun, Marc M. Van Hulle, Yanhui Liu, Ning Jiang, Dario Farina, Yuan Yang, Jiayuan He, Qing Mao
Glioma cells can reshape functional neuronal networks by hijacking neuronal synapses, leading to partial or complete neurological dysfunction.
1 code implementation • 8 Apr 2024 • Haitian Zhang, Chang Xu, Xinya Wang, Bingde Liu, Guang Hua, Lei Yu, Wen Yang
Object detection is critical in autonomous driving, and it is more practical yet challenging to localize objects of unknown categories: an endeavour known as Class-Agnostic Object Detection (CAOD).
no code implementations • 25 Mar 2024 • Wenhao Lin, Yuqing Ni, Wen Yang, Chao Yang
Under the given threshold of the control performance loss, a trade-off optimization problem is proposed.
no code implementations • 20 Mar 2024 • Li Mi, Chang Xu, Javiera Castillo-Navarro, Syrielle Montariol, Wen Yang, Antoine Bosselut, Devis Tuia
Cross-view geo-localization aims at localizing a ground-level query image by matching it to its corresponding geo-referenced aerial view.
no code implementations • 19 Mar 2024 • Haoyuan Li, Chang Xu, Wen Yang, Huai Yu, Gui-Song Xia
We observe that training on unlabeled cross-view images presents significant challenges, including the need to establish relationships within unlabeled data and reconcile view discrepancies between uncertain queries and references.
1 code implementation • 16 Jan 2024 • Haoran Zhu, Chang Xu, Wen Yang, Ruixiang Zhang, Yan Zhang, Gui-Song Xia
In this study, we address the intricate issue of tiny object detection under noisy label supervision.
no code implementations • 23 Oct 2023 • Ruixiang Zhang, Chang Xu, Fang Xu, Wen Yang, Guangjun He, Huai Yu, Gui-Song Xia
This paper focuses on the scale imbalance problem of semi-supervised object detection(SSOD) in aerial images.
1 code implementation • 29 Sep 2023 • Chi Zhang, Xiang Zhang, Mingyuan Lin, Cheng Li, Chu He, Wen Yang, Gui-Song Xia, Lei Yu
Even though the collaboration between traditional and neuromorphic event cameras brings prosperity to frame-event based vision applications, the performance is still confined by the resolution gap crossing two modalities in both spatial and temporal domains.
1 code implementation • 25 Sep 2023 • Ji Wu, Huai Yu, Wen Yang, Gui-Song Xia
This paper presents a novel framework to learn a concise geometric primitive representation for 3D point clouds.
1 code implementation • ICCV 2023 • Xiang Zhang, Lei Yu, Wen Yang, Jianzhuang Liu, Gui-Song Xia
Event-based motion deblurring has shown promising results by exploiting low-latency events.
2 code implementations • 29 May 2023 • Wen Yang, Chong Li, Jiajun Zhang, Chengqing Zong
Second, we continue training the model with a large-scale parallel dataset that covers 102 natural languages.
Ranked #4 on
Machine Translation
on FLoRes-200
1 code implementation • CVPR 2023 • Chang Xu, Jian Ding, Jinwang Wang, Wen Yang, Huai Yu, Lei Yu, Gui-Song Xia
Despite the exploration of adaptive label assignment in recent oriented object detectors, the extreme geometry shape and limited feature of oriented tiny objects still induce severe mismatch and imbalance issues.
Ranked #4 on
Oriented Object Detection
on DOTA 2.0
no code implementations • 14 Apr 2023 • Yangguang Wang, Xiang Zhang, Mingyuan Lin, Lei Yu, Boxin Shi, Wen Yang, Gui-Song Xia
Scene Dynamic Recovery (SDR) by inverting distorted Rolling Shutter (RS) images to an undistorted high frame-rate Global Shutter (GS) video is a severely ill-posed problem due to the missing temporal dynamic information in both RS intra-frame scanlines and inter-frame exposures, particularly when prior knowledge about camera/object motions is unavailable.
no code implementations • 5 Apr 2023 • Zhangyi Cheng, Xiang Zhang, Lei Yu, Jianzhuang Liu, Wen Yang, Gui-Song Xia
This paper aims at demystifying a single motion-blurred image with events and revealing temporally continuous scene dynamics encrypted behind motion blurs.
1 code implementation • 27 Feb 2023 • Lei Yu, Bishan Wang, Xiang Zhang, Haijian Zhang, Wen Yang, Jianzhuang Liu, Gui-Song Xia
Super-Resolution from a single motion Blurred image (SRB) is a severely ill-posed problem due to the joint degradation of motion blurs and low spatial resolution.
1 code implementation • 9 Jan 2023 • Fang Xu, Yilei Shi, Patrick Ebel, Wen Yang, Xiao Xiang Zhu
With this dataset, we consider the problem of cloud removal in high-resolution optical remote sensing imagery by integrating multi-modal and multi-resolution information.
no code implementations • 6 Dec 2022 • Zhipeng Zhao, Huai Yu, Chenwei Lyv, Wen Yang, Sebastian Scherer
To overcome this limitation, we focus on correlating the information of 360 equirectangular images to point clouds, proposing an end-to-end learnable network to conduct cross-modal visual localization by establishing similarity in high-dimensional feature space.
no code implementations • 5 Dec 2022 • Lei Yu, Xiang Zhang, Wei Liao, Wen Yang, Gui-Song Xia
Although synthetic aperture imaging (SAI) can achieve the seeing-through effect by blurring out off-focus foreground occlusions while recovering in-focus occluded scenes from multi-view images, its performance is often deteriorated by dense occlusions and extreme lighting conditions.
1 code implementation • 14 Nov 2022 • Huai Yu, Hao Li, Wen Yang, Lei Yu, Gui-Song Xia
To robustly detect line segments over motion blurs, we propose to leverage the complementary information of images and events.
no code implementations • 24 Aug 2022 • Wen Yang, Rui Wang, Yanchao Zhang
However, the ND-MLS method has stable performance and obtains 96. 5 top-1 acc in Res-Net on 100 different handwritten character classification tasks; 2) in segmentation, under the premise of only ten original images, DeepLab obtains 93. 5%, 85%, and 73. 3% m_IOU(10) on the bottle, horse, and grass test datasets, respectively, while the cat test dataset obtains 86. 7% m_IOU(10) with the SegNet model; 3) with only 10 original images from each category in object detection, YOLO v4 obtains 100% and 97. 2% bottle and horse detection, respectively, while the cat dataset obtains 93. 6% with YOLO v3.
1 code implementation • 24 Aug 2022 • Bingde Liu, Chang Xu, Wen Yang, Huai Yu, Lei Yu
In this work, we propose a motion robust and high-speed detection pipeline which better leverages the event data.
1 code implementation • 18 Aug 2022 • Chang Xu, Jinwang Wang, Wen Yang, Huai Yu, Lei Yu, Gui-Song Xia
Then, instead of assigning samples with IoU or center sampling strategy, a new Receptive Field Distance (RFD) is proposed to directly measure the similarity between the Gaussian receptive field and ground truth.
Ranked #2 on
Object Detection
on AI-TOD
no code implementations • 8 Jul 2022 • Chao Yang, Wen Yang, Hongbo Shi
In this paper, we study the privacy preservation problem in a cooperative networked control system working for the task of LQG control.
1 code implementation • 28 Jun 2022 • Chang Xu, Jinwang Wang, Wen Yang, Huai Yu, Lei Yu, Gui-Song Xia
Tiny object detection (TOD) in aerial images is challenging since a tiny object only contains a few pixels.
1 code implementation • 6 Jun 2022 • Fang Xu, Yilei Shi, Patrick Ebel, Lei Yu, Gui-Song Xia, Wen Yang, Xiao Xiang Zhu
The challenge of the cloud removal task can be alleviated with the aid of Synthetic Aperture Radar (SAR) images that can penetrate cloud cover.
Ranked #3 on
Cloud Removal
on SEN12MS-CR
1 code implementation • 28 Apr 2022 • Jinwang Wang, Lingxuan Meng, Weijia Li, Wen Yang, Lei Yu, Gui-Song Xia
In this paper, we propose an offset vector learning scheme, which turns the building footprint extraction problem in off-nadir images into an instance-level joint prediction problem of the building roof and its corresponding "roof to footprint" offset vector.
1 code implementation • CVPR 2022 • Wei Liao, Xiang Zhang, Lei Yu, ShiJie Lin, Wen Yang, Ning Qiao
This paper addresses this problem by leveraging the merits of both events and frames, leading to a fusion-based SAI (EF-SAI) that performs consistently under the different densities of occlusions.
1 code implementation • 24 Nov 2021 • Yao Lu, Wen Yang, Yunzhe Zhang, Zuohui Chen, Jinyin Chen, Qi Xuan, Zhen Wang, Xiaoniu Yang
Specifically, we model the process of class separation of intermediate representations in pre-trained DNNs as the evolution of communities in dynamic graphs.
1 code implementation • 22 Nov 2021 • Zuohui Chen, Yao Lu, Jinxuan Hu, Wen Yang, Qi Xuan, Zhen Wang, Xiaoniu Yang
Understanding the black-box representations in Deep Neural Networks (DNN) is an essential problem in deep learning.
1 code implementation • 18 Nov 2021 • Wen Yang, Zheng Gong, Baifu Huang, Xiaoping Hong
Lidar point cloud distortion from moving object is an important problem in autonomous driving, and recently becomes even more demanding with the emerging of newer lidars, which feature back-and-forth scanning patterns.
3 code implementations • 26 Oct 2021 • Jinwang Wang, Chang Xu, Wen Yang, Lei Yu
Our key observation is that Intersection over Union (IoU) based metrics such as IoU itself and its extensions are very sensitive to the location deviation of the tiny objects, and drastically deteriorate the detection performance when used in anchor-based detectors.
Ranked #3 on
Object Detection
on AI-TOD
no code implementations • ICCV 2021 • Fang Xu, Lei Yu, Bishan Wang, Wen Yang, Gui-Song Xia, Xu Jia, Zhendong Qiao, Jianzhuang Liu
In this paper, we propose an end-to-end learning framework for event-based motion deblurring in a self-supervised manner, where real-world events are exploited to alleviate the performance degradation caused by data inconsistency.
no code implementations • 7 Jul 2021 • Jakob Gawlikowski, Cedrique Rovile Njieutcheu Tassi, Mohsin Ali, JongSeok Lee, Matthias Humt, Jianxiang Feng, Anna Kruspe, Rudolph Triebel, Peter Jung, Ribana Roscher, Muhammad Shahzad, Wen Yang, Richard Bamler, Xiao Xiang Zhu
Different examples from the wide spectrum of challenges in different fields give an idea of the needs and challenges regarding uncertainties in practical applications.
no code implementations • 11 Mar 2021 • Shu-Hui Zhang, Jin Yang, Ding-Fu Shao, Zhenhua Wu, Wen Yang
Friedel oscillation is a well-known wave phenomenon, which represents the oscillatory response of electron waves to imperfection.
Mesoscale and Nanoscale Physics
1 code implementation • CVPR 2021 • Xiang Zhang, Wei Liao, Lei Yu, Wen Yang, Gui-Song Xia
Synthetic aperture imaging (SAI) is able to achieve the see through effect by blurring out the off-focus foreground occlusions and reconstructing the in-focus occluded targets from multi-view images.
2 code implementations • 24 Feb 2021 • Jian Ding, Nan Xue, Gui-Song Xia, Xiang Bai, Wen Yang, Micheal Ying Yang, Serge Belongie, Jiebo Luo, Mihai Datcu, Marcello Pelillo, Liangpei Zhang
In this paper, we present a large-scale Dataset of Object deTection in Aerial images (DOTA) and comprehensive baselines for ODAI.
1 code implementation • International Conference on Pattern Recognition (ICPR) 2021 • Jinwang Wang, Wen Yang, Haowen Guo, Ruixiang Zhang, Gui-Song Xia
To build a benchmark for tiny object detection in aerial images, we evaluate the state-of-the-art object detectors on our AI-TOD dataset.
Ranked #5 on
Object Detection
on AI-TOD
2 code implementations • 6 Nov 2020 • Hao Li, Huai Yu, Wen Yang, Lei Yu, Sebastian Scherer
Targeting at the unified line segment detection (ULSD) for both distorted and undistorted images, we propose to represent line segments with the Bezier curve model.
Ranked #5 on
Line Segment Detection
on wireframe dataset
(sAP10 metric)
1 code implementation • 12 Oct 2020 • Kunping Yang, Gui-Song Xia, Zicheng Liu, Bo Du, Wen Yang, Marcello Pelillo, Liangpei Zhang
Given two multi-temporal aerial images, semantic change detection aims to locate the land-cover variations and identify their change types with pixel-wise boundaries.
no code implementations • 30 Aug 2020 • Aleks Jevnikar, Jun Wang, Wen Yang
In the present paper we derive Liouville type results and existence of periodic solutions for $\chi^{(2)}$ type systems with non-homogeneous nonlinearities.
Analysis of PDEs 35K9, 35J61, 35B45
no code implementations • 14 Aug 2020 • Wensheng Cheng, Hao Luo, Wen Yang, Lei Yu, Wei Li
We then propose a structure-aware network for lane marker extraction in DVS images.
1 code implementation • ECCV 2020 • Bishan Wang, Jingwei He, Lei Yu, Gui-Song Xia, Wen Yang
To recover high-quality intensity images, one should address both denoising and super-resolution problems for event cameras.
no code implementations • 13 Jul 2020 • Jiawei Shen, Zhuoyan Li, Lei Yu, Gui-Song Xia, Wen Yang
Deep convolutional neural networks (CNN) have been applied for image dehazing tasks, where the residual network (ResNet) is often adopted as the basic component to avoid the vanishing gradient problem.
1 code implementation • 28 Jun 2020 • Emanuele Dalsasso, Xiangli Yang, Loïc Denis, Florence Tupin, Wen Yang
Many different schemes have been proposed for the restoration of intensity SAR images.
1 code implementation • 22 Jun 2020 • Yang Long, Gui-Song Xia, Shengyang Li, Wen Yang, Michael Ying Yang, Xiao Xiang Zhu, Liangpei Zhang, Deren Li
After reviewing existing benchmark datasets in the research community of RS image interpretation, this article discusses the problem of how to efficiently prepare a suitable benchmark dataset for RS image interpretation.
no code implementations • 26 Apr 2020 • Rui Peng, David Navarro-Alarcon, Victor Wu, Wen Yang
In this paper, in order to pursue high-efficiency robotic arc welding tasks, we propose a method based on point cloud acquired by an RGB-D sensor.
Robotics
1 code implementation • 1 Apr 2020 • Huai Yu, Weikun Zhen, Wen Yang, Ji Zhang, Sebastian Scherer
With the pose prediction from VIO, we can efficiently obtain coarse 2D-3D line correspondences.
no code implementations • 7 Mar 2020 • Wensheng Cheng, Yan Zhang, Xu Lei, Wen Yang, Gui-Song Xia
Change detection is an important problem in vision field, especially for aerial images.
no code implementations • 2 Mar 2020 • Fang Xu, ShiJie Lin, Wen Yang, Lei Yu, Dengxin Dai, Gui-Song Xia
The event camera has appealing properties: high dynamic range, low latency, low power consumption and low memory usage, and thus provides complementariness to conventional frame-based cameras.
no code implementations • 23 Nov 2019 • Huai Yu, Weikun Zhen, Wen Yang, Sebastian Scherer
In this paper, we propose a new 2D-3D registration method to estimate 2D-3D line feature correspondences and the camera pose in untextured point clouds of structured environments.
no code implementations • 9 Nov 2018 • Shi-Jie Lin, Jinwang Wang, Wen Yang, Guisong Xia
Autonomous Unmanned Aerial Manipulators (UAMs) have shown promising potentials to transform passive sensing missions into active 3-dimension interactive missions, but they still suffer from some difficulties impeding their wide applications, such as target detection and stabilization.
no code implementations • 4 Jun 2018 • Fan Hu, Gui-Song Xia, Wen Yang, Liangpei Zhang
Scene classification is a fundamental task in interpretation of remote sensing images, and has become an active research topic in remote sensing community due to its important role in a wide range of applications.
no code implementations • 17 Jan 2015 • Gui-Song Xia, Gang Liu, Wen Yang
The segmentation of synthetic aperture radar (SAR) images is a longstanding yet challenging task, not only because of the presence of speckle, but also due to the variations of surface backscattering properties in the images.