Search Results for author: Rongtao Xu

Found 12 papers, 9 papers with code

HCF-Net: Hierarchical Context Fusion Network for Infrared Small Object Detection

1 code implementation • 16 Mar 2024 • Shibiao Xu, ShuChen Zheng, Wenhao Xu, Rongtao Xu, Changwei Wang, Jiguang Zhang, Xiaoqiang Teng, Ao Li, Li Guo

Infrared small object detection is an important computer vision task involving the recognition and localization of tiny objects in infrared images, which usually contain only a few pixels.

object-detection Small Object Detection

Paper
Code

NaVid: Video-based VLM Plans the Next Step for Vision-and-Language Navigation

no code implementations • 24 Feb 2024 • Jiazhao Zhang, Kunyu Wang, Rongtao Xu, Gengze Zhou, Yicong Hong, Xiaomeng Fang, Qi Wu, Zhizheng Zhang, He Wang

Vision-and-Language Navigation (VLN) stands as a key research problem of Embodied AI, aiming at enabling agents to navigate in unseen environments following linguistic instructions.

Decision Making Instruction Following +3

Paper
Add Code

Local Feature Matching Using Deep Learning: A Survey

1 code implementation • 31 Jan 2024 • Shibiao Xu, Shunpeng Chen, Rongtao Xu, Changwei Wang, Peng Lu, Li Guo

The objective of this endeavor is to furnish a comprehensive overview of local feature matching methods.

3D Reconstruction Image Registration +3

Paper
Code

Spectral Prompt Tuning:Unveiling Unseen Classes for Zero-Shot Semantic Segmentation

1 code implementation • 20 Dec 2023 • Wenhao Xu, Rongtao Xu, Changwei Wang, Shibiao Xu, Li Guo, Man Zhang, Xiaopeng Zhang

Recently, CLIP has found practical utility in the domain of pixel-level zero-shot segmentation tasks.

Semantic Segmentation Zero Shot Segmentation +1

Paper
Code

The Development of LLMs for Embodied Navigation

1 code implementation • 1 Nov 2023 • Jinzhou Lin, Han Gao, Xuxiang Feng, Rongtao Xu, Changwei Wang, Man Zhang, Li Guo, Shibiao Xu

This article offers an exhaustive summary of the symbiosis between LLMs and embodied intelligence with a focus on navigation.

Decision Making

Paper
Code

Segment Anything Model is a Good Teacher for Local Feature Learning

1 code implementation • 29 Sep 2023 • Jingqian Wu, Rongtao Xu, Zach Wood-Doughty, Changwei Wang, Shibiao Xu, Edmund Lam

To do so, first, we construct an auxiliary task of Pixel Semantic Relational Distillation (PSRD), which distillates feature relations with category-agnostic semantic information learned by the SAM encoder into a local feature learning network, to improve local feature description using semantic discrimination.

Contrastive Learning Visual Localization

Paper
Code

Task Relation Distillation and Prototypical Pseudo Label for Incremental Named Entity Recognition

1 code implementation • 17 Aug 2023 • Duzhen Zhang, Hongliu Li, Wei Cong, Rongtao Xu, Jiahua Dong, Xiuyi Chen

However, INER faces the challenge of catastrophic forgetting specific for incremental learning, further aggravated by background shift (i. e., old and future entity types are labeled as the non-entity type in the current task).

Incremental Learning named-entity-recognition +3

Paper
Code

Attention Weighted Local Descriptors

1 code implementation • journal 2023 • Changwei Wang, Rongtao Xu, Ke Lu, Shibiao Xu, Weiliang Meng, Yuyang Zhang, Bin Fan, Xiaopeng Zhang

Local features detection and description are widely used in many vision applications with high industrial and commercial demands.

3D Reconstruction Homography Estimation +2

Paper
Code

Self Correspondence Distillation for End-to-End Weakly-Supervised Semantic Segmentation

1 code implementation • 27 Feb 2023 • Rongtao Xu, Changwei Wang, Jiaxi Sun, Shibiao Xu, Weiliang Meng, Xiaopeng Zhang

In addition, to further improve the segmentation accuracy, we design a Variation-aware Refine Module to enhance the local consistency of pseudo-labels by computing pixel-level variation.

Weakly supervised Semantic Segmentation Weakly-Supervised Semantic Segmentation

Paper
Code

Treating Pseudo-labels Generation as Image Matting for Weakly Supervised Semantic Segmentation

no code implementations • ICCV 2023 • Changwei Wang, Rongtao Xu, Shibiao Xu, Weiliang Meng, Xiaopeng Zhang

To solve this problem, we develop a Double Decoupled Class Activation Map (D2CAM) for Mat-Label to generate a high-quality trimap.

Image Matting Metric Learning +2

Paper
Add Code

MTLDesc: Looking Wider to Describe Better

1 code implementation • 14 Mar 2022 • Changwei Wang, Rongtao Xu, Yuyang Zhang, Shibiao Xu, Weiliang Meng, Bin Fan, Xiaopeng Zhang

Limited by the locality of convolutional neural networks, most existing local features description methods only learn local descriptors with local information and lack awareness of global and surrounding spatial context.

Indoor Localization

Paper
Code

Accurate Lung Nodules Segmentation with Detailed Representation Transfer and Soft Mask Supervision

no code implementations • 29 Jul 2020 • Changwei Wang, Rongtao Xu, Shibiao Xu, Weiliang Meng, Jun Xiao, Xiaopeng Zhang

Then, a novel Network with detailed representation transfer and Soft Mask supervision (DSNet) is proposed to process the input low-resolution images of lung nodules into high-quality segmentation results.

Computed Tomography (CT) Lesion Segmentation +3

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.