Search Results for author: Rongtao Xu

Found 12 papers, 9 papers with code

HCF-Net: Hierarchical Context Fusion Network for Infrared Small Object Detection

1 code implementation16 Mar 2024 Shibiao Xu, ShuChen Zheng, Wenhao Xu, Rongtao Xu, Changwei Wang, Jiguang Zhang, Xiaoqiang Teng, Ao Li, Li Guo

Infrared small object detection is an important computer vision task involving the recognition and localization of tiny objects in infrared images, which usually contain only a few pixels.

object-detection Small Object Detection

NaVid: Video-based VLM Plans the Next Step for Vision-and-Language Navigation

no code implementations24 Feb 2024 Jiazhao Zhang, Kunyu Wang, Rongtao Xu, Gengze Zhou, Yicong Hong, Xiaomeng Fang, Qi Wu, Zhizheng Zhang, He Wang

Vision-and-Language Navigation (VLN) stands as a key research problem of Embodied AI, aiming at enabling agents to navigate in unseen environments following linguistic instructions.

Decision Making Instruction Following +3

Local Feature Matching Using Deep Learning: A Survey

1 code implementation31 Jan 2024 Shibiao Xu, Shunpeng Chen, Rongtao Xu, Changwei Wang, Peng Lu, Li Guo

The objective of this endeavor is to furnish a comprehensive overview of local feature matching methods.

3D Reconstruction Image Registration +3

The Development of LLMs for Embodied Navigation

1 code implementation1 Nov 2023 Jinzhou Lin, Han Gao, Xuxiang Feng, Rongtao Xu, Changwei Wang, Man Zhang, Li Guo, Shibiao Xu

This article offers an exhaustive summary of the symbiosis between LLMs and embodied intelligence with a focus on navigation.

Decision Making

Segment Anything Model is a Good Teacher for Local Feature Learning

1 code implementation29 Sep 2023 Jingqian Wu, Rongtao Xu, Zach Wood-Doughty, Changwei Wang, Shibiao Xu, Edmund Lam

To do so, first, we construct an auxiliary task of Pixel Semantic Relational Distillation (PSRD), which distillates feature relations with category-agnostic semantic information learned by the SAM encoder into a local feature learning network, to improve local feature description using semantic discrimination.

Contrastive Learning Visual Localization

Task Relation Distillation and Prototypical Pseudo Label for Incremental Named Entity Recognition

1 code implementation17 Aug 2023 Duzhen Zhang, Hongliu Li, Wei Cong, Rongtao Xu, Jiahua Dong, Xiuyi Chen

However, INER faces the challenge of catastrophic forgetting specific for incremental learning, further aggravated by background shift (i. e., old and future entity types are labeled as the non-entity type in the current task).

Incremental Learning named-entity-recognition +3

Attention Weighted Local Descriptors

1 code implementation journal 2023 Changwei Wang, Rongtao Xu, Ke Lu, Shibiao Xu, Weiliang Meng, Yuyang Zhang, Bin Fan, Xiaopeng Zhang

Local features detection and description are widely used in many vision applications with high industrial and commercial demands.

3D Reconstruction Homography Estimation +2

Self Correspondence Distillation for End-to-End Weakly-Supervised Semantic Segmentation

1 code implementation27 Feb 2023 Rongtao Xu, Changwei Wang, Jiaxi Sun, Shibiao Xu, Weiliang Meng, Xiaopeng Zhang

In addition, to further improve the segmentation accuracy, we design a Variation-aware Refine Module to enhance the local consistency of pseudo-labels by computing pixel-level variation.

Weakly supervised Semantic Segmentation Weakly-Supervised Semantic Segmentation

MTLDesc: Looking Wider to Describe Better

1 code implementation14 Mar 2022 Changwei Wang, Rongtao Xu, Yuyang Zhang, Shibiao Xu, Weiliang Meng, Bin Fan, Xiaopeng Zhang

Limited by the locality of convolutional neural networks, most existing local features description methods only learn local descriptors with local information and lack awareness of global and surrounding spatial context.

Indoor Localization

Accurate Lung Nodules Segmentation with Detailed Representation Transfer and Soft Mask Supervision

no code implementations29 Jul 2020 Changwei Wang, Rongtao Xu, Shibiao Xu, Weiliang Meng, Jun Xiao, Xiaopeng Zhang

Then, a novel Network with detailed representation transfer and Soft Mask supervision (DSNet) is proposed to process the input low-resolution images of lung nodules into high-quality segmentation results.

Computed Tomography (CT) Lesion Segmentation +3

Cannot find the paper you are looking for? You can Submit a new open access paper.