Search Results for author: Shaohua Dong

Found 8 papers, 3 papers with code

Beyond MOT: Semantic Multi-Object Tracking

no code implementations • 8 Mar 2024 • Yunhao Li, Hao Wang, Xue Ma, Jiali Yao, Shaohua Dong, Heng Fan, Libo Zhang

Current multi-object tracking (MOT) aims to predict trajectories of targets (i. e.,"where") in videos.

Paper
Add Code

VastTrack: Vast Category Visual Object Tracking

1 code implementation • 6 Mar 2024 • Liang Peng, Junyuan Gao, Xinran Liu, Weihong Li, Shaohua Dong, Zhipeng Zhang, Heng Fan, Libo Zhang

The rich annotations of VastTrack enables development of both the vision-only and the vision-language tracking.

Object Visual Object Tracking +1

Paper
Code

Efficient Multimodal Semantic Segmentation via Dual-Prompt Learning

1 code implementation • 1 Dec 2023 • Shaohua Dong, Yunhe Feng, Qing Yang, Yan Huang, Dongfang Liu, Heng Fan

Existing approaches often fully fine-tune a dual-branch encoder-decoder framework with a complicated feature fusion strategy for achieving multimodal semantic segmentation, which is training-costly due to the massive parameter updates in feature extraction and fusion.

Ranked #2 on Semantic Segmentation on SUN-RGBD (using extra training data)

object-detection Object Detection +6

Paper
Code

CACFNet: Cross-Modal Attention Cascaded Fusion Network for RGB-T Urban Scene Parsing

no code implementations • journal 2023 • WuJie Zhou, Shaohua Dong, Meixin Fang, Lu Yu

Color–thermal (RGB-T) urban scene parsing has recently attracted widespread interest.

Ranked #5 on Thermal Image Segmentation on PST900

Scene Parsing Thermal Image Segmentation

Paper
Add Code

EGFNet: Edge-Aware Guidance Fusion Network for RGB–Thermal Urban Scene Parsing

no code implementations • journal 2023 • Shaohua Dong, WuJie Zhou, Caie Xu, Weiqing Yan

To address these problems, an edge-aware guidance fusion network (EGFNet) was developed in this study for RGB–thermal urban scene parsing.

Ranked #6 on Thermal Image Segmentation on PST900

Scene Parsing Semantic Segmentation +1

Paper
Add Code

GEBNet: Graph-Enhancement Branch Network for RGB-T Scene Parsing

no code implementations • journal 2022 • Shaohua Dong, WuJie Zhou, Xiaohong Qian, Lu Yu

RGB-T (red–green–blue and thermal) scene parsing has recently drawn considerable research attention.

Ranked #9 on Thermal Image Segmentation on PST900

Scene Parsing Thermal Image Segmentation

Paper
Add Code

MTANet: Multitask-Aware Network With Hierarchical Multimodal Fusion for RGB-T Urban Scene Understanding

no code implementations • journal 2022 • WuJie Zhou, Shaohua Dong, Jingsheng Lei, Lu Yu

To improve the fusion of multimodal features and the segmentation accuracy, we propose a multitask-aware network (MTANet) with hierarchical multimodal fusion (multiscale fusion strategy) for RGB-T urban scene understanding.

Ranked #10 on Thermal Image Segmentation on PST900

Autonomous Vehicles Scene Understanding +2

Paper
Add Code

Edge-aware Guidance Fusion Network for RGB Thermal Scene Parsing

1 code implementation • 9 Dec 2021 • WuJie Zhou, Shaohua Dong, Caie Xu, Yaguan Qian

Considering the importance of high level semantic information, we propose a global information module and a semantic information module to extract rich semantic information from the high-level features.

Ranked #11 on Thermal Image Segmentation on PST900

Scene Parsing Thermal Image Segmentation

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.