Search Results for author: Shaohua Dong

Found 8 papers, 3 papers with code

Beyond MOT: Semantic Multi-Object Tracking

no code implementations8 Mar 2024 Yunhao Li, Hao Wang, Xue Ma, Jiali Yao, Shaohua Dong, Heng Fan, Libo Zhang

Current multi-object tracking (MOT) aims to predict trajectories of targets (i. e.,"where") in videos.

Multi-Object Tracking Object +1

VastTrack: Vast Category Visual Object Tracking

1 code implementation6 Mar 2024 Liang Peng, Junyuan Gao, Xinran Liu, Weihong Li, Shaohua Dong, Zhipeng Zhang, Heng Fan, Libo Zhang

The rich annotations of VastTrack enables development of both the vision-only and the vision-language tracking.

Object Visual Object Tracking +1

Efficient Multimodal Semantic Segmentation via Dual-Prompt Learning

1 code implementation1 Dec 2023 Shaohua Dong, Yunhe Feng, Qing Yang, Yan Huang, Dongfang Liu, Heng Fan

Existing approaches often fully fine-tune a dual-branch encoder-decoder framework with a complicated feature fusion strategy for achieving multimodal semantic segmentation, which is training-costly due to the massive parameter updates in feature extraction and fusion.

Ranked #2 on Semantic Segmentation on SUN-RGBD (using extra training data)

object-detection Object Detection +6

MTANet: Multitask-Aware Network With Hierarchical Multimodal Fusion for RGB-T Urban Scene Understanding

no code implementations journal 2022 WuJie Zhou, Shaohua Dong, Jingsheng Lei, Lu Yu

To improve the fusion of multimodal features and the segmentation accuracy, we propose a multitask-aware network (MTANet) with hierarchical multimodal fusion (multiscale fusion strategy) for RGB-T urban scene understanding.

Autonomous Vehicles Scene Understanding +2

Edge-aware Guidance Fusion Network for RGB Thermal Scene Parsing

1 code implementation9 Dec 2021 WuJie Zhou, Shaohua Dong, Caie Xu, Yaguan Qian

Considering the importance of high level semantic information, we propose a global information module and a semantic information module to extract rich semantic information from the high-level features.

Scene Parsing Thermal Image Segmentation

Cannot find the paper you are looking for? You can Submit a new open access paper.