Search Results for author: Shuting He

Found 14 papers, 11 papers with code

MeViS: A Large-scale Benchmark for Video Segmentation with Motion Expressions

1 code implementation ICCV 2023 Henghui Ding, Chang Liu, Shuting He, Xudong Jiang, Chen Change Loy

To investigate the feasibility of using motion expressions to ground and segment objects in videos, we propose a large-scale dataset called MeViS, which contains numerous motion expressions to indicate target objects in complex environments.

Motion Expressions Guided Video Segmentation Object +6

MOSE: A New Dataset for Video Object Segmentation in Complex Scenes

1 code implementation ICCV 2023 Henghui Ding, Chang Liu, Shuting He, Xudong Jiang, Philip H. S. Torr, Song Bai

However, since the target objects in these existing datasets are usually relatively salient, dominant, and isolated, VOS under complex scenes has rarely been studied.

Object Segmentation +3

Primitive Generation and Semantic-related Alignment for Universal Zero-Shot Segmentation

1 code implementation CVPR 2023 Shuting He, Henghui Ding, Wei Jiang

The inter-class relationships of semantic-related visual features are then required to be aligned with those in semantic space, thereby transferring semantic knowledge to visual feature learning.

Instance Segmentation Panoptic Segmentation +2

GREC: Generalized Referring Expression Comprehension

1 code implementation30 Aug 2023 Shuting He, Henghui Ding, Chang Liu, Xudong Jiang

This dataset encompasses a range of expressions: those referring to multiple targets, expressions with no specific target, and the single-target expressions.

Generalized Referring Expression Comprehension Referring Expression +1

An Empirical Study of Vehicle Re-Identification on the AI City Challenge

1 code implementation20 May 2021 Hao Luo, Weihua Chen, Xianzhe Xu, Jianyang Gu, Yuqi Zhang, Chong Liu, Yiqi Jiang, Shuting He, Fan Wang, Hao Li

We mainly focus on four points, i. e. training data, unsupervised domain-adaptive (UDA) training, post-processing, model ensembling in this challenge.

Re-Ranking Retrieval +1

1st Place Solution to VisDA-2020: Bias Elimination for Domain Adaptive Pedestrian Re-identification

1 code implementation25 Dec 2020 Jianyang Gu, Hao Luo, Weihua Chen, Yiqi Jiang, Yuqi Zhang, Shuting He, Fan Wang, Hao Li, Wei Jiang

Considering the large gap between the source domain and target domain, we focused on solving two biases that influenced the performance on domain adaptive pedestrian Re-ID and proposed a two-stage training procedure.

Domain Adaptation Pseudo Label

Region Generation and Assessment Network for Occluded Person Re-Identification

no code implementations7 Sep 2023 Shuting He, Weihua Chen, Kai Wang, Hao Luo, Fan Wang, Wei Jiang, Henghui Ding

Then, to measure the importance of each generated region, we introduce a Region Assessment Module (RAM) that assigns confidence scores to different regions and reduces the negative impact of the occlusion regions by lower scores.

Person Re-Identification

VGSG: Vision-Guided Semantic-Group Network for Text-based Person Search

no code implementations13 Nov 2023 Shuting He, Hao Luo, Wei Jiang, Xudong Jiang, Henghui Ding

With the help of relational knowledge transfer, VGKT is capable of aligning semantic-group textual features with corresponding visual features without external tools and complex pairwise interaction.

Ranked #6 on Text based Person Retrieval on CUHK-PEDES (using extra training data)

Person Search Text based Person Retrieval +2

Context-Aware Integration of Language and Visual References for Natural Language Tracking

no code implementations29 Mar 2024 Yanyan Shao, Shuting He, Qi Ye, Yuchao Feng, Wenhan Luo, Jiming Chen

Tracking by natural language specification (TNL) aims to consistently localize a target in a video sequence given a linguistic description in the initial frame.

Cannot find the paper you are looking for? You can Submit a new open access paper.