no code implementations • 31 Oct 2024 • Haiwen Li, Fei Su, Zhicheng Zhao
Composed Image Retrieval (CIR) is a challenging vision-language task, utilizing bi-modal (image+text) queries to retrieve target images.
no code implementations • 13 Sep 2024 • Zhe Cui, Yuwei Jia, Siyang Zheng, Fei Su
Then, a novel 3D graph matching is conducted in 3D space according to the extracted 3D feature.
no code implementations • 19 Jun 2024 • Yunhao Du, Zhicheng Zhao, Fei Su
Multi-Object Tracking (MOT) aims to detect and associate all targets of given classes across frames.
no code implementations • 24 May 2024 • Shuai Jiang, Zhu Meng, Delong Liu, Haiwen Li, Fei Su, Zhicheng Zhao
Brain decoding, which aims at reconstructing visual stimuli from brain signals, primarily utilizing functional magnetic resonance imaging (fMRI), has recently made positive progress.
1 code implementation • 24 May 2024 • Weize Li, Zhicheng Zhao, Haochen Bai, Fei Su
Referring Expression Segmentation (RES) has attracted rising attention, aiming to identify and segment objects based on natural language expressions.
no code implementations • 18 Apr 2024 • Zeliang Ma, Song Yang, Zhe Cui, Zhicheng Zhao, Fei Su, Delong Liu, Jingyu Wang
The new trend in multi-object tracking task is to track objects of interest using natural language.
1 code implementation • CVPR 2024 • Zining Chen, Weiqiu Wang, Zhicheng Zhao, Fei Su, Aidong Men, Hongying Meng
Domain Generalization (DG) aims to resolve distribution shifts between source and target domains, and current DG methods are default to the setting that data from source and target domains share identical categories.
no code implementations • 2 Apr 2024 • Ayush Arunachalam, Ian Kintz, Suvadeep Banerjee, Arnab Raha, Xiankun Jin, Fei Su, Viswanathan Pillai Prasanth, Rubin A. Parekhji, Suriyaprakash Natarajan, Kanad Basu
Our approach encompasses a systematic analysis of anomaly abstraction at multiple levels pertaining to the automotive domain, from hardware- to block-level, where anomalies are injected to create diverse fault scenarios.
1 code implementation • 7 Mar 2024 • Yunhao Du, Zhicheng Zhao, Fei Su
To this end, we present the Refer-VI-ReID settings, which aims to match target visible images from both infrared images and coarse language descriptions (e. g., "a man with red top and black pants") to complement the missing color information.
no code implementations • IEEE Transactions on Circuits and Systems for Video Technology 2024 • Zining Chen, Weiqiu Wang, Zhicheng Zhao, Fei Su, Member, IEEE, Aidong Men, and Yuan Dong
In this paper, we propose an instance paradigm contrastive learning framework, introducing contrast between original features and novel paradigms to alleviate domain-specific distractions.
1 code implementation • CVPR 2024 • Yunhao Du, Cheng Lei, Zhicheng Zhao, Fei Su
Referring multi-object tracking (RMOT) aims to track multiple objects based on input textual descriptions.
1 code implementation • 27 Nov 2023 • Yunhao Du, Cheng Lei, Zhicheng Zhao, Yuan Dong, Fei Su
Previous methods focus on learning from cross-modality person images in different cameras.
1 code implementation • 25 Nov 2023 • Delong Liu, Haiwen Li, Zhicheng Zhao, Fei Su, Yuan Dong
Searching for specific person has great social benefits and security value, and it often involves a combination of visual and textual information.
Ranked #1 on Zero-shot Composed Person Retrieval on ITCPR dataset (using extra training data)
no code implementations • 16 Nov 2023 • Zhu Meng, Junhao Dong, Limei Guo, Fei Su, Guangxi Wang, Zhicheng Zhao
Since signet ring cells (SRCs) are associated with high peripheral metastasis rate and dismal survival, they play an important role in determining surgical approaches and prognosis, while they are easily missed by even experienced pathologists.
1 code implementation • 19 Jul 2023 • Junhao Dong, Zhu Meng, Delong Liu, Jiaxuan Liu, Zhicheng Zhao, Fei Su
In addition, to enhance the classification boundaries, we sample and cluster high- and low-confidence features separately based on confidence estimation, facilitating the generation of prototypes closer to the class boundaries.
1 code implementation • 13 Mar 2023 • Ziqi He, Mengjia Xue, Yunhao Du, Zhicheng Zhao, Fei Su
To address this problem, we propose a dynamic clustering and cluster contrastive learning (DCCC) method.
1 code implementation • 11 Oct 2022 • Yunhao Du, Zihang Liu, Fei Su
Multiple Object Tracking (MOT) has rapidly progressed in recent years.
1 code implementation • 18 Apr 2022 • Yunhao Du, Binyu Zhang, Xiangning Ruan, Fei Su, Zhicheng Zhao, Hong Chen
For the textual representation, one global embedding, three local embeddings and a color-type prompt embedding are extracted to represent various granularities of semantic features.
14 code implementations • 28 Feb 2022 • Yunhao Du, Zhicheng Zhao, Yang song, Yanyun Zhao, Fei Su, Tao Gong, Hongying Meng
As a result, the construction of a good baseline for a fair comparison is essential.
Ranked #10 on Multi-Object Tracking on MOT20 (using extra training data)
no code implementations • 23 Jun 2019 • Haiqian Gu, Jie Wang, Ziwen Wang, Bojin Zhuang, Wenhao Bian, Fei Su
Structured and unstructured data of same users shared by NetEase Music and Sina Weibo have been collected for cross-platform analysis of correlations between music preference and other users' characteristics.
no code implementations • 15 Jun 2019 • Ziwen Wang, Jie Wang, Haiqian Gu, Fei Su, Bojin Zhuang
Automatic text generation has received much attention owing to rapid development of deep neural networks.
1 code implementation • ECCV 2018 • Xinkun Cao, Zhipeng Wang, Yanyun Zhao, Fei Su
In this paper, we propose a novel encoder-decoder network, called extit{Scale Aggregation Network (SANet)}, for accurate and efficient crowd counting.
Ranked #6 on Crowd Counting on WorldExpo’10
no code implementations • 1 Jun 2018 • Ce Qi, Zhi-Zhong Liu, Fei Su
The challenge 2 of MS-Celeb-1M is a classification task.
no code implementations • 28 Apr 2018 • Ce Qi, Xiaoping Chen, Pingyu Wang, Fei Su
The proposed training strategy uses the anchors with IoUs between the first and second threshold, which can consistently improve the performance of face detection.
no code implementations • 5 Aug 2017 • Wenhui Jiang, Thuyen Ngo, B. S. Manjunath, Zhicheng Zhao, Fei Su
This region selection procedure is further integrated into a CNN-based weakly supervised detection (WSD) framework, and can be performed in each stochastic gradient descent mini-batch during training.
2 code implementations • 24 Jul 2017 • Ce Qi, Fei Su
The deep convolutional neural network(CNN) has significantly raised the performance of image classification and face recognition.
no code implementations • 3 May 2016 • Jing Zhou, Xiaopeng Hong, Fei Su, Guoying Zhao
To overcome this problem, we propose a real-time regression framework based on the recurrent convolutional neural network for automatic frame-level pain intensity estimation.