no code implementations • 23 Aug 2023 • Junyi Chen, Longteng Guo, Jia Sun, Shuai Shao, Zehuan Yuan, Liang Lin, Dongyu Zhang
Owing to the combination of the unified architecture and pre-training task, EVE is easy to scale up, enabling better downstream performance with fewer resources and faster training speed.
no code implementations • 25 May 2023 • Zijia Zhao, Longteng Guo, Tongtian Yue, Sihan Chen, Shuai Shao, Xinxin Zhu, Zehuan Yuan, Jing Liu
We show that only language-paired two-modality data is sufficient to connect all modalities.
1 code implementation • 22 May 2023 • Shuai Shao, Yu Guan, Bing Zhai, Paolo Missier, Thomas Ploetz
Specifically, with the introduction of three conceptual layers--Sampling Layer, Data Augmentation Layer, and Resilient Layer -- we develop three "boosters" -- R-Frame, Mix-up, and C-Drop -- to enrich the per-epoch training data by dense-sampling, synthesizing, and simulating, respectively.
no code implementations • 26 Apr 2023 • Yan Wang, Jian Cheng, Yixin Chen, Shuai Shao, Lanyun Zhu, Zhenzhou Wu, Tao Liu, Haogang Zhu
In FVP, the visual prompt is parameterized using only a small amount of low-frequency learnable parameters in the input frequency space, and is learned by minimizing the segmentation loss between the predicted segmentation of the prompted target image and reliable pseudo segmentation label of the target image under the frozen model.
no code implementations • 14 Feb 2023 • Jifan Zhang, Shuai Shao, Saurabh Verma, Robert Nowak
Extensive experiments in multi-class and multi-label applications demonstrate TAILOR's effectiveness in achieving accuracy comparable or better than that of the best of the candidate algorithms.
no code implementations • 23 Dec 2022 • Shuai Shao, Yu Guan, Xin Guan, Paolo Missier, Thomas Ploetz
What remains a major challenge though is the sporadic activity recognition (SAR) problem, where activities of interest tend to be non periodic, and occur less frequently when compared with the often large amount of irrelevant background activities.
no code implementations • 9 Oct 2022 • Zijia Zhao, Longteng Guo, Xingjian He, Shuai Shao, Zehuan Yuan, Jing Liu
Our method performs joint masking on image-text input and integrates both implicit and explicit targets for the masked signals to recover.
no code implementations • 24 Jul 2022 • Shuai Shao, Markus Meister, Julijana Gjorgjieva
Here we derive a general theory of optimal population coding with neuronal activation functions of any shape, different types of noise and heterogeneous firing rates of the neurons by maximizing the Shannon mutual information between a stimulus and the neuronal spiking output subject to a constrain on the maximal firing rate.
no code implementations • 6 May 2022 • Sankaran Panchapagesan, Arun Narayanan, Turaj Zakizadeh Shabestary, Shuai Shao, Nathan Howard, Alex Park, James Walker, Alexander Gruenstein
Acoustic Echo Cancellation (AEC) is essential for accurate recognition of queries spoken to a smart speaker that is playing out audio.
no code implementations • 5 Apr 2022 • Bo Yuan, Danpei Zhao, Shuai Shao, Zehuan Yuan, Changhu Wang
In two typical cross-domain semantic segmentation tasks, i. e., GTA5 to Cityscapes and SYNTHIA to Cityscapes, our method achieves the state-of-the-art segmentation accuracy.
no code implementations • 15 Mar 2022 • Shuai Shao, Lei Xing, Weifeng Liu, Yanjiang Wang, BaoDi Liu
First, we propose a novel label prediction method, Isolated Graph Learning (IGL).
no code implementations • 3 Dec 2021 • Shuai Shao, Lei Xing, Wei Yu, Rui Xu, Yanjiang Wang, BaoDi Liu
Inspired by the concept of self-supervised learning (e. g., setting the pretext task to generate a universal model for the downstream task), we propose a Self-Supervised Dictionary Learning (SSDL) framework to address this challenge.
no code implementations • 1 Dec 2021 • Shuai Shao, Lei Xing, Rui Xu, Weifeng Liu, Yan-Jiang Wang, Bao-Di Liu
Inspired by this assumption, we propose a novel method Multi-Decision Fusing Model (MDFM), which comprehensively considers the decisions based on multiple FEMs to enhance the efficacy and robustness of the model.
no code implementations • 16 Sep 2021 • Shuai Shao, Lei Xing, Yan Wang, Rui Xu, Chunyan Zhao, Yan-Jiang Wang, Bao-Di Liu
Apply the trained FEM to acquire the novel data's features and recognize them.
no code implementations • 7 Sep 2021 • Shuai Shao, Lei Xing, Yixin Chen, Yan-Jiang Wang, Bao-Di Liu, Yicong Zhou
(2) Use the FEM to extract the features of novel data (with few labeled samples and totally different categories from base data), then classify them with the to-be-designed classifier.
no code implementations • 23 Oct 2020 • Shuai Shao, Rui Xu, Yan-Jiang Wang, Weifeng Liu, Bao-Di Liu
In this paper, we propose a hypergraph based sparse attention mechanism to tackle this issue and embed it into dictionary learning.
no code implementations • 23 Oct 2020 • Shuai Shao, Mengke Wang, Rui Xu, Yan-Jiang Wang, Bao-Di Liu
To tackle this issue, we propose a Dynamic Label Dictionary Learning (DLDL) algorithm to generate the soft label matrix for unlabeled data.
no code implementations • 16 May 2020 • Shuai Shao, Jin-Yi Cai
We prove a complexity dichotomy for Holant problems on the boolean domain with arbitrary sets of real-valued constraint functions.
Computational Complexity
no code implementations • ICCV 2019 • Shuai Shao, Zeming Li, Tianyuan Zhang, Chao Peng, Gang Yu, Xiangyu Zhang, Jing Li, Jian Sun
Objects365 can serve as a better feature learning dataset for localization-sensitive tasks like object detection and semantic segmentation.
no code implementations • 17 Apr 2019 • Yan-Jiang Wang, Shuai Shao, Rui Xu, Werifeng Liu, Bao-Di Liu
Dictionary learning methods can be split into: i) class specific dictionary learning ii) class shared dictionary learning.
16 code implementations • CVPR 2019 • Wenhai Wang, Enze Xie, Xiang Li, Wenbo Hou, Tong Lu, Gang Yu, Shuai Shao
Due to the fact that there are large geometrical margins among the minimal scale kernels, our method is effective to split the close text instances, making it easier to use segmentation-based methods to detect arbitrary-shaped text instances.
Ranked #12 on
Scene Text Detection
on SCUT-CTW1500
1 code implementation • 7 Mar 2019 • Shuai Shao, Yan-Jiang Wang, Bao-Di Liu, Weifeng Liu, Rui Xu
Recently, label consistent k-svd (LC-KSVD) algorithm has been successfully applied in image classification.
no code implementations • 19 Feb 2019 • Chen Change Loy, Dahua Lin, Wanli Ouyang, Yuanjun Xiong, Shuo Yang, Qingqiu Huang, Dongzhan Zhou, Wei Xia, Quanquan Li, Ping Luo, Junjie Yan, Jian-Feng Wang, Zuoxin Li, Ye Yuan, Boxun Li, Shuai Shao, Gang Yu, Fangyun Wei, Xiang Ming, Dong Chen, Shifeng Zhang, Cheng Chi, Zhen Lei, Stan Z. Li, Hongkai Zhang, Bingpeng Ma, Hong Chang, Shiguang Shan, Xilin Chen, Wu Liu, Boyan Zhou, Huaxiong Li, Peng Cheng, Tao Mei, Artem Kukharenko, Artem Vasenin, Nikolay Sergievskiy, Hua Yang, Liangqi Li, Qiling Xu, Yuan Hong, Lin Chen, Mingjun Sun, Yirong Mao, Shiying Luo, Yongjun Li, Ruiping Wang, Qiaokang Xie, Ziyang Wu, Lei Lu, Yiheng Liu, Wengang Zhou
This paper presents a review of the 2018 WIDER Challenge on Face and Pedestrian.
2 code implementations • 21 Nov 2018 • Enze Xie, Yuhang Zang, Shuai Shao, Gang Yu, Cong Yao, Guangyao Li
We propose a supervised pyramid context network (SPCNET) to precisely locate text regions while suppressing false positives.
Ranked #2 on
Scene Text Detection
on ICDAR 2013
1 code implementation • 30 Apr 2018 • Shuai Shao, Zijian Zhao, Boxun Li, Tete Xiao, Gang Yu, Xiangyu Zhang, Jian Sun
There are a total of $470K$ human instances from the train and validation subsets, and $~22. 6$ persons per image, with various kinds of occlusions in the dataset.
Ranked #7 on
Pedestrian Detection
on Caltech
(using extra training data)
2 code implementations • CVPR 2018 • Xinlong Wang, Tete Xiao, Yuning Jiang, Shuai Shao, Jian Sun, Chunhua Shen
In this paper, we first explore how a state-of-the-art pedestrian detector is harmed by crowd occlusion via experimentation, providing insights into the crowd occlusion problem.
Ranked #9 on
Pedestrian Detection
on Caltech
(using extra training data)
no code implementations • 2 Dec 2016 • Bo Li, Tianfu Wu, Shuai Shao, Lun Zhang, Rufeng Chu
This paper presents a method of integrating a mixture of object models and region-based convolutional networks for accurate object detection.