no code implementations • ACL 2022 • Yue Guo, Yi Yang, Ahmed Abbasi
Specifically, we propose a variant of the beam search method to automatically search for biased prompts such that the cloze-style completions are the most different with respect to different demographic groups.
1 code implementation • ECCV 2020 • Guangrui Li, Guoliang Kang, Wu Liu, Yunchao Wei, Yi Yang
The target of CCM is to acquire those synthetic images that share similar distribution with the real ones in the target domain, so that the domain gap can be naturally alleviated by employing the content-consistent synthetic images for training.
Ranked #11 on
Semantic Segmentation
on GTAV-to-Cityscapes Labels
1 code implementation • NAACL 2022 • John Lalor, Yi Yang, Kendall Smith, Nicole Forsgren, Ahmed Abbasi
While much work has highlighted biases embedded in state-of-the-art language models, and more recent efforts have focused on how to debias, research assessing the fairness and performance of biased/debiased models on downstream prediction tasks has been limited.
1 code implementation • Findings (EMNLP) 2021 • Hanyu Duan, Yi Yang, Kar Yan Tam
Numeracy plays a key role in natural language understanding.
1 code implementation • EMNLP 2021 • Ahmed Abbasi, David Dobolyi, John P. Lalor, Richard G. Netemeyer, Kendall Smith, Yi Yang
We also discuss the important implications of our work and resulting testbed for future NLP research on psychometrics and fairness.
no code implementations • ACL 2022 • Chengyu Chuang, Yi Yang
Given the prevalence of NLP models in financial decision making systems, this work raises the awareness of their potential implicit preferences in the stock markets.
1 code implementation • 27 Mar 2023 • Wei Shang, Dongwei Ren, Yi Yang, Hongzhi Zhang, Kede Ma, WangMeng Zuo
Moreover, on the seemingly implausible x16 interpolation task, our method outperforms existing methods by more than 1. 5 dB in terms of PSNR.
no code implementations • 26 Mar 2023 • Dianyi Yang, Jiadong Tang, Yu Gao, Yi Yang, Mengyin Fu
And this fact leads to poor performance on some fisheye vision tasks.
no code implementations • 26 Mar 2023 • Xihan Wang, Xi Xu, Yu Gao, Yi Yang, Yufeng Yue, Mengyin Fu
Compared with the previous work for muti-point representation, the experiments show that CRRS can improve the training performance both in accurate and stability.
1 code implementation • 26 Mar 2023 • Xiaolong Shen, Zongxin Yang, Xiaohan Wang, Jianxin Ma, Chang Zhou, Yi Yang
However, using a single kind of modeling structure is difficult to balance the learning of short-term and long-term temporal correlations, and may bias the network to one of them, leading to undesirable predictions like global location shift, temporal inconsistency, and insufficient local details.
no code implementations • 23 Mar 2023 • Wenqing Wang, Yawei Luo, Zhiqing Chen, Tao Jiang, Lei Chen, Yi Yang, Jun Xiao
Specifically, DLL decouples the predicate labels and adopts separate classifiers to learn actional and spatial patterns respectively.
no code implementations • 20 Mar 2023 • Xingchen Li, Long Chen, Guikun Chen, Yinfu Feng, Yi Yang, Jun Xiao
To this end, we propose a novel Decomposed Prototype Learning (DPL).
1 code implementation • 18 Mar 2023 • Fanglei Xue, Yifan Sun, Yi Yang
This paper explores an expression-related self-supervised learning (SSL) method (ContraWarping) to perform expression classification in the 5th Affective Behavior Analysis in-the-wild (ABAW) competition.
no code implementations • 17 Mar 2023 • Liulei Li, Wenguan Wang, Tianfei Zhou, Jianwu Li, Yi Yang
The objective of this paper is self-supervised learning of video object segmentation.
1 code implementation • 16 Mar 2023 • Fanglei Xue, Yifan Sun, Yi Yang
Therefore, given a facial image, ContraWarping employs some global transformations and local warping to generate its positive and negative samples and sets up a novel contrastive learning framework.
1 code implementation • 15 Mar 2023 • Xiaohan Wang, Wenguan Wang, Jiayi Shao, Yi Yang
Recently, visual-language navigation (VLN) -- entailing robot agents to follow navigation instructions -- has shown great advance.
1 code implementation • 6 Mar 2023 • Wei Li, Linchao Zhu, Longyin Wen, Yi Yang
This decoder is both data-efficient and computation-efficient: 1) it only requires the text data for training, easing the burden on the collection of paired data.
no code implementations • 1 Mar 2023 • Jingli Shi, Weihua Li, Quan Bai, Yi Yang, Jianhua Jiang
Aspect term extraction is a fundamental task in fine-grained sentiment analysis, which aims at detecting customer's opinion targets from reviews on product or service.
no code implementations • 22 Jan 2023 • Juncheng Li, Siliang Tang, Linchao Zhu, Wenqiao Zhang, Yi Yang, Tat-Seng Chua, Fei Wu, Yueting Zhuang
To systematically benchmark the compositional generalizability of temporal grounding models, we introduce a new Compositional Temporal Grounding task and construct two new dataset splits, i. e., Charades-CG and ActivityNet-CG.
no code implementations • 18 Jan 2023 • Fan Ma, Xiaojie Jin, Heng Wang, Jingjia Huang, Linchao Zhu, Jiashi Feng, Yi Yang
Specifically, text-video localization consists of moment retrieval, which predicts start and end boundaries in videos given the text description, and text localization which matches the subset of texts with the video features.
no code implementations • 17 Jan 2023 • Yu Gao, Xi Xu, Tianji Jiang, Siyuan Chen, Yi Yang, Yufeng Yue, Mengyin Fu
For example, 2D object detection usually requires a large amount of 2D annotation data with high cost.
1 code implementation • 3 Jan 2023 • Zhen Yao, Wen Zhang, Mingyang Chen, Yufeng Huang, Yi Yang, Huajun Chen
And in AnKGE, we train an analogy function for each level of analogical inference with the original element embedding from a well-trained KGE model as input, which outputs the analogical object embedding.
1 code implementation • 3 Jan 2023 • Feifei Shao, Yawei Luo, Shengjian Wu, Qiyi Li, Fei Gao, Yi Yang, Jun Xiao
Weakly-supervised object localization aims to indicate the category as well as the scope of an object in an image given only the image-level labels.
Knowledge Distillation
Weakly Supervised Object Localization
+1
1 code implementation • 31 Dec 2022 • Wenhao Wu, Xiaohan Wang, Haipeng Luo, Jingdong Wang, Yi Yang, Wanli Ouyang
In this paper, we propose a novel framework called BIKE, which utilizes the cross-modal bridge to explore bidirectional knowledge: i) We introduce the Video Attribute Association mechanism, which leverages the Video-to-Text knowledge to generate textual auxiliary attributes for complementing video recognition.
Ranked #1 on
Action Recognition
on UCF101
no code implementations • 25 Dec 2022 • Xiaolong Shen, Zhedong Zheng, Yi Yang
As the name implies, StepNet consists of two modules: Part-level Spatial Modeling and Part-level Temporal Modeling.
1 code implementation • 19 Dec 2022 • Difei Gao, Luowei Zhou, Lei Ji, Linchao Zhu, Yi Yang, Mike Zheng Shou
To build Video Question Answering (VideoQA) systems capable of assisting humans in daily activities, seeking answers from long-form videos with diverse and complex events is a must.
1 code implementation • 29 Nov 2022 • Shuangkang Fang, Weixin Xu, Heng Wang, Yi Yang, Yufeng Wang, Shuchang Zhou
In this paper, we propose Progressive Volume Distillation (PVD), a systematic distillation method that allows any-to-any conversions between different architectures, including MLP, sparse or low-rank tensors, hashtables and their compositions.
Ranked #1 on
Novel View Synthesis
on NeRF
(Average PSNR metric)
1 code implementation • IEEE Transactions on Neural Networks and Learning Systems 2022 • Yuanzhi Liang, Linchao Zhu, Xiaohan Wang, Yi Yang
Second, we instantiate the loss function and provide a strong baseline for FGVC, where the performance of a naive backbone can be boosted and be comparable with recent methods.
Ranked #26 on
Fine-Grained Image Classification
on CUB-200-2011
Fine-Grained Image Classification
Fine-Grained Visual Recognition
no code implementations • 19 Nov 2022 • Yi Yang, Zhong-Qiu Zhao, Quan Bai, Qing Liu, Weihua Li
Due to the dynamic nature, the proposed algorithms can also estimate true labels online without re-visiting historical data.
no code implementations • 18 Nov 2022 • Yanyan Wei, Zhao Zhang, ZhongQiu Zhao, Yang Zhao, Richang Hong, Yi Yang
Stereo images, containing left and right view images with disparity, are utilized in solving low-vision tasks recently, e. g., rain removal and super-resolution.
no code implementations • 17 Nov 2022 • Jiayi Shao, Xiaohan Wang, Yi Yang
Moreover, in order to better capture the long-term temporal dependencies in the long videos, we propose a segment-level recurrence mechanism.
no code implementations • 15 Nov 2022 • Leilei Gan, Baokui Li, Kun Kuang, Yi Yang, Fei Wu
Given the fact description text of a legal case, legal judgment prediction (LJP) aims to predict the case's charge, law article and penalty term.
1 code implementation • 14 Nov 2022 • Mu Chen, Zhedong Zheng, Yi Yang, Tat-Seng Chua
In an attempt to fill this gap, we propose a unified pixel- and patch-wise self-supervised learning framework, called PiPa, for domain adaptive semantic segmentation that facilitates intra-image pixel-wise correlations and patch-wise semantic consistency against different contexts.
Ranked #1 on
Semantic Segmentation
on SYNTHIA-to-Cityscapes
no code implementations • 11 Nov 2022 • Yong Hong, Deren Li, Shupei Luo, Xin Chen, Yi Yang, Mi Wang
This study proposes an improved end-to-end multi-target tracking algorithm that adapts to multi-view multi-scale scenes based on the self-attentive mechanism of the transformer's encoder-decoder structure.
no code implementations • 10 Nov 2022 • Tingyu Wang, Zhedong Zheng, Zunjie Zhu, Yuhan Gao, Yi Yang, Chenggang Yan
Cross-view geo-localization aims to spot images of the same location shot from two platforms, e. g., the drone platform and the satellite platform.
no code implementations • IEEE Transactions on Pattern Analysis and Machine Intelligence 2022 • Chuchu Han, Zhedong Zheng, Kai Su, Dongdong Yu, Zehuan Yuan, Changxin Gao, Nong Sang, Yi Yang
Person search aims at localizing and recognizing query persons from raw video frames, which is a combination of two sub-tasks, i. e., pedestrian detection and person re-identification.
Ranked #3 on
Person Search
on PRW
no code implementations • 9 Nov 2022 • Zhao Zhang, Suiyi Zhao, Xiaojie Jin, Mingliang Xu, Yi Yang, Shuicheng Yan
In this paper, we present an embarrassingly simple yet effective solution to a seemingly impossible mission, low-light image enhancement (LLIE) without access to any task-related data.
1 code implementation • 7 Nov 2022 • Carl Doersch, Ankush Gupta, Larisa Markeeva, Adrià Recasens, Lucas Smaira, Yusuf Aytar, João Carreira, Andrew Zisserman, Yi Yang
Generic motion understanding from video involves not only tracking objects, but also perceiving how their surfaces deform and move.
no code implementations • 5 Nov 2022 • Zhe Liu, Yun Li, Lina Yao, Xiaojun Chang, Wei Fang, XiaoJun Wu, Yi Yang
We design Semantic Attention (SA) and generative Knowledge Disentanglement (KD) to learn the dependence of feasibility and contextuality, respectively.
no code implementations • 2 Nov 2022 • Huan Zheng, Zhao Zhang, Jicong Fan, Richang Hong, Yi Yang, Shuicheng Yan
Specifically, we present a decoupled interaction module (DIM) that aims for sufficient dual-view information interaction.
no code implementations • 28 Oct 2022 • Wenguan Wang, Yi Yang, Fei Wu
Neural-symbolic computing (NeSy), which pursues the integration of the symbolic and statistical paradigms of cognition, has been an active research area of Artificial Intelligence (AI) for many years.
1 code implementation • 20 Oct 2022 • Zhuo Chen, Wen Zhang, Yufeng Huang, Mingyang Chen, Yuxia Geng, Hongtao Yu, Zhen Bi, Yichi Zhang, Zhen Yao, Wenting Song, Xinliang Wu, Yi Yang, Mingyi Chen, Zhaoyang Lian, YingYing Li, Lei Cheng, Huajun Chen
In this work, we share our experience on tele-knowledge pre-training for fault analysis, a crucial task in telecommunication applications that requires a wide range of knowledge normally found in both machine log data and product documents.
1 code implementation • Deep Mind 2022 • Viorica Pătrăucean, Lucas Smaira, Ankush Gupta, Adrià Recasens Continente, Larisa Markeeva, Dylan Banarse, Mateusz Malinowski, Yi Yang, Carl Doersch, Tatiana Matejovicova, Yury Sulsky, Antoine Miech, Skanda Koppula, Alex Frechette, Hanna Klimczak, Raphael Koster, Junlin Zhang, Stephanie Winkler, Yusuf Aytar, Simon Osindero, Dima Damen, Andrew Zisserman and João Carreira
We propose a novel multimodal benchmark – the Perception Test – that aims to extensively evaluate perception and reasoning skills of multimodal models.
no code implementations • 18 Oct 2022 • Ruijun Li, Weihua Li, Yi Yang, Hanyu Wei, Jianhua Jiang, Quan Bai
Recently, diffusion models have been proven to perform remarkably well in text-to-image synthesis tasks in a number of studies, immediately presenting new study opportunities for image generation.
Ranked #1 on
Text-to-Image Generation
on Multi-Modal-CelebA-HQ
2 code implementations • 18 Oct 2022 • Zongxin Yang, Yi Yang
To solve such a problem and further facilitate the learning of visual embeddings, this paper proposes a Decoupling Features in Hierarchical Propagation (DeAOT) approach.
Ranked #1 on
Semi-Supervised Video Object Segmentation
on MOSE
Semantic Segmentation
Semi-Supervised Video Object Segmentation
+1
2 code implementations • 13 Oct 2022 • Jian-Wei Zhang, Yifan Sun, Yi Yang, Wei Chen
With a rethink of recent advances, we find that the current FSS framework has deviated far from the supervised segmentation framework: Given the deep features, FSS methods typically use an intricate decoder to perform sophisticated pixel-wise matching, while the supervised segmentation methods use a simple linear classification head.
1 code implementation • 8 Oct 2022 • Yi Yang, Chen Zhang, Dawei Song
Recent advances in distilling pretrained language models have discovered that, besides the expressiveness of knowledge, the student-friendliness should be taken into consideration to realize a truly knowledgable teacher.
2 code implementations • 5 Oct 2022 • Chen Liang, Wenguan Wang, Jiaxu Miao, Yi Yang
Going beyond this, we propose GMMSeg, a new family of segmentation models that rely on a dense generative classifier for the joint distribution p(pixel feature, class).
no code implementations • 2 Oct 2022 • Jiahuan Ren, Zhao Zhang, Richang Hong, Mingliang Xu, Yi Yang, Shuicheng Yan
Low-light image enhancement (LLIE) aims at improving the illumination and visibility of dark images with lighting noise.
no code implementations • 30 Sep 2022 • Shuai Zhao, Xiaohan Wang, Linchao Zhu, Yi Yang
One evidence of the interference is \emph{gradient imbalance}: a small proportion of parameters produces dominant gradients during backpropagation, and the main parameters may not be fully optimized.
no code implementations • 23 Sep 2022 • Tan Yu, Zhipeng Jin, Jie Liu, Yi Yang, Hongliang Fei, Ping Li
To overcome the limitations of behavior ID features in modeling new ads, we exploit the visual content in ads to boost the performance of CTR prediction models.
no code implementations • 19 Sep 2022 • Tan Yu, Jie Liu, Yi Yang, Yi Li, Hongliang Fei, Ping Li
How to pair the video ads with the user search is the core task of Baidu video advertising.
no code implementations • 7 Aug 2022 • Lin Li, Long Chen, Hanrong Shi, Wenxiao Wang, Jian Shao, Yi Yang, Jun Xiao
To this end, we propose a novel model-agnostic Label Semantic Knowledge Distillation (LS-KD) for unbiased SGG.
1 code implementation • 5 Aug 2022 • Feng Zhu, Zongxin Yang, Xin Yu, Yi Yang, Yunchao Wei
In this work, we propose a new online VIS paradigm named Instance As Identity (IAI), which models temporal information for both detection and tracking in an efficient way.
no code implementations • 3 Aug 2022 • Benyuan Sun, Jin Dai, Zihao Liang, Congying Liu, Yi Yang, Bo Bai
SIMT lays the foundation of pre-training with large-scale multi-task multi-domain datasets and is proved essential for stable training in our GPPF experiments.
1 code implementation • 3 Aug 2022 • Xingchen Li, Long Chen, Wenbo Ma, Yi Yang, Jun Xiao
However, we argue that most existing WSSGG works only focus on object-consistency, which means the grounded regions should have the same object category label as text entities.
no code implementations • 27 Jul 2022 • Lin Li, Long Chen, Hanrong Shi, Hanwang Zhang, Yi Yang, Wei Liu, Jun Xiao
To this end, we propose a novel NoIsy label CorrEction and Sample Training strategy for SGG: NICEST.
1 code implementation • 26 Jul 2022 • Wenhao Wang, Yifan Sun, Zongxin Yang, Yi Yang
While model ensemble is common, we show that combining the vision models and vision-language models brings particular benefits from their complementarity and is a key factor to our superiority.
1 code implementation • 20 Jul 2022 • Yi Yang, Chen Zhang, Benyou Wang, Dawei Song
To uncover the domain-general LM, we propose to identify domain-general parameters by playing lottery tickets (dubbed doge tickets).
1 code implementation • 19 Jul 2022 • Haitian Zeng, Xin Yu, Jiaxu Miao, Yi Yang
We propose MHR-Net, a novel method for recovering Non-Rigid Shapes from Motion (NRSfM).
no code implementations • 8 Jul 2022 • Yucheng Suo, Zhedong Zheng, Xiaohan Wang, Bang Zhang, Yi Yang
We optimize the two losses and keypoint detector network in an end-to-end manner.
1 code implementation • 1 Jul 2022 • Naiyuan Liu, Xiaohan Wang, Xiaobo Li, Yi Yang, Yueting Zhuang
In this report, we present the ReLER@ZJU-Alibaba submission to the Ego4D Natural Language Queries (NLQ) Challenge in CVPR 2022.
Ranked #2 on
Natural Language Queries
on Ego4D
no code implementations • 9 Jun 2022 • Yi Yang, Yanqiao Zhu, Hejie Cui, Xuan Kan, Lifang He, Ying Guo, Carl Yang
Specifically, we propose to meta-train the model on datasets of large sample sizes and transfer the knowledge to small datasets.
1 code implementation • ICLR 2021 • Hehe Fan, Xin Yu, Yuhang Ding, Yi Yang, Mohan Kankanhalli
Then, a spatial convolution is employed to capture the local structure of points in the 3D space, and a temporal convolution is used to model the dynamics of the spatial regions along the time dimension.
1 code implementation • 24 May 2022 • Wenhao Wang, Yifan Sun, Yi Yang
Moreover, this paper further reveals a unique difficulty for solving the hard negative problem in ICD, i. e., there is a fundamental conflict between current metric learning and ICD.
1 code implementation • IEEE Transactions on Image Processing (TIP) 2022 • Jinliang Lin, Zhedong Zheng, Zhun Zhong, Zhiming Luo, Shaozi Li, Yi Yang, Nicu Sebe
Inspired by the human visual system for mining local patterns, we propose a new framework called RK-Net to jointly learn the discriminative Representation and detect salient Keypoints with a single Network.
Ranked #2 on
Image-Based Localization
on cvusa
no code implementations • 7 May 2022 • Tianle Li, Yi Yang
This research highlights that an adversary can fool a deep NLP model with much less cost.
1 code implementation • 2 May 2022 • Shuai Zhao, Linchao Zhu, Xiaohan Wang, Yi Yang
In this paper, to reduce the number of redundant video tokens, we design a multi-segment token clustering algorithm to find the most representative tokens and drop the non-essential ones.
Ranked #8 on
Video Retrieval
on MSVD
(using extra training data)
1 code implementation • 27 Apr 2022 • Zhedong Zheng, Jiayin Zhu, Wei Ji, Yi Yang, Tat-Seng Chua
This research aims to study a self-supervised 3D clothing reconstruction method, which recovers the geometry shape and texture of human clothing from a single image.
Ranked #1 on
Single-View 3D Reconstruction
on ATR
no code implementations • 25 Apr 2022 • Shaoning Xiao, Long Chen, Kaifeng Gao, Zhao Wang, Yi Yang, Zhimeng Zhang, Jun Xiao
From the view of feature, we break down the video into trajectories and first leverage trajectory feature in VideoQA to enhance the alignment between two modalities.
1 code implementation • 18 Apr 2022 • Tingyu Wang, Zhedong Zheng, Yaoqi Sun, Tat-Seng Chua, Yi Yang, Chenggang Yan
This task is mostly regarded as an image retrieval problem.
no code implementations • 16 Apr 2022 • Suiyi Zhao, Zhao Zhang, Richang Hong, Mingliang Xu, Yi Yang, Meng Wang
Blind image deblurring (BID) remains a challenging and significant task.
1 code implementation • 16 Apr 2022 • Yulei Lu, Yawei Luo, Li Zhang, Zheyang Li, Yi Yang, Jun Xiao
A thriving trend for domain adaptive segmentation endeavors to generate the high-quality pseudo labels for target domain and retrain the segmentor on them.
Ranked #11 on
Domain Adaptation
on GTA5 to Cityscapes
1 code implementation • CVPR 2022 • Xuanmeng Zhang, Zhedong Zheng, Daiheng Gao, Bang Zhang, Pan Pan, Yi Yang
To address this challenge, we propose Multi-View Consistent Generative Adversarial Networks (MVCGAN) for high-quality 3D-aware image synthesis with geometry constraints.
1 code implementation • 29 Mar 2022 • Xiao Pan, Peike Li, Zongxin Yang, Huiling Zhou, Chang Zhou, Hongxia Yang, Jingren Zhou, Yi Yang
By contrast, pixel-level optimization is more explicit, however, it is sensitive to the visual quality of training data and is not robust to object deformation.
1 code implementation • CVPR 2022 • Fan Ma, Mike Zheng Shou, Linchao Zhu, Haoqi Fan, Yilei Xu, Yi Yang, Zhicheng Yan
Although UniTrack \cite{wang2021different} demonstrates that a shared appearance model with multiple heads can be used to tackle individual tracking tasks, it fails to exploit the large-scale tracking datasets for training and performs poorly on single object tracking.
1 code implementation • CVPR 2022 • Changlin Li, Bohan Zhuang, Guangrun Wang, Xiaodan Liang, Xiaojun Chang, Yi Yang
First, we develop a strong manual baseline for progressive learning of ViTs, by introducing momentum growth (MoGrow) to bridge the gap brought by model growth.
1 code implementation • 27 Mar 2022 • Liulei Li, Tianfei Zhou, Wenguan Wang, Lu Yang, Jianwu Li, Yi Yang
Our target is to learn visual correspondence from unlabeled videos.
2 code implementations • CVPR 2022 • Liulei Li, Tianfei Zhou, Wenguan Wang, Jianwu Li, Yi Yang
In this paper, we instead address hierarchical semantic segmentation (HSS), which aims at structured, pixel-wise description of visual observation in terms of a class hierarchy.
1 code implementation • CVPR 2022 • Chen Liang, Wenguan Wang, Tianfei Zhou, Yi Yang
In this paper, we propose a new task and dataset, Visual Abductive Reasoning (VAR), for examining abductive reasoning ability of machine intelligence in everyday visual situations.
1 code implementation • CVPR 2022 • Juncheng Li, Junlin Xie, Long Qian, Linchao Zhu, Siliang Tang, Fei Wu, Yi Yang, Yueting Zhuang, Xin Eric Wang
To systematically measure the compositional generalizability of temporal grounding models, we introduce a new Compositional Temporal Grounding task and construct two new dataset splits, i. e., Charades-CG and ActivityNet-CG.
2 code implementations • 22 Mar 2022 • Zongxin Yang, Jiaxu Miao, Xiaohan Wang, Yunchao Wei, Yi Yang
To match and segment multiple objects as efficiently as processing a single one, AOST employs an IDentification (ID) mechanism to assign objects with unique identities and associate them in a shared high-dimensional embedding space.
Semantic Segmentation
Semi-Supervised Video Object Segmentation
+1
1 code implementation • 18 Mar 2022 • Chen Liang, Wenguan Wang, Tianfei Zhou, Jiaxu Miao, Yawei Luo, Yi Yang
In light of this, we present Locater (local-global context aware Transformer), which augments the Transformer architecture with a finite memory so as to query the entire video with the language expression in an efficient manner.
Ranked #4 on
Referring Expression Segmentation
on A2D Sentences
Referring Expression Segmentation
Referring Video Object Segmentation
+4
1 code implementation • 3 Mar 2022 • Yongxing Dai, Yifan Sun, Jun Liu, Zekun Tong, Yi Yang, Ling-Yu Duan
Instead of directly aligning the source and target domains against each other, we propose to align the source and target domains against their intermediate domains for a smooth knowledge transfer.
no code implementations • 25 Feb 2022 • Feifei Shao, Yawei Luo, Ping Liu, Jie Chen, Yi Yang, Yulei Lu, Jun Xiao
To deploy SSDR-AL in a more practical scenario, we design a noise-aware iterative labeling strategy to confront the "noisy annotation" problem introduced by the previous "dominant labeling" strategy in superpoints.
no code implementations • 8 Feb 2022 • Zoë Papakipos, Giorgos Tolias, Tomas Jenicek, Ed Pizzi, Shuhei Yokoo, Wenhao Wang, Yifan Sun, Weipu Zhang, Yi Yang, Sanjay Addicam, Sergio Manuel Papadakis, Cristian Canton Ferrer, Ondrej Chum, Matthijs Douze
The 2021 Image Similarity Challenge introduced a dataset to serve as a new benchmark to evaluate recent image copy detection methods.
no code implementations • 24 Jan 2022 • Ruyi Qu, Yi Yang, Yuwei Wang
The representational model are Faster RCNN and YOLO series.
no code implementations • 17 Jan 2022 • Xu Chen, Yahong Han, Xiaohan Wang, Yifan Sun, Yi Yang
An effective approach is to select informative content from the holistic video, yielding a popular family of dynamic video recognition methods.
Ranked #36 on
Action Recognition
on Something-Something V1
no code implementations • CVPR 2022 • Liulei Li, Tianfei Zhou, Wenguan Wang, Lu Yang, Jianwu Li, Yi Yang
Our target is to learn visual correspondence from unlabeled videos.
no code implementations • CVPR 2022 • Yadong Ding, Yu Wu, Chengyue Huang, Siliang Tang, Yi Yang, Longhui Wei, Yueting Zhuang, Qi Tian
Existing NAS-based meta-learning methods apply a two-stage strategy, i. e., first searching architectures and then re-training meta-weights on the searched architecture.
1 code implementation • 1 Jan 2022 • Xiaoqiang Wang, Lei Zhu, Siliang Tang, Huazhu Fu, Ping Li, Fei Wu, Yi Yang, Yueting Zhuang
The depth estimation branch is trained with RGB-D images and then used to estimate the pseudo depth maps for all unlabeled RGB images to form the paired data.
1 code implementation • CVPR 2022 • Yunqiu Xu, Yifan Sun, Zongxin Yang, Jiaxu Miao, Yi Yang
How to align the source and target domains is critical to the CDWSOD accuracy.
Ranked #1 on
Weakly Supervised Object Detection
on Clipart1k
no code implementations • CVPR 2022 • Jialun Liu, Yifan Sun, Feng Zhu, Hongbin Pei, Yi Yang, Wenhui Li
These two unidirectional metrics (IR image to RGB proxy and RGB image to IR proxy) jointly alleviate the relay effect and benefit cross-modality association.
1 code implementation • CVPR 2022 • Yuanzhi Liang, Qianyu Feng, Linchao Zhu, Li Hu, Pan Pan, Yi Yang
Talking gesture generation is a practical yet challenging task which aims to synthesize gestures in line with speech.
Ranked #5 on
Gesture Generation
on TED Gesture Dataset
2 code implementations • CVPR 2022 • Yuanzhi Liang, Linchao Zhu, Xiaohan Wang, Yi Yang
In this paper, we propose an episodic linear probing (ELP) classifier to reflect the generalization of visual representations in an online manner.
Ranked #13 on
Fine-Grained Image Classification
on CUB-200-2011
1 code implementation • CVPR 2022 • Jiaxu Miao, Xiaohan Wang, Yu Wu, Wei Li, Xu Zhang, Yunchao Wei, Yi Yang
In contrast, our large-scale VIdeo Panoptic Segmentation in the Wild (VIPSeg) dataset provides 3, 536 videos and 84, 750 frames with pixel-level panoptic annotations, covering a wide range of real-world scenarios and categories.
no code implementations • NeurIPS 2021 • Benyuan Sun, Hongxing Huo, Yi Yang, Bo Bai
The superiority of our algorithm is proved by demonstrating the new state-of-the-art results on cross-domain federated classification and detection.
1 code implementation • NAACL 2022 • Leilei Gan, Jiwei Li, Tianwei Zhang, Xiaoya Li, Yuxian Meng, Fei Wu, Yi Yang, Shangwei Guo, Chun Fan
To deal with this issue, in this paper, we propose a new strategy to perform textual backdoor attacks which do not require an external trigger, and the poisoned samples are correctly labeled.
1 code implementation • 13 Nov 2021 • Wenhao Wang, Yifan Sun, Weipu Zhang, Yi Yang
In this paper, a data-driven and local-verification (D$^2$LV) approach is proposed to compete for Image Similarity Challenge: Matching Track at NeurIPS'21.
1 code implementation • 13 Nov 2021 • Wenhao Wang, Weipu Zhang, Yifan Sun, Yi Yang
In this paper, a bag of tricks and a strong baseline are proposed for image copy detection.
1 code implementation • 1 Nov 2021 • Weixin Xu, Zipeng Feng, Shuangkang Fang, Song Yuan, Yi Yang, Shuchang Zhou
For example, Transformer Networks do not have native support on many popular chips, and hence are difficult to deploy.
1 code implementation • ICCV 2021 • Yikai Wang, Yi Yang, Fuchun Sun, Anbang Yao
In the low-bit quantization field, training Binary Neural Networks (BNNs) is the extreme solution to ease the deployment of deep models on resource-constrained devices, having the lowest storage cost and significantly cheaper bit-wise operations compared to 32-bit floating-point counterparts.
no code implementations • 17 Oct 2021 • Zutao Jiang, Changlin Li, Xiaojun Chang, Jihua Zhu, Yi Yang
Here, we present dynamic slimmable denoising network (DDS-Net), a general method to achieve good denoising quality with less computational complexity, via dynamically adjusting the channel configurations of networks at test time with respect to different noisy images.
no code implementations • 17 Oct 2021 • Di Yuan, Xiaojun Chang, Yi Yang, Qiao Liu, Dehua Wang, Zhenyu He
In this paper, we propose an active learning method for deep visual tracking, which selects and annotates the unlabeled samples to train the deep CNNs model.
no code implementations • 29 Sep 2021 • Chen Liang, Yawei Luo, Yu Wu, Yi Yang
We focus on the problem of segmenting a certain object referred by a natural language sentence in video content, at the core of formulating a pinpoint vision-language relation.
no code implementations • ICLR 2022 • Zhengdong Hu, Yifan Sun, Yi Yang
We hold a hypothesis, i. e., if a deep model is capable to fast generalize itself to different domains (using very few samples) during training, it will maintain such domain generalization capacity for testing.
no code implementations • 21 Sep 2021 • Yi Yang, Ying Wu, Mei Li, Xiangyu Chang, Yong Tan
Then, we transform the social welfare maximization problem into the risk minimization task in machine learning, and derive a fairness-aware scoring system with the help of mixed integer programming.
1 code implementation • 13 Sep 2021 • Yi Yang, Daoye Zhu, Tengteng Qu, Qiangyu Wang, Fuhu Ren, Chengqi Cheng
In the experiments, the proposed method is applied to ResNet and UNet, and the adjusted networks are verified on three very diverse benchmark data sets (i. e., Houston2018 data, Berlin data, and MUUFL data).
no code implementations • ICCV 2021 • Chuchu Han, Kai Su, Dongdong Yu, Zehuan Yuan, Changxin Gao, Nong Sang, Yi Yang, Changhu Wang
Large-scale labeled training data is often difficult to collect, especially for person identities.
no code implementations • 3 Sep 2021 • Kunal Rao, Giuseppe Coviello, Min Feng, Biplob Debnath, Wang-Pin Hsiung, Murugan Sankaradas, Yi Yang, Oliver Po, Utsav Drolia, Srimat Chakradhar
Identification of people with elevated body temperature can reduce or dramatically slow down the spread of infectious diseases like COVID-19.
1 code implementation • 1 Sep 2021 • Chao Sun, Zhedong Zheng, Xiaohan Wang, Mingliang Xu, Yi Yang
Albeit simple, the pre-trained encoder can capture the key points of an unseen point cloud and surpasses the encoder trained from scratch on downstream tasks.
Ranked #33 on
3D Part Segmentation
on ShapeNet-Part
1 code implementation • 26 Aug 2021 • Wuyang Chen, Xinyu Gong, Junru Wu, Yunchao Wei, Humphrey Shi, Zhicheng Yan, Yi Yang, Zhangyang Wang
This work targets designing a principled and unified training-free framework for Neural Architecture Search (NAS), with high performance, low cost, and in-depth interpretation.
no code implementations • 19 Aug 2021 • Yaxiong Wang, Yunchao Wei, Xueming Qian, Li Zhu, Yi Yang
Superpixel segmentation has recently seen important progress benefiting from the advances in differentiable deep learning.
no code implementations • ICCV 2021 • Haitian Zeng, Yuchao Dai, Xin Yu, Xiaohan Wang, Yi Yang
As NRSfM is a highly under-constrained problem, we propose two new pairwise regularization to further regularize the reconstruction.
1 code implementation • ICCV 2021 • Aming Wu, Rui Liu, Yahong Han, Linchao Zhu, Yi Yang
Secondly, domain-specific representations are introduced as the differences between the input and domain-invariant representations.
no code implementations • 27 Jul 2021 • Luyu Qiu, Yi Yang, Caleb Chen Cao, Jing Liu, Yueyuan Zheng, Hilary Hei Ting Ngai, Janet Hsiao, Lei Chen
Besides, our solution also resolves a fundamental problem with the faithfulness indicator, a commonly used evaluation metric of XAI algorithms that appears to be sensitive to the OoD issue.
no code implementations • ICCV 2021 • Juncheng Li, Siliang Tang, Linchao Zhu, Haochen Shi, Xuanwen Huang, Fei Wu, Yi Yang, Yueting Zhuang
Secondly, we introduce semantic coherence learning to explicitly encourage the semantic coherence of the adaptive hierarchical graph network from three hierarchies.
1 code implementation • 26 Jun 2021 • Ye Zhu, Yu Wu, Yi Yang, Yan Yan
Current vision and language tasks usually take complete visual data (e. g., raw images or videos) as input, however, practical scenarios may often consist the situations where part of the visual information becomes inaccessible due to various reasons e. g., restricted view with fixed camera or intentional vision block for security concerns.
1 code implementation • 20 Jun 2021 • Yaxiong Wang, Yunchao Wei, Xueming Qian, Li Zhu, Yi Yang
We aim to tackle the challenging yet practical scenery image outpainting task in this work.
1 code implementation • CVPR 2021 • Ruijie Quan, Xin Yu, Yuanzhi Liang, Yi Yang
First, we propose a complementary cascaded network architecture, namely CCN, to remove rain streaks and raindrops in a unified framework.
no code implementations • CVPR 2021 • Jiaxu Miao, Yunchao Wei, Yu Wu, Chen Liang, Guangrui Li, Yi Yang
To the best of our knowledge, our VSPW is the first attempt to tackle the challenging video scene parsing task in the wild by considering diverse scenarios.
1 code implementation • CVPR 2021 • Hehe Fan, Yi Yang, Mohan Kankanhalli
To capture the dynamics in point cloud videos, point tracking is usually employed.
no code implementations • CVPR 2021 • Yu Wu, Yi Yang
Previous works take the overall event labels to supervise both audio and visual model predictions.
no code implementations • 18 Jun 2021 • Jingli Shi, Weihua Li, Sira Yongchareon, Yi Yang, Quan Bai
However, detecting concerns in time from massive information in social media turns out to be a big challenge, especially when sufficient manually labeled data is in the absence of public health emergencies, e. g., COVID-19.
1 code implementation • CVPR 2021 • Guangrui Li, Guoliang Kang, Yi Zhu, Yunchao Wei, Yi Yang
To better exploit the intrinsic structure of the target domain, we propose Domain Consensus Clustering (DCC), which exploits the domain consensus knowledge to discover discriminative clusters on both common samples and private ones.
Ranked #1 on
Universal Domain Adaptation
on Office-Home
2 code implementations • NeurIPS 2021 • Gengwei Zhang, Guoliang Kang, Yi Yang, Yunchao Wei
Directly performing cross-attention may aggregate these features from support to query and bias the query features.
Ranked #35 on
Few-Shot Semantic Segmentation
on PASCAL-5i (1-Shot)
2 code implementations • NeurIPS 2021 • Zongxin Yang, Yunchao Wei, Yi Yang
The state-of-the-art methods learn to decode features with a single positive object and thus have to match and segment each target separately under multi-object scenarios, consuming multiple times computing resources.
Ranked #3 on
Semi-Supervised Video Object Segmentation
on MOSE
1 code implementation • ACL 2021 • James Mullenbach, Yada Pruksachatkun, Sean Adler, Jennifer Seale, Jordan Swartz, T. Greg McKelvey, Hui Dai, Yi Yang, David Sontag
In this work, we describe our creation of a dataset of clinical action items annotated over MIMIC-III, the largest publicly available dataset of real clinical notes.
no code implementations • 3 Jun 2021 • Kezhou Lin, Xiaohan Wang, Zhedong Zheng, Linchao Zhu, Yi Yang
Obtaining viewer responses from videos can be useful for creators and streaming platforms to analyze the video performance and improve the future user experience.
no code implementations • 2 Jun 2021 • Chen Liang, Yu Wu, Tianfei Zhou, Wenguan Wang, Zongxin Yang, Yunchao Wei, Yi Yang
Referring video object segmentation (RVOS) aims to segment video objects with the guidance of natural language reference.
One-shot visual object segmentation
Referring Video Object Segmentation
+2
no code implementations • 1 Jun 2021 • Qianyu Feng, Bang Zhang, Yi Yang
Differently, our goal is to represent a system with a part-whole hierarchy and discover the implied dependencies among intra-system variables: inferring the interactions that possess causal effects on the sub-system behavior with REcurrent partItioned Network (REIN).
1 code implementation • 1 Jun 2021 • Jian-Wei Zhang, Lei Lv, Yawei Luo, Hao-Zhe Feng, Yi Yang, Wei Chen
The hierarchical features help the model highlight the decision boundary and focus on hard pixels, and the structural information learned from base classes is treated as the prior knowledge for novel classes.
1 code implementation • 31 May 2021 • Shuai Bai, Zhedong Zheng, Xiaohan Wang, Junyang Lin, Zhu Zhang, Chang Zhou, Yi Yang, Hongxia Yang
In this paper, we apply one new modality, i. e., the language description, to search the vehicle of interest and explore the potential of this task in the real-world scenario.
1 code implementation • 31 May 2021 • Yuan Gan, Yawei Luo, Xin Yu, Bang Zhang, Yi Yang
In this paper, we investigate the task of hallucinating an authentic high-resolution (HR) human face from multiple low-resolution (LR) video snapshots.
no code implementations • 2 May 2021 • Qianyu Feng, Linchao Zhu, Bang Zhang, Pan Pan, Yi Yang
Specifically, we expect to approximate the real joint distribution over the partial observation and latent variables, thus infer the unseen targets respectively.
1 code implementation • 30 Apr 2021 • Youjiang Xu, Linchao Zhu, Lu Jiang, Yi Yang
It has been shown that deep neural networks are prone to overfitting on biased training data.
1 code implementation • 21 Apr 2021 • Feifei Shao, Yawei Luo, Li Zhang, Lu Ye, Siliang Tang, Yi Yang, Jun Xiao
The recent emerged weakly supervised object localization (WSOL) methods can learn to localize an object in the image only using image-level labels.
Weakly Supervised Object Localization
Weakly-Supervised Object Localization
1 code implementation • CVPR 2021 • Xiaohan Wang, Linchao Zhu, Yi Yang
Moreover, a global alignment method is proposed to provide a global cross-modal measurement that is complementary to the local perspective.
no code implementations • CVPR 2021 • Zongxin Yang, Xin Yu, Yi Yang
In the first step, the framework learns to segment objects from real and synthetic data in a weakly-supervised fashion, and the segmentation masks will act as a prior for pose estimation.
2 code implementations • NAACL 2021 • Derek Chen, Howard Chen, Yi Yang, Alex Lin, Zhou Yu
Existing goal-oriented dialogue datasets focus mainly on identifying slots and values.
2 code implementations • 29 Mar 2021 • Zhedong Zheng, Yi Yang
Domain adaptation is to transfer the shared knowledge learned from the source domain to a new environment, i. e., target domain.
no code implementations • 19 Mar 2021 • Chen Liang, Yu Wu, Yawei Luo, Yi Yang
Text-based video segmentation is a challenging task that segments out the natural language referred objects in videos.
Ranked #2 on
Referring Expression Segmentation
on A2D Sentences
(Precision@0.9 metric)
no code implementations • 18 Mar 2021 • Qianyu Feng, Yunchao Wei, MingMing Cheng, Yi Yang
Visual grounding is a long-lasting problem in vision-language understanding due to its diversity and complexity.
no code implementations • ICCV 2021 • Peike Li, Xin Yu, Yi Yang
By iteratively updating the latent representations and our decoder, our DAP-FSR will be adapted to the target domain, thus achieving authentic and high-quality upsampled HR faces.
no code implementations • 10 Mar 2021 • Shaowei Wang, Lingling Zhang, Xuan Luo, Yi Yang, Xin Hu, Jun Liu
Another type of diagrams such as from Computer Science is composed of graphics containing complex topologies and relations, and research on this type of diagrams is still blank.
no code implementations • 8 Mar 2021 • Qianyu Feng, Yawei Luo, Keyang Luo, Yi Yang
To generalize the model towards a real scenario, we propose to fulfill several aspects: (1) Look: visually incorporate spatial structure from the single view to enhance the expressiveness of representation; (2) Cast: perceptually align the 2D image features to the 3D shape priors with cross-modal semantic contrastive mapping; (3) Mold: reconstruct stereo-shape of target by transforming embeddings into the desired manifold.
1 code implementation • CVPR 2021 • Tianfei Zhou, Wenguan Wang, Si Liu, Yi Yang, Luc van Gool
To address the challenging task of instance-aware human part parsing, a new bottom-up regime is proposed to learn category-level human semantic segmentation as well as multi-person pose estimation in a joint and end-to-end manner.
1 code implementation • ICCV 2021 • Aming Wu, Yahong Han, Linchao Zhu, Yi Yang
Thus, we develop a new framework of few-shot object detection with universal prototypes ({FSOD}^{up}) that owns the merit of feature generalization towards novel objects.
Ranked #19 on
Few-Shot Object Detection
on MS-COCO (10-shot)
no code implementations • 22 Feb 2021 • Chuchu Han, Zhedong Zheng, Changxin Gao, Nong Sang, Yi Yang
Specifically, to reconcile the conflicts of multiple objectives, we simplify the standard tightly coupled pipelines and establish a deeply decoupled multi-task learning framework.
Ranked #7 on
Person Search
on PRW
no code implementations • 5 Feb 2021 • Ye Zhu, Yu Wu, Hugo Latapie, Yi Yang, Yan Yan
People can easily imagine the potential sound while seeing an event.
no code implementations • 4 Feb 2021 • Yi Yang, Cheng-Wei Lin
We propose a novel way to search for axion(-like) particles in heavy-ion collisions using prompt photons as the probe and the property of conversion between photon and axion(-like) particles under a strong magnetic field generated in the non-central collisions.
High Energy Physics - Phenomenology High Energy Physics - Experiment
1 code implementation • 3 Feb 2021 • Yuhang Ding, Xin Yu, Yi Yang
Thus, it is more desirable to employ only a few labeled data in pursuing high segmentation performance.
no code implementations • 28 Jan 2021 • The LIGO Scientific Collaboration, The Virgo Collaboration, the KAGRA Collaboration, R. Abbott, T. D. Abbott, S. Abraham, F. Acernese, K. Ackley, A. Adams, C. Adams, R. X. Adhikari, V. B. Adya, C. Affeldt, D. Agarwal, M. Agathos, K. Agatsuma, N. Aggarwal, O. D. Aguiar, L. Aiello, A. Ain, T. Akutsu, K. M. Aleman, G. Allen, A. Allocca, P. A. Altin, A. Amato, S. Anand, A. Ananyeva, S. B. Anderson, W. G. Anderson, M. Ando, S. V. Angelova, S. Ansoldi, J. M. Antelis, S. Antier, S. Appert, Koya Arai, Koji Arai, Y. Arai, S. Araki, A. Araya, M. C. Araya, J. S. Areeda, M. Arène, N. Aritomi, N. Arnaud, S. M. Aronson, H. Asada, Y. Asali, G. Ashton, Y. Aso, S. M. Aston, P. Astone, F. Aubin, P. Aufmuth, K. AultONeal, C. Austin, S. Babak, F. Badaracco, M. K. M. Bader, S. Bae, Y. Bae, A. M. Baer, S. Bagnasco, Y. Bai, L. Baiotti, J. Baird, R. Bajpai, M. Ball, G. Ballardin, S. W. Ballmer, M. Bals, A. Balsamo, G. Baltus, S. Banagiri, D. Bankar, R. S. Bankar, J. C. Barayoga, C. Barbieri, B. C. Barish, D. Barker, P. Barneo, S. Barnum, F. Barone, B. Barr, L. Barsotti, M. Barsuglia, D. Barta, J. Bartlett, M. A. Barton, I. Bartos, R. Bassiri, A. Basti, M. Bawaj, J. C. Bayley, A. C. Baylor, M. Bazzan, B. Bécsy, V. M. Bedakihale, M. Bejger, I. Belahcene, V. Benedetto, D. Beniwal, M. G. Benjamin, T. F. Bennett, J. D. Bentley, M. BenYaala, F. Bergamin, B. K. Berger, S. Bernuzzi, D. Bersanetti, A. Bertolini, J. Betzwieser, R. Bhandare, A. V. Bhandari, D. Bhattacharjee, S. Bhaumik, J. Bidler, I. A. Bilenko, G. Billingsley, R. Birney, O. Birnholtz, S. Biscans, M. Bischi, S. Biscoveanu, A. Bisht, B. Biswas, M. Bitossi, M. -A. Bizouard, J. K. Blackburn, J. Blackman, C. D. Blair, D. G. Blair, R. M. Blair, F. Bobba, N. Bode, M. Boer, G. Bogaert, M. Boldrini, F. Bondu, E. Bonilla, R. Bonnand, P. Booker, B. A. Boom, R. Bork, V. Boschi, N. Bose, S. Bose, V. Bossilkov, V. Boudart, Y. Bouffanais, A. Bozzi, C. Bradaschia, P. R. Brady, A. Bramley, A. Branch, M. Branchesi, J. E. Brau, M. Breschi, T. Briant, J. H. Briggs, A. Brillet, M. Brinkmann, P. Brockill, A. F. Brooks, J. Brooks, D. D. Brown, S. Brunett, G. Bruno, R. Bruntz, J. Bryant, A. Buikema, T. Bulik, H. J. Bulten, A. Buonanno, R. Buscicchio, D. Buskulic, R. L. Byer, L. Cadonati, M. Caesar, G. Cagnoli, C. Cahillane, H. W. Cain III, J. Calderón Bustillo, J. D. Callaghan, T. A. Callister, E. Calloni, J. B. Camp, M. Canepa, M. Cannavacciuolo, K. C. Cannon, H. Cao, J. Cao, Z. Cao, E. Capocasa, E. Capote, G. Carapella, F. Carbognani, J. B. Carlin, M. F. Carney, M. Carpinelli, G. Carullo, T. L. Carver, J. Casanueva Diaz, C. Casentini, G. Castaldi, S. Caudill, M. Cavaglià, F. Cavalier, R. Cavalieri, G. Cella, P. Cerdá-Durán, E. Cesarini, W. Chaibi, K. Chakravarti, B. Champion, C. -H. Chan, C. Chan, C. L. Chan, M. Chan, K. Chandra, P. Chanial, S. Chao,