2 code implementations • 11 May 2022 • Yawei Li, Kai Zhang, Radu Timofte, Luc van Gool, Fangyuan Kong, Mingxi Li, Songwei Liu, Zongcai Du, Ding Liu, Chenhui Zhou, Jingyi Chen, Qingrui Han, Zheyuan Li, Yingqi Liu, Xiangyu Chen, Haoming Cai, Yu Qiao, Chao Dong, Long Sun, Jinshan Pan, Yi Zhu, Zhikai Zong, Xiaoxiao Liu, Zheng Hui, Tao Yang, Peiran Ren, Xuansong Xie, Xian-Sheng Hua, Yanbo Wang, Xiaozhong Ji, Chuming Lin, Donghao Luo, Ying Tai, Chengjie Wang, Zhizhong Zhang, Yuan Xie, Shen Cheng, Ziwei Luo, Lei Yu, Zhihong Wen, Qi Wu1, Youwei Li, Haoqiang Fan, Jian Sun, Shuaicheng Liu, Yuanfei Huang, Meiguang Jin, Hua Huang, Jing Liu, Xinjian Zhang, Yan Wang, Lingshun Long, Gen Li, Yuanfan Zhang, Zuowei Cao, Lei Sun, Panaetov Alexander, Yucong Wang, Minjie Cai, Li Wang, Lu Tian, Zheyuan Wang, Hongbing Ma, Jie Liu, Chao Chen, Yidong Cai, Jie Tang, Gangshan Wu, Weiran Wang, Shirui Huang, Honglei Lu, Huan Liu, Keyan Wang, Jun Chen, Shi Chen, Yuchun Miao, Zimo Huang, Lefei Zhang, Mustafa Ayazoğlu, Wei Xiong, Chengyi Xiong, Fei Wang, Hao Li, Ruimian Wen, Zhijing Yang, Wenbin Zou, Weixin Zheng, Tian Ye, Yuncheng Zhang, Xiangzhen Kong, Aditya Arora, Syed Waqas Zamir, Salman Khan, Munawar Hayat, Fahad Shahbaz Khan, Dandan Gaoand Dengwen Zhouand Qian Ning, Jingzhu Tang, Han Huang, YuFei Wang, Zhangheng Peng, Haobo Li, Wenxue Guan, Shenghua Gong, Xin Li, Jun Liu, Wanjun Wang, Dengwen Zhou, Kun Zeng, Hanjiang Lin, Xinyu Chen, Jinsheng Fang
The aim was to design a network for single image super-resolution that achieved improvement of efficiency measured according to several metrics including runtime, parameters, FLOPs, activations, and memory consumption while at least maintaining the PSNR of 29. 00dB on DIV2K validation set.
no code implementations • 18 Apr 2022 • Jiudong Yang, Peiying Wang, Yi Zhu, Mingchao Feng, Meng Chen, Xiaodong He
Turn-taking, aiming to decide when the next speaker can start talking, is an essential component in building human-robot spoken dialogue systems.
1 code implementation • 15 Apr 2022 • Jipeng Qiang, Yang Li, Chaowei Zhang, Yun Li, Yunhao Yuan, Yi Zhu, Xindong Wu
Idioms, are a kind of idiomatic expression in Chinese, most of which consist of four Chinese characters.
2 code implementations • 12 Apr 2022 • Yi Zhu, Evgueni T. Filipov
Engineering design of origami systems is challenging because comparing different origami patterns requires using categorical features and evaluating multi-physics behavior targets introduces multi-objective problems.
no code implementations • 31 Mar 2022 • Xuelin Qian, Li Wang, Yi Zhu, Li Zhang, Yanwei Fu, xiangyang xue
Conventional 3D object detection approaches concentrate on bounding boxes representation learning with several parameters, i. e., localization, dimension, and orientation.
1 code implementation • 24 Mar 2022 • Likun Cai, Zhi Zhang, Yi Zhu, Li Zhang, Mu Li, xiangyang xue
Multiple datasets and open challenges for object detection have been introduced in recent years.
Ranked #1 on
Object Detection
on BigDetection val
no code implementations • 22 Mar 2022 • Zexun Wang, Yuquan Le, Yi Zhu, Yuming Zhao, Mingchao Feng, Meng Chen, Xiaodong He
Building Spoken Language Understanding (SLU) robust to Automatic Speech Recognition (ASR) errors is an essential issue for various voice-enabled virtual assistants.
Automatic Speech Recognition
Natural Language Understanding
+2
no code implementations • 23 Feb 2022 • Yi Zhu, Xinke Zhou, Jipeng Qiang, Yun Li, Yunhao Yuan, Xindong Wu
In the short text, the extremely short length, feature sparsity, and high ambiguity pose huge challenges to classification tasks.
1 code implementation • 8 Dec 2021 • Xiwen Liang, Fengda Zhu, Yi Zhu, Bingqian Lin, Bing Wang, Xiaodan Liang
The vision-language navigation (VLN) task requires an agent to reach a target with the guidance of natural language instruction.
no code implementations • NeurIPS 2021 • Shengju Qian, Hao Shao, Yi Zhu, Mu Li, Jiaya Jia
In this work, we analyze the uncharted problem of aliasing in vision transformer and explore to incorporate anti-aliasing properties.
no code implementations • ICCV 2021 • Mohammadreza Zolfaghari, Yi Zhu, Peter Gehler, Thomas Brox
Contrastive learning allows us to flexibly define powerful losses by contrasting positive pairs from sets of negative samples.
1 code implementation • Findings (EMNLP) 2021 • Xinyu Lu, Jipeng Qiang, Yun Li, Yunhao Yuan, Yi Zhu
The availability of parallel sentence simplification (SS) is scarce for neural SS modelings.
1 code implementation • NeurIPS 2021 • Li Wang, Li Zhang, Yi Zhu, Zhi Zhang, Tong He, Mu Li, xiangyang xue
Recognizing and localizing objects in the 3D space is a crucial ability for an AI agent to perceive its surrounding environment.
1 code implementation • 5 Aug 2021 • Haofei Kuang, Yi Zhu, Zhi Zhang, Xinyu Li, Joseph Tighe, Sören Schwertfeger, Cyrill Stachniss, Mu Li
Our formulation is able to capture global context in a video, thus robust to temporal content change.
no code implementations • 29 Jul 2021 • Fangrui Zhu, Yi Zhu, Li Zhang, Chongruo wu, Yanwei Fu, Mu Li
Semantic segmentation is a challenging problem due to difficulties in modeling context in complex scenes and class confusions along boundaries.
1 code implementation • 23 Jul 2021 • Bingqian Lin, Yi Zhu, Yanxin Long, Xiaodan Liang, Qixiang Ye, Liang Lin
Specifically, we propose a Dynamic Reinforced Instruction Attacker (DR-Attacker), which learns to mislead the navigator to move to the wrong target by destroying the most instructive information in instructions at different timesteps.
no code implementations • 19 Jul 2021 • Anish Pimpley, Shuo Li, Anubha Srivastava, Vishal Rohra, Yi Zhu, Soundararajan Srinivasan, Alekh Jindal, Hiren Patel, Shi Qiao, Rathijit Sen
We introduce a system for optimal resource allocation that can predict performance with aggressive trade-offs, for both new and past observed queries.
no code implementations • 7 Jul 2021 • Fengda Zhu, Yi Zhu, Vincent CS Lee, Xiaodan Liang, Xiaojun Chang
A navigation agent is supposed to have various intelligent skills, such as visual perceiving, mapping, planning, exploring and reasoning, etc.
no code implementations • 24 Jun 2021 • Xueqing Deng, Yi Zhu, Yuxin Tian, Shawn Newsam
Neural network-based semantic segmentation has achieved remarkable results when large amounts of annotated data are available, that is, in the supervised case.
no code implementations • 18 Jun 2021 • Lina Wang, Xingshu Chen, Yulong Wang, Yawei Yue, Yi Zhu, Xuemei Zeng, Wei Wang
Previous works study the adversarial robustness of image classifiers on image level and use all the pixel information in an image indiscriminately, lacking of exploration of regions with different semantic meanings in the pixel space of an image.
1 code implementation • CVPR 2021 • Guangrui Li, Guoliang Kang, Yi Zhu, Yunchao Wei, Yi Yang
To better exploit the intrinsic structure of the target domain, we propose Domain Consensus Clustering (DCC), which exploits the domain consensus knowledge to discover discriminative clusters on both common samples and private ones.
Ranked #1 on
Universal Domain Adaptation
on Office-Home
no code implementations • ICCV 2021 • Yanyi Zhang, Xinyu Li, Chunhui Liu, Bing Shuai, Yi Zhu, Biagio Brattoli, Hao Chen, Ivan Marsic, Joseph Tighe
We first introduce the vanilla video transformer and show that transformer module is able to perform spatio-temporal modeling from raw pixels, but with heavy memory usage.
Ranked #11 on
Action Classification
on Charades
no code implementations • CVPR 2021 • Fengda Zhu, Xiwen Liang, Yi Zhu, Xiaojun Chang, Xiaodan Liang
In this task, an agent is required to navigate from an arbitrary position in a 3D embodied environment to localize a target following a scene description.
no code implementations • 11 Mar 2021 • Yuzhe Qin, Huaxiong Huang, Yi Zhu, Chun Liu, Shixin Xu
Numerical simulations first illustrate the consistency of theoretical results on the sharp interface limit.
Numerical Analysis Numerical Analysis 76Z99, 92B05, 76R50
1 code implementation • 19 Feb 2021 • Yi Zhu, Evgueni T. Filipov
Electro-thermally actuated origami provides a novel method for creating 3-D systems with advanced morphing and functional capabilities.
Robotics
no code implementations • 17 Feb 2021 • Haimo Guo, Meirong Zhang, Yi Zhu
Weyl points are degenerate points on the spectral bands at which energy bands intersect conically.
Mathematical Physics Mathematical Physics Spectral Theory
1 code implementation • ICCV 2021 • Zhiqiang Tang, Yunhe Gao, Yi Zhu, Zhi Zhang, Mu Li, Dimitris Metaxas
Can we develop new normalization methods to improve generalization robustness under distribution shifts?
1 code implementation • EACL 2021 • Yi Zhu, Ehsan Shareghi, Yingzhen Li, Roi Reichart, Anna Korhonen
Semi-supervised learning through deep generative models and multi-lingual pretraining techniques have orchestrated tremendous success across different areas of NLP.
no code implementations • 1 Jan 2021 • Zhiqiang Tang, Yunhe Gao, Yi Zhu, Zhi Zhang, Mu Li, Dimitris N. Metaxas
CrossNorm exchanges styles between feature channels to perform style augmentation, diversifying the content and style mixtures.
no code implementations • ICCV 2021 • Yi Zhu, Yue Weng, Fengda Zhu, Xiaodan Liang, Qixiang Ye, Yutong Lu, Jianbin Jiao
Vision-Dialog Navigation (VDN) requires an agent to ask questions and navigate following the human responses to find target objects.
no code implementations • ACL 2021 • Mengjie Zhao, Yi Zhu, Ehsan Shareghi, Ivan Vulić, Roi Reichart, Anna Korhonen, Hinrich Schütze
Few-shot crosslingual transfer has been shown to outperform its zero-shot counterpart with pretrained encoders like multilingual BERT.
no code implementations • 15 Dec 2020 • Xinyu Li, Chunhui Liu, Bing Shuai, Yi Zhu, Hao Chen, Joseph Tighe
In the world of action recognition research, one primary focus has been on how to construct and train networks to model the spatial-temporal volume of an input video.
no code implementations • 15 Dec 2020 • Yi Zhu
The global existence of solutions to incompressible viscoelastic flows has been a longstanding open problem, even for the global weak solution.
Analysis of PDEs 76A10, 76D03, 35B65
1 code implementation • 11 Dec 2020 • Yi Zhu, Xinyu Li, Chunhui Liu, Mohammadreza Zolfaghari, Yuanjun Xiong, Chongruo wu, Zhi Zhang, Joseph Tighe, R. Manmatha, Mu Li
Video action recognition is one of the representative tasks for video understanding.
1 code implementation • 8 Dec 2020 • Xueqing Deng, Yi Zhu, Yuxin Tian, Shawn Newsam
Land-cover classification using remote sensing imagery is an important Earth observation task.
no code implementations • 1 Dec 2020 • Srikar Appalaraju, Yi Zhu, Yusheng Xie, István Fehérvári
Self-supervised representation learning has seen remarkable progress in the last few years.
no code implementations • 18 Aug 2020 • Li-Na Wang, Rui Tang, Yawei Yue, Xingshu Chen, Wei Wang, Yi Zhu, Xuemei Zeng
The vulnerability of deep neural networks (DNNs) to adversarial attack, which is an attack that can mislead state-of-the-art classifiers into making an incorrect classification with high confidence by deliberately perturbing the original inputs, raises concerns about the robustness of DNNs to such attacks.
1 code implementation • 25 Jun 2020 • Jipeng Qiang, Yun Li, Yi Zhu, Yunhao Yuan, Xindong Wu
Lexical simplification (LS) aims to replace complex words in a given sentence with their simpler alternatives of equivalent meaning, to simplify the sentence.
no code implementations • 2 Jun 2020 • Wuyue Yang, Liangrong Peng, Yi Zhu, Liu Hong
The onset of hydrodynamic instabilities is of great importance in both industry and daily life, due to the dramatic mechanical and thermodynamic changes for different types of flow motions.
no code implementations • 1 Jun 2020 • Wuyue Yang, Liangrong Peng, Yi Zhu, Liu Hong
Due to the intrinsic complexity and nonlinearity of chemical reactions, direct applications of traditional machine learning algorithms may face with many difficulties.
no code implementations • 26 May 2020 • Yi Zhu, Yiwei Zhou, Menglin Xia
Finally, we demonstrate that adversarial training with SAGE augmented data can improve performance and robustness of TableQA systems.
no code implementations • 11 May 2020 • Pipi Hu, Wuyue Yang, Yi Zhu, Liu Hong
To derive the hidden dynamics from observed data is one of the fundamental but also challenging problems in many different fields.
no code implementations • 30 Apr 2020 • Yi Zhu, Zhongyue Zhang, Chongruo wu, Zhi Zhang, Tong He, Hang Zhang, R. Manmatha, Mu Li, Alexander Smola
In the case of semantic segmentation, this means that large amounts of pixelwise annotations are required to learn accurate models.
27 code implementations • 19 Apr 2020 • Hang Zhang, Chongruo wu, Zhongyue Zhang, Yi Zhu, Haibin Lin, Zhi Zhang, Yue Sun, Tong He, Jonas Mueller, R. Manmatha, Mu Li, Alexander Smola
It is well known that featuremap attention and multi-path representation are important for visual recognition.
Ranked #5 on
Instance Segmentation
on COCO test-dev
(APS metric)
1 code implementation • ECCV 2020 • Hu Zhang, Linchao Zhu, Yi Zhu, Yi Yang
Most of previous work on adversarial attack mainly focus on image models, while the vulnerability of video models is less explored.
1 code implementation • CVPR 2020 • Yi Zhu, Fengda Zhu, Zhaohuan Zhan, Bingqian Lin, Jianbin Jiao, Xiaojun Chang, Xiaodan Liang
Benefiting from the collaborative learning of the L-mem and the V-mem, our CMN is able to explore the memory about the decision making of historical navigation actions which is for the current step.
no code implementations • 23 Dec 2019 • Xueqing Deng, Yi Zhu, Yuxin Tian, Shawn Newsam
Inspired by this, we investigate methods to inform or guide deep learning models for geospatial image analysis to increase their performance when a limited amount of training data is available or when they are applied to scenarios other than which they were trained on.
no code implementations • CVPR 2020 • Fengda Zhu, Yi Zhu, Xiaojun Chang, Xiaodan Liang
In this paper, we introduce Auxiliary Reasoning Navigation (AuxRN), a framework with four self-supervised auxiliary reasoning tasks to take advantage of the additional training signals derived from the semantic information.
Ranked #7 on
Vision and Language Navigation
on VLN Challenge
no code implementations • 4 Nov 2019 • Yi Zhu, Jing Dong
In this paper, we study a simple algorithm to construct asymptotically valid confidence regions for model parameters using the batch means method.
no code implementations • ICLR 2020 • YI Zhu, Jing Dong, Henry Lam
Despite an ever growing literature on reinforcement learning algorithms and applications, much less is known about their statistical inference.
no code implementations • CONLL 2019 • Yi Zhu, Benjamin Heinzerling, Ivan Vulić, Michael Strube, Roi Reichart, Anna Korhonen
Recent work has validated the importance of subword information for word representation learning.
1 code implementation • 4 Aug 2019 • Shuai Yang, Hao Wang, Yuhong Zhang, Pei-Pei Li, Yi Zhu, Xuegang Hu
Domain adaptation aims to exploit the knowledge in source domain to promote the learning tasks in target domain, which plays a critical role in real-world applications.
no code implementations • 24 Jul 2019 • Yi Zhu, Shawn Newsam
Motivated by our observation that motion information is the key to good anomaly detection performance in video, we propose a temporal augmented network to learn a motion-aware feature.
2 code implementations • 14 Jul 2019 • Jipeng Qiang, Yun Li, Yi Zhu, Yunhao Yuan, Xindong Wu
Lexical simplification (LS) aims to replace complex words in a given sentence with their simpler alternatives of equivalent meaning.
4 code implementations • 9 Jul 2019 • Jian Guo, He He, Tong He, Leonard Lausen, Mu Li, Haibin Lin, Xingjian Shi, Chenguang Wang, Junyuan Xie, Sheng Zha, Aston Zhang, Hang Zhang, Zhi Zhang, Zhongyue Zhang, Shuai Zheng, Yi Zhu
We present GluonCV and GluonNLP, the deep learning toolkits for computer vision and natural language processing based on Apache MXNet (incubating).
no code implementations • CVPR 2019 • Yi Zhu, Yanzhao Zhou, Huijuan Xu, Qixiang Ye, David Doermann, Jianbin Jiao
However, learning the full extent of pixel-level instance response in a weakly supervised manner remains unexplored.
Ranked #8 on
Image-level Supervised Instance Segmentation
on PASCAL VOC 2012 val
(using extra training data)
Image-level Supervised Instance Segmentation
RGB Salient Object Detection
+4
no code implementations • NAACL 2019 • Ehsan Shareghi, Yingzhen Li, Yi Zhu, Roi Reichart, Anna Korhonen
While neural dependency parsers provide state-of-the-art accuracy for several languages, they still rely on large amounts of costly labeled training data.
1 code implementation • 25 May 2019 • Yi Zhu
In this dissertation, I present my work towards exploring temporal information for better video understanding.
1 code implementation • NAACL 2019 • Yi Zhu, Ivan Vulić, Anna Korhonen
The use of subword-level information (e. g., characters, character n-grams, morphemes) has become ubiquitous in modern word representation learning.
no code implementations • 19 Feb 2019 • Xueqing Deng, Yi Zhu, Shawn Newsam
This paper develops a deep-learning framework to synthesize a ground-level view of a location given an overhead image.
3 code implementations • CVPR 2019 • Yi Zhu, Karan Sapra, Fitsum A. Reda, Kevin J. Shih, Shawn Newsam, Andrew Tao, Bryan Catanzaro
In this paper, we present a video prediction-based methodology to scale up training sets by synthesizing new training samples in order to improve the accuracy of semantic segmentation networks.
Ranked #1 on
Semantic Segmentation
on CamVid
(using extra training data)
no code implementations • 30 Oct 2018 • Yi Zhu, Jia Xue, Shawn Newsam
Deep neural networks have led to a series of breakthroughs in computer vision given sufficient annotated training datasets.
no code implementations • 30 Oct 2018 • Yi Zhu, Shawn Newsam
However, this does not work well for multirate videos, in which actions or subactions occur at different speeds.
no code implementations • 13 Jun 2018 • Xueqing Deng, Yi Zhu, Shawn Newsam
More significantly, we show the generated images are representative of the locations and that the representations learned by the cGANs are informative.
no code implementations • 7 May 2018 • Yi Zhu, Shawn Newsam
Despite the significant progress that has been made on estimating optical flow recently, most estimation methods, including classical and deep learning approaches, still have difficulty with multi-scale estimation, real-time computation, and/or occlusion reasoning.
1 code implementation • NAACL 2018 • Yijia Liu, Yi Zhu, Wanxiang Che, Bing Qin, Nathan Schneider, Noah A. Smith
Nonetheless, using the new treebank, we build a pipeline system to parse raw tweets into UD.
1 code implementation • CVPR 2018 • Yanzhao Zhou, Yi Zhu, Qixiang Ye, Qiang Qiu, Jianbin Jiao
Motivated by this, we first design a process to stimulate peaks to emerge from a class response map.
Ranked #9 on
Image-level Supervised Instance Segmentation
on PASCAL VOC 2012 val
(using extra training data)
General Classification
Image-level Supervised Instance Segmentation
+2
no code implementations • CVPR 2018 • Yi Zhu, Yang Long, Yu Guan, Shawn Newsam, Ling Shao
Unseen Action Recognition (UAR) aims to recognise novel action categories without training examples.
Ranked #2 on
Action Recognition
on ActivityNet
no code implementations • 21 Feb 2018 • Xueqing Deng, Yi Zhu, Shawn Newsam
We also show that the spatial morphing kernel improves the results.
no code implementations • 7 Feb 2018 • Yi Zhu, Xueqing Deng, Shawn Newsam
We perform fine-grained land use mapping at the city scale using ground-level images.
1 code implementation • ICCV 2017 • Yi Zhu, Yanzhao Zhou, Qixiang Ye, Qiang Qiu, Jianbin Jiao
Weakly supervised object localization remains challenging, where only image labels instead of bounding boxes are available during training.
Ranked #2 on
Weakly Supervised Object Detection
on COCO
Weakly Supervised Object Detection
Weakly-Supervised Object Localization
no code implementations • 19 Jul 2017 • Yi Zhu, Shawn Newsam
Classical approaches for estimating optical flow have achieved rapid progress in the last decade.
no code implementations • 24 Jun 2017 • Yi Zhu, Sen Liu, Shawn Newsam
This paper is the first work to perform spatio-temporal mapping of human activity using the visual content of geo-tagged videos.
no code implementations • 11 Apr 2017 • Yi Zhu, Shawn Newsam, Zaikun Xu
This notebook paper describes our system for the untrimmed classification task in the ActivityNet challenge 2016.
3 code implementations • 2 Apr 2017 • Yi Zhu, Zhenzhong Lan, Shawn Newsam, Alexander G. Hauptmann
State-of-the-art action recognition approaches rely on traditional optical flow estimation methods to pre-compute motion information for CNNs.
Ranked #16 on
Action Recognition
on UCF101
no code implementations • 8 Feb 2017 • Yi Zhu, Zhenzhong Lan, Shawn Newsam, Alexander G. Hauptmann
We study the unsupervised learning of CNNs for optical flow estimation using proxy ground truth data.
no code implementations • 25 Jan 2017 • Zhenzhong Lan, Yi Zhu, Alexander G. Hauptmann
We investigate the problem of representing an entire video using CNN features for human action recognition.
no code implementations • 22 Dec 2016 • Yi Zhu, Shawn Newsam
We employ a multi-task learning framework that performs the three highly related steps of action proposal, action recognition, and action localization refinement in parallel instead of the standard sequential pipeline that performs the steps in order.
no code implementations • 21 Sep 2016 • Yi Zhu, Shawn Newsam
We perform spatio-temporal analysis of public sentiment using geotagged photo collections.
no code implementations • 21 Sep 2016 • Yi Zhu, Shawn Newsam
Land use mapping is a fundamental yet challenging task in geographic science.
no code implementations • 15 Aug 2016 • Yi Zhu, Shawn Newsam
This paper performs the first investigation into depth for large-scale human action recognition in video where the depth cues are estimated from the videos themselves.