no code implementations • Findings (ACL) 2022 • Yao Zhao, Jiacheng Huang, Wei Hu, Qijin Chen, Xiaoxia Qiu, Chengfu Huo, Weijun Ren
In this paper, we propose an implicit RL method called ImRL, which links relation phrases in NL to relation paths in KG.
no code implementations • Findings (EMNLP) 2021 • Misha Khalman, Yao Zhao, Mohammad Saleh
We also show that using a conversational corpus for pre-training improves the quality of the chat summarization model.
no code implementations • 4 Dec 2023 • Zhongwei Ren, Zhicheng Huang, Yunchao Wei, Yao Zhao, Dongmei Fu, Jiashi Feng, Xiaojie Jin
PixelLM excels across various pixel-level image reasoning and understanding tasks, outperforming well-established methods in multiple benchmarks, including MUSE, single- and multi-referring segmentation.
no code implementations • 14 Nov 2023 • Jing Nathan Yan, Tianqi Liu, Justin T Chiu, Jiaming Shen, Zhen Qin, Yue Yu, Yao Zhao, Charu Lakshmanan, Yair Kurzion, Alexander M. Rush, Jialu Liu, Michael Bendersky
Comparative reasoning plays a crucial role in text preference prediction; however, large language models (LLMs) often demonstrate inconsistencies in their reasoning.
no code implementations • 1 Nov 2023 • You Zhou, Xiujing Lin, Xiang Zhang, Maolin Wang, Gangwei Jiang, Huakang Lu, Yupeng Wu, Kai Zhang, Zhe Yang, Kehang Wang, Yongduo Sui, Fengwei Jia, Zuoli Tang, Yao Zhao, Hongxuan Zhang, Tiannuo Yang, Weibo Chen, Yunong Mao, Yi Li, De Bao, Yu Li, Hongrui Liao, Ting Liu, Jingwen Liu, Jinchi Guo, Xiangyu Zhao, Ying WEI, Hong Qian, Qi Liu, Xiang Wang, Wai Kin, Chan, Chenliang Li, Yusen Li, Shiyu Yang, Jining Yan, Chao Mou, Shuai Han, Wuxia Jin, Guannan Zhang, Xiaodong Zeng
To tackle the challenges of computing resources and environmental impact of AI, Green Computing has become a hot research topic.
no code implementations • 26 Oct 2023 • Shuai Zheng, Zhizhe Liu, Zhenfeng Zhu, Xingxing Zhang, JianXin Li, Yao Zhao
On this basis, BiKT not only allows us to acquire knowledge from both the GNN and its derived model but promotes each other by injecting the knowledge into the other.
no code implementations • 9 Oct 2023 • Jiyuan Wang, Chunyu Lin, Lang Nie, Shujun Huang, Yao Zhao, Xing Pan, Rui Ai
Concretely, we first present a progressive curriculum learning scheme with three simple-to-complex curricula to gradually adapt the model from clear to relative adverse, and then to adverse weather scenes.
1 code implementation • NeurIPS 2023 • Siyu Jiao, Yunchao Wei, YaoWei Wang, Yao Zhao, Humphrey Shi
However, in the paper, we reveal that CLIP is insensitive to different mask proposals and tends to produce similar predictions for various mask proposals of the same image.
no code implementations • 25 Sep 2023 • Meiqin Liu, Chenming Xu, Chao Yao, Weisi Lin, Yao Zhao
Learned B-frame video compression aims to adopt bi-directional motion estimation and motion compensation (MEMC) coding for middle frame reconstruction.
no code implementations • 18 Sep 2023 • Huan Liu, Zichang Tan, Qiang Chen, Yunchao Wei, Yao Zhao, Jingdong Wang
Moreover, to address the semantic conflicts between image and frequency domains, the forgery-aware mutual module is developed to further enable the effective interaction of disparate image and frequency features, resulting in aligned and comprehensive visual forgery representations.
no code implementations • 13 Sep 2023 • Tianqi Liu, Yao Zhao, Rishabh Joshi, Misha Khalman, Mohammad Saleh, Peter J. Liu, Jialu Liu
DPO's lack of a reward model constrains its ability to sample preference pairs from the optimal policy, and SLiC is restricted to sampling preference pairs only from the SFT policy.
1 code implementation • 17 Aug 2023 • Runmin Cong, Yuchen Guan, Jinpeng Chen, Wei zhang, Yao Zhao, Sam Kwong
Despite significant progress in shadow detection, current methods still struggle with the adverse impact of background color, which may lead to errors when shadows are present on complex backgrounds.
2 code implementations • 17 Aug 2023 • Runmin Cong, Mengyao Sun, Sanyi Zhang, Xiaofei Zhou, Wei zhang, Yao Zhao
Camouflaged object detection (COD) aims to accurately detect objects hidden in the surrounding environment.
2 code implementations • ICCV 2023 • Huan Liu, Qiang Chen, Zichang Tan, Jiang-Jiang Liu, Jian Wang, Xiangbo Su, Xiaolong Li, Kun Yao, Junyu Han, Errui Ding, Yao Zhao, Jingdong Wang
State-of-the-art solutions adopt the DETR-like framework, and mainly develop the complex decoder, e. g., regarding pose estimation as keypoint box detection and combining with human detection in ED-Pose, hierarchically predicting with pose decoder and joint (keypoint) decoder in PETR.
1 code implementation • 14 Aug 2023 • Hongguang Zhu, Yunchao Wei, Xiaodan Liang, Chunjie Zhang, Yao Zhao
Regarding the growing nature of real-world data, such an offline training paradigm on ever-expanding data is unsustainable, because models lack the continual learning ability to accumulate knowledge constantly.
no code implementations • 13 Aug 2023 • Yuyang Yin, Dejia Xu, Chuangchuang Tan, Ping Liu, Yao Zhao, Yunchao Wei
Low light enhancement has gained increasing importance with the rapid development of visual creation and editing.
1 code implementation • 27 Jun 2023 • Anqi Li, Feng Li, Jiaxin Han, Huihui Bai, Runmin Cong, Chunjie Zhang, Meng Wang, Weisi Lin, Yao Zhao
Extensive experiments have demonstrated that our approach outperforms recent state-of-the-art methods in R-D performance, visual quality, and downstream applications, at very low bitrates.
1 code implementation • 15 Jun 2023 • Dongyi Zhang, Feng Li, Man Liu, Runmin Cong, Huihui Bai, Meng Wang, Yao Zhao
In this work, we explore the potential of resolution fields in scalable image compression and propose the reciprocal pyramid network (RPN) that fulfills the need for more adaptable and versatile compression.
no code implementations • 12 Jun 2023 • Yu Chen, Yang Yu, Rongrong Ni, Yao Zhao, Haoliang Li
Next, we design a phoneme-viseme awareness module for cross-modal feature fusion and representation alignment, so that the modality gap can be reduced and the intrinsic complementarity of the two modalities can be better explored.
no code implementations • 2 Jun 2023 • Yao Zhao, Sophine Zhang, Zhiyuan Yao
Anomaly detection is an important task in network management.
no code implementations • 17 May 2023 • Yao Zhao, Rishabh Joshi, Tianqi Liu, Misha Khalman, Mohammad Saleh, Peter J. Liu
Past work has often relied on Reinforcement Learning from Human Feedback (RLHF), which optimizes the language model using reward scores assigned from a reward model trained on human preference data.
no code implementations • 25 Apr 2023 • Haoyu Chu, Shikui Wei, Ting Liu, Yao Zhao
Deep equilibrium (DEQ) models have emerged as a promising class of implicit layer models in deep learning, which abandon traditional depth by solving for the fixed points of a single nonlinear layer.
1 code implementation • CVPR 2023 • Man Liu, Feng Li, Chunjie Zhang, Yunchao Wei, Huihui Bai, Yao Zhao
Generalized Zero-Shot Learning (GZSL) identifies unseen categories by knowledge transferred from the seen domain, relying on the intrinsic interactions between visual and semantic information.
no code implementations • 20 Mar 2023 • Yuwei Wu, Zhe Zhang, Xiaolan Qiu, Yao Zhao, Weidong Yu
repetition frequency (PRF).
1 code implementation • 19 Mar 2023 • Kang Liao, Lang Nie, Shujuan Huang, Chunyu Lin, Jing Zhang, Yao Zhao, Moncef Gabbouj, DaCheng Tao
In this paper, we provide a comprehensive survey of learning-based camera calibration techniques, by analyzing their strengths and limitations.
no code implementations • ICCV 2023 • Kunyang Han, Yong liu, Jun Hao Liew, Henghui Ding, Yunchao Wei, Jiajun Liu, Yitong Wang, Yansong Tang, Yujiu Yang, Jiashi Feng, Yao Zhao
Recent advancements in pre-trained vision-language models, such as CLIP, have enabled the segmentation of arbitrary concepts solely from textual inputs, a process commonly referred to as open-vocabulary semantic segmentation (OVS).
Knowledge Distillation
Open Vocabulary Semantic Segmentation
+4
no code implementations • 16 Mar 2023 • Jiaming Liang, Meiqin Liu, Chao Yao, Chunyu Lin, Yao Zhao
Variable-rate mechanism has improved the flexibility and efficiency of learning-based image compression that trains multiple models for different rate-distortion tradeoffs.
1 code implementation • CVPR 2023 • Zhijie Shen, Zishuo Zheng, Chunyu Lin, Lang Nie, Kang Liao, Shuai Zheng, Yao Zhao
Based on the Manhattan World assumption, most existing indoor layout estimation schemes focus on recovering layouts from vertically compressed 1D sequences.
no code implementations • 20 Feb 2023 • Zisong Chen, Chunyu Lin, Lang Nie, Kang Liao, Yao Zhao
In this paper, we propose the first unsupervised omnidirectional MVS framework based on multiple fisheye images.
1 code implementation • ICCV 2023 • Lang Nie, Chunyu Lin, Kang Liao, Shuaicheng Liu, Yao Zhao
First, we propose a robust and flexible warp to model the image registration from global homography to local thin-plate spline motion.
1 code implementation • 8 Feb 2023 • Shangrong Yang, Chunyu Lin, Kang Liao, Yao Zhao
Subsequently, we observe that the inter-frame optical flow of the video is facilitated to perceive the local spatial deformation of the fisheye video.
no code implementations • 26 Jan 2023 • Shangrong Yang, Chunyu Lin, Kang Liao, Yao Zhao
To this end, we propose a Dual Diffusion Architecture (DDA) for the fisheye rectification with a better generalization ability.
1 code implementation • ICCV 2023 • Kang Liao, Lang Nie, Chunyu Lin, Zishuo Zheng, Yao Zhao
In this work, we explore constructing a win-win representation on both content and boundary by contributing a new learning model, i. e., Rectangling Rectification Network (RecRecNet).
1 code implementation • CVPR 2023 • Chuangchuang Tan, Yao Zhao, Shikui Wei, Guanghua Gu, Yunchao Wei
The key of fake image detection is to develop a generalized representation to describe the artifacts produced by generation models.
no code implementations • ICCV 2023 • Shangrong Yang, Chunyu Lin, Kang Liao, Yao Zhao
Fisheye image rectification is hindered by synthetic models producing poor results for real-world correction.
1 code implementation • CVPR 2023 • Mengxue Qu, Yu Wu, Yunchao Wei, Wu Liu, Xiaodan Liang, Yao Zhao
Extensive experiments show that our model achieves 52. 06% in terms of accuracy (versus 58. 93% in fully supervised setting) on RefCOCO+@testA, when only using 1% of the mask annotations.
1 code implementation • CVPR 2023 • Weijia Li, Saihui Hou, Chunjie Zhang, Chunshui Cao, Xu Liu, Yongzhen Huang, Yao Zhao
For the cloth-changing problem, video-based ReID is rarely studied due to the lack of a suitable cloth-changing benchmark, and gait recognition is often researched under controlled conditions.
1 code implementation • ICCV 2023 • Hongguang Zhu, Yunchao Wei, Xiaodan Liang, Chunjie Zhang, Yao Zhao
Regarding the growing nature of real-world data, such an offline training paradigm on ever-expanding data is unsustainable, because models lack the continual learning ability to accumulate knowledge constantly.
no code implementations • ICCV 2023 • Yan Fang, Feng Zhu, Bowen Cheng, Luoqi Liu, Yao Zhao, Yunchao Wei
This work shows that locating the patch-wise noisy region is a better way to deal with noise.
no code implementations • 23 Dec 2022 • Runmin Cong, Ke Huang, Jianjun Lei, Yao Zhao, Qingming Huang, Sam Kwong
Salient object detection (SOD) aims to determine the most visually attractive objects in an image.
no code implementations • 20 Dec 2022 • Kundan Krishna, Yao Zhao, Jie Ren, Balaji Lakshminarayanan, Jiaming Luo, Mohammad Saleh, Peter J. Liu
We present a large empirical study quantifying the sometimes severe loss in performance (up to 12 ROUGE-1 points) from different types of input noise for a range of datasets and model sizes.
no code implementations • 17 Dec 2022 • Hui Li, MingJie Sun, Jimin Xiao, Eng Gee Lim, Yao Zhao
To validate our framework on a weakly-supervised setting, we annotated three RES benchmark datasets (RefCOCO, RefCOCO+ and RefCOCOg) with click annotations. Our method is simple but surprisingly effective, outperforming all previous state-of-the-art RES methods on fully- and weakly-supervised settings by a large margin.
no code implementations • 7 Dec 2022 • Shuai Zheng, Zhenfeng Zhu, Zhizhe Liu, Youru Li, Yao Zhao
Graph neural networks (GNNs) have shown remarkable performance on homophilic graph data while being far less impressive when handling non-homophilic graph data due to the inherent low-pass filtering property of GNNs.
1 code implementation • 5 Dec 2022 • Siyu Jiao, Gengwei Zhang, Shant Navasardyan, Ling Chen, Yao Zhao, Yunchao Wei, Humphrey Shi
Typical methods follow the paradigm to firstly learn prototypical features from support images and then match query features in pixel-level to obtain segmentation results.
1 code implementation • 3 Dec 2022 • Yixuan Wu, Feng Li, Huihui Bai, Weisi Lin, Runmin Cong, Yao Zhao
In this paper, we analyze the degradation of a high-resolution (HR) image from image intrinsic components according to a degradation-based formulation model.
1 code implementation • 3 Dec 2022 • Feng Li, Yixuan Wu, Huihui Bai, Weisi Lin, Runmin Cong, Yao Zhao
Recent blind SR methods suggest to reconstruct SR images relying on blur kernel estimation.
1 code implementation • 15 Nov 2022 • Youru Li, Zhenfeng Zhu, Xiaobo Guo, Shaoshuai Li, Yuchen Yang, Yao Zhao
Moreover, the hierarchical representations at both instance level and channel level can be coordinated by the heterogeneous information aggregation under the guidance of global view.
no code implementations • 9 Nov 2022 • Yangjun Wu, Kebin Fang, Yao Zhao, Hao Zhang, Lifeng Shi, Mengqi Zhang
To accomplish punctuation restoration, most existing methods focus on introducing extra information (e. g., part-of-speech) or addressing the class imbalance problem.
no code implementations • 8 Nov 2022 • Yiming Wang, Dongxia Chang, Zhiqiang Fu, Jie Wen, Yao Zhao
Multi-view representation learning has developed rapidly over the past decades and has been applied in many fields.
1 code implementation • 3 Nov 2022 • Meiqin Liu, Shuo Jin, Chao Yao, Chunyu Lin, Yao Zhao
A spatio-temporal stability module is designed to learn the self-alignment from inter-frames.
no code implementations • 30 Oct 2022 • Yao Zhao, Connor James Stephens, Csaba Szepesvári, Kwang-Sung Jun
Simple regret is a natural and parameter-free performance criterion for pure exploration in multi-armed bandits yet is less popular than the probability of missing the best arm or an $\epsilon$-good arm, perhaps due to lack of easy ways to characterize it.
no code implementations • 12 Oct 2022 • Runmin Cong, Weiyu Song, Jianjun Lei, Guanghui Yue, Yao Zhao, Sam Kwong
Finally, we use the Importance Perception Fusion (IPF) module to fuse the features from two parallel branches according to their different importance in different scenarios.
2 code implementations • 9 Oct 2022 • Runmin Cong, Kepu Zhang, Chen Zhang, Feng Zheng, Yao Zhao, Qingming Huang, Sam Kwong
In addition, considering the role of thermal modality, we set up different cross-modality interaction mechanisms in the encoding phase and the decoding phase.
3 code implementations • 6 Oct 2022 • Runmin Cong, Qinwei Lin, Chen Zhang, Chongyi Li, Xiaochun Cao, Qingming Huang, Yao Zhao
Focusing on the issue of how to effectively capture and utilize cross-modality information in RGB-D salient object detection (SOD) task, we present a convolutional neural network (CNN) model, named CIR-Net, based on the novel cross-modality interaction and refinement.
no code implementations • 30 Sep 2022 • Jie Ren, Jiaming Luo, Yao Zhao, Kundan Krishna, Mohammad Saleh, Balaji Lakshminarayanan, Peter J. Liu
Furthermore, the space of potential low-quality outputs is larger as arbitrary text can be generated and it is important to know when to trust the generated output.
Abstractive Text Summarization
Out-of-Distribution Detection
+1
no code implementations • 30 Sep 2022 • Yao Zhao, Misha Khalman, Rishabh Joshi, Shashi Narayan, Mohammad Saleh, Peter J. Liu
Conditional language models are predominantly trained with maximum likelihood estimation (MLE), giving probability mass to sparsely observed target sequences.
Ranked #1 on
Abstractive Text Summarization
on CNN / Daily Mail
abstractive question answering
Abstractive Text Summarization
+5
1 code implementation • 19 Sep 2022 • Zongyu Li, Zhenfeng Zhu, Xiao bo Guo, Shuai Zheng, Zhenyu Guo, Siwei Qiang, Yao Zhao
The concept of causality plays a significant role in human cognition.
3 code implementations • 7 Sep 2022 • Runmin Cong, Qi Qin, Chen Zhang, Qiuping Jiang, Shiqi Wang, Yao Zhao, Sam Kwong
In this paper, we focus on a new weakly-supervised SOD task under hybrid labels, where the supervision labels include a large number of coarse labels generated by the traditional unsupervised method and a small number of real labels.
Ranked #7 on
RGB Salient Object Detection
on PASCAL-S
1 code implementation • 7 Sep 2022 • Runmin Cong, Yumo Zhang, Ning Yang, Haisheng Li, Xueqi Zhang, Ruochen Li, Zewen Chen, Yao Zhao, Sam Kwong
The coronavirus disease 2019 (COVID-19) continues to have a negative impact on healthcare systems around the world, though the vaccines have been developed and national vaccination coverage rate is steadily increasing.
1 code implementation • 16 Aug 2022 • Jian Jin, Yuan Xue, Xingxing Zhang, Lili Meng, Yao Zhao, Weisi Lin
However, they have a major drawback that the generated JND is assessed in the real-world signal domain instead of in the perceptual domain in the human brain.
1 code implementation • 8 Aug 2022 • Jason Phang, Yao Zhao, Peter J. Liu
While large pretrained Transformer models have proven highly capable at tackling natural language tasks, handling long sequence inputs continues to be a significant challenge.
Ranked #2 on
Long-range modeling
on SCROLLS
(GovRep metric)
no code implementations • 3 Aug 2022 • Zhijie Shen, Chunyu Lin, Lang Nie, Kang Liao, Yao Zhao
For a monocular 360 image, depth estimation is a challenging because the distortion increases along the latitude.
Ranked #8 on
Depth Estimation
on Stanford2D3D Panoramic
no code implementations • 1 Aug 2022 • Reinald Kim Amplayo, Peter J. Liu, Yao Zhao, Shashi Narayan
Specifically, We treat sentences as basic units of matching instead of tokens, and use a sentence matching function to soft-match candidate and reference sentences.
1 code implementation • 27 Jul 2022 • Mengxue Qu, Yu Wu, Wu Liu, Qiqi Gong, Xiaodan Liang, Olga Russakovsky, Yao Zhao, Yunchao Wei
Particularly, SiRi conveys a significant principle to the research of visual grounding, i. e., a better initialized vision-language encoder would help the model converge to a better local minimum, advancing the performance accordingly.
3 code implementations • 17 Jul 2022 • Runmin Cong, Haowei Yang, Qiuping Jiang, Wei Gao, Haisheng Li, Cong Wang, Yao Zhao, Sam Kwong
The spread of COVID-19 has brought a huge disaster to the world, and the automatic segmentation of infection regions can help doctors to make diagnosis quickly and reduce workload.
1 code implementation • 7 Jul 2022 • Lang Nie, Chunyu Lin, Kang Liao, Shuaicheng Liu, Yao Zhao
To this end, we leverage a neural network to predict the optical flows that can warp the tilted images to be perceptually horizontal.
no code implementations • 6 Jul 2022 • Zishuo Zheng, Chunyu Lin, Lang Nie, Kang Liao, Zhijie Shen, Yao Zhao
In this paper, we combine the two different representations and propose a novel 360{\deg} semantic segmentation solution from a complementary perspective.
no code implementations • 5 Jul 2022 • Shangrong Yang, Chunyu Lin, Kang Liao, Yao Zhao
To leverage these two characteristics, we introduced Fishformer that processes the fisheye image as a sequence to enhance global and local perception.
1 code implementation • 12 Jun 2022 • Kang Liao, Chunyu Lin, Yunchao Wei, Yao Zhao
For the distortion synthesis, we propose a spiral distortion-aware perception module, in which the learning path keeps consistent with the distortion prior of the fisheye image.
1 code implementation • 9 Jun 2022 • Meiqin Liu, Chenming Xu, Chao Yao, Chunyu Lin, Yao Zhao
Video frame interpolation (VFI) aims to generate predictive frames by warping learnable motions from the bidirectional historical references.
no code implementations • 24 May 2022 • Aaron Parisi, Yao Zhao, Noah Fiedel
Transformer based language models (LMs) demonstrate increasing performance with scale across a wide variety of tasks.
2 code implementations • 19 Apr 2022 • Runmin Cong, Ning Yang, Chongyi Li, Huazhu Fu, Yao Zhao, Qingming Huang, Sam Kwong
In this paper, we propose a global-and-local collaborative learning architecture, which includes a global correspondence modeling (GCM) and a local correspondence modeling (LCM) to capture comprehensive inter-image corresponding relationship among different images from the global and local perspectives.
1 code implementation • 18 Apr 2022 • Kang Liao, Xiangyu Xu, Chunyu Lin, Wenqi Ren, Yunchao Wei, Yao Zhao
Motivated by this analysis, we present a Cylin-Painting framework that involves meaningful collaborations between inpainting and outpainting and efficiently fuses the different arrangements, with a view to leveraging their complementary benefits on a consistent and seamless cylinder.
no code implementations • 12 Apr 2022 • Lei Zhang, Kang Liao, Chunyu Lin, Yao Zhao
Concretely, we propose a Depth-Guided Outpainting Network to model different feature representations of two modalities and learn the structure-aware cross-modal fusion.
1 code implementation • ACL 2022 • Shashi Narayan, Gonçalo Simões, Yao Zhao, Joshua Maynez, Dipanjan Das, Michael Collins, Mirella Lapata
We propose Composition Sampling, a simple but effective method to generate diverse outputs for conditional generation of higher quality compared to previous stochastic decoding strategies.
1 code implementation • 23 Mar 2022 • Yangjun Wu, Kebin Fang, Yao Zhao
To accomplish the punctuation restoration task, most existing approaches focused on leveraging extra information (e. g., part-of-speech tags) or addressing the class imbalance problem.
no code implementations • 18 Mar 2022 • Zhijie Shen, Chunyu Lin, Lang Nie, Kang Liao, Yao Zhao
It comprises two modules: Dual-Cubemap Depth Estimation (DCDE) module and Boundary Revision (BR) module.
1 code implementation • 17 Mar 2022 • Zhijie Shen, Chunyu Lin, Kang Liao, Lang Nie, Zishuo Zheng, Yao Zhao
In particular, we divide patches on the spherical tangent domain into tokens to reduce the negative effect of panoramic distortions.
Ranked #4 on
Depth Estimation
on Stanford2D3D Panoramic
1 code implementation • 11 Mar 2022 • Shuai Zheng, Zhenfeng Zhu, Zhizhe Liu, Zhenyu Guo, Yang Liu, Yuchen Yang, Yao Zhao
For disease prediction tasks, most existing graph-based methods tend to define the graph manually based on specified modality (e. g., demographic information), and then integrated other modalities to obtain the patient representation by Graph Representation Learning (GRL).
no code implementations • 10 Mar 2022 • Haoyu Chu, Shikui Wei, Qiming Lu, Yao Zhao
We propose a new training based on knowledge distillation to construct more powerful and robust Neural ODEs fitting image recognition tasks.
1 code implementation • CVPR 2022 • Lang Nie, Chunyu Lin, Kang Liao, Shuaicheng Liu, Yao Zhao
In this paper, we address these issues by proposing the first deep learning solution to image rectangling.
no code implementations • 1 Mar 2022 • Yiming Wang, Dongxia Chang, Zhiqiang Fu, Jie Wen, Yao Zhao
In this paper, we propose an augmentation-free graph contrastive learning framework, namely ACTIVE, to solve the problem of partial multi-view clustering.
no code implementations • 26 Jan 2022 • Peng Li, Arim Park, Soohyun Cho, Yao Zhao
In this paper, we study the effect of compensated reviews on non-compensated reviews by utilizing online reviews on 1, 240 auto shipping companies over a ten-year period from a transportation website.
1 code implementation • 21 Jan 2022 • Jiacheng Huang, Yao Zhao, Wei Hu, Zhen Ning, Qijin Chen, Xiaoxia Qiu, Chengfu Huo, Weijun Ren
In this paper, we propose a new trustworthy method that exploits facts for a KG based on multi-sourced noisy data and existing facts in the KG.
no code implementations • 7 Jan 2022 • Jian Jin, Xingxing Zhang, Lili Meng, Weisi Lin, Jie Liang, Huaxiang Zhang, Yao Zhao
Experimental results show that the VSD can be accurately estimated with the weights learnt by the nonlinear mapping function once its associated S-VSDs are available.
no code implementations • 8 Dec 2021 • Xudong Huang, Chunyu Lin, Haojie Liu, Lang Nie, Yao Zhao
LiDAR sensors are widely used in autonomous driving due to the reliable 3D spatial information.
no code implementations • 1 Dec 2021 • Yiming Wang, Dongxia Chang, Zhiqiang Fu, Yao Zhao
In this paper, we consider the problem of multi-view clustering on incomplete views.
2 code implementations • 27 Oct 2021 • Runmin Cong, Yumo Zhang, Leyuan Fang, Jun Li, Yao Zhao, Sam Kwong
Salient object detection (SOD) for optical remote sensing images (RSIs) aims at locating and extracting visually distinctive objects/regions from the optical RSIs.
1 code implementation • 4 Aug 2021 • Chen Zhang, Runmin Cong, Qinwei Lin, Lin Ma, Feng Li, Yao Zhao, Sam Kwong
For the cross-modality interaction in feature encoder, existing methods either indiscriminately treat RGB and depth modalities, or only habitually utilize depth cues as auxiliary information of the RGB branch.
no code implementations • 3 Aug 2021 • Bingfeng Zhang, Jimin Xiao, Yao Zhao
In this paper, we propose a new regularized loss which utilizes both shallow and deep features that are dynamically updated in order to aggregate sufficient information to represent the relationship of different pixels.
Weakly supervised Semantic Segmentation
Weakly-Supervised Semantic Segmentation
no code implementations • 27 Jul 2021 • Qi Tang, Runmin Cong, Ronghui Sheng, Lingzhi He, Dan Zhang, Yao Zhao, Sam Kwong
The other is the content guidance bridge (CGBdg) designed for the depth map reconstruction process, which provides the content guidance learned from DSR task for MDE task.
1 code implementation • 6 Jul 2021 • Lang Nie, Chunyu Lin, Kang Liao, Shuaicheng Liu, Yao Zhao
Homography estimation is an important task in computer vision applications, such as image stitching, video stabilization, and camera calibration.
no code implementations • 1 Jul 2021 • Shuai Zheng, Zhenfeng Zhu, Zhizhe Liu, Zhenyu Guo, Yang Liu, Yao Zhao
However, it is not easy for these approaches to generalize to unseen samples.
1 code implementation • 24 Jun 2021 • Lang Nie, Chunyu Lin, Kang Liao, Shuaicheng Liu, Yao Zhao
Even compared with the supervised solutions, our image stitching quality is still preferred by users.
no code implementations • CVPR 2021 • Zhiqiang Fu, Yao Zhao, Dongxia Chang, Xingxing Zhang, Yiming Wang
This paper presents a novel, simple yet robust self-representation method, i. e., Double Low-Rank Representation with Projection Distance penalty (DLRRPD) for clustering.
1 code implementation • 8 Jun 2021 • Bingfeng Zhang, Jimin Xiao, Jianbo Jiao, Yunchao Wei, Yao Zhao
More importantly, our approach can be readily applied to bounding box supervised instance segmentation task or other weakly supervised semantic segmentation tasks, with state-of-the-art or comparable performance among almot all weakly supervised tasks on PASCAL VOC or COCO dataset.
no code implementations • 11 May 2021 • Yiming Wang, Dongxia Chang, Zhiqiang Fu, Yao Zhao
Specifically, a multiple graph auto-encoder(M-GAE) is designed to flexibly encode the complementary information of multi-view data using a multi-graph attention fusion encoder.
no code implementations • 30 Apr 2021 • Yiming Wang, Dongxia Chang, Zhiqian Fu, Yao Zhao
This paper is the first attempt to employ graph pooling technique for node clustering and we propose a novel dual graph embedding network (DGEN), which is designed as a two-step graph encoder connected by a graph pooling layer to learn the graph embedding.
no code implementations • 26 Apr 2021 • Zhiqiang Fu, Yao Zhao, Dongxia Chang, Xingxing Zhang, Yiming Wang
In this paper, a novel unsupervised low-rank representation model, i. e., Auto-weighted Low-Rank Representation (ALRR), is proposed to construct a more favorable similarity graph (SG) for clustering.
no code implementations • 15 Apr 2021 • Shashi Narayan, Yao Zhao, Joshua Maynez, Gonçalo Simoes, Vitaly Nikolaev, Ryan Mcdonald
Moreover, we demonstrate empirically that planning with entity chains provides a mechanism to control hallucinations in abstractive summaries.
1 code implementation • CVPR 2021 • Lingzhi He, Hongguang Zhu, Feng Li, Huihui Bai, Runmin Cong, Chunjie Zhang, Chunyu Lin, Meiqin Liu, Yao Zhao
Depth maps obtained by commercial depth sensors are always in low-resolution, making it difficult to be used in various computer vision tasks.
1 code implementation • CVPR 2021 • Shangrong Yang, Chunyu Lin, Kang Liao, Chunjie Zhang, Yao Zhao
We embed a correction layer in skip-connection and leverage the appearance flows in different layers to pre-correct the image features.
1 code implementation • 15 Mar 2021 • Zhizhe Liu, Zhenfeng Zhu, Shuai Zheng, Yang Liu, Jiayu Zhou, Yao Zhao
To bridge the gap between the source and target domains in unsupervised domain adaptation (UDA), the most common strategy puts focus on matching the marginal distributions in the feature space through adversarial learning.
no code implementations • 16 Feb 2021 • Jian Jin, Xingxing Zhang, Xin Fu, huan zhang, Weisi Lin, Jian Lou, Yao Zhao
Experimental results on image classification demonstrate that we successfully find the JND for deep machine vision.
no code implementations • 2 Feb 2021 • Yakun Niu, Benedetta Tondi, Yao Zhao, Rongrong Ni, Mauro Barni
We assume that both the spliced regions and the background image have undergone a double JPEG compression, and use a local estimate of the primary quantization matrix to distinguish between spliced regions taken from different sources.
no code implementations • 26 Jan 2021 • Pengpeng Yang, Daniele Baracchi, Massimo Iuliani, Dasara Shullani, Rongrong Ni, Yao Zhao, Alessandro Piva
Furthermore, it is capable of correctly identifying the operating system of the source device for most of the tampered videos.
no code implementations • ICCV 2021 • Kang Liao, Chunyu Lin, Lixin Liao, Yao Zhao, Weiyao Lin
In this paper, inspired by the curriculum learning, we analyze the barrel distortion rectification task in a progressive and meaningful manner.
no code implementations • ICCV 2021 • Kang Liao, Chunyu Lin, Yunchao Wei, Feng Li, Shangrong Yang, Yao Zhao
To our knowledge, we are the first to tackle the challenging rectification via outpainting, and our curve-aware strategy can reach a rectification construction with complete content and regular shape.
no code implementations • 11 Dec 2020 • Lang Nie, Chunyu Lin, Kang Liao, Yao Zhao
In this paper, we propose an image stitching learning framework, which consists of a large-baseline deep homography module and an edge-preserved deformation module.
no code implementations • 4 Dec 2020 • Haoyu Chu, Shikui Wei, Yao Zhao
Thus, Neural ODEs have natural robustness against adversarial examples.
3 code implementations • 26 Nov 2020 • Qijian Zhang, Runmin Cong, Chongyi Li, Ming-Ming Cheng, Yuming Fang, Xiaochun Cao, Yao Zhao, Sam Kwong
Despite the remarkable advances in visual saliency analysis for natural scene images (NSIs), salient object detection (SOD) for optical remote sensing images (RSIs) still remains an open and challenging problem.
1 code implementation • NeurIPS 2020 • Qijian Zhang, Runmin Cong, Junhui Hou, Chongyi Li, Yao Zhao
In the first stage, we propose a group-attentional semantic aggregation module that models inter-image relationships to generate the group-wise semantic representations.
1 code implementation • 29 Oct 2020 • Feng Li, Runmin Cong, Huihui Bai, Yifan He, Yao Zhao, Ce Zhu
In this paper, we present a deep interleaved network (DIN) that learns how information at different states should be combined for high-quality (HQ) images reconstruction.
no code implementations • 27 Oct 2020 • Yang Yu, Rongrong Ni, Yao Zhao
Recently, AI-manipulated face techniques have developed rapidly and constantly, which has raised new security issues in society.
no code implementations • 17 Oct 2020 • Yunchao Wei, Shuai Zheng, Ming-Ming Cheng, Hang Zhao, LiWei Wang, Errui Ding, Yi Yang, Antonio Torralba, Ting Liu, Guolei Sun, Wenguan Wang, Luc van Gool, Wonho Bae, Junhyug Noh, Jinhwan Seo, Gunhee Kim, Hao Zhao, Ming Lu, Anbang Yao, Yiwen Guo, Yurong Chen, Li Zhang, Chuangchuang Tan, Tao Ruan, Guanghua Gu, Shikui Wei, Yao Zhao, Mariia Dobko, Ostap Viniavskyi, Oles Dobosevych, Zhendong Wang, Zhenyuan Chen, Chen Gong, Huanqing Yan, Jun He
The purpose of the Learning from Imperfect Data (LID) workshop is to inspire and facilitate the research in developing novel approaches that would harness the imperfect data and improve the data-efficiency during training.
no code implementations • 2 Oct 2020 • Chongyi Li, Runmin Cong, Chunle Guo, Hua Li, Chunjie Zhang, Feng Zheng, Yao Zhao
In this paper, we propose a novel Parallel Down-up Fusion network (PDF-Net) for SOD in optical RSIs, which takes full advantage of the in-path low- and high-level features and cross-path multi-resolution features to distinguish diversely scaled salient objects and suppress the cluttered backgrounds.
no code implementations • 2 Oct 2020 • Zhizhe Liu, Xingxing Zhang, Zhenfeng Zhu, Shuai Zheng, Yao Zhao, Jian Cheng
There have been numerous methods proposed for human identification, such as face identification, person re-identification, and gait identification.
no code implementations • 21 Jul 2020 • Kang Liao, Chunyu Lin, Yao Zhao
Distortion is widely existed in the images captured by popular wide-angle cameras and fisheye cameras.
no code implementations • 20 Jun 2020 • Haojie Liu, Kang Liao, Chunyu Lin, Yao Zhao, Yulan Guo
Pseudo-LiDAR point cloud interpolation is a novel and challenging task in the field of autonomous driving, which aims to address the frequency mismatching problem between camera and LiDAR.
no code implementations • 18 Jun 2020 • Yao Zhao, Mohammad Saleh, Peter J. Liu
Most prior work in the sequence-to-sequence paradigm focused on datasets with input sequence lengths in the hundreds of tokens due to the computational constraints of common RNN and Transformer architectures.
no code implementations • IEEE 2020 • Shuang Qiu, Yao Zhao, Jianbo Jiao, Yunchao Wei, Shikui Wei
To this end, we propose to train the referring image segmentation model in a generative adversarial fashion, which well addresses the distribution similarity problem.
1 code implementation • CVPR 2020 • Mingjie Sun, Jimin Xiao, Eng Gee Lim, Bingfeng Zhang, Yao Zhao
Specifically, the reinforcement learning agent learns to decide whether to update the target template according to the quality of the predicted result.
no code implementations • 10 Feb 2020 • Fuzhen Li, Zhenfeng Zhu, Xingxing Zhang, Jian Cheng, Yao Zhao
In zero-shot learning (ZSL), the samples to be classified are usually projected into side information templates such as attributes.
2 code implementations • 12 Jan 2020 • Lijun Zhao, Huihui Bai, Anhong Wang, Yao Zhao
In this paper, we introduce a deep multiple description coding (MDC) framework optimized by minimizing multiple description (MD) compressive loss.
1 code implementation • 12 Jan 2020 • Lijun Zhao, Jinjing Zhang, Fan Zhang, Anhong Wang, Huihui Bai, Yao Zhao
Most deep image smoothing operators are always trained repetitively when different explicit structure-texture pairs are employed as label images for each algorithm configured with different parameters.
16 code implementations • ICML 2020 • Jingqing Zhang, Yao Zhao, Mohammad Saleh, Peter J. Liu
Recent work pre-training Transformers with self-supervised objectives on large text corpora has shown great success when fine-tuned on downstream NLP tasks including text summarization.
Ranked #1 on
Abstractive Text Summarization
on AESLC
no code implementations • 12 Dec 2019 • Zhenfeng Zhu, Yingying Meng, Deqiang Kong, Xingxing Zhang, Yandong Guo, Yao Zhao
Due to the deteriorated conditions of \mbox{illumination} lack and uneven lighting, nighttime images have lower contrast and higher noise than their daytime counterparts of the same scene, which limits seriously the performances of conventional background modeling methods.
1 code implementation • CVPR 2020 • Shuai Zheng, Zhenfeng Zhu, Xingxing Zhang, Zhizhe Liu, Jian Cheng, Yao Zhao
Graph representation learning aims to encode all nodes of a graph into low-dimensional vectors that will serve as input of many compute vision tasks.
1 code implementation • 2 Nov 2019 • Hui Li, Jimin Xiao, Ming-Jie Sun, Eng Gee Lim, Yao Zhao
To tackle this problem, we propose to iteratively guess pseudo labels for the unlabeled image samples, which are later used to update the re-identification model together with the labelled samples.
no code implementations • 24 Oct 2019 • Xingxing Zhang, Shupeng Gui, Zhenfeng Zhu, Yao Zhao, Ji Liu
In this paper, we take an initial attempt, and propose a generic formulation to provide a systematical solution (named ATZSL) for learning a robust ZSL model.
no code implementations • 24 Oct 2019 • Xingxing Zhang, Shupeng Gui, Zhenfeng Zhu, Yao Zhao, Ji Liu
Specifically, HPL is able to obtain discriminability on both seen and unseen class domains by learning visual prototypes respectively under the transductive setting.
1 code implementation • 24 Oct 2019 • Xingxing Zhang, Zhenfeng Zhu, Yao Zhao
Given a set of hand-crafted local features, acquiring a global representation via aggregation is a promising technique to boost computational efficiency and improve task performance.
no code implementations • 22 Oct 2019 • Zhizhe Liu, Xingxing Zhang, Zhenfeng Zhu, Shuai Zheng, Yao Zhao, Jian Cheng
The key to ZSL is to transfer knowledge from the seen to the unseen classes via auxiliary class attribute vectors.
no code implementations • 16 Sep 2019 • Haojie Liu, Kang Liao, Chunyu Lin, Yao Zhao, Yulan Guo
In this paper, we propose a novel Pseudo-LiDAR interpolation network (PLIN) to increase the frequency of LiDAR sensors.
1 code implementation • 27 Aug 2019 • Dingyuan Zheng, Jimin Xiao, Kai-Zhu Huang, Yao Zhao
Person search aims to search for a target person among multiple images recorded by multiple surveillance cameras, which faces various challenges from both pedestrian detection and person re-identification.
1 code implementation • 9 Aug 2019 • Yakun Niu, Benedetta Tondi, Yao Zhao, Mauro Barni
Available model-based techniques for the estimation of the primary quantization matrix in double-compressed JPEG images work only under specific conditions regarding the relationship between the first and second compression quality factors, and the alignment of the first and second JPEG compression grids.
no code implementations • 11 Jul 2019 • Shuai Zheng, Zhenfeng Zhu, Jian Cheng, Yandong Guo, Yao Zhao
Non-uniform blur, mainly caused by camera shake and motions of multiple objects, is one of the most common causes of image quality degradation.
no code implementations • 4 Jun 2019 • Zhun Deng, Cynthia Dwork, Jialiang Wang, Yao Zhao
We provide a general framework for characterizing the trade-off between accuracy and robustness in supervised learning.
no code implementations • 9 Nov 2018 • Youru Li, Zhenfeng Zhu, Deqiang Kong, Hua Han, Yao Zhao
To address this issue, an evolutionary attention-based LSTM training with competitive random search is proposed for multivariate time series prediction.
no code implementations • 8 Nov 2018 • Yanchun Xie, Jimin Xiao, Kai-Zhu Huang, Jeyarajan Thiyagalingam, Yao Zhao
In this paper, we propose a novel approach to address the correlation filter update problem.
1 code implementation • 5 Nov 2018 • Lijun Zhao, Huihui Bai, Anhong Wang, Yao Zhao
Secondly, two entropy estimation networks are learned to estimate the informative amounts of the quantized tensors, which can further supervise the learning of multiple description encoder network to represent the input image delicately.
1 code implementation • EMNLP 2018 • Yao Zhao, Xiaochuan Ni, Yuanyuan Ding, Qifa Ke
Long text has posed challenges for sequence to sequence neural models in question generation {--} worse performances were reported if using the whole paragraph (with multiple sentences) as the input.
2 code implementations • 17 Sep 2018 • Tao Ruan, Ting Liu, Zilong Huang, Yunchao Wei, Shikui Wei, Yao Zhao, Thomas Huang
Human parsing has received considerable interest due to its wide application potentials.
Ranked #2 on
Person Re-Identification
on Market-1501-C
1 code implementation • 22 Jun 2018 • Lijun Zhao, Huihui Bai, Anhong Wang, Yao Zhao
In order to train RSN network and IDN network together in an end-to-end fashion, our VCN network intimates projection from the re-sampled vectors to the IDN-decoded image.
1 code implementation • 31 Mar 2018 • Alexey Kurakin, Ian Goodfellow, Samy Bengio, Yinpeng Dong, Fangzhou Liao, Ming Liang, Tianyu Pang, Jun Zhu, Xiaolin Hu, Cihang Xie, Jian-Yu Wang, Zhishuai Zhang, Zhou Ren, Alan Yuille, Sangxia Huang, Yao Zhao, Yuzhe Zhao, Zhonglin Han, Junjiajia Long, Yerkebulan Berdibekov, Takuya Akiba, Seiya Tokui, Motoki Abe
To accelerate research on adversarial examples and robustness of machine learning classifiers, Google Brain organized a NIPS 2017 competition that encouraged researchers to develop new methods to generate adversarial examples as well as to develop new ways to defend against them.
no code implementations • 29 Mar 2018 • Wei Zhao, Pengpeng Yang, Rongrong Ni, Yao Zhao, Haorui Wu
Instead of improving it, in this paper, the safety of deep learning based methods in the field of image forensics is taken into account.
no code implementations • 20 Feb 2018 • Qi Chang, Gene Cheung, Yao Zhao, Xiaolong Li, Rongrong Ni
If sufficiently smooth, we pose a maximum a posteriori (MAP) problem using either a quadratic Laplacian regularizer or a graph total variation (GTV) term as signal prior.
no code implementations • 2 Feb 2018 • Lijun Zhao, Huihui Bai, Feng Li, Anhong Wang, Yao Zhao
Firstly, given one input image, feature description neural network (FDNN) is used to generate a new representation of this image, so that this image representation can be more efficiently compressed by standard codec, as compared to the input image.
no code implementations • 2 Feb 2018 • Zhipeng Chen, Benedetta Tondi, Xiaolong Li, Rongrong Ni, Yao Zhao, Mauro Barni
We address the problem of data-driven image manipulation detection in the presence of an attacker with limited knowledge about the detector.
Cryptography and Security
no code implementations • 20 Jan 2018 • Lijun Zhao, Huihui Bai, Anhong Wang, Yao Zhao
Thirdly, multiple description virtual codec network (MDVCN) is proposed to bridge the gap between MDGN network and MDRN network in order to train an end-to-end MDC framework.
1 code implementation • 16 Dec 2017 • Lijun Zhao, Huihui Bai, Anhong Wang, Yao Zhao
Due to the challenge of directly learning a non-linear function for a standard codec based on convolutional neural network, we propose to learn a virtual codec neural network to approximate the projection from the valid description image to the post-processed compressed image, so that the gradient could be efficiently back-propagated from the post-processing neural network to the feature description neural network during training.
no code implementations • 30 Aug 2017 • Lijun Zhao, Huihui Bai, Jie Liang, Bing Zeng, Anhong Wang, Yao Zhao
Firstly, given the low-resolution depth image and low-resolution color image, a generative network is proposed to leverage mutual information of color image and depth image to enhance each other in consideration of the geometry structural dependency of color-depth image in the same scene.
no code implementations • 9 Jul 2017 • Lijun Zhao, Jie Liang, Huihui Bai, Lili Meng, Anhong Wang, Yao Zhao
Both frameworks employ the division of gradient and the local activity measurement to achieve noise removal.
no code implementations • CVPR 2017 • Yunchao Wei, Jiashi Feng, Xiaodan Liang, Ming-Ming Cheng, Yao Zhao, Shuicheng Yan
We investigate a principle way to progressively mine discriminative object regions using classification networks to address the weakly-supervised semantic segmentation problems.
no code implementations • 15 Mar 2017 • Pengpeng Yang, Wei Zhao, Rongrong Ni, Yao Zhao
In this paper, we propose a solution to identify the source camera of the small-size images: content-adaptive fusion network.
no code implementations • 10 Mar 2017 • Ruoyu Liu, Yao Zhao, Liang Zheng, Shikui Wei, Yi Yang
Additionally, a trivial solution, \ie, directly using the predicted class label for cross-media retrieval, is tested.
no code implementations • 25 Oct 2016 • Xiang Jiang, Shikui Wei, Ruizhen Zhao, Yao Zhao, Xindong Wu
The underlying assumption is that multiple accounts belonging to the same person contain the same or similar camera fingerprint information.
1 code implementation • 10 Sep 2015 • Yunchao Wei, Xiaodan Liang, Yunpeng Chen, Xiaohui Shen, Ming-Ming Cheng, Jiashi Feng, Yao Zhao, Shuicheng Yan
Then, a better network called Enhanced-DCNN is learned with supervision from the predicted segmentation masks of simple images based on the Initial-DCNN as well as the image-level annotations.
no code implementations • 2 Aug 2015 • Ruoyu Liu, Yao Zhao, Shikui Wei, Yi Yang
The convolutional neural network (CNN) features can give a good description of image content, which usually represent images with unique global vectors.
no code implementations • 22 Jun 2015 • Yunchao Wei, Yao Zhao, Zhenfeng Zhu, Shikui Wei, Yanhui Xiao, Jiashi Feng, Shuicheng Yan
Specifically, by jointly optimizing the correlation between images and text and the linear regression from one modal space (image or text) to the semantic space, two couples of mappings are learned to project images and text from their original feature spaces into two common latent subspaces (one for I2T and the other for T2I).
no code implementations • 22 Jun 2014 • Yunchao Wei, Wei Xia, Junshi Huang, Bingbing Ni, Jian Dong, Yao Zhao, Shuicheng Yan
Convolutional Neural Network (CNN) has demonstrated promising performance in single-label image classification tasks.
no code implementations • 9 Apr 2013 • Yanhui Xiao, Zhenfeng Zhu, Yao Zhao
However, ICA is not only sensitive to whitening but also difficult to learn an over-complete basis.