1 code implementation • ECCV 2020 • Yanbo Fan, Baoyuan Wu, Tuanhui Li, Yong Zhang, Mingyang Li, Zhifeng Li, Yujiu Yang
Based on this factorization, we formulate the sparse attack problem as a mixed integer programming (MIP) to jointly optimize the binary selection factors and continuous perturbation magnitudes of all pixels, with a cardinality constraint on selection factors to explicitly control the degree of sparsity.
1 code implementation • 1 Jun 2023 • Ge Yuan, Xiaodong Cun, Yong Zhang, Maomao Li, Chenyang Qi, Xintao Wang, Ying Shan, Huicheng Zheng
Empowered by the proposed celeb basis, the new identity in our customized model showcases a better concept combination ability than previous personalization methods.
no code implementations • 1 Jun 2023 • Jinbo Xing, Menghan Xia, Yuxin Liu, Yuechen Zhang, Yong Zhang, Yingqing He, Hanyuan Liu, Haoxin Chen, Xiaodong Cun, Xintao Wang, Ying Shan, Tien-Tsin Wong
Our method, dubbed Make-Your-Video, involves joint-conditional video generation using a Latent Diffusion Model that is pre-trained for still image synthesis and then promoted for video generation with the introduction of temporal modules.
no code implementations • 1 Jun 2023 • Ruotong Wang, Hongrui Chen, Zihao Zhu, Li Liu, Yong Zhang, Yanbo Fan, Baoyuan Wu
These triggers have demonstrated strong attack performance even under backdoor defense, which aims to eliminate or suppress the backdoor effect in the model.
1 code implementation • 29 May 2023 • Yuan Gong, Youxin Pang, Xiaodong Cun, Menghan Xia, Yingqing He, Haoxin Chen, Longyue Wang, Yong Zhang, Xintao Wang, Ying Shan, Yujiu Yang
Accurate Story visualization requires several necessary elements, such as identity consistency across frames, the alignment between plain text and visual content, and a reasonable layout of objects in images.
no code implementations • 27 Apr 2023 • Zhiyuan Yan, Yong Zhang, Yanbo Fan, Baoyuan Wu
Deepfake detection remains a challenging task due to the difficulty of generalizing to new types of forgeries.
no code implementations • 18 Apr 2023 • Weihua Xu, Feifei Gao, Yong Zhang, Chengkang Pan, Guangyi Liu
Visual perception is an effective way to obtain the spatial characteristics of wireless channels and to reduce the overhead for communications system.
1 code implementation • CVPR 2023 • Liang Chen, Yong Zhang, Yibing Song, Ying Shan, Lingqiao Liu
Generally, a TTT strategy hinges its performance on two main factors: selecting an appropriate auxiliary TTT task for updating and identifying reliable parameters to update during the test phase.
1 code implementation • CVPR 2023 • Tingting Liao, Xiaomei Zhang, Yuliang Xiu, Hongwei Yi, Xudong Liu, Guo-Jun Qi, Yong Zhang, Xuan Wang, Xiangyu Zhu, Zhen Lei
This paper presents a framework for efficient 3D clothed avatar reconstruction.
no code implementations • 1 Apr 2023 • Xiaojun Jia, Yong Zhang, Xingxing Wei, Baoyuan Wu, Ke Ma, Jue Wang, Xiaochun Cao
This initialization is generated by using high-quality adversarial perturbations from the historical training process.
1 code implementation • 20 Mar 2023 • Xinhang Li, Xiangyu Zhao, Jiaxing Xu, Yong Zhang, Chunxiao Xing
To this end, we propose a two-stage multimodal fusion framework to preserve modality-specific knowledge as well as take advantage of the complementarity between different modalities.
1 code implementation • 16 Mar 2023 • Chenyang Qi, Xiaodong Cun, Yong Zhang, Chenyang Lei, Xintao Wang, Ying Shan, Qifeng Chen
We also have a better zero-shot shape-aware editing ability based on the text-to-video model.
1 code implementation • 15 Mar 2023 • Weihuang Liu, Xiaodong Cun, Chi-Man Pun, Menghan Xia, Yong Zhang, Jue Wang
Thanks to the proposed structure, we only encode the high-resolution image in a relatively low resolution for larger reception field capturing.
1 code implementation • 14 Mar 2023 • Rindranirina Ramamonjison, Timothy T. Yu, Raymond Li, Haley Li, Giuseppe Carenini, Bissan Ghaddar, Shiqi He, Mahdi Mostajabdaveh, Amin Banitalebi-Dehkordi, Zirui Zhou, Yong Zhang
The Natural Language for Optimization (NL4Opt) Competition was created to investigate methods of extracting the meaning and formulation of an optimization problem based on its text description.
1 code implementation • CVPR 2023 • Youxin Pang, Yong Zhang, Weize Quan, Yanbo Fan, Xiaodong Cun, Ying Shan, Dong-Ming Yan
In this paper, we introduce a novel self-supervised disentanglement framework to decouple pose and expression without 3DMMs and paired data, which consists of a motion editing module, a pose generator, and an expression generator.
1 code implementation • 15 Jan 2023 • Jianrong Zhang, Yangsong Zhang, Xiaodong Cun, Shaoli Huang, Yong Zhang, Hongwei Zhao, Hongtao Lu, Xi Shen
Additionally, we conduct analyses on HumanML3D and observe that the dataset size is a limitation of our approach.
Ranked #1 on
Motion Synthesis
on HumanML3D
no code implementations • CVPR 2023 • Jianrong Zhang, Yangsong Zhang, Xiaodong Cun, Yong Zhang, Hongwei Zhao, Hongtao Lu, Xi Shen, Ying Shan
Additionally, we conduct analyses on HumanML3D and observe that the dataset size is a limitation of our approach.
1 code implementation • 1 Jan 2023 • Fei Yin, Yong Zhang, Baoyuan Wu, Yan Feng, Jingyi Zhang, Yanbo Fan, Yujiu Yang
In the scenario of black-box adversarial attack, the target model's parameters are unknown, and the attacker aims to find a successful adversarial perturbation based on query feedback under a query budget.
no code implementations • CVPR 2023 • Yong Zhang, Yingwei Pan, Ting Yao, Rui Huang, Tao Mei, Chang-Wen Chen
Specifically, cheap scene graph supervision data can be easily obtained by parsing image language descriptions into semantic graphs.
no code implementations • 22 Dec 2022 • Gursimran Singh, Chendi Wang, Ahnaf Tazwar, Lanjun Wang, Yong Zhang
Data trading is essential to accelerate the development of data-driven machine learning pipelines.
no code implementations • CVPR 2023 • Fei Yin, Yong Zhang, Xuan Wang, Tengfei Wang, Xiaoyu Li, Yuan Gong, Yanbo Fan, Xiaodong Cun, Ying Shan, Cengiz Oztireli, Yujiu Yang
It is natural to associate 3D GANs with GAN inversion methods to project a real image into the generator's latent space, allowing free-view consistent synthesis and editing, referred as 3D GAN inversion.
no code implementations • CVPR 2023 • Yunpeng Bai, Yanbo Fan, Xuan Wang, Yong Zhang, Jingxiang Sun, Chun Yuan, Ying Shan
Compared with existing works, we obtain superior novel view synthesis results and faithfully face reenactment performance.
1 code implementation • 27 Nov 2022 • Kun Cheng, Xiaodong Cun, Yong Zhang, Menghan Xia, Fei Yin, Mingrui Zhu, Xuan Wang, Jue Wang, Nannan Wang
Our system disentangles this objective into three sequential tasks: (1) face video generation with a canonical expression; (2) audio-driven lip-sync; and (3) face enhancement for improving photo-realism.
no code implementations • CVPR 2023 • Zhian Liu, Maomao Li, Yong Zhang, Cairong Wang, Qi Zhang, Jue Wang, Yongwei Nie
We rethink face swapping from the perspective of fine-grained face editing, \textit{i. e., ``editing for swapping'' (E4S)}, and propose a framework that is based on the explicit disentanglement of the shape and texture of facial components.
1 code implementation • 23 Nov 2022 • Yingqing He, Tianyu Yang, Yong Zhang, Ying Shan, Qifeng Chen
Diffusion models have shown remarkable results recently but require significant computational resources.
Ranked #2 on
Video Generation
on Sky Time-lapse
1 code implementation • CVPR 2023 • Wenxuan Zhang, Xiaodong Cun, Xuan Wang, Yong Zhang, Xi Shen, Yu Guo, Ying Shan, Fei Wang
We present SadTalker, which generates 3D motion coefficients (head pose, expression) of the 3DMM from audio and implicitly modulates a novel 3D-aware face render for talking head generation.
1 code implementation • CVPR 2023 • Jingxiang Sun, Xuan Wang, Lizhen Wang, Xiaoyu Li, Yong Zhang, Hongwen Zhang, Yebin Liu
We propose a novel 3D GAN framework for unsupervised learning of generative, high-quality and 3D-consistent facial avatars from unstructured 2D images.
1 code implementation • 19 Nov 2022 • Nisha Huang, Yuxin Zhang, Fan Tang, Chongyang Ma, Haibin Huang, Yong Zhang, WeiMing Dong, Changsheng Xu
Despite the impressive results of arbitrary image-guided style transfer methods, text-driven image stylization has recently been proposed for transferring a natural image into the stylized one according to textual descriptions of the target style provided by the user.
2 code implementations • 12 Oct 2022 • Zeyu Qin, Yanbo Fan, Yi Liu, Li Shen, Yong Zhang, Jue Wang, Baoyuan Wu
Furthermore, RAP can be naturally combined with many existing black-box attack techniques, to further boost the transferability.
1 code implementation • COLING 2022 • ZiHao Wang, Jiaheng Dou, Yong Zhang
Measuring Sentence Textual Similarity (STS) is a classic task that can be applied to many downstream NLP applications such as text generation and retrieval.
1 code implementation • 30 Sep 2022 • Rindranirina Ramamonjison, Haley Li, Timothy T. Yu, Shiqi He, Vishnu Rengan, Amin Banitalebi-Dehkordi, Zirui Zhou, Yong Zhang
We describe an augmented intelligence system for simplifying and enhancing the modeling experience for operations research.
no code implementations • 8 Sep 2022 • Zeyu Liu, Yi Wang, Jing Wen, Yong Zhang, Hao Yin, Chao Guo, Zhongyu Wang
In addition, in order to improve the segmentation performance, we adopt multi-view and multi-window level method, at the same time we employ a fine-tune strategy to mitigate the impact of inconsistent labeling.
1 code implementation • 28 Aug 2022 • Mingdeng Cao, Zhihang Zhong, Yanbo Fan, Jiahao Wang, Yong Zhang, Jue Wang, Yujiu Yang, Yinqiang Zheng
We believe the novel realistic synthesis pipeline and the corresponding RAW video dataset can help the community to easily construct customized blur datasets to improve real-world video deblurring performance largely, instead of laboriously collecting real data pairs.
1 code implementation • 16 Aug 2022 • Zhenan Fan, Zirui Zhou, Jian Pei, Michael P. Friedlander, Jiajie Hu, Chengliang Li, Yong Zhang
Federated learning is an emerging technique for training models from decentralized data sets.
no code implementations • 15 Aug 2022 • Morgan Heisler, Amin Banitalebi-Dehkordi, Yong Zhang
Our method of semantically meaningful image augmentation for object detection via language grounding, SemAug, starts by calculating semantically appropriate new objects that can be placed into relevant locations in the image (the what and where problems).
1 code implementation • 3 Aug 2022 • Fabrizio Pedersoli, Dryden Wiebe, Amin Banitalebi, Yong Zhang, George Tzanetakis, Kwang Moo Yi
Therefore, audio-based methods can be useful even for applications in which only visual information is of interest Our framework is based on Manifold Learning and consists of two steps.
1 code implementation • 18 Jul 2022 • Xiaojun Jia, Yong Zhang, Xingxing Wei, Baoyuan Wu, Ke Ma, Jue Wang, Xiaochun Cao
Based on the observation, we propose a prior-guided FGSM initialization method to avoid overfitting after investigating several initialization strategies, improving the quality of the AEs during the whole training process.
no code implementations • 12 Jul 2022 • Mohit Bajaj, Lingyang Chu, Vittorio Romaniello, Gursimran Singh, Jian Pei, Zirui Zhou, Lanjun Wang, Yong Zhang
The key idea is to find solid evidence in the form of a group of data instances discriminated most by the model.
no code implementations • 12 Jul 2022 • Bo Lin, Feifei Gao, Yong Zhang, Chengkang Pan, Guangyi Liu
In this paper, we proposed a multi-camera view based proactive BS selection and beam switching that can predict the optimal BS of the user in the future frame and switch the corresponding beam pair.
no code implementations • 14 Jun 2022 • Amin Banitalebi-Dehkordi, Pratik Gujjar, Yong Zhang
Critically, most recent work assume that such unlabeled data is drawn from the same distribution as the labeled data.
no code implementations • 14 Jun 2022 • Tianyu Zhang, Amin Banitalebi-Dehkordi, Yong Zhang
We propose a new approach for solving the data labeling and inference latency issues in combinatorial optimization based on the use of the reinforcement learning (RL) paradigm.
1 code implementation • CVPR 2022 • Yong Zhang, Yingwei Pan, Ting Yao, Rui Huang, Tao Mei, Chang-Wen Chen
Such design decomposes the process of HOI set prediction into two subsequent phases, i. e., an interaction proposal generation is first performed, and then followed by transforming the non-parametric interaction proposals into HOI predictions via a structure-aware Transformer.
Ranked #2 on
Human-Object Interaction Detection
on V-COCO
no code implementations • 7 Jun 2022 • Mehdi Seyfi, Amin Banitalebi-Dehkordi, Yong Zhang
Contrastive self-supervised representation learning methods maximize the similarity between the positive pairs, and at the same time tend to minimize the similarity between the negative pairs.
no code implementations • 7 Jun 2022 • Mehdi Seyfi, Amin Banitalebi-Dehkordi, Yong Zhang
Unsupervised representation learning methods like SwAV are proved to be effective in learning visual semantics of a target dataset.
no code implementations • 6 Jun 2022 • Zhichao Huang, Yanbo Fan, Chen Liu, Weizhong Zhang, Yong Zhang, Mathieu Salzmann, Sabine Süsstrunk, Jue Wang
While adversarial training and its variants have shown to be the most effective algorithms to defend against adversarial attacks, their extremely slow training process makes it hard to scale to large datasets like ImageNet.
no code implementations • 18 May 2022 • Jiyuan Cao, Zhilei Liu, Yong Zhang
Ablation study and visualization show that our MARL can eliminate identity-caused differences, thus obtaining a robust and generalized AU discriminative embedding representation.
1 code implementation • 17 Apr 2022 • Mingdeng Cao, Yanbo Fan, Yong Zhang, Jue Wang, Yujiu Yang
For multi-frame temporal modeling, we adapt Transformer to fuse multiple spatial features efficiently.
no code implementations • 29 Mar 2022 • Xubo Lyu, Amin Banitalebi-Dehkordi, Mo Chen, Yong Zhang
Multi-agent policy gradient methods have demonstrated success in games and robotics but are often limited to problems with low-level action space.
Hierarchical Reinforcement Learning
Multi-agent Reinforcement Learning
+3
1 code implementation • CVPR 2022 • Liang Chen, Yong Zhang, Yibing Song, Lingqiao Liu, Jue Wang
Following this principle, we propose to enrich the "diversity" of forgeries by synthesizing augmented forgeries with a pool of forgery configurations and strengthen the "sensitivity" to the forgeries by enforcing the model to predict the forgery configurations.
1 code implementation • CVPR 2022 • Xiaojun Jia, Yong Zhang, Baoyuan Wu, Ke Ma, Jue Wang, Xiaochun Cao
In this paper, we propose a novel framework for adversarial training by introducing the concept of "learnable attack strategy", dubbed LAS-AT, which learns to automatically produce attack strategies to improve the model robustness.
no code implementations • 10 Mar 2022 • Saeed Ranjbar Alvar, Lanjun Wang, Jian Pei, Yong Zhang
Image-to-image translation models are shown to be vulnerable to the Membership Inference Attack (MIA), in which the adversary's goal is to identify whether a sample is used to train the model or not.
1 code implementation • 8 Mar 2022 • Fei Yin, Yong Zhang, Xiaodong Cun, Mingdeng Cao, Yanbo Fan, Xuan Wang, Qingyan Bai, Baoyuan Wu, Jue Wang, Yujiu Yang
Our framework elevates the resolution of the synthesized talking face to 1024*1024 for the first time, even though the training dataset has a lower resolution.
no code implementations • ACL 2022 • Mohammad Akbari, Amin Banitalebi-Dehkordi, Yong Zhang
As such, it can be applied to black-box pre-trained models without a need for architectural manipulations, reassembling of modules, or re-training.
no code implementations • 28 Feb 2022 • Sihan Feng, Yong Zhang, Fuming Wang, Hong Zhao
We consider weights in pathways that link neurons longitudinally from input neurons to output neurons, or simply weight pathways, as the basic units for understanding a neural network, and decompose a neural network into a series of subnetworks of such weight pathways.
no code implementations • 24 Feb 2022 • Yong Zhang, Zhitao Li, Jianzong Wang, Ning Cheng, Jing Xiao
In this paper, we propose a novel method by directly extracting the coreference and omission relationship from the self-attention weight matrix of the transformer instead of word embeddings and edit the original text accordingly to generate the complete utterance.
no code implementations • 17 Feb 2022 • Weiming Hu, Chen Li, Xiaoyan Li, Md Mamunur Rahaman, Yong Zhang, HaoYuan Chen, Wanli Liu, YuDong Yao, Hongzan Sun, Ning Xu, Xinyu Huang, Marcin Grzegorze
Traditional machine learning methods achieve maximum accuracy of 76. 02% and deep learning method achieves a maximum accuracy of 95. 37%.
no code implementations • 25 Jan 2022 • Xinhang Li, Yong Zhang, Chunxiao Xing
We adopt GCN-based models to learn the representation of entities by considering the graph structure and incorporating the relation semantic information into GCN via knowledge distillation.
no code implementations • 7 Jan 2022 • Zhenan Fan, Huang Fang, Zirui Zhou, Jian Pei, Michael P. Friedlander, Yong Zhang
We show that VerFedSV not only satisfies many desirable properties for fairness but is also efficient to compute, and can be adapted to both synchronous and asynchronous vertical federated learning algorithms.
no code implementations • 22 Dec 2021 • Amin Banitalebi-Dehkordi, Yong Zhang
The 2021 NeurIPS Machine Learning for Combinatorial Optimization (ML4CO) competition was designed with the goal of improving state-of-the-art combinatorial optimization solvers by replacing key heuristic components with machine learning models.
no code implementations • 15 Dec 2021 • Gursimran Singh, Lingyang Chu, Lanjun Wang, Jian Pei, Qi Tian, Yong Zhang
In the real world, the frequency of occurrence of objects is naturally skewed forming long-tail class distributions, which results in poor performance on the statistically rare classes.
no code implementations • 10 Dec 2021 • Sen Zhao, Yong Zhang, Shang Wang, Beitong Zhou, Cheng Cheng
Data-driven methods for remaining useful life (RUL) prediction normally learn features from a fixed window size of a priori of degradation, which may lead to less accurate prediction results on different datasets because of the variance of local features.
1 code implementation • CVPR 2022 • Jingxiang Sun, Xuan Wang, Yong Zhang, Xiaoyu Li, Qi Zhang, Yebin Liu, Jue Wang
2D GANs can generate high fidelity portraits but with low view consistency.
no code implementations • 1 Nov 2021 • Xing Wang, Juan Zhao, Lin Zhu, Xu Zhou, Zhao Li, Junlan Feng, Chao Deng, Yong Zhang
AMF-STGCN extends GCN by (1) jointly modeling the complex spatial-temporal dependencies in mobile networks, (2) applying attention mechanisms to capture various Receptive Fields of heterogeneous base stations, and (3) introducing an extra decoder based on a fully connected deep network to conquer the error propagation challenge with multi-step forecasting.
1 code implementation • 20 Oct 2021 • Amin Banitalebi-Dehkordi, Xinyu Kang, Yong Zhang
As an attempt to mitigate this dilemma, this paper investigates the idea of combining multiple trained neural networks using unlabeled data.
1 code implementation • 20 Oct 2021 • Mohammad Akbari, Amin Banitalebi-Dehkordi, Yong Zhang
To this end, we propose an Energy-Based Joint Reasoning (EBJR) framework that adaptively distributes the samples between shallow and deep models to achieve an accuracy close to the deep model, but latency close to the shallow one.
1 code implementation • 20 Oct 2021 • Amin Banitalebi-Dehkordi, Yong Zhang
Through an extensive set of experiments, we demonstrate the usefulness of the repainted examples in training, for the tasks of image classification (ImageNet) and object detection (COCO), over several state-of-the-art network architectures at different capacities, and across different data availability regimes.
no code implementations • 11 Oct 2021 • Xiaojun Jia, Yong Zhang, Baoyuan Wu, Jue Wang, Xiaochun Cao
Adversarial training (AT) has been demonstrated to be effective in improving model robustness by leveraging adversarial examples for training.
no code implementations • 20 Sep 2021 • Xin Zheng, Yanbo Fan, Baoyuan Wu, Yong Zhang, Jue Wang, Shirui Pan
Face recognition has been greatly facilitated by the development of deep neural networks (DNNs) and has been widely applied to many safety-critical applications.
no code implementations • 19 Sep 2021 • Zhenan Fan, Huang Fang, Zirui Zhou, Jian Pei, Michael P. Friedlander, Changxin Liu, Yong Zhang
The success of federated learning depends largely on the participation of data owners.
1 code implementation • 17 Sep 2021 • Changxin Liu, Zhenan Fan, Zirui Zhou, Yang Shi, Jian Pei, Lingyang Chu, Yong Zhang
To solve it in a federated and privacy-preserving manner, we consider the equivalent dual form of the problem and develop an asynchronous gradient coordinate-descent ascent algorithm, where some active data parties perform multiple parallelized local updates per communication round to effectively reduce the number of communication rounds.
1 code implementation • CVPR 2022 • Tengfei Wang, Yong Zhang, Yanbo Fan, Jue Wang, Qifeng Chen
With a low bit-rate latent code, previous works have difficulties in preserving high-fidelity details in reconstructed and edited images.
no code implementations • 13 Sep 2021 • Lingyang Chu, Lanjun Wang, Yanjie Dong, Jian Pei, Zirui Zhou, Yong Zhang
In this paper, we first propose a federated estimation method to accurately estimate the fairness of a model without infringing the data privacy of any party.
1 code implementation • EMNLP 2021 • Li Zhou, Kevin Small, Yong Zhang, Sandeep Atluri
Motivated by suggested question generation in conversational news recommendation systems, we propose a model for generating question-answer pairs (QA pairs) with self-contained, summary-centric questions and length-constrained, article-summarizing answers.
no code implementations • 8 Sep 2021 • Liang Hu, Jiangcheng Zhu, Zirui Zhou, Ruiqing Cheng, Xiaolong Bai, Yong Zhang
Cloud training platforms, such as Amazon Web Services and Huawei Cloud provide users with computational resources to train their deep learning jobs.
1 code implementation • 30 Aug 2021 • Amin Banitalebi-Dehkordi, Naveen Vedula, Jian Pei, Fei Xia, Lanjun Wang, Yong Zhang
At the same time, large amounts of input data are collected at the edge of cloud.
1 code implementation • ICCV 2021 • Shulan Ruan, Yong Zhang, Kun Zhang, Yanbo Fan, Fan Tang, Qi Liu, Enhong Chen
Text-to-image synthesis refers to generating an image from a given text description, the key goal of which lies in photo realism and semantic consistency.
no code implementations • 18 Aug 2021 • Zicun Cong, Xuan Luo, Pei Jian, Feida Zhu, Yong Zhang
We also investigate pricing in the step of collaborative training of machine learning models, and overview pricing machine learning models for end users in the step of machine learning deployment.
no code implementations • ICCV 2021 • Peter Cho-Ho Lam, Lingyang Chu, Maxim Torgonskiy, Jian Pei, Yong Zhang, Lanjun Wang
Interpreting the decision logic behind effective deep convolutional neural networks (CNN) on images complements the success of deep learning models.
2 code implementations • 1 Aug 2021 • Xiaojun Jia, Huanqian Yan, Yonglin Wu, Xingxing Wei, Xiaochun Cao, Yong Zhang
Moreover, we have applied the proposed methods to competition ACM MM2021 Robust Logo Detection that is organized by Alibaba on the Tianchi platform and won top 2 in 36489 teams.
2 code implementations • ICCV 2021 • Rindra Ramamonjison, Amin Banitalebi-Dehkordi, Xinyu Kang, Xiaolong Bai, Yong Zhang
This paper presents a Simple and effective unsupervised adaptation method for Robust Object Detection (SimROD).
no code implementations • NeurIPS 2021 • Mohit Bajaj, Lingyang Chu, Zi Yu Xue, Jian Pei, Lanjun Wang, Peter Cho-Ho Lam, Yong Zhang
Massive deployment of Graph Neural Networks (GNNs) in high-stake applications generates a strong demand for explanations that are robust to noise and align well with human intuition.
1 code implementation • 17 Jun 2021 • Yuanen Zhou, Yong Zhang, Zhenzhen Hu, Meng Wang
To tackle this issue, non-autoregressive image captioning models have recently been proposed to significantly accelerate the speed of inference by generating all words in parallel.
1 code implementation • 4 Jun 2021 • Weiming Hu, Chen Li, Xiaoyan Li, Md Mamunur Rahaman, Jiquan Ma, Yong Zhang, HaoYuan Chen, Wanli Liu, Changhao Sun, YuDong Yao, Hongzan Sun, Marcin Grzegorzek
In order to prove that the methods of different periods in the field of image classification have discrepancies on GasHisSDB, we select a variety of classifiers for evaluation.
no code implementations • IEEE Transactions on Intelligent Transportation Systems 2021 • Jingcheng Wang, Yong Zhang, Yun Wei, Yongli Hu, Xinglin Piao, BaoCai Yin
Metro passenger flow prediction is a strategically necessary demand in an intelligent transportation system to alleviate traffic pressure, coordinate operation schedules, and plan future constructions.
no code implementations • CVPR 2021 • Yuchen Luo, Yong Zhang, Junchi Yan, Wei Liu
The second is the residual-guided spatial attention module that guides the low-level RGB feature extractor to concentrate more on forgery traces from a new perspective.
1 code implementation • CVPR 2021 • Gengcong Yang, Jingyi Zhang, Yong Zhang, Baoyuan Wu, Yujiu Yang
The ambiguity naturally leads to the issue of \emph{implicit multi-label}, motivating the need for diverse predictions.
no code implementations • 21 Feb 2021 • Yixin Li, Xinran Wu, Chen Li, Changhao Sun, Md Rahaman, HaoYuan Chen, YuDong Yao, Xiaoyan Li, Yong Zhang, Tao Jiang
The HCRF-AM model consists of an Attention Mechanism (AM) module and an Image Classification (IC) module.
no code implementations • 21 Feb 2021 • Chen Li, Xintong Li, Md Rahaman, Xiaoyan Li, Hongzan Sun, Hong Zhang, Yong Zhang, Xiaoqi Li, Jian Wu, YuDong Yao, Marcin Grzegorzek
This paper reviews the methods of WSI analysis based on machine learning.
2 code implementations • ICLR 2021 • Jiawang Bai, Baoyuan Wu, Yong Zhang, Yiming Li, Zhifeng Li, Shu-Tao Xia
By utilizing the latest technique in integer programming, we equivalently reformulate this BIP problem as a continuous optimization problem, which can be effectively and efficiently solved using the alternating direction method of multipliers (ADMM) method.
no code implementations • 1 Feb 2021 • Yong Zhang, Mao Ye, Lin Guan
The original contributions of this paper are summarized as follows: (1) Model the packets collision probability of broadcast or NACK transmission in VANET with the combination theory and investigate the potential influence of miss my packets (MMP) problem.
Networking and Internet Architecture
1 code implementation • 18 Jan 2021 • Guangyu Huo, Yong Zhang, Junbin Gao, Boyue Wang, Yongli Hu, BaoCai Yin
In this paper, we propose a cross-attention based deep clustering framework, named Cross-Attention Fusion based Enhanced Graph Convolutional Network (CaEGCN), which contains four main modules: the cross-attention fusion module which innovatively concatenates the Content Auto-encoder module (CAE) relating to the individual data and Graph Convolutional Auto-encoder module (GAE) relating to the relationship between the data in a layer-by-layer manner, and the self-supervised model that highlights the discriminative information for clustering tasks.
no code implementations • 1 Jan 2021 • ZiHao Wang, Xu Zhao, Tam Le, Hao Wu, Yong Zhang, Makoto Yamada
In this work, we consider OT over tree metrics, which is more general than the sliced Wasserstein and includes the sliced Wasserstein as a special case, and we propose a fast minimization algorithm in $O(n)$ for the optimal Wasserstein-1 transport plan between two distributions in the tree structure.
no code implementations • ICCV 2021 • Weiwei Feng, Baoyuan Wu, Tianzhu Zhang, Yong Zhang, Yongdong Zhang
To tackle these issues, we propose a class-agnostic and model-agnostic physical adversarial attack model (Meta-Attack), which is able to not only generate robust physical adversarial examples by simulating color and shape distortions, but also generalize to attacking novel images and novel DNN models by accessing a few digital and physical images.
no code implementations • 29 Nov 2020 • Haotian Xie, Yong Zhang, Jun Wang, Jingjing Zhang, Yifan Ma, Zhaogang Yang
The Gleason grading system using histological images is the most powerful diagnostic and prognostic predictor of prostate cancer.
no code implementations • 9 Nov 2020 • Jingyi Zhang, Yong Zhang, Baoyuan Wu, Yanbo Fan, Fumin Shen, Heng Tao Shen
We propose to incorporate the prior about the co-occurrence of relation pairs into the graph to further help alleviate the class imbalance issue.
no code implementations • 17 Oct 2020 • Hanzi Huang, Yetian Huang, Yu He, Haoshuo Chen, Yong Zhang, Qianwu Zhang, Nicolas K. Fontaine, Roland Ryf, Yingxiong Song, Yikai Su
We experimentally demonstrate a record net capacity per wavelength of 1. 23~Tb/s over a single silicon-on-insulator (SOI) multimode waveguide for optical interconnects employing on-chip mode-division multiplexing and 11$\times$11 multiple-in-multiple-out (MIMO) digital signal processing.
1 code implementation • EMNLP 2020 • Xu Zhao, ZiHao Wang, Hao Wu, Yong Zhang
In this paper, we propose a new semi-supervised BLI framework to encourage the interaction between the supervised signal and unsupervised alignment.
no code implementations • ACL 2020 • Xu Zhao, ZiHao Wang, Hao Wu, Yong Zhang
Recently unsupervised Bilingual Lexicon Induction (BLI) without any parallel corpus has attracted much research interest.
no code implementations • 16 Sep 2020 • Yicheng Xu, Vincent Chau, Chenchen Wu, Yong Zhang, Vassilis Zissimopoulos, Yifei Zou
Clustering is one of the most fundamental tools in the artificial intelligence area, particularly in the pattern recognition and learning theory.
1 code implementation • 7 Jul 2020 • Yutao Huang, Lingyang Chu, Zirui Zhou, Lanjun Wang, Jiangchuan Liu, Jian Pei, Yong Zhang
Non-IID data present a tough challenge for federated learning.
no code implementations • 12 May 2020 • Chengcheng Ma, Baoyuan Wu, Shibiao Xu, Yanbo Fan, Yong Zhang, Xiaopeng Zhang, Zhifeng Li
In this work, we study the detection of adversarial examples, based on the assumption that the output and internal responses of one DNN model for both adversarial and benign examples follow the generalized Gaussian distribution (GGD), but with different parameters (i. e., shape factor, mean, and variance).
no code implementations • 26 Feb 2020 • Yong Zhang, Le Li, Zhilei Liu, Baoyuan Wu, Yanbo Fan, Zhifeng Li
Most of the existing methods train models for one-versus-one kin relation, which only consider one parent face and one child face by directly using an auto-encoder without any explicit control over the resemblance of the synthesized face to the parent face.
no code implementations • 28 Jan 2020 • Zihao Wang, Yong Zhang, Hao Wu
Moreover, we further develop Recursive Optimal Similarity (ROTS) for sentences with the valuable semantic insights from the connections between cosine similarity of weighted average of word vectors and optimal transport.
no code implementations • IEEE Access ( Volume: 8 ) 2020 • Yanbo Fan, Shuchen Weng, Yong Zhang, Boxin Shi, Yi Zhang
To facilitate end-to-end training, we further develop a scenario context information extraction branch to extract context information from raw RGB video directly.
Ranked #66 on
Skeleton Based Action Recognition
on NTU RGB+D
no code implementations • 1 Jan 2020 • Jing Zhang, Yong Zhang, Suhua Zhan, Cheng Cheng
Multiple physiological signals fusing models, building the uniform classification model by means of consistent and complementary information from different emotions to improve recognition performance.
no code implementations • 12 Nov 2019 • Li Ning, Yong Zhang
Based on the realtime updated local action cells, we propose the LAC-Nav approach to navigate the agent with the properly selected velocity; and furthermore, we coupled the local action cell with an adaptive learning framework, in which the effect of selections are evaluated and used as the references for making decisions in the following updates.
no code implementations • ICCV 2019 • Yong Zhang, Haiyong Jiang, Baoyuan Wu, Yanbo Fan, Qiang Ji
The latter enables the model to be trained on a partially annotated database.
no code implementations • 4 Jun 2019 • Jinzhi Lin, Shengzhong Feng, Zhile Yang, Yun Zhang, Yong Zhang
Furthermore, by manipulating the mapping vectors, an autoencoder is able to generalize SCMA, thus a dense code multiple access (DCMA) scheme is proposed.
no code implementations • CVPR 2019 • Yong Zhang, Baoyuan Wu, Weiming Dong, Zhifeng Li, Wei Liu, Bao-Gang Hu, Qiang Ji
Accurate AU intensity estimation depends on three major elements: image representation, intensity estimator, and supervisory information.
no code implementations • 15 May 2019 • Jiacheng Wu, Yong Zhang, Jin Wang, Chunbin Lin, Yingjia Fu, Chunxiao Xing
To address the limitation, we propose SP-Join, an end-to-end framework to support distributed similarity join in metric space based on the MapReduce paradigm, which (i) employs an estimation-based stratified sampling method to produce pivots with quality guarantees for any sample size, and (ii) devises an effective cost model as the guideline to split the whole datasets into partition in map and reduce phases according to the sampled pivots.
Databases
1 code implementation • CVPR 2019 • Yan Xu, Baoyuan Wu, Fumin Shen, Yanbo Fan, Yong Zhang, Heng Tao Shen, Wei Liu
Due to the sequential dependencies among words in a caption, we formulate the generation of adversarial noises for targeted partial captions as a structured output learning problem with latent variables.
no code implementations • 23 Apr 2019 • Zihao Wang, Datong Zhou, Yong Zhang, Hao Wu, Chenglong Bao
As a fundamental problem of natural language processing, it is important to measure the distance between different documents.
1 code implementation • 7 Jan 2019 • Baoyuan Wu, Weidong Chen, Yanbo Fan, Yong Zhang, Jinlong Hou, Jie Liu, Tong Zhang
In this work, we propose to train CNNs from images annotated with multiple tags, to enhance the quality of visual representation of the trained CNN model.
1 code implementation • 19 Dec 2018 • Zheng Chen, Yong Zhang, Yue Shang, Xiaohua Hu
TSPRA combines topics (i. e. product aspects), word sentiment and user preference as regression factors, and is able to perform topic clustering, review rating prediction, sentiment analysis and what we invent as "critical aspect" analysis altogether in one framework.
no code implementations • 17 Dec 2018 • Yassine Benajiba, Jin Sun, Yong Zhang, Longquan Jiang, Zhiliang Weng, Or Biran
Semantic Pattern Similarity is an interesting, though not often encountered NLP task where two sentences are compared not by their specific meaning, but by their more abstract semantic pattern (e. g., preposition or frame).
1 code implementation • 8 Dec 2018 • Cheng Cheng, Guijun Ma, Yong Zhang, Mingyang Sun, Fei Teng, Han Ding, Ye Yuan
In industrial applications, nearly half the failures of motors are caused by the degradation of rolling element bearings (REBs).
no code implementations • 20 Sep 2018 • Yong Zhang, Yu Zhang, Zhao Zhang, Jie Bao, Yunpeng Song
Traditional human activity recognition (HAR) based on time series adopts sliding window analysis method.
no code implementations • CVPR 2018 • Yong Zhang, Wei-Ming Dong, Bao-Gang Hu, Qiang Ji
Facial action unit (AU) intensity estimation plays an important role in affective computing and human-computer interaction.
no code implementations • CVPR 2018 • Yong Zhang, Rui Zhao, Wei-Ming Dong, Bao-Gang Hu, Qiang Ji
The majority of methods directly apply supervised learning techniques to AU intensity estimation while few methods exploit unlabeled samples to improve the performance.
no code implementations • CVPR 2018 • Yong Zhang, Wei-Ming Dong, Bao-Gang Hu, Qiang Ji
To alleviate this issue, we propose a knowledge-driven method for jointly learning multiple AU classifiers without any AU annotation by leveraging prior probabilities on AUs, including expression-independent and expression-dependent AU probabilities.
no code implementations • IJCNLP 2017 • Yassine Benajiba, Jin Sun, Yong Zhang, Zhiliang Weng, Or Biran
This paper introduces Mainiway AI Labs submitted system for the IJCNLP 2017 shared task on Dimensional Sentiment Analysis of Chinese Phrases (DSAP), and related experiments.
no code implementations • WS 2017 • Yassine Benajiba, Or Biran, Zhiliang Weng, Yong Zhang, Jin Sun
Sub-character components of Chinese characters carry important semantic information, and recent studies have shown that utilizing this information can improve performance on core semantic tasks.
no code implementations • 13 Nov 2017 • Yong Zhang, Hongming Zhou, Nganmeng Tan, Saeed Bagheri, Meng Joo Er
Audience interest, demography, purchase behavior and other possible classifications are ex- tremely important factors to be carefully studied in a targeting campaign.
no code implementations • 5 Jan 2014 • Zhaosong Lu, Yong Zhang
In particular, we first introduce a class of first-order stationary points for them, and show that the first-order stationary points introduced in [11] for an SPQN regularized $vector$ minimization problem are equivalent to those of an SPQN regularized $matrix$ minimization reformulation.
no code implementations • NeurIPS 2011 • Yong Zhang, Zhaosong Lu
In this paper we consider general rank minimization problems with rank appearing in either objective function or constraint.