no code implementations • 3 Jun 2023 • Qingyang Zhang, Haitao Wu, Changqing Zhang, QinGhua Hu, Huazhu Fu, Joey Tianyi Zhou, Xi Peng
The inherent challenge of multimodal fusion is to precisely capture the cross-modal correlation and flexibly conduct cross-modal interaction.
1 code implementation • 2 Jun 2023 • Jiawei Wu, Changqing Zhang, Zuoyong Li, Huazhu Fu, Xi Peng, Joey Tianyi Zhou
Cutting out an object and estimating its opacity mask, known as image matting, is a key task in image and video editing.
no code implementations • 22 May 2023 • Pengxin Zeng, Mouxing Yang, Yiding Lu, Changqing Zhang, Peng Hu, Xi Peng
To address this problem, we present a novel framework called SeMantic Invariance LEarning (SMILE) for multi-view clustering with incomplete information that does not require any paired samples.
no code implementations • 21 Apr 2023 • Jie Chen, Hua Mao, Wai Lok Woo, Xi Peng
Then, a cluster-level CVCL strategy is presented to explore consistent semantic label information among the multiple views in the fine-tuning stage.
1 code implementation • CVPR 2023 • Tang Li, Fengchun Qiao, Mengmeng Ma, Xi Peng
How to develop robust explanations against out-of-distribution data?
no code implementations • 27 Feb 2023 • Buyu Liu, BaoJun, Jianping Fan, Xi Peng, Kui Ren, Jun Yu
More desired attacks, to this end, should be able to fool defenses with such consistency checks.
no code implementations • 26 Jan 2023 • Haobin Li, Yunfan Li, Mouxing Yang, Peng Hu, Dezhong Peng, Xi Peng
Thanks to our dual-stream model, both cluster- and view-specific information could be captured, and thus the instance commonality and view versatility could be preserved to facilitate IMvC.
no code implementations • CVPR 2023 • Yuanbiao Gou, Peng Hu, Jiancheng Lv, Hongyuan Zhu, Xi Peng
Existing studies have empirically observed that the resolution of the low-frequency region is easier to enhance than that of the high-frequency one.
1 code implementation • CVPR 2023 • Yanglin Feng, Hongyuan Zhu, Dezhong Peng, Xi Peng, Peng Hu
Recently, with the advent of Metaverse and AI Generated Content, cross-modal retrieval becomes popular with a burst of 2D and 3D data.
no code implementations • CVPR 2023 • Haiyu Zhao, Yuanbiao Gou, Boyun Li, Dezhong Peng, Jiancheng Lv, Xi Peng
Vision Transformers have shown promising performance in image restoration, which usually conduct window- or channel-based attention to avoid intensive computations.
no code implementations • 8 Dec 2022 • Wenxin Wang, Boyun Li, Yuanbiao Gou, Peng Hu, Xi Peng
In this paper, we study two challenging but less-touched problems in image restoration, namely, i) how to quantify the relationship between different image degradations and ii) how to improve the performance on a specific degradation using the quantified relationship.
no code implementations • 8 Dec 2022 • Yijie Lin, Mouxing Yang, Jun Yu, Peng Hu, Changqing Zhang, Xi Peng
In this paper, we study a novel and widely existing problem in graph matching (GM), namely, Bi-level Noisy Correspondence (BNC), which refers to node-level noisy correspondence (NNC) and edge-level noisy correspondence (ENC).
Ranked #1 on
Graph Matching
on Willow Object Class
2 code implementations • 21 Oct 2022 • Yunfan Li, Mouxing Yang, Dezhong Peng, Taihao Li, Jiantao Huang, Xi Peng
Specifically, we find that when the data is projected into a feature space with a dimensionality of the target cluster number, the rows and columns of its feature matrix correspond to the instance and cluster representation, respectively.
Ranked #1 on
Short Text Clustering
on Biomedical
no code implementations • CVPR 2023 • Pengxin Zeng, Yunfan Li, Peng Hu, Dezhong Peng, Jiancheng Lv, Xi Peng
Fair clustering aims to divide data into distinct clusters while preventing sensitive attributes (\textit{e. g.}, gender, race, RNA sequencing technique) from dominating the clustering.
no code implementations • 23 Sep 2022 • Liang Jiang, Zhenyu Huang, Jia Liu, Zujie Wen, Xi Peng
Such a process will inevitably introduce mismatched pairs (i. e., noisy correspondence) due to i) the unavailable QA pairs in target documents, and ii) the domain shift during applying the QA construction model to the target domain.
1 code implementation • 16 Aug 2022 • Zhenwei Tang, Tilman Hinnerichs, Xi Peng, Xiangliang Zhang, Robert Hoehndorf
Neural networks using distributed representations can benefit from computing semantic entailments because they enable finding contradictions, implied knowledge, or computing plans on how to achieve distant goals.
no code implementations • 29 May 2022 • Zhenwei Tang, Shichao Pei, Xi Peng, Fuzhen Zhuang, Xiangliang Zhang, Robert Hoehndorf
Neural logical reasoning (NLR) is a fundamental task to explore such knowledge bases, which aims at answering multi-hop queries with logical operations based on distributed representations of queries and answers.
1 code implementation • 23 May 2022 • Peng Hu, Xi Peng, Hongyuan Zhu, Mohamed M. Sabry Aly, Jie Lin
Numerous network compression methods such as pruning and quantization are proposed to reduce the model size significantly, of which the key is to find suitable compression allocation (e. g., pruning sparsity and quantization codebook) of each layer.
no code implementations • CVPR 2022 • Mengmeng Ma, Jian Ren, Long Zhao, Davide Testuggine, Xi Peng
Based on these findings, we propose a principle method to improve the robustness of Transformer models by automatically searching for an optimal fusion strategy regarding input data.
1 code implementation • 8 Mar 2022 • Yuanbiao Gou, Peng Hu, Jiancheng Lv, Joey Tianyi Zhou, Xi Peng
AFuB devotes to adaptively sampling and transferring the features from one scale to another scale, which fuses the multi-scale features with varying characteristics from coarse to fine.
no code implementations • 28 Feb 2022 • Xi Peng, Zhenwei Tang, Maxat Kulmanov, Kexin Niu, Robert Hoehndorf
Since the intersection of boxes remains as a box, the intersectional closure is satisfied.
1 code implementation • CVPR 2022 • Mouxing Yang, Zhenyu Huang, Peng Hu, Taihao Li, Jiancheng Lv, Xi Peng
To solve the TNL problem, we propose a novel method for robust VI-ReID, termed DuAlly Robust Training (DART).
1 code implementation • CVPR 2022 • Boyun Li, Xiao Liu, Peng Hu, Zhongqin Wu, Jiancheng Lv, Xi Peng
In this paper, we study a challenging problem in image restoration, namely, how to develop an all-in-one method that could recover images from a variety of unknown corruption types and levels.
no code implementations • 17 Dec 2021 • Tang Li, Jing Gao, Xi Peng
Here we explore the capacity of deep spatial learning for the predictive modeling of urbanization.
1 code implementation • NeurIPS 2021 • Zhenyu Huang, guocheng niu, Xiao Liu, Wenbiao Ding, Xinyan Xiao, Hua Wu, Xi Peng
Based on this observation, we reveal and study a latent and challenging direction in cross-modal matching, named noisy correspondence, which could be regarded as a new paradigm of noisy labels.
no code implementations • 29 Sep 2021 • Fengchun Qiao, Xi Peng
The key idea is to estimate the density ratio between the distributions of the two modalities, and use it to calibrate the similarity measurement in the embedding space.
no code implementations • 16 Aug 2021 • Sachin Gavali, Chuming Chen, Julie Cowart, Xi Peng, Shanshan Ding, Cathy Wu, Tammy Anderson
Furthermore, we discovered that, as the epidemic has shifted from legal (i. e., prescription opioids) to illegal (e. g., heroin and fentanyl) drugs in recent years, the correlation of environment, crime and health related variables with the opioid risk has increased significantly while the correlation of economic and socio-demographic variables has decreased.
no code implementations • 5 Aug 2021 • Xi Peng, Fengchun Qiao, Long Zhao
We are concerned with a worst-case scenario in model generalization, in the sense that a model aims to perform well on many unseen domains while there is only one single domain available for training.
no code implementations • 24 Jul 2021 • Xinlin Zhang, Hengfa Lu, Di Guo, Zongying Lai, Huihui Ye, Xi Peng, Bo Zhao, Xiaobo Qu
The combination of the sparse sampling and the low-rank structured matrix reconstruction has shown promising performance, enabling a significant reduction of the magnetic resonance imaging data acquisition time.
no code implementations • 14 Jul 2021 • Boyun Li, Yijie Lin, Xiao Liu, Peng Hu, Jiancheng Lv, Xi Peng
To generate plausible haze, we study two less-touched but challenging problems in hazy image rendering, namely, i) how to estimate the transmission map from a single image without auxiliary information, and ii) how to adaptively learn the airlight from exemplars, i. e., unpaired real hazy images.
no code implementations • 28 Jun 2021 • Yi Zhang, Sheng Huang, Xi Peng, Dan Yang
DCVAE conducts feature synthesis via pairing two Conditional Variational AutoEncoders (CVAEs) with the same seed but different modality conditions in a dizygotic symbiosis manner.
1 code implementation • CVPR 2021 • Peng Hu, Xi Peng, Hongyuan Zhu, Liangli Zhen, Jie Lin
Recently, cross-modal retrieval is emerging with the help of deep multimodal learning.
1 code implementation • ICLR 2021 • Yu Tian, Jian Ren, Menglei Chai, Kyle Olszewski, Xi Peng, Dimitris N. Metaxas, Sergey Tulyakov
We introduce a motion generator that discovers the desired trajectory, in which content and motion are disentangled.
Ranked #21 on
Video Generation
on UCF-101
1 code implementation • CVPR 2021 • Yijie Lin, Yuanbiao Gou, Zitao Liu, Boyun Li, Jiancheng Lv, Xi Peng
In this paper, we study two challenging problems in incomplete multi-view clustering analysis, namely, i) how to learn an informative and consistent representation among different views without the help of labels and ii) how to recover the missing views from data.
Ranked #1 on
Incomplete multi-view clustering
on n-MNIST
no code implementations • CVPR 2021 • Fengchun Qiao, Xi Peng
To the best of our knowledge, this is the first work to (1) access the generalization uncertainty from a single source and (2) leverage it to guide both input and label augmentation for robust generalization.
1 code implementation • 9 Mar 2021 • Mengmeng Ma, Jian Ren, Long Zhao, Sergey Tulyakov, Cathy Wu, Xi Peng
A common assumption in multimodal learning is the completeness of training data, i. e., full modalities are available in all training examples.
1 code implementation • CVPR 2021 • Mouxing Yang, Yunfan Li, Zhenyu Huang, Zitao Liu, Peng Hu, Xi Peng
To solve such a less-touched problem without the help of labels, we propose simultaneously learning representation and aligning data using a noise-robust contrastive loss.
Contrastive Learning
Partially View-aligned Multi-view Learning
+1
no code implementations • 1 Jan 2021 • Fengchun Qiao, Xi Peng
To the best of our knowledge, this is the first work to (1) access the generalization uncertainty from a single source and (2) leverage it to guide both input and label augmentation for robust generalization.
1 code implementation • CVPR 2021 • Long Zhao, Yuxiao Wang, Jiaping Zhao, Liangzhe Yuan, Jennifer J. Sun, Florian Schroff, Hartwig Adam, Xi Peng, Dimitris Metaxas, Ting Liu
To evaluate the power of the learned representations, in addition to the conventional fully-supervised action recognition settings, we introduce a novel task called single-shot cross-view action recognition.
1 code implementation • NeurIPS 2020 • Yuanbiao Gou, Boyun Li, Zitao Liu, Songfan Yang, Xi Peng
Different from the existing labor-intensive handcrafted architecture design paradigms, we present a novel method, termed as multi-sCaLe nEural ARchitecture sEarch for image Restoration (CLEARER), which is a specifically designed neural architecture search (NAS) for image restoration.
no code implementations • NeurIPS 2020 • Zhenyu Huang, Peng Hu, Joey Tianyi Zhou, Jiancheng Lv, Xi Peng
To solve this practical and challenging problem, we propose a novel multi-view clustering method termed partially view-aligned clustering (PVC).
1 code implementation • NeurIPS 2020 • Long Zhao, Ting Liu, Xi Peng, Dimitris Metaxas
In this paper, we propose a novel and effective regularization term for adversarial data augmentation.
1 code implementation • 21 Sep 2020 • Yunfan Li, Peng Hu, Zitao Liu, Dezhong Peng, Joey Tianyi Zhou, Xi Peng
In this paper, we propose a one-stage online clustering method called Contrastive Clustering (CC) which explicitly performs the instance- and cluster-level contrastive learning.
Ranked #3 on
Image Clustering
on STL-10
(using extra training data)
no code implementations • 31 Aug 2020 • Zhao Kang, Chong Peng, Qiang Cheng, Xinwang Liu, Xi Peng, Zenglin Xu, Ling Tian
Furthermore, most existing graph-based methods conduct clustering and semi-supervised classification on the graph learned from the original data matrix, which doesn't have explicit cluster structure, thus they might not achieve the optimal performance.
1 code implementation • 30 Jun 2020 • Boyun Li, Yuanbiao Gou, Shuhang Gu, Jerry Zitao Liu, Joey Tianyi Zhou, Xi Peng
In this paper, we study two challenging and less-touched problems in single image dehazing, namely, how to make deep learning achieve image dehazing without training on the ground-truth clean image (unsupervised) and a image collection (untrained).
no code implementations • 28 Apr 2020 • Joey Tianyi Zhou, Xi Peng, Yew-Soon Ong
The real-world data usually exhibits heterogeneous properties such as modalities, views, or resources, which brings some unique challenges wherein the key is Heterogeneous Representation Learning (HRL) termed in this paper.
no code implementations • CVPR 2020 • Long Zhao, Xi Peng, Yuxiao Chen, Mubbasir Kapadia, Dimitris N. Metaxas
Our key idea is to generalize the distilled cross-modal knowledge learned from a Source dataset, which contains paired examples from both modalities, to the Target dataset by modeling knowledge as priors on parameters of the Student.
1 code implementation • CVPR 2020 • Fengchun Qiao, Long Zhao, Xi Peng
We are concerned with a worst-case scenario in model generalization, in the sense that a model aims to perform well on many unseen domains while there is only one single domain available for training.
no code implementations • 13 Mar 2020 • Xiaoming Liu, Qirui Li, Chao Shen, Xi Peng, Yadong Zhou, Xiaohong Guan
Graph convolution network (GCN) attracts intensive research interest with broad applications.
no code implementations • 25 Nov 2019 • Lingfei Wu, Ian En-Hsu Yen, Zhen Zhang, Kun Xu, Liang Zhao, Xi Peng, Yinglong Xia, Charu Aggarwal
In particular, RGE is shown to achieve \emph{(quasi-)linear scalability} with respect to the number and the size of the graphs.
no code implementations • 15 Nov 2019 • Yanjie Gou, Yinjie Lei, Lingqiao Liu, Pingping Zhang, Xi Peng
To account for this style shift, the model should adjust its parameters in accordance with entity types.
1 code implementation • NeurIPS 2019 • Yu Tian, Long Zhao, Xi Peng, Dimitris N. Metaxas
Graph kernels are kernel methods measuring graph similarity and serve as a standard tool for graph classification.
Ranked #7 on
Link Prediction
on Cora
1 code implementation • 20 Jul 2019 • Yuxiao Chen, Long Zhao, Xi Peng, Jianbo Yuan, Dimitris N. Metaxas
We propose a Dynamic Graph-Based Spatial-Temporal Attention (DG-STA) method for hand gesture recognition.
Ranked #3 on
Hand Gesture Recognition
on DHG-14
1 code implementation • 4 Jun 2019 • Rahil Mehrizi, Xi Peng, Shaoting Zhang, Ruisong Liao, Kang Li
This study presents a starting point toward a powerful tool for automatic classification of gait disorders and can be used as a basis for future applications of Deep Learning in clinical gait analysis.
4 code implementations • CVPR 2019 • Long Zhao, Xi Peng, Yu Tian, Mubbasir Kapadia, Dimitris N. Metaxas
In this paper, we study the problem of learning Graph Convolutional Networks (GCNs) for regression.
Ranked #21 on
Monocular 3D Human Pose Estimation
on Human3.6M
no code implementations • NeurIPS 2019 • Yizhe Zhu, Jianwen Xie, Zhiqiang Tang, Xi Peng, Ahmed Elgammal
Zero-shot learning extends the conventional object classification to the unseen class recognition by introducing semantic representations of classes.
no code implementations • 22 Aug 2018 • Xi Peng, Yunnan Li, Ivor W. Tsang, Hongyuan Zhu, Jiancheng Lv, Joey Tianyi Zhou
The second is implementing discrete $k$-means with a differentiable neural network that embraces the advantages of parallel computing, online clustering, and clustering-favorable representation learning.
1 code implementation • 20 Aug 2018 • Zhiqiang Tang, Xi Peng, Shijie Geng, Yizhe Zhu, Dimitris N. Metaxas
We design a new connectivity pattern for the U-Net architecture.
Ranked #30 on
Pose Estimation
on MPII Human Pose
1 code implementation • ECCV 2018 • Zhiqiang Tang, Xi Peng, Shijie Geng, Lingfei Wu, Shaoting Zhang, Dimitris Metaxas
Finally, to reduce the memory consumption and high precision operations both in training and testing, we further quantize weights, inputs, and gradients of our localization network to low bit-width numbers.
Ranked #19 on
Pose Estimation
on MPII Human Pose
1 code implementation • ECCV 2018 • Long Zhao, Xi Peng, Yu Tian, Mubbasir Kapadia, Dimitris Metaxas
We consider the problem of image-to-video translation, where an input image is translated into an output video containing motions of a single object.
1 code implementation • 28 Jun 2018 • Yu Tian, Xi Peng, Long Zhao, Shaoting Zhang, Dimitris N. Metaxas
Generating multi-view images from a single-view input is an essential yet challenging problem.
no code implementations • CVPR 2018 • Xi Peng, Zhiqiang Tang, Fei Yang, Rogerio Feris, Dimitris Metaxas
Random data augmentation is a critical technique to avoid overfitting in training deep neural network models.
Ranked #3 on
Pose Estimation
on Leeds Sports Poses
no code implementations • 6 Feb 2018 • Rahil Mehrizi, Xi Peng, Zhiqiang Tang, Xu Xu, Dimitris Metaxas, Kang Li
The results are also compared with state-of-the-art methods on HumanEva-I dataset, which demonstrates the superior performance of our approach.
no code implementations • 17 Jan 2018 • Xi Peng, Rogerio S. Feris, Xiaoyu Wang, Dimitris N. Metaxas
We propose a novel method for real-time face alignment in videos based on a recurrent encoder-decoder network model.
no code implementations • CVPR 2018 • Yizhe Zhu, Mohamed Elhoseiny, Bingchen Liu, Xi Peng, Ahmed Elgammal
Most existing zero-shot learning methods consider the problem as a visual semantic embedding one.
no code implementations • 25 Sep 2017 • Xi Peng, Jiashi Feng, Shijie Xiao, Jiwen Lu, Zhang Yi, Shuicheng Yan
In this paper, we present a deep extension of Sparse Subspace Clustering, termed Deep Sparse Subspace Clustering (DSSC).
no code implementations • 25 Mar 2017 • Long Zhao, Fangda Han, Xi Peng, Xun Zhang, Mubbasir Kapadia, Vladimir Pavlovic, Dimitris N. Metaxas
We first recover the facial identity and expressions from the video by fitting a face morphable model for each frame.
no code implementations • ICCV 2017 • Xi Peng, Xiang Yu, Kihyuk Sohn, Dimitris Metaxas, Manmohan Chandraker
Finally, we propose a new feature reconstruction metric learning to explicitly disentangle identity and pose, by demanding alignment between the feature reconstructions through various combinations of identity and pose features, which is obtained from two images of the same subject.
no code implementations • 9 Sep 2016 • Xi Peng, Qiong Hu, Junzhou Huang, Dimitris N. Metaxas
Our approach takes advantage of part-based representation and cascade regression for robust and efficient alignment on each frame.
no code implementations • 19 Aug 2016 • Xi Peng, Rogerio S. Feris, Xiaoyu Wang, Dimitris N. Metaxas
We propose a novel recurrent encoder-decoder network model for real-time video-based face alignment.
no code implementations • ICCV 2015 • Xi Peng, Shaoting Zhang, Yu Yang, Dimitris N. Metaxas
Face alignment, especially on real-time or large-scale sequential images, is a challenging task with broad applications.
no code implementations • 26 Feb 2015 • Xi Peng, Can-Yi Lu, Zhang Yi, Huajin Tang
A lot of works have shown that frobenius-norm based representation (FNR) is competitive to sparse representation and nuclear-norm based representation (NNR) in numerous tasks such as subspace clustering.
no code implementations • 17 Nov 2014 • Xi Peng, Jiwen Lu, Zhang Yi, Rui Yan
In this paper, we address two challenging problems in unsupervised subspace learning: 1) how to automatically identify the feature dimension of the learned subspace (i. e., automatic subspace learning), and 2) how to learn the underlying subspace in the presence of Gaussian noise (i. e., robust subspace learning).
no code implementations • 22 Sep 2014 • Xi Peng, Rui Yan, Bo Zhao, Huajin Tang, Zhang Yi
Although the methods achieve a higher recognition rate than the traditional SPM, they consume more time to encode the local descriptors extracted from the image.
no code implementations • 25 Sep 2013 • Xi Peng, Huajin Tang, Lei Zhang, Zhang Yi, Shijie Xiao
In this paper, we propose a unified framework which makes representation-based subspace clustering algorithms feasible to cluster both out-of-sample and large-scale data.
no code implementations • CVPR 2013 • Xi Peng, Lei Zhang, Zhang Yi
To address the problems, this paper proposes out-of-sample extension of SSC, named as Scalable Sparse Subspace Clustering (SSSC), which makes SSC feasible to cluster large scale data sets.
no code implementations • 24 Apr 2013 • Liangli Zhen, Zhang Yi, Xi Peng, Dezhong Peng
There are two popular schemes to construct a similarity graph, i. e., pairwise distance based scheme and linear representation based scheme.
no code implementations • 4 Oct 2012 • Xi Peng, Lei Zhang, Zhang Yi, Kok Kiong Tan
The model of low-dimensional manifold and sparse representation are two well-known concise models that suggest each data can be described by a few characteristics.
no code implementations • 5 Sep 2012 • Xi Peng, Zhiding Yu, Huajin Tang, Zhang Yi
Under the framework of graph-based learning, the key to robust subspace clustering and subspace learning is to obtain a good similarity graph that eliminates the effects of errors and retains only connections between the data points from the same subspace (i. e., intra-subspace data points).