1 code implementation • 7 Sep 2024 • Jie Chen, Hua Mao, Yuanbiao Gou, Xi Peng
Second, these algorithms are highly sensitive to the noise contained in high-dimensional data streams.
1 code implementation • 19 Jul 2024 • Tang Li, Mengmeng Ma, Xi Peng
Large pre-trained Vision-Language Models (VLMs) have become ubiquitous foundational components of other models and downstream tasks.
no code implementations • 17 Jul 2024 • Kien X. Nguyen, Fengchun Qiao, Xi Peng
In light of these challenges, we propose a cascading paradigm that simultaneously updates the feature extractor and classifier at test time, mitigating the mismatch between them and enabling long-term model adaptation.
no code implementations • 6 Jul 2024 • Mengmeng Ma, Tang Li, Xi Peng
To achieve OOF-resiliency in a scalable manner, we propose Topology-aware Federated Learning (TFL) that leverages client topology - a graph representing client relationships - to effectively train robust models against OOF data.
no code implementations • 28 Jun 2024 • Yiding Lu, Haobin Li, Yunfan Li, Yijie Lin, Xi Peng
Facilitated by the powerful feature extraction ability of neural networks, deep clustering has achieved great success in analyzing high-dimensional and complex real-world data.
no code implementations • 27 Apr 2024 • Qingyang Zhang, Yake Wei, Zongbo Han, Huazhu Fu, Xi Peng, Cheng Deng, QinGhua Hu, Cai Xu, Jie Wen, Di Hu, Changqing Zhang
Multimodal fusion focuses on integrating information from multiple modalities with the goal of more accurate prediction, which has achieved remarkable progress in a wide range of scenarios, including autonomous driving and medical diagnosis.
no code implementations • International Journal of Computer Vision 2024 • Zhenyu Huang, Peng Hu, guocheng niu, Xinyan Xiao, Jiancheng Lv, Xi Peng
This paper studies a new learning paradigm for noisy labels, i. e., noisy correspondence (NC).
Cross-modal retrieval with noisy correspondence Text Retrieval +1
no code implementations • 28 Mar 2024 • Yanglin Feng, Yang Qin, Dezhong Peng, Hongyuan Zhu, Xi Peng, Peng Hu
We observe that the data is challenging and with noisy correspondence due to the sparsity, noise, or disorder of point clouds and the ambiguity, vagueness, or incompleteness of texts, which make existing cross-modal matching methods ineffective for PTM.
1 code implementation • IEEE Transactions on Image Processing 2024 • Xinran Ma, Mouxing Yang, Yunfan Li, Peng Hu, Jiancheng Lv, Xi Peng
Thanks to the consistency refining and mining strategy of CREAM, the overfitting on the false positives could be prevented and the consistency rooted in the false negatives could be exploited, thus leading to a robust CMR method.
Ranked #1 on Graph Matching on SPair-71k
Cross-modal retrieval with noisy correspondence Graph Matching +1
no code implementations • 13 Mar 2024 • Yuxin Tian, Mouxing Yang, Yunfan Li, Dayiheng Liu, Xingzhang Ren, Xi Peng, Jiancheng Lv
A natural expectation for PEFTs is that the performance of various PEFTs is positively related to the data size and fine-tunable parameter size.
1 code implementation • 30 Jan 2024 • Yijie Lin, Jie Zhang, Zhenyu Huang, Jia Liu, Zujie Wen, Xi Peng
Existing video-language studies mainly focus on learning short video clips, leaving long-term temporal dependencies rarely explored due to over-high computational cost of modeling long videos.
Ranked #1 on Zero-Shot Video Retrieval on YouCook2
Action Segmentation Long Video Retrieval (Background Removed) +2
no code implementations • 23 Jan 2024 • Hao Yang, Hua Mao, Wai Lok Woo, Jie Chen, Xi Peng
Furthermore, the representation process for clustering is enhanced through spectral clustering, and the consistency across multiple views is improved.
no code implementations • 2 Dec 2023 • Yuanbiao Gou, Haiyu Zhao, Boyun Li, Xinyan Xiao, Xi Peng
In contrast to close-set scenarios that restore images from a predefined set of degradations, open-set image restoration aims to handle the unknown degradations that were unforeseen during the pretraining phase, which is less-touched as far as we know.
no code implementations • 20 Nov 2023 • Jie Chen, Zhiming Li, Hua Mao, Wai Lok Woo, Xi Peng
In this paper, we propose a cross-view graph consistency learning (CGCL) method that learns invariant graph representations for link prediction.
1 code implementation • NeurIPS 2023 • Yang Qin, Yuan Sun, Dezhong Peng, Joey Tianyi Zhou, Xi Peng, Peng Hu
Recently, image-text matching has attracted more and more attention from academia and industry, which is fundamental to understanding the latent correspondence across visual and textual modalities.
Cross-modal retrieval with noisy correspondence Image-text matching +1
1 code implementation • 18 Oct 2023 • Yunfan Li, Peng Hu, Dezhong Peng, Jiancheng Lv, Jianping Fan, Xi Peng
The core of clustering is incorporating prior knowledge to construct supervision signals.
1 code implementation • ICCV 2023 • Qitong Wang, Long Zhao, Liangzhe Yuan, Ting Liu, Xi Peng
To facilitate the data efficiency of multiview learning, we further perform video-text alignment for first-person and third-person videos, to fully leverage the semantic knowledge to improve video representations.
1 code implementation • 22 Aug 2023 • Yiding Lu, Yijie Lin, Mouxing Yang, Dezhong Peng, Peng Hu, Xi Peng
In recent, some robust contrastive multi-view clustering (MvC) methods have been proposed, which construct data pairs from neighborhoods to alleviate the false negative issue, i. e., some intra-cluster samples are wrongly treated as negative pairs.
1 code implementation • CVPR 2024 • Yang Qin, Yingke Chen, Dezhong Peng, Xi Peng, Joey Tianyi Zhou, Peng Hu
Text-to-image person re-identification (TIReID) is a compelling topic in the cross-modal community, which aims to retrieve the target person based on a textual query.
Ranked #1 on Text-based Person Retrieval with Noisy Correspondence on ICFG-PEDES (using extra training data)
1 code implementation • 26 Jul 2023 • Fengchun Qiao, Xi Peng
Out-of-distribution (OOD) generalization is a challenging machine learning problem yet highly desirable in many high-stake applications.
1 code implementation • 12 Jul 2023 • Kien X. Nguyen, Liying Zheng, Ashley L. Hawke, Robert E. Carey, Scott P. Breloff, Kang Li, Xi Peng
It is necessary to analyze the whole-body kinematics (including joint locations and joint angles) to assess risks of fatal and musculoskeletal injuries in occupational tasks.
no code implementations • 11 Jul 2023 • Amani Kiruga, Xi Peng
We propose a methodology to address the challenge of instance segmentation in autonomous systems, specifically targeting accessibility and inclusivity.
no code implementations • 14 Jun 2023 • Ruiyang Zhao, Xi Peng, Varun A. Kelkar, Mark A. Anastasio, Fan Lam
We present a novel method that integrates subspace modeling with an adaptive generative image prior for high-dimensional MR image reconstruction.
1 code implementation • 3 Jun 2023 • Qingyang Zhang, Haitao Wu, Changqing Zhang, QinGhua Hu, Huazhu Fu, Joey Tianyi Zhou, Xi Peng
The inherent challenge of multimodal fusion is to precisely capture the cross-modal correlation and flexibly conduct cross-modal interaction.
1 code implementation • 2 Jun 2023 • Jiawei Wu, Changqing Zhang, Zuoyong Li, Huazhu Fu, Xi Peng, Joey Tianyi Zhou
Cutting out an object and estimating its opacity mask, known as image matting, is a key task in image and video editing.
1 code implementation • 22 May 2023 • Pengxin Zeng, Mouxing Yang, Yiding Lu, Changqing Zhang, Peng Hu, Xi Peng
To address this problem, we present a novel framework called SeMantic Invariance LEarning (SMILE) for multi-view clustering with incomplete information that does not require any paired samples.
1 code implementation • ICCV 2023 • Jie Chen, Hua Mao, Wai Lok Woo, Xi Peng
Then, a cluster-level CVCL strategy is presented to explore consistent semantic label information among the multiple views in the fine-tuning stage.
1 code implementation • IEEE Transactions on Pattern Analysis and Machine Intelligence 2023 • Yijie Lin, Yuanbiao Gou, Xiaotian Liu, Jinfeng Bai, Jiancheng Lv, Xi Peng
In this article, we propose a unified framework to solve the following two challenging problems in incomplete multi-view representation learning: i) how to learn a consistent representation unifying different views, and ii) how to recover the missing views.
1 code implementation • CVPR 2023 • Tang Li, Fengchun Qiao, Mengmeng Ma, Xi Peng
How to develop robust explanations against out-of-distribution data?
no code implementations • CVPR 2024 • Buyu Liu, BaoJun, Jianping Fan, Xi Peng, Kui Ren, Jun Yu
More desired attacks, to this end, should be able to fool defenses with such consistency checks.
1 code implementation • IEEE Transactions on Pattern Analysis and Machine Intelligence 2023 • Peng Hu, Zhenyu Huang, Dezhong Peng, Xu Wang, Xi Peng
On the one hand, our method only utilizes the negative information which is much less likely false compared with the positive information, thus avoiding the overfitting issue to PMPs.
Contrastive Learning Cross-modal retrieval with noisy correspondence +2
1 code implementation • 26 Jan 2023 • Haobin Li, Yunfan Li, Mouxing Yang, Peng Hu, Dezhong Peng, Xi Peng
Thanks to our dual-stream model, both cluster- and view-specific information could be captured, and thus the instance commonality and view versatility could be preserved to facilitate IMvC.
no code implementations • CVPR 2023 • Yuanbiao Gou, Peng Hu, Jiancheng Lv, Hongyuan Zhu, Xi Peng
Existing studies have empirically observed that the resolution of the low-frequency region is easier to enhance than that of the high-frequency one.
1 code implementation • CVPR 2023 • Yanglin Feng, Hongyuan Zhu, Dezhong Peng, Xi Peng, Peng Hu
Recently, with the advent of Metaverse and AI Generated Content, cross-modal retrieval becomes popular with a burst of 2D and 3D data.
1 code implementation • CVPR 2023 • Haiyu Zhao, Yuanbiao Gou, Boyun Li, Dezhong Peng, Jiancheng Lv, Xi Peng
Vision Transformers have shown promising performance in image restoration, which usually conduct window- or channel-based attention to avoid intensive computations.
3 code implementations • ICCV 2023 • Yijie Lin, Mouxing Yang, Jun Yu, Peng Hu, Changqing Zhang, Xi Peng
In this paper, we study a novel and widely existing problem in graph matching (GM), namely, Bi-level Noisy Correspondence (BNC), which refers to node-level noisy correspondence (NNC) and edge-level noisy correspondence (ENC).
Ranked #1 on Graph Matching on Willow Object Class
no code implementations • 8 Dec 2022 • Wenxin Wang, Boyun Li, Yuanbiao Gou, Peng Hu, WangMeng Zuo, Xi Peng
To tackle the first challenge, we proposed a Degradation Relationship Index (DRI) which is defined as the mean drop rate difference in the validation loss between two models which are respectively trained using the anchor degradation and the mixture of the anchor and the auxiliary degradations.
2 code implementations • 21 Oct 2022 • Yunfan Li, Mouxing Yang, Dezhong Peng, Taihao Li, Jiantao Huang, Xi Peng
Specifically, we find that when the data is projected into a feature space with a dimensionality of the target cluster number, the rows and columns of its feature matrix correspond to the instance and cluster representation, respectively.
Ranked #1 on Short Text Clustering on Biomedical
1 code implementation • ACM International Conference on Multimedia 2022 • Yang Qin, Dezhong Peng, Xi Peng, Xu Wang, Peng Hu
However, it will unavoidably introduce noise (i. e., mismatched pairs) into training data, dubbed noisy correspondence.
Ranked #2 on Text-based Person Retrieval with Noisy Correspondence on RSTPReid (using extra training data)
Cross-modal retrieval with noisy correspondence Retrieval +1
1 code implementation • CVPR 2023 • Pengxin Zeng, Yunfan Li, Peng Hu, Dezhong Peng, Jiancheng Lv, Xi Peng
Fair clustering aims to divide data into distinct clusters while preventing sensitive attributes (\textit{e. g.}, gender, race, RNA sequencing technique) from dominating the clustering.
Ranked #1 on Image Clustering on HAR
no code implementations • 23 Sep 2022 • Liang Jiang, Zhenyu Huang, Jia Liu, Zujie Wen, Xi Peng
Such a process will inevitably introduce mismatched pairs (i. e., noisy correspondence) due to i) the unavailable QA pairs in target documents, and ii) the domain shift during applying the QA construction model to the target domain.
1 code implementation • 16 Aug 2022 • Tilman Hinnerichs, Zhenwei Tang, Xi Peng, Xiangliang Zhang, Robert Hoehndorf
Ontologies are one of the richest sources of knowledge.
no code implementations • 29 May 2022 • Zhenwei Tang, Shichao Pei, Xi Peng, Fuzhen Zhuang, Xiangliang Zhang, Robert Hoehndorf
Neural logical reasoning (NLR) is a fundamental task to explore such knowledge bases, which aims at answering multi-hop queries with logical operations based on distributed representations of queries and answers.
no code implementations • 23 May 2022 • Peng Hu, Xi Peng, Hongyuan Zhu, Mohamed M. Sabry Aly, Jie Lin
Numerous network compression methods such as pruning and quantization are proposed to reduce the model size significantly, of which the key is to find suitable compression allocation (e. g., pruning sparsity and quantization codebook) of each layer.
no code implementations • CVPR 2022 • Mengmeng Ma, Jian Ren, Long Zhao, Davide Testuggine, Xi Peng
Based on these findings, we propose a principle method to improve the robustness of Transformer models by automatically searching for an optimal fusion strategy regarding input data.
1 code implementation • 8 Mar 2022 • Yuanbiao Gou, Peng Hu, Jiancheng Lv, Joey Tianyi Zhou, Xi Peng
AFuB devotes to adaptively sampling and transferring the features from one scale to another scale, which fuses the multi-scale features with varying characteristics from coarse to fine.
no code implementations • 28 Feb 2022 • Xi Peng, Zhenwei Tang, Maxat Kulmanov, Kexin Niu, Robert Hoehndorf
Since the intersection of boxes remains as a box, the intersectional closure is satisfied.
1 code implementation • CVPR 2022 • Mouxing Yang, Zhenyu Huang, Peng Hu, Taihao Li, Jiancheng Lv, Xi Peng
To solve the TNL problem, we propose a novel method for robust VI-ReID, termed DuAlly Robust Training (DART).
1 code implementation • CVPR 2022 • Boyun Li, Xiao Liu, Peng Hu, Zhongqin Wu, Jiancheng Lv, Xi Peng
In this paper, we study a challenging problem in image restoration, namely, how to develop an all-in-one method that could recover images from a variety of unknown corruption types and levels.
no code implementations • 17 Dec 2021 • Tang Li, Jing Gao, Xi Peng
Here we explore the capacity of deep spatial learning for the predictive modeling of urbanization.
1 code implementation • NeurIPS 2021 • Zhenyu Huang, guocheng niu, Xiao Liu, Wenbiao Ding, Xinyan Xiao, Hua Wu, Xi Peng
Based on this observation, we reveal and study a latent and challenging direction in cross-modal matching, named noisy correspondence, which could be regarded as a new paradigm of noisy labels.
Cross-modal retrieval with noisy correspondence Image-text matching +3
no code implementations • 29 Sep 2021 • Fengchun Qiao, Xi Peng
The key idea is to estimate the density ratio between the distributions of the two modalities, and use it to calibrate the similarity measurement in the embedding space.
no code implementations • 16 Aug 2021 • Sachin Gavali, Chuming Chen, Julie Cowart, Xi Peng, Shanshan Ding, Cathy Wu, Tammy Anderson
Furthermore, we discovered that, as the epidemic has shifted from legal (i. e., prescription opioids) to illegal (e. g., heroin and fentanyl) drugs in recent years, the correlation of environment, crime and health related variables with the opioid risk has increased significantly while the correlation of economic and socio-demographic variables has decreased.
no code implementations • 5 Aug 2021 • Xi Peng, Fengchun Qiao, Long Zhao
We are concerned with a worst-case scenario in model generalization, in the sense that a model aims to perform well on many unseen domains while there is only one single domain available for training.
no code implementations • 24 Jul 2021 • Xinlin Zhang, Hengfa Lu, Di Guo, Zongying Lai, Huihui Ye, Xi Peng, Bo Zhao, Xiaobo Qu
The combination of the sparse sampling and the low-rank structured matrix reconstruction has shown promising performance, enabling a significant reduction of the magnetic resonance imaging data acquisition time.
no code implementations • 14 Jul 2021 • Boyun Li, Yijie Lin, Xiao Liu, Peng Hu, Jiancheng Lv, Xi Peng
To generate plausible haze, we study two less-touched but challenging problems in hazy image rendering, namely, i) how to estimate the transmission map from a single image without auxiliary information, and ii) how to adaptively learn the airlight from exemplars, i. e., unpaired real hazy images.
no code implementations • 28 Jun 2021 • Yi Zhang, Sheng Huang, Xi Peng, Dan Yang
DCVAE conducts feature synthesis via pairing two Conditional Variational AutoEncoders (CVAEs) with the same seed but different modality conditions in a dizygotic symbiosis manner.
1 code implementation • CVPR 2021 • Peng Hu, Xi Peng, Hongyuan Zhu, Liangli Zhen, Jie Lin
Recently, cross-modal retrieval is emerging with the help of deep multimodal learning.
1 code implementation • ICLR 2021 • Yu Tian, Jian Ren, Menglei Chai, Kyle Olszewski, Xi Peng, Dimitris N. Metaxas, Sergey Tulyakov
We introduce a motion generator that discovers the desired trajectory, in which content and motion are disentangled.
Ranked #34 on Video Generation on UCF-101
2 code implementations • CVPR 2021 • Yijie Lin, Yuanbiao Gou, Zitao Liu, Boyun Li, Jiancheng Lv, Xi Peng
In this paper, we study two challenging problems in incomplete multi-view clustering analysis, namely, i) how to learn an informative and consistent representation among different views without the help of labels and ii) how to recover the missing views from data.
Ranked #1 on Incomplete multi-view clustering on n-MNIST
no code implementations • CVPR 2021 • Fengchun Qiao, Xi Peng
To the best of our knowledge, this is the first work to (1) access the generalization uncertainty from a single source and (2) leverage it to guide both input and label augmentation for robust generalization.
1 code implementation • 9 Mar 2021 • Mengmeng Ma, Jian Ren, Long Zhao, Sergey Tulyakov, Cathy Wu, Xi Peng
A common assumption in multimodal learning is the completeness of training data, i. e., full modalities are available in all training examples.
1 code implementation • CVPR 2021 • Mouxing Yang, Yunfan Li, Zhenyu Huang, Zitao Liu, Peng Hu, Xi Peng
To solve such a less-touched problem without the help of labels, we propose simultaneously learning representation and aligning data using a noise-robust contrastive loss.
no code implementations • 1 Jan 2021 • Fengchun Qiao, Xi Peng
To the best of our knowledge, this is the first work to (1) access the generalization uncertainty from a single source and (2) leverage it to guide both input and label augmentation for robust generalization.
1 code implementation • CVPR 2021 • Long Zhao, Yuxiao Wang, Jiaping Zhao, Liangzhe Yuan, Jennifer J. Sun, Florian Schroff, Hartwig Adam, Xi Peng, Dimitris Metaxas, Ting Liu
To evaluate the power of the learned representations, in addition to the conventional fully-supervised action recognition settings, we introduce a novel task called single-shot cross-view action recognition.
no code implementations • NeurIPS 2020 • Zhenyu Huang, Peng Hu, Joey Tianyi Zhou, Jiancheng Lv, Xi Peng
To solve this practical and challenging problem, we propose a novel multi-view clustering method termed partially view-aligned clustering (PVC).
1 code implementation • NeurIPS 2020 • Yuanbiao Gou, Boyun Li, Zitao Liu, Songfan Yang, Xi Peng
Different from the existing labor-intensive handcrafted architecture design paradigms, we present a novel method, termed as multi-sCaLe nEural ARchitecture sEarch for image Restoration (CLEARER), which is a specifically designed neural architecture search (NAS) for image restoration.
1 code implementation • NeurIPS 2020 • Long Zhao, Ting Liu, Xi Peng, Dimitris Metaxas
In this paper, we propose a novel and effective regularization term for adversarial data augmentation.
2 code implementations • 21 Sep 2020 • Yunfan Li, Peng Hu, Zitao Liu, Dezhong Peng, Joey Tianyi Zhou, Xi Peng
In this paper, we propose a one-stage online clustering method called Contrastive Clustering (CC) which explicitly performs the instance- and cluster-level contrastive learning.
Ranked #5 on Image Clustering on Tiny-ImageNet
no code implementations • 31 Aug 2020 • Zhao Kang, Chong Peng, Qiang Cheng, Xinwang Liu, Xi Peng, Zenglin Xu, Ling Tian
Furthermore, most existing graph-based methods conduct clustering and semi-supervised classification on the graph learned from the original data matrix, which doesn't have explicit cluster structure, thus they might not achieve the optimal performance.
1 code implementation • 30 Jun 2020 • Boyun Li, Yuanbiao Gou, Shuhang Gu, Jerry Zitao Liu, Joey Tianyi Zhou, Xi Peng
In this paper, we study two challenging and less-touched problems in single image dehazing, namely, how to make deep learning achieve image dehazing without training on the ground-truth clean image (unsupervised) and a image collection (untrained).
no code implementations • 28 Apr 2020 • Joey Tianyi Zhou, Xi Peng, Yew-Soon Ong
The real-world data usually exhibits heterogeneous properties such as modalities, views, or resources, which brings some unique challenges wherein the key is Heterogeneous Representation Learning (HRL) termed in this paper.
no code implementations • CVPR 2020 • Long Zhao, Xi Peng, Yuxiao Chen, Mubbasir Kapadia, Dimitris N. Metaxas
Our key idea is to generalize the distilled cross-modal knowledge learned from a Source dataset, which contains paired examples from both modalities, to the Target dataset by modeling knowledge as priors on parameters of the Student.
1 code implementation • CVPR 2020 • Fengchun Qiao, Long Zhao, Xi Peng
We are concerned with a worst-case scenario in model generalization, in the sense that a model aims to perform well on many unseen domains while there is only one single domain available for training.
no code implementations • 13 Mar 2020 • Xiaoming Liu, Qirui Li, Chao Shen, Xi Peng, Yadong Zhou, Xiaohong Guan
Graph convolution network (GCN) attracts intensive research interest with broad applications.
no code implementations • 25 Nov 2019 • Lingfei Wu, Ian En-Hsu Yen, Zhen Zhang, Kun Xu, Liang Zhao, Xi Peng, Yinglong Xia, Charu Aggarwal
In particular, RGE is shown to achieve \emph{(quasi-)linear scalability} with respect to the number and the size of the graphs.
no code implementations • 15 Nov 2019 • Yanjie Gou, Yinjie Lei, Lingqiao Liu, Pingping Zhang, Xi Peng
To account for this style shift, the model should adjust its parameters in accordance with entity types.
1 code implementation • NeurIPS 2019 • Yu Tian, Long Zhao, Xi Peng, Dimitris N. Metaxas
Graph kernels are kernel methods measuring graph similarity and serve as a standard tool for graph classification.
Ranked #8 on Link Prediction on Cora
1 code implementation • 20 Jul 2019 • Yuxiao Chen, Long Zhao, Xi Peng, Jianbo Yuan, Dimitris N. Metaxas
We propose a Dynamic Graph-Based Spatial-Temporal Attention (DG-STA) method for hand gesture recognition.
Ranked #4 on Hand Gesture Recognition on SHREC 2017
1 code implementation • 4 Jun 2019 • Rahil Mehrizi, Xi Peng, Shaoting Zhang, Ruisong Liao, Kang Li
This study presents a starting point toward a powerful tool for automatic classification of gait disorders and can be used as a basis for future applications of Deep Learning in clinical gait analysis.
5 code implementations • CVPR 2019 • Long Zhao, Xi Peng, Yu Tian, Mubbasir Kapadia, Dimitris N. Metaxas
In this paper, we study the problem of learning Graph Convolutional Networks (GCNs) for regression.
Ranked #26 on Monocular 3D Human Pose Estimation on Human3.6M
no code implementations • NeurIPS 2019 • Yizhe Zhu, Jianwen Xie, Zhiqiang Tang, Xi Peng, Ahmed Elgammal
Zero-shot learning extends the conventional object classification to the unseen class recognition by introducing semantic representations of classes.
no code implementations • 22 Aug 2018 • Xi Peng, Yunnan Li, Ivor W. Tsang, Hongyuan Zhu, Jiancheng Lv, Joey Tianyi Zhou
The second is implementing discrete $k$-means with a differentiable neural network that embraces the advantages of parallel computing, online clustering, and clustering-favorable representation learning.
1 code implementation • 20 Aug 2018 • Zhiqiang Tang, Xi Peng, Shijie Geng, Yizhe Zhu, Dimitris N. Metaxas
We design a new connectivity pattern for the U-Net architecture.
Ranked #30 on Pose Estimation on MPII Human Pose
1 code implementation • ECCV 2018 • Zhiqiang Tang, Xi Peng, Shijie Geng, Lingfei Wu, Shaoting Zhang, Dimitris Metaxas
Finally, to reduce the memory consumption and high precision operations both in training and testing, we further quantize weights, inputs, and gradients of our localization network to low bit-width numbers.
Ranked #19 on Pose Estimation on MPII Human Pose
1 code implementation • ECCV 2018 • Long Zhao, Xi Peng, Yu Tian, Mubbasir Kapadia, Dimitris Metaxas
We consider the problem of image-to-video translation, where an input image is translated into an output video containing motions of a single object.
1 code implementation • 28 Jun 2018 • Yu Tian, Xi Peng, Long Zhao, Shaoting Zhang, Dimitris N. Metaxas
Generating multi-view images from a single-view input is an essential yet challenging problem.
no code implementations • CVPR 2018 • Xi Peng, Zhiqiang Tang, Fei Yang, Rogerio Feris, Dimitris Metaxas
Random data augmentation is a critical technique to avoid overfitting in training deep neural network models.
Ranked #3 on Pose Estimation on Leeds Sports Poses
no code implementations • 6 Feb 2018 • Rahil Mehrizi, Xi Peng, Zhiqiang Tang, Xu Xu, Dimitris Metaxas, Kang Li
The results are also compared with state-of-the-art methods on HumanEva-I dataset, which demonstrates the superior performance of our approach.
no code implementations • 17 Jan 2018 • Xi Peng, Rogerio S. Feris, Xiaoyu Wang, Dimitris N. Metaxas
We propose a novel method for real-time face alignment in videos based on a recurrent encoder-decoder network model.
no code implementations • CVPR 2018 • Yizhe Zhu, Mohamed Elhoseiny, Bingchen Liu, Xi Peng, Ahmed Elgammal
Most existing zero-shot learning methods consider the problem as a visual semantic embedding one.
no code implementations • 25 Sep 2017 • Xi Peng, Jiashi Feng, Shijie Xiao, Jiwen Lu, Zhang Yi, Shuicheng Yan
In this paper, we present a deep extension of Sparse Subspace Clustering, termed Deep Sparse Subspace Clustering (DSSC).
no code implementations • 25 Mar 2017 • Long Zhao, Fangda Han, Xi Peng, Xun Zhang, Mubbasir Kapadia, Vladimir Pavlovic, Dimitris N. Metaxas
We first recover the facial identity and expressions from the video by fitting a face morphable model for each frame.
no code implementations • ICCV 2017 • Xi Peng, Xiang Yu, Kihyuk Sohn, Dimitris Metaxas, Manmohan Chandraker
Finally, we propose a new feature reconstruction metric learning to explicitly disentangle identity and pose, by demanding alignment between the feature reconstructions through various combinations of identity and pose features, which is obtained from two images of the same subject.
no code implementations • 9 Sep 2016 • Xi Peng, Qiong Hu, Junzhou Huang, Dimitris N. Metaxas
Our approach takes advantage of part-based representation and cascade regression for robust and efficient alignment on each frame.
no code implementations • 19 Aug 2016 • Xi Peng, Rogerio S. Feris, Xiaoyu Wang, Dimitris N. Metaxas
We propose a novel recurrent encoder-decoder network model for real-time video-based face alignment.
no code implementations • ICCV 2015 • Xi Peng, Shaoting Zhang, Yu Yang, Dimitris N. Metaxas
Face alignment, especially on real-time or large-scale sequential images, is a challenging task with broad applications.
no code implementations • 26 Feb 2015 • Xi Peng, Can-Yi Lu, Zhang Yi, Huajin Tang
A lot of works have shown that frobenius-norm based representation (FNR) is competitive to sparse representation and nuclear-norm based representation (NNR) in numerous tasks such as subspace clustering.
no code implementations • 17 Nov 2014 • Xi Peng, Jiwen Lu, Zhang Yi, Rui Yan
In this paper, we address two challenging problems in unsupervised subspace learning: 1) how to automatically identify the feature dimension of the learned subspace (i. e., automatic subspace learning), and 2) how to learn the underlying subspace in the presence of Gaussian noise (i. e., robust subspace learning).
no code implementations • 22 Sep 2014 • Xi Peng, Rui Yan, Bo Zhao, Huajin Tang, Zhang Yi
Although the methods achieve a higher recognition rate than the traditional SPM, they consume more time to encode the local descriptors extracted from the image.
no code implementations • 25 Sep 2013 • Xi Peng, Huajin Tang, Lei Zhang, Zhang Yi, Shijie Xiao
In this paper, we propose a unified framework which makes representation-based subspace clustering algorithms feasible to cluster both out-of-sample and large-scale data.
no code implementations • CVPR 2013 • Xi Peng, Lei Zhang, Zhang Yi
To address the problems, this paper proposes out-of-sample extension of SSC, named as Scalable Sparse Subspace Clustering (SSSC), which makes SSC feasible to cluster large scale data sets.
no code implementations • 24 Apr 2013 • Liangli Zhen, Zhang Yi, Xi Peng, Dezhong Peng
There are two popular schemes to construct a similarity graph, i. e., pairwise distance based scheme and linear representation based scheme.
no code implementations • 4 Oct 2012 • Xi Peng, Lei Zhang, Zhang Yi, Kok Kiong Tan
The model of low-dimensional manifold and sparse representation are two well-known concise models that suggest each data can be described by a few characteristics.
no code implementations • 5 Sep 2012 • Xi Peng, Zhiding Yu, Huajin Tang, Zhang Yi
Under the framework of graph-based learning, the key to robust subspace clustering and subspace learning is to obtain a good similarity graph that eliminates the effects of errors and retains only connections between the data points from the same subspace (i. e., intra-subspace data points).