no code implementations • 16 Oct 2024 • Jinqian Chen, Jihua Zhu
Federated learning (FL) enables collaborative model training across distributed clients while preserving data privacy.
no code implementations • 24 Sep 2024 • Naiwen Hu, Haozhe Cheng, Yifan Xie, Shiqi Li, Jihua Zhu
Overall, 3D-JEPA predicts the representation of target blocks from a context block using the encoder and context-aware decoder architecture.
no code implementations • 24 Sep 2024 • Naiwen Hu, Haozhe Cheng, Yifan Xie, Pengcheng Shi, Jihua Zhu
3D contrastive representation learning has exhibited remarkable efficacy across various downstream tasks.
no code implementations • 15 Sep 2024 • Shiqi Li, Jihua Zhu, Yifan Xie, Naiwen Hu, Mingchen Zhu, Zhongyu Li, Di Wang
Our method synthesizes the merit of both optimization-based and learning-based methods.
no code implementations • 10 Jul 2024 • Shiqi Li, Jihua Zhu, Yifan Xie, Mingchen Zhu
In this paper, we propose an incremental multiview point cloud registration method that progressively registers all scans to a growing meta-shape.
1 code implementation • 23 Apr 2024 • Xingyue Zhao, Zhongyu Li, Xiangde Luo, Peiqi Li, Peng Huang, Jianwei Zhu, Yang Liu, Jihua Zhu, Meng Yang, Shi Chang, Jun Dong
Especially, an asymmetric learning framework is developed by extending the aspect ratio annotations with two types of pseudo labels, i. e., conservative labels and radical labels, to train two asymmetric segmentation networks simultaneously.
1 code implementation • 26 Feb 2024 • Jinqian Chen, Jihua Zhu, Qinghai Zheng, Zhongyu Li, Zhiqiang Tian
Inspired by this observation, we propose the "Assembled Projection Heads" (APH) method for enhancing the reliability of federated models.
1 code implementation • 9 Jan 2024 • Yifan Xie, Boyu Wang, Shiqi Li, Jihua Zhu
In this paper, we propose a novel Iterative Feedback Network (IFNet) for unsupervised point cloud registration, in which the representation of low-level features is efficiently enriched by rerouting subsequent high-level features.
1 code implementation • 12 Dec 2023 • Haoyu Tang, Han Jiang, Mingzhu Xu, Yupeng Hu, Jihua Zhu, Liqiang Nie
Thereafter, we design two (constant- and variable- speed) incremental instance learning strategies for easy-to-hard model training, thus ensuring the reliability of these video pseudolabels and further improving overall localization performance.
1 code implementation • 5 Dec 2023 • Jinqian Chen, Jihua Zhu, Qinghai Zheng
Assuming that all clients have a single shared sample for each class, the knowledge anchor is constructed before each local training stage by extracting shared samples for missing classes and randomly selecting one sample per class for non-dominant classes.
1 code implementation • 2 Nov 2023 • Yifan Xie, Jihua Zhu, Shiqi Li, Pengcheng Shi
Specifically, we first incorporate the projected images from the point clouds and fuse the cross-modal features using the attention mechanism.
no code implementations • 18 Oct 2023 • Shiqi Li, Jihua Zhu, Yifan Xie
Point cloud registration plays a crucial role in various computer vision tasks, and usually demands the resolution of partial overlap registration in practice.
1 code implementation • 29 Aug 2023 • Junyang Wang, Yiyang Zhou, Guohai Xu, Pengcheng Shi, Chenlin Zhao, Haiyang Xu, Qinghao Ye, Ming Yan, Ji Zhang, Jihua Zhu, Jitao Sang, Haoyu Tang
In this paper, we propose Hallucination Evaluation based on Large Language Models (HaELM), an LLM-based hallucination evaluation framework.
no code implementations • 18 Aug 2023 • Pengcheng Shi, Jie Zhang, Haozhe Cheng, Junyang Wang, Yiyang Zhou, Chenlin Zhao, Jihua Zhu
Specifically, we propose a plug-and-play Overlap Bias Matching Module (OBMM) comprising two integral components, overlap sampling module and bias prediction module.
no code implementations • 16 May 2023 • Yifei Wang, Yiyang Zhou, Jihua Zhu, Xinyuan Liu, Wenbiao Yan, Zhiqiang Tian
Label distribution learning (LDL) is a new machine learning paradigm for solving label ambiguity.
no code implementations • 16 May 2023 • Pengcheng Shi, Haozhe Cheng, Xu Han, Yiyang Zhou, Jihua Zhu
To tackle these challenges, we propose an information interaction-based generative network for point cloud completion ($\mathbf{DualGenerator}$).
1 code implementation • CVPR 2023 • Qinghai Zheng, Jihua Zhu, Haoyu Tang
In this work, we focus on the challenging problem of Label Enhancement (LE), which aims to exactly recover label distributions from logical labels, and present a novel Label Information Bottleneck (LIB) method for LE.
no code implementations • 8 Mar 2023 • Yiyang Zhou, Qinghai Zheng, Shunshun Bai, Jihua Zhu
In this work, we devote ourselves to the challenging task of Unsupervised Multi-view Representation Learning (UMRL), which requires learning a unified feature representation from multiple views in an unsupervised manner.
no code implementations • 28 Feb 2023 • Wenbiao Yan, Jihua Zhu, Yiyang Zhou, Yifei Wang, Qinghai Zheng
In this way, the learned semantic consistency from multi-view data can improve the information bottleneck to more exactly distinguish the consistent information and learn a unified feature representation with more discriminative consistent information for clustering.
no code implementations • 26 Feb 2023 • Yiyang Zhou, Qinghai Zheng, Wenbiao Yan, Yifei Wang, Pengcheng Shi, Jihua Zhu
Further, we designed a multi-level consistency collaboration strategy, which utilizes the consistent information of semantic space as a self-supervised signal to collaborate with the cluster assignments in feature space.
Ranked #1 on Multiview Clustering on Fashion-MNIST
no code implementations • 2 Dec 2022 • Zutao Jiang, Guansong Lu, Xiaodan Liang, Jihua Zhu, Wei zhang, Xiaojun Chang, Hang Xu
Here, we make the first attempt to achieve generic text-guided cross-category 3D object generation via a new 3D-TOGO model, which integrates a text-to-views generation module and a views-to-3D generation module.
no code implementations • CVPR 2022 • Zhang Chen, Zhiqiang Tian, Jihua Zhu, Ce Li, Shaoyi Du
Our method is motivated by two cause-effect chains including category-causality chain and anatomy-causality chain.
no code implementations • 17 Oct 2021 • Zutao Jiang, Changlin Li, Xiaojun Chang, Jihua Zhu, Yi Yang
Here, we present dynamic slimmable denoising network (DDS-Net), a general method to achieve good denoising quality with less computational complexity, via dynamically adjusting the channel configurations of networks at test time with respect to different noisy images.
no code implementations • 9 Aug 2021 • Xinsheng Wang, Qicong Xie, Jihua Zhu, Lei Xie, Scharenborg
In this paper, we present an automatic method to generate synchronized speech and talking-head videos on the basis of text and a single face image of an arbitrary person as input.
no code implementations • 20 Mar 2021 • Jihua Zhu, Di Wang, Jiaxi Mu, Huimin Lu, Zhiqiang Tian, Zhongyu Li
Under the NDT framework, this paper proposes a novel multi-view registration method, named 3D multi-view registration based on the normal distributions transform (3DMNDT), which integrates the K-means clustering and Lie algebra solver to achieve multi-view registration.
1 code implementation • 13 Dec 2020 • Yanlin Ma, Jihua Zhu, Zhongyu Li, Zhiqiang Tian, Yaochen Li
What's more, the t-distribution takes the noise with heavy-tail into consideration, which makes the proposed method be inherently robust to noises and outliers.
1 code implementation • 23 Oct 2020 • Xinsheng Wang, Siyuan Feng, Jihua Zhu, Mark Hasegawa-Johnson, Odette Scharenborg
This paper proposes a new model, referred to as the show and speak (SAS) model that, for the first time, is able to directly synthesize spoken descriptions of images, bypassing the need for any text or phonemes.
1 code implementation • 19 Oct 2020 • Qinghai Zheng, Jihua Zhu, Yuanyuan Ma, Zhongyu Li, Zhiqiang Tian
Furthermore, underlying graph information of multi-view data is always ignored in most existing multi-view subspace clustering methods.
no code implementations • 19 Oct 2020 • Qinghai Zheng, Yu Zhang, Jihua Zhu, Zhongyu Li, Haoyu Tang, Shuangxun Ma
It can be seen that specific information contained in different views is fully investigated by the rank preserving decomposition, and the high-order correlations of multi-view data are also mined by the low-rank tensor constraint.
no code implementations • 15 Oct 2020 • Qinghai Zheng, Jihua Zhu, Shuangxun Ma
This paper focuses on the multi-view clustering, which aims to promote clustering results with multi-view data.
1 code implementation • 22 Sep 2020 • Haoyu Tang, Jihua Zhu, Meng Liu, Member, IEEE, Zan Gao, Zhiyong Cheng
Another contribution is that we propose an additional predictor to utilize the internal frames in the model training to improve the localization accuracy.
no code implementations • 21 Aug 2020 • Dou Xu, Chang Cai, Chaowei Fang, Bin Kong, Jihua Zhu, Zhongyu Li
To thisend, we present a novel method for the unsupervised domain adaptationin histopathological image analysis, based on a backbone for embeddinginput images into a feature space, and a graph neural layer for propa-gating the supervision signals of images with labels.
no code implementations • 7 Jul 2020 • Xinyuan Liu, Jihua Zhu, Qinghai Zheng, Zhongyu Li, Ruixin Liu, Jun Wang
More specifically, this novel loss function not only considers the mapping errors generated from the projection of the input space into the output one but also accounts for the reconstruction errors generated from the projection of the output space back to the input one.
2 code implementations • 14 May 2020 • Xinsheng Wang, Tingting Qiao, Jihua Zhu, Alan Hanjalic, Odette Scharenborg
An estimated half of the world's languages do not have a written form, making it impossible for these languages to benefit from any existing text-based technologies.
no code implementations • 21 Apr 2020 • Jihua Zhu, Jie Hu, Huimin Lu, Badong Chen, Zhongyu Li
Recently, the motion averaging method has been introduced as an effective means to solve the multi-view registration problem.
no code implementations • 7 Apr 2020 • Qinghai Zheng, Jihua Zhu, Haoyu Tang, Xinyuan Liu, Zhongyu Li, Huimin Lu
Recently, label distribution learning (LDL) has drawn much attention in machine learning, where LDL model is learned from labelel instances.
no code implementations • 7 Apr 2020 • Qinghai Zheng, Jihua Zhu, Zhongyu Li, Shanmin Pang, Jun Wang, Lei Chen
The complementary graph regularizer investigates the specific information of multiple views.
1 code implementation • 18 Feb 2020 • Jihua Zhu, Jing Zhang, Huimin Lu, Zhongyu Li
Registration of multi-view point sets is a prerequisite for 3D model reconstruction.
no code implementations • 1 Feb 2020 • Xinsheng Wang, Shanmin Pang, Jihua Zhu
In the generalized zero-shot learning, synthesizing unseen data with generative models has been the most popular method to address the imbalance of training data between seen and unseen classes.
no code implementations • 30 Jun 2019 • Xinsheng Wang, Shanmin Pang, Jihua Zhu, Zhongyu Li, Zhiqiang Tian, Yaochen Li
The other is to optimize the visual feature structure in an intermediate embedding space, and in this method we successfully devise a multilayer perceptron framework based algorithm that is able to learn the common intermediate embedding space and meanwhile to make the visual data structure more distinctive.
1 code implementation • 19 Jun 2019 • Qinghai Zheng, Jihua Zhu, Zhiqiang Tian, Zhongyu Li, Shanmin Pang, Xiuyi Jia
Multi-view clustering is an important and fundamental problem.
1 code implementation • 30 Jan 2019 • Qinghai Zheng, Jihua Zhu, Zhongyu Li, Shanmin Pang, Jun Wang, Yaochen Li
To this end, this paper proposes a novel multi-view subspace clustering approach dubbed Feature Concatenation Multi-view Subspace Clustering (FCMSC), which boosts the clustering performance by exploring the consensus information of multi-view data.
no code implementations • 24 Nov 2018 • Yaochen Li, Yuehu Liu, Jihua Zhu, Shiqi Ma, Zhenning Niu, Rui Guo
The data fidelity term in the MRF's energy function is jointly computed according to the superpixel features of color, texture and location.
1 code implementation • 22 May 2018 • Shanmin Pang, Jin Ma, Jianru Xue, Jihua Zhu, Vicente Ordonez
We show that by considering each deep feature as a heat source, our unsupervised aggregation method is able to avoid over-representation of \emph{bursty} features.
no code implementations • 21 Apr 2018 • Jihua Zhu, Siyu Xu, Zutao Jiang, Shanmin Pang, Jun Wang, Zhongyu Li
This paper proposes a global approach for the multi-view registration of unordered range scans.
no code implementations • 20 Mar 2018 • Jiaxing Wang, Jihua Zhu, Shanmin Pang, Zhongyu Li, Yaochen Li, Xueming Qian
Aggregating deep convolutional features into a global image vector has attracted sustained attention in image retrieval.
no code implementations • 14 Oct 2017 • Zutao Jiang, Jihua Zhu, Georgios D. Evangelidis, Changqing Zhang, Shanmin Pang, Yaochen Li
Subsequently, the shape comprised by all cluster centroids is used to sequentially estimate the rigid transformation for each point set.
no code implementations • 25 Sep 2017 • Congcong Jin, Jihua Zhu, Yaochen Li, Shanmin Pang, Lei Chen, Jun Wang
Then, it proposes the weighted LRS decomposition, where each block element is assigned with one estimated weight to denote its reliability.
no code implementations • 14 Jun 2017 • Zutao Jiang, Jihua Zhu, Yaochen Li, Zhongyu Li, Huimin Lu
The main idea of this approach is to recover all global motions for map merging from a set of relative motions.
no code implementations • 1 Jun 2017 • Congcong Jin, Jihua Zhu, Yaochen Li, Shaoyi Du, Zhongyu Li, Huimin Lu
For the registration of partially overlapping point clouds, this paper proposes an effective approach based on both the hard and soft assignments.
no code implementations • 28 Apr 2017 • Minmin Xu, Siyu Xu, Jihua Zhu, Yaochen Li, Jun Wang, Huimin Lu
This paper proposes an effective approach for the scaling registration of $m$-D point sets.
no code implementations • 21 Feb 2017 • Rui Guo, Jihua Zhu, Yaochen Li, Dapeng Chen, Zhongyu Li, Yongqin Zhang
With the overlapping percentage available, it views the overlapping percentage as the corresponding weight of each scan pair and proposes the weight motion averaging algorithm, which can pay more attention to reliable and accurate relative motions.