no code implementations • 2 Dec 2024 • Chunlin Yu, Hanqing Wang, Ye Shi, Haoyang Luo, Sibei Yang, Jingyi Yu, Jingya Wang
In this paper, we introduce the Sequential 3D Affordance Reasoning task, which extends the traditional paradigm by reasoning from cumbersome user intentions and then decomposing them into a series of segmentation maps.
no code implementations • 2 Dec 2024 • Bikang Pan, Qun Li, Xiaoying Tang, Wei Huang, Zhen Fang, Feng Liu, Jingya Wang, Jingyi Yu, Ye Shi
This matrix effectively partitions datasets into clean and noisy subsets, allowing for the application of cross-entropy loss to the clean subset and MAE loss to the noisy subset.
no code implementations • 30 Oct 2024 • Haixiang Sun, Ye Shi
In this paper, we utilize the Neural Collapse ($\mathcal{NC}$) as a tool to systematically analyze the representation of DEQ under both balanced and imbalanced conditions.
1 code implementation • 29 Sep 2024 • Bikang Pan, Wei Huang, Ye Shi
Integrating pretrained vision-language foundation models like CLIP into federated learning has attracted significant attention for enhancing generalization across diverse tasks.
1 code implementation • 30 Jul 2024 • Chaofan Huo, Ye Shi, Jingya Wang
Learning the prior knowledge of the 3D human-object spatial relation is crucial for reconstructing human-object interaction from images and understanding how humans interact with objects in 3D space.
1 code implementation • 30 Jul 2024 • Chaofan Huo, Ye Shi, Yuexin Ma, Lan Xu, Jingyi Yu, Jingya Wang
Modeling and capturing the 3D spatial arrangement of the human and the object is the key to perceiving 3D human-object interaction from monocular images.
1 code implementation • 2 Jul 2024 • Ye Shi, C. S. George Lee
Irregular distribution in latent space causes posterior collapse, misalignment between posterior and prior, and ill-sampling problem in Variational Autoencoders (VAEs).
no code implementations • 25 May 2024 • Shutong Ding, Ke Hu, Zhenhao Zhang, Kan Ren, Weinan Zhang, Jingyi Yu, Jingya Wang, Ye Shi
To overcome this, we propose a novel model-free diffusion-based online RL algorithm, Q-weighted Variational Policy Optimization (QVPO).
1 code implementation • 16 May 2024 • Tianyu Cui, Hongxia Li, Jingya Wang, Ye Shi
Federated Prompt Learning (FPL) incorporates large pre-trained Vision-Language models (VLM) into federated learning through prompt tuning.
1 code implementation • CVPR 2024 • Jiangnan Tang, Jingya Wang, Kaiyang Ji, Lan Xu, Jingyi Yu, Ye Shi
One of the biggest challenges to this task is the one-to-many mapping from sparse observations to dense full-body motions, which endowed inherent ambiguities.
no code implementations • 30 Mar 2024 • Juze Zhang, Jingyan Zhang, Zining Song, Zhanhe Shi, Chengfeng Zhao, Ye Shi, Jingyi Yu, Lan Xu, Jingya Wang
Humans naturally interact with both others and the surrounding multiple objects, engaging in various social activities.
no code implementations • 25 Mar 2024 • Qi Li, Ye Shi, Yuning Jiang, Yuanming Shi, Haoyu Wang, H. Vincent Poor
The distinctive contribution of this paper lies in its holistic approach to both static and dynamic uncertainties in smart grids.
no code implementations • 24 Mar 2024 • Jie Tian, Ran Ji, Lingxiao Yang, Yuexin Ma, Lan Xu, Jingyi Yu, Ye Shi, Jingya Wang
Gaze plays a crucial role in revealing human attention and intention, particularly in hand-object interaction scenarios, where it guides and synchronizes complex tasks that require precise coordination between the brain, hand, and object.
no code implementations • 17 Mar 2024 • Qianyang Wu, Ye Shi, Xiaoshui Huang, Jingyi Yu, Lan Xu, Jingya Wang
This paper addresses new methodologies to deal with the challenging task of generating dynamic Human-Object Interactions from textual descriptions (Text2HOI).
1 code implementation • CVPR 2024 • Hongxia Li, Wei Huang, Jingya Wang, Ye Shi
Specifically, for each client, we learn a global prompt to extract consensus knowledge among clients, and a local prompt to capture client-specific category characteristics.
no code implementations • 28 Feb 2024 • Bin Li, Ye Shi, Qian Yu, Jingya Wang
This paper introduces ProtoOT, a novel Optimal Transport formulation explicitly tailored for UCIR, which integrates intra-domain feature representation learning and cross-domain alignment into a unified framework.
1 code implementation • 5 Feb 2024 • Lingxiao Yang, Shutong Ding, Yifan Cai, Jingyi Yu, Jingya Wang, Ye Shi
We theoretically show the existence of manifold deviation by establishing a certain lower bound for the estimation error of the loss guidance.
no code implementations • CVPR 2024 • Juze Zhang, Jingyan Zhang, Zining Song, Zhanhe Shi, Chengfeng Zhao, Ye Shi, Jingyi Yu, Lan Xu, Jingya Wang
Humans naturally interact with both others and the surrounding multiple objects engaging in various social activities.
1 code implementation • 30 Dec 2023 • Yilan Dong, Chunlin Yu, Ruiyang Ha, Ye Shi, Yuexin Ma, Lan Xu, Yanwei Fu, Jingya Wang
Existing gait recognition benchmarks mostly include minor clothing variations in the laboratory environments, but lack persistent changes in appearance over time and space.
1 code implementation • NeurIPS 2023 • Zhongyi Cai, Ye Shi, Wei Huang, Jingya Wang
Specifically, the online model learns general knowledge that is shared among all clients, while the offline model is trained locally to learn the specialized knowledge of each individual client.
1 code implementation • NeurIPS 2023 • Chunlin Yu, Ye Shi, Jingya Wang
Previous endeavors in self-supervised learning have enlightened the research of deep clustering from an instance discrimination perspective.
1 code implementation • NeurIPS 2023 • Wanxing Chang, Ye Shi, Jingya Wang
However, the current approaches rely heavily on the model's predictions and evaluate each sample independently without considering either the global and local structure of the sample distribution.
1 code implementation • NeurIPS 2023 • Shutong Ding, Tianyu Cui, Jingya Wang, Ye Shi
Deep Equilibrium Models (DEQs) and Neural Ordinary Differential Equations (Neural ODEs) are two branches of implicit models that have achieved remarkable success owing to their superior performance and low memory consumption.
1 code implementation • NeurIPS 2023 • Shutong Ding, Jingya Wang, Yali Du, Ye Shi
To the best of our knowledge, RPO is the first attempt that introduces GRG to RL as a way of efficiently handling both equality and inequality hard constraints.
1 code implementation • 2 Feb 2023 • Juze Zhang, Ye Shi, Yuexin Ma, Lan Xu, Jingyi Yu, Jingya Wang
This paper presents an inverse kinematic optimization layer (IKOL) for 3D human pose and shape estimation that leverages the strength of both optimization- and regression-based methods within an end-to-end framework.
Ranked #30 on 3D Human Pose Estimation on 3DPW
no code implementations • CVPR 2023 • Juze Zhang, Haimin Luo, Hongdi Yang, Xinru Xu, Qianyang Wu, Ye Shi, Jingyi Yu, Lan Xu, Jingya Wang
We construct a dense multi-view dome to acquire a complex human object interaction dataset, named HODome, that consists of $\sim$75M frames on 10 subjects interacting with 23 objects.
1 code implementation • 29 Nov 2022 • Chunlin Yu, Ye Shi, Zimo Liu, Shenghua Gao, Jingya Wang
Lifelong person re-identification (LReID) is in significant demand for real-world development as a large amount of ReID data is captured from diverse locations over time and cannot be accessed at once inherently.
2 code implementations • ICCV 2023 • Yu-Tong Cao, Ye Shi, Baosheng Yu, Jingya Wang, DaCheng Tao
In this paper, we propose a federated active learning paradigm to efficiently learn a global model with limited annotation budget while protecting data privacy in a decentralized learning way.
1 code implementation • 3 Nov 2022 • Hongxia Li, Zhongyi Cai, Jingya Wang, Jiangnan Tang, Weiping Ding, Chin-Teng Lin, Ye Shi
Instead of using a vanilla personalization mechanism that maintains personalized self-attention layers of each client locally, we develop a learn-to-personalize mechanism to further encourage the cooperation among clients and to increase the scablability and generalization of FedTP.
1 code implementation • 31 Oct 2022 • Wanxing Chang, Ye Shi, Hoang Duong Tuan, Jingya Wang
Notably, UniOT is the first method with the capability to automatically discover and recognize private categories in the target domain for UniDA.
Ranked #3 on Universal Domain Adaptation on Office-Home
1 code implementation • 26 Oct 2022 • Leijie Zhang, Ye Shi, Yu-Cheng Chang, Chin-Teng Lin
ERL is inspired by the theory of biological evolution; it encourages rule variations while activating superior rules and deactivating inferior rules for local clients with non-IID data.
no code implementations • 4 Oct 2022 • Zixuan Zhang, Yuning Jiang, Yuanming Shi, Ye Shi, Wei Chen
This paper develops an optimal EV charging/discharging control strategy for different EV users under dynamic environments to maximize EV users' benefits.
1 code implementation • 3 Oct 2022 • Haixiang Sun, Ye Shi, Jingya Wang, Hoang Duong Tuan, H. Vincent Poor, DaCheng Tao
In this paper, we developed a new framework, named Alternating Differentiation (Alt-Diff), that differentiates optimization problems (here, specifically in the form of convex optimization problems with polyhedral constraints) in a fast and recursive way.
1 code implementation • 18 Sep 2022 • Ye Shi, Leijie Zhang, Zehong Cao, M. Tanveer, Chin-Teng Lin
In this work, we proposed a distributed Fuzzy C-means (DFCM) method and a distributed interpolation consistency regularization (DICR) built on the well-known alternating direction method of multipliers to respectively locate parameters in antecedent and consequent components of DSFR.
1 code implementation • 18 Sep 2022 • Leijie Zhang, Ye Shi, Yu-Cheng Chang, Chin-Teng Lin
The network is trained with a two-stage optimization algorithm, and the parameters at low levels of the hierarchy are learned with a scheme based on the well-known alternating direction method of multipliers, which does not reveal local data to other agents.
no code implementations • 16 Jul 2022 • Juze Zhang, Jingya Wang, Ye Shi, Fei Gao, Lan Xu, Jingyi Yu
This method first uses 2. 5D pose and geometry information to infer camera-centric root depths in a forward pass, and then exploits the root depths to further improve representation learning of 2. 5D pose estimation in a backward pass.
1 code implementation • 24 Jun 2022 • Lijun Sun, Yu-Cheng Chang, Chao Lyu, Ye Shi, Yuhui Shi, Chin-Teng Lin
The proposed distributed algorithm: fuzzy self-organizing cooperative coevolution (FSC2) is then leveraged to resolve the three challenges in multi-target SOP: distributed self-organizing search (SOS), distributed task allocation, and distributed single-target pursuit.
no code implementations • 11 Jul 2021 • Ye Shi, Shao-Yuan Li, Sheng-Jun Huang
Traditional supervised learning requires ground truth labels for the training data, whose collection can be difficult in many cases.