1 code implementation • 17 Dec 2024 • Sheng Yin, Xianghe Pang, Yuanzhuo Ding, Menglan Chen, Yutong Bi, Yichen Xiong, Wenhao Huang, Zhen Xiang, Jing Shao, Siheng Chen
With the integration of large language models (LLMs), embodied agents have strong capabilities to execute complicated instructions in natural language, paving a way for the potential deployment of embodied robots.
no code implementations • 11 Dec 2024 • Yuxi Wei, Jingbo Wang, Yuwen Du, Dingju Wang, Liang Pan, Chenxin Xu, Yao Feng, Bo Dai, Siheng Chen
Generating realistic and interactive dynamics of traffic participants according to specific instruction is critical for street scene simulation.
no code implementations • 5 Dec 2024 • Yifan Lu, Xuanchi Ren, Jiawei Yang, Tianchang Shen, Zhangjie Wu, Jun Gao, Yue Wang, Siheng Chen, Mike Chen, Sanja Fidler, Jiahui Huang
We present InfiniCube, a scalable method for generating unbounded dynamic 3D driving scenes with high fidelity and controllability.
no code implementations • 2 Dec 2024 • Zhixiang Wang, Guangnan Ye, Xiaosen Wang, Siheng Chen, Zhibo Wang, Xingjun Ma, Yu-Gang Jiang
However, most existing adversarial patch generation methods prioritize attack effectiveness over stealthiness, resulting in patches that are aesthetically unpleasing.
no code implementations • 2 Dec 2024 • Rui Ye, Xianghe Pang, Jingyi Chai, Jiaao Chen, Zhenfei Yin, Zhen Xiang, Xiaowen Dong, Jing Shao, Siheng Chen
However, the unchecked adoption of LLMs poses significant risks to the integrity of the peer review system.
no code implementations • 25 Nov 2024 • Yuchen Xia, Quan Yuan, Guiyang Luo, Xiaoyuan Fu, Yang Li, Xuanhan Zhu, Tianyou Luo, Siheng Chen, Jinglin Li
Collaborative perception in autonomous driving significantly enhances the perception capabilities of individual agents.
1 code implementation • 13 Nov 2024 • Xun Huang, Jinlong Wang, Qiming Xia, Siheng Chen, Bisheng Yang, Xin Li, Cheng Wang, Chenglu Wen
To achieve weather-robust detection, we additionally propose a Multi-modal Denoising Diffusion (MDD) module in our fusion pipeline.
no code implementations • 22 Oct 2024 • Yue Hu, Yuzhu Cai, Yaxin Du, Xinyu Zhu, Xiangrui Liu, Zijie Yu, Yuchen Hou, Shuo Tang, Siheng Chen
To extend coding capabilities beyond function-level tasks to more challenging software-level development, we further propose rSDE-Bench, a requirement-oriented software development benchmark, which features complex and diverse software requirements along with automatic evaluation of requirement correctness.
1 code implementation • 18 Oct 2024 • Shuo Tang, Xianghe Pang, Zexi Liu, Bohan Tang, Rui Ye, Xiaowen Dong, Yanfeng Wang, Siheng Chen
Post-training is essential for enabling large language models (LLMs) to follow human instructions.
no code implementations • 15 Oct 2024 • Yaxin Du, Rui Ye, Fengting Yuchi, Wanru Zhao, Jingjing Qu, Yanfeng Wang, Siheng Chen
To address this gap, we propose a new framework of federated instruction tuning of LLMs with data quality control (FedDQC), which measures data quality to facilitate the subsequent filtering and hierarchical training processes.
1 code implementation • 8 Oct 2024 • Wenhao Wang, Xiaoyu Liang, Rui Ye, Jingyi Chai, Siheng Chen, Yanfeng Wang
The success of large language models (LLMs) facilitate many parties to fine-tune LLMs on their own private data.
no code implementations • 30 Sep 2024 • Zezhou Wang, Yaxin Du, Zhuzhong Qian, Siheng Chen
Federated Domain-specific Instruction Tuning (FedDIT) utilizes limited cross-client private data together with server-side public data for instruction augmentation, ultimately boosting model performance within specific domains.
no code implementations • 11 Sep 2024 • Rui Ye, Rui Ge, Yuchi Fengting, Jingyi Chai, Yanfeng Wang, Siheng Chen
Federated instruction tuning enables multiple clients to collaboratively fine-tune a shared large language model (LLM) that can follow humans' instructions without directly sharing raw data.
no code implementations • 18 Jun 2024 • Zhenyang Ni, Zixing Lei, Yifan Lu, Dingju Wang, Chen Feng, Yanfeng Wang, Siheng Chen
However, existing collaborative perception systems heavily rely on precise localization systems to establish a consistent spatial coordinate system between agents.
no code implementations • 15 Jun 2024 • Rui Ye, Jingyi Chai, Xiangrui Liu, Yaodong Yang, Yanfeng Wang, Siheng Chen
Federated learning (FL) enables multiple parties to collaboratively fine-tune an large language model (LLM) without the need of direct data sharing.
no code implementations • 12 Jun 2024 • Peizhi Niu, Chao Pan, Siheng Chen, Olgica Milenkovic
Graph neural networks (GNNs) have become instrumental in diverse real-world applications, offering powerful graph learning capabilities for tasks such as social networks and medical data analysis.
2 code implementations • 7 Jun 2024 • Rui Ye, Rui Ge, Xinyu Zhu, Jingyi Chai, Yaxin Du, Yang Liu, Yanfeng Wang, Siheng Chen
Addressing this, we propose FedLLM-Bench, which involves 8 training methods, 4 training datasets, and 6 evaluation metrics, to offer a comprehensive testbed for the FedLLM community.
1 code implementation • 24 May 2024 • Junkai Xia, Chenxin Xu, Qingyao Xu, Chen Xie, Yanfeng Wang, Siheng Chen
To produce interactive traffic trajectories, we propose a code-to-trajectory decoder with interaction-aware feature aggregation that synergizes vehicle interactions with the environmental map and the vehicle moves.
1 code implementation • CVPR 2024 • Yue Hu, Juntong Peng, Sifei Liu, Junhao Ge, Si Liu, Siheng Chen
It inherently results in a fundamental trade-off between perception ability and communication cost.
1 code implementation • 5 May 2024 • Zixing Lei, Zhenyang Ni, Ruize Han, Shuo Tang, Dingju Wang, Chen Feng, Siheng Chen, Yanfeng Wang
To achieve this spatial-temporal alignment, traditional methods depend on external devices to provide localization and clock signals.
1 code implementation • 18 Apr 2024 • Yuzhu Cai, Sheng Yin, Yuxi Wei, Chenxin Xu, Weibo Mao, Felix Juefei-Xu, Siheng Chen, Yanfeng Wang
The burgeoning landscape of text-to-image models, exemplified by innovations such as Midjourney and DALLE 3, has revolutionized content creation across diverse sectors.
1 code implementation • 15 Apr 2024 • Genjia Liu, Yue Hu, Chenxin Xu, Weibo Mao, Junhao Ge, Zhengxiang Huang, Yifan Lu, Yinda Xu, Junkai Xia, Yafei Wang, Siheng Chen
This effort necessitates two key foundations: a platform capable of generating data to facilitate the training and testing of V2X-AD, and a comprehensive system that integrates full driving-related functionalities with mechanisms for information sharing.
no code implementations • 25 Mar 2024 • Si Liu, Zihan Ding, Jiahui Fu, Hongyu Li, Siheng Chen, Shifeng Zhang, Xu Zhou
The point cluster inherently preserves object information while packing messages, with weak relevance to the collaboration range, and supports explicit structure modeling.
1 code implementation • CVPR 2024 • Yifei Zhang, Hao Zhao, Hongyang Li, Siheng Chen
As such, the core of our method is the stochastic spectral sampling of correspondence graph.
no code implementations • 11 Mar 2024 • Shuo Tang, Rui Ye, Chenxin Xu, Xiaowen Dong, Siheng Chen, Yanfeng Wang
In this paper, we propose DeLAMA, a decentralized multi-agent lifelong collaborative learning algorithm with dynamic collaboration graphs.
no code implementations • 7 Mar 2024 • Wanru Zhao, Yaxin Du, Nicholas Donald Lane, Siheng Chen, Yanfeng Wang
In the current landscape of foundation model training, there is a significant reliance on public domain data, which is nearing exhaustion according to recent research.
2 code implementations • 10 Feb 2024 • Rui Ye, Wenhao Wang, Jingyi Chai, Dihan Li, Zexi Li, Yinda Xu, Yaxin Du, Yanfeng Wang, Siheng Chen
Trained on massive publicly available data, large language models (LLMs) have demonstrated tremendous success across various fields.
1 code implementation • CVPR 2024 • Yuxi Wei, Zi Wang, Yifan Lu, Chenxin Xu, Changxing Liu, Hao Zhao, Siheng Chen, Yanfeng Wang
Furthermore, to unleash the potential of extensive high-quality digital assets, ChatSim employs a novel multi-camera lighting estimation method to achieve scene-consistent assets' rendering.
no code implementations • 8 Feb 2024 • Xianghe Pang, Shuo Tang, Rui Ye, Yuxin Xiong, Bolun Zhang, Yanfeng Wang, Siheng Chen
Drawing from the sociological insight that acknowledging all parties' concerns is a key factor in shaping human values, this paper proposes a novel direction to align LLMs by themselves: social scene simulation.
no code implementations • 8 Feb 2024 • Bohan Tang, Zexi Liu, Keyue Jiang, Siheng Chen, Xiaowen Dong
Hypergraphs are crucial for modelling higher-order interactions in real-world data.
1 code implementation • 25 Jan 2024 • Yifan Lu, Yue Hu, Yiqi Zhong, Dequan Wang, Yanfeng Wang, Siheng Chen
In this paper, we introduce a new open heterogeneous problem: how to accommodate continually emerging new heterogeneous agent types into collaborative perception, while ensuring high perception performance and low integration cost?
1 code implementation • 23 Jan 2024 • Shaoheng Fang, Rui Ye, Wenhao Wang, Zuhong Liu, Yuxiao Wang, Yafei Wang, Siheng Chen, Yanfeng Wang
In this paper, we introduce FedRSU, an innovative federated learning framework for self-supervised scene flow estimation.
no code implementations • 23 Jan 2024 • Yue Hu, Xianghe Pang, Xiaoqi Qin, Yonina C. Eldar, Siheng Chen, Ping Zhang, Wenjun Zhang
Following this strategy, we first formulate a mathematical optimization framework for the perception-communication trade-off and then propose PragComm, a multi-agent collaborative perception system with two key components: i) single-agent detection and tracking and ii) pragmatic collaboration.
1 code implementation • 21 Jan 2024 • Shaoheng Fang, Zuhong Liu, Mingyu Wang, Chenxin Xu, Yiqi Zhong, Siheng Chen
Learning the dense bird's eye view (BEV) motion flow in a self-supervised manner is an emerging research for robotics and autonomous driving.
1 code implementation • 16 Jan 2024 • Kexin Lv, Rui Ye, Xiaolin Huang, Jie Yang, Siheng Chen
Personalized federated learning aims to address data heterogeneity across local clients in federated learning.
1 code implementation • 18 Dec 2023 • Zexi Liu, Bohan Tang, Ziyuan Ye, Xiaowen Dong, Siheng Chen, Yanfeng Wang
Hypergraphs play a pivotal role in the modelling of data featuring higher-order relations involving more than two entities.
1 code implementation • 15 Dec 2023 • Bohan Tang, Siheng Chen, Xiaowen Dong
Hypergraphs are vital in modelling data with higher-order relations containing more than two entities, gaining prominence in machine learning and signal processing.
no code implementations • 10 Dec 2023 • Rui Ye, Xinyu Zhu, Jingyi Chai, Siheng Chen, Yanfeng Wang
In this paper, we propose a novel FL framework termed FedGC, designed to mitigate data heterogeneity issues by diversifying private data with generative content.
no code implementations • 10 Dec 2023 • Rui Ye, Yaxin Du, Zhenyang Ni, Siheng Chen, Yanfeng Wang
FedCOG consists of two key components at the client side: complementary data generation, which generates data extracted from the shared global model to complement the original dataset, and knowledge-distillation-based model training, which distills knowledge from global model to local model based on the generated data to mitigate over-fitting the original heterogeneous dataset.
1 code implementation • 17 Oct 2023 • Yuxi Wei, Juntong Peng, Tong He, Chenxin Xu, Jian Zhang, Shirui Pan, Siheng Chen
To analyze multivariate time series, most previous methods assume regular subsampling of time series, where the interval between adjacent measurements and the number of samples remain unchanged.
no code implementations • 27 Aug 2023 • Bohan Tang, Siheng Chen, Xiaowen Dong
However, existing methods either adopt simple pre-defined rules that fail to precisely capture the distribution of the potential hypergraph structure, or learn a mapping between hypergraph structures and node features but require a large amount of labelled data, i. e., pre-existing hypergraph structures, for training.
1 code implementation • ICCV 2023 • Chenxin Xu, Robby T. Tan, Yuhong Tan, Siheng Chen, Xinchao Wang, Yanfeng Wang
To work with auxiliary tasks, we propose a novel auxiliary-adapted transformer, which can handle incomplete, corrupted motion data and achieve coordinate recovery via capturing spatial-temporal dependencies.
Ranked #4 on Human Pose Forecasting on Human3.6M
1 code implementation • ICCV 2023 • Qingyao Xu, Weibo Mao, Jingze Gong, Chenxin Xu, Siheng Chen, Weidi Xie, Ya zhang, Yanfeng Wang
Multi-person motion prediction is a challenging problem due to the dependency of motion on both individual past movements and interactions with other people.
1 code implementation • 30 May 2023 • Rui Ye, Mingkai Xu, Jianyu Wang, Chenxin Xu, Siheng Chen, Yanfeng Wang
However, based on our empirical observations and theoretical analysis, we find that the dataset size is not optimal and the discrepancy between local and global category distributions could be a beneficial and complementary indicator for determining aggregation weights.
no code implementations • 24 Apr 2023 • Shunli Ren, Zixing Lei, Zi Wang, Mehrdad Dianati, Yafei Wang, Siheng Chen, Wenjun Zhang
To achieve comprehensive recovery, we design a communication-adaptive multi-scale spatial-temporal prediction model to extract multi-scale spatial-temporal features based on V2X communication conditions and capture the most significant information for the prediction of the missing information.
1 code implementation • CVPR 2023 • Yue Hu, Yifan Lu, Runsheng Xu, Weidi Xie, Siheng Chen, Yanfeng Wang
Camera-only 3D detection provides an economical solution with a simple configuration for localizing objects in 3D space compared to LiDAR-based detection systems.
1 code implementation • CVPR 2023 • Weibo Mao, Chenxin Xu, Qi Zhu, Siheng Chen, Yanfeng Wang
The core of the proposed LED is to leverage a trainable leapfrog initializer to directly learn an expressive multi-modal distribution of future trajectories, which skips a large number of denoising steps, significantly accelerating inference speed.
2 code implementations • CVPR 2023 • Chenxin Xu, Robby T. Tan, Yuhong Tan, Siheng Chen, Yu Guang Wang, Xinchao Wang, Yanfeng Wang
In motion prediction tasks, maintaining motion equivariance under Euclidean geometric transformations and invariance of agent interaction is a critical and fundamental principle.
Ranked #1 on Human Pose Forecasting on HARPER
no code implementations • CVPR 2023 • Shaoheng Fang, Zi Wang, Yiqi Zhong, Junhao Ge, Siheng Chen, Yanfeng Wang
Second, a spatial-temporal pyramid transformer is introduced to comprehensively extract multi-scale BEV features and predict future BEV states with the support of spatial-temporal priors.
Ranked #2 on Bird's-Eye View Semantic Segmentation on nuScenes (IoU ped - 224x480 - Vis filter. - 100x100 at 0.5 metric)
1 code implementation • ICCV 2023 • Yiming Li, Qi Fang, Jiamu Bai, Siheng Chen, Felix Juefei-Xu, Chen Feng
This leads to our hypothesize-and-verify framework: perception results with and without collaboration from a random subset of teammates are compared until reaching a consensus.
1 code implementation • 1 Jan 2023 • Zenan Huang, Jun Wen, Siheng Chen, Linchao Zhu, Nenggan Zheng
Domain adaptation methods reduce domain shift typically by learning domain-invariant features.
1 code implementation • 14 Nov 2022 • Yifan Lu, Quanhao Li, Baoan Liu, Mehrdad Dianati, Chen Feng, Siheng Chen, Yanfeng Wang
Collaborative 3D object detection exploits information exchange among multiple agents to enhance accuracy of object detection in presence of sensor impairments such as occlusion.
no code implementations • 3 Nov 2022 • Bohan Tang, Siheng Chen, Xiaowen Dong
Hypergraph structure learning, which aims to learn the hypergraph structures from the observed signals to capture the intrinsic high-order relationships among the entities, becomes crucial when a hypergraph topology is not readily available in the datasets.
no code implementations • 31 Oct 2022 • Enpei Zhang, Shuo Tang, Xiaowen Dong, Siheng Chen, Yanfeng Wang
To fill this gap, we propose a distributed multi-agent learning model inspired by human collaboration, in which the agents can autonomously detect suitable collaborators and refer to collaborators' model for better performance.
no code implementations • 18 Oct 2022 • Yangheng Zhao, Jun Wang, Xiaolong Li, Yue Hu, Ce Zhang, Yanfeng Wang, Siheng Chen
Instead of learning a single prototype for each class, in this paper, we propose to use an adaptive number of prototypes to dynamically describe the different point patterns within a semantic class.
Ranked #17 on 3D Semantic Segmentation on SemanticKITTI
no code implementations • 14 Oct 2022 • Rui Ye, Zhenyang Ni, Chenxin Xu, Jianyu Wang, Siheng Chen, Yonina C. Eldar
This method attempts to mitigate the negative effects of data heterogeneity in FL by aligning each client's feature space.
1 code implementation • 26 Sep 2022 • Yue Hu, Shaoheng Fang, Zixing Lei, Yiqi Zhong, Siheng Chen
Where2comm has two distinct advantages: i) it considers pragmatic compression and uses less communication to achieve higher perception performance by focusing on perceptually critical areas; and ii) it can handle varying communication bandwidth by dynamically adjusting spatial areas involved in communication.
Ranked #1 on Monocular 3D Object Detection on CoPerception-UAVs
no code implementations • 22 Aug 2022 • Shunli Ren, Siheng Chen, Wenjun Zhang
Perception is one of the crucial module of the autonomous driving system, which has made great progress recently.
1 code implementation • 8 Aug 2022 • Yue Hu, Siheng Chen, Xu Chen, Ya zhang, Xiao Gu
Visual relationship detection aims to detect the interactions between objects in an image; however, this task suffers from combinatorial explosion due to the variety of objects and interactions.
no code implementations • 8 Aug 2022 • Yue Hu, Shaoheng Fang, Weidi Xie, Siheng Chen
To fill the gap, this work proposes a dual-view detection system named DVDET to achieve aerial monocular object detection in both the 2D image space and the 3D physical space.
1 code implementation • 7 Aug 2022 • Tongyi Luo, Jia Xiao, Chuncao Zhang, Siheng Chen, Yuan Tian, Guangjun Yu, Kang Dang, Xiaowei Ding
Although general movements assessment(GMA) has shown promising results in early CP detection, it is laborious.
1 code implementation • 31 Jul 2022 • Maosen Li, Siheng Chen, Zijing Zhang, Lingxi Xie, Qi Tian, Ya zhang
To address the first issue, we propose adaptive graph scattering, which leverages multiple trainable band-pass graph filters to decompose pose features into richer graph spectrum bands.
no code implementations • 20 Jul 2022 • Yiqi Zhong, Zhenyang Ni, Siheng Chen, Ulrich Neumann
In this work, we re-introduce this information as a new type of input data for trajectory forecasting systems: the local behavior data, which we conceptualize as a collection of location-specific historical trajectories.
1 code implementation • 18 Jul 2022 • Zixing Lei, Shunli Ren, Yue Hu, Wenjun Zhang, Siheng Chen
Collaborative perception has recently shown great potential to improve perception capabilities over single-agent perception.
no code implementations • 11 Jul 2022 • Bohan Tang, Yiqi Zhong, Chenxin Xu, Wei-Tao Wu, Ulrich Neumann, Yanfeng Wang, Ya zhang, Siheng Chen
Further, we apply the proposed framework to current SOTA multi-agent multi-modal forecasting systems as a plugin module, which enables the SOTA systems to 1) estimate the uncertainty in the multi-agent multi-modal trajectory forecasting task; 2) rank the multiple predictions and select the optimal one based on the estimated uncertainty.
1 code implementation • 27 Jun 2022 • Chenxin Xu, Yuxi Wei, Bohan Tang, Sheng Yin, Ya zhang, Siheng Chen
Demystifying the interactions among multiple agents from their past trajectories is fundamental to precise and interpretable trajectory prediction.
no code implementations • 31 May 2022 • Mingxing Xu, Chenglin Li, Wenrui Dai, Siheng Chen, Junni Zou, Pascal Frossard, Hongkai Xiong
Specifically, adaptive spherical wavelets are learned with a lifting structure that consists of trainable lifting operators (i. e., update and predict operators).
1 code implementation • CVPR 2022 • Chenxin Xu, Maosen Li, Zhenyang Ni, Ya zhang, Siheng Chen
From the aspect of interaction capturing, we propose a trainable multiscale hypergraph to capture both pair-wise and group-wise interactions at multiple group sizes.
1 code implementation • CVPR 2022 • Chenxin Xu, Weibo Mao, Wenjun Zhang, Siheng Chen
However, in this way, the model parameters come from all seen instances, which means a huge amount of irrelevant seen instances might also involve in predicting the current situation, disturbing the performance.
Ranked #7 on Trajectory Prediction on Stanford Drone
no code implementations • 17 Feb 2022 • Yiming Li, Dekun Ma, Ziyan An, Zixun Wang, Yiqi Zhong, Siheng Chen, Chen Feng
Vehicle-to-everything (V2X) communication techniques enable the collaboration between vehicles and many other entities in the neighboring environment, which could fundamentally improve the perception system for autonomous driving.
2 code implementations • 17 Feb 2022 • Ming Jin, Yu Zheng, Yuan-Fang Li, Siheng Chen, Bin Yang, Shirui Pan
Multivariate time series forecasting has long received significant attention in real-world applications, such as energy consumption and traffic prediction.
no code implementations • CVPR 2022 • Yixuan Huang, Xiaoyun Zhang, Yu Fu, Siheng Chen, Ya zhang, Yan-Feng Wang, Dazhi He
Those methods conduct the super-resolution task of the input low-resolution(LR) image and the texture transfer task from the reference image together in one module, easily introducing the interference between LR and reference features.
1 code implementation • CVPR 2022 • Qi Yang, Yipeng Liu, Siheng Chen, Yiling Xu, Jun Sun
We present a novel no-reference quality assessment metric, the image transferred point cloud quality assessment (IT-PCQA), for 3D point clouds.
Ranked #4 on Point Cloud Quality Assessment on WPC
2 code implementations • NeurIPS 2021 • Yiming Li, Shunli Ren, Pengxiang Wu, Siheng Chen, Chen Feng, Wenjun Zhang
Our approach is validated on V2X-Sim 1. 0, a large-scale multi-agent perception dataset that we synthesized using CARLA and SUMO co-simulation.
Ranked #3 on 3D Object Detection on V2X-SIM
no code implementations • 30 Oct 2021 • Jian Du, Song Li, Xiangyi Chen, Siheng Chen, Mingyi Hong
The equivalent privacy costs controlled by maintaining the same gradient clipping thresholds and noise powers in each step result in unstable updates and a lower model accuracy when compared to the non-DP counterpart.
no code implementations • NeurIPS 2021 • Bohan Tang, Yiqi Zhong, Ulrich Neumann, Gang Wang, Ya zhang, Siheng Chen
2) The results of trajectory forecasting benchmarks demonstrate that the CU-based framework steadily helps SOTA systems improve their performances.
no code implementations • 23 Oct 2021 • Zida Cheng, Siheng Chen, Ya zhang
Spatio-temporal graph signal analysis has a significant impact on a wide range of applications, including hand/body pose action recognition.
no code implementations • 20 Oct 2021 • Zhen Xiang, David J. Miller, Siheng Chen, Xi Li, George Kesidis
Backdoor attacks (BA) are an emerging threat to deep neural network classifiers.
1 code implementation • NeurIPS 2021 • Xingyue Pu, Tianyue Cao, Xiaoyun Zhang, Xiaowen Dong, Siheng Chen
The model is trained in an end-to-end fashion with pairs of node data and graph samples.
1 code implementation • 16 Oct 2021 • Xin Yu, Jeroen van Baar, Siheng Chen
We use a coarse graph, derived from a dense graph, to estimate the human's 3D pose, and the dense graph to estimate the 3D shape.
Ranked #269 on 3D Human Pose Estimation on Human3.6M
no code implementations • 24 Sep 2021 • Jinxiang Liu, Yangheng Zhao, Siheng Chen, Ya zhang
To leverage the human body shape prior, LPNet exploits the topological information of the body mesh to learn an expressive visual representation for the target person in the 3D mesh space.
no code implementations • 25 Aug 2021 • Maosen Li, Siheng Chen, Yangheng Zhao, Ya zhang, Yanfeng Wang, Qi Tian
The core of MST-GNN is a multiscale spatio-temporal graph that explicitly models the relations in motions at various spatial and temporal scales.
no code implementations • ICCV 2021 • Tianyue Cao, Lianyu Du, Xiaoyun Zhang, Siheng Chen, Ya zhang, Yan-Feng Wang
To handle overlapping category transfer, we propose a double-supervision mean teacher to gather common category information and bridge the domain gap between two datasets.
no code implementations • 16 Jul 2021 • Zida Cheng, Siheng Chen, Ya zhang
Experiments are conducted on FPHA and HO-3D datasets.
no code implementations • 2 Jul 2021 • Maosen Li, Siheng Chen, Yanning Shen, Genjia Liu, Ivor W. Tsang, Ya zhang
This paper considers predicting future statuses of multiple agents in an online fashion by exploiting dynamic interactions in the system.
no code implementations • 6 Apr 2021 • Chen Ju, Peisen Zhao, Siheng Chen, Ya zhang, Xiaoyun Zhang, Qi Tian
To solve this issue, we introduce an adaptive mutual supervision framework (AMS) with two branches, where the base branch adopts CAS to localize the most discriminative action regions, while the supplementary branch localizes the less discriminative action regions through a novel adaptive sampler.
Ranked #7 on Weakly Supervised Action Localization on THUMOS14
Weakly Supervised Action Localization Weakly-supervised Temporal Action Localization +1
1 code implementation • 4 Mar 2021 • Qi Yang, Yujie Zhang, Siheng Chen, Yiling Xu, Jun Sun, Zhan Ma
In this paper, we propose a new distortion quantification method for point clouds, the multiscale potential energy discrepancy (MPED).
no code implementations • ICCV 2021 • Chen Ju, Peisen Zhao, Siheng Chen, Ya zhang, Yanfeng Wang, Qi Tian
Single-frame temporal action localization (STAL) aims to localize actions in untrimmed videos with only one timestamp annotation for each action instance.
1 code implementation • 17 Dec 2020 • Chenxin Xu, Siheng Chen, Maosen Li, Ya zhang
To handle the decomposition ambiguity in the teacher network, we propose a cycle-consistent architecture promoting a 3D rotation-invariant property to train the teacher network.
no code implementations • ICLR 2021 • Chao Pan, Siheng Chen, Antonio Ortega
Although spatio-temporal graph neural networks have achieved great empirical success in handling multiple correlated time series, they may be impractical in some real-world scenarios due to a lack of sufficient high-quality training data.
no code implementations • 3 Nov 2020 • Siheng Chen, Maosen Li, Ya zhang
Compared to previous analytical sampling and recovery, the proposed methods are able to flexibly learn a variety of graph signal models from data by leveraging the learning ability of neural networks; compared to previous neural-network-based sampling and recovery, the proposed methods are designed through exploiting specific graph properties and provide interpretability.
3 code implementations • 3 Nov 2020 • Xu Chen, Siheng Chen, Jiangchao Yao, Huangjie Zheng, Ya zhang, Ivor W Tsang
Thereby, designing a new GNN for these graphs is a burning issue to the graph learning community.
2 code implementations • NeurIPS 2020 • Maosen Li, Siheng Chen, Ya zhang, Ivor W. Tsang
Based on trainable hierarchical representations of a graph, GXN enables the interchange of intermediate features across scales to promote information flow.
no code implementations • 16 Jul 2020 • Chenyang Li, Xu Chen, Ya zhang, Siheng Chen, Dan Lv, Yan-Feng Wang
Most existing methods focus on preserving the first-order proximity between entities in the KG.
no code implementations • 16 Jul 2020 • Jingchao Su, Xu Chen, Ya zhang, Siheng Chen, Dan Lv, Chenyang Li
The two-level alignment acts as two different constraints on different relations of the shared entities and facilitates better knowledge transfer for relational learning on multiple bipartite graphs.
no code implementations • 17 Jun 2020 • Takuya Fujihashi, Toshiaki Koike-Akino, Siheng Chen, Takashi Watanabe
To prevent the cliff effect subject to channel quality fluctuation, we have proposed soft point cloud delivery called HoloCast.
no code implementations • 1 Jun 2020 • Siheng Chen, Yonina C. Eldar, Lingxiao Zhao
We unroll an iterative denoising algorithm by mapping each iteration into a single network layer where the feed-forward process is equivalent to iteratively denoising graph signals.
no code implementations • ICLR 2020 • Vassilis N. Ioannidis, Siheng Chen, Georgios B. Giannakis
Graph convolutional networks (GCNs) have achieved remarkable performance in a variety of network science learning tasks.
1 code implementation • 17 Mar 2020 • Maosen Li, Siheng Chen, Yangheng Zhao, Ya zhang, Yan-Feng Wang, Qi Tian
The core idea of DMGNN is to use a multiscale graph to comprehensively model the internal relations of a human body for motion feature learning.
2 code implementations • CVPR 2020 • Pengxiang Wu, Siheng Chen, Dimitris Metaxas
The backbone of MotionNet is a novel spatio-temporal pyramid network, which extracts deep spatial and temporal features in a hierarchical fashion.
1 code implementation • CVPR 2020 • Yue Hu, Siheng Chen, Ya zhang, Xiao Gu
Motion prediction is essential and challenging for autonomous vehicles and social robots.
no code implementations • 1 Mar 2020 • Siheng Chen, Baoan Liu, Chen Feng, Carlos Vallespi-Gonzalez, Carl Wellington
We present a review of 3D point cloud processing and learning for autonomous driving.
no code implementations • 6 Feb 2020 • Jingxiao Liu, Bingqing Chen, Siheng Chen, Mario Berges, Jacobo Bielak, HaeYoung Noh
We introduce a physics-guided signal processing approach to extract a damage-sensitive and domain-invariant (DS & DI) feature from acceleration response data of a vehicle traveling over a bridge to assess bridge health.
no code implementations • 27 Jan 2020 • Vassilis N. Ioannidis, Siheng Chen, Georgios B. Giannakis
The resultant pruning algorithm is guided by a graph-spectrum-inspired criterion, and retains informative scattering features on-the-fly while bypassing the exponential complexity associated with GSTs.
no code implementations • 5 Oct 2019 • Maosen Li, Siheng Chen, Xu Chen, Ya zhang, Yan-Feng Wang, Qi Tian
For the backbone, we propose multi-branch multi-scale graph convolution networks to extract spatial and temporal features.
Ranked #48 on Skeleton Based Action Recognition on NTU RGB+D
3 code implementations • 23 Jul 2019 • Xu Chen, Siheng Chen, Huangjie Zheng, Jiangchao Yao, Kenan Cui, Ya zhang, Ivor W. Tsang
NANG learns a unifying latent representation which is shared by both node attributes and graph structures and can be translated to different modalities.
no code implementations • 26 Jun 2019 • Siheng Chen, Sufeng. Niu, Tian Lan, Baoan Liu
We present a novel graph-neural-network-based system to effectively represent large-scale 3D point clouds with the applications to autonomous driving.
no code implementations • 11 May 2019 • Siheng Chen, Chaojing Duan, Yaoqing Yang, Duanshun Li, Chen Feng, Dong Tian
The experimental results show that (1) the proposed networks outperform the state-of-the-art methods in various tasks; (2) a graph topology can be inferred as auxiliary information without specific supervision on graph topology inference; and (3) graph filtering refines the reconstruction, leading to better performances.
no code implementations • 30 Apr 2019 • Chuan Wen, Jie Chang, Ya zhang, Siheng Chen, Yan-Feng Wang, Mei Han, Qi Tian
Automatic character generation is an appealing solution for new typeface design, especially for Chinese typefaces including over 3700 most commonly-used characters.
1 code implementation • CVPR 2019 • Maosen Li, Siheng Chen, Xu Chen, Ya zhang, Yan-Feng Wang, Qi Tian
We validate AS-GCN in action recognition using two skeleton data sets, NTU-RGB+D and Kinetics.
no code implementations • 9 Apr 2019 • Chaojing Duan, Siheng Chen, Jelena Kovacevic
NPD algorithm uses a neural network to estimate reference planes for points in noisy point clouds.
1 code implementation • 8 Jun 2017 • Sufeng. Niu, Siheng Chen, Hanyu Guo, Colin Targonski, Melissa C. Smith, Jelena Kovačević
GVIN emulates the value iteration algorithm by using a novel graph convolution operator, which enables GVIN to learn and plan on irregular spatial graphs.
no code implementations • 11 Feb 2017 • Siheng Chen, Dong Tian, Chen Feng, Anthony Vetro, Jelena Kovačević
We use a general feature-extraction operator to represent application-dependent features and propose a general reconstruction error to evaluate the quality of resampling.
no code implementations • 16 Dec 2015 • Siheng Chen, Rohan Varma, Aarti Singh, Jelena Kovačević
For each class, we provide an explicit definition of the graph signals and construct a corresponding graph dictionary with desirable properties.
no code implementations • 21 Jul 2015 • Siheng Chen, Rohan Varma, Aarti Singh, Jelena Kovačević
In this paper, we consider a statistical problem of learning a linear model from noisy samples.
no code implementations • 21 Apr 2015 • Siheng Chen, Rohan Varma, Aarti Singh, Jelena Kovačević
We study signal recovery on graphs based on two sampling strategies: random sampling and experimentally designed sampling.
no code implementations • 26 Nov 2014 • Siheng Chen, Aliaksei Sandryhaila, José M. F. Moura, Jelena Kovačević
We consider the problem of signal recovery on graphs as graphs model data with complex structure as signals on a graph.