Search Results for author: Chengzhong Xu

Found 44 papers, 12 papers with code

Hunyuan-Large: An Open-Source MoE Model with 52 Billion Activated Parameters by Tencent

3 code implementations4 Nov 2024 Xingwu Sun, Yanfeng Chen, Yiqing Huang, Ruobing Xie, Jiaqi Zhu, Kai Zhang, Shuaipeng Li, Zhen Yang, Jonny Han, Xiaobo Shu, Jiahao Bu, Zhongzhi Chen, Xuemeng Huang, Fengzong Lian, Saiyong Yang, Jianfeng Yan, Yuyuan Zeng, Xiaoqin Ren, Chao Yu, Lulu Wu, Yue Mao, Jun Xia, Tao Yang, Suncong Zheng, Kan Wu, Dian Jiao, Jinbao Xue, Xipeng Zhang, Decheng Wu, Kai Liu, Dengpeng Wu, Guanghui Xu, Shaohua Chen, Shuang Chen, Xiao Feng, Yigeng Hong, Junqiang Zheng, Chengcheng Xu, Zongwei Li, Xiong Kuang, Jianglu Hu, Yiqi Chen, Yuchi Deng, Guiyang Li, Ao Liu, Chenchen Zhang, Shihui Hu, Zilong Zhao, Zifan Wu, Yao Ding, Weichao Wang, Han Liu, Roberts Wang, Hao Fei, Peijie Yu, Ze Zhao, Xun Cao, Hai Wang, Fusheng Xiang, Mengyuan Huang, Zhiyuan Xiong, Bin Hu, Xuebin Hou, Lei Jiang, Jianqiang Ma, Jiajia Wu, Yaping Deng, Yi Shen, Qian Wang, Weijie Liu, Jie Liu, Meng Chen, Liang Dong, Weiwen Jia, Hu Chen, Feifei Liu, Rui Yuan, Huilin Xu, Zhenxiang Yan, Tengfei Cao, Zhichao Hu, Xinhua Feng, Dong Du, TingHao Yu, Yangyu Tao, Feng Zhang, Jianchen Zhu, Chengzhong Xu, Xirui Li, Chong Zha, Wen Ouyang, Yinben Xia, Xiang Li, Zekun He, Rongpeng Chen, Jiawei Song, Ruibin Chen, Fan Jiang, Chongqing Zhao, Bo wang, Hao Gong, Rong Gan, Winston Hu, Zhanhui Kang, Yong Yang, Yuhong Liu, Di Wang, Jie Jiang

In this paper, we introduce Hunyuan-Large, which is currently the largest open-source Transformer-based mixture of experts model, with a total of 389 billion parameters and 52 billion activation parameters, capable of handling up to 256K tokens.

Logical Reasoning Mathematical Problem-Solving

Breaking the Memory Wall for Heterogeneous Federated Learning via Model Splitting

no code implementations12 Oct 2024 Chunlin Tian, Li Li, Kahou Tam, Yebo Wu, Chengzhong Xu

In this paper, we propose SmartSplit, a framework that effectively reduces the memory footprint on the device side while guaranteeing the training progress and model accuracy for heterogeneous FL through model splitting. Towards this end, SmartSplit employs a hierarchical structure to adaptively guide the overall training process.

Federated Learning

Real-time Accident Anticipation for Autonomous Driving Through Monocular Depth-Enhanced 3D Modeling

no code implementations2 Sep 2024 Haicheng Liao, Yongkang Li, Chengyue Wang, Songning Lai, Zhenning Li, Zilin Bian, Jaeyoung Lee, Zhiyong Cui, Guohui Zhang, Chengzhong Xu

The primary goal of traffic accident anticipation is to foresee potential accidents in real time using dashcam videos, a task that is pivotal for enhancing the safety and reliability of autonomous driving technologies.

Accident Anticipation Autonomous Driving +1

CRASH: Crash Recognition and Anticipation System Harnessing with Context-Aware and Temporal Focus Attentions

no code implementations25 Jul 2024 Haicheng Liao, Haoyu Sun, Huanming Shen, Chengyue Wang, Kahou Tam, Chunlin Tian, Li Li, Chengzhong Xu, Zhenning Li

To capture a wider range of visual cues, we further propose a multi-layer fusion that dynamically computes the temporal dependencies between different scenes and iteratively updates the correlations between different visual features for accurate and timely accident prediction.

Accident Anticipation Autonomous Driving

When, Where, and What? A Novel Benchmark for Accident Anticipation and Localization with Large Language Models

no code implementations23 Jul 2024 Haicheng Liao, Yongkang Li, Chengyue Wang, Yanchen Guan, Kahou Tam, Chunlin Tian, Li Li, Chengzhong Xu, Zhenning Li

As autonomous driving systems increasingly become part of daily transportation, the ability to accurately anticipate and mitigate potential traffic accidents is paramount.

Accident Anticipation Autonomous Driving

SUSTechGAN: Image Generation for Object Recognition in Adverse Conditions of Autonomous Driving

no code implementations18 Jul 2024 Gongjin Lan, Yang Peng, Qi Hao, Chengzhong Xu

We test the SUSTechGAN and the existing well-known GANs to generate driving images in adverse conditions of rain and night and apply the generated images to retrain object recognition networks.

Autonomous Driving Image Generation +2

Ranking-based Client Selection with Imitation Learning for Efficient Federated Learning

no code implementations7 May 2024 Chunlin Tian, Zhan Shi, Xinpeng Qin, Li Li, Chengzhong Xu

Federated Learning (FL) enables multiple devices to collaboratively train a shared model while ensuring data privacy.

Federated Learning Imitation Learning

MFTraj: Map-Free, Behavior-Driven Trajectory Prediction for Autonomous Driving

no code implementations2 May 2024 Haicheng Liao, Zhenning Li, Chengyue Wang, Huanming Shen, Bonan Wang, Dongping Liao, Guofa Li, Chengzhong Xu

This paper introduces a trajectory prediction model tailored for autonomous driving, focusing on capturing complex interactions in dynamic traffic scenarios without reliance on high-definition maps.

Autonomous Driving Computational Efficiency +1

HydraLoRA: An Asymmetric LoRA Architecture for Efficient Fine-Tuning

1 code implementation30 Apr 2024 Chunlin Tian, Zhan Shi, Zhijiang Guo, Li Li, Chengzhong Xu

Through a series of experiments, we have uncovered two critical insights that shed light on the training and parameter inefficiency of LoRA.

parameter-efficient fine-tuning

Breaking the Memory Wall for Heterogeneous Federated Learning with Progressive Training

no code implementations20 Apr 2024 Yebo Wu, Li Li, Chunlin Tian, Chengzhong Xu

In order to preserve the feature representation of each block, we decouple the whole training process into two stages: progressive model shrinking and progressive model growing.

Federated Learning

Integrated Sensing and Communication for Edge Inference with End-to-End Multi-View Fusion

no code implementations16 Apr 2024 Xibin Jin, Guoliang Li, Shuai Wang, Miaowen Wen, Chengzhong Xu, H. Vincent Poor

Integrated sensing and communication (ISAC) is a promising solution to accelerate edge inference via the dual use of wireless signals.

World Models for Autonomous Driving: An Initial Survey

no code implementations5 Mar 2024 Yanchen Guan, Haicheng Liao, Zhenning Li, Jia Hu, Runze Yuan, Yunjian Li, Guohui Zhang, Chengzhong Xu

In the rapidly evolving landscape of autonomous driving, the capability to accurately predict future events and assess their implications is paramount for both safety and efficiency, critically aiding the decision-making process.

Autonomous Driving Decision Making +1

A Cognitive-Based Trajectory Prediction Approach for Autonomous Driving

1 code implementation29 Feb 2024 Haicheng Liao, Yongkang Li, Zhenning Li, Chengyue Wang, Zhiyong Cui, Shengbo Eben Li, Chengzhong Xu

In autonomous vehicle (AV) technology, the ability to accurately predict the movements of surrounding vehicles is paramount for ensuring safety and operational efficiency.

Autonomous Driving Decision Making +2

BAT: Behavior-Aware Human-Like Trajectory Prediction for Autonomous Driving

1 code implementation11 Dec 2023 Haicheng Liao, Zhenning Li, Huanming Shen, Wenxuan Zeng, Dongping Liao, Guofa Li, Shengbo Eben Li, Chengzhong Xu

The ability to accurately predict the trajectory of surrounding vehicles is a critical hurdle to overcome on the journey to fully autonomous vehicles.

Autonomous Driving Decision Making +1

GPT-4 Enhanced Multimodal Grounding for Autonomous Driving: Leveraging Cross-Modal Attention with Large Language Models

1 code implementation6 Dec 2023 Haicheng Liao, Huanming Shen, Zhenning Li, Chengyue Wang, Guofa Li, Yiming Bie, Chengzhong Xu

In the field of autonomous vehicles (AVs), accurately discerning commander intent and executing linguistic commands within a visual context presents a significant challenge.

Autonomous Driving Decoder +1

Open Set Dandelion Network for IoT Intrusion Detection

no code implementations19 Nov 2023 Jiashu Wu, Hao Dai, Kenneth B. Kent, Jerome Yen, Chengzhong Xu, Yang Wang

The OSDN model performs intrusion knowledge transfer from the knowledge-rich source network intrusion domain to facilitate more accurate intrusion detection for the data-scarce target IoT intrusion domain.

Domain Adaptation Intrusion Detection +1

Heterogeneous Domain Adaptation for IoT Intrusion Detection: A Geometric Graph Alignment Approach

no code implementations24 Jan 2023 Jiashu Wu, Hao Dai, Yang Wang, Kejiang Ye, Chengzhong Xu

In this paper, a Geometric Graph Alignment (GGA) approach is leveraged to mask the geometric heterogeneities between domains for better intrusion knowledge transfer.

Domain Adaptation Network Intrusion Detection +2

Inflected Forms Are Redundant in Question Generation Models

no code implementations1 Jan 2023 Xingwu Sun, Hongyin Tang, Chengzhong Xu

Secondly, we propose to adapt QG as a combination of the following actions in the encode-decoder framework: generating a question word, copying a word from the source sequence or generating a word transformation type.

Decoder Question Generation +1

Toward Efficient Automated Feature Engineering

no code implementations26 Dec 2022 Kafeng Wang, Pengyang Wang, Chengzhong Xu

Specifically, we construct the AFE pipeline based on reinforcement learning setting, where each feature is assigned an agent to perform feature transformation \com{and} selection, and the evaluation score of the produced features in downstream tasks serve as the reward to update the policy.

Automated Feature Engineering Computational Efficiency +1

Joint Semantic Transfer Network for IoT Intrusion Detection

no code implementations28 Oct 2022 Jiashu Wu, Yang Wang, Binhui Xie, Shuang Li, Hao Dai, Kejiang Ye, Chengzhong Xu

The scenario semantic endows source NI and II domain with characteristics from each other to ease the knowledge transfer process via a confused domain discriminator and categorical distribution knowledge preservation.

Computational Efficiency Domain Adaptation +3

PECCO: A Profit and Cost-oriented Computation Offloading Scheme in Edge-Cloud Environment with Improved Moth-flame Optimisation

no code implementations9 Aug 2022 Jiashu Wu, Hao Dai, Yang Wang, Shigen Shen, Chengzhong Xu

With the fast growing quantity of data generated by smart devices and the exponential surge of processing demand in the Internet of Things (IoT) era, the resource-rich cloud centres have been utilised to tackle these challenges.

MaskRange: A Mask-classification Model for Range-view based LiDAR Segmentation

no code implementations24 Jun 2022 Yi Gu, Yuming Huang, Chengzhong Xu, Hui Kong

To answer this question, we propose a unified mask-classification model, MaskRange, for the range-view based LiDAR semantic and panoptic segmentation.

Classification Data Augmentation +3

Federated Deep Learning Meets Autonomous Vehicle Perception: Design and Verification

1 code implementation3 Jun 2022 Shuai Wang, Chengyang Li, Derrick Wing Kwan Ng, Yonina C. Eldar, H. Vincent Poor, Qi Hao, Chengzhong Xu

However, it is challenging to determine the network resources and road sensor placements for multi-stage training with multi-modal datasets in multi-variant scenarios.

Deep Learning Federated Learning +1

Deep Active Learning with Noise Stability

no code implementations26 May 2022 Xingjian Li, Pengkun Yang, Yangcheng Gu, Xueying Zhan, Tianyang Wang, Min Xu, Chengzhong Xu

We provide theoretical analyses by leveraging the small Gaussian noise theory and demonstrate that our method favors a subset with large and diverse gradients.

Active Learning

Learning Disentangled Representation for One-shot Progressive Face Swapping

1 code implementation24 Mar 2022 Qi Li, Weining Wang, Chengzhong Xu, Zhenan Sun, Ming-Hsuan Yang

The disentangled representation module comprises an attribute encoder and an identity encoder, which aims to achieve the disentanglement of the identity and attribute information.

Attribute Disentanglement +1

Towards Inadequately Pre-trained Models in Transfer Learning

no code implementations ICCV 2023 Andong Deng, Xingjian Li, Di Hu, Tianyang Wang, Haoyi Xiong, Chengzhong Xu

Based on the contradictory phenomenon between FE and FT that better feature extractor fails to be fine-tuned better accordingly, we conduct comprehensive analyses on features before softmax layer to provide insightful explanations.

Transfer Learning

Learning Moving-Object Tracking with FMCW LiDAR

no code implementations2 Mar 2022 Yi Gu, Hongzhi Cheng, Kafeng Wang, Dejing Dou, Chengzhong Xu, Hui Kong

In this paper, we propose a learning-based moving-object tracking method utilizing our newly developed LiDAR sensor, Frequency Modulated Continuous Wave (FMCW) LiDAR.

Contrastive Learning Object +1

Completion and Augmentation based Spatiotemporal Deep Learning Approach for Short-Term Metro Origin-Destination Matrix Prediction under Limited Observable Data

1 code implementation9 Aug 2021 Jiexia Ye, Juanjuan Zhao, Furong Zheng, Chengzhong Xu

Due to the delayed effect in latest complete OD flow collection, complex spatiotemporal correlations of OD flows in high dimension, it is more challengeable than other traffic prediction tasks of time series.

Management Time Series +2

Integrated Sensing and Communication from Learning Perspective: An SDP3 Approach

no code implementations20 Jul 2021 Guoliang Li, Shuai Wang, Jie Li, Rui Wang, Fan Liu, Xiaohui Peng, Tony Xiao Han, Chengzhong Xu

Characterizing the sensing and communication performance tradeoff in integrated sensing and communication (ISAC) systems is challenging in the applications of learning-based human motion recognition.

A Deep Reinforcement Learning Approach for Traffic Signal Control Optimization

no code implementations13 Jul 2021 Zhenning Li, Chengzhong Xu, Guohui Zhang

Inefficient traffic signal control methods may cause numerous problems, such as traffic congestion and waste of energy.

Deep Reinforcement Learning reinforcement-learning +2

Incorporating Reachability Knowledge into a Multi-Spatial Graph Convolution Based Seq2Seq Model for Traffic Forecasting

1 code implementation4 Jul 2021 Jiexia Ye, Furong Zheng, Juanjuan Zhao, Kejiang Ye, Chengzhong Xu

Our main novelties are three aspects: (1) We enrich the spatiotemporal information of model inputs by fusing multi-view features (time, location and traffic states) (2) We build multiple kinds of spatial correlations based on both prior knowledge and data-driven knowledge to improve model performance especially in insufficient or noisy data cases.

Decoder Traffic Prediction

Machine Learning-based Orchestration of Containers: A Taxonomy and Future Directions

no code implementations24 Jun 2021 Zhiheng Zhong, Minxian Xu, Maria Alejandra Rodriguez, Chengzhong Xu, Rajkumar Buyya

Containerization is a lightweight application virtualization technology, providing high environmental consistency, operating system distribution portability, and resource isolation.

BIG-bench Machine Learning Management

Federated Noisy Client Learning

1 code implementation24 Jun 2021 Kahou Tam, Li Li, Bo Han, Chengzhong Xu, Huazhu Fu

Federated learning (FL) collaboratively trains a shared global model depending on multiple local clients, while keeping the training data decentralized in order to preserve data privacy.

Federated Learning

One Shot Face Swapping on Megapixels

1 code implementation CVPR 2021 Yuhao Zhu, Qi Li, Jian Wang, Chengzhong Xu, Zhenan Sun

Extensive experiments demonstrate the superiority of MegaFS and the first megapixel level face swapping database is released for research on DeepFake detection and face image editing in the public domain.

DeepFake Detection Disentanglement +2

SMILE: Self-Distilled MIxup for Efficient Transfer LEarning

no code implementations25 Mar 2021 Xingjian Li, Haoyi Xiong, Chengzhong Xu, Dejing Dou

Performing mixup for transfer learning with pre-trained models however is not that simple, a high capacity pre-trained model with a large fully-connected (FC) layer could easily overfit to the target dataset even with samples-to-labels mixed up.

Transfer Learning Triplet

DEAL: Decremental Energy-Aware Learning in a Federated System

no code implementations5 Feb 2021 Wenting Zou, Li Li, Zichen Xu, Chengzhong Xu

To address the conflict between learning SLO and energy efficiency, we propose DEAL, an energy efficient learning system that saves energy and preserves privacy with a decremental learning design.

energy management Federated Learning +1

Towards Accurate Knowledge Transfer via Target-awareness Representation Disentanglement

no code implementations16 Oct 2020 Xingjian Li, Di Hu, Xuhong LI, Haoyi Xiong, Zhi Ye, Zhipeng Wang, Chengzhong Xu, Dejing Dou

Fine-tuning deep neural networks pre-trained on large scale datasets is one of the most practical transfer learning paradigm given limited quantity of training samples.

Disentanglement Transfer Learning

COLAM: Co-Learning of Deep Neural Networks and Soft Labels via Alternating Minimization

no code implementations26 Apr 2020 Xingjian Li, Haoyi Xiong, Haozhe An, Dejing Dou, Chengzhong Xu

Softening labels of training datasets with respect to data representations has been frequently used to improve the training of deep neural networks (DNNs).

General Classification

Cannot find the paper you are looking for? You can Submit a new open access paper.