Search Results for author: Jun Zhang

Found 297 papers, 100 papers with code

BMInf: An Efficient Toolkit for Big Model Inference and Tuning

1 code implementation ACL 2022 Xu Han, Guoyang Zeng, Weilin Zhao, Zhiyuan Liu, Zhengyan Zhang, Jie zhou, Jun Zhang, Jia Chao, Maosong Sun

In recent years, large-scale pre-trained language models (PLMs) containing billions of parameters have achieved promising results on various NLP tasks.

Quantization Scheduling

KERS: A Knowledge-Enhanced Framework for Recommendation Dialog Systems with Multiple Subgoals

1 code implementation Findings (EMNLP) 2021 Jun Zhang, Yan Yang, Chencai Chen, Liang He, Zhou Yu

Recommendation dialogs require the system to build a social bond with users to gain trust and develop affinity in order to increase the chance of a successful recommendation.

Question Answering Recommendation Systems +1

Reinforcement-Learning-Enabled Beam Alignment for Water-Air Direct Optical Wireless Communications

no code implementations5 Sep 2024 Jiayue Liu, Tianqi Mao, Dongxuan He, Yang Yang, Zhen Gao, Dezhi Zheng, Jun Zhang

The escalating interests on underwater exploration/reconnaissance applications have motivated high-rate data transmission from underwater to airborne relaying platforms, especially under high-sea scenarios.

ReconX: Reconstruct Any Scene from Sparse Views with Video Diffusion Model

no code implementations29 Aug 2024 Fangfu Liu, Wenqiang Sun, HanYang Wang, Yikai Wang, Haowen Sun, Junliang Ye, Jun Zhang, Yueqi Duan

Advancements in 3D scene reconstruction have transformed 2D images from the real world into 3D models, producing realistic 3D results from hundreds of input photos.

3D Scene Reconstruction

Exploring Selective Layer Fine-Tuning in Federated Learning

no code implementations28 Aug 2024 Yuchang Sun, Yuexiang Xie, Bolin Ding, Yaliang Li, Jun Zhang

Federated learning (FL) has emerged as a promising paradigm for fine-tuning foundation models using distributed data in a privacy-preserving manner.

Federated Learning Privacy Preserving

ConflictBank: A Benchmark for Evaluating the Influence of Knowledge Conflicts in LLM

1 code implementation22 Aug 2024 Zhaochen Su, Jun Zhang, Xiaoye Qu, Tong Zhu, Yanshu Li, Jiashuo Sun, Juntao Li, Min Zhang, Yu Cheng

Only a few research explored the conflicts between the inherent knowledge of LLMs and the retrieved contextual knowledge.

Misinformation

A Landscape-Aware Differential Evolution for Multimodal Optimization Problems

no code implementations5 Aug 2024 Guo-Yun Lin, Zong-Gan Chen, Yuncheng Jiang, Zhi-Hui Zhan, Jun Zhang

First, a landscape-aware peak exploration helps each individual evolve adaptively to locate a peak and simulates the regions of the found peaks according to search history to avoid an individual locating a found peak.

Task-Oriented Communication for Vehicle-to-Infrastructure Cooperative Perception

no code implementations30 Jul 2024 Jiawei Shao, Teng Li, Jun Zhang

Despite its potential to improve perception accuracy and robustness, the large amount of raw sensor data inevitably results in high communication overhead.

Autonomous Driving feature selection

EaTVul: ChatGPT-based Evasion Attack Against Software Vulnerability Detection

1 code implementation27 Jul 2024 Shigang Liu, Di Cao, Junae Kim, Tamas Abraham, Paul Montague, Seyit Camtepe, Jun Zhang, Yang Xiang

Recently, deep learning has demonstrated promising results in enhancing the accuracy of vulnerability detection and identifying vulnerabilities in software.

Adversarial Attack Vulnerability Detection

Bidirectional Stereo Image Compression with Cross-Dimensional Entropy Model

no code implementations15 Jul 2024 Zhening Liu, Xinjie Zhang, Jiawei Shao, Zehong Lin, Jun Zhang

With the rapid advancement of stereo vision technologies, stereo image compression has emerged as a crucial field that continues to draw significant attention.

Image Compression MS-SSIM +2

Multimodal contrastive learning for spatial gene expression prediction using histology images

1 code implementation11 Jul 2024 Wenwen Min, Zhiceng Shi, Jun Zhang, Jun Wan, Changmiao Wang

In this paper, we propose \textbf{mclSTExp}, a multimodal contrastive learning with Transformer and Densenet-121 encoder for Spatial Transcriptomics Expression prediction.

Contrastive Learning whole slide images

Let the Code LLM Edit Itself When You Edit the Code

no code implementations3 Jul 2024 Zhenyu He, Jun Zhang, Shengjie Luo, Jingjing Xu, Zhi Zhang, Di He

Simply encoding the edited subsequence and integrating it to the original KV cache meets the temporal confusion problem, leading to significantly worse performance.

8k Code Generation +2

Boosting Consistency in Story Visualization with Rich-Contextual Conditional Diffusion Models

1 code implementation2 Jul 2024 Fei Shen, Hu Ye, Sibo Liu, Jun Zhang, Cong Wang, Xiao Han, Wei Yang

Moreover, RCDMs can generate consistent stories with a single forward inference compared to autoregressive models.

Story Visualization

Low-Complexity CSI Feedback for FDD Massive MIMO Systems via Learning to Optimize

no code implementations24 Jun 2024 Yifan Ma, Hengtao He, Shenghui Song, Jun Zhang, Khaled B. Letaief

In frequency-division duplex (FDD) massive multiple-input multiple-output (MIMO) systems, the growing number of base station antennas leads to prohibitive feedback overhead for downlink channel state information (CSI).

Compressive Sensing Decoder

CityBench: Evaluating the Capabilities of Large Language Model as World Model

1 code implementation20 Jun 2024 Jie Feng, Jun Zhang, Junbo Yan, Xin Zhang, Tianjian Ouyang, Tianhui Liu, Yuwei Du, Siqi Guo, Yong Li

Based on CitySim, we design 7 tasks in 2 categories of perception-understanding and decision-making group to evaluate the capability of LLMs as city-scale world model for urban domain.

Language Modelling Large Language Model

Timo: Towards Better Temporal Reasoning for Language Models

1 code implementation20 Jun 2024 Zhaochen Su, Jun Zhang, Tong Zhu, Xiaoye Qu, Juntao Li, Min Zhang, Yu Cheng

Therefore, we propose a crucial question: Can we build a universal framework to handle a variety of temporal reasoning tasks?

Question Answering

SD-Eval: A Benchmark Dataset for Spoken Dialogue Understanding Beyond Words

1 code implementation19 Jun 2024 Junyi Ao, Yuancheng Wang, Xiaohai Tian, Dekun Chen, Jun Zhang, Lu Lu, Yuxuan Wang, Haizhou Li, Zhizheng Wu

We also conduct a comprehensive evaluation using objective evaluation methods (e. g. BLEU and ROUGE), subjective evaluations and LLM-based metrics for the generated responses.

Dialogue Understanding

A GPU-accelerated Large-scale Simulator for Transportation System Optimization Benchmarking

1 code implementation15 Jun 2024 Jun Zhang, Wenxuan Ao, Junbo Yan, Depeng Jin, Yong Li

Based on the simulator, we implement a set of microscopic and macroscopic controllable objects and metrics to support most typical transportation system optimization scenarios.

Benchmarking

Dual-Pipeline with Low-Rank Adaptation for New Language Integration in Multilingual ASR

no code implementations12 Jun 2024 Yerbolat Khassanov, Zhipeng Chen, Tianfeng Chen, Tze Yuang Chong, Wei Li, Jun Zhang, Lu Lu, Yuxuan Wang

This paper addresses challenges in integrating new languages into a pre-trained multilingual automatic speech recognition (mASR) system, particularly in scenarios where training data for existing languages is limited or unavailable.

Automatic Speech Recognition Decoder +2

Boosting Large-scale Parallel Training Efficiency with C4: A Communication-Driven Approach

no code implementations7 Jun 2024 Jianbo Dong, Bin Luo, Jun Zhang, Pengcheng Zhang, Fei Feng, Yikai Zhu, Ang Liu, Zian Chen, Yi Shi, Hairong Jiao, Gang Lu, Yu Guan, Ennan Zhai, Wencong Xiao, Hanyu Zhao, Man Yuan, Siran Yang, Xiang Li, Jiamang Wang, Rui Men, Jianwei Zhang, Huang Zhong, Dennis Cai, Yuan Xie, Binzhang Fu

By leveraging this feature, C4 can rapidly identify the faulty components, swiftly isolate the anomaly, and restart the task, thereby avoiding resource wastage caused by delays in anomaly detection.

Anomaly Detection

Memorization in deep learning: A survey

no code implementations6 Jun 2024 Jiaheng Wei, Yanjun Zhang, Leo Yu Zhang, Ming Ding, Chao Chen, Kok-Leong Ong, Jun Zhang, Yang Xiang

Deep Learning (DL) powered by Deep Neural Networks (DNNs) has revolutionized various domains, yet understanding the intricacies of DNN decision-making and learning processes remains a significant challenge.

Decision Making Memorization

V-Express: Conditional Dropout for Progressive Training of Portrait Video Generation

no code implementations4 Jun 2024 Cong Wang, Kuan Tian, Jun Zhang, Yonghang Guan, Feng Luo, Fei Shen, Zhiwei Jiang, Qing Gu, Xiao Han, Wei Yang

In our work on portrait video generation, we identified audio signals as particularly weak, often overshadowed by stronger signals such as facial pose and reference image.

Video Generation

DA-HFNet: Progressive Fine-Grained Forgery Image Detection and Localization Based on Dual Attention

no code implementations3 Jun 2024 Yang Liu, Xiaofei Li, Jun Zhang, Shengze Hu, Jun Lei

The increasing difficulty in accurately detecting forged images generated by AIGC(Artificial Intelligence Generative Content) poses many risks, necessitating the development of effective methods to identify and further locate forged areas.

Graph Neural Network Enhanced Retrieval for Question Answering of LLMs

no code implementations3 Jun 2024 Zijian Li, Qingyan Guo, Jiawei Shao, Lei Song, Jiang Bian, Jun Zhang, Rui Wang

A graph neural network (GNN) is then leveraged to exploit the relationships between passages and improve the retrieval of supporting passages.

Graph Neural Network Language Modelling +3

Freeplane: Unlocking Free Lunch in Triplane-Based Sparse-View Reconstruction Models

no code implementations2 Jun 2024 Wenqiang Sun, Zhengyi Wang, Shuo Chen, Yikai Wang, Zilong Chen, Jun Zhu, Jun Zhang

We first analyze the role of triplanes in feed-forward methods and find that the inconsistent multi-view images introduce high-frequency artifacts on triplanes, leading to low-quality 3D meshes.

3D geometry

Individual Contributions as Intrinsic Exploration Scaffolds for Multi-agent Reinforcement Learning

1 code implementation28 May 2024 Xinran Li, Zifan Liu, Shibo Chen, Jun Zhang

In multi-agent reinforcement learning (MARL), effective exploration is critical, especially in sparse reward environments.

SMAC+ Starcraft

Ensembling Diffusion Models via Adaptive Feature Aggregation

1 code implementation27 May 2024 Cong Wang, Kuan Tian, Yonghang Guan, Jun Zhang, Zhiwei Jiang, Fei Shen, Xiao Han, Qing Gu, Wei Yang

In this paper, we propose a novel ensembling method, Adaptive Feature Aggregation (AFA), which dynamically adjusts the contributions of multiple models at the feature level according to various states (i. e., prompts, initial noises, denoising steps, and spatial locations), thereby keeping the advantages of multiple diffusion models, while suppressing their disadvantages.

Denoising

Dual-Delayed Asynchronous SGD for Arbitrarily Heterogeneous Data

no code implementations27 May 2024 Xiaolu Wang, Yuchang Sun, Hoi-To Wai, Jun Zhang

We consider the distributed learning problem with data dispersed across multiple workers under the orchestration of a central server.

WirelessLLM: Empowering Large Language Models Towards Wireless Intelligence

no code implementations27 May 2024 Jiawei Shao, Jingwen Tong, Qiong Wu, Wei Guo, Zijian Li, Zehong Lin, Jun Zhang

To empower LLMs with knowledge and expertise in the wireless domain, this paper proposes WirelessLLM, a comprehensive framework for adapting and enhancing LLMs to address the unique challenges and requirements of wireless communication networks.

Prompt Engineering

Tackling Distribution Shifts in Task-Oriented Communication with Information Bottleneck

1 code implementation15 May 2024 Hongru Li, Jiawei Shao, Hengtao He, Shenghui Song, Jun Zhang, Khaled B. Letaief

Specifically, we propose an invariant feature encoding approach based on the IB principle and IRM framework for domainshift generalization, which aims to find the causal relationship between the input data and task result by minimizing the complexity and domain dependence of the encoded feature.

Image Classification Semantic Shift Detection

Benchmarking Classical and Learning-Based Multibeam Point Cloud Registration

1 code implementation10 May 2024 Li Ling, Jun Zhang, Nils Bore, John Folkesson, Anna Wåhlin

However, in the underwater domain, most registration of multibeam echo-sounder (MBES) point cloud data are still performed using classical methods in the iterative closest point (ICP) family.

Benchmarking Point Cloud Registration

Neural Graph Mapping for Dense SLAM with Efficient Loop Closure

no code implementations6 May 2024 Leonard Bruns, Jun Zhang, Patric Jensfelt

Existing neural field-based SLAM methods typically employ a single monolithic field as their scene representation.

Task-Aware Encoder Control for Deep Video Compression

no code implementations CVPR 2024 Xingtong Ge, Jixiang Luo, Xinjie Zhang, Tongda Xu, Guo Lu, Dailan He, Jing Geng, Yan Wang, Jun Zhang, Hongwei Qin

Prior research on deep video compression (DVC) for machine tasks typically necessitates training a unique codec for each specific task, mandating a dedicated decoder per task.

Decoder Video Compression

Data-Driven Online Resource Allocation for User Experience Improvement in Mobile Edge Clouds

no code implementations6 Apr 2024 Liqun Fu, Jingwen Tong, Tongtong Lin, Jun Zhang

Due to the learned objective model is typically non-convex and challenging to solve in real-time, we leverage the Lyapunov optimization to decouple the long-term average constraint and apply the prime-dual method to solve this decoupled resource allocation problem.

From Learning to Analytics: Improving Model Efficacy with Goal-Directed Client Selection

no code implementations30 Mar 2024 Jingwen Tong, Zhenzhen Chen, Liqun Fu, Jun Zhang, Zhu Han

To address the challenges posed by system and data heterogeneities in the FL process, we study a goal-directed client selection problem based on the model analytics framework by selecting a subset of clients for the model training.

Federated Learning

Decentralizing Coherent Joint Transmission Precoding via Fast ADMM with Deterministic Equivalents

no code implementations28 Mar 2024 Xinyu Bian, Yuhao Liu, Yizhou Xu, Tianqi Hou, Wenjie Wang, Yuyi Mao, Jun Zhang

Simulation results demonstrate the effectiveness of our proposed decentralized precoding scheme, which achieves performance similar to the optimal centralized precoding scheme.

Decentralizing Coherent Joint Transmission Precoding via Deterministic Equivalents

no code implementations15 Mar 2024 Yuhao Liu, Xinyu Bian, Yizhou Xu, Tianqi Hou, Wenjie Wang, Yuyi Mao, Jun Zhang

In order to control the inter-cell interference for a multi-cell multi-user multiple-input multiple-output network, we consider the precoder design for coordinated multi-point with downlink coherent joint transmission.

Den-SOFT: Dense Space-Oriented Light Field DataseT for 6-DOF Immersive Experience

no code implementations15 Mar 2024 Xiaohang Yu, Zhengxian Yang, Shi Pan, Yuqi Han, Haoxiang Wang, Jun Zhang, Shi Yan, Borong Lin, Lei Yang, Tao Yu, Lu Fang

We have built a custom mobile multi-camera large-space dense light field capture system, which provides a series of high-quality and sufficiently dense light field images for various scenarios.

3D Reconstruction 3D Scene Reconstruction +1

Content-aware Masked Image Modeling Transformer for Stereo Image Compression

no code implementations13 Mar 2024 Xinjie Zhang, Shenyuan Gao, Zhening Liu, Jiawei Shao, Xingtong Ge, Dailan He, Tongda Xu, Yan Wang, Jun Zhang

Existing learning-based stereo image codec adopt sophisticated transformation with simple entropy models derived from single image codecs to encode latent representations.

Decoder Image Compression

GaussianImage: 1000 FPS Image Representation and Compression by 2D Gaussian Splatting

1 code implementation13 Mar 2024 Xinjie Zhang, Xingtong Ge, Tongda Xu, Dailan He, Yan Wang, Hongwei Qin, Guo Lu, Jing Geng, Jun Zhang

In response, we propose a groundbreaking paradigm of image representation and compression by 2D Gaussian Splatting, named GaussianImage.

Quantization

Boosting Neural Representations for Videos with a Conditional Decoder

1 code implementation CVPR 2024 Xinjie Zhang, Ren Yang, Dailan He, Xingtong Ge, Tongda Xu, Yan Wang, Hongwei Qin, Jun Zhang

Implicit neural representations (INRs) have emerged as a promising approach for video storage and processing, showing remarkable versatility across various video tasks.

Decoder

Joint Activity-Delay Detection and Channel Estimation for Asynchronous Massive Random Access: A Free Probability Theory Approach

no code implementations28 Feb 2024 Xinyu Bian, Yuyi Mao, Jun Zhang

Grant-free random access (RA) has been recognized as a promising solution to support massive connectivity due to the removal of the uplink grant request procedures.

Action Detection Activity Detection

Training-Free Long-Context Scaling of Large Language Models

1 code implementation27 Feb 2024 Chenxin An, Fei Huang, Jun Zhang, Shansan Gong, Xipeng Qiu, Chang Zhou, Lingpeng Kong

The ability of Large Language Models (LLMs) to process and generate coherent text is markedly weakened when the number of input tokens exceeds their pretraining length.

16k

Achieving Linear Speedup in Asynchronous Federated Learning with Heterogeneous Clients

no code implementations17 Feb 2024 Xiaolu Wang, Zijian Li, Shi Jin, Jun Zhang

Federated learning (FL) is an emerging distributed training paradigm that aims to learn a common global model without exchanging or transferring the data that are stored locally at different clients.

Federated Learning

Federated Prompt-based Decision Transformer for Customized VR Services in Mobile Edge Computing System

no code implementations15 Feb 2024 Tailin Zhou, Jiadong Yu, Jun Zhang, Danny H. K. Tsang

This paper investigates resource allocation to provide heterogeneous users with customized virtual reality (VR) services in a mobile edge computing (MEC) system.

Edge-computing Federated Learning

FD-Vision Mamba for Endoscopic Exposure Correction

no code implementations9 Feb 2024 Zhuoran Zheng, Jun Zhang

In endoscopic imaging, the recorded images are prone to exposure abnormalities, so maintaining high-quality images is important to assist healthcare professionals in performing decision-making.

Decision Making Exposure Correction

Private Knowledge Sharing in Distributed Learning: A Survey

no code implementations8 Feb 2024 Yasas Supeksala, Dinh C. Nguyen, Ming Ding, Thilina Ranbaduge, Calson Chua, Jun Zhang, Jun Li, H. Vincent Poor

In this light, it is crucial to utilize information in learning processes that are either distributed or owned by different entities.

Spatial-Aware Latent Initialization for Controllable Image Generation

no code implementations29 Jan 2024 Wenqiang Sun, Teng Li, Zehong Lin, Jun Zhang

Recently, text-to-image diffusion models have demonstrated impressive ability to generate high-quality images conditioned on the textual input.

Denoising Image Generation

MIFI: MultI-camera Feature Integration for Roust 3D Distracted Driver Activity Recognition

1 code implementation25 Jan 2024 Jian Kuang, Wenjing Li, Fang Li, Jun Zhang, Zhongcheng Wu

Distracted driver activity recognition plays a critical role in risk aversion-particularly beneficial in intelligent transportation systems.

Activity Recognition

How to Collaborate: Towards Maximizing the Generalization Performance in Cross-Silo Federated Learning

no code implementations24 Jan 2024 Yuchang Sun, Marios Kountouris, Jun Zhang

We show that the generalization performance of a client can be improved only by collaborating with other clients that have more training data and similar data distribution.

Federated Learning Privacy Preserving

TransLoc4D: Transformer-based 4D Radar Place Recognition

no code implementations CVPR 2024 Guohao Peng, Heshan Li, Yangyang Zhao, Jun Zhang, Zhenyu Wu, Pengyu Zheng, Danwei Wang

To validate TransLoc4D we construct two datasets and set up benchmarks for 4D radar place recognition.

Context-aware Communication for Multi-agent Reinforcement Learning

1 code implementation25 Dec 2023 Xinran Li, Jun Zhang

Following this, agents utilize attention mechanisms in the second stage to selectively generate messages personalized for the receivers.

Multi-agent Reinforcement Learning Quantization +1

Joint Channel Estimation and Cooperative Localization for Near-Field Ultra-Massive MIMO

no code implementations21 Dec 2023 Ruoxiao Cao, Hengtao He, Xianghao Yu, Shenghui Song, Kaibin Huang, Jun Zhang, Yi Gong, Khaled B. Letaief

To address the joint channel estimation and cooperative localization problem for near-field UM-MIMO systems, we propose a variational Newtonized near-field channel estimation (VNNCE) algorithm and a Gaussian fusion cooperative localization (GFCL) algorithm.

Urban Generative Intelligence (UGI): A Foundational Platform for Agents in Embodied City Environment

1 code implementation19 Dec 2023 Fengli Xu, Jun Zhang, Chen Gao, Jie Feng, Yong Li

Urban environments, characterized by their complex, multi-layered networks encompassing physical, social, economic, and environmental dimensions, face significant challenges in the face of rapid urbanization.

Linear Attention via Orthogonal Memory

no code implementations18 Dec 2023 Jun Zhang, Shuyang Jiang, Jiangtao Feng, Lin Zheng, Lingpeng Kong

Given that orthogonal memory compresses global information, we further dissect the context to amplify fine-grained local information.

Causal Language Modeling Computational Efficiency +1

Bayes-Optimal Unsupervised Learning for Channel Estimation in Near-Field Holographic MIMO

no code implementations16 Dec 2023 Wentao Yu, Hengtao He, Xianghao Yu, Shenghui Song, Jun Zhang, Ross Murch, Khaled B. Letaief

In this paper, we address the fundamental challenge of designing a low-complexity Bayes-optimal channel estimator in near-field HMIMO systems operating in unknown EM environments.

Denoising

Green Edge AI: A Contemporary Survey

no code implementations1 Dec 2023 Yuyi Mao, Xianghao Yu, Kaibin Huang, Ying-Jun Angela Zhang, Jun Zhang

Guided by these principles, we then explore energy-efficient design methodologies for the three critical tasks in edge AI systems, including training data acquisition, edge training, and edge inference.

Sluggish and Chemically-Biased Interstitial Diffusion in Concentrated Solid Solution Alloys: Mechanisms and Methods

1 code implementation28 Nov 2023 Biao Xu, Haijun Fu, Shasha Huang, Shihua Ma, Yaoxu Xiong, Jun Zhang, Xuepeng Xiang, Wenyu Lu, Ji-Jung Kai, Shijun Zhao

Interstitial diffusion is a pivotal process that governs the phase stability and irradiation response of materials in non-equilibrium conditions.

Surgical Temporal Action-aware Network with Sequence Regularization for Phase Recognition

no code implementations21 Nov 2023 Zhen Chen, Yuhao Zhai, Jun Zhang, Jinqiao Wang

Specifically, we propose an efficient multi-scale surgical temporal action (MS-STA) module, which integrates visual features with spatial and temporal knowledge of surgical actions at the cost of 2D networks.

Surgical phase recognition

KBioXLM: A Knowledge-anchored Biomedical Multilingual Pretrained Language Model

1 code implementation20 Nov 2023 Lei Geng, Xu Yan, Ziqiang Cao, Juntao Li, Wenjie Li, Sujian Li, Xinjie Zhou, Yang Yang, Jun Zhang

We achieve a biomedical multilingual corpus by incorporating three granularity knowledge alignments (entity, fact, and passage levels) into monolingual corpora.

Relation XLM-R

Improving Large-scale Deep Biasing with Phoneme Features and Text-only Data in Streaming Transducer

no code implementations15 Nov 2023 Jin Qiu, Lu Huang, Boyu Li, Jun Zhang, Lu Lu, Zejun Ma

Deep biasing for the Transducer can improve the recognition performance of rare words or contextual entities, which is essential in practical applications, especially for streaming Automatic Speech Recognition (ASR).

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Learning Bayes-Optimal Channel Estimation for Holographic MIMO in Unknown EM Environments

no code implementations14 Nov 2023 Wentao Yu, Hengtao He, Xianghao Yu, Shenghui Song, Jun Zhang, Ross D. Murch, Khaled B. Letaief

Holographic MIMO (HMIMO) has recently been recognized as a promising enabler for future 6G systems through the use of an ultra-massive number of antennas in a compact space to exploit the propagation characteristics of the electromagnetic (EM) channel.

Learning Decentralized Traffic Signal Controllers with Multi-Agent Graph Reinforcement Learning

no code implementations7 Nov 2023 Yao Zhang, Zhiwen Yu, Jun Zhang, Liang Wang, Tom H. Luan, Bin Guo, Chau Yuen

Nevertheless, existing MARL algorithms ignore effective information aggregation which is fundamental for improving the learning capacity of decentralized agents.

Graph Learning Multi-agent Reinforcement Learning +1

Image Super-resolution Via Latent Diffusion: A Sampling-space Mixture Of Experts And Frequency-augmented Decoder Approach

1 code implementation18 Oct 2023 Feng Luo, Jinxi Xiang, Jun Zhang, Xiao Han, Wei Yang

To alleviate the huge computational cost required by pixel-based diffusion SR, latent-based methods utilize a feature encoder to transform the image and then implement the SR image generation in a compact latent space.

Blind Super-Resolution Decoder +2

Hyperspectral Image Fusion via Logarithmic Low-rank Tensor Ring Decomposition

no code implementations16 Oct 2023 Jun Zhang, Lipeng Zhu, Chao Wang, Shutao Li

On the other hand, the tensor nuclear norm (TNN)-based approaches have recently demonstrated to be more efficient on keeping high-dimensional low-rank structures in tensor recovery.

valid

Effortless Cross-Platform Video Codec: A Codebook-Based Method

no code implementations16 Oct 2023 Kuan Tian, Yonghang Guan, Jinxi Xiang, Jun Zhang, Xiao Han, Wei Yang

Due to the absence of autoregressive modeling and optical flow alignment, we can design an extremely minimalist framework that can greatly benefit computational efficiency.

Computational Efficiency Optical Flow Estimation +1

Attentive Multi-Layer Perceptron for Non-autoregressive Generation

1 code implementation14 Oct 2023 Shuyang Jiang, Jun Zhang, Jiangtao Feng, Lin Zheng, Lingpeng Kong

Furthermore, we marry AMLP with popular NAR models, deriving a highly efficient NAR-AMLP architecture with linear time and space complexity.

Machine Translation Speech Synthesis +1

Advancing Pose-Guided Image Synthesis with Progressive Conditional Diffusion Models

1 code implementation10 Oct 2023 Fei Shen, Hu Ye, Jun Zhang, Cong Wang, Xiao Han, Wei Yang

Specifically, in the first stage, we design a simple prior conditional diffusion model that predicts the global features of the target image by mining the global alignment relationship between pose coordinates and image appearance.

Image Generation

Mode Connectivity and Data Heterogeneity of Federated Learning

no code implementations29 Sep 2023 Tailin Zhou, Jun Zhang, Danny H. K. Tsang

Empirically, reducing data heterogeneity makes the connectivity on different paths more similar, forming more low-error overlaps between client and global modes.

Federated Learning Linear Mode Connectivity

Semantic Communications using Foundation Models: Design Approaches and Open Issues

no code implementations23 Sep 2023 Peiwen Jiang, Chao-Kai Wen, Xinping Yi, Xiao Li, Shi Jin, Jun Zhang

Foundation models (FMs), including large language models, have become increasingly popular due to their wide-ranging applicability and ability to understand human-like semantics.

Towards Real-Time Neural Video Codec for Cross-Platform Application Using Calibration Information

no code implementations20 Sep 2023 Kuan Tian, Yonghang Guan, Jinxi Xiang, Jun Zhang, Xiao Han, Wei Yang

First, to solve the problem of inconsistency of codec caused by the uncertainty of floating point calculations across platforms, we design a calibration transmitting system to guarantee the consistent quantization of entropy parameters between the encoding and decoding stages.

Quantization

Deep Learning for Near-Field XL-MIMO Transceiver Design: Principles and Techniques

no code implementations18 Sep 2023 Wentao Yu, Yifan Ma, Hengtao He, Shenghui Song, Jun Zhang, Khaled B. Letaief

Massive multiple-input multiple-output (MIMO) has been a critical enabling technology in 5th generation (5G) wireless networks.

Client-side Gradient Inversion Against Federated Learning from Poisoning

no code implementations14 Sep 2023 Jiaheng Wei, Yanjun Zhang, Leo Yu Zhang, Chao Chen, Shirui Pan, Kok-Leong Ong, Jun Zhang, Yang Xiang

For the first time, we show the feasibility of a client-side adversary with limited knowledge being able to recover the training samples from the aggregated global model.

Federated Learning

FedCiR: Client-Invariant Representation Learning for Federated Non-IID Features

no code implementations30 Aug 2023 Zijian Li, Zehong Lin, Jiawei Shao, Yuyi Mao, Jun Zhang

However, devices often have non-independent and identically distributed (non-IID) data, meaning their local data distributions can vary significantly.

Federated Learning Representation Learning

Forensic Histopathological Recognition via a Context-Aware MIL Network Powered by Self-Supervised Contrastive Learning

no code implementations27 Aug 2023 Chen Shen, Jun Zhang, Xinggong Liang, Zeyi Hao, Kehan Li, Fan Wang, Zhenyuan Wang, Chunfeng Lian

Forensic pathology is critical in analyzing death manner and time from the microscopic aspect to assist in the establishment of reliable factual bases for criminal investigation.

Contrastive Learning Domain Generalization +3

Dynamic Low-Rank Instance Adaptation for Universal Neural Image Compression

1 code implementation15 Aug 2023 Yue Lv, Jinxi Xiang, Jun Zhang, Wenming Yang, Xiao Han, Wei Yang

We thus introduce a dynamic gating network on top of the low-rank adaptation method, in order to decide which decoder layer should employ adaptation.

Decoder Image Compression

IP-Adapter: Text Compatible Image Prompt Adapter for Text-to-Image Diffusion Models

3 code implementations13 Aug 2023 Hu Ye, Jun Zhang, Sibo Liu, Xiao Han, Wei Yang

Despite the simplicity of our method, an IP-Adapter with only 22M parameters can achieve comparable or even better performance to a fully fine-tuned image prompt model.

Diffusion Personalization Tuning Free Image Generation +2

A new solution and concrete implementation steps for Artificial General Intelligence

no code implementations12 Aug 2023 Yongcong Chen, Ting Zeng, Jun Zhang

At present, the mainstream artificial intelligence generally adopts the technical path of "attention mechanism + deep learning" + "reinforcement learning".

reinforcement-learning

Feature Matching Data Synthesis for Non-IID Federated Learning

no code implementations9 Aug 2023 Zijian Li, Yuchang Sun, Jiawei Shao, Yuyi Mao, Jessie Hui Wang, Jun Zhang

For better privacy preservation, we propose a hard feature augmentation method to transfer real features towards the decision boundary, with which the synthetic data not only improve the model generalization but also erase the information of real features.

Data Augmentation Federated Learning +1

Binary Federated Learning with Client-Level Differential Privacy

no code implementations7 Aug 2023 Lumin Liu, Jun Zhang, Shenghui Song, Khaled B. Letaief

To improve communication efficiency and achieve a better privacy-utility trade-off, we propose a communication-efficient FL training algorithm with differential privacy guarantee.

Federated Learning Privacy Preserving

L-Eval: Instituting Standardized Evaluation for Long Context Language Models

3 code implementations20 Jul 2023 Chenxin An, Shansan Gong, Ming Zhong, Xingjian Zhao, Mukai Li, Jun Zhang, Lingpeng Kong, Xipeng Qiu

Recently, there has been growing interest in extending the context length of large language models (LLMs), aiming to effectively process long inputs of one turn or conversations with more extensive histories.

Instruction Following

Large Language Models Empowered Autonomous Edge AI for Connected Intelligence

no code implementations6 Jul 2023 Yifei Shen, Jiawei Shao, Xinjie Zhang, Zehong Lin, Hao Pan, Dongsheng Li, Jun Zhang, Khaled B. Letaief

The evolution of wireless networks gravitates towards connected intelligence, a concept that envisions seamless interconnectivity among humans, objects, and intelligence in a hyper-connected cyber-physical world.

Code Generation Federated Learning +3

MimiC: Combating Client Dropouts in Federated Learning by Mimicking Central Updates

1 code implementation21 Jun 2023 Yuchang Sun, Yuyi Mao, Jun Zhang

Federated learning (FL) is a promising framework for privacy-preserving collaborative learning, where model training tasks are distributed to clients and only the model updates need to be collected at a server.

Federated Learning Privacy Preserving

CIF-PT: Bridging Speech and Text Representations for Spoken Language Understanding via Continuous Integrate-and-Fire Pre-Training

no code implementations27 May 2023 Linhao Dong, Zhecheng An, Peihao Wu, Jun Zhang, Lu Lu, Zejun Ma

We also observe the cross-modal representation extracted by CIF-PT obtains better performance than other neural interfaces for the tasks of SLU, including the dominant speech representation learned from self-supervised pre-training.

intent-classification Intent Classification +5

Channel and Gradient-Importance Aware Device Scheduling for Over-the-Air Federated Learning

no code implementations26 May 2023 Yuchang Sun, Zehong Lin, Yuyi Mao, Shi Jin, Jun Zhang

In this paper, we propose a probabilistic device scheduling framework for over-the-air FL, named PO-FL, to mitigate the negative impact of channel noise, where each device is scheduled according to a certain probability and its model update is reweighted using this probability in aggregation.

Federated Learning Privacy Preserving +1

Towards Large-scale Single-shot Millimeter-wave Imaging for Low-cost Security Inspection

no code implementations25 May 2023 Liheng Bian, Daoyu Li, Shuoguang Wang, Chunyang Teng, Huteng Liu, Hanwen Xu, Xuyang Chang, Guoqiang Zhao, Shiyong Li, Jun Zhang

These elements are then sampled based on the ranking, building the experimentally optimal sparse sampling strategy that reduces the cost of antenna array by up to one order of magnitude.

Image Reconstruction object-detection +1

Task-Oriented Communication with Out-of-Distribution Detection: An Information Bottleneck Framework

1 code implementation21 May 2023 Hongru Li, Wentao Yu, Hengtao He, Jiawei Shao, Shenghui Song, Jun Zhang, Khaled B. Letaief

Task-oriented communication is an emerging paradigm for next-generation communication networks, which extracts and transmits task-relevant information, instead of raw data, for downstream applications.

Informativeness Out-of-Distribution Detection

Joint Activity-Delay Detection and Channel Estimation for Asynchronous Massive Random Access

no code implementations21 May 2023 Xinyu Bian, Yuyi Mao, Jun Zhang

Most existing studies on joint activity detection and channel estimation for grant-free massive random access (RA) systems assume perfect synchronization among all active users, which is hard to achieve in practice.

Action Detection Activity Detection

Understanding and Improving Model Averaging in Federated Learning on Heterogeneous Data

1 code implementation13 May 2023 Tailin Zhou, Zehong Lin, Jun Zhang, Danny H. K. Tsang

To gain further insights into model averaging in FL, we decompose the expected loss of the global model into five factors related to the client models.

Federated Learning

Novel deep learning methods for 3D flow field segmentation and classification

no code implementations10 May 2023 Xiaorui Bai, Wenyong Wang, Jun Zhang, Yueqing Wang, Yu Xiang

Flow field segmentation and classification help researchers to understand vortex structure and thus turbulent flow.

Classification Segmentation

Invertible Coarse Graining with Physics-Informed Generative Artificial Intelligence

no code implementations2 May 2023 Jun Zhang, Xiaohan Lin, Weinan E, Yi Qin Gao

Multiscale molecular modeling is widely applied in scientific research of molecular properties over large time and length scales.

DABS: Data-Agnostic Backdoor attack at the Server in Federated Learning

no code implementations2 May 2023 Wenqiang Sun, Sen Li, Yuchang Sun, Jun Zhang

Federated learning (FL) attempts to train a global model by aggregating local models from distributed devices under the coordination of a central server.

Backdoor Attack Federated Learning

Evaluation of a Canonical Image Representation for Sidescan Sonar

1 code implementation18 Apr 2023 Weiqi Xu, Li Ling, Yiping Xie, Jun Zhang, John Folkesson

In this paper, a canonical transformation method consisting of intensity correction and slant range correction is proposed to decrease the above distortion.

Template Matching

CEC: Crowdsourcing-based Evolutionary Computation for Distributed Optimization

no code implementations12 Apr 2023 Feng-Feng Wei, Wei-neng Chen, Xiao-Qi Guo, Bowen Zhao, Sang-Woon Jeon, Jun Zhang

Inspired by this, this paper intends to introduce crowdsourcing into evolutionary computation (EC) to propose a crowdsourcing-based evolutionary computation (CEC) paradigm for distributed optimization.

Distributed Optimization

Grant-free Massive Random Access with Retransmission: Receiver Optimization and Performance Analysis

no code implementations12 Apr 2023 Xinyu Bian, Yuyi Mao, Jun Zhang

Specifically, by jointly leveraging the user activity correlation between adjacent transmission blocks and the historical channel estimation results, we first develop an activity-correlation-aware receiver for grant-free massive RA systems with retransmission based on the correlated approximate message passing (AMP) algorithm.

Action Detection Activity Detection

A Survey on Distributed Evolutionary Computation

no code implementations12 Apr 2023 Wei-neng Chen, Feng-Feng Wei, Tian-Fang Zhao, Kay Chen Tan, Jun Zhang

Based on this taxonomy, existing studies on DEC are reviewed in terms of purpose, parallel structure of the algorithm, parallel model for implementation, and the implementation environment.

Distributed Computing Distributed Optimization

Selective Knowledge Sharing for Privacy-Preserving Federated Distillation without A Good Teacher

1 code implementation4 Apr 2023 Jiawei Shao, Fangzhao Wu, Jun Zhang

While federated learning is promising for privacy-preserving collaborative learning without revealing local data, it remains vulnerable to white-box attacks and struggles to adapt to heterogeneous clients.

Federated Learning Knowledge Distillation +2

Generalized Relation Modeling for Transformer Tracking

1 code implementation CVPR 2023 Shenyuan Gao, Chunluan Zhou, Jun Zhang

Compared with previous two-stream trackers, the recent one-stream tracking pipeline, which allows earlier interaction between the template and search region, has achieved a remarkable performance gain.

Relation

When Evolutionary Computation Meets Privacy

no code implementations22 Mar 2023 Bowen Zhao, Wei-neng Chen, Xiaoguo Li, Ximeng Liu, Qingqi Pei, Jun Zhang

To this end, in this paper, we discuss three typical optimization paradigms (i. e., \textit{centralized optimization, distributed optimization, and data-driven optimization}) to characterize optimization modes of evolutionary computation and propose BOOM to sort out privacy concerns in evolutionary computation.

Distributed Computing Distributed Optimization +1

Low-complexity Deep Video Compression with A Distributed Coding Architecture

1 code implementation21 Mar 2023 Xinjie Zhang, Jiawei Shao, Jun Zhang

This has inspired a distributed coding architecture aiming at reducing the encoding complexity.

Decoder Motion Estimation +1

DSDP: A Blind Docking Strategy Accelerated by GPUs

no code implementations16 Mar 2023 Yupeng Huang, Hong Zhang, Siyuan Jiang, Dajiong Yue, Xiaohan Lin, Jun Zhang, Yi Qin Gao

In this study, we take the advantage of both traditional and machine-learning based methods, and present a method Deep Site and Docking Pose (DSDP) to improve the performance of blind docking.

Blind Docking Drug Discovery

CoNIC Challenge: Pushing the Frontiers of Nuclear Detection, Segmentation, Classification and Counting

1 code implementation11 Mar 2023 Simon Graham, Quoc Dang Vu, Mostafa Jahanifar, Martin Weigert, Uwe Schmidt, Wenhua Zhang, Jun Zhang, Sen yang, Jinxi Xiang, Xiyue Wang, Josef Lorenz Rumberger, Elias Baumann, Peter Hirsch, Lihao Liu, Chenyang Hong, Angelica I. Aviles-Rivero, Ayushi Jain, Heeyoung Ahn, Yiyu Hong, Hussam Azzuni, Min Xu, Mohammad Yaqub, Marie-Claire Blache, Benoît Piégu, Bertrand Vernay, Tim Scherr, Moritz Böhland, Katharina Löffler, Jiachen Li, Weiqin Ying, Chixin Wang, Dagmar Kainmueller, Carola-Bibiane Schönlieb, Shuolin Liu, Dhairya Talsania, Yughender Meda, Prakash Mishra, Muhammad Ridzuan, Oliver Neumann, Marcel P. Schilling, Markus Reischl, Ralf Mikut, Banban Huang, Hsiang-Chin Chien, Ching-Ping Wang, Chia-Yen Lee, Hong-Kun Lin, Zaiyi Liu, Xipeng Pan, Chu Han, Jijun Cheng, Muhammad Dawood, Srijay Deshpande, Raja Muhammad Saad Bashir, Adam Shephard, Pedro Costa, João D. Nunes, Aurélio Campilho, Jaime S. Cardoso, Hrishikesh P S, Densen Puthussery, Devika R G, Jiji C V, Ye Zhang, Zijie Fang, Zhifan Lin, Yongbing Zhang, Chunhui Lin, Liukun Zhang, Lijian Mao, Min Wu, Vi Thi-Tuong Vo, Soo-Hyung Kim, Taebum Lee, Satoshi Kondo, Satoshi Kasai, Pranay Dumbhare, Vedant Phuse, Yash Dubey, Ankush Jamthikar, Trinh Thi Le Vuong, Jin Tae Kwak, Dorsa Ziaei, Hyun Jung, Tianyi Miao, David Snead, Shan E Ahmed Raza, Fayyaz Minhas, Nasir M. Rajpoot

Nuclear detection, segmentation and morphometric profiling are essential in helping us further understand the relationship between histology and patient outcome.

Nuclear Segmentation Segmentation +2

AC2C: Adaptively Controlled Two-Hop Communication for Multi-Agent Reinforcement Learning

no code implementations24 Feb 2023 Xuefeng Wang, Xinran Li, Jiawei Shao, Jun Zhang

Learning communication strategies in cooperative multi-agent reinforcement learning (MARL) has recently attracted intensive attention.

Multi-agent Reinforcement Learning reinforcement-learning +2

Message Passing Meets Graph Neural Networks: A New Paradigm for Massive MIMO Systems

1 code implementation14 Feb 2023 Hengtao He, Xianghao Yu, Jun Zhang, Shenghui Song, Khaled B. Letaief

As one of the core technologies for 5G systems, massive multiple-input multiple-output (MIMO) introduces dramatic capacity improvements along with very high beamforming and spatial multiplexing gains.

Graph Neural Network

Federated attention consistent learning models for prostate cancer diagnosis and Gleason grading

no code implementations13 Feb 2023 Fei Kong, Xiyue Wang, Jinxi Xiang, Sen yang, Xinran Wang, Meng Yue, Jun Zhang, Junhan Zhao, Xiao Han, Yuhan Dong, Biyue Zhu, Fang Wang, Yueping Liu

We assessed the effectiveness of FACL in cancer diagnosis and Gleason grading tasks using 19, 461 whole-slide images of prostate cancer from multiple centers.

Contrastive Learning Ethics +2

Hyperspectral Image Super Resolution with Real Unaligned RGB Guidance

1 code implementation13 Feb 2023 Zeqiang Lai, Ying Fu, Jun Zhang

The features of RGB reference images are then processed by a multi-stage alignment module to explicitly align the features of RGB reference with the LR HSI.

Hyperspectral Image Super-Resolution Image Super-Resolution

In-Context Learning with Many Demonstration Examples

1 code implementation9 Feb 2023 Mukai Li, Shansan Gong, Jiangtao Feng, Yiheng Xu, Jun Zhang, Zhiyong Wu, Lingpeng Kong

Based on EVALM, we scale up the size of examples efficiently in both instruction tuning and in-context learning to explore the boundary of the benefits from more annotated data.

16k 8k +2

LDMIC: Learning-based Distributed Multi-view Image Coding

1 code implementation24 Jan 2023 Xinjie Zhang, Jiawei Shao, Jun Zhang

Multi-view image compression plays a critical role in 3D-related applications.

Data Compression Decoder +1

Artificial intelligence for diagnosing and predicting survival of patients with renal cell carcinoma: Retrospective multi-center study

no code implementations12 Jan 2023 Siteng Chen, Xiyue Wang, Jun Zhang, Liren Jiang, Ning Zhang, Feng Gao, Wei Yang, Jinxi Xiang, Sen yang, Junhua Zheng, Xiao Han

The OSrisk for the prediction of 5-year survival status achieved AUC of 0. 784 (0. 746-0. 819) in the TCGA cohort, which was further verified in the independent General cohort and the CPTAC cohort, with AUC of 0. 774 (0. 723-0. 820) and 0. 702 (0. 632-0. 765), respectively.

whole slide images

Machine Learning for Large-Scale Optimization in 6G Wireless Networks

no code implementations3 Jan 2023 Yandong Shi, Lixiang Lian, Yuanming Shi, Zixin Wang, Yong Zhou, Liqun Fu, Lin Bai, Jun Zhang, Wei zhang

The sixth generation (6G) wireless systems are envisioned to enable the paradigm shift from "connected things" to "connected intelligence", featured by ultra high density, large-scale, dynamic heterogeneity, diversified functional requirements and machine learning capabilities, which leads to a growing need for highly efficient intelligent algorithms.

Computational Efficiency Distributed Optimization +3

Large-scale single-photon imaging

no code implementations28 Dec 2022 Liheng Bian, Haoze Song, Lintao Peng, Xuyang Chang, Xi Yang, Roarke Horstmeyer, Lin Ye, Tong Qin, Dezhi Zheng, Jun Zhang

Benefiting from its single-photon sensitivity, single-photon avalanche diode (SPAD) array has been widely applied in various fields such as fluorescence lifetime imaging and quantum computing.

Super-Resolution

RLogist: Fast Observation Strategy on Whole-slide Images with Deep Reinforcement Learning

1 code implementation4 Dec 2022 Boxuan Zhao, Jun Zhang, Deheng Ye, Jian Cao, Xiao Han, Qiang Fu, Wei Yang

Most of the existing methods rely on a multiple instance learning framework that requires densely sampling local patches at high magnification.

Benchmarking Decision Making +4

iEnhancer-ELM: improve enhancer identification by extracting position-related multiscale contextual information based on enhancer language models

1 code implementation3 Dec 2022 Jiahao Li, Zhourun Wu, Wenhao Lin, Jiawei Luo, Jun Zhang, Qingcai Chen, Junjie Chen

Although many feature extraction methods have been proposed to improve the performance of enhancer identification, they cannot learn position-related multiscale contextual information from raw DNA sequences.

Language Modelling Position

An Adaptive and Robust Deep Learning Framework for THz Ultra-Massive MIMO Channel Estimation

1 code implementation29 Nov 2022 Wentao Yu, Yifei Shen, Hengtao He, Xianghao Yu, Shenghui Song, Jun Zhang, Khaled B. Letaief

For practical usage, the proposed framework is further extended to wideband THz UM-MIMO systems with beam squint effect.

Lightweight and Flexible Deep Equilibrium Learning for CSI Feedback in FDD Massive MIMO

no code implementations28 Nov 2022 Yifan Ma, Wentao Yu, Xianghao Yu, Jun Zhang, Shenghui Song, Khaled B. Letaief

In this paper, we propose a lightweight and flexible deep learning-based CSI feedback approach by capitalizing on deep equilibrium models.

Task-Oriented Communication for Edge Video Analytics

1 code implementation25 Nov 2022 Jiawei Shao, Xinjie Zhang, Jun Zhang

With the development of artificial intelligence (AI) techniques and the increasing popularity of camera-equipped devices, many edge video analytics applications are emerging, calling for the deployment of computation-intensive AI models at the network edge.

Informativeness

Blind Performance Prediction for Deep Learning Based Ultra-Massive MIMO Channel Estimation

no code implementations15 Nov 2022 Wentao Yu, Hengtao He, Xianghao Yu, Shenghui Song, Jun Zhang, Khaled B. Letaief

Reliability is of paramount importance for the physical layer of wireless systems due to its decisive impact on end-to-end performance.

Stochastic Coded Federated Learning: Theoretical Analysis and Incentive Mechanism Design

no code implementations8 Nov 2022 Yuchang Sun, Jiawei Shao, Yuyi Mao, Songze Li, Jun Zhang

During training, the server computes gradients on the global coded dataset to compensate for the missing model updates of the straggling devices.

Federated Learning Privacy Preserving

Deep Latent Mixture Model for Recommendation

no code implementations27 Oct 2022 Jun Zhang, Ping Li, Wei Wang

Recent advances in neural networks have been successfully applied to many tasks in online recommendation applications.

CAB: Comprehensive Attention Benchmarking on Long Sequence Modeling

1 code implementation14 Oct 2022 Jun Zhang, Shuyang Jiang, Jiangtao Feng, Lin Zheng, Lingpeng Kong

In this paper, we propose Comprehensive Attention Benchmark (CAB) under a fine-grained attention taxonomy with four distinguishable attention patterns, namely, noncausal self, causal self, noncausal cross, and causal cross attentions.

Benchmarking Long-range modeling

PARAGEN : A Parallel Generation Toolkit

1 code implementation7 Oct 2022 Jiangtao Feng, Yi Zhou, Jun Zhang, Xian Qian, Liwei Wu, Zhexi Zhang, Yanming Liu, Mingxuan Wang, Lei LI, Hao Zhou

PARAGEN is a PyTorch-based NLP toolkit for further development on parallel generation.

Model Selection

DReS-FL: Dropout-Resilient Secure Federated Learning for Non-IID Clients via Secret Data Sharing

no code implementations6 Oct 2022 Jiawei Shao, Yuchang Sun, Songze Li, Jun Zhang

Federated learning (FL) strives to enable collaborative training of machine learning models without centrally collecting clients' private data.

Federated Learning

SoccerNet 2022 Challenges Results

7 code implementations5 Oct 2022 Silvio Giancola, Anthony Cioppa, Adrien Deliège, Floriane Magera, Vladimir Somers, Le Kang, Xin Zhou, Olivier Barnich, Christophe De Vleeschouwer, Alexandre Alahi, Bernard Ghanem, Marc Van Droogenbroeck, Abdulrahman Darwish, Adrien Maglo, Albert Clapés, Andreas Luyts, Andrei Boiarov, Artur Xarles, Astrid Orcesi, Avijit Shah, Baoyu Fan, Bharath Comandur, Chen Chen, Chen Zhang, Chen Zhao, Chengzhi Lin, Cheuk-Yiu Chan, Chun Chuen Hui, Dengjie Li, Fan Yang, Fan Liang, Fang Da, Feng Yan, Fufu Yu, Guanshuo Wang, H. Anthony Chan, He Zhu, Hongwei Kan, Jiaming Chu, Jianming Hu, Jianyang Gu, Jin Chen, João V. B. Soares, Jonas Theiner, Jorge De Corte, José Henrique Brito, Jun Zhang, Junjie Li, Junwei Liang, Leqi Shen, Lin Ma, Lingchi Chen, Miguel Santos Marques, Mike Azatov, Nikita Kasatkin, Ning Wang, Qiong Jia, Quoc Cuong Pham, Ralph Ewerth, Ran Song, RenGang Li, Rikke Gade, Ruben Debien, Runze Zhang, Sangrok Lee, Sergio Escalera, Shan Jiang, Shigeyuki Odashima, Shimin Chen, Shoichi Masui, Shouhong Ding, Sin-wai Chan, Siyu Chen, Tallal El-Shabrawy, Tao He, Thomas B. Moeslund, Wan-Chi Siu, Wei zhang, Wei Li, Xiangwei Wang, Xiao Tan, Xiaochuan Li, Xiaolin Wei, Xiaoqing Ye, Xing Liu, Xinying Wang, Yandong Guo, YaQian Zhao, Yi Yu, YingYing Li, Yue He, Yujie Zhong, Zhenhua Guo, Zhiheng Li

The SoccerNet 2022 challenges were the second annual video understanding challenges organized by the SoccerNet team.

Action Spotting Camera Calibration +3

Text-Adaptive Multiple Visual Prototype Matching for Video-Text Retrieval

no code implementations27 Sep 2022 Chengzhi Lin, AnCong Wu, Junwei Liang, Jun Zhang, Wenhang Ge, Wei-Shi Zheng, Chunhua Shen

To address this problem, we propose a Text-Adaptive Multiple Visual Prototype Matching model, which automatically captures multiple prototypes to describe a video by adaptive aggregation of video token features.

Cross-Modal Retrieval Text Retrieval +1

Multi-dataset Training of Transformers for Robust Action Recognition

1 code implementation26 Sep 2022 Junwei Liang, Enwei Zhang, Jun Zhang, Chunhua Shen

We study the task of robust feature representations, aiming to generalize well on multiple datasets for action recognition.

Action Recognition Temporal Action Localization

18