Search Results for author: Cheng Luo

Found 57 papers, 28 papers with code

MOM: Memory-Efficient Offloaded Mini-Sequence Inference for Long Context Language Models

no code implementations16 Apr 2025 Junyang Zhang, Tianyi Zhu, Cheng Luo, Anima Anandkumar

On Meta-Llama-3. 2-8B, MOM extends the maximum context length from 155k to 455k tokens on a single A100 80GB GPU, while keeping outputs identical and not compromising accuracy.

Algorithm Design and Prototype Validation for Reconfigurable Intelligent Sensing Surface: Forward-Only Transmission

no code implementations31 Mar 2025 Cheng Luo, Luping Xiang, Jie Hu, Kun Yang

By leveraging sensing results from the active elements, we propose communication enhancement and robust interference suppression schemes for both near-field and far-field models, implemented through the passive elements.

Low-Complexity Beamforming Design for Null Space-based Simultaneous Wireless Information and Power Transfer Systems

no code implementations11 Mar 2025 Cheng Luo, Jie Hu, Luping Xiang, Kun Yang

Simultaneous wireless information and power transfer (SWIPT) is a promising technology for the upcoming sixth-generation (6G) communication networks, enabling internet of things (IoT) devices and sensors to extend their operational lifetimes.

Reconfigurable Intelligent Sensing Surface enables Wireless Powered Communication Networks: Interference Suppression and Massive Wireless Energy Transfer

no code implementations11 Mar 2025 Cheng Luo, Jie Hu, Luping Xiang, Kun Yang

Recently, a novel structures of reconfigurable intelligent surface (RIS) integrating both passive and active elements, termed reconfigurable intelligent sensing surface (RISS), efficiently addresses challenges in RIS channel estimation and mitigates issues related to multiplicative path loss by processing the signal at the RISS.

Fairness

Bedrock Models in Communication and Sensing: Advancing Generalization, Transferability, and Performance

no code implementations11 Mar 2025 Cheng Luo, Luping Xiang, Jie Hu, Kun Yang

Deep learning (DL) has emerged as a powerful tool for addressing the intricate challenges inherent in communication and sensing systems, significantly enhancing the intelligence of future sixth-generation (6G) networks.

CaseGen: A Benchmark for Multi-Stage Legal Case Documents Generation

1 code implementation25 Feb 2025 Haitao Li, Jiaying Ye, Yiran Hu, Jia Chen, Qingyao Ai, Yueyue Wu, Junjie Chen, Yifan Chen, Cheng Luo, Quan Zhou, Yiqun Liu

To the best of our knowledge, CaseGen is the first benchmark designed to evaluate LLMs in the context of legal case document generation.

Legal Reasoning

Finedeep: Mitigating Sparse Activation in Dense LLMs via Multi-Layer Fine-Grained Experts

no code implementations18 Feb 2025 Leiyu Pan, Zhenpeng Su, Minxuan Lv, Yizhe Xiong, Xiangwen Zhang, Zijia Lin, Hui Chen, Jungong Han, Guiguang Ding, Cheng Luo, Di Zhang, Kun Gai, Deyi Xiong

Moreover, we find that Finedeep achieves optimal results when balancing depth and width, specifically by adjusting the number of expert sub-layers and the number of experts per sub-layer.

Efficient Exploration

HeadInfer: Memory-Efficient LLM Inference by Head-wise Offloading

1 code implementation18 Feb 2025 Cheng Luo, Zefan Cai, Hanshi Sun, Jinqi Xiao, Bo Yuan, Wen Xiao, Junjie Hu, Jiawei Zhao, Beidi Chen, Anima Anandkumar

Extending the context length has disproportionately shifted the memory footprint of LLMs during inference to the key-value cache (KV cache).

Computational Efficiency

Benchmarking Graph Representations and Graph Neural Networks for Multivariate Time Series Classification

1 code implementation14 Jan 2025 Wennuo Yang, shiling Wu, Yuzhi Zhou, Cheng Luo, Xilin He, Weicheng Xie, Linlin Shen, Siyang Song

Multivariate Time Series Classification (MTSC) enables the analysis if complex temporal data, and thus serves as a cornerstone in various real-world applications, ranging from healthcare to finance.

Benchmarking Graph Representation Learning +2

Towards Combating Frequency Simplicity-biased Learning for Domain Generalization

1 code implementation21 Oct 2024 Xilin He, Jingyu Hu, Qinliang Lin, Cheng Luo, Weicheng Xie, Siyang Song, Muhammad Haris Khan, Linlin Shen

Given the theoretical justification of models' biased learning behavior on different spatial frequency components, which is based on the dataset frequency properties, we argue that the learning behavior on various frequency components could be manipulated by changing the dataset statistical structure in the Fourier domain.

Data Augmentation Domain Generalization

SynFER: Towards Boosting Facial Expression Recognition with Synthetic Data

no code implementations13 Oct 2024 Xilin He, Cheng Luo, Xiaole Xian, Bing Li, Siyang Song, Muhammad Haris Khan, Weicheng Xie, Linlin Shen, ZongYuan Ge

Facial expression datasets remain limited in scale due to privacy concerns, the subjectivity of annotations, and the labor-intensive nature of data collection.

Facial Expression Recognition Pseudo Label +1

CLIP Multi-modal Hashing for Multimedia Retrieval

no code implementations10 Oct 2024 Jian Zhu, Mingkai Sheng, Zhangmin Huang, Jingfei Chang, Jinling Jiang, Jian Long, Cheng Luo, Lei Liu

Multi-modal hashing methods are widely used in multimedia retrieval, which can fuse multi-source data to generate binary hash code.

Retrieval

Optimizing Placement and Power Allocation in Reconfigurable Intelligent Sensing Surfaces for Enhanced Sensing and Communication Performance

no code implementations10 Sep 2024 Cheng Luo, Jie Hu, Luping Xiang, Kun Yang, Bo Lei

In this letter, we investigate the design of multiple reconfigurable intelligent sensing surfaces (RISSs) that enhance both communication and sensing tasks.

Multi-SIGATnet: A multimodal schizophrenia MRI classification algorithm using sparse interaction mechanisms and graph attention networks

no code implementations25 Aug 2024 Yuhong Jiao, Jiaqing Miao, Jinnan Gong, Hui He, Ping Liang, Cheng Luo, Ying Tan

To effectively capture the topological information of brain neural networks, a novel multimodal graph attention network based on sparse interaction mechanism (Multi-SIGATnet) was proposed for SZ classification was proposed for SZ classification.

Graph Attention MRI classification +1

Mini-Sequence Transformer: Optimizing Intermediate Memory for Long Sequences Training

1 code implementation22 Jul 2024 Cheng Luo, Jiawei Zhao, Zhuoming Chen, Beidi Chen, Anima Anandkumar

We introduce Mini-Sequence Transformer (MsT), a simple and effective methodology for highly efficient and accurate LLM training with extremely long sequences.

LeKUBE: A Legal Knowledge Update BEnchmark

1 code implementation19 Jul 2024 Changyue Wang, Weihang Su, Hu Yiran, Qingyao Ai, Yueyue Wu, Cheng Luo, Yiqun Liu, Min Zhang, Shaoping Ma

Existing benchmarks for evaluating knowledge update methods are mostly designed for the open domain and cannot address the specific challenges of the legal domain, such as the nuanced application of new legal knowledge, the complexity and lengthiness of legal regulations, and the intricate nature of legal reasoning.

Legal Reasoning

ISPO: An Integrated Ontology of Symptom Phenotypes for Semantic Integration of Traditional Chinese Medical Data

no code implementations8 Jul 2024 Zixin Shu, Rui Hua, Dengying Yan, Chenxia Lu, Ning Xu, Jun Li, Hui Zhu, Jia Zhang, Dan Zhao, Chenyang Hui, Junqiu Ye, Chu Liao, Qi Hao, Wen Ye, Cheng Luo, Xinyan Wang, Chuang Cheng, XiaoDong Li, Baoyan Liu, Xiaji Zhou, Runshun Zhang, Min Xu, Xuezhong Zhou

Methods: To construct an integrated ontology of symptom phenotypes (ISPO), we manually annotated classical TCM textbooks and large-scale Chinese electronic medical records (EMRs) to collect symptom terms with support from a medical text annotation system.

text annotation

Enhancing Vision-Language Models Generalization via Diversity-Driven Novel Feature Synthesis

no code implementations4 May 2024 Siyuan Yan, Cheng Luo, Zhen Yu, ZongYuan Ge

To address this, we propose a plug-and-play feature synthesis method called LDFS (Language-Guided Diverse Feature Synthesis) to synthesize new domain features and improve existing CLIP fine-tuning strategies.

Diversity Zero-shot Generalization

Multi-scale Dynamic and Hierarchical Relationship Modeling for Facial Action Units Recognition

1 code implementation CVPR 2024 Zihan Wang, Siyang Song, Cheng Luo, Songhe Deng, Weicheng Xie, Linlin Shen

Human facial action units (AUs) are mutually related in a hierarchical manner, as not only they are associated with each other in both spatial and temporal domains but also AUs located in the same/close facial regions show stronger relationships than those of different facial regions.

Facial Action Unit Detection

Conversational Disease Diagnosis via External Planner-Controlled Large Language Models

1 code implementation4 Apr 2024 Zhoujian Sun, Cheng Luo, Ziyi Liu, Zhengxing Huang

We demonstrated that our system obtained impressive performance in both disease screening and differential diagnoses tasks.

Active Learning Decision Making +4

Deep Prompt Multi-task Network for Abuse Language Detection

no code implementations8 Mar 2024 Jian Zhu, YuPing Ruan, Jingfei Chang, Wenhui Sun, Hui Wan, Jian Long, Cheng Luo

To address the problem, we propose a novel Deep Prompt Multi-task Network (DPMN) for abuse language detection.

Abusive Language General Knowledge +1

REACT 2024: the Second Multiple Appropriate Facial Reaction Generation Challenge

2 code implementations10 Jan 2024 Siyang Song, Micol Spitale, Cheng Luo, Cristina Palmero, German Barquero, Hengde Zhu, Sergio Escalera, Michel Valstar, Tobias Baur, Fabien Ringeval, Elisabeth Andre, Hatice Gunes

In dyadic interactions, humans communicate their intentions and state of mind using verbal and non-verbal cues, where multiple different facial reactions might be appropriate in response to a specific speaker behaviour.

IEKM: A Model Incorporating External Keyword Matrices

no code implementations21 Nov 2023 Cheng Luo, Qin Li, Zhao Yan, Mengliang Rao, Yunbo Cao

In this paper, we propose an incorporation external keywords matrices model (IEKM) to address these challenges.

model Semantic Similarity +3

Massive Wireless Energy Transfer without Channel State Information via Imperfect Intelligent Reflecting Surfaces

no code implementations15 Nov 2023 Cheng Luo, Jie Hu, Luping Xiang, Kun Yang, Kai-Kit Wong

Intelligent Reflecting Surface (IRS) utilizes low-cost, passive reflecting elements to enhance the passive beam gain, improve Wireless Energy Transfer (WET) efficiency, and enable its deployment for numerous Internet of Things (IoT) devices.

RTP: Rethinking Tensor Parallelism with Memory Deduplication

1 code implementation2 Nov 2023 Cheng Luo, Tianle Zhong, Geoffrey Fox

In the evolving landscape of neural network models, one prominent challenge stand out: the significant memory overheads associated with training expansive models.

CTP:A Causal Interpretable Model for Non-Communicable Disease Progression Prediction

1 code implementation18 Aug 2023 Zhoujian Sun, Wenzhuo Zhang, Zhengxing Huang, Nai Ding, Cheng Luo

The CTP model combines trajectory prediction and causal discovery to enable accurate prediction of disease progression trajectories and uncover causal relationships between features.

Causal Discovery Decision Making +2

MRecGen: Multimodal Appropriate Reaction Generator

no code implementations5 Jul 2023 Jiaqi Xu, Cheng Luo, Weicheng Xie, Linlin Shen, Xiaofeng Liu, Lu Liu, Hatice Gunes, Siyang Song

Verbal and non-verbal human reaction generation is a challenging task, as different reactions could be appropriate for responding to the same behaviour.

REACT2023: the first Multi-modal Multiple Appropriate Facial Reaction Generation Challenge

2 code implementations11 Jun 2023 Siyang Song, Micol Spitale, Cheng Luo, German Barquero, Cristina Palmero, Sergio Escalera, Michel Valstar, Tobias Baur, Fabien Ringeval, Elisabeth Andre, Hatice Gunes

The Multi-modal Multiple Appropriate Facial Reaction Generation Challenge (REACT2023) is the first competition event focused on evaluating multimedia processing and machine learning techniques for generating human-appropriate facial reactions in various dyadic interaction scenarios, with all participants competing strictly under the same conditions.

ReactFace: Online Multiple Appropriate Facial Reaction Generation in Dyadic Interactions

1 code implementation25 May 2023 Cheng Luo, Siyang Song, Weicheng Xie, Micol Spitale, ZongYuan Ge, Linlin Shen, Hatice Gunes

To address these limitations, this paper reformulates the task as an extrapolation or prediction problem, and proposes an novel framework (called ReactFace) to generate multiple different but appropriate facial reactions from a speaker behaviour rather than merely replicating the corresponding listener facial behaviours.

Semi-Supervised RF Fingerprinting with Consistency-Based Regularization

no code implementations28 Apr 2023 Weidong Wang, Cheng Luo, Jiancheng An, Lu Gan, Hongshu Liao, Chau Yuen

As a promising non-password authentication technology, radio frequency (RF) fingerprinting can greatly improve wireless security.

Data Augmentation

Temporal Dynamic Synchronous Functional Brain Network for Schizophrenia Diagnosis and Lateralization Analysis

1 code implementation31 Mar 2023 Cheng Zhu, Ying Tan, Shuqi Yang, Jiaqing Miao, JiaYi Zhu, Huan Huang, Dezhong Yao, Cheng Luo

The available evidence suggests that dynamic functional connectivity (dFC) can capture time-varying abnormalities in brain activity in resting-state cerebral functional magnetic resonance imaging (rs-fMRI) data and has a natural advantage in uncovering mechanisms of abnormal brain activity in schizophrenia(SZ) patients.

Functional Connectivity

Spatio-Temporal AU Relational Graph Representation Learning For Facial Action Units Detection

1 code implementation19 Mar 2023 Zihan Wang, Siyang Song, Cheng Luo, Yuzhi Zhou, shiling Wu, Weicheng Xie, Linlin Shen

This paper presents our Facial Action Units (AUs) detection submission to the fifth Affective Behavior Analysis in-the-wild Competition (ABAW).

Graph Learning Graph Representation Learning

Shift from Texture-bias to Shape-bias: Edge Deformation-based Augmentation for Robust Object Recognition

no code implementations ICCV 2023 Xilin He, Qinliang Lin, Cheng Luo, Weicheng Xie, Siyang Song, Feng Liu, Linlin Shen

Recent studies have shown the vulnerability of CNNs under perturbation noises, which is partially caused by the reason that the well-trained CNNs are too biased toward the object texture, i. e., they make predictions mainly based on texture cues.

Object Recognition

GRATIS: Deep Learning Graph Representation with Task-specific Topology and Multi-dimensional Edge Features

1 code implementation19 Nov 2022 Siyang Song, Yuxin Song, Cheng Luo, Zhiyuan Song, Selim Kuzucu, Xi Jia, Zhijiang Guo, Weicheng Xie, Linlin Shen, Hatice Gunes

Our framework is effective, robust and flexible, and is a plug-and-play module that can be combined with different backbones and Graph Neural Networks (GNNs) to generate a task-specific graph representation from various graph and non-graph data.

Graph Representation Learning

A Benchmarking Dataset with 2440 Organic Molecules for Volume Distribution at Steady State

1 code implementation10 Nov 2022 Wenwen Liu, Cheng Luo, Hecheng Wang, Fanwang Meng

Conclusions: To the best of our knowledge, this is the largest dataset for VDss, which can be used as the benchmark for computational studies of VDss.

Benchmarking feature selection +1

SemFormer: Semantic Guided Activation Transformer for Weakly Supervised Semantic Segmentation

1 code implementation26 Oct 2022 Junliang Chen, Xiaodong Zhao, Cheng Luo, Linlin Shen

Recent mainstream weakly supervised semantic segmentation (WSSS) approaches are mainly based on Class Activation Map (CAM) generated by a CNN (Convolutional Neural Network) based image classifier.

Weakly supervised Semantic Segmentation Weakly-Supervised Semantic Segmentation

Scale-free and Task-agnostic Attack: Generating Photo-realistic Adversarial Patterns with Patch Quilting Generator

no code implementations12 Aug 2022 Xiangbo Gao, Cheng Luo, Qinliang Lin, Weicheng Xie, Minmin Liu, Linlin Shen, Keerthy Kusumam, Siyang Song

\noindent Traditional L_p norm-restricted image attack algorithms suffer from poor transferability to black box scenarios and poor robustness to defense algorithms.

Adversarial Attack Image Classification +4

GAN-Based Multi-View Video Coding with Spatio-Temporal EPI Reconstruction

no code implementations7 May 2022 Chengdong Lan, Hao Yan, Cheng Luo, Tiesong Zhao

At the decoder side, we combine the SI and adjacent viewpoints to reconstruct intermediate views using the GAN generator.

Decoder Generative Adversarial Network +1

Learning Multi-dimensional Edge Feature-based AU Relation Graph for Facial Action Unit Recognition

2 code implementations2 May 2022 Cheng Luo, Siyang Song, Weicheng Xie, Linlin Shen, Hatice Gunes

While the relationship between a pair of AUs can be complex and unique, existing approaches fail to specifically and explicitly represent such cues for each pair of AUs in each facial display.

Facial Action Unit Detection Relation

Modality-Balanced Embedding for Video Retrieval

no code implementations18 Apr 2022 Xun Wang, Bingqing Ke, Xuanping Li, Fangyu Liu, Mingyu Zhang, Xiao Liang, Qiushi Xiao, Cheng Luo, Yue Yu

This modality imbalanceresults from a) modality gap: the relevance between a query and a video text is much easier to learn as the query is also a piece of text, with the same modality as the video text; b) data bias: most training samples can be solved solely by text matching.

Retrieval Text Matching +1

Frequency-driven Imperceptible Adversarial Attack on Semantic Similarity

1 code implementation CVPR 2022 Cheng Luo, Qinliang Lin, Weicheng Xie, Bizhu Wu, Jinheng Xie, Linlin Shen

Current adversarial attack research reveals the vulnerability of learning-based classifiers against carefully crafted perturbations.

Adversarial Attack Semantic Similarity +1

Online Refinement of Low-level Feature Based Activation Map for Weakly Supervised Object Localization

1 code implementation ICCV 2021 Jinheng Xie, Cheng Luo, Xiangping Zhu, Ziqi Jin, Weizeng Lu, Linlin Shen

In the first stage, an activation map generator produces activation maps based on the low-level feature maps in the classifier, such that rich contextual object information is included in an online manner.

Object Weakly-Supervised Object Localization

Prediction, Selection, and Generation: Exploration of Knowledge-Driven Conversation System

no code implementations23 Apr 2021 Cheng Luo, Dayiheng Liu, Chanjuan Li, Li Lu, Jiancheng Lv

The system includes modules such as dialogue topic prediction, knowledge matching and dialogue generation.

Dialogue Generation

CrossoverScheduler: Overlapping Multiple Distributed Training Applications in a Crossover Manner

no code implementations14 Mar 2021 Cheng Luo, Lei Qu, Youshan Miao, Peng Cheng, Yongqiang Xiong

Distributed deep learning workloads include throughput-intensive training tasks on the GPU clusters, where the Distributed Stochastic Gradient Descent (SGD) incurs significant communication delays after backward propagation, forces workers to wait for the gradient synchronization via a centralized parameter server or directly in decentralized workers.

Deep Learning Image Classification

Let's be Humorous: Knowledge Enhanced Humor Generation

no code implementations ACL 2020 Hang Zhang, Dayiheng Liu, Jiancheng Lv, Cheng Luo

To our knowledge, this is the first attempt to generate punchlines with knowledge enhanced model.

Sentence

Neural Document Expansion with User Feedback

1 code implementation8 Aug 2019 Yue Yin, Chenyan Xiong, Cheng Luo, Zhiyuan Liu

This paper presents a neural document expansion approach (NeuDEF) that enriches document representations for neural ranking models.

Temporal Relational Ranking for Stock Prediction

3 code implementations25 Sep 2018 Fuli Feng, Xiangnan He, Xiang Wang, Cheng Luo, Yiqun Liu, Tat-Seng Chua

Our RSR method advances existing solutions in two major aspects: 1) tailoring the deep learning models for stock ranking, and 2) capturing the stock relations in a time-sensitive manner.

Deep Learning Prediction +3

Unbiased Learning to Rank with Unbiased Propensity Estimation

1 code implementation16 Apr 2018 Qingyao Ai, Keping Bi, Cheng Luo, Jiafeng Guo, W. Bruce Croft

We find that the problem of estimating a propensity model from click data is a dual problem of unbiased learning to rank.

Learning-To-Rank parameter estimation

The Dependent Random Measures with Independent Increments in Mixture Models

no code implementations27 Jun 2016 Cheng Luo, Richard Yi Da Xu, Yang Xiang

One of the propositions of the dependent random measures is that the atoms of the posterior distribution are shared amongst groups, and hence groups can borrow information from each other.

Smoothed Hierarchical Dirichlet Process: A Non-Parametric Approach to Constraint Measures

no code implementations16 Apr 2016 Cheng Luo, Yang Xiang, Richard Yi Da Xu

The key novelty of this model is that we place a temporal constraint amongst the nearby discrete measures $\{G_j\}$ in the form of symmetric Kullback-Leibler (KL) Divergence with a fixed bound $B$.

Cannot find the paper you are looking for? You can Submit a new open access paper.