Search Results for author: Yijia Zhang

Found 21 papers, 7 papers with code

ROMA: a Read-Only-Memory-based Accelerator for QLoRA-based On-Device LLM

no code implementations17 Mar 2025 WenQiang Wang, Yijia Zhang, Zikai Zhang, Guanting Huo, Hao Liang, Shijie Cao, Ningyi Xu

In this work, we propose ROMA, a QLoRA accelerator with a hybrid storage architecture that uses ROM for quantized base models and SRAM for LoRA weights and KV cache.

Cognitive-Aligned Document Selection for Retrieval-augmented Generation

no code implementations17 Feb 2025 Bingyu Wan, Fuxi Zhang, Zhongpeng Qi, Jiayi Ding, Jijun Li, Baoshi Fan, Yijia Zhang, Jun Zhang

Large language models (LLMs) inherently display hallucinations since the precision of generated texts cannot be guaranteed purely by the parametric knowledge they include.

RAG Retrieval

STAHGNet: Modeling Hybrid-grained Heterogenous Dependency Efficiently for Traffic Prediction

no code implementations23 Dec 2024 Jiyao Wang, Zehua Peng, Yijia Zhang, Dengbo He, Lei Chen

Traffic flow prediction plays a critical role in the intelligent transportation system, and it is also a challenging task because of the underlying complex Spatio-temporal patterns and heterogeneities evolving across time.

Feature Engineering Graph Attention +1

UMSPU: Universal Multi-Size Phase Unwrapping via Mutual Self-Distillation and Adaptive Boosting Ensemble Segmenters

no code implementations7 Dec 2024 Lintong Du, Huazhen Liu, Yijia Zhang, Shuxin Liu, Yuan Qu, Zenghui Zhang, Jiamiao Yang

To address this issue, we propose a mutual self-distillation (MSD) mechanism and adaptive boosting ensemble segmenters to construct a universal multi-size phase unwrapping network (UMSPU).

Automating Energy-Efficient GPU Kernel Generation: A Fast Search-Based Compilation Approach

no code implementations28 Nov 2024 Yijia Zhang, Zhihong Gou, Shijie Cao, Weigang Feng, Sicheng Zhang, Guohao Dai, Ningyi Xu

Furthermore, we introduce a dynamic updating strategy for the energy cost model, reducing the need for on-device energy measurements and accelerating the search process.

Reprogramming Pretrained Target-Specific Diffusion Models for Dual-Target Drug Design

1 code implementation28 Oct 2024 Xiangxin Zhou, Jiaqi Guan, Yijia Zhang, Xingang Peng, Liang Wang, Jianzhu Ma

Considering the tremendous success that deep generative models have achieved in structure-based drug design in recent years, we formulate dual-target drug design as a generative task and curate a novel dataset of potential target pairs based on synergistic drug combinations.

Drug Design

Swin-BERT: A Feature Fusion System designed for Speech-based Alzheimer's Dementia Detection

no code implementations9 Oct 2024 Yilin Pan, Yanpei Shi, Yijia Zhang, Mingyu Lu

For the acoustic part, the shifted windows multi-head attention that proposed to extract local and global information from images, is used for designing our acoustic-based system.

Rhythm

Diff4VS: HIV-inhibiting Molecules Generation with Classifier Guidance Diffusion for Virtual Screening

1 code implementation20 Jul 2024 Jiaqing Lyu, Changjie Chen, Bing Liang, Yijia Zhang

The DrugIndex is the ratio of the proportion of candidate drug molecules in the generated molecule to the proportion of candidate drug molecules in the training set.

Drug Design

BitDistiller: Unleashing the Potential of Sub-4-Bit LLMs via Self-Distillation

2 code implementations16 Feb 2024 Dayou Du, Yijia Zhang, Shijie Cao, Jiaqi Guo, Ting Cao, Xiaowen Chu, Ningyi Xu

The upscaling of Large Language Models (LLMs) has yielded impressive advances in natural language processing, yet it also poses significant deployment challenges.

Knowledge Distillation Quantization

AFPQ: Asymmetric Floating Point Quantization for LLMs

1 code implementation3 Nov 2023 Yijia Zhang, Sicheng Zhang, Shijie Cao, Dayou Du, Jianyu Wei, Ting Cao, Ningyi Xu

Large language models (LLMs) show great performance in various tasks, but face deployment challenges from limited memory capacity and bandwidth.

Quantization

A Transformer-Based Model With Self-Distillation for Multimodal Emotion Recognition in Conversations

1 code implementation31 Oct 2023 Hui Ma, Jian Wang, Hongfei Lin, Bo Zhang, Yijia Zhang, Bo Xu

Emotion recognition in conversations (ERC), the task of recognizing the emotion of each utterance in a conversation, is crucial for building empathetic machines.

Emotion Recognition in Conversation Multimodal Emotion Recognition

TKwinFormer: Top k Window Attention in Vision Transformers for Feature Matching

no code implementations29 Aug 2023 Yun Liao, Yide Di, Hao Zhou, Kaijun Zhu, Mingyu Lu, Yijia Zhang, Qing Duan, Junhui Liu

Local feature matching remains a challenging task, primarily due to difficulties in matching sparse keypoints and low-texture regions.

Adam Accumulation to Reduce Memory Footprints of both Activations and Gradients for Large-scale DNN Training

no code implementations31 May 2023 Yijia Zhang, Yibo Han, Shijie Cao, Guohao Dai, Youshan Miao, Ting Cao, Fan Yang, Ningyi Xu

We find that previous gradient accumulation reduces activation memory but fails to be compatible with gradient memory reduction due to a contradiction between preserving gradients and releasing gradients.

Integer or Floating Point? New Outlooks for Low-Bit Quantization on Large Language Models

no code implementations21 May 2023 Yijia Zhang, Lingran Zhao, Shijie Cao, WenQiang Wang, Ting Cao, Fan Yang, Mao Yang, Shanghang Zhang, Ningyi Xu

In this study, we conduct a comparative analysis of INT and FP quantization with the same bit-width, revealing that the optimal quantization format varies across different layers due to the complexity and diversity of tensor distribution.

Quantization

TC-GAT: Graph Attention Network for Temporal Causality Discovery

no code implementations21 Apr 2023 Xiaosong Yuan, Ke Chen, Wanli Zuo, Yijia Zhang

The present study explores the intricacies of causal relationship extraction, a vital component in the pursuit of causality knowledge.

Graph Attention

A Unified Review of Deep Learning for Automated Medical Coding

no code implementations8 Jan 2022 Shaoxiong Ji, Wei Sun, Xiaobo Li, Hang Dong, Ara Taalas, Yijia Zhang, Honghan Wu, Esa Pitkänen, Pekka Marttinen

Automated medical coding, an essential task for healthcare operation and delivery, makes unstructured data manageable by predicting medical codes from clinical documents.

Decoder Deep Learning

Exploring Semi-supervised Variational Autoencoders for Biomedical Relation Extraction

no code implementations18 Jan 2019 Yijia Zhang, Zhiyong Lu

Experimental results show that our method effectively exploits the unlabeled data to improve the performance and reduce the dependence on labeled data.

Decoder Relation +1

Cannot find the paper you are looking for? You can Submit a new open access paper.