Search Results for author: Haoyang Li

Found 56 papers, 25 papers with code

Graph-based Retrieval Augmented Generation for Dynamic Few-shot Text Classification

no code implementations6 Jan 2025 YuBo Wang, Haoyang Li, Fei Teng, Lei Chen

While neural network-based models, such as CNN and BERT, have demonstrated remarkable performance in text classification, their effectiveness heavily relies on abundant labeled training data.

Data Integration Few-Shot Text Classification +2

Revisiting Graph Neural Networks on Graph-level Tasks: Comprehensive Experiments, Analysis, and Improvements

no code implementations1 Jan 2025 Haoyang Li, Yuming Xu, Chen Jason Zhang, Alexander Zhou, Lei Chen, Qing Li

Graph-level tasks, which predict properties or classes for the entire graph, are critical for applications, such as molecular property prediction and subgraph counting.

Contrastive Learning Graph Classification +3

A Survey on Large Language Model Acceleration based on KV Cache Management

1 code implementation27 Dec 2024 Haoyang Li, Yiming Li, Anxin Tian, Tianhao Tang, Zhanchao Xu, Xuejia Chen, Nicole Hu, Wei Dong, Qing Li, Lei Chen

This survey provides a comprehensive overview of KV cache management strategies for LLM acceleration, categorizing them into token-level, model-level, and system-level optimizations.

Language Modeling Language Modelling +5

An Investigation on the Potential of KAN in Speech Enhancement

no code implementations23 Dec 2024 Haoyang Li, Yuchen Hu, Chen Chen, Eng Siong Chng

High-fidelity speech enhancement often requires sophisticated modeling to capture intricate, multiscale patterns.

Kolmogorov-Arnold Networks Speech Enhancement

Demystifying Workload Imbalances in Large Transformer Model Training over Variable-length Sequences

no code implementations10 Dec 2024 Haoyang Li, Fangcheng Fu, Sheng Lin, Hao Ge, XuanYu Wang, Jiawen Niu, Jie Jiang, Bin Cui

To optimize large Transformer model training, efficient parallel computing and advanced data management are essential.

Management

Learning Symmetry-Independent Jet Representations via Jet-Based Joint Embedding Predictive Architecture

no code implementations5 Dec 2024 Subash Katel, Haoyang Li, Zihan Zhao, Raghav Kansal, Farouk Mokhtar, Javier Duarte

In high energy physics, self-supervised learning (SSL) methods have the potential to aid in the creation of machine learning models without the need for labeled datasets for a variety of tasks, including those related to jets -- narrow sprays of particles produced by quarks and gluons in high energy particle collisions.

Jet Tagging Self-Supervised Learning

Reconstruction of boosted and resolved multi-Higgs-boson events with symmetry-preserving attention networks

no code implementations5 Dec 2024 Haoyang Li, Marko Stamenkovic, Alexander Shmakov, Michael Fenton, Darius Shih-Chieh Chao, Kaitlyn Maiya White, Caden Mikkelsen, Jovan Mitic, Cristina Mantilla Suarez, Melissa Quinnan, Greg Landsberg, Harvey Newman, Pierre Baldi, Daniel Whiteson, Javier Duarte

However, the complexity of jet assignment increases when simultaneously considering both $H\rightarrow b\bar{b}$ reconstruction possibilities, i. e., two "resolved" small-radius jets each containing a shower initiated by a $b$-quark or one "boosted" large-radius jet containing a merged shower initiated by a $b\bar{b}$ pair.

Noro: A Noise-Robust One-shot Voice Conversion System with Hidden Speaker Representation Capabilities

no code implementations29 Nov 2024 Haorui He, Yuchen Song, Yuancheng Wang, Haoyang Li, Xueyao Zhang, Li Wang, Gongping Huang, Eng Siong Chng, Zhizheng Wu

Experimental results demonstrate that Noro outperforms our baseline system in both clean and noisy scenarios, highlighting its efficacy for real-world applications.

Representation Learning Self-Supervised Learning +1

Subgraph Aggregation for Out-of-Distribution Generalization on Graphs

no code implementations29 Oct 2024 Bowen Liu, Haoyang Li, Shuning Wang, Shuo Nie, Shanghang Zhang

To address these challenges, we propose a novel framework, SubGraph Aggregation (SuGAr), designed to learn a diverse set of subgraphs that are crucial for OOD generalization on graphs.

Molecular Property Prediction Out-of-Distribution Generalization +1

LlamaPartialSpoof: An LLM-Driven Fake Speech Dataset Simulating Disinformation Generation

1 code implementation23 Sep 2024 Hieu-Thi Luong, Haoyang Li, Lin Zhang, Kong Aik Lee, Eng Siong Chng

Previous fake speech datasets were constructed from a defender's perspective to develop countermeasure (CM) systems without considering diverse motivations of attackers.

Language Modeling Language Modelling +3

Impacts of Darwinian Evolution on Pre-trained Deep Neural Networks

no code implementations10 Aug 2024 Guodong Du, Runhua Jiang, Senqiao Yang, Haoyang Li, Wei Chen, Keren Li, Sim Kuan Goh, Ho-Kin Tang

The empirical results show that the proposed framework has positive impacts on the network, with reduced over-fitting and an order of magnitude lower time complexity compared to BP.

Why Are My Prompts Leaked? Unraveling Prompt Extraction Threats in Customized Large Language Models

1 code implementation5 Aug 2024 Zi Liang, Haibo Hu, Qingqing Ye, Yaxin Xiao, Haoyang Li

In this paper, we analyze the underlying mechanism of prompt leakage, which we refer to as prompt memorization, and develop corresponding defending strategies.

Memorization

Text-based Talking Video Editing with Cascaded Conditional Diffusion

no code implementations20 Jul 2024 Bo Han, Heqing Zou, Haoyang Li, Guangcong Wang, Chng Eng Siong

The cascaded conditional diffusion model decomposes the complex talking editing task into two flexible generation tasks, which provides a generalizable talking-face representation, seamless audio-visual transitions, and identity-preserved faces on a small dataset.

Video Editing

PixelsDB: Serverless and NL-Aided Data Analytics with Flexible Service Levels and Prices

1 code implementation30 May 2024 Haoqiong Bian, Dongyang Geng, Haoyang Li, Yunpeng Chai, Anastasia Ailamaki

The queries are then executed by a serverless query engine that offers varying prices for different performance service levels (SLAs).

Scheduling

Cardinality Estimation on Hyper-relational Knowledge Graphs

no code implementations24 May 2024 Fei Teng, Haoyang Li, Shimin Di, Lei Chen

However, existing CE methods over KGs achieve unsatisfying performance on HKGs due to the complexity of qualifiers in HKGs.

Graph Neural Network Knowledge Graphs

UniCL: A Universal Contrastive Learning Framework for Large Time Series Models

no code implementations17 May 2024 Jiawei Li, Jingshu Peng, Haoyang Li, Lei Chen

Time-series analysis plays a pivotal role across a range of critical applications, from finance to healthcare, which involves various tasks, such as forecasting and classification.

Contrastive Learning Time Series +1

LLMTune: Accelerate Database Knob Tuning with Large Language Models

1 code implementation17 Apr 2024 Xinmei Huang, Haoyang Li, Jing Zhang, Xinxin Zhao, Zhiming Yao, Yiyan Li, Zhuohao Yu, Tieying Zhang, Hong Chen, Cuiping Li

Database knob tuning is a critical challenge in the database community, aiming to optimize knob values to enhance database performance for specific workloads.

Language Modelling Large Language Model

Spectral Invariant Learning for Dynamic Graphs under Distribution Shifts

1 code implementation NeurIPS 2023 Zeyang Zhang, Xin Wang, Ziwei Zhang, Zhou Qin, Weigao Wen, Hui Xue, Haoyang Li, Wenwu Zhu

In this paper, we discover that there exist cases with distribution shifts unobservable in the time domain while observable in the spectral domain, and propose to study distribution shifts on dynamic graphs in the spectral domain for the first time.

Link Prediction Node Classification

Intent-aware Recommendation via Disentangled Graph Contrastive Learning

no code implementations6 Mar 2024 Yuling Wang, Xiao Wang, Xiangzhou Huang, Yanhua Yu, Haoyang Li, Mengdi Zhang, Zirui Guo, Wei Wu

The other is different behaviors have different intent distributions, so how to establish their relations for a more explainable recommender system.

Contrastive Learning Graph Neural Network

CodeS: Towards Building Open-source Language Models for Text-to-SQL

1 code implementation26 Feb 2024 Haoyang Li, Jing Zhang, Hanbing Liu, Ju Fan, Xiaokang Zhang, Jun Zhu, Renjie Wei, Hongyan Pan, Cuiping Li, Hong Chen

To address the limitations, we introduce CodeS, a series of pre-trained language models with parameters ranging from 1B to 15B, specifically designed for the text-to-SQL task.

Data Augmentation Domain Adaptation +2

Bringing Generative AI to Adaptive Learning in Education

no code implementations2 Feb 2024 Hang Li, Tianlong Xu, Chaoli Zhang, Eason Chen, Jing Liang, Xing Fan, Haoyang Li, Jiliang Tang, Qingsong Wen

The recent surge in generative AI technologies, such as large language models and diffusion models, has boosted the development of AI applications in various domains, including science, finance, and education.

Position

Cross-Space Adaptive Filter: Integrating Graph Topology and Node Attributes for Alleviating the Over-smoothing Problem

1 code implementation26 Jan 2024 Chen Huang, Haoyang Li, Yifan Zhang, Wenqiang Lei, Jiancheng Lv

To this end, various methods have been proposed to create an adaptive filter by incorporating an extra filter (e. g., a high-pass filter) extracted from the graph topology.

Attribute Node Classification

Graph Invariant Learning with Subgraph Co-mixup for Out-Of-Distribution Generalization

1 code implementation18 Dec 2023 Tianrui Jia, Haoyang Li, Cheng Yang, Tao Tao, Chuan Shi

In this paper, we propose a novel graph invariant learning method based on invariant and variant patterns co-mixup strategy, which is capable of jointly generating mixed multiple environments and capturing invariant patterns from the mixed graph data.

Graph Representation Learning Out-of-Distribution Generalization

Out-of-Distribution Generalized Dynamic Graph Neural Network with Disentangled Intervention and Invariance Promotion

no code implementations24 Nov 2023 Zeyang Zhang, Xin Wang, Ziwei Zhang, Haoyang Li, Wenwu Zhu

In this paper, we propose Disentangled Intervention-based Dynamic graph Attention networks with Invariance Promotion (I-DIDA) to handle spatio-temporal distribution shifts in dynamic graphs by discovering and utilizing invariant patterns, i. e., structures and features whose predictive abilities are stable across distribution shifts.

Graph Attention Graph Neural Network

LLM4DyG: Can Large Language Models Solve Spatial-Temporal Problems on Dynamic Graphs?

1 code implementation26 Oct 2023 Zeyang Zhang, Xin Wang, Ziwei Zhang, Haoyang Li, Yijian Qin, Wenwu Zhu

Our main observations are: 1) LLMs have preliminary spatial-temporal understanding abilities on dynamic graphs, 2) Dynamic graph tasks show increasing difficulties for LLMs as the graph size and density increase, while not sensitive to the time span and data generation mechanism, 3) the proposed DST2 prompting method can help to improve LLMs' spatial-temporal understanding abilities on dynamic graphs for most tasks.

Automated Bioinformatics Analysis via AutoBA

1 code implementation6 Sep 2023 Juexiao Zhou, Bin Zhang, Xiuying Chen, Haoyang Li, Xiaopeng Xu, Siyuan Chen, Xin Gao

With the fast-growing and evolving omics data, the demand for streamlined and adaptable tools to handle the analysis continues to grow.

AI Agent Language Modeling +1

Graph Meets LLMs: Towards Large Graph Models

1 code implementation28 Aug 2023 Ziwei Zhang, Haoyang Li, Zeyang Zhang, Yijian Qin, Xin Wang, Wenwu Zhu

In order to promote applying large models for graphs forward, we present a perspective paper to discuss the challenges and opportunities associated with developing large graph models.

Accelerating Text-to-Image Editing via Cache-Enabled Sparse Diffusion Inference

2 code implementations27 May 2023 Zihao Yu, Haoyang Li, Fangcheng Fu, Xupeng Miao, Bin Cui

The key intuition behind our approach is to utilize the semantic mapping between the minor modifications on the input text and the affected regions on the output image.

Text-to-Image Generation

Personalized and privacy-preserving federated heterogeneous medical image analysis with PPPML-HMI

1 code implementation20 Feb 2023 Juexiao Zhou, Longxi Zhou, Di Wang, Xiaopeng Xu, Haoyang Li, Yuetan Chu, Wenkai Han, Xin Gao

However, there are few open-source frameworks for federated heterogeneous medical image analysis with personalization and privacy protection simultaneously without the demand to modify the existing model structures or to share any private data.

Medical Image Analysis Privacy Preserving

Audit to Forget: A Unified Method to Revoke Patients' Private Data in Intelligent Healthcare

1 code implementation20 Feb 2023 Juexiao Zhou, Haoyang Li, Xingyu Liao, Bin Zhang, Wenjia He, Zhongxiao Li, Longxi Zhou, Xin Gao

Revoking personal private data is one of the basic human rights, which has already been sheltered by several privacy-preserving laws in many countries.

Deep Learning Privacy Preserving

RESDSQL: Decoupling Schema Linking and Skeleton Parsing for Text-to-SQL

1 code implementation12 Feb 2023 Haoyang Li, Jing Zhang, Cuiping Li, Hong Chen

Due to the structural property of the SQL queries, the seq2seq model takes the responsibility of parsing both the schema items (i. e., tables and columns) and the skeleton (i. e., SQL keywords).

Decoder Language Modeling +3

Curriculum Graph Machine Learning: A Survey

no code implementations6 Feb 2023 Haoyang Li, Xin Wang, Wenwu Zhu

To the best of our knowledge, this paper is the first survey for curriculum graph machine learning.

Model Optimization Survey

FAIR AI Models in High Energy Physics

no code implementations9 Dec 2022 Javier Duarte, Haoyang Li, Avik Roy, Ruike Zhu, E. A. Huerta, Daniel Diaz, Philip Harris, Raghav Kansal, Daniel S. Katz, Ishaan H. Kavoori, Volodymyr V. Kindratenko, Farouk Mokhtar, Mark S. Neubauer, Sang Eon Park, Melissa Quinnan, Roger Rusack, Zhizhen Zhao

The findable, accessible, interoperable, and reusable (FAIR) data principles provide a framework for examining, evaluating, and improving how data is shared to facilitate scientific discovery.

Graph Neural Network scientific discovery +1

Unsupervised Visual Defect Detection with Score-Based Generative Model

no code implementations29 Nov 2022 Yapeng Teng, Haoyang Li, Fuzhen Cai, Ming Shao, Siyu Xia

Thus, we focus on the unsupervised visual defect detection and localization tasks and propose a novel framework based on the recent score-based generative models, which synthesize the real image by iterative denoising through stochastic differential equations (SDEs).

Anomaly Detection Defect Detection +1

Use Classifier as Generator

1 code implementation10 Sep 2022 Haoyang Li

Image recognition/classification is a widely studied problem, but its reverse problem, image generation, has drawn much less attention until recently.

Image Generation

Out-Of-Distribution Generalization on Graphs: A Survey

1 code implementation16 Feb 2022 Haoyang Li, Xin Wang, Ziwei Zhang, Wenwu Zhu

This paper is the first systematic and comprehensive review of OOD generalization on graphs, to the best of our knowledge.

Out-of-Distribution Generalization Survey

Automated Graph Machine Learning: Approaches, Libraries, Benchmarks and Directions

1 code implementation4 Jan 2022 Xin Wang, Ziwei Zhang, Haoyang Li, Wenwu Zhu

However, as the literature on graph learning booms with a vast number of emerging methods and techniques, it becomes increasingly difficult to manually design the optimal machine learning algorithm for different graph-related tasks.

BIG-bench Machine Learning Graph Learning +1

OOD-GNN: Out-of-Distribution Generalized Graph Neural Network

no code implementations7 Dec 2021 Haoyang Li, Xin Wang, Ziwei Zhang, Wenwu Zhu

Our proposed OOD-GNN employs a novel nonlinear graph representation decorrelation method utilizing random Fourier features, which encourages the model to eliminate the statistical dependence between relevant and irrelevant graph representations through iteratively optimizing the sample graph weights and graph encoder.

Graph Neural Network Out-of-Distribution Generalization

Disentangled Contrastive Learning on Graphs

no code implementations NeurIPS 2021 Haoyang Li, Xin Wang, Ziwei Zhang, Zehuan Yuan, Hang Li, Wenwu Zhu

Then we propose a novel factor-wise discrimination objective in a contrastive learning manner, which can force the factorized representations to independently reflect the expressive information from different latent factors.

Contrastive Learning Self-Supervised Learning

Noise Modulation: Let Your Model Interpret Itself

no code implementations19 Mar 2021 Haoyang Li, Xinggang Wang

Given the great success of Deep Neural Networks(DNNs) and the black-box nature of it, the interpretability of these models becomes an important issue. The majority of previous research works on the post-hoc interpretation of a trained model. But recently, adversarial training shows that it is possible for a model to have an interpretable input-gradient through training. However, adversarial training lacks efficiency for interpretability. To resolve this problem, we construct an approximation of the adversarial perturbations and discover a connection between adversarial training and amplitude modulation.

Quantitative Evaluations on Saliency Methods: An Experimental Study

no code implementations31 Dec 2020 Xiao-Hui Li, Yuhan Shi, Haoyang Li, Wei Bai, Yuanwei Song, Caleb Chen Cao, Lei Chen

It has been long debated that eXplainable AI (XAI) is an important topic, but it lacks rigorous definition and fair metrics.

Explainable Artificial Intelligence (XAI)

Billion-scale Network Embedding with Iterative Random Projection

2 code implementations7 May 2018 Ziwei Zhang, Peng Cui, Haoyang Li, Xiao Wang, Wenwu Zhu

Network embedding, which learns low-dimensional vector representation for nodes in the network, has attracted considerable research attention recently.

Distributed Computing Link Prediction +2

Cannot find the paper you are looking for? You can Submit a new open access paper.