no code implementations • 6 Jan 2025 • YuBo Wang, Haoyang Li, Fei Teng, Lei Chen
While neural network-based models, such as CNN and BERT, have demonstrated remarkable performance in text classification, their effectiveness heavily relies on abundant labeled training data.
no code implementations • 1 Jan 2025 • Haoyang Li, Yuming Xu, Chen Jason Zhang, Alexander Zhou, Lei Chen, Qing Li
Graph-level tasks, which predict properties or classes for the entire graph, are critical for applications, such as molecular property prediction and subgraph counting.
1 code implementation • 27 Dec 2024 • Haoyang Li, Yiming Li, Anxin Tian, Tianhao Tang, Zhanchao Xu, Xuejia Chen, Nicole Hu, Wei Dong, Qing Li, Lei Chen
This survey provides a comprehensive overview of KV cache management strategies for LLM acceleration, categorizing them into token-level, model-level, and system-level optimizations.
no code implementations • 23 Dec 2024 • Haoyang Li, Yuchen Hu, Chen Chen, Eng Siong Chng
High-fidelity speech enhancement often requires sophisticated modeling to capture intricate, multiscale patterns.
1 code implementation • 18 Dec 2024 • Jiaqi Xiong, Nan Yin, Shiyang Liang, Haoyang Li, Yingxu Wang, Duo Ai, Fang Pan, JingJie Wang
However, most of these studies have not considered the skewed degree distribution of genes.
no code implementations • 10 Dec 2024 • Haoyang Li, Fangcheng Fu, Sheng Lin, Hao Ge, XuanYu Wang, Jiawen Niu, Jie Jiang, Bin Cui
To optimize large Transformer model training, efficient parallel computing and advanced data management are essential.
no code implementations • 5 Dec 2024 • Subash Katel, Haoyang Li, Zihan Zhao, Raghav Kansal, Farouk Mokhtar, Javier Duarte
In high energy physics, self-supervised learning (SSL) methods have the potential to aid in the creation of machine learning models without the need for labeled datasets for a variety of tasks, including those related to jets -- narrow sprays of particles produced by quarks and gluons in high energy particle collisions.
no code implementations • 5 Dec 2024 • Haoyang Li, Marko Stamenkovic, Alexander Shmakov, Michael Fenton, Darius Shih-Chieh Chao, Kaitlyn Maiya White, Caden Mikkelsen, Jovan Mitic, Cristina Mantilla Suarez, Melissa Quinnan, Greg Landsberg, Harvey Newman, Pierre Baldi, Daniel Whiteson, Javier Duarte
However, the complexity of jet assignment increases when simultaneously considering both $H\rightarrow b\bar{b}$ reconstruction possibilities, i. e., two "resolved" small-radius jets each containing a shower initiated by a $b$-quark or one "boosted" large-radius jet containing a merged shower initiated by a $b\bar{b}$ pair.
no code implementations • 29 Nov 2024 • Haorui He, Yuchen Song, Yuancheng Wang, Haoyang Li, Xueyao Zhang, Li Wang, Gongping Huang, Eng Siong Chng, Zhizheng Wu
Experimental results demonstrate that Noro outperforms our baseline system in both clean and noisy scenarios, highlighting its efficacy for real-world applications.
no code implementations • 29 Oct 2024 • Bowen Liu, Haoyang Li, Shuning Wang, Shuo Nie, Shanghang Zhang
To address these challenges, we propose a novel framework, SubGraph Aggregation (SuGAr), designed to learn a diverse set of subgraphs that are crucial for OOD generalization on graphs.
Molecular Property Prediction Out-of-Distribution Generalization +1
no code implementations • 25 Sep 2024 • Yiwen Hu, Jun Wei, Yuncheng Jiang, Haoyang Li, Shuguang Cui, Zhen Li, Song Wu
Limited by the expensive labeling, polyp segmentation models are plagued by data shortages.
1 code implementation • 23 Sep 2024 • Hieu-Thi Luong, Haoyang Li, Lin Zhang, Kong Aik Lee, Eng Siong Chng
Previous fake speech datasets were constructed from a defender's perspective to develop countermeasure (CM) systems without considering diverse motivations of attackers.
no code implementations • 10 Aug 2024 • Guodong Du, Runhua Jiang, Senqiao Yang, Haoyang Li, Wei Chen, Keren Li, Sim Kuan Goh, Ho-Kin Tang
The empirical results show that the proposed framework has positive impacts on the network, with reduced over-fitting and an order of magnitude lower time complexity compared to BP.
1 code implementation • 5 Aug 2024 • Zi Liang, Haibo Hu, Qingqing Ye, Yaxin Xiao, Haoyang Li
In this paper, we analyze the underlying mechanism of prompt leakage, which we refer to as prompt memorization, and develop corresponding defending strategies.
no code implementations • 5 Aug 2024 • Yiyan Li, Haoyang Li, Zhao Pu, Jing Zhang, Xinyi Zhang, Tao Ji, Luming Sun, Cuiping Li, Hong Chen
Knob tuning plays a crucial role in optimizing databases by adjusting knobs to enhance database performance.
no code implementations • 24 Jul 2024 • Xiuying Chen, Tairan Wang, Taicheng Guo, Kehan Guo, Juexiao Zhou, Haoyang Li, Mingchen Zhuge, Jürgen Schmidhuber, Xin Gao, Xiangliang Zhang
We hope our benchmark and model can facilitate and promote more research on chemical QA.
no code implementations • 20 Jul 2024 • Bo Han, Heqing Zou, Haoyang Li, Guangcong Wang, Chng Eng Siong
The cascaded conditional diffusion model decomposes the complex talking editing task into two flexible generation tasks, which provides a generalizable talking-face representation, seamless audio-visual transitions, and identity-preserved faces on a small dataset.
no code implementations • 8 Jun 2024 • Yanling Wang, Haoyang Li, Hao Zou, Jing Zhang, Xinlei He, Qi Li, Ke Xu
Despite advancements in large language models (LLMs), non-factual responses remain prevalent.
1 code implementation • 1 Jun 2024 • Shengyu Tao, Mengtian Zhang, Zixi Zhao, Haoyang Li, Ruifei Ma, Yunhong Che, Xin Sun, Lin Su, Xiangyu Chen, ZiHao Zhou, Heng Chang, Tingwei Cao, Xiao Xiao, Yaojun Liu, Wenjun Yu, Zhongling Xu, Yang Li, Han Hao, Xuan Zhang, Xiaosong Hu, Guangmin ZHou
Manufacturing complexities and uncertainties have impeded the transition from material prototypes to commercial batteries, making prototype verification critical to quality assessment.
1 code implementation • 30 May 2024 • Haoqiong Bian, Dongyang Geng, Haoyang Li, Yunpeng Chai, Anastasia Ailamaki
The queries are then executed by a serverless query engine that offers varying prices for different performance service levels (SLAs).
no code implementations • 24 May 2024 • Fei Teng, Haoyang Li, Shimin Di, Lei Chen
However, existing CE methods over KGs achieve unsatisfying performance on HKGs due to the complexity of qualifiers in HKGs.
no code implementations • 17 May 2024 • Jiawei Li, Jingshu Peng, Haoyang Li, Lei Chen
Time-series analysis plays a pivotal role across a range of critical applications, from finance to healthcare, which involves various tasks, such as forecasting and classification.
1 code implementation • 17 Apr 2024 • Xinmei Huang, Haoyang Li, Jing Zhang, Xinxin Zhao, Zhiming Yao, Yiyan Li, Zhuohao Yu, Tieying Zhang, Hong Chen, Cuiping Li
Database knob tuning is a critical challenge in the database community, aiming to optimize knob values to enhance database performance for specific workloads.
no code implementations • 18 Mar 2024 • Xufeng Yao, Haoyang Li, Tsz Ho Chan, Wenyi Xiao, Mingxuan Yuan, Yu Huang, Lei Chen, Bei Yu
In the domain of chip design, Hardware Description Languages (HDLs) play a pivotal role.
no code implementations • 13 Mar 2024 • Richard Tong, Haoyang Li, Joleen Liang, Qingsong Wen
Finally, we outline a strategic roadmap for stakeholders to implement these standards, fostering a cohesive and ethical AIED ecosystem.
1 code implementation • NeurIPS 2023 • Zeyang Zhang, Xin Wang, Ziwei Zhang, Zhou Qin, Weigao Wen, Hui Xue, Haoyang Li, Wenwu Zhu
In this paper, we discover that there exist cases with distribution shifts unobservable in the time domain while observable in the spectral domain, and propose to study distribution shifts on dynamic graphs in the spectral domain for the first time.
no code implementations • 6 Mar 2024 • Yuling Wang, Xiao Wang, Xiangzhou Huang, Yanhua Yu, Haoyang Li, Mengdi Zhang, Zirui Guo, Wei Wu
The other is different behaviors have different intent distributions, so how to establish their relations for a more explainable recommender system.
1 code implementation • 26 Feb 2024 • Haoyang Li, Jing Zhang, Hanbing Liu, Ju Fan, Xiaokang Zhang, Jun Zhu, Renjie Wei, Hongyan Pan, Cuiping Li, Hong Chen
To address the limitations, we introduce CodeS, a series of pre-trained language models with parameters ranging from 1B to 15B, specifically designed for the text-to-SQL task.
no code implementations • 2 Feb 2024 • Hang Li, Tianlong Xu, Chaoli Zhang, Eason Chen, Jing Liang, Xing Fan, Haoyang Li, Jiliang Tang, Qingsong Wen
The recent surge in generative AI technologies, such as large language models and diffusion models, has boosted the development of AI applications in various domains, including science, finance, and education.
1 code implementation • 26 Jan 2024 • Chen Huang, Haoyang Li, Yifan Zhang, Wenqiang Lei, Jiancheng Lv
To this end, various methods have been proposed to create an adaptive filter by incorporating an extra filter (e. g., a high-pass filter) extracted from the graph topology.
1 code implementation • 18 Dec 2023 • Tianrui Jia, Haoyang Li, Cheng Yang, Tao Tao, Chuan Shi
In this paper, we propose a novel graph invariant learning method based on invariant and variant patterns co-mixup strategy, which is capable of jointly generating mixed multiple environments and capturing invariant patterns from the mixed graph data.
Graph Representation Learning Out-of-Distribution Generalization
no code implementations • 24 Nov 2023 • Zeyang Zhang, Xin Wang, Ziwei Zhang, Haoyang Li, Wenwu Zhu
In this paper, we propose Disentangled Intervention-based Dynamic graph Attention networks with Invariance Promotion (I-DIDA) to handle spatio-temporal distribution shifts in dynamic graphs by discovering and utilizing invariant patterns, i. e., structures and features whose predictive abilities are stable across distribution shifts.
1 code implementation • 26 Oct 2023 • Zeyang Zhang, Xin Wang, Ziwei Zhang, Haoyang Li, Yijian Qin, Wenwu Zhu
Our main observations are: 1) LLMs have preliminary spatial-temporal understanding abilities on dynamic graphs, 2) Dynamic graph tasks show increasing difficulties for LLMs as the graph size and density increase, while not sensitive to the time span and data generation mechanism, 3) the proposed DST2 prompting method can help to improve LLMs' spatial-temporal understanding abilities on dynamic graphs for most tasks.
1 code implementation • 6 Sep 2023 • Juexiao Zhou, Bin Zhang, Xiuying Chen, Haoyang Li, Xiaopeng Xu, Siyuan Chen, Xin Gao
With the fast-growing and evolving omics data, the demand for streamlined and adaptable tools to handle the analysis continues to grow.
1 code implementation • 28 Aug 2023 • Ziwei Zhang, Haoyang Li, Zeyang Zhang, Yijian Qin, Xin Wang, Wenwu Zhu
In order to promote applying large models for graphs forward, we present a perspective paper to discuss the challenges and opportunities associated with developing large graph models.
2 code implementations • 27 May 2023 • Zihao Yu, Haoyang Li, Fangcheng Fu, Xupeng Miao, Bin Cui
The key intuition behind our approach is to utilize the semantic mapping between the minor modifications on the input text and the affected regions on the output image.
1 code implementation • 20 Feb 2023 • Juexiao Zhou, Longxi Zhou, Di Wang, Xiaopeng Xu, Haoyang Li, Yuetan Chu, Wenkai Han, Xin Gao
However, there are few open-source frameworks for federated heterogeneous medical image analysis with personalization and privacy protection simultaneously without the demand to modify the existing model structures or to share any private data.
1 code implementation • 20 Feb 2023 • Juexiao Zhou, Haoyang Li, Xingyu Liao, Bin Zhang, Wenjia He, Zhongxiao Li, Longxi Zhou, Xin Gao
Revoking personal private data is one of the basic human rights, which has already been sheltered by several privacy-preserving laws in many countries.
1 code implementation • 12 Feb 2023 • Haoyang Li, Jing Zhang, Cuiping Li, Hong Chen
Due to the structural property of the SQL queries, the seq2seq model takes the responsibility of parsing both the schema items (i. e., tables and columns) and the skeleton (i. e., SQL keywords).
Ranked #1 on Semantic Parsing on spider
no code implementations • 6 Feb 2023 • Haoyang Li, Xin Wang, Wenwu Zhu
To the best of our knowledge, this paper is the first survey for curriculum graph machine learning.
no code implementations • 9 Dec 2022 • Javier Duarte, Haoyang Li, Avik Roy, Ruike Zhu, E. A. Huerta, Daniel Diaz, Philip Harris, Raghav Kansal, Daniel S. Katz, Ishaan H. Kavoori, Volodymyr V. Kindratenko, Farouk Mokhtar, Mark S. Neubauer, Sang Eon Park, Melissa Quinnan, Roger Rusack, Zhizhen Zhao
The findable, accessible, interoperable, and reusable (FAIR) data principles provide a framework for examining, evaluating, and improving how data is shared to facilitate scientific discovery.
no code implementations • 29 Nov 2022 • Yapeng Teng, Haoyang Li, Fuzhen Cai, Ming Shao, Siyu Xia
Thus, we focus on the unsupervised visual defect detection and localization tasks and propose a novel framework based on the recent score-based generative models, which synthesize the real image by iterative denoising through stochastic differential equations (SDEs).
1 code implementation • 10 Sep 2022 • Haoyang Li
Image recognition/classification is a widely studied problem, but its reverse problem, image generation, has drawn much less attention until recently.
2 code implementations • International Conference on Data Engineering 2022 • Shendi Wang, Haoyang Li, Caleb Chen Cao, Xiao-Hui Li, Ng Ngai Fai, Jianxin Liu, Xun Xue, Hu Song, Jinyu Li, Guangye Gu, Lei Chen
Recently, neural networks based models have been widely used for recommender systems (RS).
no code implementations • 26 Mar 2022 • Sha Yuan, Hanyu Zhao, Shuai Zhao, Jiahong Leng, Yangxiao Liang, Xiaozhi Wang, Jifan Yu, Xin Lv, Zhou Shao, Jiaao He, Yankai Lin, Xu Han, Zhenghao Liu, Ning Ding, Yongming Rao, Yizhao Gao, Liang Zhang, Ming Ding, Cong Fang, Yisen Wang, Mingsheng Long, Jing Zhang, Yinpeng Dong, Tianyu Pang, Peng Cui, Lingxiao Huang, Zheng Liang, HuaWei Shen, HUI ZHANG, Quanshi Zhang, Qingxiu Dong, Zhixing Tan, Mingxuan Wang, Shuo Wang, Long Zhou, Haoran Li, Junwei Bao, Yingwei Pan, Weinan Zhang, Zhou Yu, Rui Yan, Chence Shi, Minghao Xu, Zuobai Zhang, Guoqiang Wang, Xiang Pan, Mengjie Li, Xiaoyu Chu, Zijun Yao, Fangwei Zhu, Shulin Cao, Weicheng Xue, Zixuan Ma, Zhengyan Zhang, Shengding Hu, Yujia Qin, Chaojun Xiao, Zheni Zeng, Ganqu Cui, Weize Chen, Weilin Zhao, Yuan YAO, Peng Li, Wenzhao Zheng, Wenliang Zhao, Ziyi Wang, Borui Zhang, Nanyi Fei, Anwen Hu, Zenan Ling, Haoyang Li, Boxi Cao, Xianpei Han, Weidong Zhan, Baobao Chang, Hao Sun, Jiawen Deng, Chujie Zheng, Juanzi Li, Lei Hou, Xigang Cao, Jidong Zhai, Zhiyuan Liu, Maosong Sun, Jiwen Lu, Zhiwu Lu, Qin Jin, Ruihua Song, Ji-Rong Wen, Zhouchen Lin, LiWei Wang, Hang Su, Jun Zhu, Zhifang Sui, Jiajun Zhang, Yang Liu, Xiaodong He, Minlie Huang, Jian Tang, Jie Tang
With the rapid development of deep learning, training Big Models (BMs) for multiple downstream tasks becomes a popular paradigm.
1 code implementation • 16 Feb 2022 • Haoyang Li, Xin Wang, Ziwei Zhang, Wenwu Zhu
This paper is the first systematic and comprehensive review of OOD generalization on graphs, to the best of our knowledge.
1 code implementation • 4 Jan 2022 • Xin Wang, Ziwei Zhang, Haoyang Li, Wenwu Zhu
However, as the literature on graph learning booms with a vast number of emerging methods and techniques, it becomes increasingly difficult to manually design the optimal machine learning algorithm for different graph-related tasks.
no code implementations • 7 Dec 2021 • Haoyang Li, Xin Wang, Ziwei Zhang, Wenwu Zhu
Our proposed OOD-GNN employs a novel nonlinear graph representation decorrelation method utilizing random Fourier features, which encourages the model to eliminate the statistical dependence between relevant and irrelevant graph representations through iteratively optimizing the sample graph weights and graph encoder.
no code implementations • NeurIPS 2021 • Haoyang Li, Xin Wang, Ziwei Zhang, Zehuan Yuan, Hang Li, Wenwu Zhu
Then we propose a novel factor-wise discrimination objective in a contrastive learning manner, which can force the factorized representations to independently reflect the expressive information from different latent factors.
no code implementations • Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining 2021 • Xiao-Hui Li, Yuhan Shi, Haoyang Li, Wei Bai, Caleb Chen Cao, Lei Chen
It has been long debated that eXplainable AI (XAI) is an important technology for model and data exploration, validation, and debugging.
2 code implementations • ICLR Workshop GTRL 2021 • Ziwei Zhang, Yijian Qin, Zeyang Zhang, Chaoyu Guan, Jie Cai, Heng Chang, Jiyan Jiang, Haoyang Li, Zixin Sun, Beini Xie, Yang Yao, YiPeng Zhang, Xin Wang, Wenwu Zhu
To fill this gap, we present Automated Graph Learning (AutoGL), the first dedicated library for automated machine learning on graphs.
no code implementations • 19 Mar 2021 • Haoyang Li, Xinggang Wang
Given the great success of Deep Neural Networks(DNNs) and the black-box nature of it, the interpretability of these models becomes an important issue. The majority of previous research works on the post-hoc interpretation of a trained model. But recently, adversarial training shows that it is possible for a model to have an interpretable input-gradient through training. However, adversarial training lacks efficiency for interpretability. To resolve this problem, we construct an approximation of the adversarial perturbations and discover a connection between adversarial training and amplitude modulation.
no code implementations • 31 Dec 2020 • Xiao-Hui Li, Yuhan Shi, Haoyang Li, Wei Bai, Yuanwei Song, Caleb Chen Cao, Lei Chen
It has been long debated that eXplainable AI (XAI) is an important topic, but it lacks rigorous definition and fair metrics.
no code implementations • 7 Dec 2020 • Xinrun Wang, Tarun Nair, Haoyang Li, Yuh Sheng Reuben Wong, Nachiket Kelkar, Srinivas Vaidyanathan, Rajat Nayak, Bo An, Jagdish Krishnaswamy, Milind Tambe
Dams impact downstream river dynamics through flow regulation and disruption of upstream-downstream linkages.
2 code implementations • 18 Oct 2019 • Ignavier Ng, Shengyu Zhu, Zhuangyan Fang, Haoyang Li, Zhitang Chen, Jun Wang
This paper studies the problem of learning causal structures from observational data.
2 code implementations • 7 May 2018 • Ziwei Zhang, Peng Cui, Haoyang Li, Xiao Wang, Wenwu Zhu
Network embedding, which learns low-dimensional vector representation for nodes in the network, has attracted considerable research attention recently.