Search Results for author: Xiangliang Zhang

Found 123 papers, 35 papers with code

ArMATH: a Dataset for Solving Arabic Math Word Problems

1 code implementation LREC 2022 Reem Alghamdi, Zhenwen Liang, Xiangliang Zhang

In addition, a transfer learning model is built to let the high-resource Chinese MWP solver promote the performance of the low-resource Arabic MWP solver.

Deep Learning Math +1

Unsupervised Mitigating Gender Bias by Character Components: A Case Study of Chinese Word Embedding

no code implementations NAACL (GeBNLP) 2022 Xiuying Chen, Mingzhe Li, Rui Yan, Xin Gao, Xiangliang Zhang

Word embeddings learned from massive text collections have demonstrated significant levels of discriminative biases. However, debias on the Chinese language, one of the most spoken languages, has been less explored. Meanwhile, existing literature relies on manually created supplementary data, which is time- and energy-consuming. In this work, we propose the first Chinese Gender-neutral word Embedding model (CGE) based on Word2vec, which learns gender-neutral word embeddings without any labeled data. Concretely, CGE utilizes and emphasizes the rich feminine and masculine information contained in radicals, i. e., a kind of component in Chinese characters, during the training procedure. This consequently alleviates discriminative gender biases. Experimental results on public benchmark datasets show that our unsupervised method outperforms the state-of-the-art supervised debiased word embedding models without sacrificing the functionality of the embedding model.

Word Embeddings

Beyond Single-Value Metrics: Evaluating and Enhancing LLM Unlearning with Cognitive Diagnosis

no code implementations19 Feb 2025 Yicheng Lang, Kehan Guo, Yue Huang, Yujun Zhou, Haomin Zhuang, Tianyu Yang, Yao Su, Xiangliang Zhang

Due to the widespread use of LLMs and the rising critical ethical and safety concerns, LLM unlearning methods have been developed to remove harmful knowledge and undesirable capabilities.

cognitive diagnosis

Preference Leakage: A Contamination Problem in LLM-as-a-judge

1 code implementation3 Feb 2025 Dawei Li, Renliang Sun, Yue Huang, Ming Zhong, Bohan Jiang, Jiawei Han, Xiangliang Zhang, Wei Wang, Huan Liu

All of these findings imply that preference leakage is a widespread and challenging problem in the area of LLM-as-a-judge.

Shaping the Safety Boundaries: Understanding and Defending Against Jailbreaks in Large Language Models

no code implementations22 Dec 2024 Lang Gao, Xiangliang Zhang, Preslav Nakov, Xiuying Chen

In particular, we introduce \textit{safety boundary}, and we find that jailbreaks shift harmful activations outside that safety boundary, where LLMs are less sensitive to harmful information.

Bayesian Optimization

UOE: Unlearning One Expert Is Enough For Mixture-of-experts LLMS

no code implementations27 Nov 2024 Haomin Zhuang, Yihua Zhang, Kehan Guo, Jinghan Jia, Gaowen Liu, Sijia Liu, Xiangliang Zhang

As MoE LLMs are celebrated for their exceptional performance and highly efficient inference processes, we ask: How can unlearning be performed effectively and efficiently on MoE LLMs?

Large Language Model

CLIPErase: Efficient Unlearning of Visual-Textual Associations in CLIP

no code implementations30 Oct 2024 Tianyu Yang, Lisen Dai, Zheyuan Liu, Xiangqi Wang, Meng Jiang, Yapeng Tian, Xiangliang Zhang

Machine unlearning (MU) has gained significant attention as a means to remove specific data from trained models without requiring a full retraining process.

Image Classification Machine Unlearning

Social Science Meets LLMs: How Reliable Are Large Language Models in Social Simulations?

no code implementations30 Oct 2024 Yue Huang, Zhengqing Yuan, Yujun Zhou, Kehan Guo, Xiangqi Wang, Haomin Zhuang, Weixiang Sun, Lichao Sun, Jindong Wang, Yanfang Ye, Xiangliang Zhang

To address this, we introduce TrustSim, an evaluation dataset covering 10 CSS-related topics, to systematically investigate the reliability of the LLM simulation.

Knowledge Graph Enhanced Language Agents for Recommendation

no code implementations25 Oct 2024 Taicheng Guo, Chaochun Liu, Hai Wang, Varun Mannam, Fang Wang, Xin Chen, Xiangliang Zhang, Chandan K. Reddy

Our key insight is that the paths in a KG can capture complex relationships between users and items, eliciting the underlying reasons for user preferences and enriching user profiles.

Knowledge Graphs Recommendation Systems

LabSafety Bench: Benchmarking LLMs on Safety Issues in Scientific Labs

no code implementations18 Oct 2024 Yujun Zhou, Jingdong Yang, Kehan Guo, Pin-Yu Chen, Tian Gao, Werner Geyer, Nuno Moniz, Nitesh V Chawla, Xiangliang Zhang

With the increasing reliance on large language models (LLMs) for guidance in various fields, including laboratory settings, there is a growing concern about their reliability in critical safety-related decision-making.

Benchmarking Fairness +1

Improving LLM Reasoning through Scaling Inference Computation with Collaborative Verification

no code implementations5 Oct 2024 Zhenwen Liang, Ye Liu, Tong Niu, Xiangliang Zhang, Yingbo Zhou, Semih Yavuz

Moreover, to leverage the unique strengths of different reasoning strategies, we propose a novel collaborative method integrating Chain-of-Thought (CoT) and Program-of-Thought (PoT) solutions for verification.

GSM8K Math

Justice or Prejudice? Quantifying Biases in LLM-as-a-Judge

no code implementations3 Oct 2024 Jiayi Ye, Yanbo Wang, Yue Huang, Dongping Chen, Qihui Zhang, Nuno Moniz, Tian Gao, Werner Geyer, Chao Huang, Pin-Yu Chen, Nitesh V Chawla, Xiangliang Zhang

LLM-as-a-Judge has been widely utilized as an evaluation method in various benchmarks and served as supervised rewards in model training.

XTraffic: A Dataset Where Traffic Meets Incidents with Explainability and More

no code implementations16 Jul 2024 Xiaochuan Gou, Ziyue Li, Tian Lan, Junpeng Lin, Zhishuai Li, Bingyu Zhao, Chen Zhang, Di Wang, Xiangliang Zhang

Our data can revolutionalize traditional traffic-related tasks towards higher interpretability and practice: instead of traditional prediction or classification tasks, we conduct: (1) post-incident traffic forecasting to quantify the impact of different incidents on traffic indexes; (2) incident classification using traffic indexes to determine the incidents types for precautions measures; (3) global causal analysis among the traffic indexes, meta-attributes, and incidents to give high-level guidance of the interrelations of various factors; (4) local causal analysis within road nodes to examine how different incidents affect the road segments' relations.

Advanced Framework for Animal Sound Classification With Features Optimization

no code implementations3 Jul 2024 Qiang Yang, Xiuying Chen, Changsheng Ma, Carlos M. Duarte, Xiangliang Zhang

The automatic classification of animal sounds presents an enduring challenge in bioacoustics, owing to the diverse statistical properties of sound signals, variations in recording equipment, and prevalent low Signal-to-Noise Ratio (SNR) conditions.

Classification Diversity +3

UniGen: A Unified Framework for Textual Dataset Generation Using Large Language Models

1 code implementation27 Jun 2024 Siyuan Wu, Yue Huang, Chujie Gao, Dongping Chen, Qihui Zhang, Yao Wan, Tianyi Zhou, Xiangliang Zhang, Jianfeng Gao, Chaowei Xiao, Lichao Sun

Large Language Models (LLMs) such as GPT-4 and Llama3 have significantly impacted various fields by enabling high-quality synthetic data generation and reducing dependence on expensive human-generated datasets.

Attribute Benchmarking +4

Quantifying AI Psychology: A Psychometrics Benchmark for Large Language Models

no code implementations25 Jun 2024 Yuan Li, Yue Huang, Hongyi Wang, Xiangliang Zhang, James Zou, Lichao Sun

Inspired by psychometrics, this paper presents a framework for investigating psychology in LLMs, including psychological dimension identification, assessment dataset curation, and assessment with results validation.

1+1>2: Can Large Language Models Serve as Cross-Lingual Knowledge Aggregators?

no code implementations20 Jun 2024 Yue Huang, Chenrui Fan, Yuan Li, Siyuan Wu, Tianyi Zhou, Xiangliang Zhang, Lichao Sun

This paper introduces a method to enhance the multilingual performance of LLMs by aggregating knowledge from diverse languages.

Jailbreaking Large Language Models Through Alignment Vulnerabilities in Out-of-Distribution Settings

1 code implementation19 Jun 2024 Yue Huang, Jingyu Tang, Dongping Chen, Bingda Tang, Yao Wan, Lichao Sun, Philip S. Yu, Xiangliang Zhang

Recently, Large Language Models (LLMs) have garnered significant attention for their exceptional natural language processing capabilities.

MolX: Enhancing Large Language Models for Molecular Learning with A Multi-Modal Extension

no code implementations10 Jun 2024 Khiem Le, Zhichun Guo, Kaiwen Dong, Xiaobao Huang, Bozhao Nan, Roshni Iyer, Xiangliang Zhang, Olaf Wiest, Wei Wang, Nitesh V. Chawla

Large Language Models (LLMs) with their strong task-handling capabilities have shown remarkable advancements across a spectrum of fields, moving beyond natural language understanding.

Natural Language Understanding Retrosynthesis

Flexible and Adaptable Summarization via Expertise Separation

1 code implementation8 Jun 2024 Xiuying Chen, Mingzhe Li, Shen Gao, Xin Cheng, Qingqing Zhu, Rui Yan, Xin Gao, Xiangliang Zhang

Our model's distinct separation of general and domain-specific summarization abilities grants it with notable flexibility and adaptability, all while maintaining parameter efficiency.

Write Summary Step-by-Step: A Pilot Study of Stepwise Summarization

no code implementations8 Jun 2024 Xiuying Chen, Shen Gao, Mingzhe Li, Qingqing Zhu, Xin Gao, Xiangliang Zhang

Hence, in this paper, we propose the task of Stepwise Summarization, which aims to generate a new appended summary each time a new document is proposed.

Abstractive Text Summarization Story Generation

HonestLLM: Toward an Honest and Helpful Large Language Model

1 code implementation1 Jun 2024 Chujie Gao, Siyuan Wu, Yue Huang, Dongping Chen, Qihui Zhang, Zhengyan Fu, Yao Wan, Lichao Sun, Xiangliang Zhang

Subsequently, we present two approaches to augmenting honesty and helpfulness in LLMs: a training-free enhancement and a fine-tuning-based improvement.

Language Modeling Language Modelling +1

MathChat: Benchmarking Mathematical Reasoning and Instruction Following in Multi-Turn Interactions

1 code implementation29 May 2024 Zhenwen Liang, Dian Yu, Wenhao Yu, Wenlin Yao, Zhihan Zhang, Xiangliang Zhang, Dong Yu

We evaluate the performance of various SOTA LLMs on the MathChat benchmark, and we observe that while these models excel in single turn question answering, they significantly underperform in more complex scenarios that require sustained reasoning and dialogue understanding.

Benchmarking Dialogue Understanding +5

Cross-Context Backdoor Attacks against Graph Prompt Learning

1 code implementation28 May 2024 Xiaoting Lyu, Yufei Han, Wei Wang, Hangwei Qian, Ivor Tsang, Xiangliang Zhang

Graph Prompt Learning (GPL) bridges significant disparities between pretraining and downstream applications to alleviate the knowledge transfer bottleneck in real-world graph learning.

Backdoor Attack Computational Efficiency +3

Zero-Shot Relational Learning for Multimodal Knowledge Graphs

no code implementations9 Apr 2024 Rui Cai, Shichao Pei, Xiangliang Zhang

Relational learning is an essential task in the domain of knowledge representation, particularly in knowledge graph completion (KGC).

Relational Reasoning

WebCode2M: A Real-World Dataset for Code Generation from Webpage Designs

no code implementations9 Apr 2024 Yi Gui, Zhen Li, Yao Wan, Yemin Shi, Hongyu Zhang, Yi Su, Bohua Chen, Dongping Chen, Siyuan Wu, Xing Zhou, Wenbin Jiang, Hai Jin, Xiangliang Zhang

The benchmarking results demonstrate that our dataset significantly improves the ability of MLLMs to generate code from webpage designs, confirming its effectiveness and usability for future applications in front-end design tools.

Benchmarking Code Generation

Defending Jailbreak Prompts via In-Context Adversarial Game

no code implementations20 Feb 2024 Yujun Zhou, Yufei Han, Haomin Zhuang, Kehan Guo, Zhenwen Liang, Hongyan Bao, Xiangliang Zhang

Large Language Models (LLMs) demonstrate remarkable capabilities across diverse applications.

UGMAE: A Unified Framework for Graph Masked Autoencoders

no code implementations12 Feb 2024 Yijun Tian, Chuxu Zhang, Ziyi Kou, Zheyuan Liu, Xiangliang Zhang, Nitesh V. Chawla

In light of this, we propose UGMAE, a unified framework for graph masked autoencoders to address these issues from the perspectives of adaptivity, integrity, complementarity, and consistency.

Self-Supervised Learning

Are we making much progress? Revisiting chemical reaction yield prediction from an imbalanced regression perspective

no code implementations6 Feb 2024 Yihong Ma, Xiaobao Huang, Bozhao Nan, Nuno Moniz, Xiangliang Zhang, Olaf Wiest, Nitesh V. Chawla

The yield of a chemical reaction quantifies the percentage of the target product formed in relation to the reactants consumed during the chemical reaction.

Prediction

Manipulating Predictions over Discrete Inputs in Machine Teaching

no code implementations31 Jan 2024 Xiaodong Wu, Yufei Han, Hayssam Dahrouj, Jianbing Ni, Zhenwen Liang, Xiangliang Zhang

Machine teaching often involves the creation of an optimal (typically minimal) dataset to help a model (referred to as the `student') achieve specific goals given by a teacher.

Combinatorial Optimization

Large Language Model based Multi-Agents: A Survey of Progress and Challenges

1 code implementation21 Jan 2024 Taicheng Guo, Xiuying Chen, Yaqi Wang, Ruidi Chang, Shichao Pei, Nitesh V. Chawla, Olaf Wiest, Xiangliang Zhang

To provide the community with an overview of this dynamic field, we present this survey to offer an in-depth discussion on the essential aspects of multi-agent systems based on LLMs, as well as the challenges.

Decision Making Language Modeling +2

What can Large Language Models do in chemistry? A comprehensive benchmark on eight tasks

1 code implementation NeurIPS 2023 Taicheng Guo, Kehan Guo, Bozhao Nan, Zhenwen Liang, Zhichun Guo, Nitesh V. Chawla, Olaf Wiest, Xiangliang Zhang

In this paper, rather than pursuing state-of-the-art performance, we aim to evaluate capabilities of LLMs in a wide range of tasks across the chemistry domain.

In-Context Learning

Let GPT be a Math Tutor: Teaching Math Word Problem Solvers with Customized Exercise Generation

no code implementations22 May 2023 Zhenwen Liang, Wenhao Yu, Tanmay Rajpurohit, Peter Clark, Xiangliang Zhang, Ashwin Kaylan

In this paper, we present a novel approach for distilling math word problem solving capabilities from large language models (LLMs) into smaller, more efficient student models.

Knowledge Tracing Math +1

A Topic-aware Summarization Framework with Different Modal Side Information

no code implementations19 May 2023 Xiuying Chen, Mingzhe Li, Shen Gao, Xin Cheng, Qiang Yang, Qishen Zhang, Xin Gao, Xiangliang Zhang

To address these two challenges, we first propose a unified topic encoder, which jointly discovers latent topics from the document and various kinds of side information.

Contrastive Learning Triplet

LogicRec: Recommendation with Users' Logical Requirements

1 code implementation23 Apr 2023 Zhenwei Tang, Griffin Floto, Armin Toroghi, Shichao Pei, Xiangliang Zhang, Scott Sanner

In this work, we formulate the problem of recommendation with users' logical requirements (LogicRec) and construct benchmark datasets for LogicRec.

Knowledge Graphs Recommendation Systems +1

Learning towards Selective Data Augmentation for Dialogue Generation

no code implementations17 Mar 2023 Xiuying Chen, Mingzhe Li, Jiayi Zhang, Xiaoqiang Xia, Chen Wei, Jianwei Cui, Xin Gao, Xiangliang Zhang, Rui Yan

As it is cumbersome and expensive to acquire a huge amount of data for training neural dialog models, data augmentation is proposed to effectively utilize existing training samples.

Data Augmentation Dialogue Generation +1

Knowledge Distillation on Graphs: A Survey

no code implementations1 Feb 2023 Yijun Tian, Shichao Pei, Xiangliang Zhang, Chuxu Zhang, Nitesh V. Chawla

Therefore, to improve the applicability of GNNs and fully encode the complicated topological information, knowledge distillation on graphs (KDG) has been introduced to build a smaller yet effective model and exploit more knowledge from data, leading to model compression and performance improvement.

Knowledge Distillation Model Compression +1

Follow the Timeline! Generating Abstractive and Extractive Timeline Summary in Chronological Order

1 code implementation2 Jan 2023 Xiuying Chen, Mingzhe Li, Shen Gao, Zhangming Chan, Dongyan Zhao, Xin Gao, Xiangliang Zhang, Rui Yan

Nowadays, time-stamped web documents related to a general news query floods spread throughout the Internet, and timeline summarization targets concisely summarizing the evolution trajectory of events along the timeline.

Decoder Document Summarization +2

Towards Efficient and Domain-Agnostic Evasion Attack with High-dimensional Categorical Inputs

no code implementations13 Dec 2022 Hongyan Bao, Yufei Han, Yujun Zhou, Xin Gao, Xiangliang Zhang

Our work targets at searching feasible adversarial perturbation to attack a classifier with high-dimensional categorical inputs in a domain-agnostic setting.

AdvCat: Domain-Agnostic Robustness Assessment for Cybersecurity-Critical Applications with Categorical Inputs

no code implementations13 Dec 2022 Helene Orsini, Hongyan Bao, Yujun Zhou, Xiangrui Xu, Yufei Han, Longyang Yi, Wei Wang, Xin Gao, Xiangliang Zhang

Machine Learning-as-a-Service systems (MLaaS) have been largely developed for cybersecurity-critical applications, such as detecting network intrusions and fake news campaigns.

Adversarial Robustness Fake News Detection +1

Scientific Paper Extractive Summarization Enhanced by Citation Graphs

no code implementations8 Dec 2022 Xiuying Chen, Mingzhe Li, Shen Gao, Rui Yan, Xin Gao, Xiangliang Zhang

We first propose a Multi-granularity Unsupervised Summarization model (MUS) as a simple and low-cost solution to the task.

Extractive Summarization Link Prediction +1

Generalizing Math Word Problem Solvers via Solution Diversification

1 code implementation1 Dec 2022 Zhenwen Liang, Jipeng Zhang, Lei Wang, Yan Wang, Jie Shao, Xiangliang Zhang

In this paper, we design a new training framework for an MWP solver by introducing a solution buffer and a solution discriminator.

Math

Analogical Math Word Problems Solving with Enhanced Problem-Solution Association

1 code implementation1 Dec 2022 Zhenwen Liang, Jipeng Zhang, Xiangliang Zhang

In this paper, we propose to build a novel MWP solver by leveraging analogical MWPs, which advance the solver's generalization ability across different kinds of MWPs.

Math Question Answering

Pairwise Instance Relation Augmentation for Long-tailed Multi-label Text Classification

no code implementations19 Nov 2022 Lin Xiao, Pengyu Xu, Liping Jing, Xiangliang Zhang

In response, we propose a Pairwise Instance Relation Augmentation Network (PIRAN) to augment tailed-label documents for balancing tail labels and head labels.

Diversity Multi Label Text Classification +3

Towards Improving Faithfulness in Abstractive Summarization

1 code implementation4 Oct 2022 Xiuying Chen, Mingzhe Li, Xin Gao, Xiangliang Zhang

The evaluation of factual consistency also shows that our model generates more faithful summaries than baselines.

Abstractive Text Summarization Decoder +3

NOSMOG: Learning Noise-robust and Structure-aware MLPs on Graphs

2 code implementations22 Aug 2022 Yijun Tian, Chuxu Zhang, Zhichun Guo, Xiangliang Zhang, Nitesh V. Chawla

Existing methods attempt to address this scalability issue by training multi-layer perceptrons (MLPs) exclusively on node content features using labels derived from trained GNNs.

TAR: Neural Logical Reasoning across TBox and ABox

no code implementations29 May 2022 Zhenwei Tang, Shichao Pei, Xi Peng, Fuzhen Zhuang, Xiangliang Zhang, Robert Hoehndorf

Neural logical reasoning (NLR) is a fundamental task to explore such knowledge bases, which aims at answering multi-hop queries with logical operations based on distributed representations of queries and answers.

Descriptive Logical Reasoning +1

Target-aware Abstractive Related Work Generation with Contrastive Learning

1 code implementation26 May 2022 Xiuying Chen, Hind Alamro, Mingzhe Li, Shen Gao, Rui Yan, Xin Gao, Xiangliang Zhang

The related work section is an important component of a scientific paper, which highlights the contribution of the target paper in the context of the reference papers.

Contrastive Learning TAG

Few-Shot Learning on Graphs

no code implementations17 Mar 2022 Chuxu Zhang, Kaize Ding, Jundong Li, Xiangliang Zhang, Yanfang Ye, Nitesh V. Chawla, Huan Liu

In light of this, few-shot learning on graphs (FSLG), which combines the strengths of graph representation learning and few-shot learning together, has been proposed to tackle the performance degradation in face of limited annotated data challenge.

Few-Shot Learning Graph Mining +2

Data-Driven State Estimation for Light-Emitting Diode Underwater Optical Communication

no code implementations30 Dec 2021 Yingquan Li, Zhenwen Liang, Ibrahima N'Doye, Xiangliang Zhang, Mohamed-Slim Alouini, Taous-Meriem Laleg-Kirati

Light-Emitting Diodes (LEDs) based underwater optical wireless communications (UOWCs), a technology with low latency and high data rates, have attracted significant importance for underwater robots.

Cross-modal Zero-shot Hashing by Label Attributes Embedding

no code implementations7 Nov 2021 Runmin Wang, Guoxian Yu, Lei Liu, Lizhen Cui, Carlotta Domeniconi, Xiangliang Zhang

Cross-modal hashing (CMH) is one of the most promising methods in cross-modal approximate nearest neighbor search.

Attribute

Crowdsourcing with Meta-Workers: A New Way to Save the Budget

no code implementations7 Nov 2021 Guangyang Han, Guoxian Yu, Lizhen Cui, Carlotta Domeniconi, Xiangliang Zhang

Due to the unreliability of Internet workers, it's difficult to complete a crowdsourcing project satisfactorily, especially when the tasks are multiple and the budget is limited.

Few-Shot Learning Image Classification

Meta Cross-Modal Hashing on Long-Tailed Data

no code implementations7 Nov 2021 Runmin Wang, Guoxian Yu, Carlotta Domeniconi, Xiangliang Zhang

Due to the lack of training samples in the tail classes, MetaCMH first learns direct features from data in different modalities, and then introduces an associative memory module to learn the memory features of samples of the tail classes.

Meta-Learning

Open-Set Crowdsourcing using Multiple-Source Transfer Learning

no code implementations7 Nov 2021 Guangyang Han, Guoxian Yu, Lei Liu, Lizhen Cui, Carlotta Domeniconi, Xiangliang Zhang

First, OSCrowd integrates crowd theme related datasets into a large source domain to facilitate partial transfer learning to approximate the label space inference of these tasks.

Transfer Learning

Towards Understanding the Robustness Against Evasion Attack on Categorical Data

no code implementations ICLR 2022 Hongyan Bao, Yufei Han, Yujun Zhou, Yun Shen, Xiangliang Zhang

Characterizing and assessing the adversarial vulnerability of classification models with categorical input has been a practically important, while rarely explored research problem.

Classification

Overview of the Arabic Sentiment Analysis 2021 Competition at KAUST

no code implementations29 Sep 2021 Hind Alamro, Manal Alshehri, Basma Alharbi, Zuhair Khayyat, Manal Kalkatawi, Inji Ibrahim Jaber, Xiangliang Zhang

From our recently released ASAD dataset, we provide the competitors with 55K tweets for training, and 20K tweets for validation, based on which the performance of participating teams are ranked on a leaderboard, https://www. kaggle. com/c/arabic-sentiment-analysis-2021-kaust.

Arabic Sentiment Analysis

A Simple and Debiased Sampling Method for Personalized Ranking

no code implementations29 Sep 2021 Lu Yu, Shichao Pei, Chuxu Zhang, Xiangliang Zhang

Pairwise ranking models have been widely used to address various problems, such as recommendation.

AppQ: Warm-starting App Recommendation Based on View Graphs

no code implementations8 Sep 2021 Dan Su, Jiqiang Liu, Sencun Zhu, Xiaoyang Wang, Wei Wang, Xiangliang Zhang

In this work, we propose AppQ, a novel app quality grading and recommendation system that extracts inborn features of apps based on app source code.

Recommendation Systems

FARF: A Fair and Adaptive Random Forests Classifier

no code implementations17 Aug 2021 Wenbin Zhang, Albert Bifet, Xiangliang Zhang, Jeremy C. Weiss, Wolfgang Nejdl

This algorithm, called FARF (Fair and Adaptive Random Forests), is based on using online component classifiers and updating them according to the current distribution, that also accounts for fairness and a single hyperparameters that alters fairness-accuracy balance.

Decision Making Fairness

Capturing Relations between Scientific Papers: An Abstractive Model for Related Work Section Generation

1 code implementation ACL 2021 Xiuying Chen, Hind Alamro, Mingzhe Li, Shen Gao, Xiangliang Zhang, Dongyan Zhao, Rui Yan

Hence, in this paper, we propose a Relation-aware Related work Generator (RRG), which generates an abstractive related work from the given multiple scientific papers in the same research area.

Relation

Attack Transferability Characterization for Adversarially Robust Multi-label Classification

1 code implementation29 Jun 2021 Zhuo Yang, Yufei Han, Xiangliang Zhang

We unveil how the transferability level of the attack determines the attackability of the classifier via establishing an information-theoretic analysis of the adversarial risk.

Adversarial Attack Classification +4

Socially-Aware Self-Supervised Tri-Training for Recommendation

1 code implementation7 Jun 2021 Junliang Yu, Hongzhi Yin, Min Gao, Xin Xia, Xiangliang Zhang, Nguyen Quoc Viet Hung

Under this scheme, only a bijective mapping is built between nodes in two different views, which means that the self-supervision signals from other nodes are being neglected.

Contrastive Learning Recommendation Systems +2

Quaternion Factorization Machines: A Lightweight Solution to Intricate Feature Interaction Modelling

no code implementations5 Apr 2021 Tong Chen, Hongzhi Yin, Xiangliang Zhang, Zi Huang, Yang Wang, Meng Wang

As a well-established approach, factorization machine (FM) is capable of automatically learning high-order interactions among features to make predictions without the need for manual feature engineering.

Feature Engineering

Uniting Heterogeneity, Inductiveness, and Efficiency for Graph Representation Learning

no code implementations4 Apr 2021 Tong Chen, Hongzhi Yin, Jie Ren, Zi Huang, Xiangliang Zhang, Hao Wang

In WIDEN, we propose a novel inductive, meta path-free message passing scheme that packs up heterogeneous node features with their associated edges from both low- and high-order neighbor nodes.

Graph Representation Learning Transductive Learning

Fast-adapting and Privacy-preserving Federated Recommender System

no code implementations2 Apr 2021 Qinyong Wang, Hongzhi Yin, Tong Chen, Junliang Yu, Alexander Zhou, Xiangliang Zhang

In the mobile Internet era, the recommender system has become an irreplaceable tool to help users discover useful items, and thus alleviating the information overload problem.

Federated Learning Meta-Learning +2

Hierarchical Hyperedge Embedding-based Representation Learning for Group Recommendation

no code implementations24 Mar 2021 Lei Guo, Hongzhi Yin, Tong Chen, Xiangliang Zhang, Kai Zheng

However, the representation learning for a group is most complex beyond the fusion of group member representation, as the personal preferences and group preferences may be in different spaces.

Representation Learning

Graph Embedding for Recommendation against Attribute Inference Attacks

no code implementations29 Jan 2021 Shijie Zhang, Hongzhi Yin, Tong Chen, Zi Huang, Lizhen Cui, Xiangliang Zhang

Specifically, in GERAI, we bind the information perturbation mechanism in differential privacy with the recommendation capability of graph convolutional networks.

Attribute Graph Embedding +2

Combat Data Shift in Few-shot Learning with Knowledge Graph

no code implementations27 Jan 2021 Yongchun Zhu, Fuzhen Zhuang, Xiangliang Zhang, Zhiyuan Qi, Zhiping Shi, Juan Cao, Qing He

However, in real-world applications, few-shot learning paradigm often suffers from data shift, i. e., samples in different tasks, even in the same task, could be drawn from various data distributions.

Few-Shot Learning

Does Head Label Help for Long-Tailed Multi-Label Text Classification

1 code implementation24 Jan 2021 Lin Xiao, Xiangliang Zhang, Liping Jing, Chi Huang, Mingyang Song

To address the challenge of insufficient training data on tail label classification, we propose a Head-to-Tail Network (HTTN) to transfer the meta-knowledge from the data-rich head labels to data-poor tail labels.

General Classification Multi Label Text Classification +2

Self-Supervised Multi-Channel Hypergraph Convolutional Network for Social Recommendation

4 code implementations16 Jan 2021 Junliang Yu, Hongzhi Yin, Jundong Li, Qinyong Wang, Nguyen Quoc Viet Hung, Xiangliang Zhang

In this paper, we fill this gap and propose a multi-channel hypergraph convolutional network to enhance social recommendation by leveraging high-order user relations.

Recommendation Systems Self-Supervised Learning

FENet: A Frequency Extraction Network for Obstructive Sleep Apnea Detection

no code implementations8 Jan 2021 Guanhua Ye, Hongzhi Yin, Tong Chen, Hongxu Chen, Lizhen Cui, Xiangliang Zhang

Obstructive Sleep Apnea (OSA) is a highly prevalent but inconspicuous disease that seriously jeopardizes the health of human beings.

Sleep apnea detection

Characterizing the Evasion Attackability of Multi-label Classifiers

no code implementations17 Dec 2020 Zhuo Yang, Yufei Han, Xiangliang Zhang

Evasion attack in multi-label learning systems is an interesting, widely witnessed, yet rarely explored research topic.

Computational Efficiency Multi-Label Learning

Self-Supervised Hypergraph Convolutional Networks for Session-based Recommendation

2 code implementations12 Dec 2020 Xin Xia, Hongzhi Yin, Junliang Yu, Qinyong Wang, Lizhen Cui, Xiangliang Zhang

Moreover, to enhance hypergraph modeling, we devise another graph convolutional network which is based on the line graph of the hypergraph and then integrate self-supervised learning into the training of the networks by maximizing mutual information between the session representations learned via the two networks, serving as an auxiliary task to improve the recommendation task.

Self-Supervised Learning Session-Based Recommendations

ASAD: A Twitter-based Benchmark Arabic Sentiment Analysis Dataset

no code implementations1 Nov 2020 Basma Alharbi, Hind Alamro, Manal Alshehri, Zuhair Khayyat, Manal Kalkatawi, Inji Ibrahim Jaber, Xiangliang Zhang

This paper provides a detailed description of a new Twitter-based benchmark dataset for Arabic Sentiment Analysis (ASAD), which is launched in a competition3, sponsored by KAUST for awarding 10000 USD, 5000 USD and 2000 USD to the first, second and third place winners, respectively.

Arabic Sentiment Analysis

Deterministic and probabilistic deep learning models for inverse design of broadband acoustic cloak

no code implementations28 Oct 2020 Waqas W. Ahmed, Mohamed Farhat, Xiangliang Zhang, Ying Wu

Concealing an object from incoming waves (light and/or sound) remained science fiction for a long time due to the absence of wave-shielding materials in nature.

Applied Physics Computational Physics

Multi-typed Objects Multi-view Multi-instance Multi-label Learning

no code implementations6 Oct 2020 Yuanlin Yang, Guoxian Yu, Jun Wang, Carlotta Domeniconi, Xiangliang Zhang

Multi-typed objects Multi-view Multi-instance Multi-label Learning (M4L) deals with interconnected multi-typed objects (or bags) that are made of diverse instances, represented with heterogeneous feature views and annotated with a set of non-exclusive but semantically related labels.

Multi-Label Learning

Deep Incomplete Multi-View Multiple Clusterings

no code implementations2 Oct 2020 Shaowei Wei, Jun Wang, Guoxian Yu, Carlotta Domeniconi, Xiangliang Zhang

Multi-view clustering aims at exploiting information from multiple heterogeneous views to promote clustering.

Clustering Decoder +1

SAIL: Self-Augmented Graph Contrastive Learning

no code implementations2 Sep 2020 Lu Yu, Shichao Pei, Lizhong Ding, Jun Zhou, Longfei Li, Chuxu Zhang, Xiangliang Zhang

This paper studies learning node representations with graph neural networks (GNNs) for unsupervised scenario.

Contrastive Learning Knowledge Distillation +1

PAGE: A Simple and Optimal Probabilistic Gradient Estimator for Nonconvex Optimization

no code implementations25 Aug 2020 Zhize Li, Hongyan Bao, Xiangliang Zhang, Peter Richtárik

Then, we show that PAGE obtains the optimal convergence results $O(n+\frac{\sqrt{n}}{\epsilon^2})$ (finite-sum) and $O(b+\frac{\sqrt{b}}{\epsilon^2})$ (online) matching our lower bounds for both nonconvex finite-sum and online problems.

Graph Factorization Machines for Cross-Domain Recommendation

no code implementations12 Jul 2020 Dongbo Xi, Fuzhen Zhuang, Yongchun Zhu, Pengpeng Zhao, Xiangliang Zhang, Qing He

In this paper, we propose a Graph Factorization Machine (GFM) which utilizes the popular Factorization Machine to aggregate multi-order interactions from neighborhood for recommendation.

Recommendation Systems

Attention-Aware Answers of the Crowd

no code implementations24 Dec 2019 Jingzheng Tu, Guoxian Yu, Jun Wang, Carlotta Domeniconi, Xiangliang Zhang

However, they all assume that workers' label quality is stable over time (always at the same level whenever they conduct the tasks).

Bayesian Inference

Multi-View Multiple Clusterings using Deep Matrix Factorization

no code implementations26 Nov 2019 Shaowei Wei, Jun Wang, Guoxian Yu, Carlotta, Xiangliang Zhang

Multi-view clustering aims at integrating complementary information from multiple heterogeneous views to improve clustering results.

Clustering Diversity

Prototypical Networks for Multi-Label Learning

no code implementations17 Nov 2019 Zhuo Yang, Yufei Han, Guoxian Yu, Qiang Yang, Xiangliang Zhang

We propose to formulate multi-label learning as a estimation of class distribution in a non-linear embedding space, where for each label, its positive data embeddings and negative data embeddings distribute compactly to form a positive component and negative component respectively, while the positive component and negative component are pushed away from each other.

Multi-Label Classification MUlTI-LABEL-ClASSIFICATION +1

Active Multi-Label Crowd Consensus

no code implementations7 Nov 2019 Jinzheng Tu, Guoxian Yu, Carlotta Domeniconi, Jun Wang, Xiangliang Zhang

AMCC accounts for the commonality and individuality of workers, and assumes that workers can be organized into different groups.

Triplet

Recurrent Attention Walk for Semi-supervised Classification

1 code implementation22 Oct 2019 Uchenna Akujuobi, Qiannan Zhang, Han Yufei, Xiangliang Zhang

We propose to explore the neighborhood in a reinforcement learning setting and find a walk path well-tuned for classifying the unlabelled target nodes.

Classification General Classification +2

Cross-modal Zero-shot Hashing

no code implementations19 Aug 2019 Xuanwu Liu, Zhao Li, Jun Wang, Guoxian Yu, Carlotta Domeniconi, Xiangliang Zhang

It then defines an objective function to achieve deep feature learning compatible with the composite similarity preserving, category attribute space learning, and hashing coding function learning.

Attribute Retrieval

Weakly-paired Cross-Modal Hashing

no code implementations29 May 2019 Xuanwu Liu, Jun Wang, Guoxian Yu, Carlotta Domeniconi, Xiangliang Zhang

FlexCMH first introduces a clustering-based matching strategy to explore the local structure of each cluster, and thus to find the potential correspondence between clusters (and samples therein) across modalities.

Clustering Retrieval

ActiveHNE: Active Heterogeneous Network Embedding

no code implementations14 May 2019 Xia Chen, Guoxian Yu, Jun Wang, Carlotta Domeniconi, Zhao Li, Xiangliang Zhang

To maximize the profit of utilizing the rare and valuable supervised information in HNEs, we develop a novel Active Heterogeneous Network Embedding (ActiveHNE) framework, which includes two components: Discriminative Heterogeneous Network Embedding (DHNE) and Active Query in Heterogeneous Networks (AQHN).

Network Embedding

Multi-View Multiple Clustering

no code implementations13 May 2019 Shixing Yao, Guoxian Yu, Jun Wang, Carlotta Domeniconi, Xiangliang Zhang

It then uses matrix factorization on the individual matrices, along with the shared matrix, to generate diverse clusterings of high-quality.

Clustering Representation Learning

Robust Federated Training via Collaborative Machine Teaching using Trusted Instances

no code implementations8 May 2019 Yufei Han, Xiangliang Zhang

In our work, we propose a collaborative and privacy-preserving machine teaching paradigm with multiple distributed teachers, to improve robustness of the federated training process against local data corruption.

Data Poisoning Federated Learning +1

Risk Convergence of Centered Kernel Ridge Regression with Large Dimensional Data

no code implementations19 Apr 2019 Khalil Elkhalil, Abla Kammoun, Xiangliang Zhang, Mohamed-Slim Alouini, Tareq Al-Naffouri

This paper carries out a large dimensional analysis of a variation of kernel ridge regression that we call \emph{centered kernel ridge regression} (CKRR), also known in the literature as kernel ridge regression with offset.

regression

P^2IR: Universal Deep Node Representation via Partial Permutation Invariant Set Functions

no code implementations27 Sep 2018 Shupeng Gui, Xiangliang Zhang, Shuang Qiu, Mingrui Wu, Jieping Ye, Ji Liu

Our method can 1) learn an arbitrary form of the representation function from the neighborhood, without losing any potential dependence structures, 2) automatically decide the significance of neighbors at different distances, and 3) be applicable to both homogeneous and heterogeneous graph embedding, which may contain multiple types of nodes.

Graph Embedding Representation Learning

CreditCoin: A Privacy-Preserving Blockchain-Based Incentive Announcement Network for Communications of Smart Vehicles

no code implementations7 Jul 2018 Lun Li, Jiqiang Liu, Lichen Cheng, Shuo Qiu, Wei Wang, Xiangliang Zhang, and Zonghua Zhang

The vehicular announcement network is one of the most promising utilities in the communications of smart vehicles and in the smart transportation systems.

Privacy Preserving

GESF: A Universal Discriminative Mapping Mechanism for Graph Representation Learning

no code implementations28 May 2018 Shupeng Gui, Xiangliang Zhang, Shuang Qiu, Mingrui Wu, Jieping Ye, Ji Liu

Graph embedding is a central problem in social network analysis and many other applications, aiming to learn the vector representation for each node.

Graph Embedding Graph Representation Learning

Graph Embedding with Rich Information through Heterogeneous Network

no code implementations18 Oct 2017 Guolei Sun, Xiangliang Zhang

In this paper, we proposed a novel and general framework of representation learning for graph with rich text information through constructing a bipartite heterogeneous network.

General Classification Graph Embedding +2

Coarse Grained Exponential Variational Autoencoders

no code implementations25 Feb 2017 Ke Sun, Xiangliang Zhang

Variational autoencoders (VAE) often use Gaussian or category distribution to model the inference process.

A PCA-Based Change Detection Framework for Multidimensional Data Streams: Change Detection in Multidimensional Data Streams

1 code implementation ACM SIGKDD international conference on Knowledge discovery and data mining 2015 Abdulhakim A. Qahtan, Basma Alharbi, Suojin Wang, Xiangliang Zhang

In this paper, we propose a framework for detecting changes in multidimensional data streams based on principal component analysis, which is used for projecting data into a lower dimensional space, thus facilitating density estimation and change-score calculations.

Change Detection Density Estimation

Cannot find the paper you are looking for? You can Submit a new open access paper.