Search Results for author: Cheng Yang

Found 88 papers, 38 papers with code

Real-time Topic-aware Influence Maximization Using Preprocessing

no code implementations1 Mar 2014 Wei Chen, Tian Lin, Cheng Yang

In this paper, we focus on the topic-aware influence maximization task.

Comprehend DeepWalk as Matrix Factorization

no code implementations2 Jan 2015 Cheng Yang, Zhiyuan Liu

Word2vec, as an efficient tool for learning vector representation of words has shown its effectiveness in many natural language processing tasks.

Learning and Transferring IDs Representation in E-commerce

no code implementations22 Dec 2017 Kui Zhao, Yuechuan Li, Zhaoqian Shuai, Cheng Yang

Many machine intelligence techniques are developed in E-commerce and one of the most essential components is the representation of IDs, including user ID, item ID, product ID, store ID, brand ID, category ID etc.

Adaptive Recurrent Neural Network Based on Mixture Layer

no code implementations24 Jan 2018 Kui Zhao, Yuechuan Li, Chi Zhang, Cheng Yang, Huan Xu

By leveraging the mixture layer, the proposed method can adaptively update states according to the similarities between encoded inputs and prototype vectors, leading to a stronger capacity in assimilating sequences with multiple patterns.

3D Point Cloud Denoising using Graph Laplacian Regularization of a Low Dimensional Manifold Model

no code implementations20 Mar 2018 Jin Zeng, Gene Cheung, Michael Ng, Jiahao Pang, Cheng Yang

Due to discrete observations of the patches on the manifold, we approximate the manifold dimension computation defined in the continuous domain with a patch-based graph Laplacian regularizer and propose a new discrete patch distance measure to quantify the similarity between two same-sized surface patches for graph construction that is robust to noise.

Denoising graph construction +2

Stylistic Chinese Poetry Generation via Unsupervised Style Disentanglement

no code implementations EMNLP 2018 Cheng Yang, Maosong Sun, Xiaoyuan Yi, Wenhao Li

The ability to write diverse poems in different styles under the same poetic imagery is an important characteristic of human poetry writing.

Disentanglement Machine Translation +1

CED: Credible Early Detection of Social Media Rumors

1 code implementation10 Nov 2018 Changhe Song, Cunchao Tu, Cheng Yang, Zhiyuan Liu, Maosong Sun

By regarding all reposts to a rumor candidate as a sequence, the proposed model will seek an early point-in-time for making a credible prediction.

Social and Information Networks

Graph Neural Networks: A Review of Methods and Applications

5 code implementations20 Dec 2018 Jie Zhou, Ganqu Cui, Shengding Hu, Zhengyan Zhang, Cheng Yang, Zhiyuan Liu, LiFeng Wang, Changcheng Li, Maosong Sun

Lots of learning tasks require dealing with graph data which contains rich relation information among elements.

Graph Attention

Neural Diffusion Model for Microscopic Cascade Prediction

1 code implementation21 Dec 2018 Cheng Yang, Maosong Sun, Haoran Liu, Shiyi Han, Zhiyuan Liu, Huanbo Luan

The strong assumptions oversimplify the complex diffusion mechanism and prevent these models from better fitting real-world cascade data.

Social and Information Networks Physics and Society

A Unified Framework for Marketing Budget Allocation

no code implementations4 Feb 2019 Kui Zhao, Junhao Hua, Ling Yan, Qi Zhang, Huan Xu, Cheng Yang

In our approach, a semi-black-box model is built to forecast the dynamic market response and an efficient optimization method is proposed to solve the complex allocation task.

Decision Making Marketing

Jiuge: A Human-Machine Collaborative Chinese Classical Poetry Generation System

no code implementations ACL 2019 Guo Zhipeng, Xiaoyuan Yi, Maosong Sun, Wenhao Li, Cheng Yang, Jiannan Liang, Huimin Chen, Yuhui Zhang, Ruoyu Li

By exposing the options of poetry genres, styles and revision modes, Jiuge, acting as a professional assistant, allows constant and active participation of users in poetic creation.

Cultural Vocal Bursts Intensity Prediction

GEAR: Graph-based Evidence Aggregating and Reasoning for Fact Verification

2 code implementations ACL 2019 Jie Zhou, Xu Han, Cheng Yang, Zhiyuan Liu, LiFeng Wang, Changcheng Li, Maosong Sun

Fact verification (FV) is a challenging task which requires to retrieve relevant evidence from plain text and use the evidence to verify given claims.

Fact Verification

Improving Sequence Modeling Ability of Recurrent Neural Networks via Sememes

1 code implementation20 Oct 2019 Yujia Qin, Fanchao Qi, Sicong Ouyang, Zhiyuan Liu, Cheng Yang, Yasheng Wang, Qun Liu, Maosong Sun

Sememes, the minimum semantic units of human languages, have been successfully utilized in various natural language processing applications.

Adversarial Attack Language Modelling +2

Graph Neural News Recommendation with Long-term and Short-term Interest Modeling

no code implementations30 Oct 2019 Linmei Hu, Chen Li, Chuan Shi, Cheng Yang, Chao Shao

Existing methods on news recommendation mainly include collaborative filtering methods which rely on direct user-item interactions and content based methods which characterize the content of user reading history.

Collaborative Filtering News Recommendation +1

Graph Metric Learning via Gershgorin Disc Alignment

no code implementations28 Jan 2020 Cheng Yang, Gene Cheung, Wei Hu

We propose a fast general projection-free metric learning framework, where the minimization objective $\min_{\textbf{M} \in \mathcal{S}} Q(\textbf{M})$ is a convex differentiable function of the metric matrix $\textbf{M}$, and $\textbf{M}$ resides in the set $\mathcal{S}$ of generalized graph Laplacian matrices for connected graphs with positive edge weights and node degrees.

Metric Learning

MixPoet: Diverse Poetry Generation via Learning Controllable Mixed Latent Space

no code implementations13 Mar 2020 Xiaoyuan Yi, Ruoyu Li, Cheng Yang, Wenhao Li, Maosong Sun

Though recent neural models make prominent progress in some criteria of poetry quality, generated poems still suffer from the problem of poor diversity.

DCNAS: Densely Connected Neural Architecture Search for Semantic Image Segmentation

no code implementations CVPR 2021 Xiong Zhang, Hongmin Xu, Hong Mo, Jianchao Tan, Cheng Yang, Lei Wang, Wenqi Ren

Neural Architecture Search (NAS) has shown great potentials in automatically designing scalable network architectures for dense image predictions.

Ranked #13 on Semantic Segmentation on Cityscapes test (using extra training data)

Image Segmentation Neural Architecture Search +1

KACC: A Multi-task Benchmark for Knowledge Abstraction, Concretization and Completion

1 code implementation Findings (ACL) 2021 Jie Zhou, Shengding Hu, Xin Lv, Cheng Yang, Zhiyuan Liu, Wei Xu, Jie Jiang, Juanzi Li, Maosong Sun

Based on the datasets, we propose novel tasks such as multi-hop knowledge abstraction (MKA), multi-hop knowledge concretization (MKC) and then design a comprehensive benchmark.

Knowledge Graphs Transfer Learning

Fast Network Embedding Enhancement via High Order Proximity Approximation

2 code implementations ‏‏‎ ‎ 2020 Cheng Yang, Maosong Sun, Zhiyuan Liu, Cunchao Tu

Many Network Representation Learning (NRL) methods have been proposed to learn vector representations for vertices in a network recently.

Dimensionality Reduction Link Prediction +3

AP20-OLR Challenge: Three Tasks and Their Baselines

no code implementations4 Jun 2020 Zheng Li, Miao Zhao, Qingyang Hong, Lin Li, Zhiyuan Tang, Dong Wang, Li-Ming Song, Cheng Yang

Based on Kaldi and Pytorch, recipes for i-vector and x-vector systems are also conducted as baselines for the three tasks.

Dialect Identification

Signed Graph Metric Learning via Gershgorin Disc Perfect Alignment

1 code implementation15 Jun 2020 Cheng Yang, Gene Cheung, Wei Hu

Given a convex and differentiable objective $Q(\M)$ for a real symmetric matrix $\M$ in the positive definite (PD) cone -- used to compute Mahalanobis distances -- we propose a fast general metric learning framework that is entirely projection-free.

Binary Classification Metric Learning

Graph Neural News Recommendation with Unsupervised Preference Disentanglement

1 code implementation ACL 2020 Linmei Hu, Siyong Xu, Chen Li, Cheng Yang, Chuan Shi, Nan Duan, Xing Xie, Ming Zhou

Furthermore, the learned representations are disentangled with latent preference factors by a neighborhood routing algorithm, which can enhance expressiveness and interpretability.

Disentanglement News Recommendation

Adaptive Graph Encoder for Attributed Graph Embedding

1 code implementation3 Jul 2020 Ganqu Cui, Jie zhou, Cheng Yang, Zhiyuan Liu

Experimental results show that AGE consistently outperforms state-of-the-art graph embedding methods considerably on these tasks.

Clustering Graph Embedding +2

Few-shot Knowledge Transfer for Fine-grained Cartoon Face Generation

4 code implementations27 Jul 2020 Nan Zhuang, Cheng Yang

In this paper, we are interested in generating fine-grained cartoon faces for various groups.

Face Generation Transfer Learning +1

Identification of deep breath while moving forward based on multiple body regions and graph signal analysis

no code implementations20 Oct 2020 Yunlu Wang, Cheng Yang, Menghan Hu, Jian Zhang, Qingli Li, Guangtao Zhai, Xiao-Ping Zhang

This paper presents an unobtrusive solution that can automatically identify deep breath when a person is walking past the global depth camera.

Nonseparable Symplectic Neural Networks

no code implementations ICLR 2021 Shiying Xiong, Yunjin Tong, Xingzhe He, Shuqi Yang, Cheng Yang, Bo Zhu

The enabling mechanics of our approach is an augmented symplectic time integrator to decouple the position and momentum energy terms and facilitate their evolution.

Position

Progressively Stacking 2.0: A Multi-stage Layerwise Training Method for BERT Training Speedup

no code implementations27 Nov 2020 Cheng Yang, Shengnan Wang, Chao Yang, Yuechuan Li, Ru He, Jingqiao Zhang

In BERT training, the backward computation is much more time-consuming than the forward computation, especially in the distributed training setting in which the backward computation time further includes the communication time for gradient synchronization.

CoRe: An Efficient Coarse-refined Training Framework for BERT

no code implementations27 Nov 2020 Cheng Yang, Shengnan Wang, Yuechuan Li, Chao Yang, Ming Yan, Jingqiao Zhang, Fangquan Lin

In the second phase, we transform the trained relaxed BERT model into the original BERT and further retrain the model.

Incorporating Vision Bias into Click Models for Image-oriented Search Engine

no code implementations7 Jan 2021 Ningxin Xu, Cheng Yang, Yixin Zhu, Xiaowei Hu, Changhu Wang

Most typical click models assume that the probability of a document to be examined by users only depends on position, such as PBM and UBM.

Position

Re-rank Coarse Classification with Local Region Enhanced Features for Fine-Grained Image Recognition

no code implementations19 Feb 2021 Shaokang Yang, Shuai Liu, Cheng Yang, Changhu Wang

In this paper, a retrieval-based coarse-to-fine framework is proposed, where we re-rank the TopN classification results by using the local region enhanced embedding features to improve the Top1 accuracy (based on the observation that the correct category usually resides in TopN results).

Fine-Grained Image Classification Fine-Grained Image Recognition +2

Extract the Knowledge of Graph Neural Networks and Go Beyond it: An Effective Knowledge Distillation Framework

1 code implementation4 Mar 2021 Cheng Yang, Jiawei Liu, Chuan Shi

Our framework extracts the knowledge of an arbitrary learned GNN model (teacher model), and injects it into a well-designed student model.

Knowledge Distillation Node Classification

TransFG: A Transformer Architecture for Fine-grained Recognition

2 code implementations14 Mar 2021 Ju He, Jie-Neng Chen, Shuai Liu, Adam Kortylewski, Cheng Yang, Yutong Bai, Changhu Wang

Fine-grained visual classification (FGVC) which aims at recognizing objects from subcategories is a very challenging task due to the inherently subtle inter-class differences.

Fine-Grained Image Classification

Learning to Reconstruct 3D Non-Cuboid Room Layout from a Single RGB Image

1 code implementation16 Apr 2021 Cheng Yang, Jia Zheng, Xili Dai, Rui Tang, Yi Ma, Xiaojun Yuan

Single-image room layout reconstruction aims to reconstruct the enclosed 3D structure of a room from a single image.

Room Layout Estimation

Markdowns in E-Commerce Fresh Retail: A Counterfactual Prediction and Multi-Period Optimization Approach

no code implementations18 May 2021 Junhao Hua, Ling Yan, Huan Xu, Cheng Yang

In this paper, by leveraging abundant observational transaction data, we propose a novel data-driven and interpretable pricing approach for markdowns, consisting of counterfactual prediction and multi-period price optimization.

counterfactual

Rethinking Re-Sampling in Imbalanced Semi-Supervised Learning

1 code implementation1 Jun 2021 Ju He, Adam Kortylewski, Shaokang Yang, Shuai Liu, Cheng Yang, Changhu Wang, Alan Yuille

In particular, we decouple the training of the representation and the classifier, and systematically investigate the effects of different data re-sampling techniques when training the whole network including a classifier as well as fine-tuning the feature extractor only.

Projection-free Graph-based Classifier Learning using Gershgorin Disc Perfect Alignment

no code implementations NeurIPS 2021 Cheng Yang, Gene Cheung, Guangtao Zhai

We repose the SDR dual for solution $\bar{\mathbf{H}}$, then replace the PSD cone constraint $\bar{\mathbf{H}} \succeq 0$ with linear constraints derived from GDPA -- sufficient conditions to ensure $\bar{\mathbf{H}}$ is PSD -- so that the optimization becomes an LP per iteration.

SAS: Self-Augmentation Strategy for Language Model Pre-training

1 code implementation14 Jun 2021 Yifei Xu, Jingqiao Zhang, Ru He, Liangzhu Ge, Chao Yang, Cheng Yang, Ying Nian Wu

In this paper, we propose a self-augmentation strategy (SAS) where a single network is utilized for both regular pre-training and contextualized data augmentation for the training in later epochs.

Data Augmentation Language Modelling +2

Evaluating Modules in Graph Contrastive Learning

1 code implementation15 Jun 2021 Ganqu Cui, Yufeng Du, Cheng Yang, Jie zhou, Liang Xu, Xing Zhou, Xingyi Cheng, Zhiyuan Liu

The recent emergence of contrastive learning approaches facilitates the application on graph representation learning (GRL), introducing graph contrastive learning (GCL) into the literature.

Contrastive Learning Graph Classification +1

Copy and Paste method based on Pose for Re-identification

no code implementations22 Jul 2021 Cheng Yang

To solve the aforementioned problem, a simple and effective method to generate images in several new scenarios was proposed, which is names the Copy and Paste method based on Pose(CPP).

OLR 2021 Challenge: Datasets, Rules and Baselines

no code implementations23 Jul 2021 Binling Wang, Wenxuan Hu, Jing Li, Yiming Zhi, Zheng Li, Qingyang Hong, Lin Li, Dong Wang, Liming Song, Cheng Yang

In addition to the Language Identification (LID) tasks, multilingual Automatic Speech Recognition (ASR) tasks are introduced to OLR 2021 Challenge for the first time.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Hand Image Understanding via Deep Multi-Task Learning

1 code implementation ICCV 2021 Xiong Zhang, Hongsheng Huang, Jianchao Tan, Hongmin Xu, Cheng Yang, Guozhu Peng, Lei Wang, Ji Liu

To further improve the performance of these tasks, we propose a novel Hand Image Understanding (HIU) framework to extract comprehensive information of the hand object from a single RGB image, by jointly considering the relationships between these tasks.

3D Hand Pose Estimation Multi-Task Learning +1

Unfolding Projection-free SDP Relaxation of Binary Graph Classifier via GDPA Linearization

no code implementations10 Sep 2021 Cheng Yang, Gene Cheung, Wai-tian Tan, Guangtao Zhai

Algorithm unfolding creates an interpretable and parsimonious neural network architecture by implementing each iteration of a model-based algorithm as a neural layer.

Be Confident! Towards Trustworthy Graph Neural Networks via Confidence Calibration

2 code implementations NeurIPS 2021 Xiao Wang, Hongrui Liu, Chuan Shi, Cheng Yang

Specifically, we first verify that the confidence distribution in a graph has homophily property, and this finding inspires us to design a calibration GNN model (CaGCN) to learn the calibration function.

PartImageNet: A Large, High-Quality Dataset of Parts

1 code implementation2 Dec 2021 Ju He, Shuo Yang, Shaokang Yang, Adam Kortylewski, Xiaoding Yuan, Jie-Neng Chen, Shuai Liu, Cheng Yang, Qihang Yu, Alan Yuille

To help address this problem, we propose PartImageNet, a large, high-quality dataset with part segmentation annotations.

Activity Recognition Few-Shot Learning +6

Space4HGNN: A Novel, Modularized and Reproducible Platform to Evaluate Heterogeneous Graph Neural Network

1 code implementation18 Feb 2022 Tianyu Zhao, Cheng Yang, Yibo Li, Quan Gan, Zhenyi Wang, Fengqi Liang, Huan Zhao, Yingxia Shao, Xiao Wang, Chuan Shi

Heterogeneous Graph Neural Network (HGNN) has been successfully employed in various tasks, but we cannot accurately know the importance of different design dimensions of HGNNs due to diverse architectures and applied scenarios.

Spatial Autoregressive Coding for Graph Neural Recommendation

no code implementations19 May 2022 Jiayi Zheng, Ling Yang, Heyuan Wang, Cheng Yang, Yinghong Li, Xiaowei Hu, Shenda Hong

To adequately leverage neighbor proximity and high-order information, we design a novel spatial autoregressive paradigm.

Graph Embedding

An Adaptive Contrastive Learning Model for Spike Sorting

no code implementations24 May 2022 Lang Qian, Shengjie Zheng, Chunshan Deng, Cheng Yang, Xiaojian Li

But for BCIs used in neuroscience research, it is important to separate out the activity of individual neurons.

Binary Classification Brain Computer Interface +3

Improving Contrastive Learning of Sentence Embeddings with Case-Augmented Positives and Retrieved Negatives

1 code implementation6 Jun 2022 Wei Wang, Liangzhu Ge, Jingqiao Zhang, Cheng Yang

Following SimCSE, contrastive learning based methods have achieved the state-of-the-art (SOTA) performance in learning sentence embeddings.

Attribute Contrastive Learning +5

GUIM -- General User and Item Embedding with Mixture of Representation in E-commerce

no code implementations2 Jul 2022 Chao Yang, Ru He, Fangquan Lin, Suoyuan Song, Jingqiao Zhang, Cheng Yang

Our goal is to build general representation (embedding) for each user and each product item across Alibaba's businesses, including Taobao and Tmall which are among the world's biggest e-commerce websites.

Contrastive Learning Marketing

KDD CUP 2022 Wind Power Forecasting Team 88VIP Solution

no code implementations18 Aug 2022 Fangquan Lin, Wei Jiang, Hanwei Zhang, Cheng Yang

KDD CUP 2022 proposes a time-series forecasting task on spatial dynamic wind power dataset, in which the participants are required to predict the future generation given the historical context factors.

Imputation Time Series +1

xTrimoABFold: De novo Antibody Structure Prediction without MSA

no code implementations30 Nov 2022 Yining Wang, Xumeng Gong, Shaochuan Li, Bing Yang, YiWu Sun, Chuan Shi, Yangang Wang, Cheng Yang, Hui Li, Le Song

Its improvement in both accuracy and efficiency makes it a valuable tool for de novo antibody design and could make further improvements in immuno-theory.

Computational Efficiency Protein Language Model

MA-GCL: Model Augmentation Tricks for Graph Contrastive Learning

1 code implementation14 Dec 2022 Xumeng Gong, Cheng Yang, Chuan Shi

We argue that typical data augmentation techniques (e. g., edge dropping) in GCL cannot generate diverse enough contrastive views to filter out noises.

Contrastive Learning Data Augmentation +2

Vision Learners Meet Web Image-Text Pairs

no code implementations17 Jan 2023 Bingchen Zhao, Quan Cui, Hao Wu, Osamu Yoshie, Cheng Yang, Oisin Mac Aodha

In this work, given the excellent scalability of web data, we consider self-supervised pre-training on noisy web sourced image-text paired data.

Benchmarking Self-Supervised Learning +1

Neural Partial Differential Equations with Functional Convolution

no code implementations10 Mar 2023 Ziqian Wu, Xingzhe He, Yijun Li, Cheng Yang, Rui Liu, Shiying Xiong, Bo Zhu

We present a lightweighted neural PDE representation to discover the hidden structure and predict the solution of different nonlinear PDEs.

Abnormal Event Detection via Hypergraph Contrastive Learning

no code implementations2 Apr 2023 Bo Yan, Cheng Yang, Chuan Shi, Jiawei Liu, Xiaochen Wang

AEHCL designs the intra-event and inter-event contrastive modules to exploit self-supervised AHIN information.

Contrastive Learning Event Detection

Graph Mining for Cybersecurity: A Survey

no code implementations2 Apr 2023 Bo Yan, Cheng Yang, Chuan Shi, Yong Fang, Qi Li, Yanfang Ye, Junping Du

In recent years, with the proliferation of graph mining techniques, many researchers investigated these techniques for capturing correlations between cyber entities and achieving high performance.

Graph Mining

Real-World Denoising via Diffusion Model

no code implementations8 May 2023 Cheng Yang, Lijing Liang, Zhixun Su

We introduce a diffusion process with linear interpolation, and the intermediate noisy image is interpolated from the original clean image and the corresponding real-world noisy image, so that this diffusion model can handle the level of added noise.

Image Denoising Image Generation

Question Answering as Programming for Solving Time-Sensitive Questions

1 code implementation23 May 2023 Xinyu Zhu, Cheng Yang, Bei Chen, Siheng Li, Jian-Guang Lou, Yujiu Yang

Question answering plays a pivotal role in human daily life because it involves our acquisition of knowledge about the world.

Natural Language Understanding Question Answering

An Efficient Virtual Data Generation Method for Reducing Communication in Federated Learning

no code implementations21 Jun 2023 Cheng Yang, Xue Yang, Dongxian Wu, Xiaohu Tang

Then the server aggregates all the proxy datasets to form a central dummy dataset, which is used to finetune aggregated global model.

Federated Learning

Interactive Molecular Discovery with Natural Language

1 code implementation21 Jun 2023 Zheni Zeng, Bangchen Yin, Shipeng Wang, Jiarui Liu, Cheng Yang, Haishen Yao, Xingzhi Sun, Maosong Sun, Guotong Xie, Zhiyuan Liu

Natural language is expected to be a key medium for various human-machine interactions in the era of large language models.

Property Prediction

Communicative Agents for Software Development

1 code implementation16 Jul 2023 Chen Qian, Xin Cong, Wei Liu, Cheng Yang, Weize Chen, Yusheng Su, Yufan Dang, Jiahao Li, Juyuan Xu, Dahai Li, Zhiyuan Liu, Maosong Sun

At the core of this paradigm lies ChatDev, a virtual chat-powered software development company that mirrors the established waterfall model, meticulously dividing the development process into four distinct chronological stages: designing, coding, testing, and documenting.

Decision Making

Does Correction Remain A Problem For Large Language Models?

no code implementations3 Aug 2023 Xiaowu Zhang, Xiaotian Zhang, Cheng Yang, Hang Yan, Xipeng Qiu

As large language models, such as GPT, continue to advance the capabilities of natural language processing (NLP), the question arises: does the problem of correction still persist?

Few-Shot Learning

Deep Learning-Based Knowledge Injection for Metaphor Detection: A Comprehensive Review

no code implementations8 Aug 2023 Cheng Yang, Zheng Li, Zhiyue Liu, Qingbao Huang

Finally, we explore the current issues facing knowledge injection methods and provide an outlook on future research directions.

AutoConv: Automatically Generating Information-seeking Conversations with Large Language Models

no code implementations12 Aug 2023 Siheng Li, Cheng Yang, Yichun Yin, Xinyu Zhu, Zesen Cheng, Lifeng Shang, Xin Jiang, Qun Liu, Yujiu Yang

Information-seeking conversation, which aims to help users gather information through conversation, has achieved great progress in recent years.

Few-Shot Learning Language Modelling

NewsDialogues: Towards Proactive News Grounded Conversation

1 code implementation12 Aug 2023 Siheng Li, Yichun Yin, Cheng Yang, Wangjie Jiang, Yiwei Li, Zesen Cheng, Lifeng Shang, Xin Jiang, Qun Liu, Yujiu Yang

In this paper, we propose a novel task, Proactive News Grounded Conversation, in which a dialogue system can proactively lead the conversation based on some key topics of the news.

Response Generation

AgentVerse: Facilitating Multi-Agent Collaboration and Exploring Emergent Behaviors

1 code implementation21 Aug 2023 Weize Chen, Yusheng Su, Jingwei Zuo, Cheng Yang, Chenfei Yuan, Chi-Min Chan, Heyang Yu, Yaxi Lu, Yi-Hsin Hung, Chen Qian, Yujia Qin, Xin Cong, Ruobing Xie, Zhiyuan Liu, Maosong Sun, Jie zhou

Autonomous agents empowered by Large Language Models (LLMs) have undergone significant improvements, enabling them to generalize across a broad spectrum of tasks.

A Stochastic Online Forecast-and-Optimize Framework for Real-Time Energy Dispatch in Virtual Power Plants under Uncertainty

no code implementations15 Sep 2023 Wei Jiang, Zhongkai Yi, Li Wang, Hanwei Zhang, Jihai Zhang, Fangquan Lin, Cheng Yang

Aggregating distributed energy resources in power systems significantly increases uncertainties, in particular caused by the fluctuation of renewable energy generation.

Data Augmentation energy management +2

Data-centric Graph Learning: A Survey

no code implementations8 Oct 2023 Yuxin Guo, Deyu Bo, Cheng Yang, Zhiyuan Lu, Zhongjian Zhang, Jixi Liu, Yufei Peng, Chuan Shi

Recently, instead of designing more complex neural architectures as model-centric approaches, the attention of AI community has shifted to data-centric ones, which focuses on better processing data to strengthen the ability of neural models.

Graph Learning

Towards Graph Foundation Models: A Survey and Beyond

no code implementations18 Oct 2023 Jiawei Liu, Cheng Yang, Zhiyuan Lu, Junze Chen, Yibo Li, Mengmei Zhang, Ting Bai, Yuan Fang, Lichao Sun, Philip S. Yu, Chuan Shi

Foundation models have emerged as critical components in a variety of artificial intelligence applications, and showcase significant success in natural language processing and several other domains.

Graph Learning

Specialist or Generalist? Instruction Tuning for Specific NLP Tasks

no code implementations23 Oct 2023 Chufan Shi, Yixuan Su, Cheng Yang, Yujiu Yang, Deng Cai

Although instruction tuning has proven to be a data-efficient method for transforming LLMs into such generalist models, their performance still lags behind specialist models trained exclusively for specific tasks.

Specificity

Improving Vision-and-Language Reasoning via Spatial Relations Modeling

no code implementations9 Nov 2023 Cheng Yang, Rui Xu, Ye Guo, Peixiang Huang, Yiru Chen, Wenkui Ding, Zhongyuan Wang, Hong Zhou

Further, we design two pre-training tasks named object position regression (OPR) and spatial relation classification (SRC) to learn to reconstruct the spatial relation graph respectively.

Position regression Relation +3

Graph Invariant Learning with Subgraph Co-mixup for Out-Of-Distribution Generalization

no code implementations18 Dec 2023 Tianrui Jia, Haoyang Li, Cheng Yang, Tao Tao, Chuan Shi

In this paper, we propose a novel graph invariant learning method based on invariant and variant patterns co-mixup strategy, which is capable of jointly generating mixed multiple environments and capturing invariant patterns from the mixed graph data.

Graph Representation Learning Out-of-Distribution Generalization

Experiential Co-Learning of Software-Developing Agents

1 code implementation28 Dec 2023 Chen Qian, Yufan Dang, Jiahao Li, Wei Liu, Weize Chen, Cheng Yang, Zhiyuan Liu, Maosong Sun

Recent advancements in large language models (LLMs) have brought significant changes to various domains, especially through LLM-driven autonomous agents.

AllSpark: A Multimodal Spatio-Temporal General Intelligence Model with Thirteen Modalities

no code implementations31 Dec 2023 Run Shao, Cheng Yang, Qiujun Li, Qing Zhu, Yongjun Zhang, Yansheng Li, Yu Liu, Yong Tang, Dapeng Liu, Shizhong Yang, Haifeng Li

We introduce the Language as Reference Framework (LaRF), a fundamental principle for constructing a multimodal unified model, aiming to strike a trade-off between the cohesion and autonomy among different modalities.

AgentBoard: An Analytical Evaluation Board of Multi-turn LLM Agents

2 code implementations24 Jan 2024 Chang Ma, Junlei Zhang, Zhihao Zhu, Cheng Yang, Yujiu Yang, Yaohui Jin, Zhenzhong Lan, Lingpeng Kong, Junxian He

Evaluating large language models (LLMs) as general-purpose agents is essential for understanding their capabilities and facilitating their integration into practical applications.

Benchmarking

GraphTranslator: Aligning Graph Model to Large Language Model for Open-ended Tasks

1 code implementation11 Feb 2024 Mengmei Zhang, Mingwei Sun, Peng Wang, Shen Fan, Yanhu Mo, Xiaoxiao Xu, Hong Liu, Cheng Yang, Chuan Shi

Large language models (LLMs) like ChatGPT, exhibit powerful zero-shot and instruction-following capabilities, have catalyzed a revolutionary transformation across diverse fields, especially for open-ended tasks.

Graph Question Answering Instruction Following +4

Endowing Pre-trained Graph Models with Provable Fairness

1 code implementation19 Feb 2024 Zhongjian Zhang, Mengmei Zhang, Yue Yu, Cheng Yang, Jiawei Liu, Chuan Shi

Furthermore, with GraphPAR, we quantify whether the fairness of each node is provable, i. e., predictions are always fair within a certain range of sensitive attribute semantics.

Attribute Fairness +1

Beyond Natural Language: LLMs Leveraging Alternative Formats for Enhanced Reasoning and Communication

1 code implementation28 Feb 2024 Weize Chen, Chenfei Yuan, Jiarui Yuan, Yusheng Su, Chen Qian, Cheng Yang, Ruobing Xie, Zhiyuan Liu, Maosong Sun

Natural language (NL) has long been the predominant format for human cognition and communication, and by extension, has been similarly pivotal in the development and application of Large Language Models (LLMs).

BurstAttention: An Efficient Distributed Attention Framework for Extremely Long Sequences

1 code implementation14 Mar 2024 Sun Ao, Weilin Zhao, Xu Han, Cheng Yang, Zhiyuan Liu, Chuan Shi, Maosong Sun, Shengnan Wang, Teng Su

Effective attention modules have played a crucial role in the success of Transformer-based large language models (LLMs), but the quadratic time and memory complexities of these attention modules also pose a challenge when processing long sequences.

FairSIN: Achieving Fairness in Graph Neural Networks through Sensitive Information Neutralization

no code implementations19 Mar 2024 Cheng Yang, Jixi Liu, Yunhe Yan, Chuan Shi

The F3 are expected to statistically neutralize the sensitive bias in node representations and provide additional nonsensitive information.

Fairness

Treasures Outside Contexts: Improving Event Detection via Global Statistics

1 code implementation EMNLP 2021 Rui Li, Wenlin Zhao, Cheng Yang, Sen Su

Event detection (ED) aims at identifying event instances of specified types in given texts, which has been formalized as a sequence labeling task.

Event Detection

Cannot find the paper you are looking for? You can Submit a new open access paper.