Search Results for author: Cheng Yang

Found 118 papers, 53 papers with code

Treasures Outside Contexts: Improving Event Detection via Global Statistics

1 code implementation EMNLP 2021 Rui Li, Wenlin Zhao, Cheng Yang, Sen Su

Event detection (ED) aims at identifying event instances of specified types in given texts, which has been formalized as a sequence labeling task.

Event Detection

An End-to-End Real-World Camera Imaging Pipeline

no code implementations16 Nov 2024 Kepeng Xu, Zijia Ma, Li Xu, Gang He, Yunsong Li, Wenxin Yu, Taichu Han, Cheng Yang

Nevertheless, the real-world imaging pipeline still faces challenges including the lack of joint optimization in system components, computational redundancies, and optical distortions such as lens shading. In light of this, we propose an end-to-end camera imaging pipeline (RealCamNet) to enhance real-world camera imaging performance.

Image Compression Tone Mapping

Large Language Models Can Self-Improve in Long-context Reasoning

1 code implementation12 Nov 2024 Siheng Li, Cheng Yang, Zesen Cheng, Lemao Liu, Mo Yu, Yujiu Yang, Wai Lam

Large language models (LLMs) have achieved substantial progress in processing long contexts but still struggle with long-context reasoning.

Optima: Optimizing Effectiveness and Efficiency for LLM-Based Multi-Agent System

no code implementations10 Oct 2024 Weize Chen, Jiarui Yuan, Chen Qian, Cheng Yang, Zhiyuan Liu, Maosong Sun

Large Language Model (LLM) based multi-agent systems (MAS) show remarkable potential in collaborative problem-solving, yet they still face critical challenges: low communication efficiency, poor scalability, and a lack of effective parameter-updating optimization methods.

Large Language Model Question Answering

Can Large Language Models Analyze Graphs like Professionals? A Benchmark, Datasets and Models

1 code implementation29 Sep 2024 Xin Li, Weize Chen, Qizhi Chu, Haopeng Li, Zhaojun Sun, Ran Li, Chen Qian, Yiwei Wei, Zhiyuan Liu, Chuan Shi, Maosong Sun, Cheng Yang

Our results underscore that the capabilities of LLMs in handling structured data are still under-explored, and show the effectiveness of LLM4Graph in enhancing LLMs' proficiency of graph analysis.

Recommendation Systems

A Survey on the Honesty of Large Language Models

2 code implementations27 Sep 2024 Siheng Li, Cheng Yang, Taiqiang Wu, Chufan Shi, Yuji Zhang, Xinyu Zhu, Zesen Cheng, Deng Cai, Mo Yu, Lemao Liu, Jie zhou, Yujiu Yang, Ngai Wong, Xixin Wu, Wai Lam

Honesty is a fundamental principle for aligning large language models (LLMs) with human values, requiring these models to recognize what they know and don't know and be able to faithfully express their knowledge.

Survey

GLinSAT: The General Linear Satisfiability Neural Network Layer By Accelerated Gradient Descent

1 code implementation26 Sep 2024 Hongtai Zeng, Chao Yang, Yanzhen Zhou, Cheng Yang, Qinglai Guo

In this paper, we consider making a batch of neural network outputs satisfy bounded and general linear constraints.

Decision Making Graph Matching

Enhancing Decision-Making for LLM Agents via Step-Level Q-Value Models

no code implementations14 Sep 2024 Yuanzhao Zhai, Tingkai Yang, Kele Xu, Feng Dawei, Cheng Yang, Bo Ding, Huaimin Wang

Agents significantly enhance the capabilities of standalone Large Language Models (LLMs) by perceiving environments, making decisions, and executing actions.

4k Decision Making

Efficient Multi-task Prompt Tuning for Recommendation

no code implementations30 Aug 2024 Ting Bai, Le Huang, Yue Yu, Cheng Yang, Cheng Hou, Zhe Zhao, Chuan Shi

A novel two-stage prompt-tuning MTL framework (MPT-Rec) is proposed to address task irrelevance and training efficiency problems in multi-task recommender systems.

Multi-Task Learning Recommendation Systems

Can Large Language Models Improve the Adversarial Robustness of Graph Neural Networks?

no code implementations16 Aug 2024 Zhongjian Zhang, Xiao Wang, Huichi Zhou, Yue Yu, Mengmei Zhang, Cheng Yang, Chuan Shi

By presenting the empirical results, we find that despite that LLMs can improve the robustness of GNNs, there is still an average decrease of 23. 1% in accuracy, implying that the GNNs remain extremely vulnerable against topology attack.

Adversarial Robustness

An Energy-based Model for Word-level AutoCompletion in Computer-aided Translation

1 code implementation29 Jul 2024 Cheng Yang, Guoping Huang, Mo Yu, Zhirui Zhang, Siheng Li, Mingming Yang, Shuming Shi, Yujiu Yang, Lemao Liu

Existing work addresses this task through a classification model based on a neural network that maps the hidden vector of the input context into its corresponding label (i. e., the candidate target word is treated as a label).

Sentence

LSReGen: Large-Scale Regional Generator via Backward Guidance Framework

no code implementations21 Jul 2024 BoWen Zhang, Cheng Yang, Xuanhui Liu

The third approach depends on phenomena specific to certain model architectures, complicating its application to large-scale image generation. To address these issues, we propose a novel controllable generation framework that offers a generalized interpretation of backward guidance without relying on specific assumptions.

Image Generation

Solving General Natural-Language-Description Optimization Problems with Large Language Models

no code implementations9 Jul 2024 Jihai Zhang, Wei Wang, Siyan Guo, Li Wang, Fangquan Lin, Cheng Yang, Wotao Yin

Optimization problems seek to find the best solution to an objective under a set of constraints, and have been widely investigated in real-world applications.

Decision Making

Internet of Agents: Weaving a Web of Heterogeneous Agents for Collaborative Intelligence

1 code implementation9 Jul 2024 Weize Chen, Ziming You, Ran Li, Yitong Guan, Chen Qian, Chenyang Zhao, Cheng Yang, Ruobing Xie, Zhiyuan Liu, Maosong Sun

The rapid advancement of large language models (LLMs) has paved the way for the development of highly capable autonomous agents.

A Pairwise DomMix Attentive Adversarial Network for Unsupervised Domain Adaptive Object Detection

no code implementations3 Jul 2024 Jie Shao, Jiacheng Wu, Wenzhong Shen, Cheng Yang

Then a pairwise attentive adversarial network is applied with attentive encoding on both image-level and instance-level features at different scales and optimizes domain alignment by adversarial learning.

object-detection Object Detection

CountFormer: Multi-View Crowd Counting Transformer

1 code implementation2 Jul 2024 Hong Mo, Xiong Zhang, Jianchao Tan, Cheng Yang, Qiong Gu, Bo Hang, Wenqi Ren

Multi-view counting (MVC) methods have shown their superiority over single-view counterparts, particularly in situations characterized by heavy occlusion and severe perspective distortions.

Crowd Counting

Data Issues in Industrial AI System: A Meta-Review and Research Strategy

no code implementations22 Jun 2024 Xuejiao Li, Cheng Yang, Charles Møller, Jay Lee

How to address these data issues stands as a significant concern confronting both industry and academia.

Data Integration

Autonomous Agents for Collaborative Task under Information Asymmetry

2 code implementations21 Jun 2024 Wei Liu, Chenxi Wang, Yifei Wang, Zihao Xie, Rennai Qiu, Yufan Dang, Zhuoyun Du, Weize Chen, Cheng Yang, Chen Qian

Together with InfoNav, iAgents organizes human information in a mixed memory to provide agents with accurate and comprehensive information for exchange.

Language Modelling Large Language Model +1

HoLLMwood: Unleashing the Creativity of Large Language Models in Screenwriting via Role Playing

no code implementations17 Jun 2024 Jing Chen, Xinyu Zhu, Cheng Yang, Chufan Shi, Yadong Xi, Yuxiang Zhang, Junjie Wang, Jiashu Pu, Rongsheng Zhang, Yujiu Yang, Tian Feng

Generative AI has demonstrated unprecedented creativity in the field of computer vision, yet such phenomena have not been observed in natural language processing.

ChartMimic: Evaluating LMM's Cross-Modal Reasoning Capability via Chart-to-Code Generation

1 code implementation14 Jun 2024 Chufan Shi, Cheng Yang, Yaxin Liu, Bo Shui, Junjie Wang, Mohan Jing, Linran Xu, Xinyu Zhu, Siheng Li, Yuxiang Zhang, Gongye Liu, Xiaomei Nie, Deng Cai, Yujiu Yang

We introduce a new benchmark, ChartMimic, aimed at assessing the visually-grounded code generation capabilities of large multimodal models (LMMs).

Code Generation

Multi-Agent Software Development through Cross-Team Collaboration

1 code implementation13 Jun 2024 Zhuoyun Du, Chen Qian, Wei Liu, Zihao Xie, Yifei Wang, Yufan Dang, Weize Chen, Cheng Yang

We anticipate that our work will guide LLM agents towards a cross-team paradigm and contribute to their significant growth in but not limited to software development.

Story Generation

Scaling Large-Language-Model-based Multi-Agent Collaboration

1 code implementation11 Jun 2024 Chen Qian, Zihao Xie, Yifei Wang, Wei Liu, Yufan Dang, Zhuoyun Du, Weize Chen, Cheng Yang, Zhiyuan Liu, Maosong Sun

Pioneering advancements in large language model-powered agents have underscored the design pattern of multi-agent collaboration, demonstrating that collective intelligence can surpass the capabilities of each individual.

Language Modelling Large Language Model

Non-autoregressive Personalized Bundle Generation

no code implementations11 Jun 2024 Wenchuan Yang, Cheng Yang, Jichao Li, Yuejin Tan, Xin Lu, Chuan Shi

The personalized bundle generation problem, which aims to create a preferred bundle for user from numerous candidate items, receives increasing attention in recommendation.

Decoder Graph Neural Network +1

Online Self-Preferring Language Models

no code implementations23 May 2024 Yuanzhao Zhai, Zhuo Zhang, Kele Xu, Hanyang Peng, Yue Yu, Dawei Feng, Cheng Yang, Bo Ding, Huaimin Wang

To overcome this limitation, we propose Online Self-Preferring (OSP) language models to learn from self-generated response pairs and self-judged preference strengths.

Iterative Experience Refinement of Software-Developing Agents

no code implementations7 May 2024 Chen Qian, Jiahao Li, Yufan Dang, Wei Liu, Yifei Wang, Zihao Xie, Weize Chen, Cheng Yang, Yingli Zhang, Zhiyuan Liu, Maosong Sun

We propose two fundamental patterns: the successive pattern, refining based on nearest experiences within a task batch, and the cumulative pattern, acquiring experiences across all previous task batches.

Federated Graph Condensation with Information Bottleneck Principles

no code implementations7 May 2024 Bo Yan, Sihao He, Cheng Yang, Shang Liu, Yang Cao, Chuan Shi

However, existing graph condensation methods rely on centralized data storage, which is unfeasible for real-world decentralized data distribution, and overlook data holders' privacy-preserving requirements.

Graph Learning Privacy Preserving

FairSIN: Achieving Fairness in Graph Neural Networks through Sensitive Information Neutralization

no code implementations19 Mar 2024 Cheng Yang, Jixi Liu, Yunhe Yan, Chuan Shi

The F3 are expected to statistically neutralize the sensitive bias in node representations and provide additional nonsensitive information.

Fairness

BurstAttention: An Efficient Distributed Attention Framework for Extremely Long Sequences

1 code implementation14 Mar 2024 Ao Sun, Weilin Zhao, Xu Han, Cheng Yang, Zhiyuan Liu, Chuan Shi, Maosong Sun

Effective attention modules have played a crucial role in the success of Transformer-based large language models (LLMs), but the quadratic time and memory complexities of these attention modules also pose a challenge when processing long sequences.

Beyond Natural Language: LLMs Leveraging Alternative Formats for Enhanced Reasoning and Communication

1 code implementation28 Feb 2024 Weize Chen, Chenfei Yuan, Jiarui Yuan, Yusheng Su, Chen Qian, Cheng Yang, Ruobing Xie, Zhiyuan Liu, Maosong Sun

Natural language (NL) has long been the predominant format for human cognition and communication, and by extension, has been similarly pivotal in the development and application of Large Language Models (LLMs).

Endowing Pre-trained Graph Models with Provable Fairness

1 code implementation19 Feb 2024 Zhongjian Zhang, Mengmei Zhang, Yue Yu, Cheng Yang, Jiawei Liu, Chuan Shi

Furthermore, with GraphPAR, we quantify whether the fairness of each node is provable, i. e., predictions are always fair within a certain range of sensitive attribute semantics.

Attribute Fairness +1

GraphTranslator: Aligning Graph Model to Large Language Model for Open-ended Tasks

1 code implementation11 Feb 2024 Mengmei Zhang, Mingwei Sun, Peng Wang, Shen Fan, Yanhu Mo, Xiaoxiao Xu, Hong Liu, Cheng Yang, Chuan Shi

Large language models (LLMs) like ChatGPT, exhibit powerful zero-shot and instruction-following capabilities, have catalyzed a revolutionary transformation across diverse fields, especially for open-ended tasks.

Graph Question Answering Instruction Following +4

AgentBoard: An Analytical Evaluation Board of Multi-turn LLM Agents

2 code implementations24 Jan 2024 Chang Ma, Junlei Zhang, Zhihao Zhu, Cheng Yang, Yujiu Yang, Yaohui Jin, Zhenzhong Lan, Lingpeng Kong, Junxian He

Evaluating large language models (LLMs) as general-purpose agents is essential for understanding their capabilities and facilitating their integration into practical applications.

Benchmarking

AllSpark: A Multimodal Spatio-Temporal General Intelligence Model with Thirteen Modalities

no code implementations31 Dec 2023 Run Shao, Cheng Yang, Qiujun Li, Qing Zhu, Yongjun Zhang, Yansheng Li, Yu Liu, Yong Tang, Dapeng Liu, Shizhong Yang, Haifeng Li

We introduce the Language as Reference Framework (LaRF), a fundamental principle for constructing a multimodal unified model, aiming to strike a trade-off between the cohesion and autonomy among different modalities.

Experiential Co-Learning of Software-Developing Agents

1 code implementation28 Dec 2023 Chen Qian, Yufan Dang, Jiahao Li, Wei Liu, Zihao Xie, Yifei Wang, Weize Chen, Cheng Yang, Xin Cong, Xiaoyin Che, Zhiyuan Liu, Maosong Sun

Recent advancements in large language models (LLMs) have brought significant changes to various domains, especially through LLM-driven autonomous agents.

Graph Invariant Learning with Subgraph Co-mixup for Out-Of-Distribution Generalization

1 code implementation18 Dec 2023 Tianrui Jia, Haoyang Li, Cheng Yang, Tao Tao, Chuan Shi

In this paper, we propose a novel graph invariant learning method based on invariant and variant patterns co-mixup strategy, which is capable of jointly generating mixed multiple environments and capturing invariant patterns from the mixed graph data.

Graph Representation Learning Out-of-Distribution Generalization

Improving Vision-and-Language Reasoning via Spatial Relations Modeling

no code implementations9 Nov 2023 Cheng Yang, Rui Xu, Ye Guo, Peixiang Huang, Yiru Chen, Wenkui Ding, Zhongyuan Wang, Hong Zhou

Further, we design two pre-training tasks named object position regression (OPR) and spatial relation classification (SRC) to learn to reconstruct the spatial relation graph respectively.

Position regression Relation +3

Specialist or Generalist? Instruction Tuning for Specific NLP Tasks

no code implementations23 Oct 2023 Chufan Shi, Yixuan Su, Cheng Yang, Yujiu Yang, Deng Cai

Although instruction tuning has proven to be a data-efficient method for transforming LLMs into such generalist models, their performance still lags behind specialist models trained exclusively for specific tasks.

Specificity

Towards Graph Foundation Models: A Survey and Beyond

no code implementations18 Oct 2023 Jiawei Liu, Cheng Yang, Zhiyuan Lu, Junze Chen, Yibo Li, Mengmei Zhang, Ting Bai, Yuan Fang, Lichao Sun, Philip S. Yu, Chuan Shi

Foundation models have emerged as critical components in a variety of artificial intelligence applications, and showcase significant success in natural language processing and several other domains.

Graph Learning Survey

Data-centric Graph Learning: A Survey

no code implementations8 Oct 2023 Yuxin Guo, Deyu Bo, Cheng Yang, Zhiyuan Lu, Zhongjian Zhang, Jixi Liu, Yufei Peng, Chuan Shi

Recently, instead of designing more complex neural architectures as model-centric approaches, the attention of AI community has shifted to data-centric ones, which focuses on better processing data to strengthen the ability of neural models.

Graph Learning Survey

A Stochastic Online Forecast-and-Optimize Framework for Real-Time Energy Dispatch in Virtual Power Plants under Uncertainty

no code implementations15 Sep 2023 Wei Jiang, Zhongkai Yi, Li Wang, Hanwei Zhang, Jihai Zhang, Fangquan Lin, Cheng Yang

Aggregating distributed energy resources in power systems significantly increases uncertainties, in particular caused by the fluctuation of renewable energy generation.

Data Augmentation energy management +2

AgentVerse: Facilitating Multi-Agent Collaboration and Exploring Emergent Behaviors

1 code implementation21 Aug 2023 Weize Chen, Yusheng Su, Jingwei Zuo, Cheng Yang, Chenfei Yuan, Chi-Min Chan, Heyang Yu, Yaxi Lu, Yi-Hsin Hung, Chen Qian, Yujia Qin, Xin Cong, Ruobing Xie, Zhiyuan Liu, Maosong Sun, Jie zhou

Autonomous agents empowered by Large Language Models (LLMs) have undergone significant improvements, enabling them to generalize across a broad spectrum of tasks.

NewsDialogues: Towards Proactive News Grounded Conversation

1 code implementation12 Aug 2023 Siheng Li, Yichun Yin, Cheng Yang, Wangjie Jiang, Yiwei Li, Zesen Cheng, Lifeng Shang, Xin Jiang, Qun Liu, Yujiu Yang

In this paper, we propose a novel task, Proactive News Grounded Conversation, in which a dialogue system can proactively lead the conversation based on some key topics of the news.

Response Generation

AutoConv: Automatically Generating Information-seeking Conversations with Large Language Models

no code implementations12 Aug 2023 Siheng Li, Cheng Yang, Yichun Yin, Xinyu Zhu, Zesen Cheng, Lifeng Shang, Xin Jiang, Qun Liu, Yujiu Yang

Information-seeking conversation, which aims to help users gather information through conversation, has achieved great progress in recent years.

Few-Shot Learning Language Modelling

Deep Learning-Based Knowledge Injection for Metaphor Detection: A Comprehensive Review

no code implementations8 Aug 2023 Cheng Yang, Zheng Li, Zhiyue Liu, Qingbao Huang

Finally, we explore the current issues facing knowledge injection methods and provide an outlook on future research directions.

Does Correction Remain A Problem For Large Language Models?

no code implementations3 Aug 2023 Xiaowu Zhang, Xiaotian Zhang, Cheng Yang, Hang Yan, Xipeng Qiu

As large language models, such as GPT, continue to advance the capabilities of natural language processing (NLP), the question arises: does the problem of correction still persist?

Few-Shot Learning

ChatDev: Communicative Agents for Software Development

1 code implementation16 Jul 2023 Chen Qian, Wei Liu, Hongzhang Liu, Nuo Chen, Yufan Dang, Jiahao Li, Cheng Yang, Weize Chen, Yusheng Su, Xin Cong, Juyuan Xu, Dahai Li, Zhiyuan Liu, Maosong Sun

Numerous studies used deep learning to improve specific phases in a waterfall model, such as design, coding, and testing.

Decision Making

An Efficient Virtual Data Generation Method for Reducing Communication in Federated Learning

no code implementations21 Jun 2023 Cheng Yang, Xue Yang, Dongxian Wu, Xiaohu Tang

Then the server aggregates all the proxy datasets to form a central dummy dataset, which is used to finetune aggregated global model.

Federated Learning

Interactive Molecular Discovery with Natural Language

1 code implementation21 Jun 2023 Zheni Zeng, Bangchen Yin, Shipeng Wang, Jiarui Liu, Cheng Yang, Haishen Yao, Xingzhi Sun, Maosong Sun, Guotong Xie, Zhiyuan Liu

Natural language is expected to be a key medium for various human-machine interactions in the era of large language models.

Property Prediction

Question Answering as Programming for Solving Time-Sensitive Questions

1 code implementation23 May 2023 Xinyu Zhu, Cheng Yang, Bei Chen, Siheng Li, Jian-Guang Lou, Yujiu Yang

Question answering plays a pivotal role in human daily life because it involves our acquisition of knowledge about the world.

Natural Language Understanding Question Answering

Real-World Denoising via Diffusion Model

no code implementations8 May 2023 Cheng Yang, Lijing Liang, Zhixun Su

We introduce a diffusion process with linear interpolation, and the intermediate noisy image is interpolated from the original clean image and the corresponding real-world noisy image, so that this diffusion model can handle the level of added noise.

Image Denoising Image Generation

Abnormal Event Detection via Hypergraph Contrastive Learning

no code implementations2 Apr 2023 Bo Yan, Cheng Yang, Chuan Shi, Jiawei Liu, Xiaochen Wang

AEHCL designs the intra-event and inter-event contrastive modules to exploit self-supervised AHIN information.

Contrastive Learning Event Detection +1

Graph Mining for Cybersecurity: A Survey

no code implementations2 Apr 2023 Bo Yan, Cheng Yang, Chuan Shi, Yong Fang, Qi Li, Yanfang Ye, Junping Du

In recent years, with the proliferation of graph mining techniques, many researchers investigated these techniques for capturing correlations between cyber entities and achieving high performance.

Graph Mining Survey

Neural Partial Differential Equations with Functional Convolution

no code implementations10 Mar 2023 Ziqian Wu, Xingzhe He, Yijun Li, Cheng Yang, Rui Liu, Shiying Xiong, Bo Zhu

We present a lightweighted neural PDE representation to discover the hidden structure and predict the solution of different nonlinear PDEs.

Vision Learners Meet Web Image-Text Pairs

no code implementations17 Jan 2023 Bingchen Zhao, Quan Cui, Hao Wu, Osamu Yoshie, Cheng Yang, Oisin Mac Aodha

In this work, given the excellent scalability of web data, we consider self-supervised pre-training on noisy web sourced image-text paired data.

Benchmarking Self-Supervised Learning +1

MA-GCL: Model Augmentation Tricks for Graph Contrastive Learning

1 code implementation14 Dec 2022 Xumeng Gong, Cheng Yang, Chuan Shi

We argue that typical data augmentation techniques (e. g., edge dropping) in GCL cannot generate diverse enough contrastive views to filter out noises.

Contrastive Learning Data Augmentation +2

xTrimoABFold: De novo Antibody Structure Prediction without MSA

no code implementations30 Nov 2022 Yining Wang, Xumeng Gong, Shaochuan Li, Bing Yang, YiWu Sun, Chuan Shi, Yangang Wang, Cheng Yang, Hui Li, Le Song

Its improvement in both accuracy and efficiency makes it a valuable tool for de novo antibody design and could make further improvements in immuno-theory.

Computational Efficiency Protein Language Model

KDD CUP 2022 Wind Power Forecasting Team 88VIP Solution

no code implementations18 Aug 2022 Fangquan Lin, Wei Jiang, Hanwei Zhang, Cheng Yang

KDD CUP 2022 proposes a time-series forecasting task on spatial dynamic wind power dataset, in which the participants are required to predict the future generation given the historical context factors.

Imputation Time Series +1

GUIM -- General User and Item Embedding with Mixture of Representation in E-commerce

no code implementations2 Jul 2022 Chao Yang, Ru He, Fangquan Lin, Suoyuan Song, Jingqiao Zhang, Cheng Yang

Our goal is to build general representation (embedding) for each user and each product item across Alibaba's businesses, including Taobao and Tmall which are among the world's biggest e-commerce websites.

Contrastive Learning Demand Forecasting +1

Improving Contrastive Learning of Sentence Embeddings with Case-Augmented Positives and Retrieved Negatives

1 code implementation6 Jun 2022 Wei Wang, Liangzhu Ge, Jingqiao Zhang, Cheng Yang

Following SimCSE, contrastive learning based methods have achieved the state-of-the-art (SOTA) performance in learning sentence embeddings.

Attribute Contrastive Learning +5

An Adaptive Contrastive Learning Model for Spike Sorting

no code implementations24 May 2022 Lang Qian, Shengjie Zheng, Chunshan Deng, Cheng Yang, Xiaojian Li

But for BCIs used in neuroscience research, it is important to separate out the activity of individual neurons.

Binary Classification Brain Computer Interface +3

Spatial Autoregressive Coding for Graph Neural Recommendation

no code implementations19 May 2022 Jiayi Zheng, Ling Yang, Heyuan Wang, Cheng Yang, Yinghong Li, Xiaowei Hu, Shenda Hong

To adequately leverage neighbor proximity and high-order information, we design a novel spatial autoregressive paradigm.

Graph Embedding

Space4HGNN: A Novel, Modularized and Reproducible Platform to Evaluate Heterogeneous Graph Neural Network

1 code implementation18 Feb 2022 Tianyu Zhao, Cheng Yang, Yibo Li, Quan Gan, Zhenyi Wang, Fengqi Liang, Huan Zhao, Yingxia Shao, Xiao Wang, Chuan Shi

Heterogeneous Graph Neural Network (HGNN) has been successfully employed in various tasks, but we cannot accurately know the importance of different design dimensions of HGNNs due to diverse architectures and applied scenarios.

Graph Neural Network

PartImageNet: A Large, High-Quality Dataset of Parts

1 code implementation2 Dec 2021 Ju He, Shuo Yang, Shaokang Yang, Adam Kortylewski, Xiaoding Yuan, Jie-Neng Chen, Shuai Liu, Cheng Yang, Qihang Yu, Alan Yuille

To help address this problem, we propose PartImageNet, a large, high-quality dataset with part segmentation annotations.

Activity Recognition Few-Shot Learning +6

Be Confident! Towards Trustworthy Graph Neural Networks via Confidence Calibration

2 code implementations NeurIPS 2021 Xiao Wang, Hongrui Liu, Chuan Shi, Cheng Yang

Specifically, we first verify that the confidence distribution in a graph has homophily property, and this finding inspires us to design a calibration GNN model (CaGCN) to learn the calibration function.

Unfolding Projection-free SDP Relaxation of Binary Graph Classifier via GDPA Linearization

no code implementations10 Sep 2021 Cheng Yang, Gene Cheung, Wai-tian Tan, Guangtao Zhai

Algorithm unfolding creates an interpretable and parsimonious neural network architecture by implementing each iteration of a model-based algorithm as a neural layer.

Hand Image Understanding via Deep Multi-Task Learning

1 code implementation ICCV 2021 Xiong Zhang, Hongsheng Huang, Jianchao Tan, Hongmin Xu, Cheng Yang, Guozhu Peng, Lei Wang, Ji Liu

To further improve the performance of these tasks, we propose a novel Hand Image Understanding (HIU) framework to extract comprehensive information of the hand object from a single RGB image, by jointly considering the relationships between these tasks.

3D Hand Pose Estimation Multi-Task Learning +1

OLR 2021 Challenge: Datasets, Rules and Baselines

no code implementations23 Jul 2021 Binling Wang, Wenxuan Hu, Jing Li, Yiming Zhi, Zheng Li, Qingyang Hong, Lin Li, Dong Wang, Liming Song, Cheng Yang

In addition to the Language Identification (LID) tasks, multilingual Automatic Speech Recognition (ASR) tasks are introduced to OLR 2021 Challenge for the first time.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Copy and Paste method based on Pose for Re-identification

no code implementations22 Jul 2021 Cheng Yang

To solve the aforementioned problem, a simple and effective method to generate images in several new scenarios was proposed, which is names the Copy and Paste method based on Pose(CPP).

Evaluating Modules in Graph Contrastive Learning

1 code implementation15 Jun 2021 Ganqu Cui, Yufeng Du, Cheng Yang, Jie zhou, Liang Xu, Xing Zhou, Xingyi Cheng, Zhiyuan Liu

The recent emergence of contrastive learning approaches facilitates the application on graph representation learning (GRL), introducing graph contrastive learning (GCL) into the literature.

Contrastive Learning Graph Classification +1

SAS: Self-Augmentation Strategy for Language Model Pre-training

1 code implementation14 Jun 2021 Yifei Xu, Jingqiao Zhang, Ru He, Liangzhu Ge, Chao Yang, Cheng Yang, Ying Nian Wu

In this paper, we propose a self-augmentation strategy (SAS) where a single network is utilized for both regular pre-training and contextualized data augmentation for the training in later epochs.

Data Augmentation Language Modelling +2

Projection-free Graph-based Classifier Learning using Gershgorin Disc Perfect Alignment

no code implementations NeurIPS 2021 Cheng Yang, Gene Cheung, Guangtao Zhai

We repose the SDR dual for solution $\bar{\mathbf{H}}$, then replace the PSD cone constraint $\bar{\mathbf{H}} \succeq 0$ with linear constraints derived from GDPA -- sufficient conditions to ensure $\bar{\mathbf{H}}$ is PSD -- so that the optimization becomes an LP per iteration.

Rethinking Re-Sampling in Imbalanced Semi-Supervised Learning

1 code implementation1 Jun 2021 Ju He, Adam Kortylewski, Shaokang Yang, Shuai Liu, Cheng Yang, Changhu Wang, Alan Yuille

In particular, we decouple the training of the representation and the classifier, and systematically investigate the effects of different data re-sampling techniques when training the whole network including a classifier as well as fine-tuning the feature extractor only.

Markdowns in E-Commerce Fresh Retail: A Counterfactual Prediction and Multi-Period Optimization Approach

no code implementations18 May 2021 Junhao Hua, Ling Yan, Huan Xu, Cheng Yang

In this paper, by leveraging abundant observational transaction data, we propose a novel data-driven and interpretable pricing approach for markdowns, consisting of counterfactual prediction and multi-period price optimization.

counterfactual

Learning to Reconstruct 3D Non-Cuboid Room Layout from a Single RGB Image

1 code implementation16 Apr 2021 Cheng Yang, Jia Zheng, Xili Dai, Rui Tang, Yi Ma, Xiaojun Yuan

Single-image room layout reconstruction aims to reconstruct the enclosed 3D structure of a room from a single image.

Room Layout Estimation

TransFG: A Transformer Architecture for Fine-grained Recognition

2 code implementations14 Mar 2021 Ju He, Jie-Neng Chen, Shuai Liu, Adam Kortylewski, Cheng Yang, Yutong Bai, Changhu Wang

Fine-grained visual classification (FGVC) which aims at recognizing objects from subcategories is a very challenging task due to the inherently subtle inter-class differences.

Fine-Grained Image Classification

Extract the Knowledge of Graph Neural Networks and Go Beyond it: An Effective Knowledge Distillation Framework

1 code implementation4 Mar 2021 Cheng Yang, Jiawei Liu, Chuan Shi

Our framework extracts the knowledge of an arbitrary learned GNN model (teacher model), and injects it into a well-designed student model.

Knowledge Distillation Node Classification

Re-rank Coarse Classification with Local Region Enhanced Features for Fine-Grained Image Recognition

no code implementations19 Feb 2021 Shaokang Yang, Shuai Liu, Cheng Yang, Changhu Wang

In this paper, a retrieval-based coarse-to-fine framework is proposed, where we re-rank the TopN classification results by using the local region enhanced embedding features to improve the Top1 accuracy (based on the observation that the correct category usually resides in TopN results).

Fine-Grained Image Classification Fine-Grained Image Recognition +2

Incorporating Vision Bias into Click Models for Image-oriented Search Engine

no code implementations7 Jan 2021 Ningxin Xu, Cheng Yang, Yixin Zhu, Xiaowei Hu, Changhu Wang

Most typical click models assume that the probability of a document to be examined by users only depends on position, such as PBM and UBM.

Position

CoRe: An Efficient Coarse-refined Training Framework for BERT

no code implementations27 Nov 2020 Cheng Yang, Shengnan Wang, Yuechuan Li, Chao Yang, Ming Yan, Jingqiao Zhang, Fangquan Lin

In the second phase, we transform the trained relaxed BERT model into the original BERT and further retrain the model.

Progressively Stacking 2.0: A Multi-stage Layerwise Training Method for BERT Training Speedup

no code implementations27 Nov 2020 Cheng Yang, Shengnan Wang, Chao Yang, Yuechuan Li, Ru He, Jingqiao Zhang

In BERT training, the backward computation is much more time-consuming than the forward computation, especially in the distributed training setting in which the backward computation time further includes the communication time for gradient synchronization.

Nonseparable Symplectic Neural Networks

no code implementations ICLR 2021 Shiying Xiong, Yunjin Tong, Xingzhe He, Shuqi Yang, Cheng Yang, Bo Zhu

The enabling mechanics of our approach is an augmented symplectic time integrator to decouple the position and momentum energy terms and facilitate their evolution.

Position

Identification of deep breath while moving forward based on multiple body regions and graph signal analysis

no code implementations20 Oct 2020 Yunlu Wang, Cheng Yang, Menghan Hu, Jian Zhang, Qingli Li, Guangtao Zhai, Xiao-Ping Zhang

This paper presents an unobtrusive solution that can automatically identify deep breath when a person is walking past the global depth camera.

Few-shot Knowledge Transfer for Fine-grained Cartoon Face Generation

4 code implementations27 Jul 2020 Nan Zhuang, Cheng Yang

In this paper, we are interested in generating fine-grained cartoon faces for various groups.

Face Generation Transfer Learning +1

Adaptive Graph Encoder for Attributed Graph Embedding

1 code implementation3 Jul 2020 Ganqu Cui, Jie zhou, Cheng Yang, Zhiyuan Liu

Experimental results show that AGE consistently outperforms state-of-the-art graph embedding methods considerably on these tasks.

Clustering Graph Embedding +2

Graph Neural News Recommendation with Unsupervised Preference Disentanglement

1 code implementation ACL 2020 Linmei Hu, Siyong Xu, Chen Li, Cheng Yang, Chuan Shi, Nan Duan, Xing Xie, Ming Zhou

Furthermore, the learned representations are disentangled with latent preference factors by a neighborhood routing algorithm, which can enhance expressiveness and interpretability.

Disentanglement News Recommendation

Signed Graph Metric Learning via Gershgorin Disc Perfect Alignment

1 code implementation15 Jun 2020 Cheng Yang, Gene Cheung, Wei Hu

Given a convex and differentiable objective $Q(\M)$ for a real symmetric matrix $\M$ in the positive definite (PD) cone -- used to compute Mahalanobis distances -- we propose a fast general metric learning framework that is entirely projection-free.

Binary Classification Metric Learning

AP20-OLR Challenge: Three Tasks and Their Baselines

no code implementations4 Jun 2020 Zheng Li, Miao Zhao, Qingyang Hong, Lin Li, Zhiyuan Tang, Dong Wang, Li-Ming Song, Cheng Yang

Based on Kaldi and Pytorch, recipes for i-vector and x-vector systems are also conducted as baselines for the three tasks.

Dialect Identification

Fast Network Embedding Enhancement via High Order Proximity Approximation

2 code implementations ‏‏‎ ‎ 2020 Cheng Yang, Maosong Sun, Zhiyuan Liu, Cunchao Tu

Many Network Representation Learning (NRL) methods have been proposed to learn vector representations for vertices in a network recently.

Dimensionality Reduction Link Prediction +3

KACC: A Multi-task Benchmark for Knowledge Abstraction, Concretization and Completion

1 code implementation Findings (ACL) 2021 Jie Zhou, Shengding Hu, Xin Lv, Cheng Yang, Zhiyuan Liu, Wei Xu, Jie Jiang, Juanzi Li, Maosong Sun

Based on the datasets, we propose novel tasks such as multi-hop knowledge abstraction (MKA), multi-hop knowledge concretization (MKC) and then design a comprehensive benchmark.

Knowledge Graphs Transfer Learning

DCNAS: Densely Connected Neural Architecture Search for Semantic Image Segmentation

no code implementations CVPR 2021 Xiong Zhang, Hongmin Xu, Hong Mo, Jianchao Tan, Cheng Yang, Lei Wang, Wenqi Ren

Neural Architecture Search (NAS) has shown great potentials in automatically designing scalable network architectures for dense image predictions.

Ranked #13 on Semantic Segmentation on Cityscapes test (using extra training data)

Image Segmentation Neural Architecture Search +1

MixPoet: Diverse Poetry Generation via Learning Controllable Mixed Latent Space

no code implementations13 Mar 2020 Xiaoyuan Yi, Ruoyu Li, Cheng Yang, Wenhao Li, Maosong Sun

Though recent neural models make prominent progress in some criteria of poetry quality, generated poems still suffer from the problem of poor diversity.

Diversity

Graph Metric Learning via Gershgorin Disc Alignment

no code implementations28 Jan 2020 Cheng Yang, Gene Cheung, Wei Hu

We propose a fast general projection-free metric learning framework, where the minimization objective $\min_{\textbf{M} \in \mathcal{S}} Q(\textbf{M})$ is a convex differentiable function of the metric matrix $\textbf{M}$, and $\textbf{M}$ resides in the set $\mathcal{S}$ of generalized graph Laplacian matrices for connected graphs with positive edge weights and node degrees.

Metric Learning

Graph Neural News Recommendation with Long-term and Short-term Interest Modeling

no code implementations30 Oct 2019 Linmei Hu, Chen Li, Chuan Shi, Cheng Yang, Chao Shao

Existing methods on news recommendation mainly include collaborative filtering methods which rely on direct user-item interactions and content based methods which characterize the content of user reading history.

Collaborative Filtering News Recommendation +1

Improving Sequence Modeling Ability of Recurrent Neural Networks via Sememes

1 code implementation20 Oct 2019 Yujia Qin, Fanchao Qi, Sicong Ouyang, Zhiyuan Liu, Cheng Yang, Yasheng Wang, Qun Liu, Maosong Sun

Sememes, the minimum semantic units of human languages, have been successfully utilized in various natural language processing applications.

Adversarial Attack Language Modelling +2

GEAR: Graph-based Evidence Aggregating and Reasoning for Fact Verification

2 code implementations ACL 2019 Jie Zhou, Xu Han, Cheng Yang, Zhiyuan Liu, LiFeng Wang, Changcheng Li, Maosong Sun

Fact verification (FV) is a challenging task which requires to retrieve relevant evidence from plain text and use the evidence to verify given claims.

Fact Verification

Jiuge: A Human-Machine Collaborative Chinese Classical Poetry Generation System

no code implementations ACL 2019 Guo Zhipeng, Xiaoyuan Yi, Maosong Sun, Wenhao Li, Cheng Yang, Jiannan Liang, Huimin Chen, Yuhui Zhang, Ruoyu Li

By exposing the options of poetry genres, styles and revision modes, Jiuge, acting as a professional assistant, allows constant and active participation of users in poetic creation.

Cultural Vocal Bursts Intensity Prediction

A Unified Framework for Marketing Budget Allocation

no code implementations4 Feb 2019 Kui Zhao, Junhao Hua, Ling Yan, Qi Zhang, Huan Xu, Cheng Yang

In our approach, a semi-black-box model is built to forecast the dynamic market response and an efficient optimization method is proposed to solve the complex allocation task.

Decision Making Marketing

Neural Diffusion Model for Microscopic Cascade Prediction

1 code implementation21 Dec 2018 Cheng Yang, Maosong Sun, Haoran Liu, Shiyi Han, Zhiyuan Liu, Huanbo Luan

The strong assumptions oversimplify the complex diffusion mechanism and prevent these models from better fitting real-world cascade data.

Social and Information Networks Physics and Society

Graph Neural Networks: A Review of Methods and Applications

5 code implementations20 Dec 2018 Jie Zhou, Ganqu Cui, Shengding Hu, Zhengyan Zhang, Cheng Yang, Zhiyuan Liu, LiFeng Wang, Changcheng Li, Maosong Sun

Lots of learning tasks require dealing with graph data which contains rich relation information among elements.

Graph Attention Graph Neural Network

CED: Credible Early Detection of Social Media Rumors

1 code implementation10 Nov 2018 Changhe Song, Cunchao Tu, Cheng Yang, Zhiyuan Liu, Maosong Sun

By regarding all reposts to a rumor candidate as a sequence, the proposed model will seek an early point-in-time for making a credible prediction.

Social and Information Networks

Stylistic Chinese Poetry Generation via Unsupervised Style Disentanglement

no code implementations EMNLP 2018 Cheng Yang, Maosong Sun, Xiaoyuan Yi, Wenhao Li

The ability to write diverse poems in different styles under the same poetic imagery is an important characteristic of human poetry writing.

Disentanglement Machine Translation +1

3D Point Cloud Denoising using Graph Laplacian Regularization of a Low Dimensional Manifold Model

no code implementations20 Mar 2018 Jin Zeng, Gene Cheung, Michael Ng, Jiahao Pang, Cheng Yang

Due to discrete observations of the patches on the manifold, we approximate the manifold dimension computation defined in the continuous domain with a patch-based graph Laplacian regularizer and propose a new discrete patch distance measure to quantify the similarity between two same-sized surface patches for graph construction that is robust to noise.

Denoising graph construction +2

Adaptive Recurrent Neural Network Based on Mixture Layer

no code implementations24 Jan 2018 Kui Zhao, Yuechuan Li, Chi Zhang, Cheng Yang, Huan Xu

By leveraging the mixture layer, the proposed method can adaptively update states according to the similarities between encoded inputs and prototype vectors, leading to a stronger capacity in assimilating sequences with multiple patterns.

Learning and Transferring IDs Representation in E-commerce

no code implementations22 Dec 2017 Kui Zhao, Yuechuan Li, Zhaoqian Shuai, Cheng Yang

Many machine intelligence techniques are developed in E-commerce and one of the most essential components is the representation of IDs, including user ID, item ID, product ID, store ID, brand ID, category ID etc.

Comprehend DeepWalk as Matrix Factorization

no code implementations2 Jan 2015 Cheng Yang, Zhiyuan Liu

Word2vec, as an efficient tool for learning vector representation of words has shown its effectiveness in many natural language processing tasks.

Real-time Topic-aware Influence Maximization Using Preprocessing

no code implementations1 Mar 2014 Wei Chen, Tian Lin, Cheng Yang

In this paper, we focus on the topic-aware influence maximization task.

Cannot find the paper you are looking for? You can Submit a new open access paper.