Search Results for author: Yaming Yang

Found 29 papers, 12 papers with code

Enhancing Self-Attention with Knowledge-Assisted Attention Maps

no code implementations • NAACL 2022 • Jiangang Bai, Yujing Wang, Hong Sun, Ruonan Wu, Tianmeng Yang, Pengfei Tang, Defu Cao, Mingliang Zhang1, Yunhai Tong, Yaming Yang, Jing Bai, Ruofei Zhang, Hao Sun, Wei Shen

Large-scale pre-trained language models have attracted extensive attentions in the research community and shown promising results on various tasks of natural language processing.

Multi-Task Learning Natural Language Understanding

Paper
Add Code

Deep Learning-Based Detection for Marker Codes over Insertion and Deletion Channels

no code implementations • 2 Jan 2024 • Guochen Ma, Xiaopeng Jiao, Jianjun Mu, Hui Han, Yaming Yang

In this paper, we propose two CSI-agnostic detecting algorithms for marker code based on deep learning.

Paper
Add Code

NodeMixup: Tackling Under-Reaching for Graph Neural Networks

1 code implementation • 20 Dec 2023 • Weigang Lu, Ziyu Guan, Wei Zhao, Yaming Yang, Long Jin

However, due to the uneven location distribution of labeled nodes in the graph, labeled nodes are only accessible to a small portion of unlabeled nodes, leading to the \emph{under-reaching} issue.

Node Classification

Paper
Code

Self-Supervised Multi-Modal Sequential Recommendation

1 code implementation • 26 Apr 2023 • Kunzhe Song, Qingfeng Sun, Can Xu, Kai Zheng, Yaming Yang

To address this issue, we propose a dual-tower retrieval architecture for sequence recommendation.

Contrastive Learning Retrieval +1

Paper
Code

Pseudo Contrastive Learning for Graph-based Semi-supervised Learning

no code implementations • 19 Feb 2023 • Weigang Lu, Ziyu Guan, Wei Zhao, Yaming Yang, Yuanhai Lv, Lining Xing, Baosheng Yu, DaCheng Tao

Pseudo Labeling is a technique used to improve the performance of semi-supervised Graph Neural Networks (GNNs) by generating additional pseudo-labels based on confident predictions.

Contrastive Learning Data Augmentation

Paper
Add Code

Convolution-enhanced Evolving Attention Networks

1 code implementation • 16 Dec 2022 • Yujing Wang, Yaming Yang, Zhuo Li, Jiangang Bai, Mingliang Zhang, Xiangtai Li, Jing Yu, Ce Zhang, Gao Huang, Yunhai Tong

To the best of our knowledge, this is the first work that explicitly models the layer-wise evolution of attention maps.

Image Classification Machine Translation +3

Paper
Code

MMDialog: A Large-scale Multi-turn Dialogue Dataset Towards Multi-modal Open-domain Conversation

1 code implementation • 10 Nov 2022 • Jiazhan Feng, Qingfeng Sun, Can Xu, Pu Zhao, Yaming Yang, Chongyang Tao, Dongyan Zhao, QIngwei Lin

First, it is the largest multi-modal conversation dataset by the number of dialogues by 88x.

Ranked #2 on Multimodal Intent Recognition on MMDialog

Multimodal Intent Recognition Retrieval

176

Paper
Code

Self-supervised Heterogeneous Graph Pre-training Based on Structural Clustering

1 code implementation • 19 Oct 2022 • Yaming Yang, Ziyu Guan, Zhe Wang, Wei Zhao, Cai Xu, Weigang Lu, Jianbin Huang

The two modules can effectively utilize and enhance each other, promoting the model to learn discriminative embeddings.

Clustering

Paper
Code

Entropy Induced Pruning Framework for Convolutional Neural Networks

no code implementations • 13 Aug 2022 • Yiheng Lu, Ziyu Guan, Yaming Yang, Maoguo Gong, Wei Zhao, Kaiyuan Feng

By leveraging the proposed AFIE, the proposed framework is able to yield a stable importance evaluation of each filter no matter whether the original model is trained fully.

Image Classification

Paper
Add Code

Binary Classification with Positive Labeling Sources

no code implementations • 2 Aug 2022 • Jieyu Zhang, Yujing Wang, Yaming Yang, Yang Luo, Alexander Ratner

Thus, in this work, we study the application of WS on binary classification tasks with positive labeling sources only.

Benchmarking Binary Classification +1

Paper
Add Code

Privacy-preserving Online AutoML for Domain-Specific Face Detection

no code implementations • CVPR 2022 • Chenqian Yan, Yuge Zhang, Quanlu Zhang, Yaming Yang, Xinyang Jiang, Yuqing Yang, Baoyuan Wang

Thanks to HyperFD, each local task (client) is able to effectively leverage the learning "experience" of previous tasks without uploading raw images to the platform; meanwhile, the meta-feature extractor is continuously learned to better trade off the bias and variance.

AutoML Face Detection +1

Paper
Add Code

Learning Multi-granularity User Intent Unit for Session-based Recommendation

1 code implementation • 25 Dec 2021 • Jiayan Guo, Yaming Yang, Xiangchen Song, Yuan Zhang, Yujing Wang, Jing Bai, Yan Zhang

Specifically, we creatively propose Multi-granularity Intent Heterogeneous Session Graph which captures the interactions between different granularity intent units and relieves the burden of long-dependency.

Session-Based Recommendations

Paper
Code

SkipNode: On Alleviating Performance Degradation for Deep Graph Convolutional Networks

no code implementations • 22 Dec 2021 • Weigang Lu, Yibing Zhan, Binbin Lin, Ziyu Guan, Liu Liu, Baosheng Yu, Wei Zhao, Yaming Yang, DaCheng Tao

In this paper, we conduct theoretical and experimental analysis to explore the fundamental causes of performance degradation in deep GCNs: over-smoothing and gradient vanishing have a mutually reinforcing effect that causes the performance to deteriorate more quickly in deep GCNs.

Link Prediction Node Classification

Paper
Add Code

Multimodal Dialogue Response Generation

no code implementations • ACL 2022 • Qingfeng Sun, Yujing Wang, Can Xu, Kai Zheng, Yaming Yang, Huang Hu, Fei Xu, Jessica Zhang, Xiubo Geng, Daxin Jiang

In such a low-resource setting, we devise a novel conversational agent, Divter, in order to isolate parameters that depend on multimodal dialogues from the entire generation model.

Dialogue Generation Response Generation +1

Paper
Add Code

Creating Training Sets via Weak Indirect Supervision

no code implementations • ICLR 2022 • Jieyu Zhang, Bohan Wang, Xiangchen Song, Yujing Wang, Yaming Yang, Jing Bai, Alexander Ratner

Creating labeled training sets has become one of the major roadblocks in machine learning.

text-classification Text Classification

Paper
Add Code

Graph Pointer Neural Networks

no code implementations • 3 Oct 2021 • Tianmeng Yang, Yujing Wang, Zhihan Yue, Yaming Yang, Yunhai Tong, Jing Bai

On the one hand, multi-hop-based approaches do not explicitly distinguish relevant nodes from a large number of multi-hop neighborhoods, leading to a severe over-smoothing problem.

Node Classification

Paper
Add Code

WRENCH: A Comprehensive Benchmark for Weak Supervision

1 code implementation • 23 Sep 2021 • Jieyu Zhang, Yue Yu, Yinghao Li, Yujing Wang, Yaming Yang, Mao Yang, Alexander Ratner

To address these problems, we introduce a benchmark platform, WRENCH, for thorough and standardized evaluation of WS approaches.

211

Paper
Code

Attentive Knowledge-aware Graph Convolutional Networks with Collaborative Guidance for Personalized Recommendation

no code implementations • 5 Sep 2021 • Yankai Chen, Yaming Yang, Yujing Wang, Jing Bai, Xiangchen Song, Irwin King

However, simply integrating KGs in current KG-based RS models is not necessarily a guarantee to improve the recommendation performance, which may even weaken the holistic model capability.

Click-Through Rate Prediction Knowledge-Aware Recommendation +1

Paper
Add Code

Learning to Rank Ace Neural Architectures via Normalized Discounted Cumulative Gain

1 code implementation • 6 Aug 2021 • Yuge Zhang, Quanlu Zhang, Li Lyna Zhang, Yaming Yang, Chenqian Yan, Xiaotian Gao, Yuqing Yang

One of the key challenges in Neural Architecture Search (NAS) is to efficiently rank the performances of architectures.

Learning-To-Rank Neural Architecture Search

Paper
Code

Syntax-BERT: Improving Pre-trained Transformers with Syntax Trees

1 code implementation • EACL 2021 • Jiangang Bai, Yujing Wang, Yiren Chen, Yaming Yang, Jing Bai, Jing Yu, Yunhai Tong

Pre-trained language models like BERT achieve superior performances in various NLP tasks without explicit consideration of syntactic information.

Natural Language Understanding

Paper
Code

Evolving Attention with Residual Convolutions

2 code implementations • 20 Feb 2021 • Yujing Wang, Yaming Yang, Jiangang Bai, Mingliang Zhang, Jing Bai, Jing Yu, Ce Zhang, Gao Huang, Yunhai Tong

In this paper, we propose a novel and generic mechanism based on evolving attention to improve the performance of transformers.

Image Classification Machine Translation +2

Paper
Code

Predictive Attention Transformer: Improving Transformer with Attention Map Prediction

no code implementations • 1 Jan 2021 • Yujing Wang, Yaming Yang, Jiangang Bai, Mingliang Zhang, Jing Bai, Jing Yu, Ce Zhang, Yunhai Tong

Instead, we model their dependencies via a chain of prediction models that take previous attention maps as input to predict the attention maps of a new layer through convolutional neural networks.

Machine Translation

Paper
Add Code

How Does Supernet Help in Neural Architecture Search?

no code implementations • 16 Oct 2020 • Yuge Zhang, Quanlu Zhang, Yaming Yang

Weight sharing, as an approach to speed up architecture performance estimation has received wide attention.

Neural Architecture Search

Paper
Add Code

AutoADR: Automatic Model Design for Ad Relevance

no code implementations • 14 Oct 2020 • Yiren Chen, Yaming Yang, Hong Sun, Yujing Wang, Yu Xu, Wei Shen, Rong Zhou, Yunhai Tong, Jing Bai, Ruofei Zhang

We add the model designed by AutoADR as a sub-model into the production Ad Relevance model.

Knowledge Distillation Neural Architecture Search

Paper
Add Code

Interpretable and Efficient Heterogeneous Graph Convolutional Network

1 code implementation • 27 May 2020 • Yaming Yang, Ziyu Guan, Jian-Xin Li, Wei Zhao, Jiangtao Cui, Quan Wang

However, regarding Heterogeneous Information Network (HIN), existing HIN-oriented GCN methods still suffer from two deficiencies: (1) they cannot flexibly explore all possible meta-paths and extract the most useful ones for a target object, which hinders both effectiveness and interpretability; (2) they often need to generate intermediate meta-path based dense graphs, which leads to high computational complexity.

Object

Paper
Code

LadaBERT: Lightweight Adaptation of BERT through Hybrid Model Compression

no code implementations • COLING 2020 • Yihuan Mao, Yujing Wang, Chufan Wu, Chen Zhang, Yang Wang, Yaming Yang, Quanlu Zhang, Yunhai Tong, Jing Bai

BERT is a cutting-edge language representation model pre-trained by a large corpus, which achieves superior performances on various natural language understanding tasks.

Blocking Knowledge Distillation +2

Paper
Add Code

Deeper Insights into Weight Sharing in Neural Architecture Search

1 code implementation • 6 Jan 2020 • Yuge Zhang, Zejun Lin, Junyang Jiang, Quanlu Zhang, Yujing Wang, Hui Xue, Chen Zhang, Yaming Yang

With the success of deep neural networks, Neural Architecture Search (NAS) as a way of automatic model design has attracted wide attention.

Neural Architecture Search

Paper
Code

TextNAS: A Neural Architecture Search Space tailored for Text Representation

no code implementations • 23 Dec 2019 • Yujing Wang, Yaming Yang, Yiren Chen, Jing Bai, Ce Zhang, Guinan Su, Xiaoyu Kou, Yunhai Tong, Mao Yang, Lidong Zhou

Learning text representation is crucial for text classification and other language related tasks.

General Classification Natural Language Inference +3

Paper
Add Code

DeGNN: Characterizing and Improving Graph Neural Networks with Graph Decomposition

no code implementations • 10 Oct 2019 • Xupeng Miao, Nezihe Merve Gürel, Wentao Zhang, Zhichao Han, Bo Li, Wei Min, Xi Rao, Hansheng Ren, Yinan Shan, Yingxia Shao, Yujie Wang, Fan Wu, Hui Xue, Yaming Yang, Zitao Zhang, Yang Zhao, Shuai Zhang, Yujing Wang, Bin Cui, Ce Zhang

Despite the wide application of Graph Convolutional Network (GCN), one major limitation is that it does not benefit from the increasing depth and suffers from the oversmoothing problem.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.