Search Results for author: Jianyong Wang

Found 34 papers, 18 papers with code

Are LLM-based Recommenders Already the Best? Simple Scaled Cross-entropy Unleashes the Potential of Traditional Sequential Recommenders

1 code implementation26 Aug 2024 Cong Xu, Zhangchi Zhu, Mo Yu, Jun Wang, Jianyong Wang, Wei zhang

Some studies have observed that LLMs, when fine-tuned by the cross-entropy (CE) loss with a full softmax, could achieve `state-of-the-art' performance in sequential recommendation.

Sequential Recommendation

FocusLLM: Scaling LLM's Context by Parallel Decoding

no code implementations21 Aug 2024 Zhenyu Li, Yike Zhang, Tengyu Pan, Yutao Sun, Zhichao Duan, Junjie Fang, Rong Han, Zixuan Wang, Jianyong Wang

Empowering LLMs with the ability to utilize useful information from a long context is crucial for many downstream applications.

8k Decoder +1

You Only Cache Once: Decoder-Decoder Architectures for Language Models

1 code implementation8 May 2024 Yutao Sun, Li Dong, Yi Zhu, Shaohan Huang, Wenhui Wang, Shuming Ma, Quanlu Zhang, Jianyong Wang, Furu Wei

We introduce a decoder-decoder architecture, YOCO, for large language models, which only caches key-value pairs once.

Decoder Retrieval

Understanding the Role of Cross-Entropy Loss in Fairly Evaluating Large Language Model-based Recommendation

no code implementations9 Feb 2024 Cong Xu, Zhangchi Zhu, Jun Wang, Jianyong Wang, Wei zhang

Large language models (LLMs) have gained much attention in the recommendation community; some studies have observed that LLMs, fine-tuned by the cross-entropy loss with a full softmax, could achieve state-of-the-art performance already.

Language Modelling Large Language Model

Interpretable Knowledge Tracing via Response Influence-based Counterfactual Reasoning

1 code implementation1 Dec 2023 Jiajun Cui, Minghe Yu, Bo Jiang, Aimin Zhou, Jianyong Wang, Wei zhang

Knowledge tracing (KT) plays a crucial role in computer-aided education and intelligent tutoring systems, aiming to assess students' knowledge proficiency by predicting their future performance on new questions based on their past response records.

counterfactual Counterfactual Reasoning +1

Learning Interpretable Rules for Scalable Data Representation and Classification

1 code implementation22 Oct 2023 Zhuo Wang, Wei zhang, Ning Liu, Jianyong Wang

Rule-based models, e. g., decision trees, are widely used in scenarios demanding high model interpretability for their transparent inner structures and good model expressivity.

Classification

Graph-enhanced Optimizers for Structure-aware Recommendation Embedding Evolution

1 code implementation24 Sep 2023 Cong Xu, Jun Wang, Jianyong Wang, Wei zhang

Embedding plays a key role in modern recommender systems because they are virtual representations of real-world entities and the foundation for subsequent decision-making models.

Decision Making Graph Neural Network

FlexKBQA: A Flexible LLM-Powered Framework for Few-Shot Knowledge Base Question Answering

1 code implementation23 Aug 2023 Zhenyu Li, Sunqi Fan, Yu Gu, Xiuxing Li, Zhichao Duan, Bowen Dong, Ning Liu, Jianyong Wang

Knowledge base question answering (KBQA) is a critical yet challenging task due to the vast number of entities within knowledge bases and the diversity of natural language questions posed by users.

Knowledge Base Question Answering

Knowledge-aware Collaborative Filtering with Pre-trained Language Model for Personalized Review-based Rating Prediction

1 code implementation2 Aug 2023 Quanxiu Wang, Xinlei Cao, Jianyong Wang, Wei zhang

For the first issue, to utilize rich knowledge, KCF-PLM develops a transformer network to model the interactions of the extracted aspects w. r. t.

Collaborative Filtering Language Modelling

Retentive Network: A Successor to Transformer for Large Language Models

9 code implementations17 Jul 2023 Yutao Sun, Li Dong, Shaohan Huang, Shuming Ma, Yuqing Xia, Jilong Xue, Jianyong Wang, Furu Wei

In this work, we propose Retentive Network (RetNet) as a foundation architecture for large language models, simultaneously achieving training parallelism, low-cost inference, and good performance.

Language Modelling

Can LLMs like GPT-4 outperform traditional AI tools in dementia diagnosis? Maybe, but not today

no code implementations2 Jun 2023 Zhuo Wang, Rongzhen Li, Bowen Dong, Jie Wang, Xiuxing Li, Ning Liu, Chenhui Mao, Wei zhang, Liling Dong, Jing Gao, Jianyong Wang

In this paper, we explore the potential of LLMs such as GPT-4 to outperform traditional AI tools in dementia diagnosis.

Bridging the Language Gap: Knowledge Injected Multilingual Question Answering

no code implementations6 Apr 2023 Zhichao Duan, Xiuxing Li, Zhengyan Zhang, Zhenyu Li, Ning Liu, Jianyong Wang

As a popular topic in natural language processing tasks, extractive question answering task (extractive QA) has gained extensive attention in the past few years.

Cross-Lingual Transfer Extractive Question-Answering +3

Optimization Techniques for Unsupervised Complex Table Reasoning via Self-Training Framework

2 code implementations20 Dec 2022 Zhenyu Li, Xiuxing Li, Sunqi Fan, Jianyong Wang

However, constructing labeled data for complex reasoning tasks is labor intensive, and the quantity of annotated data remains insufficient to support the intricate demands of real-world applications.

Data Augmentation Fact Verification +1

Joint Open Knowledge Base Canonicalization and Linking

no code implementations Proceedings of the 2021 International Conference on Management of Data 2021 Yinan Liu, Wei Shen, Yuanfei Wang, Jianyong Wang, Zhenglu Yang, Xiaojie Yuan

However, noun phrases (NPs) and relation phrases (RPs) in OKBs are not canonicalized and often appear in different paraphrased textual variants, which leads to redundant and ambiguous facts.

Open Information Extraction Relation

Learning Entity Linking Features for Emerging Entities

1 code implementation8 Aug 2022 Chenwei Ran, Wei Shen, Jianbo Gao, Yuhan Li, Jianyong Wang, Yantao Jia

Entity linking (EL) is the process of linking entity mentions appearing in text with their corresponding entities in a knowledge base.

Entity Linking

Effective Few-Shot Named Entity Linking by Meta-Learning

1 code implementation12 Jul 2022 Xiuxing Li, Zhenyu Li, Zhengyan Zhang, Ning Liu, Haitao Yuan, Wei zhang, Zhiyuan Liu, Jianyong Wang

In this paper, we endeavor to solve the problem of few-shot entity linking, which only requires a minimal amount of in-domain labeled data and is more practical in real situations.

Entity Linking Knowledge Base Completion +2

Prompt Tuning for Discriminative Pre-trained Language Models

1 code implementation Findings (ACL) 2022 Yuan YAO, Bowen Dong, Ao Zhang, Zhengyan Zhang, Ruobing Xie, Zhiyuan Liu, Leyu Lin, Maosong Sun, Jianyong Wang

Recent works have shown promising results of prompt tuning in stimulating pre-trained language models (PLMs) for natural language processing (NLP) tasks.

Language Modelling Question Answering +2

Data-Free Knowledge Transfer: A Survey

no code implementations31 Dec 2021 Yuang Liu, Wei zhang, Jun Wang, Jianyong Wang

In this paper, we provide a comprehensive survey on data-free knowledge transfer from the perspectives of knowledge distillation and unsupervised domain adaptation, to help readers have a better understanding of the current research status and ideas.

Data-free Knowledge Distillation Model Compression +3

Scalable Rule-Based Representation Learning for Interpretable Classification

2 code implementations NeurIPS 2021 Zhuo Wang, Wei zhang, Ning Liu, Jianyong Wang

Rule-based models, e. g., decision trees, are widely used in scenarios demanding high model interpretability for their transparent inner structures and good model expressivity.

Classification Representation Learning

Entity Linking Meets Deep Learning: Techniques and Solutions

no code implementations26 Sep 2021 Wei Shen, Yuhan Li, Yinan Liu, Jiawei Han, Jianyong Wang, Xiaojie Yuan

Entity linking (EL) is the process of linking entity mentions appearing in web text with their corresponding entities in a knowledge base.

Deep Learning Entity Linking +4

Learning Dual Dynamic Representations on Time-Sliced User-Item Interaction Graphs for Sequential Recommendation

1 code implementation24 Sep 2021 Zeyuan Chen, Wei zhang, Junchi Yan, Gang Wang, Jianyong Wang

Sequential Recommendation aims to recommend items that a target user will interact with in the near future based on the historically interacted items.

Representation Learning Sequential Recommendation

Toward Tweet Entity Linking with Heterogeneous Information Networks

1 code implementation IEEE Transactions on Knowledge and Data Engineering 2021 Wei Shen, Yuwei Yin, Yang Yang, Jiawei Han, Jianyong Wang, Xiaojie Yuan

The task of linking an entity mention in a tweet with its corresponding entity in a heterogeneous information network is of great importance, for the purpose of enriching heterogeneous information networks with the abundant and fresh knowledge embedded in tweets.

Entity Linking Metric Learning

Social Link Inference via Multi-View Matching Network from Spatio-Temporal Trajectories

no code implementations20 Mar 2021 Wei zhang, Xin Lai, Jianyong Wang

In this paper, we investigate the problem of social link inference in a target Location-aware Social Network (LSN), which aims at predicting the unobserved links between users within the network.

Link Prediction Time Series Analysis

Graph-Based Tri-Attention Network for Answer Ranking in CQA

no code implementations5 Mar 2021 Wei zhang, Zeyuan Chen, Chao Dong, Wen Wang, Hongyuan Zha, Jianyong Wang

However, they encounter two main limitations: (1) Correlations between answers in the same question are often overlooked.

Question Answering

RRL: A Scalable Classifier for Interpretable Rule-Based Representation Learning

no code implementations1 Jan 2021 Zhuo Wang, Wei zhang, Ning Liu, Jianyong Wang

Rule-based models, e. g., decision trees, are widely used in scenarios demanding high model interpretability for their transparent inner structures and good model expressivity.

Representation Learning

Transparent Classification with Multilayer Logical Perceptrons and Random Binarization

1 code implementation10 Dec 2019 Zhuo Wang, Wei zhang, Ning Liu, Jianyong Wang

In this paper, we propose a new hierarchical rule-based model for classification tasks, named Concept Rule Sets (CRS), which has both a strong expressive ability and a transparent inner structure.

Binarization Classification +1

Attentive Representation Learning with Adversarial Training for Short Text Clustering

no code implementations8 Dec 2019 Wei Zhang, Chao Dong, Jianhua Yin, Jianyong Wang

Relying on this, the representation learning and clustering for short texts are seamlessly integrated into a unified model.

Clustering Information Retrieval +3

Style Transfer as Unsupervised Machine Translation

no code implementations23 Aug 2018 Zhirui Zhang, Shuo Ren, Shujie Liu, Jianyong Wang, Peng Chen, Mu Li, Ming Zhou, Enhong Chen

Language style transferring rephrases text with specific stylistic attributes while preserving the original attribute-independent content.

Attribute NMT +4

A Dirichlet Multinomial Mixture Model-based Approach for Short Text Clustering

no code implementations1 Aug 2014 Jianhua Yin, Jianyong Wang

Short text clustering has become an increasingly important task with the popularity of social media like Twitter, Google+, and Facebook.

Clustering Short Text Clustering

Cannot find the paper you are looking for? You can Submit a new open access paper.