1 code implementation • 26 Aug 2024 • Cong Xu, Zhangchi Zhu, Mo Yu, Jun Wang, Jianyong Wang, Wei zhang
Some studies have observed that LLMs, when fine-tuned by the cross-entropy (CE) loss with a full softmax, could achieve `state-of-the-art' performance in sequential recommendation.
no code implementations • 21 Aug 2024 • Zhenyu Li, Yike Zhang, Tengyu Pan, Yutao Sun, Zhichao Duan, Junjie Fang, Rong Han, Zixuan Wang, Jianyong Wang
Empowering LLMs with the ability to utilize useful information from a long context is crucial for many downstream applications.
1 code implementation • 8 May 2024 • Yutao Sun, Li Dong, Yi Zhu, Shaohan Huang, Wenhui Wang, Shuming Ma, Quanlu Zhang, Jianyong Wang, Furu Wei
We introduce a decoder-decoder architecture, YOCO, for large language models, which only caches key-value pairs once.
no code implementations • 9 Feb 2024 • Cong Xu, Zhangchi Zhu, Jun Wang, Jianyong Wang, Wei zhang
Large language models (LLMs) have gained much attention in the recommendation community; some studies have observed that LLMs, fine-tuned by the cross-entropy loss with a full softmax, could achieve state-of-the-art performance already.
1 code implementation • 1 Dec 2023 • Jiajun Cui, Minghe Yu, Bo Jiang, Aimin Zhou, Jianyong Wang, Wei zhang
Knowledge tracing (KT) plays a crucial role in computer-aided education and intelligent tutoring systems, aiming to assess students' knowledge proficiency by predicting their future performance on new questions based on their past response records.
1 code implementation • 22 Oct 2023 • Zhuo Wang, Wei zhang, Ning Liu, Jianyong Wang
Rule-based models, e. g., decision trees, are widely used in scenarios demanding high model interpretability for their transparent inner structures and good model expressivity.
1 code implementation • 24 Sep 2023 • Cong Xu, Jun Wang, Jianyong Wang, Wei zhang
Embedding plays a key role in modern recommender systems because they are virtual representations of real-world entities and the foundation for subsequent decision-making models.
1 code implementation • 23 Aug 2023 • Zhenyu Li, Sunqi Fan, Yu Gu, Xiuxing Li, Zhichao Duan, Bowen Dong, Ning Liu, Jianyong Wang
Knowledge base question answering (KBQA) is a critical yet challenging task due to the vast number of entities within knowledge bases and the diversity of natural language questions posed by users.
1 code implementation • 2 Aug 2023 • Quanxiu Wang, Xinlei Cao, Jianyong Wang, Wei zhang
For the first issue, to utilize rich knowledge, KCF-PLM develops a transformer network to model the interactions of the extracted aspects w. r. t.
9 code implementations • 17 Jul 2023 • Yutao Sun, Li Dong, Shaohan Huang, Shuming Ma, Yuqing Xia, Jilong Xue, Jianyong Wang, Furu Wei
In this work, we propose Retentive Network (RetNet) as a foundation architecture for large language models, simultaneously achieving training parallelism, low-cost inference, and good performance.
no code implementations • 2 Jun 2023 • Zhuo Wang, Rongzhen Li, Bowen Dong, Jie Wang, Xiuxing Li, Ning Liu, Chenhui Mao, Wei zhang, Liling Dong, Jing Gao, Jianyong Wang
In this paper, we explore the potential of LLMs such as GPT-4 to outperform traditional AI tools in dementia diagnosis.
no code implementations • 6 Apr 2023 • Zhichao Duan, Xiuxing Li, Zhengyan Zhang, Zhenyu Li, Ning Liu, Jianyong Wang
As a popular topic in natural language processing tasks, extractive question answering task (extractive QA) has gained extensive attention in the past few years.
2 code implementations • 20 Dec 2022 • Zhenyu Li, Xiuxing Li, Sunqi Fan, Jianyong Wang
However, constructing labeled data for complex reasoning tasks is labor intensive, and the quantity of annotated data remains insufficient to support the intricate demands of real-world applications.
no code implementations • Proceedings of the 2021 International Conference on Management of Data 2021 • Yinan Liu, Wei Shen, Yuanfei Wang, Jianyong Wang, Zhenglu Yang, Xiaojie Yuan
However, noun phrases (NPs) and relation phrases (RPs) in OKBs are not canonicalized and often appear in different paraphrased textual variants, which leads to redundant and ambiguous facts.
no code implementations • 10 Nov 2022 • Zhichao Duan, Xiuxing Li, Zhenyu Li, Zhuo Wang, Jianyong Wang
Document-level relation extraction (DocRE) aims to identify semantic labels among entities within a single document.
1 code implementation • 8 Aug 2022 • Chenwei Ran, Wei Shen, Jianbo Gao, Yuhan Li, Jianyong Wang, Yantao Jia
Entity linking (EL) is the process of linking entity mentions appearing in text with their corresponding entities in a knowledge base.
1 code implementation • 12 Jul 2022 • Xiuxing Li, Zhenyu Li, Zhengyan Zhang, Ning Liu, Haitao Yuan, Wei zhang, Zhiyuan Liu, Jianyong Wang
In this paper, we endeavor to solve the problem of few-shot entity linking, which only requires a minimal amount of in-domain labeled data and is more practical in real situations.
1 code implementation • Findings (ACL) 2022 • Yuan YAO, Bowen Dong, Ao Zhang, Zhengyan Zhang, Ruobing Xie, Zhiyuan Liu, Leyu Lin, Maosong Sun, Jianyong Wang
Recent works have shown promising results of prompt tuning in stimulating pre-trained language models (PLMs) for natural language processing (NLP) tasks.
no code implementations • 31 Dec 2021 • Yuang Liu, Wei zhang, Jun Wang, Jianyong Wang
In this paper, we provide a comprehensive survey on data-free knowledge transfer from the perspectives of knowledge distillation and unsupervised domain adaptation, to help readers have a better understanding of the current research status and ideas.
2 code implementations • NeurIPS 2021 • Zhuo Wang, Wei zhang, Ning Liu, Jianyong Wang
Rule-based models, e. g., decision trees, are widely used in scenarios demanding high model interpretability for their transparent inner structures and good model expressivity.
no code implementations • 26 Sep 2021 • Wei Shen, Yuhan Li, Yinan Liu, Jiawei Han, Jianyong Wang, Xiaojie Yuan
Entity linking (EL) is the process of linking entity mentions appearing in web text with their corresponding entities in a knowledge base.
1 code implementation • 24 Sep 2021 • Zeyuan Chen, Wei zhang, Junchi Yan, Gang Wang, Jianyong Wang
Sequential Recommendation aims to recommend items that a target user will interact with in the near future based on the historically interacted items.
1 code implementation • IEEE Transactions on Knowledge and Data Engineering 2021 • Wei Shen, Yuwei Yin, Yang Yang, Jiawei Han, Jianyong Wang, Xiaojie Yuan
The task of linking an entity mention in a tweet with its corresponding entity in a heterogeneous information network is of great importance, for the purpose of enriching heterogeneous information networks with the abundant and fresh knowledge embedded in tweets.
no code implementations • 20 Mar 2021 • Wei zhang, Xin Lai, Jianyong Wang
In this paper, we investigate the problem of social link inference in a target Location-aware Social Network (LSN), which aims at predicting the unobserved links between users within the network.
no code implementations • 5 Mar 2021 • Wei zhang, Zeyuan Chen, Chao Dong, Wen Wang, Hongyuan Zha, Jianyong Wang
However, they encounter two main limitations: (1) Correlations between answers in the same question are often overlooked.
no code implementations • 1 Jan 2021 • Zhuo Wang, Wei zhang, Ning Liu, Jianyong Wang
Rule-based models, e. g., decision trees, are widely used in scenarios demanding high model interpretability for their transparent inner structures and good model expressivity.
1 code implementation • 10 Dec 2019 • Zhuo Wang, Wei zhang, Ning Liu, Jianyong Wang
In this paper, we propose a new hierarchical rule-based model for classification tasks, named Concept Rule Sets (CRS), which has both a strong expressive ability and a transparent inner structure.
no code implementations • 8 Dec 2019 • Wei Zhang, Chao Dong, Jianhua Yin, Jianyong Wang
Relying on this, the representation learning and clustering for short texts are seamlessly integrated into a unified model.
no code implementations • 2019 IEEE 35th International Conference on Data Engineering (ICDE) 2019 • Ning Liu, Pan Lu, Wei zhang, Jianyong Wang
To address the above issues, we propose novel Knowledge-aware Deep Dual Networks (K-DDN) for the text-based mortality prediction task.
no code implementations • Advances in Knowledge Discovery and Data Mining (PAKDD 2019) 2019 • Yuanquan Lu, Wei zhang, Pan Lu, Jianyong Wang
Nowadays, the online interactions between users and items become diverse, and may include textual reviews as well as numerical ratings.
no code implementations • 23 Aug 2018 • Zhirui Zhang, Shuo Ren, Shujie Liu, Jianyong Wang, Peng Chen, Mu Li, Ming Zhou, Enhong Chen
Language style transferring rephrases text with specific stylistic attributes while preserving the original attribute-independent content.
Ranked #3 on Unsupervised Text Style Transfer on GYAFC
1 code implementation • 24 May 2018 • Pan Lu, Lei Ji, Wei zhang, Nan Duan, Ming Zhou, Jianyong Wang
To better utilize semantic knowledge in images, we propose a novel framework to learn visual relation facts for VQA.
1 code implementation • 18 Nov 2017 • Pan Lu, Hongsheng Li, Wei zhang, Jianyong Wang, Xiaogang Wang
Existing VQA methods mainly adopt the visual attention mechanism to associate the input question with corresponding image regions for effective question answering.
no code implementations • 1 Aug 2014 • Jianhua Yin, Jianyong Wang
Short text clustering has become an increasingly important task with the popularity of social media like Twitter, Google+, and Facebook.