Search Results for author: Mengqi Zhang

Found 24 papers, 10 papers with code

Constrained Auto-Regressive Decoding Constrains Generative Retrieval

no code implementations14 Apr 2025 Shiguang Wu, Zhaochun Ren, Xin Xin, Jiyuan Yang, Mengqi Zhang, Zhumin Chen, Maarten de Rijke, Pengjie Ren

This paper aims to improve our theoretical understanding of the generalization capabilities of the auto-regressive decoding retrieval paradigm, laying a foundation for its limitations and inspiring future advancements toward more robust and generalizable generative retrieval.

Retrieval

UIPE: Enhancing LLM Unlearning by Removing Knowledge Related to Forgetting Targets

no code implementations6 Mar 2025 Wenyu Wang, Mengqi Zhang, Xiaotian Ye, Zhaochun Ren, Zhumin Chen, Pengjie Ren

Large Language Models (LLMs) inevitably acquire harmful information during training on massive datasets.

Numerical solutions of fixed points in two-dimensional Kuramoto-Sivashinsky equation expedited by reinforcement learning

1 code implementation27 Dec 2024 Juncheng Jiang, Dongdong Wan, Mengqi Zhang

This paper presents a combined approach to enhancing the effectiveness of Jacobian-Free Newton-Krylov (JFNK) method by deep reinforcement learning (DRL) in identifying fixed points within the 2D Kuramoto-Sivashinsky Equation (KSE).

Deep Reinforcement Learning Navigate +1

ASLoRA: Adaptive Sharing Low-Rank Adaptation Across Layers

no code implementations13 Dec 2024 Junyan Hu, Xue Xiao, Mengqi Zhang, Yao Chen, Zhaochun Ren, Zhumin Chen, Pengjie Ren

As large language models (LLMs) grow in size, traditional full fine-tuning becomes increasingly impractical due to its high computational and storage costs.

parameter-efficient fine-tuning

Uncovering Overfitting in Large Language Model Editing

no code implementations10 Oct 2024 Mengqi Zhang, Xiaotian Ye, Qiang Liu, Pengjie Ren, Shu Wu, Zhumin Chen

Knowledge editing has been proposed as an effective method for updating and correcting the internal knowledge of Large Language Models (LLMs).

Attribute In-Context Learning +5

Impact of ChatGPT on the writing style of condensed matter physicists

no code implementations30 Aug 2024 Shaojun Xu, Xiaohui Ye, Mengqi Zhang, Pei Wang

We apply a state-of-the-art difference-in-differences approach to estimate the impact of ChatGPT's release on the writing style of condensed matter papers on arXiv.

Enhancing Multi-hop Reasoning through Knowledge Erasure in Large Language Model Editing

no code implementations22 Aug 2024 Mengqi Zhang, Bowen Fang, Qiang Liu, Pengjie Ren, Shu Wu, Zhumin Chen, Liang Wang

Building on the validated hypothesis, we propose a novel knowledge editing method that incorporates a Knowledge Erasure mechanism for Large language model Editing (KELE).

knowledge editing Language Modeling +2

Learning Domain-Invariant Features for Out-of-Context News Detection

no code implementations11 Jun 2024 Yimeng Gu, Mengqi Zhang, Ignacio Castro, Shu Wu, Gareth Tyson

In order to effectively adapt the detection model to unlabeled news topics or agencies, we propose ConDA-TTA (Contrastive Domain Adaptation with Test-Time Adaptation) which applies contrastive learning and maximum mean discrepancy (MMD) to learn domain-invariant features.

Contrastive Learning Misinformation +1

ExcluIR: Exclusionary Neural Information Retrieval

1 code implementation26 Apr 2024 WenHao Zhang, Mengqi Zhang, Shiguang Wu, Jiahuan Pei, Zhaochun Ren, Maarten de Rijke, Zhumin Chen, Pengjie Ren

However, in information retrieval community, there is little research on exclusionary retrieval, where users express what they do not want in their queries.

Information Retrieval Retrieval

Generative Retrieval as Multi-Vector Dense Retrieval

1 code implementation31 Mar 2024 Shiguang Wu, Wenda Wei, Mengqi Zhang, Zhumin Chen, Jun Ma, Zhaochun Ren, Maarten de Rijke, Pengjie Ren

Both methods compute relevance as a sum of products of query and document vectors and an alignment matrix.

Decoder Retrieval

Uncovering Selective State Space Model's Capabilities in Lifelong Sequential Recommendation

1 code implementation25 Mar 2024 Jiyuan Yang, Yuanzi Li, Jingyu Zhao, Hanbing Wang, Muyang Ma, Jun Ma, Zhaochun Ren, Mengqi Zhang, Xin Xin, Zhumin Chen, Pengjie Ren

We conduct extensive experiments to evaluate the performance of representative sequential recommendation models in the setting of lifelong sequences.

2k Mamba +1

MELoRA: Mini-Ensemble Low-Rank Adapters for Parameter-Efficient Fine-Tuning

1 code implementation27 Feb 2024 Pengjie Ren, Chengshun Shi, Shiguang Wu, Mengqi Zhang, Zhaochun Ren, Maarten de Rijke, Zhumin Chen, Jiahuan Pei

Parameter-efficient fine-tuning (PEFT) is a popular method for tailoring pre-trained large language models (LLMs), especially as the models' scale and the diversity of tasks increase.

Diversity Instruction Following +2

Knowledge Graph Enhanced Large Language Model Editing

no code implementations21 Feb 2024 Mengqi Zhang, Xiaotian Ye, Qiang Liu, Pengjie Ren, Shu Wu, Zhumin Chen

Large language models (LLMs) are pivotal in advancing natural language processing (NLP) tasks, yet their efficacy is hampered by inaccuracies and outdated knowledge.

Knowledge Graphs Language Modeling +3

Self-Supervised Position Debiasing for Large Language Models

no code implementations2 Jan 2024 Zhongkun Liu, Zheng Chen, Mengqi Zhang, Zhaochun Ren, Pengjie Ren, Zhumin Chen

Existing debiasing methods for LLMs require external bias knowledge or annotated non-biased samples, which is lacking for position debiasing and impractical in reality.

Position

Patched Denoising Diffusion Models For High-Resolution Image Synthesis

1 code implementation2 Aug 2023 Zheng Ding, Mengqi Zhang, Jiajun Wu, Zhuowen Tu

Feature collage systematically crops and combines partial features of the neighboring patches to predict the features of a shifted image patch, allowing the seamless generation of the entire image due to the overlap in the patch feature space.

Denoising Image Generation

MetaTKG: Learning Evolutionary Meta-Knowledge for Temporal Knowledge Graph Reasoning

no code implementations2 Feb 2023 Yuwei Xia, Mengqi Zhang, Qiang Liu, Shu Wu, Xiao-Yu Zhang

Most existing works focus on exploring evolutionary information in history to obtain effective temporal embeddings for entities and relations, but they ignore the variation in evolution patterns of facts, which makes them struggle to adapt to future data with different evolution patterns.

Knowledge Graphs Meta-Learning

FF2: A Feature Fusion Two-Stream Framework for Punctuation Restoration

no code implementations9 Nov 2022 Yangjun Wu, Kebin Fang, Yao Zhao, Hao Zhang, Lifeng Shi, Mengqi Zhang

To accomplish punctuation restoration, most existing methods focus on introducing extra information (e. g., part-of-speech) or addressing the class imbalance problem.

Language Modeling Language Modelling +2

Latent Structure Mining with Contrastive Modality Fusion for Multimedia Recommendation

1 code implementation1 Nov 2021 Jinghao Zhang, Yanqiao Zhu, Qiang Liu, Mengqi Zhang, Shu Wu, Liang Wang

Although having access to multiple modalities might allow us to capture rich information, we argue that the simple coarse-grained fusion by linear combination or concatenation in previous work is insufficient to fully understand content information and item relationships. To this end, we propose a latent structure MIning with ContRastive mOdality fusion method (MICRO for brevity).

Collaborative Filtering Multimedia recommendation

Deep Contrastive Multiview Network Embedding

no code implementations16 Aug 2021 Mengqi Zhang, Yanqiao Zhu, Qiang Liu, Shu Wu, Liang Wang

In our work, different views can be obtained based on the various relations among nodes.

Attribute Contrastive Learning +2

Dynamic Graph Neural Networks for Sequential Recommendation

1 code implementation15 Apr 2021 Mengqi Zhang, Shu Wu, Xueli Yu, Qiang Liu, Liang Wang

We propose a new method named Dynamic Graph Neural Network for Sequential Recommendation (DGSR), which connects different user sequences through a dynamic graph structure, exploring the interactive behavior of users and items with time and order information.

Graph Attention Graph Neural Network +2

Dynamic Graph Collaborative Filtering

1 code implementation8 Jan 2021 Xiaohan Li, Mengqi Zhang, Shu Wu, Zheng Liu, Liang Wang, Philip S. Yu

Here we propose Dynamic Graph Collaborative Filtering (DGCF), a novel framework leveraging dynamic graphs to capture collaborative and sequential relations of both items and users at the same time.

Collaborative Filtering Recommendation Systems

Personalized Graph Neural Networks with Attention Mechanism for Session-Aware Recommendation

3 code implementations20 Oct 2019 Mengqi Zhang, Shu Wu, Meng Gao, Xin Jiang, Ke Xu, Liang Wang

The other is Dot-Product Attention mechanism, which draws on the Transformer net to explicitly model the effect of historical sessions on the current session.

Graph Neural Network Machine Translation +1

Cannot find the paper you are looking for? You can Submit a new open access paper.