Search Results for author: Rui Meng

Found 41 papers, 18 papers with code

Investigating Factuality in Long-Form Text Generation: The Roles of Self-Known and Self-Unknown

no code implementations24 Nov 2024 Lifu Tu, Rui Meng, Shafiq Joty, Yingbo Zhou, Semih Yavuz

Interestingly, even without significant changes in the models' self-judgment (Self-Known and Self-Unknown), the number of unsupported claims can increases, likely as an artifact of long-form generation.

Text Generation

CodeXEmbed: A Generalist Embedding Model Family for Multiligual and Multi-task Code Retrieval

no code implementations19 Nov 2024 Ye Liu, Rui Meng, Shafiq Joty, Silvio Savarese, Caiming Xiong, Yingbo Zhou, Semih Yavuz

This gap leaves existing models unable to effectively capture the diversity of programming languages and tasks across different domains, highlighting the need for more focused research in code retrieval.

Diversity Natural Language Queries +2

TDGCN-Based Mobile Multiuser Physical-Layer Authentication for EI-Enabled IIoT

no code implementations13 Nov 2024 Rui Meng, Hangyu Zhao, Bingxuan Xu, Yining Wang, Xiaodong Xu, Suyu Lv, Xiaofeng Tao, Ping Zhang

Physical-Layer Authentication (PLA) offers endogenous security, lightweight implementation, and high reliability, making it a promising complement to upper-layer security methods in Edge Intelligence (EI)-empowered Industrial Internet of Things (IIoT).

Node Clustering

Beyond Content Relevance: Evaluating Instruction Following in Retrieval Models

1 code implementation31 Oct 2024 Jianqun Zhou, Yuanlei Zheng, Wei Chen, Qianqian Zheng, Zeyuan Shang, Wei zhang, Rui Meng, Xiaoyu Shen

Instruction-following capabilities in large language models (LLMs) have significantly progressed, enabling more complex user interactions through detailed prompts.

Instruction Following Retrieval

VLM2Vec: Training Vision-Language Models for Massive Multimodal Embedding Tasks

no code implementations7 Oct 2024 Ziyan Jiang, Rui Meng, Xinyi Yang, Semih Yavuz, Yingbo Zhou, Wenhu Chen

Embedding models have been crucial in enabling various downstream tasks such as semantic similarity, information retrieval, and clustering.

Information Retrieval Language Modelling +6

Learning from Committee: Reasoning Distillation from a Mixture of Teachers with Peer-Review

no code implementations4 Oct 2024 Zhuochun Li, Yuelyu Ji, Rui Meng, Daqing He

While reasoning capabilities typically emerge in large language models (LLMs) with tens of billions of parameters, recent research focuses on improving smaller open-source models through knowledge distillation (KD) from commercial LLMs.

Knowledge Distillation Logical Reasoning

Decoupled Marked Temporal Point Process using Neural Ordinary Differential Equations

no code implementations10 Jun 2024 Yujee Song, DongHyun Lee, Rui Meng, Won Hwa Kim

While most previous studies focus on the inter-event dependencies and their representations, how individual events influence the overall dynamics over time has been under-explored.

Density Estimation

RAG-RLRC-LaySum at BioLaySumm: Integrating Retrieval-Augmented Generation and Readability Control for Layman Summarization of Biomedical Texts

1 code implementation21 May 2024 Yuelyu Ji, Zhuochun Li, Rui Meng, Sonish Sivarajkumar, Yanshan Wang, Zeshui Yu, Hui Ji, Yushui Han, Hanyu Zeng, Daqing He

This paper introduces the RAG-RLRC-LaySum framework, designed to make complex biomedical research understandable to laymen through advanced Natural Language Processing (NLP) techniques.

RAG Retrieval

Investigating Answerability of LLMs for Long-Form Question Answering

no code implementations15 Sep 2023 Meghana Moorthy Bhat, Rui Meng, Ye Liu, Yingbo Zhou, Semih Yavuz

As we embark on a new era of LLMs, it becomes increasingly crucial to understand their capabilities, limitations, and differences.

Long Form Question Answering Question Generation +1

XGen-7B Technical Report

1 code implementation7 Sep 2023 Erik Nijkamp, Tian Xie, Hiroaki Hayashi, Bo Pang, Congying Xia, Chen Xing, Jesse Vig, Semih Yavuz, Philippe Laban, Ben Krause, Senthil Purushwalkam, Tong Niu, Wojciech Kryściński, Lidiya Murakhovs'ka, Prafulla Kumar Choubey, Alex Fabbri, Ye Liu, Rui Meng, Lifu Tu, Meghana Bhat, Chien-Sheng Wu, Silvio Savarese, Yingbo Zhou, Shafiq Joty, Caiming Xiong

Most open-source LLMs, on the other hand, are limited in their ability to support longer sequence lengths, which is a key requirement for many tasks that require inference over an input context.

2k 8k

Enhancing Performance on Seen and Unseen Dialogue Scenarios using Retrieval-Augmented End-to-End Task-Oriented System

no code implementations16 Aug 2023 JianGuo Zhang, Stephen Roller, Kun Qian, Zhiwei Liu, Rui Meng, Shelby Heinecke, Huan Wang, Silvio Savarese, Caiming Xiong

End-to-end task-oriented dialogue (TOD) systems have achieved promising performance by leveraging sophisticated natural language understanding and natural language generation capabilities of pre-trained models.

Natural Language Understanding Retrieval +1

Efficient Gaussian Process Classification-based Physical-Layer Authentication with Configurable Fingerprints for 6G-Enabled IoT

no code implementations23 Jul 2023 Rui Meng, Fangzhou Zhu, Xiaodong Xu, Bizhu Wang, Bingxuan Xu, Ping Zhang

Furthermore, our proposed fingerprint selection algorithms achieve the reduction in the authentication error rate of up to 86. 93% compared to baseline active learning algorithms.

Active Learning Gaussian Processes

DialogStudio: Towards Richest and Most Diverse Unified Dataset Collection for Conversational AI

1 code implementation19 Jul 2023 JianGuo Zhang, Kun Qian, Zhiwei Liu, Shelby Heinecke, Rui Meng, Ye Liu, Zhou Yu, Huan Wang, Silvio Savarese, Caiming Xiong

Despite advancements in conversational AI, language models encounter challenges to handle diverse conversational tasks, and existing dialogue dataset collections often lack diversity and comprehensiveness.

Conversational Recommendation Diversity +3

HPE:Answering Complex Questions over Text by Hybrid Question Parsing and Execution

no code implementations12 May 2023 Ye Liu, Semih Yavuz, Rui Meng, Dragomir Radev, Caiming Xiong, Yingbo Zhou

It comprises two central pillars: (1) We parse the question of varying complexity into an intermediate representation, named H-expression, which is composed of simple questions as the primitives and symbolic operations representing the relationships among them; (2) To execute the resulting H-expressions, we design a hybrid executor, which integrates the deterministic rules to translate the symbolic operations with a drop-in neural reader network to answer each decomposed simple question.

Knowledge Graphs Question Answering +1

Latent Semantic Diffusion-based Channel Adaptive De-Noising SemCom for Future 6G Systems

no code implementations19 Apr 2023 Bingxuan Xu, Rui Meng, Yue Chen, Xiaodong Xu, Chen Dong, Hao Sun

Upon the designed DNSC architecture, we further combine adversarial learning, variational autoencoder, and diffusion model to propose the Latent Diffusion DNSC (Latent-Diff DNSC) scheme to realize intelligent online de-noising.

Semantic Communication SSIM

Uni-Parser: Unified Semantic Parser for Question Answering on Knowledge Base and Database

no code implementations9 Nov 2022 Ye Liu, Semih Yavuz, Rui Meng, Dragomir Radev, Caiming Xiong, Yingbo Zhou

Parsing natural language questions into executable logical forms is a useful and interpretable way to perform question answering on structured data such as knowledge bases (KB) or databases (DB).

Question Answering Semantic Parsing

General-to-Specific Transfer Labeling for Domain Adaptable Keyphrase Generation

1 code implementation20 Aug 2022 Rui Meng, Tong Wang, Xingdi Yuan, Yingbo Zhou, Daqing He

Finally, we fine-tune the model with limited data with true labels to fully adapt it to the target domain.

Keyphrase Generation

Compressed Predictive Information Coding

no code implementations3 Mar 2022 Rui Meng, Tianyi Luo, Kristofer Bouchard

The key insight of our framework is to learn representations by minimizing the compression complexity and maximizing the predictive information in latent space.

Mutual Information Estimation

Bayesian Inference in High-Dimensional Time-Serieswith the Orthogonal Stochastic Linear Mixing Model

no code implementations25 Jun 2021 Rui Meng, Kristofer Bouchard

Stochastic linear mixing models (SLMM) assume the mixture coefficients depend on input, making them more flexible and effective to capture complex output dependence.

Bayesian Inference Gaussian Processes +2

Stochastic Collapsed Variational Inference for Structured Gaussian Process Regression Network

no code implementations1 Jun 2021 Rui Meng, Herbie Lee, Kristofer Bouchard

This paper presents an efficient variational inference framework for deriving a family of structured gaussian process regression network (SGPRN) models.

Imputation Missing Values +3

Unsupervised Deep Keyphrase Generation

1 code implementation18 Apr 2021 Xianjie Shen, Yinghan Wang, Rui Meng, Jingbo Shang

Keyphrase generation aims to summarize long documents with a collection of salient phrases.

Keyphrase Generation

Predicting User Engagement Status for Online Evaluation of Intelligent Assistants

no code implementations1 Oct 2020 Rui Meng, Zhen Yue, Alyssa Glass

Therefore, we consider predicting user engagement status as the very first and critical step to online evaluation for intelligent assistants.

Recommendation Systems

An Empirical Study on Neural Keyphrase Generation

1 code implementation NAACL 2021 Rui Meng, Xingdi Yuan, Tong Wang, Sanqiang Zhao, Adam Trischler, Daqing He

Recent years have seen a flourishing of neural keyphrase generation (KPG) works, including the release of several large-scale datasets and a host of new models to tackle them.

Keyphrase Generation

Spatiotemporal Attention for Multivariate Time Series Prediction and Interpretation

2 code implementations11 Aug 2020 Tryambak Gangopadhyay, Sin Yong Tan, Zhanhong Jiang, Rui Meng, Soumik Sarkar

Accurate interpretation of such prediction outcomes from a machine learning model that explicitly captures temporal correlations can significantly benefit the domain experts.

BIG-bench Machine Learning Time Series +1

Nonstationary Multivariate Gaussian Processes for Electronic Health Records

no code implementations13 Oct 2019 Rui Meng, Braden Soper, Herbert Lee, Vincent X. Liu, John D. Greene, Priyadip Ray

We propose multivariate nonstationary Gaussian processes for jointly modeling multiple clinical variables, where the key parameters, length-scales, standard deviations and the correlations between the observed output, are all time dependent.

Gaussian Processes

Hierarchical Hidden Markov Jump Processes for Cancer Screening Modeling

no code implementations13 Oct 2019 Rui Meng, Soper Braden, Jan Nygard, Mari Nygrad, Herbert Lee

In this paper, we propose a piece-wise stationary transition matrix to explain the heterogeneity in time.

Regularized Sparse Gaussian Processes

no code implementations13 Oct 2019 Rui Meng, Herbert Lee, Soper Braden, Priyadip Ray

An issue faced by SGP, especially in latent variable models, is the inefficient learning of the inducing inputs, which leads to poor model prediction.

Facial Expression Recognition (FER) Gaussian Processes +3

Does Order Matter? An Empirical Study on Generating Multiple Keyphrases as a Sequence

1 code implementation9 Sep 2019 Rui Meng, Xingdi Yuan, Tong Wang, Peter Brusilovsky, Adam Trischler, Daqing He

Recently, concatenating multiple keyphrases as a target sequence has been proposed as a new learning paradigm for keyphrase generation.

Keyphrase Generation

Subjective Knowledge Acquisition and Enrichment Powered By Crowdsourcing

no code implementations16 May 2017 Rui Meng, Hao Xin, Lei Chen, Yangqiu Song

In our work, we propose a system, called crowdsourced subjective knowledge acquisition (CoSKA), for subjective knowledge acquisition powered by crowdsourcing and existing KBs.

Deep Keyphrase Generation

4 code implementations ACL 2017 Rui Meng, Sanqiang Zhao, Shuguang Han, Daqing He, Peter Brusilovsky, Yu Chi

Keyphrase provides highly-condensed information that can be effectively used for understanding, organizing and retrieving text content.

Decoder Keyphrase Extraction +1

Cannot find the paper you are looking for? You can Submit a new open access paper.