Search Results for author: Xiao Yu

Found 25 papers, 11 papers with code

Entity Attribute Relation Extraction with Attribute-Aware Embeddings

no code implementations EMNLP (DeeLIO) 2020 Dan Iter, Xiao Yu, Fangtao Li

Entity-attribute relations are a fundamental component for building large-scale knowledge bases, which are widely employed in modern search engines.

Attribute Attribute Extraction +2

Diffusion Synthesizer for Efficient Multilingual Speech to Speech Translation

no code implementations14 Jun 2024 Nameer Hirschkind, Xiao Yu, Mahesh Kumar Nandwana, Joseph Liu, Eloi DuBois, Dao Le, Nicolas Thiebaut, Colin Sinclair, Kyle Spence, Charles Shang, Zoe Abrams, Morgan McGuire

We introduce DiffuseST, a low-latency, direct speech-to-speech translation system capable of preserving the input speaker's voice zero-shot while translating from multiple source languages into English.

Speech-to-Speech Translation Translation

LocalRQA: From Generating Data to Locally Training, Testing, and Deploying Retrieval-Augmented QA Systems

1 code implementation1 Mar 2024 Xiao Yu, Yunan Lu, Zhou Yu

Retrieval-augmented question-answering systems combine retrieval techniques with large language models to provide answers that are more accurate and informative.

Question Answering Retrieval

Formal Synthesis of Controllers for Safety-Critical Autonomous Systems: Developments and Challenges

no code implementations20 Feb 2024 Xiang Yin, Bingzhao Gao, Xiao Yu

This paper provides a comprehensive review of formal controller synthesis techniques for safety-critical autonomous systems.

ConFit: Improving Resume-Job Matching using Data Augmentation and Contrastive Learning

no code implementations29 Jan 2024 Xiao Yu, Jinzhong Zhang, Zhou Yu

A reliable resume-job matching system helps a company find suitable candidates from a pool of resumes, and helps a job seeker find relevant jobs from a list of job posts.

Contrastive Learning Data Augmentation

Teaching Language Models to Self-Improve through Interactive Demonstrations

1 code implementation20 Oct 2023 Xiao Yu, Baolin Peng, Michel Galley, Jianfeng Gao, Zhou Yu

The self-improving ability of large language models (LLMs), enabled by prompting them to analyze and revise their own outputs, has garnered significant interest in recent research.

Math

Distantly-Supervised Joint Extraction with Noise-Robust Learning

1 code implementation8 Oct 2023 Yufei Li, Xiao Yu, Yanghong Guo, Yanchi Liu, Haifeng Chen, Cong Liu

Joint entity and relation extraction is a process that identifies entity pairs and their relations using a single model.

Joint Entity and Relation Extraction Language Modelling +2

Prompt-Based Monte-Carlo Tree Search for Goal-Oriented Dialogue Policy Planning

1 code implementation23 May 2023 Xiao Yu, Maximillian Chen, Zhou Yu

Planning for goal-oriented dialogue often requires simulating future dialogue interactions and estimating task progress.

Language Modelling Large Language Model

DPIC: Decoupling Prompt and Intrinsic Characteristics for LLM Generated Text Detection

no code implementations21 May 2023 Xiao Yu, Yuang Qi, Kejiang Chen, Guoqiang Chen, Xi Yang, Pengyuan Zhu, Xiuwei Shang, Weiming Zhang, Nenghai Yu

Then, the similarity between the candidate text and the regenerated text is used as a detection feature, thus eliminating the prompt in the detection process, which allows the detector to focus on the intrinsic characteristics of the generative model.

Language Modelling Large Language Model +2

Controllable Mixed-Initiative Dialogue Generation through Prompting

1 code implementation6 May 2023 Maximillian Chen, Xiao Yu, Weiyan Shi, Urvi Awasthi, Zhou Yu

The standard approach has been fine-tuning pre-trained language models to perform generation conditioned on these intents.

Dialogue Generation

Uncertainty-Aware Bootstrap Learning for Joint Extraction on Distantly-Supervised Data

1 code implementation5 May 2023 Yufei Li, Xiao Yu, Yanchi Liu, Haifeng Chen, Cong Liu

To mitigate such impact, we propose uncertainty-aware bootstrap learning, which is motivated by the intuition that the higher uncertainty of an instance, the more likely the model confidence is inconsistent with the ground truths.

Relation Extraction

KRLS: Improving End-to-End Response Generation in Task Oriented Dialog with Reinforced Keywords Learning

1 code implementation30 Nov 2022 Xiao Yu, Qingyang Wu, Kun Qian, Zhou Yu

In task-oriented dialogs (TOD), reinforcement learning (RL) algorithms train a model to directly optimize response for task-related metrics.

Language Modelling reinforcement-learning +2

Improving Model Training via Self-learned Label Representations

1 code implementation9 Sep 2022 Xiao Yu, Nakul Verma

In this work, we demonstrate that more sophisticated label representations are better for classification than the usual one-hot encoding.

Classification

Diverse Title Generation for Stack Overflow Posts with Multiple Sampling Enhanced Transformer

1 code implementation24 Aug 2022 Fengji Zhang, Jin Liu, Yao Wan, Xiao Yu, Xiao Liu, Jacky Keung

Stack Overflow is one of the most popular programming communities where developers can seek help for their encountered problems.

A recipe of training neural network-based LDPC decoders

no code implementations1 May 2022 Guangwen Li, Xiao Yu

It is known belief propagation decoding variants of LDPC codes can be unrolled easily as neural networks after assigning differed weights to message passing edges flexibly.

FastKASSIM: A Fast Tree Kernel-Based Syntactic Similarity Metric

1 code implementation15 Mar 2022 Maximillian Chen, Caitlyn Chen, Xiao Yu, Zhou Yu

Syntax is a fundamental component of language, yet few metrics have been employed to capture syntactic similarity or coherence at the utterance- and document-level.

Authorship Attribution

Improving Stack Overflow question title generation with copying enhanced CodeBERT model and bi-modal information

1 code implementation27 Sep 2021 Fengji Zhang, Xiao Yu, Jacky Keung, Fuyang Li, Zhiwen Xie, Zhen Yang, Caoyuan Ma, Zhimin Zhang

However, only using the code snippets in the question body cannot provide sufficient information for title generation, and LSTMs cannot capture the long-range dependencies between tokens.

Decoder

Asymptotic spreading of KPP reactive fronts in heterogeneous shifting environments

no code implementations17 Jan 2021 King-Yeung Lam, Xiao Yu

We study the asymptotic spreading of Kolmogorov-Petrovsky-Piskunov (KPP) fronts in heterogeneous shifting habitats, with any number of shifting speeds, by further developing the method based on the theory of viscosity solutions of Hamilton-Jacobi equations.

Analysis of PDEs 35B40, 35K57, 35R10, 35D40

Adaptive Transfer Learning of Multi-View Time Series Classification

no code implementations14 Oct 2019 Donglin Zhan, Shiyu Yi, Dongli Xu, Xiao Yu, Denglin Jiang, Siqi Yu, Haoting Zhang, Wenfang Shangguan, Weihua Zhang

In this paper, we first proposed a general adaptive transfer learning framework for multi-view time series data, which shows strong ability in storing inter-view importance value in the process of knowledge transfer.

Classification Density Estimation +5

Learning to Generate Posters of Scientific Papers by Probabilistic Graphical Models

no code implementations21 Feb 2017 Yu-ting Qiang, Yanwei Fu, Xiao Yu, Yanwen Guo, Zhi-Hua Zhou, Leonid Sigal

In order to bridge the gap between panel attributes and the composition within each panel, we also propose a recursive page splitting algorithm to generate the panel layout for a poster.

Cannot find the paper you are looking for? You can Submit a new open access paper.