Search Results for author: Haitao Mi

Found 33 papers, 8 papers with code

Paper
Add Code

Flexible and Efficient Hypergraph Interactions for Joint Hierarchical and Forest-to-String Decoding

no code implementations • EMNLP 2013 • Martin {\v{C}}mejrek, Haitao Mi, Bo-Wen Zhou

Machine Translation

Paper
Add Code

Hierarchical MT Training using Max-Violation Perceptron

no code implementations • ACL 2014 • Kai Zhao, Liang Huang, Haitao Mi, Abe Ittycheriah

Machine Translation Part-Of-Speech Tagging

Paper
Add Code

A Structured Language Model for Incremental Tree-to-String Translation

no code implementations • COLING 2014 • Heng Yu, Haitao Mi, Liang Huang, Qun Liu

Language Modelling Translation

Paper
Add Code

Shift-Reduce Constituency Parsing with Dynamic Programming and POS Tag Lattice

no code implementations • HLT 2015 • Haitao Mi, Liang Huang

Constituency Parsing Dependency Parsing +4

Paper
Add Code

Feature Optimization for Constituent Parsing via Neural Networks

no code implementations • IJCNLP 2015 • Zhiguo Wang, Haitao Mi, Nianwen Xue

Dependency Parsing Feature Engineering +2

Paper
Add Code

Semi-supervised Clustering for Short Text via Deep Representation Learning

no code implementations • CONLL 2016 • Zhiguo Wang, Haitao Mi, Abraham Ittycheriah

In this work, we propose a semi-supervised method for short text clustering, where we represent texts as distributed vectors with neural networks, and use a small amount of labeled data to specify our intention for clustering.

Clustering Representation Learning +1

Paper
Add Code

Sentence Similarity Learning by Lexical Decomposition and Composition

1 code implementation • COLING 2016 • Zhiguo Wang, Haitao Mi, Abraham Ittycheriah

Most conventional sentence similarity methods only focus on similar parts of two input sentences, and simply ignore the dissimilar parts, which usually give us some clues and semantic meanings about the sentences.

Ranked #13 on Question Answering on WikiQA

Paraphrase Identification Question Answering +2

Paper
Code

Vocabulary Manipulation for Neural Machine Translation

no code implementations • ACL 2016 • Haitao Mi, Zhiguo Wang, Abe Ittycheriah

Our method simply takes into account the translation options of each word or phrase in the source sentence, and picks a very small target vocabulary for each sentence based on a word-to-word translation model or a bilingual phrase library learned from a traditional machine translation model.

Machine Translation Sentence +2

Paper
Add Code

Coverage Embedding Models for Neural Machine Translation

no code implementations • EMNLP 2016 • Haitao Mi, Baskaran Sankaran, Zhiguo Wang, Abe Ittycheriah

In this paper, we enhance the attention-based neural machine translation (NMT) by adding explicit coverage embedding models to alleviate issues of repeating and dropping translations in NMT.

Machine Translation NMT +1

Paper
Add Code

Sense Embedding Learning for Word Sense Induction

no code implementations • SEMEVAL 2016 • Linfeng Song, Zhiguo Wang, Haitao Mi, Daniel Gildea

In the training stage, our method induces several sense centroids (embedding) for each polysemous word.

Ranked #4 on Word Sense Induction on SemEval 2010 WSI

Word Sense Induction

Paper
Add Code

Supervised Attentions for Neural Machine Translation

no code implementations • EMNLP 2016 • Haitao Mi, Zhiguo Wang, Abe Ittycheriah

We simply compute the distance between the machine attentions and the "true" alignments, and minimize this cost in the training procedure.

Machine Translation Sentence +1

Paper
Add Code

Temporal Attention Model for Neural Machine Translation

no code implementations • 9 Aug 2016 • Baskaran Sankaran, Haitao Mi, Yaser Al-Onaizan, Abe Ittycheriah

Attention-based Neural Machine Translation (NMT) models suffer from attention deficiency issues as has been observed in recent research.

Machine Translation NMT +2

Paper
Add Code

Multi-Perspective Context Matching for Machine Comprehension

1 code implementation • 13 Dec 2016 • Zhiguo Wang, Haitao Mi, Wael Hamza, Radu Florian

Based on this dataset, we propose a Multi-Perspective Context Matching (MPCM) model, which is an end-to-end system that directly predicts the answer beginning and ending points in a passage.

Ranked #3 on Open-Domain Question Answering on SQuAD1.1

Question Answering Reading Comprehension

216

Paper
Code

R2D2: Recursive Transformer based on Differentiable Tree for Interpretable Hierarchical Language Modeling

1 code implementation • ACL 2021 • Xiang Hu, Haitao Mi, Zujie Wen, Yafang Wang, Yi Su, Jing Zheng, Gerard de Melo

Human language understanding operates at multiple levels of granularity (e. g., words, phrases, and sentences) with increasing levels of abstraction that can be hierarchically combined.

Language Modelling

Paper
Code

A Dialogue-based Information Extraction System for Medical Insurance Assessment

no code implementations • Findings (ACL) 2021 • Shuang Peng, Mengdi Zhou, Minghui Yang, Haitao Mi, Shaosheng Cao, Zujie Wen, Teng Xu, Hongbin Wang, Lei Liu

In the Chinese medical insurance industry, the assessor's role is essential and requires significant efforts to converse with the claimant.

Paper
Add Code

DP-FP: Differentially Private Forward Propagation for Large Models

no code implementations • 29 Dec 2021 • Jian Du, Haitao Mi

Our DP-FP employs novel (1) representation clipping followed by noise addition in the forward propagation stage, as well as (2) micro-batch construction via subsampling to achieve DP amplification and reduce noise power to $1/M$, where $M$ is the number of micro-batch in a step.

Privacy Preserving Privacy Preserving Deep Learning

Paper
Add Code

Fast-R2D2: A Pretrained Recursive Neural Network based on Pruned CKY for Grammar Induction and Text Representation

2 code implementations • 1 Mar 2022 • Xiang Hu, Haitao Mi, Liang Li, Gerard de Melo

We propose to use a top-down parser as a model-based pruning method, which also enables parallel encoding during inference.

Language Modelling Large Language Model +1

Paper
Code

Towards Generalized Models for Task-oriented Dialogue Modeling on Spoken Conversations

no code implementations • 8 Mar 2022 • Ruijie Yan, Shuang Peng, Haitao Mi, Liang Jiang, Shihui Yang, Yuchi Zhang, Jiajun Li, Liangrui Peng, Yongliang Wang, Zujie Wen

Building robust and general dialogue models for spoken conversations is challenging due to the gap in distributions of spoken and written data.

Data Augmentation domain classification +1

Paper
Add Code

Learning a Grammar Inducer from Massive Uncurated Instructional Videos

1 code implementation • 22 Oct 2022 • Songyang Zhang, Linfeng Song, Lifeng Jin, Haitao Mi, Kun Xu, Dong Yu, Jiebo Luo

While previous work focuses on building systems for inducing grammars on text that are well-aligned with video content, we investigate the scenario, in which text and video are only in loose correspondence.

Language Acquisition Video Alignment

Paper
Code

Discover, Explanation, Improvement: An Automatic Slice Detection Framework for Natural Language Processing

no code implementations • 8 Nov 2022 • Wenyue Hua, Lifeng Jin, Linfeng Song, Haitao Mi, Yongfeng Zhang, Dong Yu

Pretrained natural language processing (NLP) models have achieved high overall performance, but they still make systematic errors.

Paper
Add Code

Friend-training: Learning from Models of Different but Related Tasks

no code implementations • 31 Jan 2023 • Mian Zhang, Lifeng Jin, Linfeng Song, Haitao Mi, Xiabing Zhou, Dong Yu

Current self-training methods such as standard self-training, co-training, tri-training, and others often focus on improving model performance on a single task, utilizing differences in input features, model architectures, and training processes.

Dialogue Rewriting Dialogue Understanding +1

Paper
Add Code

Search-Engine-augmented Dialogue Response Generation with Cheaply Supervised Query Production

1 code implementation • 16 Feb 2023 • Ante Wang, Linfeng Song, Qi Liu, Haitao Mi, Longyue Wang, Zhaopeng Tu, Jinsong Su, Dong Yu

We propose a dialogue model that can access the vast and dynamic information from any search engine for response generation.

Chatbot Response Generation

Paper
Code

Stabilizing RLHF through Advantage Model and Selective Rehearsal

no code implementations • 18 Sep 2023 • Baolin Peng, Linfeng Song, Ye Tian, Lifeng Jin, Haitao Mi, Dong Yu

Large Language Models (LLMs) have revolutionized natural language processing, yet aligning these models with human values and preferences using RLHF remains a significant challenge.

Paper
Add Code

The Trickle-down Impact of Reward (In-)consistency on RLHF

1 code implementation • 28 Sep 2023 • Lingfeng Shen, Sihao Chen, Linfeng Song, Lifeng Jin, Baolin Peng, Haitao Mi, Daniel Khashabi, Dong Yu

We propose Contrast Instructions -- a benchmarking strategy for the consistency of RM.

Benchmarking

Paper
Code

Inconsistent dialogue responses and how to recover from them

1 code implementation • 18 Jan 2024 • Mian Zhang, Lifeng Jin, Linfeng Song, Haitao Mi, Dong Yu

One critical issue for chat systems is to stay consistent about preferences, opinions, beliefs and facts of itself, which has been shown a difficult problem.

Paper
Code

Self-Alignment for Factuality: Mitigating Hallucinations in LLMs via Self-Evaluation

no code implementations • 14 Feb 2024 • Xiaoying Zhang, Baolin Peng, Ye Tian, Jingyan Zhou, Lifeng Jin, Linfeng Song, Haitao Mi, Helen Meng

Despite showing increasingly human-like abilities, large language models (LLMs) often struggle with factual inaccuracies, i. e. "hallucinations", even when they hold relevant knowledge.

Paper
Add Code

Fine-Grained Self-Endorsement Improves Factuality and Reasoning

no code implementations • 23 Feb 2024 • Ante Wang, Linfeng Song, Baolin Peng, Ye Tian, Lifeng Jin, Haitao Mi, Jinsong Su, Dong Yu

Experiments on Biographies show that our method can effectively improve the factuality of generations with simple and intuitive prompts across different scales of LLMs.

GSM8K Language Modelling +2

Paper
Add Code

Collaborative decoding of critical tokens for boosting factuality of large language models

no code implementations • 28 Feb 2024 • Lifeng Jin, Baolin Peng, Linfeng Song, Haitao Mi, Ye Tian, Dong Yu

The most common training pipeline for large language models includes pretraining, finetuning and aligning phases, with their respective resulting models, such as the pretrained model and the finetuned model.

Hallucination Instruction Following

Paper
Add Code

A Knowledge Plug-and-Play Test Bed for Open-domain Dialogue Generation

no code implementations • 6 Mar 2024 • Xiangci Li, Linfeng Song, Lifeng Jin, Haitao Mi, Jessica Ouyang, Dong Yu

In this paper, we present a high-quality benchmark named multi-source Wizard of Wikipedia (Ms. WoW) for evaluating multi-source dialogue knowledge selection and response generation.

Dialogue Generation Response Generation

Paper
Add Code

Self-Consistency Boosts Calibration for Math Reasoning

no code implementations • 14 Mar 2024 • Ante Wang, Linfeng Song, Ye Tian, Baolin Peng, Lifeng Jin, Haitao Mi, Jinsong Su, Dong Yu

Calibration, which establishes the correlation between accuracy and model confidence, is important for LLM development.

GSM8K Math

Paper
Add Code

Entropy Guided Extrapolative Decoding to Improve Factuality in Large Language Models

no code implementations • 14 Apr 2024 • Souvik Das, Lifeng Jin, Linfeng Song, Haitao Mi, Baolin Peng, Dong Yu

Current state-of-the-art approaches refine decoding by contrasting early-exit distributions from a lower layer with the final layer to exploit information related to factuality within the model forward procedure.

Hallucination

Paper
Add Code

Toward Self-Improvement of LLMs via Imagination, Searching, and Criticizing

no code implementations • 18 Apr 2024 • Ye Tian, Baolin Peng, Linfeng Song, Lifeng Jin, Dian Yu, Haitao Mi, Dong Yu

Despite the impressive capabilities of Large Language Models (LLMs) on various tasks, they still struggle with scenarios that involves complex reasoning and planning.

Mathematical Reasoning Self-Learning

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.