Search Results for author: Xinyi Wang

Found 77 papers, 32 papers with code

CMU’s IWSLT 2022 Dialect Speech Translation System

no code implementations • IWSLT (ACL) 2022 • Brian Yan, Patrick Fernandes, Siddharth Dalmia, Jiatong Shi, Yifan Peng, Dan Berrebbi, Xinyi Wang, Graham Neubig, Shinji Watanabe

We use additional paired Modern Standard Arabic data (MSA) to directly improve the speech recognition (ASR) and machine translation (MT) components of our cascaded systems.

Knowledge Distillation Machine Translation +3

Paper
Add Code

Grid Monitoring and Protection with Continuous Point-on-Wave Measurements and Generative AI

no code implementations • 11 Mar 2024 • Lang Tong, Xinyi Wang, Qing Zhao

Purpose This article presents a case for a next-generation grid monitoring and control system, leveraging recent advances in generative artificial intelligence (AI), machine learning, and statistical inference.

Data Compression Fault Detection

Paper
Add Code

Forecasting Electricity Market Signals via Generative AI

no code implementations • 9 Mar 2024 • Xinyi Wang, Qing Zhao, Lang Tong

This paper presents a generative artificial intelligence approach to probabilistic forecasting of electricity market signals, such as real-time locational marginal prices and area control error signals.

Time Series

Paper
Add Code

Multitask Multilingual Model Adaptation with Featurized Low-Rank Mixtures

no code implementations • 27 Feb 2024 • Chu-Cheng Lin, Xinyi Wang, Jonathan H. Clark, Han Lu, Yun Zhu, Chenxi Whitehouse, Hongkun Yu

By composing feature-specific parameters for each dataset, FLix can accommodate diverse dataset mixtures and generalize better to unseen datasets.

Paper
Add Code

A Survey on Data Selection for Language Models

1 code implementation • 26 Feb 2024 • Alon Albalak, Yanai Elazar, Sang Michael Xie, Shayne Longpre, Nathan Lambert, Xinyi Wang, Niklas Muennighoff, Bairu Hou, Liangming Pan, Haewon Jeong, Colin Raffel, Shiyu Chang, Tatsunori Hashimoto, William Yang Wang

A major factor in the recent success of large language models is the use of enormous and ever-growing text datasets for unsupervised pre-training.

Unsupervised Pre-training

Paper
Code

Generative Probabilistic Time Series Forecasting and Applications in Grid Operations

no code implementations • 21 Feb 2024 • Xinyi Wang, Lang Tong, Qing Zhao

Generative probabilistic forecasting produces future time series samples according to the conditional probability distribution given past time series observations.

Decision Making Probabilistic Time Series Forecasting +1

Paper
Add Code

Rate-Quality or Energy-Quality Pareto Fronts for Adaptive Video Streaming?

no code implementations • 10 Feb 2024 • Angeliki Katsenou, Xinyi Wang, Daniel Schien, David Bull

Adaptive video streaming is a key enabler for optimising the delivery of offline encoded video content.

Paper
Add Code

Understanding the Reasoning Ability of Language Models From the Perspective of Reasoning Paths Aggregation

1 code implementation • 5 Feb 2024 • Xinyi Wang, Alfonso Amayuelas, Kexun Zhang, Liangming Pan, Wenhu Chen, William Yang Wang

To understand how pre-training with a next-token prediction objective contributes to the emergence of such reasoning capability, we propose that we can view an LM as deriving new conclusions by aggregating indirect reasoning paths seen at pre-training time.

Knowledge Graphs Math

Paper
Code

Tweets to Citations: Unveiling the Impact of Social Media Influencers on AI Research Visibility

no code implementations • 24 Jan 2024 • Iain Xie Weissburg, Mehir Arora, Xinyi Wang, Liangming Pan, William Yang Wang

As the number of accepted papers at AI and ML conferences reaches into the thousands, it has become unclear how researchers access and read research publications.

Causal Inference

Paper
Add Code

GE-AdvGAN: Improving the transferability of adversarial samples by gradient editing-based adversarial generative model

1 code implementation • 11 Jan 2024 • Zhiyu Zhu, Huaming Chen, Xinyi Wang, Jiayu Zhang, Zhibo Jin, Kim-Kwang Raymond Choo, Jun Shen, Dong Yuan

With the functional and characteristic similarity analysis, we introduce a novel gradient editing (GE) mechanism and verify its feasibility in generating transferable samples on various models.

Adversarial Attack

Paper
Code

MFABA: A More Faithful and Accelerated Boundary-based Attribution Method for Deep Neural Networks

1 code implementation • 21 Dec 2023 • Zhiyu Zhu, Huaming Chen, Jiayu Zhang, Xinyi Wang, Zhibo Jin, Minhui Xue, Dongxiao Zhu, Kim-Kwang Raymond Choo

To better understand the output of deep neural networks (DNN), attribution based methods have been an important approach for model interpretability, which assign a score for each input dimension to indicate its importance towards the model outcome.

Paper
Code

Comparative Study of Hardware and Software Power Measurements in Video Compression

no code implementations • 19 Dec 2023 • Angeliki Katsenou, Xinyi Wang, Daniel Schien, David Bull

The environmental impact of video streaming services has been discussed as part of the strategies towards sustainable information and communication technologies.

Video Compression

Paper
Add Code

The Good, The Bad, and Why: Unveiling Emotions in Generative AI

no code implementations • 18 Dec 2023 • Cheng Li, Jindong Wang, Yixuan Zhang, Kaijie Zhu, Xinyi Wang, Wenxin Hou, Jianxun Lian, Fang Luo, Qiang Yang, Xing Xie

Through extensive experiments involving language and multi-modal models on semantic understanding, logical reasoning, and generation tasks, we demonstrate that both textual and visual EmotionPrompt can boost the performance of AI models while EmotionAttack can hinder it.

Logical Reasoning

Paper
Add Code

Seeing through the Mask: Multi-task Generative Mask Decoupling Face Recognition

no code implementations • 20 Nov 2023 • Zhaohui Wang, Sufang Zhang, Jianteng Peng, Xinyi Wang, Yandong Guo

Therefore, this paper proposes a Multi-task gEnerative mask dEcoupling face Recognition (MEER) network to jointly handle these two tasks, which can learn occlusionirrelevant and identity-related representation while achieving unmasked face synthesis.

Face Generation Face Recognition

Paper
Add Code

SiRA: Sparse Mixture of Low Rank Adaptation

no code implementations • 15 Nov 2023 • Yun Zhu, Nevan Wichers, Chu-Cheng Lin, Xinyi Wang, Tianlong Chen, Lei Shu, Han Lu, Canoee Liu, Liangchen Luo, Jindong Chen, Lei Meng

Parameter Efficient Tuning has been an prominent approach to adapt the Large Language Model to downstream tasks.

Language Modelling Large Language Model

Paper
Add Code

Language and Task Arithmetic with Parameter-Efficient Layers for Zero-Shot Summarization

no code implementations • 15 Nov 2023 • Alexandra Chronopoulou, Jonas Pfeiffer, Joshua Maynez, Xinyi Wang, Sebastian Ruder, Priyanka Agrawal

Parameter-efficient fine-tuning (PEFT) using labeled task data can significantly improve the performance of large language models (LLMs) on the downstream task.

Text Generation Zero-Shot Cross-Lingual Transfer

Paper
Add Code

Continual Event Extraction with Semantic Confusion Rectification

1 code implementation • 24 Oct 2023 • Zitao Wang, Xinyi Wang, Wei Hu

We study continual event extraction, which aims to extract incessantly emerging event information while avoiding forgetting.

Event Extraction Sentence

Paper
Code

DANAA: Towards transferable attacks with double adversarial neuron attribution

1 code implementation • 16 Oct 2023 • Zhibo Jin, Zhiyu Zhu, Xinyi Wang, Jiayu Zhang, Jun Shen, Huaming Chen

While deep neural networks have excellent results in many fields, they are susceptible to interference from attacking samples resulting in erroneous judgments.

Feature Importance

Paper
Code

Guiding Language Model Math Reasoning with Planning Tokens

no code implementations • 9 Oct 2023 • Xinyi Wang, Lucas Caccia, Oleksiy Ostapenko, Xingdi Yuan, William Yang Wang, Alessandro Sordoni

Large language models (LLMs) have recently attracted considerable interest for their ability to perform complex reasoning tasks, such as chain-of-thought reasoning.

Language Modelling Math

Paper
Add Code

The Robust Semantic Segmentation UNCV2023 Challenge Results

no code implementations • 27 Sep 2023 • Xuanlong Yu, Yi Zuo, Zitao Wang, Xiaowen Zhang, Jiaxuan Zhao, Yuting Yang, Licheng Jiao, Rui Peng, Xinyi Wang, Junpei Zhang, Kexin Zhang, Fang Liu, Roberto Alcover-Couso, Juan C. SanMiguel, Marcos Escudero-Viñolo, Hanlin Tian, Kenta Matsui, Tianhao Wang, Fahmy Adan, Zhitong Gao, Xuming He, Quentin Bouniot, Hossein Moghaddam, Shyam Nandan Rai, Fabio Cermelli, Carlo Masone, Andrea Pilzer, Elisa Ricci, Andrei Bursuc, Arno Solin, Martin Trapp, Rui Li, Angela Yao, Wenlong Chen, Ivor Simpson, Neill D. F. Campbell, Gianni Franchi

This paper outlines the winning solutions employed in addressing the MUAD uncertainty quantification challenge held at ICCV 2023.

Autonomous Driving Segmentation +2

Paper
Add Code

FIAT: Fusing learning paradigms with Instruction-Accelerated Tuning

no code implementations • 9 Sep 2023 • Xinyi Wang, John Wieting, Jonathan H. Clark

Learning paradigms for large language models (LLMs) currently tend to fall within either in-context learning (ICL) or full fine-tuning.

In-Context Learning

Paper
Add Code

Multi-model fusion for Aerial Vision and Dialog Navigation based on human attention aids

no code implementations • 27 Aug 2023 • Xinyi Wang, Xuan Cui, Danxu Li, Fang Liu, Licheng Jiao

Drones have been widely used in many areas of our daily lives.

Paper
Add Code

UGC Quality Assessment: Exploring the Impact of Saliency in Deep Feature-Based Quality Assessment

no code implementations • 13 Aug 2023 • Xinyi Wang, Angeliki Katsenou, David Bull

Preliminary results indicate that high correlations are achieved by using only deep features while adding saliency is not always boosting the performance.

Paper
Add Code

Automatically Correcting Large Language Models: Surveying the landscape of diverse self-correction strategies

1 code implementation • 6 Aug 2023 • Liangming Pan, Michael Saxon, Wenda Xu, Deepak Nathani, Xinyi Wang, William Yang Wang

Large language models (LLMs) have demonstrated remarkable performance across a wide array of NLP tasks.

Hallucination

308

Paper
Code

Non-parametric Probabilistic Time Series Forecasting via Innovations Representation

no code implementations • 5 Jun 2023 • Xinyi Wang, Meijen Lee, Qing Zhao, Lang Tong

Probabilistic time series forecasting predicts the conditional probability distributions of the time series at a future time given past realizations.

Decision Making Probabilistic Time Series Forecasting +1

Paper
Add Code

mmT5: Modular Multilingual Pre-Training Solves Source Language Hallucinations

no code implementations • 23 May 2023 • Jonas Pfeiffer, Francesco Piccinno, Massimo Nicosia, Xinyi Wang, Machel Reid, Sebastian Ruder

Multilingual sequence-to-sequence models perform poorly with increased language coverage and fail to consistently generate text in the correct target language in few-shot settings.

Hallucination Natural Language Understanding

Paper
Add Code

Evaluating and Modeling Attribution for Cross-Lingual Question Answering

no code implementations • 23 May 2023 • Benjamin Muller, John Wieting, Jonathan H. Clark, Tom Kwiatkowski, Sebastian Ruder, Livio Baldini Soares, Roee Aharoni, Jonathan Herzig, Xinyi Wang

Based on these models, we improve the attribution level of a cross-lingual question-answering system.

Attribute Cross-Lingual Question Answering +1

Paper
Add Code

TheoremQA: A Theorem-driven Question Answering dataset

1 code implementation • 21 May 2023 • Wenhu Chen, Ming Yin, Max Ku, Pan Lu, Yixin Wan, Xueguang Ma, Jianyu Xu, Xinyi Wang, Tony Xia

We evaluate a wide spectrum of 16 large language and code models with different prompting strategies like Chain-of-Thoughts and Program-of-Thoughts.

Ranked #1 on Natural Questions on TheoremQA

Math Question Answering

152

Paper
Code

Logic-LM: Empowering Large Language Models with Symbolic Solvers for Faithful Logical Reasoning

1 code implementation • 20 May 2023 • Liangming Pan, Alon Albalak, Xinyi Wang, William Yang Wang

We also introduce a self-refinement module, which utilizes the symbolic solver's error messages to revise symbolic formalizations.

Logical Reasoning

172

Paper
Code

XTREME-UP: A User-Centric Scarce-Data Benchmark for Under-Represented Languages

1 code implementation • 19 May 2023 • Sebastian Ruder, Jonathan H. Clark, Alexander Gutkin, Mihir Kale, Min Ma, Massimo Nicosia, Shruti Rijhwani, Parker Riley, Jean-Michel A. Sarr, Xinyi Wang, John Wieting, Nitish Gupta, Anna Katanova, Christo Kirov, Dana L. Dickinson, Brian Roark, Bidisha Samanta, Connie Tao, David I. Adelani, Vera Axelrod, Isaac Caswell, Colin Cherry, Dan Garrette, Reeve Ingle, Melvin Johnson, Dmitry Panteleev, Partha Talukdar

We evaluate commonly used models on the benchmark.

In-Context Learning Multilingual NLP +3

Paper
Code

Collaborative Generative AI: Integrating GPT-k for Efficient Editing in Text-to-Image Generation

no code implementations • 18 May 2023 • Wanrong Zhu, Xinyi Wang, Yujie Lu, Tsu-Jui Fu, Xin Eric Wang, Miguel Eckstein, William Yang Wang

We conduct a series of experiments to compare the common edits made by humans and GPT-k, evaluate the performance of GPT-k in prompting T2I, and examine factors that may influence this process.

Text Generation Text-to-Image Generation

Paper
Add Code

Serial Contrastive Knowledge Distillation for Continual Few-shot Relation Extraction

1 code implementation • 11 May 2023 • Xinyi Wang, Zitao Wang, Wei Hu

Continual few-shot relation extraction (RE) aims to continuously train a model for new relations with few labeled training data, of which the major challenges are the catastrophic forgetting of old relations and the overfitting caused by data sparsity.

Contrastive Learning Knowledge Distillation +3

Paper
Code

Complexity and Enumeration in Models of Genome Rearrangement

no code implementations • 3 May 2023 • Lora Bailey, Heather Smith Blake, Garner Cochran, Nathan Fox, Michael Levet, Reem Mahmoud, Elizabeth Matson, Inne Singgih, Grace Stadnyk, Xinyi Wang, Alexander Wiedemann

In this paper, we examine the computational complexity of enumeration in certain genome rearrangement models.

Paper
Add Code

Air-Ground Integrated Sensing and Communications: Opportunities and Challenges

no code implementations • 13 Feb 2023 • Zesong Fei, Xinyi Wang, Nan Wu, Jingxuan Huang, J. Andrew Zhang

The air-ground integrated sensing and communications (AG-ISAC) network, which consists of unmanned aerial vehicles (UAVs) and ground terrestrial networks, offers unique capabilities and demands special design techniques.

Paper
Add Code

Large Language Models Are Latent Variable Models: Explaining and Finding Good Demonstrations for In-Context Learning

1 code implementation • NeurIPS 2023 • Xinyi Wang, Wanrong Zhu, Michael Saxon, Mark Steyvers, William Yang Wang

This study aims to examine the in-context learning phenomenon through a Bayesian lens, viewing real-world LLMs as latent variable models.

Few-Shot Learning GSM8K +5

Paper
Code

A Survey of Face Recognition

no code implementations • 26 Dec 2022 • Xinyi Wang, Jianteng Peng, Sufang Zhang, Bihui Chen, Yi Wang, Yandong Guo

Recent years witnessed the breakthrough of face recognition with deep convolutional neural networks.

Face Recognition

Paper
Add Code

Program of Thoughts Prompting: Disentangling Computation from Reasoning for Numerical Reasoning Tasks

2 code implementations • 22 Nov 2022 • Wenhu Chen, Xueguang Ma, Xinyi Wang, William W. Cohen

By combining PoT with self-consistency decoding, we can achieve SoTA performance on all math problem datasets and near-SoTA performance on financial datasets.

Math

1,017

Paper
Code

TW-BAG: Tensor-wise Brain-aware Gate Network for Inpainting Disrupted Diffusion Tensor Imaging

no code implementations • 31 Oct 2022 • Zihao Tang, Xinyi Wang, Lihaowen Zhu, Mariano Cabezas, Dongnan Liu, Michael Barnett, Weidong Cai, Chengyu Wang

Diffusion Weighted Imaging (DWI) is an advanced imaging technique commonly used in neuroscience and neurological clinical research through a Diffusion Tensor Imaging (DTI) model.

Paper
Add Code

Novelty Detection in Time Series via Weak Innovations Representation: A Deep Learning Approach

no code implementations • 24 Oct 2022 • Xinyi Wang, Mei-jen Lee, Qing Zhao, Lang Tong

We consider novelty detection in time series with unknown and nonparametric probability structures.

Novelty Detection Time Series +1

Paper
Add Code

A Multi-dimensional Evaluation of Tokenizer-free Multilingual Pretrained Models

no code implementations • 13 Oct 2022 • Jimin Sun, Patrick Fernandes, Xinyi Wang, Graham Neubig

Recent work on tokenizer-free multilingual pretrained models show promising results in improving cross-lingual transfer and reducing engineering overhead (Clark et al., 2022; Xue et al., 2022).

Cross-Lingual Transfer

Paper
Add Code

Semantic Preserving Adversarial Attack Generation with Autoencoder and Genetic Algorithm

no code implementations • 25 Aug 2022 • Xinyi Wang, Simon Yusuf Enoch, Dong Seong Kim

Widely used deep learning models are found to have poor robustness.

Adversarial Attack

Paper
Add Code

Enhancing Document-level Relation Extraction by Entity Knowledge Injection

1 code implementation • 23 Jul 2022 • Xinyi Wang, Zitao Wang, Weijian Sun, Wei Hu

Document-level relation extraction (RE) aims to identify the relations between entities throughout an entire document.

Ranked #23 on Relation Extraction on DocRED

Document-level Relation Extraction Knowledge Graphs +1

Paper
Code

A Comprehensive Review on Deep Supervision: Theories and Applications

no code implementations • 6 Jul 2022 • Renjie Li, Xinyi Wang, Guan Huang, Wenli Yang, Kaining Zhang, Xiaotong Gu, Son N. Tran, Saurabh Garg, Jane Alty, Quan Bai

Deep supervision, or known as 'intermediate supervision' or 'auxiliary supervision', is to add supervision at hidden layers of a neural network.

Paper
Add Code

Taxonomy of Benchmarks in Graph Representation Learning

1 code implementation • 15 Jun 2022 • Renming Liu, Semih Cantürk, Frederik Wenkel, Sarah McGuire, Xinyi Wang, Anna Little, Leslie O'Bray, Michael Perlmutter, Bastian Rieck, Matthew Hirn, Guy Wolf, Ladislav Rampášek

Graph Neural Networks (GNNs) extend the success of neural networks to graph-structured data by accounting for their intrinsic geometry.

Benchmarking Graph Representation Learning

Paper
Code

Causal Balancing for Domain Generalization

1 code implementation • 10 Jun 2022 • Xinyi Wang, Michael Saxon, Jiachen Li, Hongyang Zhang, Kun Zhang, William Yang Wang

While machine learning models rapidly advance the state-of-the-art on various real-world tasks, out-of-domain (OOD) generalization remains a challenging problem given the vulnerability of these models to spurious correlations.

Domain Generalization

Paper
Code

Expanding Pretrained Models to Thousands More Languages via Lexicon-based Adaptation

1 code implementation • ACL 2022 • Xinyi Wang, Sebastian Ruder, Graham Neubig

The performance of multilingual pretrained models is highly dependent on the availability of monolingual or parallel text present in a target language.

Paper
Code

Towards Bi-directional Skip Connections in Encoder-Decoder Architectures and Beyond

no code implementations • 11 Mar 2022 • Tiange Xiang, Chaoyi Zhang, Xinyi Wang, Yang song, Dongnan Liu, Heng Huang, Weidong Cai

With the backward skip connections, we propose a U-Net based network family, namely Bi-directional O-shape networks, which set new benchmarks on multiple public medical imaging segmentation datasets.

Medical Image Segmentation Neural Architecture Search +1

Paper
Add Code

Asynchronous Decentralized Federated Learning for Collaborative Fault Diagnosis of PV Stations

no code implementations • 28 Feb 2022 • Qi Liu, Bo Yang, Zhaojian Wang, Dafeng Zhu, Xinyi Wang, Kai Ma, Xinping Guan

Therefore, federated learning can be exploited to train a collaborative fault diagnosis model.

Federated Learning

Paper
Add Code

PECO: Examining Single Sentence Label Leakage in Natural Language Inference Datasets through Progressive Evaluation of Cluster Outliers

no code implementations • 16 Dec 2021 • Michael Saxon, Xinyi Wang, Wenda Xu, William Yang Wang

Building natural language inference (NLI) benchmarks that are both challenging for modern techniques, and free from shortcut biases is difficult.

Natural Language Inference Sentence

Paper
Add Code

Efficient Test Time Adapter Ensembling for Low-resource Language Varieties

1 code implementation • Findings (EMNLP) 2021 • Xinyi Wang, Yulia Tsvetkov, Sebastian Ruder, Graham Neubig

Adapters are light-weight modules that allow parameter-efficient fine-tuning of pretrained models.

Cross-Lingual Transfer named-entity-recognition +4

Paper
Code

A Dataset for Answering Time-Sensitive Questions

1 code implementation • 13 Aug 2021 • Wenhu Chen, Xinyi Wang, William Yang Wang

Lots of facts can evolve with respect to time.

Benchmarking

Paper
Code

BiX-NAS: Searching Efficient Bi-directional Architecture for Medical Image Segmentation

1 code implementation • 26 Jun 2021 • Xinyi Wang, Tiange Xiang, Chaoyi Zhang, Yang song, Dongnan Liu, Heng Huang, Weidong Cai

We evaluate BiX-NAS on two segmentation tasks using three different medical image datasets, and the experimental results show that our BiX-NAS searched architecture achieves the state-of-the-art performance with significantly lower computational cost.

Image Segmentation Medical Image Segmentation +3

Paper
Code

Innovations Autoencoder and its Application in One-class Anomalous Sequence Detection

no code implementations • 23 Jun 2021 • Xinyi Wang, Lang Tong

An innovations sequence of a time series is a sequence of independent and identically distributed random variables with which the original time series has a causal representation.

Anomaly Detection Gaussian Processes +2

Paper
Add Code

RefBERT: Compressing BERT by Referencing to Pre-computed Representations

no code implementations • 11 Jun 2021 • Xinyi Wang, Haiqin Yang, Liang Zhao, Yang Mo, Jianping Shen

Differently, in this paper, we propose RefBERT to leverage the knowledge learned from the teacher, i. e., facilitating the pre-computed BERT representation on the reference sample and compressing BERT into a smaller student model.

Knowledge Distillation

Paper
Add Code

Counterfactual Maximum Likelihood Estimation for Training Deep Networks

1 code implementation • NeurIPS 2021 • Xinyi Wang, Wenhu Chen, Michael Saxon, William Yang Wang

Although deep learning models have driven state-of-the-art performance on a wide array of tasks, they are prone to spurious correlations that should not be learned as predictive clues.

counterfactual Domain Generalization +2

Paper
Code

Applications of Artificial Intelligence to aid detection of dementia: a narrative review on current capabilities and future directions

no code implementations • 29 Apr 2021 • Renjie Li, Xinyi Wang, Katherine Lawler, Saurabh Garg, Quan Bai, Jane Alty

With populations ageing, the number of people with dementia worldwide is expected to triple to 152 million by 2050.

Paper
Add Code

Multi-view Subword Regularization

1 code implementation • NAACL 2021 • Xinyi Wang, Sebastian Ruder, Graham Neubig

Multilingual pretrained representations generally rely on subword segmentation algorithms to create a shared multilingual vocabulary.

Cross-Lingual Transfer Segmentation

Paper
Code

Gradient-guided Loss Masking for Neural Machine Translation

no code implementations • 26 Feb 2021 • Xinyi Wang, Ankur Bapna, Melvin Johnson, Orhan Firat

To mitigate the negative effect of low quality training data on the performance of neural machine translation models, most existing strategies focus on filtering out harmful data before training starts.

Machine Translation Translation

Paper
Add Code

Meta Back-translation

1 code implementation • ICLR 2021 • Hieu Pham, Xinyi Wang, Yiming Yang, Graham Neubig

Back-translation is an effective strategy to improve the performance of Neural Machine Translation~(NMT) by generating pseudo-parallel data.

Machine Translation Meta-Learning +2

32,798

Paper
Code

Fast Dynamics in a Model Metallic Glass-forming Material

no code implementations • 28 Jan 2021 • Hao Zhang, Xinyi Wang, Hai-Bin Yu, Jack F. Douglas

We investigate the fast $\beta$- and Johari-Goldstein (JG) $\beta$-relaxation processes, along with the elastic scattering response of glass-forming (GF) liquids and the Boson peak, in a simulated Al-Sm GF material exhibiting a fragile-strong (FS) transition.

Materials Science

Paper
Add Code

Dynamic Heterogeneity, Cooperative Motion, and Johari-Goldstein $β$-Relaxation in a Metallic Glass-Forming Material Exhibiting a Fragile to Strong Transition

no code implementations • 27 Jan 2021 • Hao Zhang, Xinyi Wang, Hai-Bin Yu, Jack F. Douglas

We investigate the Johari-Goldstein (JG) $\beta$-relaxation process in a model metallic glass-forming (GF) material (Al90Sm10), previously studied extensively by both frequency-dependent mechanical measurements and simulation studies devoted to equilibrium properties, by molecular dynamics simulations based on validated and optimized interatomic potentials with the primary aim of better understanding the nature of this universal relaxation process from a dynamic heterogeneity (DH) perspective.

Materials Science

Paper
Add Code

Modeling Disclosive Transparency in NLP Application Descriptions

1 code implementation • EMNLP 2021 • Michael Saxon, Sharon Levy, Xinyi Wang, Alon Albalak, William Yang Wang

Broader disclosive transparency$-$truth and clarity in communication regarding the function of AI systems$-$is widely considered desirable.

Fairness Language Modelling +1

Paper
Code

A Deep Learning Approach to Anomaly Sequence Detection for High-Resolution Monitoring of Power Systems

no code implementations • 9 Dec 2020 • Kursat Rasim Mestav, Xinyi Wang, Lang Tong

A deep learning approach is proposed to detect data and system anomalies using high-resolution continuous point-on-wave (CPOW) or phasor measurements.

Anomaly Detection Generative Adversarial Network

Paper
Add Code

Improving Target-side Lexical Transfer in Multilingual Neural Machine Translation

no code implementations • Findings of the Association for Computational Linguistics 2020 • Luyu Gao, Xinyi Wang, Graham Neubig

To improve the performance of Neural Machine Translation~(NMT) for low-resource languages~(LRL), one effective strategy is to leverage parallel data from a related high-resource language~(HRL).

Machine Translation NMT +1

Paper
Add Code

Adaptive Subband Compression for Streaming of Continuous Point-on-Wave and PMU Data

no code implementations • 23 Aug 2020 • Xinyi Wang, Yilu Liu, Lang Tong

A data compression system capable of providing real-time streaming of high-resolution continuous point-on-wave (CPOW) and phasor measurement unit (PMU) measurements is proposed.

Data Compression

Paper
Add Code

Balancing Training for Multilingual Neural Machine Translation

2 code implementations • ACL 2020 • Xinyi Wang, Yulia Tsvetkov, Graham Neubig

When training multilingual machine translation (MT) models that can translate to/from multiple languages, we are faced with imbalanced training sets: some languages have much more training data than others.

Machine Translation Translation

Paper
Code

A Probabilistic Formulation of Unsupervised Text Style Transfer

5 code implementations • ICLR 2020 • Junxian He, Xinyi Wang, Graham Neubig, Taylor Berg-Kirkpatrick

Across all style transfer tasks, our approach yields substantial gains over state-of-the-art non-generative baselines, including the state-of-the-art unsupervised machine translation techniques that our approach generalizes.

Decipherment Language Modelling +6

222

Paper
Code

Optimizing Data Usage via Differentiable Rewards

1 code implementation • ICML 2020 • Xinyi Wang, Hieu Pham, Paul Michel, Antonios Anastasopoulos, Jaime Carbonell, Graham Neubig

To acquire a new skill, humans learn better and faster if a tutor, based on their current knowledge level, informs them of how much attention they should pay to particular content or practice problems.

Image Classification Machine Translation

Paper
Code

Improving Conditioning in Context-Aware Sequence to Sequence Models

no code implementations • 21 Nov 2019 • Xinyi Wang, Jason Weston, Michael Auli, Yacine Jernite

Neural sequence to sequence models are well established for applications which can be cast as mapping a single input sequence into a single output sequence.

Ranked #6 on Open-Domain Question Answering on ELI5

abstractive question answering Data Augmentation +2

Paper
Add Code

Domain Differential Adaptation for Neural Machine Translation

1 code implementation • WS 2019 • Zi-Yi Dou, Xinyi Wang, Junjie Hu, Graham Neubig

We then use these learned domain differentials to adapt models for the target task accordingly.

Domain Adaptation Machine Translation +1

Paper
Code

Target Conditioned Sampling: Optimizing Data Selection for Multilingual Neural Machine Translation

no code implementations • ACL 2019 • Xinyi Wang, Graham Neubig

To improve low-resource Neural Machine Translation (NMT) with multilingual corpora, training on the most related high-resource language only is often more effective than using all data available (Neubig and Hu, 2018).

Low-Resource Neural Machine Translation NMT +2

Paper
Add Code

compare-mt: A Tool for Holistic Comparison of Language Generation Systems

2 code implementations • NAACL 2019 • Graham Neubig, Zi-Yi Dou, Junjie Hu, Paul Michel, Danish Pruthi, Xinyi Wang, John Wieting

In this paper, we describe compare-mt, a tool for holistic analysis and comparison of the results of systems for language generation tasks such as machine translation.

Machine Translation Sentence +2

461

Paper
Code

The ARIEL-CMU Systems for LoReHLT18

no code implementations • 24 Feb 2019 • Aditi Chaudhary, Siddharth Dalmia, Junjie Hu, Xinjian Li, Austin Matthews, Aldrian Obaja Muis, Naoki Otani, Shruti Rijhwani, Zaid Sheikh, Nidhi Vyas, Xinyi Wang, Jiateng Xie, Ruochen Xu, Chunting Zhou, Peter J. Jansen, Yiming Yang, Lori Levin, Florian Metze, Teruko Mitamura, David R. Mortensen, Graham Neubig, Eduard Hovy, Alan W. black, Jaime Carbonell, Graham V. Horwood, Shabnam Tafreshi, Mona Diab, Efsun S. Kayi, Noura Farra, Kathleen McKeown

This paper describes the ARIEL-CMU submissions to the Low Resource Human Language Technologies (LoReHLT) 2018 evaluations for the tasks Machine Translation (MT), Entity Discovery and Linking (EDL), and detection of Situation Frames in Text and Speech (SF Text and Speech).

Machine Translation Translation

Paper
Add Code

Multilingual Neural Machine Translation With Soft Decoupled Encoding

1 code implementation • ICLR 2019 • Xinyi Wang, Hieu Pham, Philip Arthur, Graham Neubig

Multilingual training of neural machine translation (NMT) systems has led to impressive accuracy improvements on low-resource languages.

Machine Translation NMT +1

Paper
Code

A Tree-based Decoder for Neural Machine Translation

1 code implementation • EMNLP 2018 • Xinyi Wang, Hieu Pham, Pengcheng Yin, Graham Neubig

Recent advances in Neural Machine Translation (NMT) show that adding syntactic information to NMT systems can improve the quality of their translations.

Machine Translation NMT +2

Paper
Code

SwitchOut: an Efficient Data Augmentation Algorithm for Neural Machine Translation

no code implementations • EMNLP 2018 • Xinyi Wang, Hieu Pham, Zihang Dai, Graham Neubig

In this work, we examine methods for data augmentation for text-based tasks such as neural machine translation (NMT).

Data Augmentation Machine Translation +3

Paper
Add Code

XNMT: The eXtensible Neural Machine Translation Toolkit

1 code implementation • WS 2018 • Graham Neubig, Matthias Sperber, Xinyi Wang, Matthieu Felix, Austin Matthews, Sarguna Padmanabhan, Ye Qi, Devendra Singh Sachan, Philip Arthur, Pierre Godard, John Hewitt, Rachid Riad, Liming Wang

In this paper we describe the design of XNMT and its experiment configuration system, and demonstrate its utility on the tasks of machine translation, speech recognition, and multi-tasked machine translation/parsing.

Machine Translation NMT +3

185

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.