1 code implementation • EMNLP 2021 • Yichen Jiang, Mohit Bansal
Motivated by the failure of a Transformer model on the SCAN compositionality challenge (Lake and Baroni, 2018), which requires parsing a command into actions, we propose two auxiliary sequence prediction tasks as additional training supervision.
1 code implementation • 28 Nov 2022 • Yichen Jiang, Xiang Zhou, Mohit Bansal
Recent datasets expose the lack of the systematic generalization ability in standard sequence-to-sequence models.
no code implementations • MTSummit 2021 • Paul Soulos, Sudha Rao, Caitlin Smith, Eric Rosen, Asli Celikyilmaz, R. Thomas McCoy, Yichen Jiang, Coleman Haley, Roland Fernandez, Hamid Palangi, Jianfeng Gao, Paul Smolensky
Machine translation has seen rapid progress with the advent of Transformer-based models.
1 code implementation • Findings (EMNLP) 2021 • Yichen Jiang, Mohit Bansal
On examples with a maximum source and target length of 30 from De-En, WMT'16 English-Romanian, and WMT'21 English-Chinese translation tasks, our learned order outperforms all heuristic generation orders on four out of six tasks.
1 code implementation • 30 Sep 2021 • Yichen Jiang, Mohit Bansal
Motivated by the failure of a Transformer model on the SCAN compositionality challenge (Lake and Baroni, 2018), which requires parsing a command into actions, we propose two auxiliary sequence prediction tasks that track the progress of function and argument semantics, as additional training supervision.
1 code implementation • NAACL 2021 • Yichen Jiang, Asli Celikyilmaz, Paul Smolensky, Paul Soulos, Sudha Rao, Hamid Palangi, Roland Fernandez, Caitlin Smith, Mohit Bansal, Jianfeng Gao
On several syntactic and semantic probing tasks, we demonstrate the emergent structural information in the role vectors and improved syntactic interpretability in the TPR layer outputs.
no code implementations • Findings of the Association for Computational Linguistics 2020 • Yichen Jiang, Shikha Bordia, Zheng Zhong, Charles Dognin, Maneesh Singh, Mohit Bansal
We introduce HoVer (HOppy VERification), a dataset for many-hop evidence extraction and fact verification.
1 code implementation • IJCNLP 2019 • Yichen Jiang, Mohit Bansal
Multi-hop QA requires a model to connect multiple pieces of evidence scattered in a long context to answer the question.
1 code implementation • ACL 2019 • Yichen Jiang, Mohit Bansal
After adversarial training, the baseline's performance improves but is still limited on the adversarial evaluation.
1 code implementation • ACL 2019 • Yichen Jiang, Nitish Joshi, Yen-Chun Chen, Mohit Bansal
Multi-hop reading comprehension requires the model to explore and connect relevant information from multiple sentences/documents in order to answer the question about the context.
no code implementations • EMNLP 2018 • Yichen Jiang, Mohit Bansal
A good neural sequence-to-sequence summarization model should have a strong encoder that can distill and memorize the important information from long input texts so that the decoder can generate salient summaries based on the encoder's memory.
Ranked #37 on
Abstractive Text Summarization
on CNN / Daily Mail