no code implementations • EMNLP (Louhi) 2020 • Tarek Sakakini, Jong Yoon Lee, Aditya Duri, Renato F.L. Azevedo, Victor Sadauskas, Kuangxiao Gu, Suma Bhat, Dan Morrow, James Graumlich, Saqib Walayat, Mark Hasegawa-Johnson, Thomas Huang, Ann Willemsen-Dunlap, Donald Halpin
We also show the enhanced accuracy of our system over directly-supervised neural methods in this low-resource setting.
no code implementations • ACL (MWE) 2021 • Jianing Zhou, Hongyu Gong, Suma Bhat
Idiomatic expressions (IE) play an important role in natural language, and have long been a “pain in the neck” for NLP systems.
no code implementations • EMNLP 2021 • Jianing Zhou, Suma Bhat
This paper focuses on paraphrase generation, which is a widely studied natural language generation task in NLP.
no code implementations • INLG (ACL) 2020 • Hongyu Gong, Linfeng Song, Suma Bhat
Text style transfer aims to change an input sentence to an output sentence by changing its text style while preserving the content.
1 code implementation • 1 Oct 2022 • S Ashwin Hebbar, Viraj Nadkarni, Ashok Vardhan Makkuva, Suma Bhat, Sewoong Oh, Pramod Viswanath
We design a principled curriculum, guided by information-theoretic insights, to train CRISP and show that it outperforms the successive-cancellation (SC) decoder and attains near-optimal reliability performance on the Polar(32, 16) and Polar(64, 22) codes.
no code implementations • 8 Jul 2022 • Ziheng Zeng, Suma Bhat
Idiomatic expressions (IEs), characterized by their non-compositionality, are an important part of natural language.
no code implementations • 16 Dec 2021 • Jianing Zhou, Ziheng Zeng, Hongyu Gong, Suma Bhat
In this paper, we study the task of idiomatic sentence paraphrasing (ISP), which aims to paraphrase a sentence with an IE by replacing the IE with its literal paraphrase.
1 code implementation • 19 Oct 2021 • Ziheng Zeng, Suma Bhat
Idiomatic expressions are an integral part of natural language and constantly being added to a language.
1 code implementation • Findings (EMNLP) 2021 • Wanzheng Zhu, Suma Bhat
It is a well-known approach for fringe groups and organizations to use euphemisms -- ordinary-sounding and innocent-looking words with a secret meaning -- to conceal what they are discussing.
1 code implementation • Findings (ACL) 2021 • Wanzheng Zhu, Suma Bhat
Countermeasures to effectively fight the ever increasing hate speech online without blocking freedom of speech is of great social interest.
1 code implementation • 24 May 2021 • Hongyu Gong, Alberto Valido, Katherine M. Ingram, Giulia Fanti, Suma Bhat, Dorothy L. Espelage
Abusive language is a massive problem in online social platforms.
no code implementations • 13 Apr 2021 • Jianing Zhou, Hongyu Gong, Srihari Nanniyur, Suma Bhat
We study a new application for text generation -- idiomatic sentence generation -- which aims to transfer literal phrases in sentences into their idiomatic counterparts.
1 code implementation • 31 Mar 2021 • Wanzheng Zhu, Hongyu Gong, Rohan Bansal, Zachary Weinberg, Nicolas Christin, Giulia Fanti, Suma Bhat
It is usually apparent to a human moderator that a word is being used euphemistically, but they may not know what the secret meaning is, and therefore whether the message violates policy.
2 code implementations • Findings of the Association for Computational Linguistics 2020 • Wanzheng Zhu, Suma Bhat
Automatic evaluation metrics are indispensable for evaluating generated text.
1 code implementation • CONLL 2020 • Hongyu Gong, Suma Bhat, Pramod Viswanath
The meaning of a word is closely linked to sociocultural factors that can change over time and location, resulting in corresponding meaning changes.
no code implementations • WS 2020 • Hongyu Gong, Kshitij Gupta, Akriti Jain, Suma Bhat
Metaphors are rhetorical use of words based on the conceptual mapping as opposed to their literal use.
1 code implementation • 10 Oct 2019 • Wanzheng Zhu, Hongyu Gong, Jiaming Shen, Chao Zhang, Jingbo Shang, Suma Bhat, Jiawei Han
In this paper, we study the task of multi-faceted set expansion, which aims to capture all semantic facets in the seed set and return multiple sets of entities, one for each semantic facet.
no code implementations • IJCNLP 2019 • Omer Anjum, Hongyu Gong, Suma Bhat, Wen-mei Hwu, JinJun Xiong
Finding the right reviewers to assess the quality of conference submissions is a time consuming process for conference organizers.
no code implementations • WS 2019 • Tarek Sakakini, Hongyu Gong, Jong Yoon Lee, Robert Schloss, JinJun Xiong, Suma Bhat
One of the challenges of building natural language processing (NLP) applications for education is finding a large domain-specific corpus for the subject of interest (e. g., history or science).
no code implementations • NAACL 2019 • Hongyu Gong, Suma Bhat, Lingfei Wu, JinJun Xiong, Wen-mei Hwu
Our generator employs an attention-based encoder-decoder to transfer a sentence from the source style to the target style.
1 code implementation • ACL 2018 • Hongyu Gong, Tarek Sakakini, Suma Bhat, JinJun Xiong
This is because of the lexical, contextual and the abstraction gaps between a long document of rich details and its concise summary of abstract information.
no code implementations • 23 Jan 2019 • Hongyu Gong, Yuchen Li, Suma Bhat, Pramod Viswanath
Misspelled words of the malicious kind work by changing specific keywords and are intended to thwart existing automated applications for cyber-environment control such as harassing content detection on the Internet and email spam detection.
1 code implementation • EMNLP 2018 • Hongyu Gong, Jiaqi Mu, Suma Bhat, Pramod Viswanath
Prepositions are highly polysemous, and their variegated senses encode significant semantic information.
no code implementations • NAACL 2018 • Hongyu Gong, Suma Bhat, Pramod Viswanath
Prepositions are among the most frequent words in English and play complex roles in the syntax and semantics of sentences.
no code implementations • ACL 2017 • Jiaqi Mu, Suma Bhat, Pramod Viswanath
Sentences are important semantic units of natural language.
no code implementations • ACL 2017 • Tarek Sakakini, Suma Bhat, Pramod Viswanath
We present in this paper a novel framework for morpheme segmentation which uses the morpho-syntactic regularities preserved by word representations, in addition to orthographic features, to segment words into morphemes.
no code implementations • 7 Feb 2017 • Tarek Sakakini, Suma Bhat, Pramod Viswanath
We present an unsupervised and language-agnostic method for learning root-and-pattern morphology in Semitic languages.
no code implementations • 5 Feb 2017 • Hongyu Gong, Jiaqi Mu, Suma Bhat, Pramod Viswanath
Prepositions are highly polysemous, and their variegated senses encode significant semantic information.
4 code implementations • ICLR 2018 • Jiaqi Mu, Suma Bhat, Pramod Viswanath
The postprocessing is empirically validated on a variety of lexical-level intrinsic tasks (word similarity, concept categorization, word analogy) and sentence-level tasks (semantic textural similarity and { text classification}) on multiple datasets and with a variety of representation methods and hyperparameter choices in multiple languages; in each case, the processed representations are consistently better than the original ones.
Ranked #10 on
Sentiment Analysis
on MR
1 code implementation • 29 Nov 2016 • Hongyu Gong, Suma Bhat, Pramod Viswanath
This paper proposes a simple test for compositionality (i. e., literal usage) of a word or phrase in a context-specific way.
no code implementations • 24 Oct 2016 • Jiaqi Mu, Suma Bhat, Pramod Viswanath
Vector representations of words have heralded a transformational approach to classical problems in NLP; the most popular example is word2vec.