4 code implementations • ICLR 2018 • Jiaqi Mu, Suma Bhat, Pramod Viswanath
The postprocessing is empirically validated on a variety of lexical-level intrinsic tasks (word similarity, concept categorization, word analogy) and sentence-level tasks (semantic textural similarity and { text classification}) on multiple datasets and with a variety of representation methods and hyperparameter choices in multiple languages; in each case, the processed representations are consistently better than the original ones.
Ranked #11 on Sentiment Analysis on MR
1 code implementation • ACL 2018 • Hongyu Gong, Tarek Sakakini, Suma Bhat, JinJun Xiong
This is because of the lexical, contextual and the abstraction gaps between a long document of rich details and its concise summary of abstract information.
1 code implementation • 31 Mar 2021 • Wanzheng Zhu, Hongyu Gong, Rohan Bansal, Zachary Weinberg, Nicolas Christin, Giulia Fanti, Suma Bhat
It is usually apparent to a human moderator that a word is being used euphemistically, but they may not know what the secret meaning is, and therefore whether the message violates policy.
1 code implementation • Findings (EMNLP) 2021 • Wanzheng Zhu, Suma Bhat
It is a well-known approach for fringe groups and organizations to use euphemisms -- ordinary-sounding and innocent-looking words with a secret meaning -- to conceal what they are discussing.
2 code implementations • Findings of the Association for Computational Linguistics 2020 • Wanzheng Zhu, Suma Bhat
Automatic evaluation metrics are indispensable for evaluating generated text.
1 code implementation • Findings (ACL) 2021 • Wanzheng Zhu, Suma Bhat
Countermeasures to effectively fight the ever increasing hate speech online without blocking freedom of speech is of great social interest.
1 code implementation • 19 Oct 2021 • Ziheng Zeng, Suma Bhat
Idiomatic expressions are an integral part of natural language and constantly being added to a language.
1 code implementation • 29 Nov 2016 • Hongyu Gong, Suma Bhat, Pramod Viswanath
This paper proposes a simple test for compositionality (i. e., literal usage) of a word or phrase in a context-specific way.
1 code implementation • 10 Oct 2019 • Wanzheng Zhu, Hongyu Gong, Jiaming Shen, Chao Zhang, Jingbo Shang, Suma Bhat, Jiawei Han
In this paper, we study the task of multi-faceted set expansion, which aims to capture all semantic facets in the seed set and return multiple sets of entities, one for each semantic facet.
1 code implementation • 24 May 2021 • Hongyu Gong, Alberto Valido, Katherine M. Ingram, Giulia Fanti, Suma Bhat, Dorothy L. Espelage
Abusive language is a massive problem in online social platforms.
1 code implementation • CONLL 2020 • Hongyu Gong, Suma Bhat, Pramod Viswanath
The meaning of a word is closely linked to sociocultural factors that can change over time and location, resulting in corresponding meaning changes.
1 code implementation • 1 Oct 2022 • S Ashwin Hebbar, Viraj Nadkarni, Ashok Vardhan Makkuva, Suma Bhat, Sewoong Oh, Pramod Viswanath
We design a principled curriculum, guided by information-theoretic insights, to train CRISP and show that it outperforms the successive-cancellation (SC) decoder and attains near-optimal reliability performance on the Polar(32, 16) and Polar(64, 22) codes.
1 code implementation • EMNLP 2018 • Hongyu Gong, Jiaqi Mu, Suma Bhat, Pramod Viswanath
Prepositions are highly polysemous, and their variegated senses encode significant semantic information.
1 code implementation • 11 Dec 2023 • Ziheng Zeng, Kellen Tan Cheng, Srihari Venkat Nanniyur, Jianing Zhou, Suma Bhat
Unlike prior works that enable IE comprehension through fine-tuning PTLMs with sentences containing IEs, in this work, we construct IEKG, a commonsense knowledge graph for figurative interpretations of IEs.
no code implementations • NAACL 2018 • Hongyu Gong, Suma Bhat, Pramod Viswanath
Prepositions are among the most frequent words in English and play complex roles in the syntax and semantics of sentences.
no code implementations • ACL 2017 • Tarek Sakakini, Suma Bhat, Pramod Viswanath
We present in this paper a novel framework for morpheme segmentation which uses the morpho-syntactic regularities preserved by word representations, in addition to orthographic features, to segment words into morphemes.
no code implementations • ACL 2017 • Jiaqi Mu, Suma Bhat, Pramod Viswanath
Sentences are important semantic units of natural language.
no code implementations • 7 Feb 2017 • Tarek Sakakini, Suma Bhat, Pramod Viswanath
We present an unsupervised and language-agnostic method for learning root-and-pattern morphology in Semitic languages.
no code implementations • 5 Feb 2017 • Hongyu Gong, Jiaqi Mu, Suma Bhat, Pramod Viswanath
Prepositions are highly polysemous, and their variegated senses encode significant semantic information.
no code implementations • 24 Oct 2016 • Jiaqi Mu, Suma Bhat, Pramod Viswanath
Vector representations of words have heralded a transformational approach to classical problems in NLP; the most popular example is word2vec.
no code implementations • 23 Jan 2019 • Hongyu Gong, Yuchen Li, Suma Bhat, Pramod Viswanath
Misspelled words of the malicious kind work by changing specific keywords and are intended to thwart existing automated applications for cyber-environment control such as harassing content detection on the Internet and email spam detection.
no code implementations • NAACL 2019 • Hongyu Gong, Suma Bhat, Lingfei Wu, JinJun Xiong, Wen-mei Hwu
Our generator employs an attention-based encoder-decoder to transfer a sentence from the source style to the target style.
no code implementations • WS 2019 • Tarek Sakakini, Hongyu Gong, Jong Yoon Lee, Robert Schloss, JinJun Xiong, Suma Bhat
One of the challenges of building natural language processing (NLP) applications for education is finding a large domain-specific corpus for the subject of interest (e. g., history or science).
no code implementations • IJCNLP 2019 • Omer Anjum, Hongyu Gong, Suma Bhat, Wen-mei Hwu, JinJun Xiong
Finding the right reviewers to assess the quality of conference submissions is a time consuming process for conference organizers.
no code implementations • WS 2020 • Hongyu Gong, Kshitij Gupta, Akriti Jain, Suma Bhat
Metaphors are rhetorical use of words based on the conceptual mapping as opposed to their literal use.
no code implementations • 13 Apr 2021 • Jianing Zhou, Hongyu Gong, Srihari Nanniyur, Suma Bhat
We study a new application for text generation -- idiomatic sentence generation -- which aims to transfer literal phrases in sentences into their idiomatic counterparts.
no code implementations • ACL (MWE) 2021 • Jianing Zhou, Hongyu Gong, Suma Bhat
Idiomatic expressions (IE) play an important role in natural language, and have long been a “pain in the neck” for NLP systems.
no code implementations • EMNLP 2021 • Jianing Zhou, Suma Bhat
This paper focuses on paraphrase generation, which is a widely studied natural language generation task in NLP.
no code implementations • INLG (ACL) 2020 • Hongyu Gong, Linfeng Song, Suma Bhat
Text style transfer aims to change an input sentence to an output sentence by changing its text style while preserving the content.
no code implementations • EMNLP (Louhi) 2020 • Tarek Sakakini, Jong Yoon Lee, Aditya Duri, Renato F.L. Azevedo, Victor Sadauskas, Kuangxiao Gu, Suma Bhat, Dan Morrow, James Graumlich, Saqib Walayat, Mark Hasegawa-Johnson, Thomas Huang, Ann Willemsen-Dunlap, Donald Halpin
We also show the enhanced accuracy of our system over directly-supervised neural methods in this low-resource setting.
no code implementations • 16 Dec 2021 • Jianing Zhou, Ziheng Zeng, Hongyu Gong, Suma Bhat
In this paper, we study the task of idiomatic sentence paraphrasing (ISP), which aims to paraphrase a sentence with an IE by replacing the IE with its literal paraphrase.
1 code implementation • 8 Jul 2022 • Ziheng Zeng, Suma Bhat
Idiomatic expressions (IEs), characterized by their non-compositionality, are an important part of natural language.
1 code implementation • 29 Oct 2023 • Ziheng Zeng, Suma Bhat
Accurate processing of non-compositional language relies on generating good representations for such expressions.