1 code implementation • EMNLP 2020 • Dheeraj Mekala, Xinyang Zhang, Jingbo Shang
Based on seed words, we rank and filter motif instances to distill highly label-indicative ones as {``}seed motifs{''}, which provide additional weak supervision.
no code implementations • 25 May 2022 • Dheeraj Mekala, Tu Vu, Jingbo Shang
We improve generative data augmentation by formulating the data generation as context generation task and use question answering (QA) datasets for intermediate training.
1 code implementation • 25 May 2022 • Dheeraj Mekala, chengyu dong, Jingbo Shang
Weakly supervised text classification methods typically train a deep neural classifier based on pseudo-labels.
no code implementations • EMNLP 2021 • Dheeraj Mekala, Varun Gangal, Jingbo Shang
Existing text classification methods mainly focus on a fixed label set, whereas many real-world applications require extending to new fine-grained classes as the number of samples per label increases.
no code implementations • Findings (EMNLP) 2021 • Zichao Li, Dheeraj Mekala, chengyu dong, Jingbo Shang
To recognize the poisoned subset, we examine the training samples with these identified triggers as the most suspicious token, and check if removing the trigger will change the poisoned model's prediction.
1 code implementation • 18 Apr 2021 • Xiuwen Zheng, Dheeraj Mekala, Amarnath Gupta, Jingbo Shang
Hashtag annotation for microblog posts has been recently formulated as a sequence generation problem to handle emerging hashtags that are unseen in the training set.
2 code implementations • NAACL 2021 • Zihan Wang, Dheeraj Mekala, Jingbo Shang
Finally, we pick the most confident documents from each cluster to train a text classifier.
1 code implementation • ACL 2020 • Dheeraj Mekala, Jingbo Shang
Weakly supervised text classification based on a few user-provided seed words has recently attracted much attention from researchers.
no code implementations • 17 Feb 2018 • Dheeraj Mekala, Vivek Gupta, Purushottam Kar, Harish Karnick
We extend the consistency of hierarchical classification algorithm over asymmetric tree distance loss.
4 code implementations • EMNLP 2017 • Dheeraj Mekala, Vivek Gupta, Bhargavi Paranjape, Harish Karnick
We present a feature vector formation technique for documents - Sparse Composite Document Vector (SCDV) - which overcomes several shortcomings of the current distributional paragraph vector representations that are widely used for text representation.
no code implementations • 20 Dec 2016 • Rahul Wadbude, Vivek Gupta, Dheeraj Mekala, Harish Karnick
Review score prediction of text reviews has recently gained a lot of attention in recommendation systems.