no code implementations • ICON 2020 • . Chandrahas, Nilesh Agrawal, Partha Talukdar
For this distinction, simple functions of vectors’ dimensions, called interactions, are used.
1 code implementation • 25 Apr 2024 • Harman Singh, Nitish Gupta, Shikhar Bharadwaj, Dinesh Tewari, Partha Talukdar
To facilitate research on multilingual LLM evaluation, we release IndicGenBench - the largest benchmark for evaluating LLMs on user-facing generation tasks across a diverse set 29 of Indic languages covering 13 scripts and 4 language families.
1 code implementation • 4 Jan 2024 • Rachit Bansal, Bidisha Samanta, Siddharth Dalmia, Nitish Gupta, Shikhar Vashishth, Sriram Ganapathy, Abhishek Bapna, Prateek Jain, Partha Talukdar
Foundational models with billions of parameters which have been trained on large corpora of data have demonstrated non-trivial skills in a variety of domains.
no code implementations • 2 Nov 2023 • Megh Thakkar, Tolga Bolukbasi, Sriram Ganapathy, Shikhar Vashishth, Sarath Chandar, Partha Talukdar
Once the pre-training corpus has been assembled, all data samples in the corpus are treated with equal importance during LM pre-training.
no code implementations • 19 Sep 2023 • Shikhar Bharadwaj, Min Ma, Shikhar Vashishth, Ankur Bapna, Sriram Ganapathy, Vera Axelrod, Siddharth Dalmia, Wei Han, Yu Zhang, Daan van Esch, Sandy Ritchie, Partha Talukdar, Jason Riesa
Spoken language identification refers to the task of automatically predicting the spoken language in a given utterance.
no code implementations • 7 Jun 2023 • Shikhar Vashishth, Shikhar Bharadwaj, Sriram Ganapathy, Ankur Bapna, Min Ma, Wei Han, Vera Axelrod, Partha Talukdar
In this paper, we propose a novel framework of combining self-supervised representation learning with the language label information for the pre-training task.
1 code implementation • 19 May 2023 • Sebastian Ruder, Jonathan H. Clark, Alexander Gutkin, Mihir Kale, Min Ma, Massimo Nicosia, Shruti Rijhwani, Parker Riley, Jean-Michel A. Sarr, Xinyi Wang, John Wieting, Nitish Gupta, Anna Katanova, Christo Kirov, Dana L. Dickinson, Brian Roark, Bidisha Samanta, Connie Tao, David I. Adelani, Vera Axelrod, Isaac Caswell, Colin Cherry, Dan Garrette, Reeve Ingle, Melvin Johnson, Dmitry Panteleev, Partha Talukdar
We evaluate commonly used models on the benchmark.
no code implementations • 22 Mar 2023 • Jeremy R. Cole, Aditi Chaudhary, Bhuwan Dhingra, Partha Talukdar
First, we find that SSM alone improves the downstream performance on three temporal tasks by an avg.
no code implementations • 21 Nov 2022 • Shaily Bhatt, Sunipa Dev, Partha Talukdar, Shachi Dave, Vinodkumar Prabhakaran
Recent research has revealed undesirable biases in NLP data and models.
no code implementations • 14 Nov 2022 • Sagar Gubbi Venkatesh, Partha Talukdar, Srini Narayanan
We compare the performance of different LLMs including PaLM and GPT-3 and find that the end-to-end task completion rate is 48% for English UI but the performance drops to 32% for other languages.
no code implementations • 13 Oct 2022 • Abhijeet Awasthi, Nitish Gupta, Bidisha Samanta, Shachi Dave, Sunita Sarawagi, Partha Talukdar
Despite cross-lingual generalization demonstrated by pre-trained multilingual models, the translate-train paradigm of transferring English datasets across multiple languages remains to be a key mechanism for training task-specific multilingual models.
no code implementations • 12 Oct 2022 • Aditya Sharma, Apoorv Saxena, Chitrank Gupta, Seyed Mehran Kazemi, Partha Talukdar, Soumen Chakrabarti
Recent years have witnessed much interest in temporal reasoning over knowledge graphs (KG) for complex question answering (QA), but there remains a substantial gap in human capabilities.
1 code implementation • 25 Sep 2022 • Shaily Bhatt, Sunipa Dev, Partha Talukdar, Shachi Dave, Vinodkumar Prabhakaran
In this paper, we focus on NLP fair-ness in the context of India.
no code implementations • 14 Sep 2022 • Kartikeya Badola, Shachi Dave, Partha Talukdar
We address this challenge by proposing LAFT-URIEL, a parameter-efficient finetuning strategy which aims to increase the number of languages on which the model improves after an update, while reducing the magnitude of loss in performance for the remaining languages.
no code implementations • 25 May 2022 • Simran Khanuja, Sebastian Ruder, Partha Talukdar
In order for NLP technology to be widely applicable, fair, and useful, it needs to serve a diverse set of speakers across the world's languages, be equitable, i. e., not unduly biased towards any particular language, and be inclusive of all users, particularly in low-resource settings where compute constraints are common.
1 code implementation • ACL 2022 • Vaidehi Patil, Partha Talukdar, Sunita Sarawagi
This results in improved zero-shot transfer from related HRLs to LRLs without reducing HRL representation and accuracy.
2 code implementations • NAACL 2022 • Ameet Deshpande, Partha Talukdar, Karthik Narasimhan
While recent work on multilingual language models has demonstrated their capacity for cross-lingual zero-shot transfer on downstream tasks, there is a lack of consensus in the community as to what shared properties between languages enable such transfer.
no code implementations • ACL 2022 • Kalpesh Krishna, Deepak Nathani, Xavier Garcia, Bidisha Samanta, Partha Talukdar
When compared to prior work, our model achieves 2-3x better performance in formality transfer and code-mixing addition across seven languages.
1 code implementation • AKBC 2021 • Keshav Kolluru, Martin Rezk, Pat Verga, William W. Cohen, Partha Talukdar
This makes it challenging to link KG facts to sentences in languages other than the limited set of languages.
1 code implementation • ACL 2021 • Yash Khemchandani, Sarvesh Mehtani, Vaidehi Patil, Abhijeet Awasthi, Partha Talukdar, Sunita Sarawagi
RelateLM uses transliteration to convert the unseen script of limited LRL text into the script of a Related Prominent Language (RPL) (Hindi in our case).
no code implementations • 5 Jun 2021 • Simran Khanuja, Melvin Johnson, Partha Talukdar
Pre-trained multilingual language models (LMs) have achieved state-of-the-art results in cross-lingual transfer, but they often lead to an inequitable representation of languages due to limited capacity, skewed pre-training data, and sub-optimal vocabularies.
2 code implementations • ACL 2021 • Apoorv Saxena, Soumen Chakrabarti, Partha Talukdar
Temporal Knowledge Graphs (Temporal KGs) extend regular Knowledge Graphs by providing temporal scopes (start and end times) on each edge in the KG.
Ranked #5 on Question Answering on CronQuestions
1 code implementation • Findings (ACL) 2021 • Sawan Kumar, Partha Talukdar
Finally, we analyze the learned prompts to reveal novel insights, including the idea that two training examples in the right order alone can provide competitive performance for sentiment classification and natural language inference.
1 code implementation • 19 Mar 2021 • Simran Khanuja, Diksha Bansal, Sarvesh Mehtani, Savya Khosla, Atreyee Dey, Balaji Gopalan, Dilip Kumar Margam, Pooja Aggarwal, Rajiv Teja Nagipogu, Shachi Dave, Shruti Gupta, Subhash Chandra Bose Gali, Vish Subramanian, Partha Talukdar
This can be explained by the fact that multilingual language models (LMs) are often trained on 100+ languages together, leading to a small representation of IN languages in their vocabulary and training data.
no code implementations • 26 Dec 2020 • Sagar Gubbi Venkatesh, Anirban Biswas, Raviteja Upadrashta, Vikram Srinivasan, Partha Talukdar, Bharadwaj Amrutur
Robots that can manipulate objects in unstructured environments and collaborate with humans can benefit immensely by understanding natural language.
2 code implementations • ACL 2020 • Apoorv Saxena, Aditay Tripathi, Partha Talukdar
In a separate line of research, KG embedding methods have been proposed to reduce KG sparsity by performing missing link prediction.
1 code implementation • ACL 2020 • Sawan Kumar, Partha Talukdar
In this work, we focus on the task of natural language inference (NLI) and address the following question: can we build NLI systems which produce labels with high accuracy, while also generating faithful explanations of its decisions?
1 code implementation • 18 May 2020 • Vivek Gupta, Ankit Saw, Pegah Nokhiz, Praneeth Netrapalli, Piyush Rai, Partha Talukdar
One of the key reasons is that a longer document is likely to contain words from many different topics; hence, creating a single vector while ignoring all the topical structure is unlikely to yield an effective document representation.
2 code implementations • TACL 2020 • Ashutosh Kumar, Kabir Ahuja, Raghuram Vadapalli, Partha Talukdar
In these methods, syntactic-guidance is sourced from a separate exemplar sentence.
1 code implementation • NeurIPS 2019 • Naganand Yadati, Madhav Nimishakavi, Prateek Yadav, Vikram Nitin, Anand Louis, Partha Talukdar
In many real-world network datasets such as co-authorship, co-citation, email communication, etc., relationships are complex and go beyond pairwise.
1 code implementation • 18 Nov 2019 • Vivek Gupta, Ankit Saw, Pegah Nokhiz, Harshit Gupta, Partha Talukdar
Through extensive experiments on multiple real-world datasets, we show that SCDV-MS embeddings outperform previous state-of-the-art embeddings on multi-class and multi-label text categorization tasks.
Ranked #5 on Document Classification on Reuters-21578 (F1 metric)
2 code implementations • ACL 2020 • Zhiqing Sun, Shikhar Vashishth, Soumya Sanyal, Partha Talukdar, Yiming Yang
Knowledge Graph Completion (KGC) aims at automatically predicting missing links for large-scale knowledge graphs.
Ranked #25 on Link Prediction on FB15k-237 (MR metric)
4 code implementations • ICLR 2020 • Shikhar Vashishth, Soumya Sanyal, Vikram Nitin, Partha Talukdar
Multi-relational graphs are a more general and prevalent form of graphs where each edge has a label and direction associated with it.
Ranked #23 on Link Prediction on FB15k-237
no code implementations • IJCNLP 2019 • Swapnil Gupta, Sreyash Kenkre, Partha Talukdar
Organization of such triples in the form of a graph with noun phrases (NPs) as nodes and relation phrases (RPs) as edges results in the construction of Open Knowledge Graphs (OpenKGs).
1 code implementation • 1 Nov 2019 • Shikhar Vashishth, Soumya Sanyal, Vikram Nitin, Nilesh Agrawal, Partha Talukdar
In this paper, we analyze how increasing the number of these interactions affects link prediction performance, and utilize our observations to propose InteractE.
Ranked #11 on Link Prediction on YAGO3-10
no code implementations • 25 Sep 2019 • Naganand Yadati, Tingran Gao, Shahab Asoodeh, Partha Talukdar, Anand Louis
In this paper, we explore GNNs for graph-based SSL of histograms.
1 code implementation • ACL 2019 • Sawan Kumar, Sharmistha Jat, Karan Saxena, Partha Talukdar
To overcome this challenge, we propose Extended WSD Incorporating Sense Embeddings (EWISE), a supervised model to perform WSD by predicting over a continuous sense embedding space as opposed to a discrete label space.
1 code implementation • ACL 2019 • Sharmistha Jat, Hao Tang, Partha Talukdar, Tom Mitchell
To the best of our knowledge, this is the first work showing that the MEG brain recording when reading a word in a sentence can be used to distinguish earlier words in the sentence.
1 code implementation • NAACL 2019 • Ashutosh Kumar, Satwik Bhattamishra, Bh, Manik ari, Partha Talukdar
Inducing diversity in the task of paraphrasing is an important problem in NLP with applications in data augmentation and conversational agents.
no code implementations • ICLR 2019 • Naganand Yadati, Vikram Nitin, Madhav Nimishakavi, Prateek Yadav, Anand Louis, Partha Talukdar
Additionally, there is need to represent the direction from reactants to products.
1 code implementation • 1 Feb 2019 • Shikhar Vashishth, Prince Jain, Partha Talukdar
Open Information Extraction (OpenIE) methods extract (noun phrase, relation phrase, noun phrase) triples from text, resulting in the construction of large Open Knowledge Bases (Open KBs).
Ranked #1 on Noun Phrase Canonicalization on Ambiguous Dataset
1 code implementation • ACL 2018 • Shikhar Vashishth, Shib Sankar Dasgupta, Swayambhu Nath Ray, Partha Talukdar
While existing approaches for these tasks assume accurate knowledge of the document date, this is not always available, especially for arbitrary documents from the Web.
Ranked #1 on Document Dating on APW
1 code implementation • 24 Jan 2019 • Shikhar Vashishth, Prateek Yadav, Manik Bhandari, Partha Talukdar
Graph-based Semi-Supervised Learning (SSL) methods aim to address this problem by labeling a small subset of the nodes as seeds and then utilizing the graph structure to predict label scores for the rest of the nodes in the graph.
1 code implementation • EMNLP 2018 • Swayambhu Nath Ray, Shib Sankar Dasgupta, Partha Talukdar
Knowledge of the creation date of documents facilitates several tasks such as summarization, event extraction, temporally focused information extraction etc.
1 code implementation • EMNLP 2018 • Shikhar Vashishth, Rishabh Joshi, Sai Suman Prayaga, Chiranjib Bhattacharyya, Partha Talukdar
In this paper, we propose RESIDE, a distantly-supervised neural relation extraction method which utilizes additional side information from KBs for improved relation extraction.
Ranked #5 on Relation Extraction on NYT Corpus
1 code implementation • 14 Nov 2018 • Soumya Sanyal, Janakiraman Balachandran, Naganand Yadati, Abhishek Kumar, Padmini Rajagopalan, Suchismita Sanyal, Partha Talukdar
Some of the major challenges involved in developing such models are, (i) limited availability of materials data as compared to other fields, (ii) lack of universal descriptor of materials to predict its various properties.
Ranked #4 on Band Gap on Materials Project
1 code implementation • EMNLP 2018 • Shib Sankar Dasgupta, Swayambhu Nath Ray, Partha Talukdar
Knowledge Graph (KG) embedding has emerged as an active area of research resulting in the development of several KG embedding methods.
1 code implementation • ACL 2019 • Shikhar Vashishth, Manik Bhandari, Prateek Yadav, Piyush Rai, Chiranjib Bhattacharyya, Partha Talukdar
Word embeddings have been widely adopted across several NLP applications.
1 code implementation • 7 Sep 2018 • Naganand Yadati, Madhav Nimishakavi, Prateek Yadav, Vikram Nitin, Anand Louis, Partha Talukdar
In many real-world network datasets such as co-authorship, co-citation, email communication, etc., relationships are complex and go beyond pairwise.
1 code implementation • ACL 2018 • {Ch, rahas}, Aditya Sharma, Partha Talukdar
These KG embedding methods represent KG entities and relations as vectors in a high-dimensional space.
1 code implementation • NAACL 2018 • Priya Radhakrishnan, Partha Talukdar, Vasudeva Varma
Entity Linking (EL) systems aim to automatically map mentions of an entity in text to the corresponding entity in a Knowledge Graph (KG).
Ranked #3 on Entity Linking on CoNLL-Aida
1 code implementation • 29 May 2018 • Prateek Yadav, Madhav Nimishakavi, Naganand Yadati, Shikhar Vashishth, Arun Rajkumar, Partha Talukdar
We analyse local and global properties of graphs and demonstrate settings where LCNs tend to work better than GCNs.
5 code implementations • 19 Apr 2018 • Sharmistha Jat, Siddhesh Khandelwal, Partha Talukdar
Relation extraction is the problem of classifying the relationship between two entities in a given sentence.
no code implementations • 18 Feb 2018 • Madhav Nimishakavi, Bamdev Mishra, Manish Gupta, Partha Talukdar
Besides the tensors, in many real world scenarios, side information is also available in the form of matrices which also grow in size with time.
no code implementations • 26 Oct 2017 • Tushar Nagarajan, Sharmistha, Partha Talukdar
The unsupervised nature of this technique allows it to scale to web-scale relation extraction tasks, at the expense of noise in the training data.
1 code implementation • EMNLP 2017 • Aditya Sharma, Zarana Parekh, Partha Talukdar
RLIE-DQN is a recently proposed Reinforcement Learning-based Information Extraction (IE) technique which is able to incorporate external evidence during the extraction process.
no code implementations • EMNLP 2017 • Prakhar Ojha, Partha Talukdar
Automatic construction of large knowledge graphs (KG) by mining web-scale text datasets has received considerable attention recently.
1 code implementation • ACL 2018 • Madhav Nimishakavi, Partha Talukdar
Relation Schema Induction (RSI) is the problem of identifying type signatures of arguments of relations from unlabeled text.
no code implementations • 21 Oct 2016 • Prakhar Ojha, Partha Talukdar
Automatic construction of large knowledge graphs (KG) by mining web-scale text datasets has received considerable attention recently.
1 code implementation • EMNLP 2016 • Madhav Nimishakavi, Uday Singh Saini, Partha Talukdar
To the best of our knowledge, this is the first application of tensor factorization for the RSI problem.
no code implementations • 6 Dec 2015 • Danish, Yogesh Dahiya, Partha Talukdar
We hypothesize that these factors contribute with varying degrees towards getting responses from others for a given question.