Search Results for author: Mayank Singh

Found 53 papers, 13 papers with code

BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

6 code implementations • 9 Nov 2022 • BigScience Workshop, :, Teven Le Scao, Angela Fan, Christopher Akiki, Ellie Pavlick, Suzana Ilić, Daniel Hesslow, Roman Castagné, Alexandra Sasha Luccioni, François Yvon, Matthias Gallé, Jonathan Tow, Alexander M. Rush, Stella Biderman, Albert Webson, Pawan Sasanka Ammanamanchi, Thomas Wang, Benoît Sagot, Niklas Muennighoff, Albert Villanova del Moral, Olatunji Ruwase, Rachel Bawden, Stas Bekman, Angelina McMillan-Major, Iz Beltagy, Huu Nguyen, Lucile Saulnier, Samson Tan, Pedro Ortiz Suarez, Victor Sanh, Hugo Laurençon, Yacine Jernite, Julien Launay, Margaret Mitchell, Colin Raffel, Aaron Gokaslan, Adi Simhi, Aitor Soroa, Alham Fikri Aji, Amit Alfassy, Anna Rogers, Ariel Kreisberg Nitzav, Canwen Xu, Chenghao Mou, Chris Emezue, Christopher Klamm, Colin Leong, Daniel van Strien, David Ifeoluwa Adelani, Dragomir Radev, Eduardo González Ponferrada, Efrat Levkovizh, Ethan Kim, Eyal Bar Natan, Francesco De Toni, Gérard Dupont, Germán Kruszewski, Giada Pistilli, Hady Elsahar, Hamza Benyamina, Hieu Tran, Ian Yu, Idris Abdulmumin, Isaac Johnson, Itziar Gonzalez-Dios, Javier de la Rosa, Jenny Chim, Jesse Dodge, Jian Zhu, Jonathan Chang, Jörg Frohberg, Joseph Tobing, Joydeep Bhattacharjee, Khalid Almubarak, Kimbo Chen, Kyle Lo, Leandro von Werra, Leon Weber, Long Phan, Loubna Ben allal, Ludovic Tanguy, Manan Dey, Manuel Romero Muñoz, Maraim Masoud, María Grandury, Mario Šaško, Max Huang, Maximin Coavoux, Mayank Singh, Mike Tian-Jian Jiang, Minh Chien Vu, Mohammad A. Jauhar, Mustafa Ghaleb, Nishant Subramani, Nora Kassner, Nurulaqilla Khamis, Olivier Nguyen, Omar Espejel, Ona de Gibert, Paulo Villegas, Peter Henderson, Pierre Colombo, Priscilla Amuok, Quentin Lhoest, Rheza Harliman, Rishi Bommasani, Roberto Luis López, Rui Ribeiro, Salomey Osei, Sampo Pyysalo, Sebastian Nagel, Shamik Bose, Shamsuddeen Hassan Muhammad, Shanya Sharma, Shayne Longpre, Somaieh Nikpoor, Stanislav Silberberg, Suhas Pai, Sydney Zink, Tiago Timponi Torrent, Timo Schick, Tristan Thrush, Valentin Danchev, Vassilina Nikoulina, Veronika Laippala, Violette Lepercq, Vrinda Prabhu, Zaid Alyafeai, Zeerak Talat, Arun Raja, Benjamin Heinzerling, Chenglei Si, Davut Emre Taşar, Elizabeth Salesky, Sabrina J. Mielke, Wilson Y. Lee, Abheesht Sharma, Andrea Santilli, Antoine Chaffin, Arnaud Stiegler, Debajyoti Datta, Eliza Szczechla, Gunjan Chhablani, Han Wang, Harshit Pandey, Hendrik Strobelt, Jason Alan Fries, Jos Rozen, Leo Gao, Lintang Sutawika, M Saiful Bari, Maged S. Al-shaibani, Matteo Manica, Nihal Nayak, Ryan Teehan, Samuel Albanie, Sheng Shen, Srulik Ben-David, Stephen H. Bach, Taewoon Kim, Tali Bers, Thibault Fevry, Trishala Neeraj, Urmish Thakker, Vikas Raunak, Xiangru Tang, Zheng-Xin Yong, Zhiqing Sun, Shaked Brody, Yallow Uri, Hadar Tojarieh, Adam Roberts, Hyung Won Chung, Jaesung Tae, Jason Phang, Ofir Press, Conglong Li, Deepak Narayanan, Hatim Bourfoune, Jared Casper, Jeff Rasley, Max Ryabinin, Mayank Mishra, Minjia Zhang, Mohammad Shoeybi, Myriam Peyrounette, Nicolas Patry, Nouamane Tazi, Omar Sanseviero, Patrick von Platen, Pierre Cornette, Pierre François Lavallée, Rémi Lacroix, Samyam Rajbhandari, Sanchit Gandhi, Shaden Smith, Stéphane Requena, Suraj Patil, Tim Dettmers, Ahmed Baruwa, Amanpreet Singh, Anastasia Cheveleva, Anne-Laure Ligozat, Arjun Subramonian, Aurélie Névéol, Charles Lovering, Dan Garrette, Deepak Tunuguntla, Ehud Reiter, Ekaterina Taktasheva, Ekaterina Voloshina, Eli Bogdanov, Genta Indra Winata, Hailey Schoelkopf, Jan-Christoph Kalo, Jekaterina Novikova, Jessica Zosa Forde, Jordan Clive, Jungo Kasai, Ken Kawamura, Liam Hazan, Marine Carpuat, Miruna Clinciu, Najoung Kim, Newton Cheng, Oleg Serikov, Omer Antverg, Oskar van der Wal, Rui Zhang, Ruochen Zhang, Sebastian Gehrmann, Shachar Mirkin, Shani Pais, Tatiana Shavrina, Thomas Scialom, Tian Yun, Tomasz Limisiewicz, Verena Rieser, Vitaly Protasov, Vladislav Mikhailov, Yada Pruksachatkun, Yonatan Belinkov, Zachary Bamberger, Zdeněk Kasner, Alice Rueda, Amanda Pestana, Amir Feizpour, Ammar Khan, Amy Faranak, Ana Santos, Anthony Hevia, Antigona Unldreaj, Arash Aghagol, Arezoo Abdollahi, Aycha Tammour, Azadeh HajiHosseini, Bahareh Behroozi, Benjamin Ajibade, Bharat Saxena, Carlos Muñoz Ferrandis, Daniel McDuff, Danish Contractor, David Lansky, Davis David, Douwe Kiela, Duong A. Nguyen, Edward Tan, Emi Baylor, Ezinwanne Ozoani, Fatima Mirza, Frankline Ononiwu, Habib Rezanejad, Hessie Jones, Indrani Bhattacharya, Irene Solaiman, Irina Sedenko, Isar Nejadgholi, Jesse Passmore, Josh Seltzer, Julio Bonis Sanz, Livia Dutra, Mairon Samagaio, Maraim Elbadri, Margot Mieskes, Marissa Gerchick, Martha Akinlolu, Michael McKenna, Mike Qiu, Muhammed Ghauri, Mykola Burynok, Nafis Abrar, Nazneen Rajani, Nour Elkott, Nour Fahmy, Olanrewaju Samuel, Ran An, Rasmus Kromann, Ryan Hao, Samira Alizadeh, Sarmad Shubber, Silas Wang, Sourav Roy, Sylvain Viguier, Thanh Le, Tobi Oyebade, Trieu Le, Yoyo Yang, Zach Nguyen, Abhinav Ramesh Kashyap, Alfredo Palasciano, Alison Callahan, Anima Shukla, Antonio Miranda-Escalada, Ayush Singh, Benjamin Beilharz, Bo wang, Caio Brito, Chenxi Zhou, Chirag Jain, Chuxin Xu, Clémentine Fourrier, Daniel León Periñán, Daniel Molano, Dian Yu, Enrique Manjavacas, Fabio Barth, Florian Fuhrimann, Gabriel Altay, Giyaseddin Bayrak, Gully Burns, Helena U. Vrabec, Imane Bello, Ishani Dash, Jihyun Kang, John Giorgi, Jonas Golde, Jose David Posada, Karthik Rangasai Sivaraman, Lokesh Bulchandani, Lu Liu, Luisa Shinzato, Madeleine Hahn de Bykhovetz, Maiko Takeuchi, Marc Pàmies, Maria A Castillo, Marianna Nezhurina, Mario Sänger, Matthias Samwald, Michael Cullan, Michael Weinberg, Michiel De Wolf, Mina Mihaljcic, Minna Liu, Moritz Freidank, Myungsun Kang, Natasha Seelam, Nathan Dahlberg, Nicholas Michio Broad, Nikolaus Muellner, Pascale Fung, Patrick Haller, Ramya Chandrasekhar, Renata Eisenberg, Robert Martin, Rodrigo Canalli, Rosaline Su, Ruisi Su, Samuel Cahyawijaya, Samuele Garda, Shlok S Deshmukh, Shubhanshu Mishra, Sid Kiblawi, Simon Ott, Sinee Sang-aroonsiri, Srishti Kumar, Stefan Schweter, Sushil Bharati, Tanmay Laud, Théo Gigant, Tomoya Kainuma, Wojciech Kusa, Yanis Labrak, Yash Shailesh Bajaj, Yash Venkatraman, Yifan Xu, Yingxin Xu, Yu Xu, Zhe Tan, Zhongli Xie, Zifan Ye, Mathilde Bras, Younes Belkada, Thomas Wolf

Large language models (LLMs) have been shown to be able to perform new tasks based on a few demonstrations or natural language instructions.

Language Modelling Multilingual NLP

2,181

Paper
Code

Charting the Right Manifold: Manifold Mixup for Few-shot Learning

7 code implementations • 28 Jul 2019 • Puneet Mangla, Mayank Singh, Abhishek Sinha, Nupur Kumari, Vineeth N. Balasubramanian, Balaji Krishnamurthy

A recent regularization technique - Manifold Mixup focuses on learning a general-purpose representation, robust to small changes in the data distribution.

Ranked #13 on Few-Shot Image Classification on Tiered ImageNet 5-way (5-shot)

Few-Shot Image Classification Few-Shot Learning +2

465

Paper
Code

Augmented Convolutional LSTMs for Generation of High-Resolution Climate Change Projections

1 code implementation • 23 Sep 2020 • Nidhin Harilal, Udit Bhatia, Mayank Singh

Projection of changes in extreme indices of climate variables such as temperature and precipitation are critical to assess the potential impacts of climate change on human-made and natural systems, including critical infrastructures and ecosystems.

Super-Resolution Vocal Bursts Intensity Prediction

Paper
Code

LineEX: Data Extraction from Scientific Line Charts

1 code implementation • Winter Conference on Applications of Computer Vision 2023 • Shivasankaran V P, Muhammad Yusuf Hassan, Mayank Singh

In this paper, we introduce LINEEX that extracts data from scientific line charts.

Paper
Code

Attributional Robustness Training using Input-Gradient Spatial Alignment

1 code implementation • ECCV 2020 • Mayank Singh, Nupur Kumari, Puneet Mangla, Abhishek Sinha, Vineeth N. Balasubramanian, Balaji Krishnamurthy

Safe deployment of machine learning system mandates that the prediction and its explanation be reliable and robust.

Ranked #1 on Weakly-Supervised Object Localization on CUB-200-2011 (Top-1 Error Rate metric)

BIG-bench Machine Learning Weakly-Supervised Object Localization

Paper
Code

Harnessing the Vulnerability of Latent Layers in Adversarially Trained Models

1 code implementation • 13 May 2019 • Mayank Singh, Abhishek Sinha, Nupur Kumari, Harshitha Machiraju, Balaji Krishnamurthy, Vineeth N. Balasubramanian

We analyze the adversarially trained robust models to study their vulnerability against adversarial attacks at the level of the latent layers.

Adversarial Attack

Paper
Code

IIT Gandhinagar at SemEval-2019 Task 3: Contextual Emotion Detection Using Deep Learning

1 code implementation • SEMEVAL 2019 • Arik Pamnani, Rajat Goel, Jayesh Choudhari, Mayank Singh

Recent advancements in Internet and Mobile infrastructure have resulted in the development of faster and efficient platforms of communication.

Chatbot

Paper
Code

Bollyrics: Automatic Lyrics Generator for Romanised Hindi

1 code implementation • 25 Jul 2020 • Naman Jain, Ankush Chauhan, Atharva Chewale, Ojas Mithbavkar, Ujjaval Shah, Mayank Singh

Song lyrics convey a meaningful story in a creative manner with complex rhythmic patterns.

Paper
Code

COMPARE: A Taxonomy and Dataset of Comparison Discussions in Peer Reviews

1 code implementation • 9 Aug 2021 • Shruti Singh, Mayank Singh, Pawan Goyal

We present COMPARE, a taxonomy and a dataset of comparison discussions in peer reviews of research papers in the domain of experimental deep learning.

Paper
Code

The Inefficiency of Language Models in Scholarly Retrieval: An Experimental Walk-through

1 code implementation • Findings (ACL) 2022 • Shruti Singh, Mayank Singh

Language models are increasingly becoming popular in AI-powered scientific IR systems.

Retrieval

Paper
Code

Cross-lingual Editing in Multilingual Language Models

1 code implementation • 19 Jan 2024 • Himanshu Beniwal, Kowsik Nandagopan D, Mayank Singh

The training of large language models (LLMs) necessitates substantial data and computational resources, and updating outdated LLMs entails significant efforts and resources.

Model Editing

Paper
Code

How Robust are the Tabular QA Models for Scientific Tables? A Study using Customized Dataset

1 code implementation • 30 Mar 2024 • Akash Ghosh, B Venkata Sahith, Niloy Ganguly, Pawan Goyal, Mayank Singh

Question-answering (QA) on hybrid scientific tabular and textual data deals with scientific information, and relies on complex numerical reasoning.

Question Answering

Paper
Code

Neural Networks in Adversarial Setting and Ill-Conditioned Weight Space

no code implementations • 3 Jan 2018 • Mayank Singh, Abhishek Sinha, Balaji Krishnamurthy

Recently, Neural networks have seen a huge surge in its adoption due to their ability to provide high accuracy on various tasks.

Paper
Add Code

AppTechMiner: Mining Applications and Techniques from Scientific Articles

no code implementations • 10 Sep 2017 • Mayank Singh, Soham Dan, Sanyam Agarwal, Pawan Goyal, Animesh Mukherjee

We also categorize individual research articles based on their application areas and the techniques proposed/improved in the article.

Information Retrieval Retrieval

Paper
Add Code

Which techniques does your application use?: An information extraction framework for scientific articles

no code implementations • 23 Aug 2016 • Soham Dan, Sanyam Agarwal, Mayank Singh, Pawan Goyal, Animesh Mukherjee

Every field of research consists of multiple application areas with various techniques routinely used to solve problems in these wide range of application areas.

Language Modelling

Paper
Add Code

Automated Early Leaderboard Generation From Comparative Tables

no code implementations • 13 Feb 2018 • Mayank Singh, Rajdeep Sarkar, Atharva Vyas, Pawan Goyal, Animesh Mukherjee, Soumen Chakrabarti

We propose several approaches to rank papers from these noisy 'match' outcomes.

Paper
Add Code

NLPExplorer: Exploring the Universe of NLP Papers

no code implementations • 16 Oct 2019 • Monarch Parmar, Naman jain, Pranjali Jain, P Jayakrishna Sahit, Soham Pachpande, Shruti Singh, Mayank Singh

Also, it provides temporal statistics such as yearwise popularity of topics, datasets, and seminal papers.

Paper
Add Code

Weakly-Supervised Deep Learning for Domain Invariant Sentiment Classification

no code implementations • 29 Oct 2019 • Pratik Kayal, Mayank Singh, Pawan Goyal

The task of learning a sentiment classification model that adapts well to any target domain, different from the source domain, is a challenging problem.

Classification General Classification +2

Paper
Add Code

A Method for Computing Class-wise Universal Adversarial Perturbations

no code implementations • 1 Dec 2019 • Tejus Gupta, Abhishek Sinha, Nupur Kumari, Mayank Singh, Balaji Krishnamurthy

We present an algorithm for computing class-specific universal adversarial perturbations for deep neural networks.

Paper
Add Code

PHINC: A Parallel Hinglish Social Media Code-Mixed Corpus for Machine Translation

no code implementations • EMNLP (WNUT) 2020 • Vivek Srivastava, Mayank Singh

Code-mixing is the phenomenon of using more than one language in a sentence.

Machine Translation Sentence +1

Paper
Add Code

On the Benefits of Models with Perceptually-Aligned Gradients

no code implementations • 4 May 2020 • Gunjan Aggarwal, Abhishek Sinha, Nupur Kumari, Mayank Singh

In this paper, we leverage models with interpretable perceptually-aligned features and show that adversarial training with low max-perturbation bound can improve the performance of models for zero-shot and weakly supervised localization tasks.

Paper
Add Code

Gandhipedia: A one-stop AI-enabled portal for browsing Gandhian literature, life-events and his social network

no code implementations • 5 Jun 2020 • Sayantan Adak, Atharva Vyas, Animesh Mukherjee, Heer Ambavi, Pritam Kadasi, Mayank Singh, Shivam Patel

We introduce an AI-enabled portal that presents an excellent visualization of Mahatma Gandhi's life events by constructing temporal and spatial social networks from the Gandhian literature.

Paper
Add Code

SEAL: Scientific Keyphrase Extraction and Classification

no code implementations • 5 Jun 2020 • Ayush Garg, Sammed Shantinath Kagi, Mayank Singh

Automatic scientific keyphrase extraction is a challenging problem facilitating several downstream scholarly tasks like search, recommendation, and ranking.

Classification General Classification +1

Paper
Add Code

IIT Gandhinagar at SemEval-2020 Task 9: Code-Mixed Sentiment Classification Using Candidate Sentence Generation and Selection

no code implementations • SEMEVAL 2020 • Vivek Srivastava, Mayank Singh

Code-mixing is the phenomenon of using multiple languages in the same utterance of a text or speech.

Humor Detection intent-classification +4

Paper
Add Code

Identification, Tracking and Impact: Understanding the trade secret of catchphrases

no code implementations • 20 Jul 2020 • Jagriti Jalal, Mayank Singh, Arindam Pal, Lipika Dey, Animesh Mukherjee

Understanding the topical evolution in industrial innovation is a challenging problem.

Paper
Add Code

LT-GAN: Self-Supervised GAN with Latent Transformation Detection

no code implementations • 19 Oct 2020 • Parth Patel, Nupur Kumari, Mayank Singh, Balaji Krishnamurthy

We propose a self-supervised approach (LT-GAN) to improve the generation quality and diversity of images by estimating the GAN-induced transformation (i. e. transformation induced in the generated images by perturbing the latent space of generator).

Ranked #4 on Image Generation on CelebA-HQ 128x128

Image Generation

Paper
Add Code

CovidExplorer: A Multi-faceted AI-based Search and Visualization Engine for COVID-19 Information

no code implementations • 30 Nov 2020 • Heer Ambavi, Kavita Vaishnaw, Udit Vyas, Abhisht Tiwari, Mayank Singh

The entire world is engulfed in the fight against the COVID-19 pandemic, leading to a significant surge in research experiments, government policies, and social media discussions.

Data Visualization

Paper
Add Code

Data InStance Prior (DISP) in Generative Adversarial Networks

no code implementations • 8 Dec 2020 • Puneet Mangla, Nupur Kumari, Mayank Singh, Balaji Krishnamurthy, Vineeth N Balasubramanian

Previous works have addressed training in low data setting by leveraging transfer learning and data augmentation techniques.

Data Augmentation Image Generation +2

Paper
Add Code

TabLeX: A Benchmark Dataset for Structure and Content Information Extraction from Scientific Tables

no code implementations • 12 May 2021 • Harsh Desai, Pratik Kayal, Mayank Singh

Information Extraction (IE) from the tables present in scientific articles is challenging due to complicated tabular representations and complex embedded text.

Table Extraction

Paper
Add Code

SentEmojiBot: Empathising Conversations Generation with Emojis

no code implementations • 26 May 2021 • Akhilesh Ravi, Amit Yadav, Jainish Chauhan, Jatin Dholakia, Naman jain, Mayank Singh

The increasing use of dialogue agents makes it extremely desirable for them to understand and acknowledge the implied emotions to respond like humans with empathy.

Paper
Add Code

ICDAR 2021 Competition on Scientific Table Image Recognition to LaTeX

no code implementations • 30 May 2021 • Pratik Kayal, Mrinal Anand, Harsh Desai, Mayank Singh

This paper discusses the dataset, tasks, participants' methods, and results of the ICDAR 2021 Competition on Scientific Table Image Recognition to LaTeX.

Table Recognition

Paper
Add Code

Challenges and Considerations with Code-Mixed NLP for Multilingual Societies

no code implementations • 15 Jun 2021 • Vivek Srivastava, Mayank Singh

Multilingualism refers to the high degree of proficiency in two or more languages in the written and oral communication modes.

Management Multilingual NLP

Paper
Add Code

Challenges and Limitations with the Metrics Measuring the Complexity of Code-Mixed Text

no code implementations • NAACL (CALCS) 2021 • Vivek Srivastava, Mayank Singh

Code-mixing is a frequent communication style among multilingual speakers where they mix words and phrases from two different languages in the same utterance of text or speech.

Paper
Add Code

TweeNLP: A Twitter Exploration Portal for Natural Language Processing

no code implementations • ACL 2021 • Viraj Shah, Shruti Singh, Mayank Singh

It supports multiple features such as TweetExplorer to explore tweets by topics, visualize insights from Twitter activity throughout the organization cycle of conferences, discover popular research papers and researchers.

Paper
Add Code

On Adversarial Robustness of Synthetic Code Generation

no code implementations • 22 Jun 2021 • Mrinal Anand, Pratik Kayal, Mayank Singh

In this paper, we specifically experiment with \textsc{AlgoLisp} DSL-based generative models and showcase the existence of significant dataset bias through different classes of adversarial examples.

Adversarial Robustness Code Generation

Paper
Add Code

HinGE: A Dataset for Generation and Evaluation of Code-Mixed Hinglish Text

no code implementations • EMNLP (Eval4NLP) 2021 • Vivek Srivastava, Mayank Singh

Text generation is a highly active area of research in the computational linguistic community.

Text Generation

Paper
Add Code

MIPE: A Metric Independent Pipeline for Effective Code-Mixed NLG Evaluation

no code implementations • EMNLP (Eval4NLP) 2021 • Ayush Garg, Sammed S Kagi, Vivek Srivastava, Mayank Singh

Code-mixing is a phenomenon of mixing words and phrases from two or more languages in a single utterance of speech and text.

nlg evaluation Text Generation

Paper
Add Code

Quality Evaluation of the Low-Resource Synthetically Generated Code-Mixed Hinglish Text

no code implementations • INLG (ACL) 2021 • Vivek Srivastava, Mayank Singh

In this shared task, we seek the participating teams to investigate the factors influencing the quality of the code-mixed text generation systems.

Text Generation

Paper
Add Code

Data Instance Prior for Transfer Learning in GANs

no code implementations • 28 Sep 2020 • Puneet Mangla, Nupur Kumari, Mayank Singh, Vineeth N. Balasubramanian, Balaji Krishnamurthy

Recent advances in generative adversarial networks (GANs) have shown remarkable progress in generating high-quality images.

Data Augmentation Image Generation +2

Paper
Add Code

AutoCoder: Leveraging Transformers for Automatic Code Synthesis

no code implementations • NeurIPS Workshop AIPLANS 2021 • Mrinal Anand, Pratik Kayal, Mayank Singh

Program synthesis from natural language descriptions is a challenging task.

Program Synthesis

Paper
Add Code

Adversarial Robustness of Program Synthesis Models

no code implementations • NeurIPS Workshop AIPLANS 2021 • Mrinal Anand, Pratik Kayal, Mayank Singh

The resurgence of automatic program synthesis has been observed with the rise of deep learning.

Adversarial Robustness Program Synthesis

Paper
Add Code

The Bull and the Bear: Summarizing Stock Market Discussions

no code implementations • LREC 2022 • Ayush Kumar, Dhyey Jani, Jay Shah, Devanshu Thakar, Varun Jain, Mayank Singh

We plan to make the dataset publicly available.

Abstractive Text Summarization

Paper
Add Code

Tables to LaTeX: structure and content extraction from scientific tables

no code implementations • 31 Oct 2022 • Pratik Kayal, Mrinal Anand, Harsh Desai, Mayank Singh

In this paper, we adapt the transformer-based language modeling paradigm for scientific table structure and content extraction.

Language Modelling

Paper
Add Code

MUTANT: A Multi-sentential Code-mixed Hinglish Dataset

no code implementations • 23 Feb 2023 • Rahul Gupta, Vivek Srivastava, Mayank Singh

As a use case, we leverage multilingual articles from two different data sources and build a first-of-its-kind multi-sentential code-mixed Hinglish dataset i. e., MUTANT.

Paper
Add Code

MMT: A Multilingual and Multi-Topic Indian Social Media Dataset

no code implementations • 2 Apr 2023 • Dwip Dalal, Vivek Srivastava, Mayank Singh

Social media plays a significant role in cross-cultural communication.

Language Identification named-entity-recognition +1

Paper
Add Code

Analogy-Forming Transformers for Few-Shot 3D Parsing

no code implementations • 27 Apr 2023 • Nikolaos Gkanatsios, Mayank Singh, Zhaoyuan Fang, Shubham Tulsiani, Katerina Fragkiadaki

We present Analogical Networks, a model that encodes domain knowledge explicitly, in a collection of structured labelled 3D scenes, in addition to implicitly, as model parameters, and segments 3D object scenes with analogical reasoning: instead of mapping a scene to part segments directly, our model first retrieves related scenes from memory and their corresponding part structures, and then predicts analogous part structures for the input scene, via an end-to-end learnable modulation mechanism.

Few-Shot Learning

Paper
Add Code

Metric@CustomerN: Evaluating Metrics at a Customer Level in E-Commerce

no code implementations • 31 Jul 2023 • Mayank Singh, Emily Ray, Marc Ferradou, Andrea Barraza-Urbina

Accuracy measures such as Recall, Precision, and Hit Rate have been a standard way of evaluating Recommendation Systems.

Recommendation Systems

Paper
Add Code

Modeling interdisciplinary interactions among Physics, Mathematics & Computer Science

no code implementations • 19 Sep 2023 • Rima Hazra, Mayank Singh, Pawan Goyal, Bibhas Adhikari, Animesh Mukherjee

Interdisciplinarity has over the recent years have gained tremendous importance and has become one of the key ways of doing cutting edge research.

Paper
Add Code

Unlocking Model Insights: A Dataset for Automated Model Card Generation

no code implementations • 22 Sep 2023 • Shruti Singh, Hitesh Lodwal, Husain Malwat, Rakesh Thakur, Mayank Singh

To automate model card generation, we introduce a dataset of 500 question-answer pairs for 25 ML models that cover crucial aspects of the model, such as its training configurations, datasets, biases, architecture details, and training resources.

Paper
Add Code

Unveiling the Multi-Annotation Process: Examining the Influence of Annotation Quantity and Instance Difficulty on Model Performance

no code implementations • 23 Oct 2023 • Pritam Kadasi, Mayank Singh

The NLP community has long advocated for the construction of multi-annotator datasets to better capture the nuances of language interpretation, subjectivity, and ambiguity.

Paper
Add Code

PythonSaga: Redefining the Benchmark to Evaluate Code Generating LLM

no code implementations • 8 Jan 2024 • Ankit Yadav, Mayank Singh

Driven by the surge in code generation using large language models (LLMs), numerous benchmarks have emerged to evaluate these LLMs capabilities.

Code Generation

Paper
Add Code

LEGOBench: Scientific Leaderboard Generation Benchmark

1 code implementation • 11 Jan 2024 • Shruti Singh, Shoaib Alam, Husain Malwat, Mayank Singh

We present four graph-based and two language model-based leaderboard generation task configurations.

Language Modelling

Paper
Code

Remember This Event That Year? Assessing Temporal Information and Reasoning in Large Language Models

no code implementations • 19 Feb 2024 • Himanshu Beniwal, Kowsik Nandagopan D, Mayank Singh

Large Language Models (LLMs) are increasingly becoming ubiquitous, yet their ability to reason about and retain temporal information remains limited.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.