Search Results for author: Mayank Singh

Found 53 papers, 13 papers with code

BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

6 code implementations9 Nov 2022 BigScience Workshop, :, Teven Le Scao, Angela Fan, Christopher Akiki, Ellie Pavlick, Suzana Ilić, Daniel Hesslow, Roman Castagné, Alexandra Sasha Luccioni, François Yvon, Matthias Gallé, Jonathan Tow, Alexander M. Rush, Stella Biderman, Albert Webson, Pawan Sasanka Ammanamanchi, Thomas Wang, Benoît Sagot, Niklas Muennighoff, Albert Villanova del Moral, Olatunji Ruwase, Rachel Bawden, Stas Bekman, Angelina McMillan-Major, Iz Beltagy, Huu Nguyen, Lucile Saulnier, Samson Tan, Pedro Ortiz Suarez, Victor Sanh, Hugo Laurençon, Yacine Jernite, Julien Launay, Margaret Mitchell, Colin Raffel, Aaron Gokaslan, Adi Simhi, Aitor Soroa, Alham Fikri Aji, Amit Alfassy, Anna Rogers, Ariel Kreisberg Nitzav, Canwen Xu, Chenghao Mou, Chris Emezue, Christopher Klamm, Colin Leong, Daniel van Strien, David Ifeoluwa Adelani, Dragomir Radev, Eduardo González Ponferrada, Efrat Levkovizh, Ethan Kim, Eyal Bar Natan, Francesco De Toni, Gérard Dupont, Germán Kruszewski, Giada Pistilli, Hady Elsahar, Hamza Benyamina, Hieu Tran, Ian Yu, Idris Abdulmumin, Isaac Johnson, Itziar Gonzalez-Dios, Javier de la Rosa, Jenny Chim, Jesse Dodge, Jian Zhu, Jonathan Chang, Jörg Frohberg, Joseph Tobing, Joydeep Bhattacharjee, Khalid Almubarak, Kimbo Chen, Kyle Lo, Leandro von Werra, Leon Weber, Long Phan, Loubna Ben allal, Ludovic Tanguy, Manan Dey, Manuel Romero Muñoz, Maraim Masoud, María Grandury, Mario Šaško, Max Huang, Maximin Coavoux, Mayank Singh, Mike Tian-Jian Jiang, Minh Chien Vu, Mohammad A. Jauhar, Mustafa Ghaleb, Nishant Subramani, Nora Kassner, Nurulaqilla Khamis, Olivier Nguyen, Omar Espejel, Ona de Gibert, Paulo Villegas, Peter Henderson, Pierre Colombo, Priscilla Amuok, Quentin Lhoest, Rheza Harliman, Rishi Bommasani, Roberto Luis López, Rui Ribeiro, Salomey Osei, Sampo Pyysalo, Sebastian Nagel, Shamik Bose, Shamsuddeen Hassan Muhammad, Shanya Sharma, Shayne Longpre, Somaieh Nikpoor, Stanislav Silberberg, Suhas Pai, Sydney Zink, Tiago Timponi Torrent, Timo Schick, Tristan Thrush, Valentin Danchev, Vassilina Nikoulina, Veronika Laippala, Violette Lepercq, Vrinda Prabhu, Zaid Alyafeai, Zeerak Talat, Arun Raja, Benjamin Heinzerling, Chenglei Si, Davut Emre Taşar, Elizabeth Salesky, Sabrina J. Mielke, Wilson Y. Lee, Abheesht Sharma, Andrea Santilli, Antoine Chaffin, Arnaud Stiegler, Debajyoti Datta, Eliza Szczechla, Gunjan Chhablani, Han Wang, Harshit Pandey, Hendrik Strobelt, Jason Alan Fries, Jos Rozen, Leo Gao, Lintang Sutawika, M Saiful Bari, Maged S. Al-shaibani, Matteo Manica, Nihal Nayak, Ryan Teehan, Samuel Albanie, Sheng Shen, Srulik Ben-David, Stephen H. Bach, Taewoon Kim, Tali Bers, Thibault Fevry, Trishala Neeraj, Urmish Thakker, Vikas Raunak, Xiangru Tang, Zheng-Xin Yong, Zhiqing Sun, Shaked Brody, Yallow Uri, Hadar Tojarieh, Adam Roberts, Hyung Won Chung, Jaesung Tae, Jason Phang, Ofir Press, Conglong Li, Deepak Narayanan, Hatim Bourfoune, Jared Casper, Jeff Rasley, Max Ryabinin, Mayank Mishra, Minjia Zhang, Mohammad Shoeybi, Myriam Peyrounette, Nicolas Patry, Nouamane Tazi, Omar Sanseviero, Patrick von Platen, Pierre Cornette, Pierre François Lavallée, Rémi Lacroix, Samyam Rajbhandari, Sanchit Gandhi, Shaden Smith, Stéphane Requena, Suraj Patil, Tim Dettmers, Ahmed Baruwa, Amanpreet Singh, Anastasia Cheveleva, Anne-Laure Ligozat, Arjun Subramonian, Aurélie Névéol, Charles Lovering, Dan Garrette, Deepak Tunuguntla, Ehud Reiter, Ekaterina Taktasheva, Ekaterina Voloshina, Eli Bogdanov, Genta Indra Winata, Hailey Schoelkopf, Jan-Christoph Kalo, Jekaterina Novikova, Jessica Zosa Forde, Jordan Clive, Jungo Kasai, Ken Kawamura, Liam Hazan, Marine Carpuat, Miruna Clinciu, Najoung Kim, Newton Cheng, Oleg Serikov, Omer Antverg, Oskar van der Wal, Rui Zhang, Ruochen Zhang, Sebastian Gehrmann, Shachar Mirkin, Shani Pais, Tatiana Shavrina, Thomas Scialom, Tian Yun, Tomasz Limisiewicz, Verena Rieser, Vitaly Protasov, Vladislav Mikhailov, Yada Pruksachatkun, Yonatan Belinkov, Zachary Bamberger, Zdeněk Kasner, Alice Rueda, Amanda Pestana, Amir Feizpour, Ammar Khan, Amy Faranak, Ana Santos, Anthony Hevia, Antigona Unldreaj, Arash Aghagol, Arezoo Abdollahi, Aycha Tammour, Azadeh HajiHosseini, Bahareh Behroozi, Benjamin Ajibade, Bharat Saxena, Carlos Muñoz Ferrandis, Daniel McDuff, Danish Contractor, David Lansky, Davis David, Douwe Kiela, Duong A. Nguyen, Edward Tan, Emi Baylor, Ezinwanne Ozoani, Fatima Mirza, Frankline Ononiwu, Habib Rezanejad, Hessie Jones, Indrani Bhattacharya, Irene Solaiman, Irina Sedenko, Isar Nejadgholi, Jesse Passmore, Josh Seltzer, Julio Bonis Sanz, Livia Dutra, Mairon Samagaio, Maraim Elbadri, Margot Mieskes, Marissa Gerchick, Martha Akinlolu, Michael McKenna, Mike Qiu, Muhammed Ghauri, Mykola Burynok, Nafis Abrar, Nazneen Rajani, Nour Elkott, Nour Fahmy, Olanrewaju Samuel, Ran An, Rasmus Kromann, Ryan Hao, Samira Alizadeh, Sarmad Shubber, Silas Wang, Sourav Roy, Sylvain Viguier, Thanh Le, Tobi Oyebade, Trieu Le, Yoyo Yang, Zach Nguyen, Abhinav Ramesh Kashyap, Alfredo Palasciano, Alison Callahan, Anima Shukla, Antonio Miranda-Escalada, Ayush Singh, Benjamin Beilharz, Bo wang, Caio Brito, Chenxi Zhou, Chirag Jain, Chuxin Xu, Clémentine Fourrier, Daniel León Periñán, Daniel Molano, Dian Yu, Enrique Manjavacas, Fabio Barth, Florian Fuhrimann, Gabriel Altay, Giyaseddin Bayrak, Gully Burns, Helena U. Vrabec, Imane Bello, Ishani Dash, Jihyun Kang, John Giorgi, Jonas Golde, Jose David Posada, Karthik Rangasai Sivaraman, Lokesh Bulchandani, Lu Liu, Luisa Shinzato, Madeleine Hahn de Bykhovetz, Maiko Takeuchi, Marc Pàmies, Maria A Castillo, Marianna Nezhurina, Mario Sänger, Matthias Samwald, Michael Cullan, Michael Weinberg, Michiel De Wolf, Mina Mihaljcic, Minna Liu, Moritz Freidank, Myungsun Kang, Natasha Seelam, Nathan Dahlberg, Nicholas Michio Broad, Nikolaus Muellner, Pascale Fung, Patrick Haller, Ramya Chandrasekhar, Renata Eisenberg, Robert Martin, Rodrigo Canalli, Rosaline Su, Ruisi Su, Samuel Cahyawijaya, Samuele Garda, Shlok S Deshmukh, Shubhanshu Mishra, Sid Kiblawi, Simon Ott, Sinee Sang-aroonsiri, Srishti Kumar, Stefan Schweter, Sushil Bharati, Tanmay Laud, Théo Gigant, Tomoya Kainuma, Wojciech Kusa, Yanis Labrak, Yash Shailesh Bajaj, Yash Venkatraman, Yifan Xu, Yingxin Xu, Yu Xu, Zhe Tan, Zhongli Xie, Zifan Ye, Mathilde Bras, Younes Belkada, Thomas Wolf

Large language models (LLMs) have been shown to be able to perform new tasks based on a few demonstrations or natural language instructions.

Language Modelling Multilingual NLP

Augmented Convolutional LSTMs for Generation of High-Resolution Climate Change Projections

1 code implementation23 Sep 2020 Nidhin Harilal, Udit Bhatia, Mayank Singh

Projection of changes in extreme indices of climate variables such as temperature and precipitation are critical to assess the potential impacts of climate change on human-made and natural systems, including critical infrastructures and ecosystems.

Super-Resolution Vocal Bursts Intensity Prediction

Harnessing the Vulnerability of Latent Layers in Adversarially Trained Models

1 code implementation13 May 2019 Mayank Singh, Abhishek Sinha, Nupur Kumari, Harshitha Machiraju, Balaji Krishnamurthy, Vineeth N. Balasubramanian

We analyze the adversarially trained robust models to study their vulnerability against adversarial attacks at the level of the latent layers.

Adversarial Attack

IIT Gandhinagar at SemEval-2019 Task 3: Contextual Emotion Detection Using Deep Learning

1 code implementation SEMEVAL 2019 Arik Pamnani, Rajat Goel, Jayesh Choudhari, Mayank Singh

Recent advancements in Internet and Mobile infrastructure have resulted in the development of faster and efficient platforms of communication.

Chatbot

Bollyrics: Automatic Lyrics Generator for Romanised Hindi

1 code implementation25 Jul 2020 Naman Jain, Ankush Chauhan, Atharva Chewale, Ojas Mithbavkar, Ujjaval Shah, Mayank Singh

Song lyrics convey a meaningful story in a creative manner with complex rhythmic patterns.

COMPARE: A Taxonomy and Dataset of Comparison Discussions in Peer Reviews

1 code implementation9 Aug 2021 Shruti Singh, Mayank Singh, Pawan Goyal

We present COMPARE, a taxonomy and a dataset of comparison discussions in peer reviews of research papers in the domain of experimental deep learning.

Cross-lingual Editing in Multilingual Language Models

1 code implementation19 Jan 2024 Himanshu Beniwal, Kowsik Nandagopan D, Mayank Singh

The training of large language models (LLMs) necessitates substantial data and computational resources, and updating outdated LLMs entails significant efforts and resources.

Model Editing

How Robust are the Tabular QA Models for Scientific Tables? A Study using Customized Dataset

1 code implementation30 Mar 2024 Akash Ghosh, B Venkata Sahith, Niloy Ganguly, Pawan Goyal, Mayank Singh

Question-answering (QA) on hybrid scientific tabular and textual data deals with scientific information, and relies on complex numerical reasoning.

Question Answering

Neural Networks in Adversarial Setting and Ill-Conditioned Weight Space

no code implementations3 Jan 2018 Mayank Singh, Abhishek Sinha, Balaji Krishnamurthy

Recently, Neural networks have seen a huge surge in its adoption due to their ability to provide high accuracy on various tasks.

AppTechMiner: Mining Applications and Techniques from Scientific Articles

no code implementations10 Sep 2017 Mayank Singh, Soham Dan, Sanyam Agarwal, Pawan Goyal, Animesh Mukherjee

We also categorize individual research articles based on their application areas and the techniques proposed/improved in the article.

Information Retrieval Retrieval

Which techniques does your application use?: An information extraction framework for scientific articles

no code implementations23 Aug 2016 Soham Dan, Sanyam Agarwal, Mayank Singh, Pawan Goyal, Animesh Mukherjee

Every field of research consists of multiple application areas with various techniques routinely used to solve problems in these wide range of application areas.

Language Modelling

NLPExplorer: Exploring the Universe of NLP Papers

no code implementations16 Oct 2019 Monarch Parmar, Naman jain, Pranjali Jain, P Jayakrishna Sahit, Soham Pachpande, Shruti Singh, Mayank Singh

Also, it provides temporal statistics such as yearwise popularity of topics, datasets, and seminal papers.

Weakly-Supervised Deep Learning for Domain Invariant Sentiment Classification

no code implementations29 Oct 2019 Pratik Kayal, Mayank Singh, Pawan Goyal

The task of learning a sentiment classification model that adapts well to any target domain, different from the source domain, is a challenging problem.

Classification General Classification +2

A Method for Computing Class-wise Universal Adversarial Perturbations

no code implementations1 Dec 2019 Tejus Gupta, Abhishek Sinha, Nupur Kumari, Mayank Singh, Balaji Krishnamurthy

We present an algorithm for computing class-specific universal adversarial perturbations for deep neural networks.

On the Benefits of Models with Perceptually-Aligned Gradients

no code implementations4 May 2020 Gunjan Aggarwal, Abhishek Sinha, Nupur Kumari, Mayank Singh

In this paper, we leverage models with interpretable perceptually-aligned features and show that adversarial training with low max-perturbation bound can improve the performance of models for zero-shot and weakly supervised localization tasks.

Gandhipedia: A one-stop AI-enabled portal for browsing Gandhian literature, life-events and his social network

no code implementations5 Jun 2020 Sayantan Adak, Atharva Vyas, Animesh Mukherjee, Heer Ambavi, Pritam Kadasi, Mayank Singh, Shivam Patel

We introduce an AI-enabled portal that presents an excellent visualization of Mahatma Gandhi's life events by constructing temporal and spatial social networks from the Gandhian literature.

SEAL: Scientific Keyphrase Extraction and Classification

no code implementations5 Jun 2020 Ayush Garg, Sammed Shantinath Kagi, Mayank Singh

Automatic scientific keyphrase extraction is a challenging problem facilitating several downstream scholarly tasks like search, recommendation, and ranking.

Classification General Classification +1

LT-GAN: Self-Supervised GAN with Latent Transformation Detection

no code implementations19 Oct 2020 Parth Patel, Nupur Kumari, Mayank Singh, Balaji Krishnamurthy

We propose a self-supervised approach (LT-GAN) to improve the generation quality and diversity of images by estimating the GAN-induced transformation (i. e. transformation induced in the generated images by perturbing the latent space of generator).

Image Generation

CovidExplorer: A Multi-faceted AI-based Search and Visualization Engine for COVID-19 Information

no code implementations30 Nov 2020 Heer Ambavi, Kavita Vaishnaw, Udit Vyas, Abhisht Tiwari, Mayank Singh

The entire world is engulfed in the fight against the COVID-19 pandemic, leading to a significant surge in research experiments, government policies, and social media discussions.

Data Visualization

Data InStance Prior (DISP) in Generative Adversarial Networks

no code implementations8 Dec 2020 Puneet Mangla, Nupur Kumari, Mayank Singh, Balaji Krishnamurthy, Vineeth N Balasubramanian

Previous works have addressed training in low data setting by leveraging transfer learning and data augmentation techniques.

Data Augmentation Image Generation +2

TabLeX: A Benchmark Dataset for Structure and Content Information Extraction from Scientific Tables

no code implementations12 May 2021 Harsh Desai, Pratik Kayal, Mayank Singh

Information Extraction (IE) from the tables present in scientific articles is challenging due to complicated tabular representations and complex embedded text.

Table Extraction

SentEmojiBot: Empathising Conversations Generation with Emojis

no code implementations26 May 2021 Akhilesh Ravi, Amit Yadav, Jainish Chauhan, Jatin Dholakia, Naman jain, Mayank Singh

The increasing use of dialogue agents makes it extremely desirable for them to understand and acknowledge the implied emotions to respond like humans with empathy.

ICDAR 2021 Competition on Scientific Table Image Recognition to LaTeX

no code implementations30 May 2021 Pratik Kayal, Mrinal Anand, Harsh Desai, Mayank Singh

This paper discusses the dataset, tasks, participants' methods, and results of the ICDAR 2021 Competition on Scientific Table Image Recognition to LaTeX.

Table Recognition

Challenges and Considerations with Code-Mixed NLP for Multilingual Societies

no code implementations15 Jun 2021 Vivek Srivastava, Mayank Singh

Multilingualism refers to the high degree of proficiency in two or more languages in the written and oral communication modes.

Management Multilingual NLP

Challenges and Limitations with the Metrics Measuring the Complexity of Code-Mixed Text

no code implementations NAACL (CALCS) 2021 Vivek Srivastava, Mayank Singh

Code-mixing is a frequent communication style among multilingual speakers where they mix words and phrases from two different languages in the same utterance of text or speech.

TweeNLP: A Twitter Exploration Portal for Natural Language Processing

no code implementations ACL 2021 Viraj Shah, Shruti Singh, Mayank Singh

It supports multiple features such as TweetExplorer to explore tweets by topics, visualize insights from Twitter activity throughout the organization cycle of conferences, discover popular research papers and researchers.

On Adversarial Robustness of Synthetic Code Generation

no code implementations22 Jun 2021 Mrinal Anand, Pratik Kayal, Mayank Singh

In this paper, we specifically experiment with \textsc{AlgoLisp} DSL-based generative models and showcase the existence of significant dataset bias through different classes of adversarial examples.

Adversarial Robustness Code Generation

Quality Evaluation of the Low-Resource Synthetically Generated Code-Mixed Hinglish Text

no code implementations INLG (ACL) 2021 Vivek Srivastava, Mayank Singh

In this shared task, we seek the participating teams to investigate the factors influencing the quality of the code-mixed text generation systems.

Text Generation

Data Instance Prior for Transfer Learning in GANs

no code implementations28 Sep 2020 Puneet Mangla, Nupur Kumari, Mayank Singh, Vineeth N. Balasubramanian, Balaji Krishnamurthy

Recent advances in generative adversarial networks (GANs) have shown remarkable progress in generating high-quality images.

Data Augmentation Image Generation +2

Tables to LaTeX: structure and content extraction from scientific tables

no code implementations31 Oct 2022 Pratik Kayal, Mrinal Anand, Harsh Desai, Mayank Singh

In this paper, we adapt the transformer-based language modeling paradigm for scientific table structure and content extraction.

Language Modelling

MUTANT: A Multi-sentential Code-mixed Hinglish Dataset

no code implementations23 Feb 2023 Rahul Gupta, Vivek Srivastava, Mayank Singh

As a use case, we leverage multilingual articles from two different data sources and build a first-of-its-kind multi-sentential code-mixed Hinglish dataset i. e., MUTANT.

Analogy-Forming Transformers for Few-Shot 3D Parsing

no code implementations27 Apr 2023 Nikolaos Gkanatsios, Mayank Singh, Zhaoyuan Fang, Shubham Tulsiani, Katerina Fragkiadaki

We present Analogical Networks, a model that encodes domain knowledge explicitly, in a collection of structured labelled 3D scenes, in addition to implicitly, as model parameters, and segments 3D object scenes with analogical reasoning: instead of mapping a scene to part segments directly, our model first retrieves related scenes from memory and their corresponding part structures, and then predicts analogous part structures for the input scene, via an end-to-end learnable modulation mechanism.

Few-Shot Learning

Metric@CustomerN: Evaluating Metrics at a Customer Level in E-Commerce

no code implementations31 Jul 2023 Mayank Singh, Emily Ray, Marc Ferradou, Andrea Barraza-Urbina

Accuracy measures such as Recall, Precision, and Hit Rate have been a standard way of evaluating Recommendation Systems.

Recommendation Systems

Modeling interdisciplinary interactions among Physics, Mathematics & Computer Science

no code implementations19 Sep 2023 Rima Hazra, Mayank Singh, Pawan Goyal, Bibhas Adhikari, Animesh Mukherjee

Interdisciplinarity has over the recent years have gained tremendous importance and has become one of the key ways of doing cutting edge research.

Unlocking Model Insights: A Dataset for Automated Model Card Generation

no code implementations22 Sep 2023 Shruti Singh, Hitesh Lodwal, Husain Malwat, Rakesh Thakur, Mayank Singh

To automate model card generation, we introduce a dataset of 500 question-answer pairs for 25 ML models that cover crucial aspects of the model, such as its training configurations, datasets, biases, architecture details, and training resources.

Unveiling the Multi-Annotation Process: Examining the Influence of Annotation Quantity and Instance Difficulty on Model Performance

no code implementations23 Oct 2023 Pritam Kadasi, Mayank Singh

The NLP community has long advocated for the construction of multi-annotator datasets to better capture the nuances of language interpretation, subjectivity, and ambiguity.

PythonSaga: Redefining the Benchmark to Evaluate Code Generating LLM

no code implementations8 Jan 2024 Ankit Yadav, Mayank Singh

Driven by the surge in code generation using large language models (LLMs), numerous benchmarks have emerged to evaluate these LLMs capabilities.

Code Generation

LEGOBench: Scientific Leaderboard Generation Benchmark

1 code implementation11 Jan 2024 Shruti Singh, Shoaib Alam, Husain Malwat, Mayank Singh

We present four graph-based and two language model-based leaderboard generation task configurations.

Language Modelling

Remember This Event That Year? Assessing Temporal Information and Reasoning in Large Language Models

no code implementations19 Feb 2024 Himanshu Beniwal, Kowsik Nandagopan D, Mayank Singh

Large Language Models (LLMs) are increasingly becoming ubiquitous, yet their ability to reason about and retain temporal information remains limited.

Cannot find the paper you are looking for? You can Submit a new open access paper.