no code implementations • EACL (AdaptNLP) 2021 • Sebastin Santy, Anirudh Srinivasan, Monojit Choudhury
Models such as mBERT and XLMR have shown success in solving Code-Mixed NLP tasks even though they were not exposed to such text during pretraining.
no code implementations • 27 May 2024 • Thao Nguyen, Matthew Wallingford, Sebastin Santy, Wei-Chiu Ma, Sewoong Oh, Ludwig Schmidt, Pang Wei Koh, Ranjay Krishna
By translating all multilingual image-text pairs from a raw web crawl to English and re-filtering them, we increase the prevalence of (translated) multilingual data in the resulting training set.
no code implementations • 10 May 2024 • Rock Yuren Pang, Sebastin Santy, René Just, Katharina Reinecke
Digital technologies have positively transformed society, but they have also led to undesirable consequences not anticipated at the time of design or development.
1 code implementation • 16 Apr 2024 • Huihan Li, Liwei Jiang, Jena D. Hwang, Hyunwoo Kim, Sebastin Santy, Taylor Sorensen, Bill Yuchen Lin, Nouha Dziri, Xiang Ren, Yejin Choi
As the utilization of large language models (LLMs) has proliferated world-wide, it is crucial for them to have adequate knowledge and fair representation for diverse global cultures.
no code implementations • 22 Oct 2023 • Andre Ye, Sebastin Santy, Jena D. Hwang, Amy X. Zhang, Ranjay Krishna
Computer vision often treats human perception as homogeneous: an implicit assumption that visual stimuli are perceived similarly by everyone.
1 code implementation • 2 Jun 2023 • Sebastin Santy, Jenny T. Liang, Ronan Le Bras, Katharina Reinecke, Maarten Sap
We introduce NLPositionality, a framework for characterizing design biases and quantifying the positionality of NLP datasets and models.
no code implementations • 18 Dec 2022 • Sebastin Santy, Prasanta Bhattacharya, Rishabh Mehrotra
With the steady emergence of community question answering (CQA) platforms like Quora, StackExchange, and WikiHow, users now have an unprecedented access to information on various kind of queries and tasks.
1 code implementation • 29 Nov 2022 • Devansh Mehta, Harshita Diddee, Ananya Saxena, Anurag Shukla, Sebastin Santy, Ramaravind Kommiya Mothilal, Brij Mohan Lal Srivastava, Alok Sharma, Vishnu Prasad, Venkanna U, Kalika Bali
The primary obstacle to developing technologies for low-resource languages is the lack of representative, usable data.
no code implementations • 11 Jun 2021 • Sebastin Santy, Prasanta Bhattacharya
Recent advances in AI and ML applications have benefited from rapid progress in NLP research.
no code implementations • Findings (ACL) 2021 • Sebastin Santy, Anku Rani, Monojit Choudhury
Ethical aspects of research in language technologies have received much attention recently.
no code implementations • LREC 2020 • Devansh Mehta, Sebastin Santy, Ramaravind Kommiya Mothilal, Brij Mohan Lal Srivastava, Alok Sharma, Anurag Shukla, Vishnu Prasad, Venkanna U, Amit Sharma, Kalika Bali
The primary obstacle to developing technologies for low-resource languages is the lack of usable data.
1 code implementation • ACL 2020 • Pratik Joshi, Sebastin Santy, Amar Budhiraja, Kalika Bali, Monojit Choudhury
Language technologies contribute to promoting multilingualism and linguistic diversity around the world.
no code implementations • ICON 2019 • Pratik Joshi, Christain Barnes, Sebastin Santy, Simran Khanuja, Sanket Shah, Anirudh Srinivasan, Satwik Bhattamishra, Sunayana Sitaram, Monojit Choudhury, Kalika Bali
In this paper, we examine and analyze the challenges associated with developing and introducing language technologies to low-resource language communities.
no code implementations • WS 2019 • Sanket Shah, Pratik Joshi, Sebastin Santy, Sunayana Sitaram
Code-switching refers to the alternation of two or more languages in a conversation or utterance and is common in multilingual communities across the world.
1 code implementation • IJCNLP 2019 • Sebastin Santy, D, S apat, ipan, Monojit Choudhury, Kalika Bali
In this paper, we demonstrate an Interactive Machine Translation interface, that assists human translators with on-the-fly hints and suggestions.
no code implementations • 28 Nov 2018 • Sebastin Santy, Wazeer Zulfikar, Rishabh Mehrotra, Emine Yilmaz
We consider the problem of understanding real world tasks depicted in visual images.
1 code implementation • The Journal of Open Source Software 2018 • Lyndon White, Sebastin Santy
DataDepsGenerators. jl is a tool written to help users of the Julia programming language(Bezanson, Edelman, Karpinski, & Shah, 2017), to observe best practices when making use of published datasets.
no code implementations • 14 Sep 2018 • Wazeer Zulfikar, Sebastin Santy, Sahith Dambekodi, Tirtharaj Dash
Specifically, the present work is a comprehensive study on the implementation of an auto-encoder based Boundary Equilibrium GAN (BEGAN) to generate frontal faces using an interpolation of a side view face and its mirrored view.