Search Results for author: Sebastin Santy

Found 18 papers, 6 papers with code

BERTologiCoMix: How does Code-Mixing interact with Multilingual BERT?

no code implementations EACL (AdaptNLP) 2021 Sebastin Santy, Anirudh Srinivasan, Monojit Choudhury

Models such as mBERT and XLMR have shown success in solving Code-Mixed NLP tasks even though they were not exposed to such text during pretraining.

Multilingual Diversity Improves Vision-Language Representations

no code implementations27 May 2024 Thao Nguyen, Matthew Wallingford, Sebastin Santy, Wei-Chiu Ma, Sewoong Oh, Ludwig Schmidt, Pang Wei Koh, Ranjay Krishna

By translating all multilingual image-text pairs from a raw web crawl to English and re-filtering them, we increase the prevalence of (translated) multilingual data in the resulting training set.

Diversity Text Retrieval

BLIP: Facilitating the Exploration of Undesirable Consequences of Digital Technologies

no code implementations10 May 2024 Rock Yuren Pang, Sebastin Santy, René Just, Katharina Reinecke

Digital technologies have positively transformed society, but they have also led to undesirable consequences not anticipated at the time of design or development.

Diversity

CULTURE-GEN: Revealing Global Cultural Perception in Language Models through Natural Language Prompting

1 code implementation16 Apr 2024 Huihan Li, Liwei Jiang, Jena D. Hwang, Hyunwoo Kim, Sebastin Santy, Taylor Sorensen, Bill Yuchen Lin, Nouha Dziri, Xiang Ren, Yejin Choi

As the utilization of large language models (LLMs) has proliferated world-wide, it is crucial for them to have adequate knowledge and fair representation for diverse global cultures.

Diversity Fairness

Computer Vision Datasets and Models Exhibit Cultural and Linguistic Diversity in Perception

no code implementations22 Oct 2023 Andre Ye, Sebastin Santy, Jena D. Hwang, Amy X. Zhang, Ranjay Krishna

Computer vision often treats human perception as homogeneous: an implicit assumption that visual stimuli are perceived similarly by everyone.

Diversity Graph Embedding

NLPositionality: Characterizing Design Biases of Datasets and Models

1 code implementation2 Jun 2023 Sebastin Santy, Jenny T. Liang, Ronan Le Bras, Katharina Reinecke, Maarten Sap

We introduce NLPositionality, a framework for characterizing design biases and quantifying the positionality of NLP datasets and models.

Hate Speech Detection

Task Preferences across Languages on Community Question Answering Platforms

no code implementations18 Dec 2022 Sebastin Santy, Prasanta Bhattacharya, Rishabh Mehrotra

With the steady emergence of community question answering (CQA) platforms like Quora, StackExchange, and WikiHow, users now have an unprecedented access to information on various kind of queries and tasks.

Community Question Answering

CoSSAT: Code-Switched Speech Annotation Tool

no code implementations WS 2019 Sanket Shah, Pratik Joshi, Sebastin Santy, Sunayana Sitaram

Code-switching refers to the alternation of two or more languages in a conversation or utterance and is common in multilingual communities across the world.

INMT: Interactive Neural Machine Translation Prediction

1 code implementation IJCNLP 2019 Sebastin Santy, D, S apat, ipan, Monojit Choudhury, Kalika Bali

In this paper, we demonstrate an Interactive Machine Translation interface, that assists human translators with on-the-fly hints and suggestions.

Machine Translation Translation

DataDepsGenerators. jl: making reusing data easy by automatically generating DataDeps. jl registration code

1 code implementation The Journal of Open Source Software 2018 Lyndon White, Sebastin Santy

DataDepsGenerators. jl is a tool written to help users of the Julia programming language(Bezanson, Edelman, Karpinski, & Shah, 2017), to observe best practices when making use of published datasets.

A study on the use of Boundary Equilibrium GAN for Approximate Frontalization of Unconstrained Faces to aid in Surveillance

no code implementations14 Sep 2018 Wazeer Zulfikar, Sebastin Santy, Sahith Dambekodi, Tirtharaj Dash

Specifically, the present work is a comprehensive study on the implementation of an auto-encoder based Boundary Equilibrium GAN (BEGAN) to generate frontal faces using an interpolation of a side view face and its mirrored view.

Face Generation Generative Adversarial Network

Cannot find the paper you are looking for? You can Submit a new open access paper.