Search Results for author: Vivek Kulkarni

Found 37 papers, 12 papers with code

CTM - A Model for Large-Scale Multi-View Tweet Topic Classification

no code implementations • NAACL (ACL) 2022 • Vivek Kulkarni, Kenny Leung, Aria Haghighi

In contrast to most prior work which only focuses on post-classification into a small number of topics (10-20), we consider the task of large-scale topic classification in the context of Twitter where the topic space is 10 times larger with potentially multiple topic associations per Tweet.

Classification Topic Classification

Paper
Add Code

DialectGram: Automatic Detection of Dialectal Changes with Multi-geographic Resolution Analysis

1 code implementation • SCiL 2020 • Hang Jiang, Haoshen Hong, Yuxing Chen, Vivek Kulkarni

Paper
Code

mEdIT: Multilingual Text Editing via Instruction Tuning

1 code implementation • 26 Feb 2024 • Vipul Raheja, Dimitris Alikaniotis, Vivek Kulkarni, Bashar Alhafni, Dhruv Kumar

We introduce mEdIT, a multi-lingual extension to CoEdIT -- the recent state-of-the-art text editing models for writing assistance.

Grammatical Error Correction Text Simplification

Paper
Code

Personalized Text Generation with Fine-Grained Linguistic Control

1 code implementation • 7 Feb 2024 • Bashar Alhafni, Vivek Kulkarni, Dhruv Kumar, Vipul Raheja

As the text generation capabilities of large language models become increasingly prominent, recent studies have focused on controlling particular aspects of the generated text to make it more personalized.

Text Generation

Paper
Code

SOCIALITE-LLAMA: An Instruction-Tuned Model for Social Scientific Tasks

no code implementations • 3 Feb 2024 • Gourab Dey, Adithya V Ganesan, Yash Kumar Lal, Manal Shah, Shreyashee Sinha, Matthew Matero, Salvatore Giorgi, Vivek Kulkarni, H. Andrew Schwartz

Social science NLP tasks, such as emotion or humor detection, are required to capture the semantics along with the implicit pragmatics from text, often with limited amounts of training data.

Humor Detection Reading Comprehension

Paper
Add Code

Writing Assistants Should Model Social Factors of Language

no code implementations • 28 Mar 2023 • Vivek Kulkarni, Vipul Raheja

Intelligent writing assistants powered by large language models (LLMs) are more popular today than ever before, but their further widespread adoption is precluded by sub-optimal performance.

Position

Paper
Add Code

NTULM: Enriching Social Media Text Representations with Non-Textual Units

no code implementations • COLING (WNUT) 2022 • Jinning Li, Shubhanshu Mishra, Ahmed El-Kishky, Sneha Mehta, Vivek Kulkarni

We refer to these annotations as Non-Textual Units (NTUs).

Language Modelling

Paper
Add Code

CTM -- A Model for Large-Scale Multi-View Tweet Topic Classification

no code implementations • 3 May 2022 • Vivek Kulkarni, Kenny Leung, Aria Haghighi

In contrast to most prior work which only focuses on post classification into a small number of topics ($10$-$20$), we consider the task of large-scale topic classification in the context of Twitter where the topic space is $10$ times larger with potentially multiple topic associations per Tweet.

Classification Topic Classification

Paper
Add Code

LMSOC: An Approach for Socially Sensitive Pretraining

1 code implementation • Findings (EMNLP) 2021 • Vivek Kulkarni, Shubhanshu Mishra, Aria Haghighi

Although language depends heavily on the geographical, temporal, and other social contexts of the speaker, these elements have not been incorporated into modern transformer-based language models.

Cloze Test Graph Representation Learning +1

Paper
Code

Learning, Computing, and Trustworthiness in Intelligent IoT Environments: Performance-Energy Tradeoffs

no code implementations • 4 Oct 2021 • Beatriz Soret, Lam D. Nguyen, Jan Seeger, Arne Bröring, Chaouki Ben Issaid, Sumudu Samarakoon, Anis El Gabli, Vivek Kulkarni, Mehdi Bennis, Petar Popovski

An Intelligent IoT Environment (iIoTe) is comprised of heterogeneous devices that can collaboratively execute semi-autonomous IoT applications, examples of which include highly automated manufacturing cells or autonomously interacting harvesting machines.

Edge-computing Total Energy

Paper
Add Code

TopicBERT for Energy Efficient Document Classification

1 code implementation • Findings of the Association for Computational Linguistics 2020 • Yatin Chaudhary, Pankaj Gupta, Khushbu Saxena, Vivek Kulkarni, Thomas Runkler, Hinrich Schütze

Our work thus focuses on optimizing the computational cost of fine-tuning for document classification.

Classification Document Classification +1

Paper
Code

DialectGram: Detecting Dialectal Variation at Multiple Geographic Resolutions

1 code implementation • 4 Oct 2019 • Hang Jiang, Haoshen Hong, Yuxing Chen, Vivek Kulkarni

In this work, we propose a model that enables detection of dialectal variation at multiple levels of geographic resolution obviating the need for a-priori definition of the resolution level.

Word Embeddings

Paper
Code

What Should I Ask? Using Conversationally Informative Rewards for Goal-Oriented Visual Dialog

no code implementations • 28 Jul 2019 • Pushkar Shukla, Carlos Elmadjian, Richika Sharan, Vivek Kulkarni, Matthew Turk, William Yang Wang

In this work, we focus on the task of goal-oriented visual dialogue, aiming to automatically generate a series of questions about an image with a single objective.

Visual Dialog

Paper
Add Code

TWEETQA: A Social Media Focused Question Answering Dataset

no code implementations • ACL 2019 • Wenhan Xiong, Jiawei Wu, Hong Wang, Vivek Kulkarni, Mo Yu, Shiyu Chang, Xiaoxiao Guo, William Yang Wang

With social media becoming increasingly pop-ular on which lots of news and real-time eventsare reported, developing automated questionanswering systems is critical to the effective-ness of many applications that rely on real-time knowledge.

Question Answering

Paper
Add Code

What Should I Ask? Using Conversationally Informative Rewards for Goal-oriented Visual Dialog.

no code implementations • ACL 2019 • Pushkar Shukla, Carlos Elmadjian, Richika Sharan, Vivek Kulkarni, Matthew Turk, William Yang Wang

In this work, we focus on the task of goal-oriented visual dialogue, aiming to automatically generate a series of questions about an image with a single objective.

Visual Dialog

Paper
Add Code

MOHONE: Modeling Higher Order Network Effects in KnowledgeGraphs via Network Infused Embeddings

no code implementations • 1 Nov 2018 • Hao Yu, Vivek Kulkarni, William Wang

First, we introduce methods that learn network representations of entities in the knowledge graph capturing these varied aspects of similarity.

Knowledge Graph Embedding Knowledge Graph Embeddings +2

Paper
Add Code

DOLORES: Deep Contextualized Knowledge Graph Embeddings

no code implementations • AKBC 2020 • Haoyu Wang, Vivek Kulkarni, William Yang Wang

We introduce a new method DOLORES for learning knowledge graph embeddings that effectively captures contextual cues and dependencies among entities and relations.

Knowledge Graph Embeddings Knowledge Graphs +3

Paper
Add Code

Multi-view Models for Political Ideology Detection of News Articles

no code implementations • EMNLP 2018 • Vivek Kulkarni, Junting Ye, Steven Skiena, William Yang Wang

A news article's title, content and link structure often reveal its political ideology.

Representation Learning

Paper
Add Code

Simple Neologism Based Domain Independent Models to Predict Year of Authorship

no code implementations • COLING 2018 • Vivek Kulkarni, Yingtao Tian, D, Parth iwala, Steve Skiena

We present domain independent models to date documents based only on neologism usage patterns.

Domain Adaptation Information Retrieval

Paper
Add Code

Simple Models for Word Formation in Slang

1 code implementation • NAACL 2018 • Vivek Kulkarni, William Yang Wang

We propose the first generative models for three types of extra-grammatical word formation phenomena abounding in slang: Blends, Clippings, and Reduplicatives.

Paper
Code

Hate Lingo: A Target-based Linguistic Analysis of Hate Speech in Social Media

2 code implementations • 11 Apr 2018 • Mai ElSherief, Vivek Kulkarni, Dana Nguyen, William Yang Wang, Elizabeth Belding

While social media empowers freedom of expression and individual voices, it also enables anti-social behavior, online harassment, cyberbullying, and hate speech.

Paper
Code

Simple Models for Word Formation in English Slang

1 code implementation • 7 Apr 2018 • Vivek Kulkarni, William Yang Wang

We propose generative models for three types of extra-grammatical word formation phenomena abounding in English slang: Blends, Clippings, and Reduplicatives.

Paper
Code

TFW, DamnGina, Juvie, and Hotsie-Totsie: On the Linguistic and Social Aspects of Internet Slang

no code implementations • 22 Dec 2017 • Vivek Kulkarni, William Yang Wang

In this work, we use UrbanDictionary to conduct the first large-scale linguistic analysis of slang and its social aspects on the Internet to yield insights into this variety of language that is increasingly used all over the world online.

Question Answering

Paper
Add Code

Human Centered NLP with User-Factor Adaptation

no code implementations • EMNLP 2017 • Veronica Lynn, Youngseo Son, Vivek Kulkarni, Niranjan Balasubramanian, H. Andrew Schwartz

We pose the general task of user-factor adaptation {--} adapting supervised learning models to real-valued user factors inferred from a background of their language, reflecting the idea that a piece of text should be understood within the context of the user that wrote it.

Document Classification Domain Adaptation +5

Paper
Add Code

On the Distribution of Lexical Features at Multiple Levels of Analysis

no code implementations • ACL 2017 • Fatemeh Almodaresi, Lyle Ungar, Vivek Kulkarni, Mohsen Zakeri, Salvatore Giorgi, H. Andrew Schwartz

Natural language processing has increasingly moved from modeling documents and words toward studying the people behind the language.

Document Classification Sentiment Analysis

Paper
Add Code

Latent Human Traits in the Language of Social Media: An Open-Vocabulary Approach

no code implementations • 22 May 2017 • Vivek Kulkarni, Margaret L. Kern, David Stillwell, Michal Kosinski, Sandra Matz, Lyle Ungar, Steven Skiena, H. Andrew Schwartz

Taking advantage of linguistic information available through Facebook, we study the process of inferring a new set of potential human traits based on unprompted language use.

Paper
Add Code

Domain Adaptation for Named Entity Recognition in Online Media with Word Embeddings

no code implementations • 1 Dec 2016 • Vivek Kulkarni, Yashar Mehdad, Troy Chevalier

In this paper, we propose methods to effectively adapt models learned on one domain onto other domains using distributed word representations.

Domain Adaptation named-entity-recognition +3

Paper
Add Code

Data Centroid Based Multi-Level Fuzzy Min-Max Neural Network

no code implementations • 19 Aug 2016 • Shraddha Deshmukh, Sagar Gandhi, Pratap Sanap, Vivek Kulkarni

In this brief, an extension of MLF is proposed which defines a new boundary region, where the previously proposed methods mark decisions with less confidence and hence misclassification is more frequent.

Classification General Classification

Paper
Add Code

On the Convergent Properties of Word Embedding Methods

no code implementations • 12 May 2016 • Yingtao Tian, Vivek Kulkarni, Bryan Perozzi, Steven Skiena

Do word embeddings converge to learn similar things over different initializations?

Word Embeddings

Paper
Add Code

Theano: A Python framework for fast computation of mathematical expressions

1 code implementation • 9 May 2016 • The Theano Development Team, Rami Al-Rfou, Guillaume Alain, Amjad Almahairi, Christof Angermueller, Dzmitry Bahdanau, Nicolas Ballas, Frédéric Bastien, Justin Bayer, Anatoly Belikov, Alexander Belopolsky, Yoshua Bengio, Arnaud Bergeron, James Bergstra, Valentin Bisson, Josh Bleecher Snyder, Nicolas Bouchard, Nicolas Boulanger-Lewandowski, Xavier Bouthillier, Alexandre de Brébisson, Olivier Breuleux, Pierre-Luc Carrier, Kyunghyun Cho, Jan Chorowski, Paul Christiano, Tim Cooijmans, Marc-Alexandre Côté, Myriam Côté, Aaron Courville, Yann N. Dauphin, Olivier Delalleau, Julien Demouth, Guillaume Desjardins, Sander Dieleman, Laurent Dinh, Mélanie Ducoffe, Vincent Dumoulin, Samira Ebrahimi Kahou, Dumitru Erhan, Ziye Fan, Orhan Firat, Mathieu Germain, Xavier Glorot, Ian Goodfellow, Matt Graham, Caglar Gulcehre, Philippe Hamel, Iban Harlouchet, Jean-Philippe Heng, Balázs Hidasi, Sina Honari, Arjun Jain, Sébastien Jean, Kai Jia, Mikhail Korobov, Vivek Kulkarni, Alex Lamb, Pascal Lamblin, Eric Larsen, César Laurent, Sean Lee, Simon Lefrancois, Simon Lemieux, Nicholas Léonard, Zhouhan Lin, Jesse A. Livezey, Cory Lorenz, Jeremiah Lowin, Qianli Ma, Pierre-Antoine Manzagol, Olivier Mastropietro, Robert T. McGibbon, Roland Memisevic, Bart van Merriënboer, Vincent Michalski, Mehdi Mirza, Alberto Orlandi, Christopher Pal, Razvan Pascanu, Mohammad Pezeshki, Colin Raffel, Daniel Renshaw, Matthew Rocklin, Adriana Romero, Markus Roth, Peter Sadowski, John Salvatier, François Savard, Jan Schlüter, John Schulman, Gabriel Schwartz, Iulian Vlad Serban, Dmitriy Serdyuk, Samira Shabanian, Étienne Simon, Sigurd Spieckermann, S. Ramana Subramanyam, Jakub Sygnowski, Jérémie Tanguay, Gijs van Tulder, Joseph Turian, Sebastian Urban, Pascal Vincent, Francesco Visin, Harm de Vries, David Warde-Farley, Dustin J. Webb, Matthew Willson, Kelvin Xu, Lijun Xue, Li Yao, Saizheng Zhang, Ying Zhang

Since its introduction, it has been one of the most used CPU and GPU mathematical compilers - especially in the machine learning community - and has shown steady performance improvements.

BIG-bench Machine Learning Clustering +2

9,851

Paper
Code

Don't Walk, Skip! Online Learning of Multi-scale Network Embeddings

2 code implementations • 6 May 2016 • Bryan Perozzi, Vivek Kulkarni, Haochen Chen, Steven Skiena

We present Walklets, a novel approach for learning multiscale representations of vertices in a network.

Social and Information Networks Physics and Society

375

Paper
Code

Freshman or Fresher? Quantifying the Geographic Variation of Internet Language

2 code implementations • 22 Oct 2015 • Vivek Kulkarni, Bryan Perozzi, Steven Skiena

Our analysis of British and American English over a period of 100 years reveals that semantic variation between these dialects is shrinking.

Paper
Code

To Drop or Not to Drop: Robustness, Consistency and Differential Privacy Properties of Dropout

no code implementations • 6 Mar 2015 • Prateek Jain, Vivek Kulkarni, Abhradeep Thakurta, Oliver Williams

Moreover, using the above mentioned stability properties of dropout, we design dropout based differentially private algorithms for solving ERMs.

L2 Regularization

Paper
Add Code

Statistically Significant Detection of Linguistic Change

no code implementations • 12 Nov 2014 • Vivek Kulkarni, Rami Al-Rfou, Bryan Perozzi, Steven Skiena

We propose a new computational approach for tracking and detecting statistically significant linguistic shifts in the meaning and usage of words.

Change Point Detection Time Series +1

Paper
Add Code

POLYGLOT-NER: Massive Multilingual Named Entity Recognition

no code implementations • 14 Oct 2014 • Rami Al-Rfou, Vivek Kulkarni, Bryan Perozzi, Steven Skiena

We describe a system that builds Named Entity Recognition (NER) annotators for 40 major languages using Wikipedia and Freebase.

Information Retrieval Machine Translation +7

Paper
Add Code

Exploring the power of GPU's for training Polyglot language models

no code implementations • 5 Apr 2014 • Vivek Kulkarni, Rami Al-Rfou', Bryan Perozzi, Steven Skiena

We evaluate the performance of training the model on the GPU and present optimizations that boost the performance on the GPU. One of the key optimizations, we propose increases the performance of a function involved in calculating and updating the gradient by approximately 50 times on the GPU for sufficiently large batch sizes.

Paper
Add Code

Inducing Language Networks from Continuous Space Word Representations

no code implementations • 6 Mar 2014 • Bryan Perozzi, Rami Al-Rfou, Vivek Kulkarni, Steven Skiena

Recent advancements in unsupervised feature learning have developed powerful latent representations of words.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.