Search Results for author: Arun Kumar

Found 26 papers, 5 papers with code

Learning Agglutinative Morphology of Indian Languages with Linguistically Motivated Adaptor Grammars

no code implementations • RANLP 2015 • Arun Kumar, Llu{\'\i}s Padr{\'o}, Antoni Oliver

Paper
Add Code

Unsupervised learning of agglutinated morphology using nested Pitman-Yor process based morpheme induction algorithm

no code implementations • RANLP 2015 • Arun Kumar

Information Retrieval Machine Translation

Paper
Add Code

Joint Bayesian Morphology Learning for Dravidian Languages

no code implementations • WS 2015 • Arun Kumar, Llu{\'\i}s Padr{\'o}, Antoni Oliver

Morphological Analysis

Paper
Add Code

Bolt-on Differential Privacy for Scalable Stochastic Gradient Descent-based Analytics

1 code implementation • 15 Jun 2016 • Xi Wu, Fengan Li, Arun Kumar, Kamalika Chaudhuri, Somesh Jha, Jeffrey F. Naughton

This paper takes a first step to remedy this disconnect and proposes a private SGD algorithm to address \emph{both} issues in an integrated manner.

Paper
Code

Novelty Learning via Collaborative Proximity Filtering

no code implementations • 21 Oct 2016 • Arun Kumar, Paul Schrater

We meet these challenges by developing a model of novelty preferences that learns and tracks latent user tastes.

Recommendation Systems

Paper
Add Code

Tuple-oriented Compression for Large-scale Mini-batch Stochastic Gradient Descent

no code implementations • 22 Feb 2017 • Fengan Li, Lingjiao Chen, Yijing Zeng, Arun Kumar, Jeffrey F. Naughton, Jignesh M. Patel, Xi Wu

We fill this crucial research gap by proposing a new lossless compression scheme we call tuple-oriented compression (TOC) that is inspired by an unlikely source, the string/text compression scheme Lempel-Ziv-Welch, but tailored to MGD in a way that preserves tuple boundaries within mini-batches.

Data Compression Open-Ended Question Answering +1

Paper
Add Code

Morphological Analysis of the Dravidian Language Family

no code implementations • EACL 2017 • Arun Kumar, Ryan Cotterell, Llu{\'\i}s Padr{\'o}, Antoni Oliver

The Dravidian languages are one of the most widely spoken language families in the world, yet there are very few annotated resources available to NLP researchers.

Morphological Analysis Segmentation

Paper
Add Code

Are Key-Foreign Key Joins Safe to Avoid when Learning High-Capacity Classifiers?

no code implementations • 3 Apr 2017 • Vraj Shah, Arun Kumar, Xiaojin Zhu

Our results show that these high-capacity classifiers are surprisingly and counter-intuitively more robust to avoiding KFK joins compared to linear classifiers, refuting an intuition from the prior work's analysis.

Management

Paper
Add Code

Dialogue Act Sequence Labeling using Hierarchical encoder with CRF

3 code implementations • 13 Sep 2017 • Harshit Kumar, Arvind Agarwal, Riddhiman Dasgupta, Sachindra Joshi, Arun Kumar

Dialogue Act recognition associate dialogue acts (i. e., semantic labels) to utterances in a conversation.

Ranked #7 on Dialogue Act Classification on ICSI Meeting Recorder Dialog Act (MRDA) corpus

Dialogue Act Classification

Paper
Code

In-RDBMS Hardware Acceleration of Advanced Analytics

no code implementations • 8 Jan 2018 • Divya Mahajan, Joon Kyung Kim, Jacob Sacks, Adel Ardalan, Arun Kumar, Hadi Esmaeilzadeh

The data revolution is fueled by advances in machine learning, databases, and hardware design.

Paper
Add Code

Model-based Pricing for Machine Learning in a Data Marketplace

no code implementations • 26 May 2018 • Lingjiao Chen, Paraschos Koutris, Arun Kumar

Finally, we conduct extensive experiments, which validate that the MBP framework can provide high revenue to the seller, high affordability to the buyer, and also operate on low runtime cost.

BIG-bench Machine Learning

Paper
Add Code

Bipedal Walking Robot using Deep Deterministic Policy Gradient

3 code implementations • 16 Jul 2018 • Arun Kumar, Navneet Paul, S. N. Omkar

The control systems community has started to show interest towards several machine learning algorithms from the sub-domains such as supervised learning, imitation learning and reinforcement learning to achieve autonomous control and intelligent decision making.

BIG-bench Machine Learning Imitation Learning +2

Paper
Code

Deep Domain Adaptation under Deep Label Scarcity

no code implementations • 20 Sep 2018 • Amar Prakash Azad, Dinesh Garg, Priyanka Agrawal, Arun Kumar

The goal behind Domain Adaptation (DA) is to leverage the labeled examples from a source domain so as to infer an accurate model in a target domain where labels are not available or in scarce at the best.

Domain Adaptation Transductive Learning

Paper
Add Code

Fine Grained Classification of Personal Data Entities

no code implementations • 23 Nov 2018 • Riddhiman Dasgupta, Balaji Ganesan, Aswin Kannan, Berthold Reinwald, Arun Kumar

Entity Type Classification can be defined as the task of assigning category labels to entity mentions in documents.

Classification General Classification

Paper
Add Code

Document Structure Measure for Hypernym discovery

no code implementations • 30 Nov 2018 • Aswin Kannan, Shanmukha C Guttula, Balaji Ganesan, Hima P Karanam, Arun Kumar

Hypernym discovery is the problem of finding terms that have is-a relationship with a given term.

Hypernym Discovery Position

Paper
Add Code

Belief dynamics extraction

no code implementations • 2 Feb 2019 • Arun Kumar, Zhengwei Wu, Xaq Pitkow, Paul Schrater

Estimating the structure of these internal states is crucial for understanding the neural basis of behavior.

Model-based Reinforcement Learning

Paper
Add Code

MLSys: The New Frontier of Machine Learning Systems

no code implementations • 29 Mar 2019 • Alexander Ratner, Dan Alistarh, Gustavo Alonso, David G. Andersen, Peter Bailis, Sarah Bird, Nicholas Carlini, Bryan Catanzaro, Jennifer Chayes, Eric Chung, Bill Dally, Jeff Dean, Inderjit S. Dhillon, Alexandros Dimakis, Pradeep Dubey, Charles Elkan, Grigori Fursin, Gregory R. Ganger, Lise Getoor, Phillip B. Gibbons, Garth A. Gibson, Joseph E. Gonzalez, Justin Gottschlich, Song Han, Kim Hazelwood, Furong Huang, Martin Jaggi, Kevin Jamieson, Michael. I. Jordan, Gauri Joshi, Rania Khalaf, Jason Knight, Jakub Konečný, Tim Kraska, Arun Kumar, Anastasios Kyrillidis, Aparna Lakshmiratan, Jing Li, Samuel Madden, H. Brendan McMahan, Erik Meijer, Ioannis Mitliagkas, Rajat Monga, Derek Murray, Kunle Olukotun, Dimitris Papailiopoulos, Gennady Pekhimenko, Theodoros Rekatsinas, Afshin Rostamizadeh, Christopher Ré, Christopher De Sa, Hanie Sedghi, Siddhartha Sen, Virginia Smith, Alex Smola, Dawn Song, Evan Sparks, Ion Stoica, Vivienne Sze, Madeleine Udell, Joaquin Vanschoren, Shivaram Venkataraman, Rashmi Vinayak, Markus Weimer, Andrew Gordon Wilson, Eric Xing, Matei Zaharia, Ce Zhang, Ameet Talwalkar

Machine learning (ML) techniques are enjoying rapidly increasing adoption.

BIG-bench Machine Learning

Paper
Add Code

Predicting Eating Events in Free Living Individuals -- A Technical Report

no code implementations • 14 Aug 2019 • Jiayi Wang, Jiue-An Yang, Supun Nakandala, Arun Kumar, Marta M. Jankowska

For predicting food purchasing events, the RBF-SVM model (0. 7395) outperforms others.

Paper
Add Code

Morphological Segmentation Inside-Out

no code implementations • EMNLP 2016 • Ryan Cotterell, Arun Kumar, Hinrich Schütze

Morphological segmentation has traditionally been modeled with non-hierarchical models, which yield flat segmentations as output.

Morphological Analysis Segmentation

Paper
Add Code

A novel method of fuzzy time series forecasting based on interval index number and membership value using support vector machine

no code implementations • 20 Oct 2020 • Kiran Bisht, Arun Kumar

There are generally, four factors that determine the performance of the forecasting method (1) number of intervals (NOIs) and length of intervals to partition universe of discourse (UOD) (2) fuzzification rules or feature representation of crisp time series (3) method of establishing fuzzy logic rule (FLRs) between input and target values (4) defuzzification rule to get crisp forecasted value.

Clustering Time Series +1

Paper
Add Code

Hydra: A System for Large Multi-Model Deep Learning

1 code implementation • 16 Oct 2021 • Kabir Nagrecha, Arun Kumar

In this paper, we present Hydra, a system designed to tackle such challenges by enabling out-of-the-box scaling for multi-large-model DL workloads on even commodity GPUs in a resource-efficient manner.

Ranked #5 on Language Modelling on WikiText-2 (using extra training data)

Language Modelling Model Selection +2

Paper
Code

Technology Pipeline for Large Scale Cross-Lingual Dubbing of Lecture Videos into Multiple Indian Languages

no code implementations • 1 Nov 2022 • Anusha Prakash, Arun Kumar, Ashish Seth, Bhagyashree Mukherjee, Ishika Gupta, Jom Kuriakose, Jordan Fernandes, K V Vikram, Mano Ranjith Kumar M, Metilda Sagaya Mary, Mohammad Wajahat, Mohana N, Mudit Batra, Navina K, Nihal John George, Nithya Ravi, Pruthwik Mishra, Sudhanshu Srivastava, Vasista Sai Lodagala, Vandan Mujadia, Kada Sai Venkata Vineeth, Vrunda Sukhadia, Dipti Sharma, Hema Murthy, Pushpak Bhattacharya, S Umesh, Rajeev Sangal

Cross-lingual dubbing of lecture videos requires the transcription of the original audio, correction and removal of disfluencies, domain term discovery, text-to-text translation into the target language, chunking of text using target language rhythm, text-to-speech synthesis followed by isochronous lipsyncing to the original video.

Chunking Speech Synthesis +1

Paper
Add Code

Objects as Spatio-Temporal 2.5D points

no code implementations • 6 Dec 2022 • Paridhi Singh, Gaurav Singh, Arun Kumar

Determining accurate bird's eye view (BEV) positions of objects and tracks in a scene is vital for various perception tasks including object interactions mapping, scenario extraction etc., however, the level of supervision required to accomplish that is extremely challenging to procure.

Depth Estimation Depth Prediction +3

Paper
Add Code

Saturn: An Optimized Data System for Large Model Deep Learning Workloads

1 code implementation • 3 Sep 2023 • Kabir Nagrecha, Arun Kumar

Such models need multiple GPUs due to both their size and computational load, driving the development of a bevy of "model parallelism" techniques & tools.

Model Selection Scheduling

Paper
Code

Saturn: Efficient Multi-Large-Model Deep Learning

no code implementations • 6 Nov 2023 • Kabir Nagrecha, Arun Kumar

In this paper, we propose Saturn, a new data system to improve the efficiency of multi-large-model training (e. g., during model selection/hyperparameter optimization).

Hyperparameter Optimization Model Selection +1

Paper
Add Code

KIX: A Metacognitive Generalization Framework

no code implementations • 8 Feb 2024 • Arun Kumar, Paul Schrater

Humans and other animals aptly exhibit general intelligence behaviors in solving a variety of tasks with flexibility and ability to adapt to novel situations by reusing and applying high level knowledge acquired over time.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.