Search Results for author: Arun Kumar

Found 26 papers, 5 papers with code

Bolt-on Differential Privacy for Scalable Stochastic Gradient Descent-based Analytics

1 code implementation15 Jun 2016 Xi Wu, Fengan Li, Arun Kumar, Kamalika Chaudhuri, Somesh Jha, Jeffrey F. Naughton

This paper takes a first step to remedy this disconnect and proposes a private SGD algorithm to address \emph{both} issues in an integrated manner.

Novelty Learning via Collaborative Proximity Filtering

no code implementations21 Oct 2016 Arun Kumar, Paul Schrater

We meet these challenges by developing a model of novelty preferences that learns and tracks latent user tastes.

Recommendation Systems

Tuple-oriented Compression for Large-scale Mini-batch Stochastic Gradient Descent

no code implementations22 Feb 2017 Fengan Li, Lingjiao Chen, Yijing Zeng, Arun Kumar, Jeffrey F. Naughton, Jignesh M. Patel, Xi Wu

We fill this crucial research gap by proposing a new lossless compression scheme we call tuple-oriented compression (TOC) that is inspired by an unlikely source, the string/text compression scheme Lempel-Ziv-Welch, but tailored to MGD in a way that preserves tuple boundaries within mini-batches.

Data Compression Open-Ended Question Answering +1

Morphological Analysis of the Dravidian Language Family

no code implementations EACL 2017 Arun Kumar, Ryan Cotterell, Llu{\'\i}s Padr{\'o}, Antoni Oliver

The Dravidian languages are one of the most widely spoken language families in the world, yet there are very few annotated resources available to NLP researchers.

Morphological Analysis Segmentation

Are Key-Foreign Key Joins Safe to Avoid when Learning High-Capacity Classifiers?

no code implementations3 Apr 2017 Vraj Shah, Arun Kumar, Xiaojin Zhu

Our results show that these high-capacity classifiers are surprisingly and counter-intuitively more robust to avoiding KFK joins compared to linear classifiers, refuting an intuition from the prior work's analysis.

Management

In-RDBMS Hardware Acceleration of Advanced Analytics

no code implementations8 Jan 2018 Divya Mahajan, Joon Kyung Kim, Jacob Sacks, Adel Ardalan, Arun Kumar, Hadi Esmaeilzadeh

The data revolution is fueled by advances in machine learning, databases, and hardware design.

Model-based Pricing for Machine Learning in a Data Marketplace

no code implementations26 May 2018 Lingjiao Chen, Paraschos Koutris, Arun Kumar

Finally, we conduct extensive experiments, which validate that the MBP framework can provide high revenue to the seller, high affordability to the buyer, and also operate on low runtime cost.

BIG-bench Machine Learning

Bipedal Walking Robot using Deep Deterministic Policy Gradient

3 code implementations16 Jul 2018 Arun Kumar, Navneet Paul, S. N. Omkar

The control systems community has started to show interest towards several machine learning algorithms from the sub-domains such as supervised learning, imitation learning and reinforcement learning to achieve autonomous control and intelligent decision making.

BIG-bench Machine Learning Imitation Learning +2

Deep Domain Adaptation under Deep Label Scarcity

no code implementations20 Sep 2018 Amar Prakash Azad, Dinesh Garg, Priyanka Agrawal, Arun Kumar

The goal behind Domain Adaptation (DA) is to leverage the labeled examples from a source domain so as to infer an accurate model in a target domain where labels are not available or in scarce at the best.

Domain Adaptation Transductive Learning

Fine Grained Classification of Personal Data Entities

no code implementations23 Nov 2018 Riddhiman Dasgupta, Balaji Ganesan, Aswin Kannan, Berthold Reinwald, Arun Kumar

Entity Type Classification can be defined as the task of assigning category labels to entity mentions in documents.

Classification General Classification

Document Structure Measure for Hypernym discovery

no code implementations30 Nov 2018 Aswin Kannan, Shanmukha C Guttula, Balaji Ganesan, Hima P Karanam, Arun Kumar

Hypernym discovery is the problem of finding terms that have is-a relationship with a given term.

Hypernym Discovery Position

Belief dynamics extraction

no code implementations2 Feb 2019 Arun Kumar, Zhengwei Wu, Xaq Pitkow, Paul Schrater

Estimating the structure of these internal states is crucial for understanding the neural basis of behavior.

Model-based Reinforcement Learning

Morphological Segmentation Inside-Out

no code implementations EMNLP 2016 Ryan Cotterell, Arun Kumar, Hinrich Schütze

Morphological segmentation has traditionally been modeled with non-hierarchical models, which yield flat segmentations as output.

Morphological Analysis Segmentation

A novel method of fuzzy time series forecasting based on interval index number and membership value using support vector machine

no code implementations20 Oct 2020 Kiran Bisht, Arun Kumar

There are generally, four factors that determine the performance of the forecasting method (1) number of intervals (NOIs) and length of intervals to partition universe of discourse (UOD) (2) fuzzification rules or feature representation of crisp time series (3) method of establishing fuzzy logic rule (FLRs) between input and target values (4) defuzzification rule to get crisp forecasted value.

Clustering Time Series +1

Hydra: A System for Large Multi-Model Deep Learning

1 code implementation16 Oct 2021 Kabir Nagrecha, Arun Kumar

In this paper, we present Hydra, a system designed to tackle such challenges by enabling out-of-the-box scaling for multi-large-model DL workloads on even commodity GPUs in a resource-efficient manner.

Ranked #5 on Language Modelling on WikiText-2 (using extra training data)

Language Modelling Model Selection +2

Technology Pipeline for Large Scale Cross-Lingual Dubbing of Lecture Videos into Multiple Indian Languages

no code implementations1 Nov 2022 Anusha Prakash, Arun Kumar, Ashish Seth, Bhagyashree Mukherjee, Ishika Gupta, Jom Kuriakose, Jordan Fernandes, K V Vikram, Mano Ranjith Kumar M, Metilda Sagaya Mary, Mohammad Wajahat, Mohana N, Mudit Batra, Navina K, Nihal John George, Nithya Ravi, Pruthwik Mishra, Sudhanshu Srivastava, Vasista Sai Lodagala, Vandan Mujadia, Kada Sai Venkata Vineeth, Vrunda Sukhadia, Dipti Sharma, Hema Murthy, Pushpak Bhattacharya, S Umesh, Rajeev Sangal

Cross-lingual dubbing of lecture videos requires the transcription of the original audio, correction and removal of disfluencies, domain term discovery, text-to-text translation into the target language, chunking of text using target language rhythm, text-to-speech synthesis followed by isochronous lipsyncing to the original video.

Chunking Speech Synthesis +1

Objects as Spatio-Temporal 2.5D points

no code implementations6 Dec 2022 Paridhi Singh, Gaurav Singh, Arun Kumar

Determining accurate bird's eye view (BEV) positions of objects and tracks in a scene is vital for various perception tasks including object interactions mapping, scenario extraction etc., however, the level of supervision required to accomplish that is extremely challenging to procure.

Depth Estimation Depth Prediction +3

Saturn: An Optimized Data System for Large Model Deep Learning Workloads

1 code implementation3 Sep 2023 Kabir Nagrecha, Arun Kumar

Such models need multiple GPUs due to both their size and computational load, driving the development of a bevy of "model parallelism" techniques & tools.

Model Selection Scheduling

Saturn: Efficient Multi-Large-Model Deep Learning

no code implementations6 Nov 2023 Kabir Nagrecha, Arun Kumar

In this paper, we propose Saturn, a new data system to improve the efficiency of multi-large-model training (e. g., during model selection/hyperparameter optimization).

Hyperparameter Optimization Model Selection +1

KIX: A Metacognitive Generalization Framework

no code implementations8 Feb 2024 Arun Kumar, Paul Schrater

Humans and other animals aptly exhibit general intelligence behaviors in solving a variety of tasks with flexibility and ability to adapt to novel situations by reusing and applying high level knowledge acquired over time.

Cannot find the paper you are looking for? You can Submit a new open access paper.