Search Results for author: Gautam Shroff

Found 54 papers, 4 papers with code

Intent Detection and Discovery from User Logs via Deep Semi-Supervised Contrastive Clustering

no code implementations NAACL 2022 Rajat Kumar, Mayur Patidar, Vaibhav Varshney, Lovekesh Vig, Gautam Shroff

However, even skilled domain experts are often unable to foresee all possible user intents at design time and for practical applications, novel intents may have to be inferred incrementally on-the-fly from user utterances.

Clustering Intent Detection +4

Prompt Augmented Generative Replay via Supervised Contrastive Learning for Lifelong Intent Detection

no code implementations Findings (NAACL) 2022 Vaibhav Varshney, Mayur Patidar, Rajat Kumar, Lovekesh Vig, Gautam Shroff

This typically entails repeated retraining of the intent detector on both the existing and novel intents which can be expensive and would require storage of all past data corresponding to prior intents.

Continual Learning Contrastive Learning +2

SmartFlow: Robotic Process Automation using LLMs

no code implementations21 May 2024 Arushi Jain, Shubham Paliwal, Monika Sharma, Lovekesh Vig, Gautam Shroff

To assess the effectiveness of SmartFlow, we have developed a dataset that includes a set of generic enterprise applications with diverse layouts, which we are releasing for research use.

Acceleron: A Tool to Accelerate Research Ideation

no code implementations7 Mar 2024 Harshit Nigam, Manasi Patwardhan, Lovekesh Vig, Gautam Shroff

To aid with research ideation, we propose `Acceleron', a research accelerator for different phases of the research life cycle, and which is specially designed to aid the ideation process.

Conservative Predictions on Noisy Financial Data

no code implementations18 Oct 2023 Omkar Nabar, Gautam Shroff

We apply a similar approach, where a model abstains from making a prediction on data points that it is uncertain on.

Neuro-symbolic Meta Reinforcement Learning for Trading

no code implementations15 Jan 2023 S I Harini, Gautam Shroff, Ashwin Srinivasan, Prayushi Faldu, Lovekesh Vig

We model short-duration (e. g. day) trading in financial markets as a sequential decision-making problem under uncertainty, with the added complication of continual concept-drift.

Decision Making Meta Reinforcement Learning +3

Calibrating Deep Neural Networks using Explicit Regularisation and Dynamic Data Pruning

no code implementations20 Dec 2022 Ramya Hebbalaguppe, Rishabh Patra, Tirtharaj Dash, Gautam Shroff, Lovekesh Vig

Contemporary model calibration techniques mitigate the problem of overconfident predictions by pushing down the confidence of the winning class while increasing the confidence of the remaining classes across all test samples.

Image Classification

Neural Feature-Adaptation for Symbolic Predictions Using Pre-Training and Semantic Loss

no code implementations29 Nov 2022 Vedant Shah, Aditya Agrawal, Lovekesh Vig, Ashwin Srinivasan, Gautam Shroff, Tanmay Verlekar

Real data is often imperfect meaning that even if the symbolic theory remains unchanged, we may still need to address the problem of mapping raw data to high-level symbols, each time there is a change in the data acquisition environment or equipment.

Knowledge-based Analogical Reasoning in Neuro-symbolic Latent Spaces

no code implementations19 Sep 2022 Vishwa Shah, Aditya Sharma, Gautam Shroff, Lovekesh Vig, Tirtharaj Dash, Ashwin Srinivasan

However, connectionist models struggle to include explicit domain knowledge for deductive reasoning.

Continual Learning for Multivariate Time Series Tasks with Variable Input Dimensions

no code implementations14 Mar 2022 Vibhor Gupta, Jyoti Narwariya, Pankaj Malhotra, Lovekesh Vig, Gautam Shroff

We note that existing continual learning methods do not take into account variability in input dimensions arising due to different subsets of sensors being available across tasks, and struggle to adapt to such variable input dimensions (VID) tasks.

Activity Recognition Continual Learning +2

Learning to Liquidate Forex: Optimal Stopping via Adaptive Top-K Regression

no code implementations25 Feb 2022 Diksha Garg, Pankaj Malhotra, Anil Bhatia, Sanjay Bhat, Lovekesh Vig, Gautam Shroff

We consider learning a trading agent acting on behalf of the treasury of a firm earning revenue in a foreign currency (FC) and incurring expenses in the home currency (HC).

regression

DRTCI: Learning Disentangled Representations for Temporal Causal Inference

no code implementations20 Jan 2022 Garima Gupta, Lovekesh Vig, Gautam Shroff

Medical professionals evaluating alternative treatment plans for a patient often encounter time varying confounders, or covariates that affect both the future treatment assignment and the patient outcome.

Causal Inference counterfactual +1

Solving Visual Analogies Using Neural Algorithmic Reasoning

no code implementations19 Nov 2021 Atharv Sonwane, Gautam Shroff, Lovekesh Vig, Ashwin Srinivasan, Tirtharaj Dash

We consider a class of visual analogical reasoning problems that involve discovering the sequence of transformations by which pairs of input/output images are related, so as to analogously transform future inputs.

Program Synthesis Visual Analogies

CAMTA: Causal Attention Model for Multi-touch Attribution

no code implementations21 Dec 2020 Sachin Kumar, Garima Gupta, Ranjitha Prasad, Arnab Chatterjee, Lovekesh Vig, Gautam Shroff

Advertising channels have evolved from conventional print media, billboards and radio advertising to online digital advertising (ad), where the users are exposed to a sequence of ad campaigns via social networks, display ads, search etc.

Selection bias

Batch-Constrained Distributional Reinforcement Learning for Session-based Recommendation

no code implementations16 Dec 2020 Diksha Garg, Priyanka Gupta, Pankaj Malhotra, Lovekesh Vig, Gautam Shroff

Most of the existing deep reinforcement learning (RL) approaches for session-based recommendations either rely on costly online interactions with real users, or rely on potentially biased rule-based or data-driven user-behavior models for learning.

Distributional Reinforcement Learning Offline RL +3

Handling Variable-Dimensional Time Series with Graph Neural Networks

no code implementations1 Jul 2020 Vibhor Gupta, Jyoti Narwariya, Pankaj Malhotra, Lovekesh Vig, Gautam Shroff

Such a combinatorial generalization is achieved by conditioning the layers of a core neural network-based time series model with a "conditioning vector" that carries information of the available combination of sensors for each time series.

Activity Recognition Graph Neural Network +3

Graph Neural Networks for Leveraging Industrial Equipment Structure: An application to Remaining Useful Life Estimation

no code implementations30 Jun 2020 Jyoti Narwariya, Pankaj Malhotra, Vishnu Tv, Lovekesh Vig, Gautam Shroff

Deep learning models such as those based on recurrent neural networks (RNNs) or convolutional neural networks (CNNs) fail to explicitly leverage this potentially rich source of domain-knowledge into the learning procedure.

Time Series Time Series Analysis

MultiMBNN: Matched and Balanced Causal Inference with Neural Networks

no code implementations28 Apr 2020 Ankit Sharma, Garima Gupta, Ranjitha Prasad, Arnab Chatterjee, Lovekesh Vig, Gautam Shroff

Causal inference (CI) in observational studies has received a lot of attention in healthcare, education, ad attribution, policy evaluation, etc.

Causal Inference

Privacy Guidelines for Contact Tracing Applications

no code implementations28 Apr 2020 Manish Shukla, Rajan M A, Sachin Lodha, Gautam Shroff, Ramesh Raskar

Due to this there is an emergence of mobile based applications for contact tracing.

MetaCI: Meta-Learning for Causal Inference in a Heterogeneous Population

no code implementations9 Dec 2019 Ankit Sharma, Garima Gupta, Ranjitha Prasad, Arnab Chatterjee, Lovekesh Vig, Gautam Shroff

Performing inference on data obtained through observational studies is becoming extremely relevant due to the widespread availability of data in fields such as healthcare, education, retail, etc.

Causal Inference counterfactual +1

From Monolingual to Multilingual FAQ Assistant using Multilingual Co-training

no code implementations WS 2019 Mayur Patidar, Surabhi Kumari, Manasi Patwardhan, Kar, Shirish e, Puneet Agarwal, Lovekesh Vig, Gautam Shroff

We observe that the proposed approach provides consistent gains in the performance of BERT for multiple benchmark datasets (e. g. 1. 0{\%} gain on MLDocs, and 1. 2{\%} gain on XNLI over translate-train with BERT), while requiring a single model for multiple languages.

Cross-Lingual Transfer

Meta-Learning for Few-Shot Time Series Classification

no code implementations13 Sep 2019 Jyoti Narwariya, Pankaj Malhotra, Lovekesh Vig, Gautam Shroff, Vishnu Tv

We overcome this limitation in order to train a common agent across domains with each domain having different number of target classes, we utilize a triplet-loss based learning procedure that does not require any constraints to be enforced on the number of classes for the few-shot TSC tasks.

Activity Recognition Classification +5

NISER: Normalized Item and Session Representations to Handle Popularity Bias

2 code implementations10 Sep 2019 Priyanka Gupta, Diksha Garg, Pankaj Malhotra, Lovekesh Vig, Gautam Shroff

The models using normalized item and session-graph representations perform significantly better: i. for the less popular long-tail items in the offline setting, and ii.

Session-Based Recommendations

Meta-Learning for Black-box Optimization

no code implementations16 Jul 2019 Vishnu TV, Pankaj Malhotra, Jyoti Narwariya, Lovekesh Vig, Gautam Shroff

Recently, neural networks trained as optimizers under the "learning to learn" or meta-learning framework have been shown to be effective for a broad range of optimization tasks including derivative-free black-box function optimization.

Meta-Learning

One-shot Information Extraction from Document Images using Neuro-Deductive Program Synthesis

no code implementations6 Jun 2019 Vishal Sunder, Ashwin Srinivasan, Lovekesh Vig, Gautam Shroff, Rohit Rahul

Our interest in this paper is in meeting a rapidly growing industrial demand for information extraction from images of documents such as invoices, bills, receipts etc.

Program Synthesis

Fast Online "Next Best Offers" using Deep Learning

no code implementations31 May 2019 Rekha Singhal, Gautam Shroff, Mukund Kumar, Sharod Roy, Sanket Kadarkar, Rupinder virk, Siddharth Verma, Vartika Tiwari

In this paper, we present iPrescribe, a scalable low-latency architecture for recommending 'next-best-offers' in an online setting.

BIG-bench Machine Learning

ConvTimeNet: A Pre-trained Deep Convolutional Neural Network for Time Series Classification

no code implementations29 Apr 2019 Kathan Kashiparekh, Jyoti Narwariya, Pankaj Malhotra, Lovekesh Vig, Gautam Shroff

We also provide qualitative insights into the working of CTN by: i) analyzing the activations and filters of first convolution layer suggesting the filters in CTN are generically useful, ii) analyzing the impact of the design decision to incorporate multiple length decisions, and iii) finding regions of time series that affect the final classification decision via occlusion sensitivity analysis.

Computational Efficiency General Classification +3

Transfer Learning for Clinical Time Series Analysis using Deep Neural Networks

no code implementations1 Apr 2019 Priyanka Gupta, Pankaj Malhotra, Jyoti Narwariya, Lovekesh Vig, Gautam Shroff

We, therefore, conclude that pre-trained deep models like TimeNet and HealthNet allow leveraging the advantages of deep learning for clinical time series analysis tasks, while also minimize dependence on hand-crafted features, deal robustly with scarce labeled training data scenarios without overfitting, as well as reduce dependence on expertise and resources required to train deep networks from scratch.

Domain Adaptation Time Series +2

Data-driven Prognostics with Predictive Uncertainty Estimation using Ensemble of Deep Ordinal Regression Models

no code implementations23 Mar 2019 Vishnu TV, Diksha, Pankaj Malhotra, Lovekesh Vig, Gautam Shroff

We propose a novel deep learning based approach for Prognostics with Uncertainty Quantification that is useful in scenarios where: (i) access to labeled failure data is scarce due to rarity of failures (ii) future operational conditions are unobserved and (iii) inherent noise is present in the sensor readings.

regression Time Series Analysis +1

MEETING BOT: Reinforcement Learning for Dialogue Based Meeting Scheduling

no code implementations28 Dec 2018 Vishwanath D, Lovekesh Vig, Gautam Shroff, Puneet Agarwal

In this paper we present Meeting Bot, a reinforcement learning based conversational system that interacts with multiple users to schedule meetings.

reinforcement-learning Reinforcement Learning (RL) +1

Deep Reader: Information extraction from Document images via relation extraction and Natural Language

no code implementations11 Dec 2018 Vishwanath D, Rohit Rahul, Gunjan Sehgal, Swati, Arindam Chowdhury, Monika Sharma, Lovekesh Vig, Gautam Shroff, Ashwin Srinivasan

In this paper, we propose a novel enterprise based end-to-end framework called DeepReader which facilitates information extraction from document images via identification of visual entities and populating a meta relational model across different entities in the document image.

Optical Character Recognition Optical Character Recognition (OCR) +2

Transfer Learning for Clinical Time Series Analysis using Recurrent Neural Networks

no code implementations4 Jul 2018 Priyanka Gupta, Pankaj Malhotra, Lovekesh Vig, Gautam Shroff

We consider transferring the knowledge captured in an RNN trained on several source tasks simultaneously using a large labeled dataset to build the model for a target task with limited labeled data.

Mortality Prediction Time Series +2

Information Bottleneck Inspired Method For Chat Text Segmentation

no code implementations IJCNLP 2017 S Vishal, Mohit Yadav, Lovekesh Vig, Gautam Shroff

We present a novel technique for segmenting chat conversations using the information bottleneck method (Tishby et al., 2000), augmented with sequential continuity constraints.

Representation Learning Text Generation +2

Crop Planning using Stochastic Visual Optimization

no code implementations25 Oct 2017 Gunjan Sehgal, Bindu Gupta, Kaushal Paneri, Karamjit Singh, Geetika Sharma, Gautam Shroff

Given the weather and soil properties, farmers need to take critical decisions such as which seed variety to plant and in what proportion, in order to maximize productivity.

Decision Making

Predicting Remaining Useful Life using Time Series Embeddings based on Recurrent Neural Networks

no code implementations4 Sep 2017 Narendhar Gugulothu, Vishnu Tv, Pankaj Malhotra, Lovekesh Vig, Puneet Agarwal, Gautam Shroff

We propose Embed-RUL: a novel approach for RUL estimation from sensor data that does not rely on any degradation-trend assumptions, is robust to noise, and handles missing values.

Missing Values Time Series +1

Comparative Benchmarking of Causal Discovery Techniques

no code implementations18 Aug 2017 Karamjit Singh, Garima Gupta, Vartika Tewari, Gautam Shroff

In this paper we present a comprehensive view of prominent causal discovery algorithms, categorized into two main categories (1) assuming acyclic and no latent variables, and (2) allowing both cycles and latent variables, along with experimental results comparing them from three perspectives: (a) structural accuracy, (b) standard predictive accuracy, and (c) accuracy of counterfactual inference.

Benchmarking Causal Discovery +2

TimeNet: Pre-trained deep recurrent neural network for time series classification

2 code implementations23 Jun 2017 Pankaj Malhotra, Vishnu Tv, Lovekesh Vig, Puneet Agarwal, Gautam Shroff

Inspired by the tremendous success of deep Convolutional Neural Networks as generic feature extractors for images, we propose TimeNet: a deep recurrent neural network (RNN) trained on diverse time series in an unsupervised manner using sequence to sequence (seq2seq) models to extract features from time series.

Dynamic Time Warping General Classification +3

Learning and Knowledge Transfer with Memory Networks for Machine Comprehension

no code implementations EACL 2017 Mohit Yadav, Lovekesh Vig, Gautam Shroff

Motivated by these practical issues, we propose a novel curriculum inspired training procedure for Memory Networks to improve the performance for machine comprehension with relatively small volumes of training data.

Question Answering Reading Comprehension +1

Deep Convolutional Neural Networks for Pairwise Causality

no code implementations3 Jan 2017 Karamjit Singh, Garima Gupta, Lovekesh Vig, Gautam Shroff, Puneet Agarwal

Discovering causal models from observational and interventional data is an important first step preceding what-if analysis or counterfactual reasoning.

Attribute Causal Discovery +2

Neuro-symbolic EDA-based Optimisation using ILP-enhanced DBNs

no code implementations20 Dec 2016 Sarmimala Saikia, Lovekesh Vig, Ashwin Srinivasan, Gautam Shroff, Puneet Agarwal, Richa Rawat

We investigate solving discrete optimisation problems using the estimation of distribution (EDA) approach via a novel combination of deep belief networks(DBN) and inductive logic programming (ILP). While DBNs are used to learn the structure of successively better feasible solutions, ILP enables the incorporation of domain-based background knowledge related to the goodness of solutions. Recent work showed that ILP could be an effective way to use domain knowledge in an EDA scenario. However, in a purely ILP-based EDA, sampling successive populations is either inefficient or not straightforward. In our Neuro-symbolic EDA, an ILP engine is used to construct a model for good solutions using domain-based background knowledge. These rules are introduced as Boolean features in the last hidden layer of DBNs used for EDA-based optimization. This incorporation of logical ILP features requires some changes while training and sampling from DBNs: (a)our DBNs need to be trained with data for units at the input layer as well as some units in an otherwise hidden layer, and (b)we would like the samples generated to be drawn from instances entailed by the logical model. We demonstrate the viability of our approach on instances of two optimisation problems: predicting optimal depth-of-win for the KRK endgame, and jobshop scheduling. Our results are promising: (i)On each iteration of distribution estimation, samples obtained with an ILP-assisted DBN have a substantially greater proportion of good solutions than samples generated using a DBN without ILP features, and (ii)On termination of distribution estimation, samples obtained using an ILP-assisted DBN contain more near-optimal samples than samples from a DBN without ILP features. These results suggest that the use of ILP-constructed theories could be useful for incorporating complex domain-knowledge into deep models for estimation of distribution based procedures.

Inductive logic programming

Multi-Sensor Prognostics using an Unsupervised Health Index based on LSTM Encoder-Decoder

no code implementations22 Aug 2016 Pankaj Malhotra, Vishnu Tv, Anusha Ramakrishnan, Gaurangi Anand, Lovekesh Vig, Puneet Agarwal, Gautam Shroff

Many approaches for estimation of Remaining Useful Life (RUL) of a machine, using its operational sensor data, make assumptions about how a system degrades or a fault evolves, e. g., exponential degradation.

Decoder Exponential degradation +2

Generation of Near-Optimal Solutions Using ILP-Guided Sampling

no code implementations3 Aug 2016 Ashwin Srinivasan, Gautam Shroff, Lovekesh Vig, Sarmimala Saikia, Puneet Agarwal

To answer this in the affirmative, we need: (a)a general-purpose technique for the incorporation of domain knowledge when constructing models for optimal values; and (b)a way of using these models to generate new data samples.

Inductive logic programming Job Shop Scheduling +1

LSTM-based Encoder-Decoder for Multi-sensor Anomaly Detection

8 code implementations1 Jul 2016 Pankaj Malhotra, Anusha Ramakrishnan, Gaurangi Anand, Lovekesh Vig, Puneet Agarwal, Gautam Shroff

Mechanical devices such as engines, vehicles, aircrafts, etc., are typically instrumented with numerous sensors to capture the behavior and health of the machine.

Anomaly Detection Decoder +4

ODE - Augmented Training Improves Anomaly Detection in Sensor Data from Machines

no code implementations5 May 2016 Mohit Yadav, Pankaj Malhotra, Lovekesh Vig, K Sriram, Gautam Shroff

The available data is then augmented with data generated from the ODE, and the anomaly detector is retrained on this augmented dataset.

Anomaly Detection Time Series +1

Long Short Term Memory Networks for Anomaly Detection in Time Series

1 code implementation ESANN 2015 Pankaj Malhotra, Lovekesh Vig, Gautam Shroff, Puneet Agarwal

Long Short Term Memory (LSTM) networks have been demonstrated to be particularly useful for learning sequences containing longer term patterns of unknown length, due to their ability to maintain long term memory.

Anomaly Detection Fault Detection +2

Efficiently Discovering Frequent Motifs in Large-scale Sensor Data

no code implementations2 Jan 2015 Puneet Agarwal, Gautam Shroff, Sarmimala Saikia, Zaigham Khan

While analyzing vehicular sensor data, we found that frequently occurring waveforms could serve as features for further analysis, such as rule mining, classification, and anomaly detection.

Anomaly Detection Clustering +2

Warranty Cost Estimation Using Bayesian Network

no code implementations11 Nov 2014 Karamjit Singh, Puneet Agarwal, Gautam Shroff

All multi-component product manufacturing companies face the problem of warranty cost estimation.

Multi-Sensor Event Detection using Shape Histograms

no code implementations16 Aug 2014 Ehtesham Hassan, Gautam Shroff, Puneet Agarwal

We present results on real-life vehicular sensor data and show that our technique performs better than available pattern detection implementations on our data, and that it can also be used to combine features from multiple sensors resulting in better accuracy than using any single sensor.

Event Detection Time Series +1

Cannot find the paper you are looking for? You can Submit a new open access paper.