no code implementations • 15 Sep 2024 • Bhawna Paliwal, Deepak Saini, Mudit Dhawan, Siddarth Asokan, Nagarajan Natarajan, Surbhi Aggarwal, Pankaj Malhotra, Jian Jiao, Manik Varma
In response, we propose Cross-encoders with Joint Efficient Modeling (CROSS-JEM), a novel ranking approach that enables transformer-based models to jointly score multiple items for a query, maximizing parameter utilization.
no code implementations • 4 May 2024 • Siddhant Kharbanda, Devaansh Gupta, Gururaj K, Pankaj Malhotra, Cho-Jui Hsieh, Rohit Babbar
While such methods have shown empirical success, we observe two key uncharted aspects, (i) DE training typically uses only a single positive relation even for datasets which offer more, (ii) existing approaches fixate on using only OvA reduction of the multi-label problem.
Extreme Multi-Label Classification
MUlTI-LABEL-ClASSIFICATION
no code implementations • 28 Feb 2024 • Anshul Mittal, Shikhar Mohan, Deepak Saini, Siddarth Asokan, Suchith C. Prabhu, Lakshya Kumar, Pankaj Malhotra, Jain jiao, Amit Singh, Sumeet Agarwal, Soumen Chakrabarti, Purushottam Kar, Manik Varma
The paper notices that in these settings, it is much more effective to use graph data to regularize encoder training than to implement a GCN.
no code implementations • 14 Mar 2022 • Vibhor Gupta, Jyoti Narwariya, Pankaj Malhotra, Lovekesh Vig, Gautam Shroff
We note that existing continual learning methods do not take into account variability in input dimensions arising due to different subsets of sensors being available across tasks, and struggle to adapt to such variable input dimensions (VID) tasks.
no code implementations • 25 Feb 2022 • Diksha Garg, Pankaj Malhotra, Anil Bhatia, Sanjay Bhat, Lovekesh Vig, Gautam Shroff
We consider learning a trading agent acting on behalf of the treasury of a firm earning revenue in a foreign currency (FC) and incurring expenses in the home currency (HC).
no code implementations • 11 Feb 2022 • Jyoti Narwariya, Chetan Verma, Pankaj Malhotra, Lovekesh Vig, Easwara Subramanian, Sanjay Bhat
One of the objectives of such demand response management is to incentivize the consumers to adjust their consumption so that the overall electricity procurement in the wholesale markets is minimized, e. g. it is desirable that consumers consume less during peak hours when cost of procurement for brokers from wholesale markets are high.
1 code implementation • 10 Feb 2021 • Hritik Bansal, Gantavya Bhatt, Pankaj Malhotra, Prathosh A. P
Systematic generalization aims to evaluate reasoning about novel combinations from known components, an intrinsic property of human cognition.
no code implementations • 16 Dec 2020 • Diksha Garg, Priyanka Gupta, Pankaj Malhotra, Lovekesh Vig, Gautam Shroff
Most of the existing deep reinforcement learning (RL) approaches for session-based recommendations either rely on costly online interactions with real users, or rely on potentially biased rule-based or data-driven user-behavior models for learning.
Deep Reinforcement Learning
Distributional Reinforcement Learning
+4
no code implementations • 1 Jul 2020 • Vibhor Gupta, Jyoti Narwariya, Pankaj Malhotra, Lovekesh Vig, Gautam Shroff
Such a combinatorial generalization is achieved by conditioning the layers of a core neural network-based time series model with a "conditioning vector" that carries information of the available combination of sensors for each time series.
no code implementations • 30 Jun 2020 • Jyoti Narwariya, Pankaj Malhotra, Vishnu Tv, Lovekesh Vig, Gautam Shroff
Deep learning models such as those based on recurrent neural networks (RNNs) or convolutional neural networks (CNNs) fail to explicitly leverage this potentially rich source of domain-knowledge into the learning procedure.
no code implementations • 13 Sep 2019 • Jyoti Narwariya, Pankaj Malhotra, Lovekesh Vig, Gautam Shroff, Vishnu Tv
We overcome this limitation in order to train a common agent across domains with each domain having different number of target classes, we utilize a triplet-loss based learning procedure that does not require any constraints to be enforced on the number of classes for the few-shot TSC tasks.
2 code implementations • 10 Sep 2019 • Priyanka Gupta, Diksha Garg, Pankaj Malhotra, Lovekesh Vig, Gautam Shroff
The models using normalized item and session-graph representations perform significantly better: i. for the less popular long-tail items in the offline setting, and ii.
Ranked #2 on
Session-Based Recommendations
on Last.FM
no code implementations • 16 Jul 2019 • Vishnu TV, Pankaj Malhotra, Jyoti Narwariya, Lovekesh Vig, Gautam Shroff
Recently, neural networks trained as optimizers under the "learning to learn" or meta-learning framework have been shown to be effective for a broad range of optimization tasks including derivative-free black-box function optimization.
no code implementations • 29 Apr 2019 • Kathan Kashiparekh, Jyoti Narwariya, Pankaj Malhotra, Lovekesh Vig, Gautam Shroff
We also provide qualitative insights into the working of CTN by: i) analyzing the activations and filters of first convolution layer suggesting the filters in CTN are generically useful, ii) analyzing the impact of the design decision to incorporate multiple length decisions, and iii) finding regions of time series that affect the final classification decision via occlusion sensitivity analysis.
no code implementations • 1 Apr 2019 • Priyanka Gupta, Pankaj Malhotra, Jyoti Narwariya, Lovekesh Vig, Gautam Shroff
We, therefore, conclude that pre-trained deep models like TimeNet and HealthNet allow leveraging the advantages of deep learning for clinical time series analysis tasks, while also minimize dependence on hand-crafted features, deal robustly with scarce labeled training data scenarios without overfitting, as well as reduce dependence on expertise and resources required to train deep networks from scratch.
no code implementations • 23 Mar 2019 • Vishnu TV, Diksha, Pankaj Malhotra, Lovekesh Vig, Gautam Shroff
We propose a novel deep learning based approach for Prognostics with Uncertainty Quantification that is useful in scenarios where: (i) access to labeled failure data is scarce due to rarity of failures (ii) future operational conditions are unobserved and (iii) inherent noise is present in the sensor readings.
no code implementations • 4 Jul 2018 • Priyanka Gupta, Pankaj Malhotra, Lovekesh Vig, Gautam Shroff
We consider transferring the knowledge captured in an RNN trained on several source tasks simultaneously using a large labeled dataset to build the model for a target task with limited labeled data.
no code implementations • 4 Sep 2017 • Narendhar Gugulothu, Vishnu Tv, Pankaj Malhotra, Lovekesh Vig, Puneet Agarwal, Gautam Shroff
We propose Embed-RUL: a novel approach for RUL estimation from sensor data that does not rely on any degradation-trend assumptions, is robust to noise, and handles missing values.
2 code implementations • 23 Jun 2017 • Pankaj Malhotra, Vishnu Tv, Lovekesh Vig, Puneet Agarwal, Gautam Shroff
Inspired by the tremendous success of deep Convolutional Neural Networks as generic feature extractors for images, we propose TimeNet: a deep recurrent neural network (RNN) trained on diverse time series in an unsupervised manner using sequence to sequence (seq2seq) models to extract features from time series.
no code implementations • 22 Aug 2016 • Pankaj Malhotra, Vishnu Tv, Anusha Ramakrishnan, Gaurangi Anand, Lovekesh Vig, Puneet Agarwal, Gautam Shroff
Many approaches for estimation of Remaining Useful Life (RUL) of a machine, using its operational sensor data, make assumptions about how a system degrades or a fault evolves, e. g., exponential degradation.
8 code implementations • 1 Jul 2016 • Pankaj Malhotra, Anusha Ramakrishnan, Gaurangi Anand, Lovekesh Vig, Puneet Agarwal, Gautam Shroff
Mechanical devices such as engines, vehicles, aircrafts, etc., are typically instrumented with numerous sensors to capture the behavior and health of the machine.
no code implementations • 5 May 2016 • Mohit Yadav, Pankaj Malhotra, Lovekesh Vig, K Sriram, Gautam Shroff
The available data is then augmented with data generated from the ODE, and the anomaly detector is retrained on this augmented dataset.
1 code implementation • ESANN 2015 • Pankaj Malhotra, Lovekesh Vig, Gautam Shroff, Puneet Agarwal
Long Short Term Memory (LSTM) networks have been demonstrated to be particularly useful for learning sequences containing longer term patterns of unknown length, due to their ability to maintain long term memory.