no code implementations • 29 Jan 2025 • Yoshua Bengio, Sören Mindermann, Daniel Privitera, Tamay Besiroglu, Rishi Bommasani, Stephen Casper, Yejin Choi, Philip Fox, Ben Garfinkel, Danielle Goldfarb, Hoda Heidari, Anson Ho, Sayash Kapoor, Leila Khalatbari, Shayne Longpre, Sam Manning, Vasilios Mavroudis, Mantas Mazeika, Julian Michael, Jessica Newman, Kwan Yee Ng, Chinasa T. Okolo, Deborah Raji, Girish Sastry, Elizabeth Seger, Theodora Skeadas, Tobin South, Emma Strubell, Florian Tramèr, Lucia Velasco, Nicole Wheeler, Daron Acemoglu, Olubayo Adekanmbi, David Dalrymple, Thomas G. Dietterich, Edward W. Felten, Pascale Fung, Pierre-Olivier Gourinchas, Fredrik Heintz, Geoffrey Hinton, Nick Jennings, Andreas Krause, Susan Leavy, Percy Liang, Teresa Ludermir, Vidushi Marda, Helen Margetts, John McDermid, Jane Munga, Arvind Narayanan, Alondra Nelson, Clara Neppel, Alice Oh, Gopal Ramchurn, Stuart Russell, Marietje Schaake, Bernhard Schölkopf, Dawn Song, Alvaro Soto, Lee Tiedrich, Gaël Varoquaux, Andrew Yao, Ya-Qin Zhang, Fahad Albalawi, Marwan Alserkal, Olubunmi Ajala, Guillaume Avrin, Christian Busch, André Carlos Ponce de Leon Ferreira de Carvalho, Bronwyn Fox, Amandeep Singh Gill, Ahmet Halit Hatip, Juha Heikkilä, Gill Jolly, Ziv Katzir, Hiroaki Kitano, Antonio Krüger, Chris Johnson, Saif M. Khan, Kyoung Mu Lee, Dominic Vincent Ligot, Oleksii Molchanovskyi, Andrea Monti, Nusu Mwamanzi, Mona Nemer, Nuria Oliver, José Ramón López Portillo, Balaraman Ravindran, Raquel Pezoa Rivera, Hammam Riza, Crystal Rugege, Ciarán Seoighe, Jerry Sheehan, Haroon Sheikh, Denise Wong, Yi Zeng
The first International AI Safety Report comprehensively synthesizes the current evidence on the capabilities, risks, and safety of advanced AI systems.
no code implementations • 20 Jan 2025 • Ambreesh Parthasarathy, Chandrasekar Subramanian, Ganesh Senrayan, Shreyash Adappanavar, Aparna Taneja, Balaraman Ravindran, Milind Tambe
In this work, we study the effects on both task performance and fairness when the DLM algorithm, a recent work on using LLMs to design reward functions for RMABs, is prompted with non-English language commands.
no code implementations • 30 Nov 2024 • Sai Kiran Narayanaswami, Gopalakrishnan Srinivasan, Balaraman Ravindran
We propose optimizations that enhance the efficiency of QuAKE in commonly used exponential non-linearities such as Softmax, GELU, and the Logistic function.
no code implementations • 7 May 2024 • Atharvan Dogra, Krishna Pillutla, Ameet Deshpande, Ananya B Sai, John Nay, Tanmay Rajpurohit, Ashwin Kalyan, Balaraman Ravindran
We explore the ability of large language model (LLM)-based agents to engage in subtle deception such as strategically phrasing and intentionally manipulating information to misguide and deceive other agents.
no code implementations • 16 Feb 2024 • Yogesh Tripathi, Raghav Donakanti, Sahil Girhepuje, Ishan Kavathekar, Bhaskara Hanuma Vedula, Gokul S Krishnan, Shreya Goyal, Anmol Goel, Balaraman Ravindran, Ponnurangam Kumaraguru
Task performance and fairness scores of LLaMA and LLaMA--2 models indicate that the proposed $LSS_{\beta}$ metric can effectively determine the readiness of a model for safe usage in the legal sector.
no code implementations • 4 Dec 2023 • Gokul S Krishnan, Sarala Padi, Craig S. Greenberg, Balaraman Ravindran, Dinesh Manoch, Ram D. Sriram
To overcome these limitations, we propose novel line conversation graph convolutional network (LineConGCN) and graph attention (LineConGAT) models for ERC analysis.
no code implementations • 20 Nov 2023 • Omkar Shelke, Pranavi Pathakota, Anandsingh Chauhan, Harshad Khadilkar, Hardik Meisheri, Balaraman Ravindran
This paper presents an integrated algorithmic framework for minimising product delivery costs in e-commerce (known as the cost-to-serve or C2S).
no code implementations • 25 Jun 2023 • Saket Gurukar, Shaileshh Bojja Venkatakrishnan, Balaraman Ravindran, Srinivasan Parthasarathy
Specifically, the subgraph-based sampling approaches such as ClusterGCN and GraphSAINT have achieved state-of-the-art performance on the node classification tasks.
no code implementations • 30 May 2023 • Returaj Burnwal, Anirban Santara, Nirav P. Bhatt, Balaraman Ravindran, Gaurav Aggarwal
We propose a novel approach that uses a generative adversarial network (GAN) to minimize the Jensen-Shannon divergence between the state-trajectory distributions of the demonstrator and the imitator.
1 code implementation • 23 May 2023 • Sudarsun Santhiappan, Nitin Shravan, Balaraman Ravindran
We also propose an end-to-end Automated ML system for data classification based on our model selection method.
no code implementations • 12 Apr 2023 • Aravind Venugopal, Stephanie Milani, Fei Fang, Balaraman Ravindran
Unlike existing models, MABL is capable of encoding essential global information into the latent states during training while guaranteeing the decentralized execution of learned policies.
no code implementations • 13 Mar 2023 • Sahil Girhepuje, Anmol Goel, Gokul S Krishnan, Shreya Goyal, Satyendra Pandey, Ponnurangam Kumaraguru, Balaraman Ravindran
We highlight the propagation of learnt algorithmic biases in the bail prediction task for models trained on Hindi legal documents.
1 code implementation • 5 Dec 2022 • Adithya Ramesh, Balaraman Ravindran
In these environments, the physics-informed version of our algorithm achieves significantly better average-return and sample efficiency.
no code implementations • 27 Nov 2022 • Neeraja Kirtane, Jeshuren Chelladurai, Balaraman Ravindran, Ashish Tendulkar
Changing data composition is a popular way to address the imbalance in node classification.
no code implementations • 14 Jul 2022 • Jeshuren Chelladurai, Sudarsun Santhiappan, Balaraman Ravindran
We propose to automate this manual process by automatically constructing a query for the IR system using the entities auto-extracted from the clinical notes.
1 code implementation • 25 Jun 2022 • Jatin Chauhan, Aravindan Raghuveer, Rishi Saket, Jay Nandy, Balaraman Ravindran
Through systematic experiments across 4 datasets and 5 forecast models, we show that our technique is able to recover close to 95\% performance of the models even when only 15\% of the original variables are present.
no code implementations • 12 Jun 2022 • Kushal Chauhan, Soumya Chatterjee, Akash Reddy, Balaraman Ravindran, Pradeep Shenoy
The options framework in Hierarchical Reinforcement Learning breaks down overall goals into a combination of options or simpler tasks and associated policies, allowing for abstraction in the action space.
no code implementations • 29 Apr 2022 • Adam Żychowski, Jacek Mańdziuk, Elizabeth Bondi, Aravind Venugopal, Milind Tambe, Balaraman Ravindran
Green Security Games have become a popular way to model scenarios involving the protection of natural resources, such as wildlife.
no code implementations • 12 Mar 2022 • Shreya Goyal, Sumanth Doddapaneni, Mitesh M. Khapra, Balaraman Ravindran
In the past few years, it has become increasingly evident that deep neural networks are not resilient enough to withstand adversarial perturbations in input data, leaving them vulnerable to attack.
no code implementations • 29 Nov 2021 • Pavan Ravishankar, Pranshu Malviya, Balaraman Ravindran
We prove this result for the non-trivial non-parametric model setting when the cumulative unfairness cannot be expressed in terms of edge unfairness.
no code implementations • 3 Nov 2021 • Sapana Chaudhary, Balaraman Ravindran
We call our new smooth IL algorithm \textit{Smooth Policy and Cost Imitation Learning} (SPaCIL, pronounced 'Special').
no code implementations • 15 Oct 2021 • Amrit Diggavi Seshadri, Balaraman Ravindran
Synthesizing high-quality, realistic images from text-descriptions is a challenging task, and current methods synthesize images from text in a multi-stage manner, typically by first generating a rough initial image and then refining image details at subsequent stages.
no code implementations • 15 Oct 2021 • Harsha Kokel, Arjun Manoharan, Sriraam Natarajan, Balaraman Ravindran, Prasad Tadepalli
State abstraction enables sample-efficient learning and better task transfer in complex reinforcement learning environments.
1 code implementation • 5 Oct 2021 • Anasua Mitra, Priyesh Vijayan, Ranbir Sanasam, Diganta Goswami, Srinivasan Parthasarathy, Balaraman Ravindran
Multiplex networks are complex graph structures in which a set of entities are connected to each other via multiple types of relations, each relation representing a distinct layer.
no code implementations • ICLR 2022 • Chandrasekar Subramanian, Balaraman Ravindran
We study a contextual bandit setting where the learning agent has the ability to perform interventions on targeted subsets of the population, apart from possessing qualitative causal side-information.
no code implementations • AKBC 2021 • Karthik V, Beethika Tripathi, Mitesh M Khapra, Balaraman Ravindran
However, we find that existing approaches suffer from one or more of four drawbacks – 1) They are not modular with respect to the choice of the KG embedding model 2) They ignore best practices for aligning two embedding spaces 3) They do not account for differences in training strategy needed when presented with datasets with different description sizes and 4) They do not produce entity embeddings for use by downstream tasks.
1 code implementation • 11 May 2021 • Pranshu Malviya, Balaraman Ravindran, Sarath Chandar
We also show that our method performs better than several state-of-the-art methods in lifelong learning on complex datasets with a large number of tasks.
no code implementations • 7 Mar 2021 • Siddharth Nishtala, Lovish Madaan, Aditya Mate, Harshavardhan Kamarthi, Anirudh Grama, Divy Thakkar, Dhyanesh Narayanan, Suresh Chaudhary, Neha Madhiwalla, Ramesh Padmanabhan, Aparna Hegde, Pradeep Varakantham, Balaraman Ravindran, Milind Tambe
India has a maternal mortality ratio of 113 and child mortality ratio of 2830 per 100, 000 live births.
1 code implementation • 6 Feb 2021 • Deepak Maurya, Balaraman Ravindran
This is further used to propose a hyperedge prediction algorithm.
no code implementations • 7 Jan 2021 • Jahnvi Patel, Devika Jay, Balaraman Ravindran, K. Shanti Swarup
In this paper, a pioneer work on learning optimal bidding strategies from observation and experience in a three-stage reactive power market is reported.
no code implementations • 7 Jan 2021 • Nahas Pareekutty, Francis James, Balaraman Ravindran, Suril V. Shah
This paper presents a sampling-based method for optimal motion planning in non-holonomic systems in the absence of known cost functions.
no code implementations • 18 Dec 2020 • Aravind Venugopal, Elizabeth Bondi, Harshavardhan Kamarthi, Keval Dholakia, Balaraman Ravindran, Milind Tambe
We therefore first propose a novel GSG model that combines defender allocation, patrolling, real-time drone notification to human patrollers, and drones sending warning signals to attackers.
Decision Making
Multiagent Systems
1 code implementation • 16 Dec 2020 • Ashutosh Kakadiya, Sriraam Natarajan, Balaraman Ravindran
Contextual bandits algorithms have become essential in real-world user interaction problems in recent years.
no code implementations • 16 Nov 2020 • Deepak Maurya, Balaraman Ravindran
We also show improvement for the min-cut solution on 2-uniform hypergraphs (graphs) over the standard spectral partitioning algorithm.
no code implementations • 30 Oct 2020 • Anand A. Rajasekar, Karthik Raman, Balaraman Ravindran
One challenging and essential task in biochemistry is the generation of novel molecules with desired properties.
no code implementations • 2 Oct 2020 • Anirban Santara, Sohan Rudra, Sree Aditya Buridi, Meha Kaushik, Abhishek Naik, Bharat Kaul, Balaraman Ravindran
In this work, we present MADRaS, an open-source multi-agent driving simulator for use in the design and evaluation of motion planning algorithms for autonomous driving.
no code implementations • 18 Aug 2020 • Siddharth Nayak, Balaraman Ravindran
The performance of a trained object detection neural network depends a lot on the image quality.
no code implementations • 10 Jul 2020 • Pavan Ravishankar, Pranshu Malviya, Balaraman Ravindran
Unlike previous works that only make cautionary claims of discrimination and de-biases data after its generation, this paper attempts to prioritize unfair sources before mitigating their unfairness in the real-world.
no code implementations • 13 Jun 2020 • Siddharth Nishtala, Harshavardhan Kamarthi, Divy Thakkar, Dhyanesh Narayanan, Anirudh Grama, Aparna Hegde, Ramesh Padmanabhan, Neha Madhiwalla, Suresh Chaudhary, Balaraman Ravindran, Milind Tambe
India accounts for 11% of maternal deaths globally where a woman dies in childbirth every fifteen minutes.
1 code implementation • 7 Jun 2020 • Nazneen N Sultana, Hardik Meisheri, Vinita Baniwal, Somjit Nath, Balaraman Ravindran, Harshad Khadilkar
This paper describes the application of reinforcement learning (RL) to multi-product inventory management in supply chains.
1 code implementation • WS 2020 • Nikita Moghe, Priyesh Vijayan, Balaraman Ravindran, Mitesh M. Khapra
This requires capturing structural, sequential and semantic information from the conversation context and the background resources.
1 code implementation • 9 May 2020 • Sravan Mylavarapu, Mahtab Sandhu, Priyesh Vijayan, K. Madhava Krishna, Balaraman Ravindran, Anoop Namboodiri
We present a novel Multi-Relational Graph Convolutional Network (MRGCN) based framework to model on-road vehicle behaviors from a sequence of temporally ordered frames as grabbed by a moving monocular camera.
Ranked #1 on
Test results
on KITTI
2 code implementations • ACL 2020 • Akash Kumar Mohankumar, Preksha Nema, Sharan Narasimhan, Mitesh M. Khapra, Balaji Vasan Srinivasan, Balaraman Ravindran
To make attention mechanisms more faithful and plausible, we propose a modified LSTM cell with a diversity-driven training objective that ensures that the hidden representations learned at different time steps are diverse.
1 code implementation • ICLR 2020 • Sanchari Sen, Balaraman Ravindran, Anand Raghunathan
Our results indicate that EMPIR boosts the average adversarial accuracies by 42. 6%, 15. 2% and 10. 5% for the DNN models trained on the MNIST, CIFAR-10 and ImageNet datasets respectively, when compared to single full-precision models, without sacrificing accuracy on the unperturbed inputs.
1 code implementation • Proceedings of the 2020 SIAM International Conference on Data Mining 2020 • Anasua Mitra, Priyesh Vijayan, Srinivasan Parthasarathy, Balaraman Ravindran
We propose a Semi-Supervised Learning (SSL) methodology that explicitly encodes different necessary priors to learn efficient representations for nodes in a network.
1 code implementation • 3 Feb 2020 • Sravan Mylavarapu, Mahtab Sandhu, Priyesh Vijayan, K. Madhava Krishna, Balaraman Ravindran, Anoop Namboodiri
Understanding on-road vehicle behaviour from a temporal sequence of sensor data is gaining in popularity.
no code implementations • 15 Jan 2020 • Rohan Saphal, Balaraman Ravindran, Dheevatsa Mudigere, Sasikanth Avancha, Bharat Kaul
However, ensemble methods are relatively less popular in reinforcement learning owing to the high sample complexity and computational expense involved in obtaining a diverse ensemble.
no code implementations • 1 Oct 2019 • Hardik Meisheri, Vinita Baniwal, Nazneen N Sultana, Balaraman Ravindran, Harshad Khadilkar
This paper describes a purely data-driven solution to a class of sequential decision-making problems with a large number of concurrent online decisions, with applications to computing systems and operations research.
no code implementations • 9 Sep 2019 • Arjun Manoharan, Rahul Ramesh, Balaraman Ravindran
Option discovery and skill acquisition frameworks are integral to the functioning of a Hierarchically organized Reinforcement learning agent.
1 code implementation • IJCNLP 2019 • Preksha Nema, Akash Kumar Mohankumar, Mitesh M. Khapra, Balaji Vasan Srinivasan, Balaraman Ravindran
It is desired that the generated question should be (i) grammatically correct (ii) answerable from the passage and (iii) specific to the given answer.
1 code implementation • 8 Jul 2019 • Harshavardhan Kamarthi, Priyesh Vijayan, Bryan Wilder, Balaraman Ravindran, Milind Tambe
A serious challenge when finding influential actors in real-world social networks is the lack of knowledge about the structure of the underlying network.
no code implementations • 27 Jun 2019 • Anirban Santara, Rishabh Madan, Balaraman Ravindran, Pabitra Mitra
Given an optimal policy in a related task-environment, we show that its bisimulation distance from the current task-environment gives a lower bound on the optimal advantage of state-action pairs in the current task-environment.
no code implementations • 17 Jun 2019 • Abhishek Ghose, Balaraman Ravindran
Additionally, we show that (1) it is more accurate than its predecessor, (2) requires only one hyperparameter to be set in practice, (3) accommodates a multi-variate notion of model size, e. g., both maximum depth of a tree and number of trees in Gradient Boosted Models, and (4) works across different feature spaces between the uncertainty oracle and the interpretable model, e. g., a GRU might act as an oracle for a decision tree that ingests n-grams.
no code implementations • 17 May 2019 • Manan Tomar, Akhil Sathuluri, Balaraman Ravindran
Shaping in humans and animals has been shown to be a powerful tool for learning complex tasks as compared to learning in a randomized fashion.
1 code implementation • 14 May 2019 • Rahul Ramesh, Manan Tomar, Balaraman Ravindran
This work adopts a complementary approach, where we attempt to discover options that navigate to landmark states.
no code implementations • 4 May 2019 • Abhishek Ghose, Balaraman Ravindran
Our technique identifies the training data distribution to learn from that leads to the highest accuracy for a model of a given size.
1 code implementation • 2 May 2019 • Saket Gurukar, Priyesh Vijayan, Aakash Srinivasan, Goonmeet Bajaj, Chen Cai, Moniba Keymanesh, Saravana Kumar, Pranav Maneriker, Anasua Mitra, Vedang Patel, Balaraman Ravindran, Srinivasan Parthasarathy
An important area of research that has emerged over the last decade is the use of graphs as a vehicle for non-linear dimensionality reduction in a manner akin to previous efforts based on manifold learning with uses for downstream database processing, machine learning and visualization.
no code implementations • 5 Feb 2019 • Harish Kumar, Balaraman Ravindran
On top of our LSTM neural network that learnt musical sequences in this representation, we built an RL agent that learnt to find combinations of songs whose joint dominance produced pleasant compositions.
no code implementations • 1 Jan 2019 • Sai Kiran Narayanaswami, Nandan Sudarsanam, Balaraman Ravindran
Robust Policy Search is the problem of learning policies that do not degrade in performance when subject to unseen environment model parameters.
no code implementations • 28 Dec 2018 • Tarun Kumar, Sankaran Vaidyanathan, Harini Ananthapadmanabhan, Srinivasan Parthasarathy, Balaraman Ravindran
Clustering on hypergraphs has been garnering increased attention with potential applications in network analysis, VLSI design and computer vision, among others.
1 code implementation • 26 Dec 2018 • Deepak Mittal, Shweta Bhardwaj, Mitesh M. Khapra, Balaraman Ravindran
In this work, we report experiments which suggest that the comparable performance of the pruned network is not due to the specific criterion chosen but due to the inherent plasticity of deep neural networks which allows them to recover from the loss of pruned filters once the rest of the filters are fine-tuned.
no code implementations • 23 Dec 2018 • Vignesh Prasad, Karmesh Yadav, Rohitashva Singh Saurabh, Swapnil Daga, Nahas Pareekutty, K. Madhava Krishna, Balaraman Ravindran, Brojeshwar Bhowmick
Monocular SLAM refers to using a single camera to estimate robot ego motion while building a map of the environment.
Robotics
no code implementations • 1 Dec 2018 • Ameet Deshpande, Harshavardhan Kamarthi, Balaraman Ravindran
Learning options that allow agents to exhibit temporally higher order behavior has proven to be useful in increasing exploration, reducing sample complexity and for various transfer scenarios.
no code implementations • 27 Sep 2018 • Manan Tomar*, Rahul Ramesh*, Balaraman Ravindran
Additionally, we describe an Incremental Successor options model that iteratively builds options and explores in environments where exploration through primitive actions is inadequate to form the Successor Representations.
Hierarchical Reinforcement Learning
reinforcement-learning
+2
no code implementations • 16 Sep 2018 • Ameet Deshpande, Srikanth Sarma, Ashutosh Jha, Balaraman Ravindran
One such approach is Hindsight Experience replay which uses an off-policy Reinforcement Learning algorithm to learn a goal conditioned policy.
no code implementations • 5 Sep 2018 • Athindran Ramesh Kumar, Balaraman Ravindran, Anand Raghunathan
Based on these observations, we propose Pack and Detect (PaD), an approach to reduce the computational requirements of object detection in videos.
1 code implementation • 31 May 2018 • Priyesh Vijayan, Yash Chandak, Mitesh M. Khapra, Srinivasan Parthasarathy, Balaraman Ravindran
Given a graph where every node has certain attributes associated with it and some nodes have labels associated with them, Collective Classification (CC) is the task of assigning labels to every unlabeled node using information from the node as well as its neighbors.
1 code implementation • 31 May 2018 • Priyesh Vijayan, Yash Chandak, Mitesh M. Khapra, Srinivasan Parthasarathy, Balaraman Ravindran
State-of-the-art models for node classification on such attributed graphs use differentiable recursive functions that enable aggregation and filtering of neighborhood information from multiple hops.
no code implementations • 17 May 2018 • Ghulam Ahmed Ansari, Sagar J P, Sarath Chandar, Balaraman Ravindran
Text-based games are suitable test-beds for designing agents that can learn by interaction with the environment in the form of natural language text.
no code implementations • 27 Feb 2018 • Parijat Dewangan, S Phaniteja, K. Madhava Krishna, Abhishek Sarkar, Balaraman Ravindran
In this paper, we propose a new approach for simultaneous training of multiple tasks sharing a set of common actions in continuous action spaces, which we call as DiGrad (Differential Policy Gradient).
1 code implementation • 31 Jan 2018 • Deepak Mittal, Shweta Bhardwaj, Mitesh M. Khapra, Balaraman Ravindran
In this work, we report experiments which suggest that the comparable performance of the pruned network is not due to the specific criterion chosen but due to the inherent plasticity of deep neural networks which allows them to recover from the loss of pruned filters once the rest of the filters are fine-tuned.
no code implementations • 14 Dec 2017 • Nandan Sudarsanam, Nishanth Kumar, Abhishek Sharma, Balaraman Ravindran
We present a comprehensive analysis of 50 interestingness measures and classify them in accordance with the two properties.
no code implementations • 9 Nov 2017 • Subhojyoti Mukherjee, K. P. Naveen, Nandan Sudarsanam, Balaraman Ravindran
We propose a novel variant of the UCB algorithm (referred to as Efficient-UCB-Variance (EUCBV)) for minimizing cumulative regret in the stochastic multi-armed bandit (MAB) setting.
no code implementations • 14 Sep 2017 • Rakesh R. Menon, Balaraman Ravindran
Deep Reinforcement Learning has been able to achieve amazing successes in a variety of domains from video games to continuous control by trying to maximize the cumulative reward.
1 code implementation • 20 Jul 2017 • Anirban Santara, Abhishek Naik, Balaraman Ravindran, Dipankar Das, Dheevatsa Mudigere, Sasikanth Avancha, Bharat Kaul
Generative Adversarial Imitation Learning (GAIL) is a state-of-the-art algorithm for learning policies when the expert's behavior is available as a fixed set of trajectories.
no code implementations • ICLR 2018 • Sahil Sharma, Girish Raguvir J, Srivatsan Ramesh, Balaraman Ravindran
Our second major contribution is that we propose a generalization of lambda-returns called Confidence-based Autodidactic Returns (CAR), wherein the RL agent learns the weighting of the n-step returns in an end-to-end manner.
no code implementations • 20 May 2017 • Sahil Sharma, Aravind Suresh, Rahul Ramesh, Balaraman Ravindran
Deep Reinforcement Learning (DRL) methods have performed well in an increasing numbering of high-dimensional visual decision making domains.
2 code implementations • ACL 2017 • Preksha Nema, Mitesh Khapra, Anirban Laha, Balaraman Ravindran
Abstractive summarization aims to generate a shorter version of the document covering all the salient points in a compact and coherent fashion.
Ranked #2 on
Query-Based Extractive Summarization
on Debatepedia
no code implementations • 7 Apr 2017 • Subhojyoti Mukherjee, K. P. Naveen, Nandan Sudarsanam, Balaraman Ravindran
In this paper we propose the Augmented-UCB (AugUCB) algorithm for a fixed-budget version of the thresholding bandit problem (TBP), where the objective is to identify a set of arms whose quality is above a threshold.
no code implementations • 4 Apr 2017 • Sanjay Ganapathy, Swagath Venkataramani, Balaraman Ravindran, Anand Raghunathan
Complementary to these approaches, DyVEDeep is a dynamic approach that exploits the heterogeneity in the inputs to DNNs to improve their compute efficiency with comparable classification accuracy.
no code implementations • 20 Feb 2017 • Sahil Sharma, Aravind Srinivas, Balaraman Ravindran
Reinforcement Learning algorithms can learn complex behavioral patterns for sequential decision making tasks wherein an agent interacts with an environment and acquires feedback in the form of rewards sampled from it.
1 code implementation • ICLR 2018 • Sahil Sharma, Ashutosh Jha, Parikshit Hegde, Balaraman Ravindran
In this work, we propose an efficient multi-task learning framework which solves multiple goal-directed tasks in an on-line setup without the need for expert supervision.
no code implementations • 29 Nov 2016 • Sai Praveen Bangaru, JS Suhas, Balaraman Ravindran
Exploration in multi-task reinforcement learning is critical in training agents to deduce the underlying MDP.
no code implementations • 5 Oct 2016 • Aravind Rajeswaran, Sarvjeet Ghotra, Balaraman Ravindran, Sergey Levine
Sample complexity and safety are major challenges when learning policies with reinforcement learning for real-world tasks, especially when the policies are represented using rich function approximators like deep neural networks.
no code implementations • 17 May 2016 • Aravind Srinivas, Ramnandan Krishnamurthy, Peeyush Kumar, Balaraman Ravindran
This paper introduces an automated skill acquisition framework in reinforcement learning which involves identifying a hierarchical description of the given task in terms of abstract states and extended actions between abstract states.
no code implementations • 17 May 2016 • Aravind Srinivas, Sahil Sharma, Balaraman Ravindran
Deep Reinforcement Learning methods have achieved state of the art performance in learning control policies for the games in the Atari 2600 domain.
no code implementations • 4 May 2016 • Nandan Sudarsanam, Balaraman Ravindran
One of the proposed methods, X-Random bootstrap, performs better than the baselines in-terms of cumulative regret across various degrees of noise and different number of trials.
1 code implementation • NAACL 2016 • Janarthanan Rajendran, Mitesh M. Khapra, Sarath Chandar, Balaraman Ravindran
In this work, we address a real-world scenario where no direct parallel data is available between two views of interest (say, $V_1$ and $V_2$) but parallel data is available between each of these views and a pivot view ($V_3$).
2 code implementations • 10 Oct 2015 • Janarthanan Rajendran, Aravind Srinivas, Mitesh M. Khapra, P. Prasanna, Balaraman Ravindran
Second, the agent should be able to selectively transfer, which is the ability to select and transfer from different and multiple source tasks for different parts of the state space of the target task.
no code implementations • 10 Oct 2015 • P. Prasanna, Sarath Chandar, Balaraman Ravindran
In this paper, we propose TSEB, a Thompson Sampling based algorithm with adaptive exploration bonus that aims to solve the problem with tighter PAC guarantees, while being cautious on the regret as well.
no code implementations • 24 Jul 2015 • Abhinav Garlapati, aditi raghunathan, Vaishnavh Nagarajan, Balaraman Ravindran
Online decision tree learning algorithms typically examine all features of a new data point to update model parameters.
2 code implementations • 27 Apr 2015 • Sarath Chandar, Mitesh M. Khapra, Hugo Larochelle, Balaraman Ravindran
CCA based approaches learn a joint representation by maximizing correlation of the views when projected to the common subspace.
no code implementations • NeurIPS 2014 • Sarath Chandar A P, Stanislas Lauly, Hugo Larochelle, Mitesh M. Khapra, Balaraman Ravindran, Vikas Raykar, Amrita Saha
Cross-language learning allows us to use training data from one language to build models for a different language.