no code implementations • 20 Nov 2023 • Omkar Shelke, Pranavi Pathakota, Anandsingh Chauhan, Harshad Khadilkar, Hardik Meisheri, Balaraman Ravindran
This paper presents an integrated algorithmic framework for minimising product delivery costs in e-commerce (known as the cost-to-serve or C2S).
no code implementations • 28 Jun 2023 • Pranavi Pathakota, Hardik Meisheri, Harshad Khadilkar
The ability to learn robust policies while generalizing over large discrete action spaces is an open challenge for intelligent systems, especially in noisy environments that face the curse of dimensionality.
no code implementations • 28 Oct 2022 • Harshad Khadilkar, Hardik Meisheri
A significant challenge in reinforcement learning is quantifying the complex relationship between actions and long-term rewards.
no code implementations • 2 Mar 2022 • Hardik Meisheri, Somjit Nath, Mayank Baranwal, Harshad Khadilkar
Through empirical evaluations, it is further shown that the inventory management with uncertain lead times is not only equivalent to that of delay in information sharing across multiple echelons (\emph{observation delay}), a model trained to handle one kind of delay is capable to handle delays of another kind without requiring to be retrained.
no code implementations • 2 Mar 2022 • Durgesh Kalwar, Omkar Shelke, Somjit Nath, Hardik Meisheri, Harshad Khadilkar
Exploration methods have been used to sample better trajectories in large environments while auxiliary tasks have been incorporated where the reward is sparse.
no code implementations • 16 Dec 2021 • Pranavi Pathakota, Kunwar Zaid, Anulekha Dhara, Hardik Meisheri, Shaun D Souza, Dheeraj Shah, Harshad Khadilkar
We describe a novel decision-making problem developed in response to the demands of retail electronic commerce (e-commerce).
no code implementations • 23 Feb 2021 • Omkar Shelke, Hardik Meisheri, Harshad Khadilkar
In this paper, we focus on developing a curriculum for learning a robust and promising policy in a constrained computational budget of 100, 000 games, starting from a fixed base policy (which is itself trained to imitate a noisy expert policy).
no code implementations • 1 Nov 2020 • Hardik Meisheri, Harshad Khadilkar
We describe our solution approach for Pommerman TeamRadio, a competition environment associated with NeurIPS 2019.
1 code implementation • 7 Jun 2020 • Nazneen N Sultana, Hardik Meisheri, Vinita Baniwal, Somjit Nath, Balaraman Ravindran, Harshad Khadilkar
This paper describes the application of reinforcement learning (RL) to multi-product inventory management in supply chains.
no code implementations • 12 Nov 2019 • Hardik Meisheri, Omkar Shelke, Richa Verma, Harshad Khadilkar
Our methodology involves training an agent initially through imitation learning on a noisy expert policy, followed by a proximal-policy optimization (PPO) reinforcement learning algorithm.
no code implementations • 1 Oct 2019 • Hardik Meisheri, Vinita Baniwal, Nazneen N Sultana, Balaraman Ravindran, Harshad Khadilkar
This paper describes a purely data-driven solution to a class of sequential decision-making problems with a large number of concurrent online decisions, with applications to computing systems and operations research.
no code implementations • WS 2018 • Hardik Meisheri, Harshad Khadilkar
Most of the existing state of the art sentiment classification techniques involve the use of pre-trained embeddings.
no code implementations • SEMEVAL 2018 • Hardik Meisheri, Lipika Dey
This paper presents system description of our submission to the SemEval-2018 task-1: Affect in tweets for the English language.
no code implementations • 25 Feb 2018 • Hardik Meisheri, Nagraj Ramrao, Suman Mitra
In addition to this non-stationarity present in the features extracted from the CSP present a challenge in classification.
no code implementations • 7 Oct 2017 • Kaustubh Mani, Ishan Verma, Hardik Meisheri, Lipika Dey
As the number of documents on the web is growing exponentially, multi-document summarization is becoming more and more important since it can provide the main ideas in a document set in short time.
no code implementations • WS 2017 • Hardik Meisheri, Rupsa Saha, Priyanka Sinha, Lipika Dey
This paper describes our approach to the Emotion Intensity shared task.