no code implementations • 1 Dec 2024 • Phu Pham, Damon Conover, Aniket Bera
We present FlashSLAM, a novel SLAM approach that leverages 3D Gaussian Splatting for efficient and robust 3D scene reconstruction.
1 code implementation • 31 Oct 2024 • AmirMohammad Tahmasbi, MohammadSaleh Faghfoorian, Saeed Khodaygan, Aniket Bera
Path planning in high-dimensional spaces poses significant challenges, particularly in achieving both time efficiency and a fair success rate.
no code implementations • 26 Oct 2024 • Kai Cheng, Zhengyuan Li, Xingpeng Sun, Byung-Cheol Min, Amrit Singh Bedi, Aniket Bera
Embodied Question Answering (EQA) is an essential yet challenging task for robotic home assistants.
no code implementations • 30 Sep 2024 • Divyanshu Daiya, Damon Conover, Aniket Bera
We propose a novel framework COLLAGE for generating collaborative agent-object-agent interactions by leveraging large language models (LLMs) and hierarchical motion-specific vector-quantized variational autoencoders (VQ-VAEs).
no code implementations • 28 Sep 2024 • Yi Wu, Zikang Xiong, Yiran Hu, Shreyash S. Iyengar, Nan Jiang, Aniket Bera, Lin Tan, Suresh Jagannathan
We demonstrate the effectiveness and generalizability of SELP across different robot agents and tasks, including drone navigation and robot manipulation.
no code implementations • 25 Sep 2024 • Phu Pham, Dipam Patel, Damon Conover, Aniket Bera
We introduce Go-SLAM, a novel framework that utilizes 3D Gaussian Splatting SLAM to reconstruct dynamic environments while embedding object-level information within the scene representations.
no code implementations • 25 Sep 2024 • Vineet Punyamoorty, Pascal Jutras-Dubé, Ruqi Zhang, Vaneet Aggarwal, Damon Conover, Aniket Bera
By framing reinforcement learning as a sequence modeling problem, recent work has enabled the use of generative models, such as diffusion models, for planning.
1 code implementation • 25 Sep 2024 • Apoorva Vashisth, Dipam Patel, Damon Conover, Aniket Bera
We propose a novel deep reinforcement learning approach for multi-robot informative path planning to map targets-of-interest in an unknown 3D environment.
no code implementations • 10 Sep 2024 • Phu Pham, Aradhya N. Mathur, Ojaswa Sharma, Aniket Bera
The field of text-to-3D content generation has made significant progress in generating realistic 3D objects, with existing methodologies like Score Distillation Sampling (SDS) offering promising guidance.
no code implementations • 2 Aug 2024 • Pascal Jutras-Dubé, Ruqi Zhang, Aniket Bera
Planning with generative models has emerged as an effective decision-making paradigm across a wide range of domains, including reinforcement learning and autonomous navigation.
1 code implementation • 4 Jul 2024 • Hrishikesh Viswanath, Yue Chang, Julius Berner, Peter Yichen Chen, Aniket Bera
We propose accelerating the simulation of Lagrangian dynamics, such as fluid flows, granular flows, and elastoplasticity, with neural-operator-based reduced-order modeling.
no code implementations • 26 Jun 2024 • Uttaran Bhattacharya, Aniket Bera, Dinesh Manocha
To enhance the plausibility of synthesis, we use an adversarial discriminator that learns to differentiate between the face and pose motions computed from the original videos and our synthesized motions based on their affective expressions.
no code implementations • CVPR 2024 • Juanwu Lu, Can Cui, Yunsheng Ma, Aniket Bera, Ziran Wang
In this paper, we propose the Sequential Neural Variational Agent (SeNeVA), a generative model that describes the distribution of future trajectories for a single moving object.
no code implementations • 5 Feb 2024 • Xingpeng Sun, Haoming Meng, Souradip Chakraborty, Amrit Singh Bedi, Aniket Bera
While LLMs excel in processing text in these human conversations, they struggle with the nuances of verbal instructions in scenarios like social navigation, where ambiguity and uncertainty can erode trust in robotic and other AI systems.
1 code implementation • CVPR 2024 • Lu Ling, Yichen Sheng, Zhi Tu, Wentian Zhao, Cheng Xin, Kun Wan, Lantao Yu, Qianyu Guo, Zixun Yu, Yawen Lu, Xuanmao Li, Xingpeng Sun, Rohan Ashok, Aniruddha Mukherjee, Hao Kang, Xiangrui Kong, Gang Hua, Tianyi Zhang, Bedrich Benes, Aniket Bera
We have witnessed significant progress in deep learning-based 3D vision, ranging from neural radiance field (NeRF) based 3D representation learning to applications in novel view synthesis (NVS).
no code implementations • 8 Dec 2023 • Aradhya N. Mathur, Phu Pham, Aniket Bera, Ojaswa Sharma
Further, the recent work of Denoising Diffusion Policy Optimization (DDPO) demonstrates that the diffusion process is compatible with policy gradient methods and has been demonstrated to improve the 2D diffusion models using an aesthetic scoring function.
1 code implementation • CVPR 2024 • Yunsheng Ma, Can Cui, Xu Cao, Wenqian Ye, Peiran Liu, Juanwu Lu, Amr Abdelraouf, Rohit Gupta, Kyungtae Han, Aniket Bera, James M. Rehg, Ziran Wang
Autonomous driving (AD) has made significant strides in recent years.
no code implementations • 6 Dec 2023 • Nihal Gunukula, Kshitij Tiwari, Aniket Bera
In emergency scenarios, mobile robots must navigate like humans, interpreting stimuli to locate potential victims rapidly without interfering with first responders.
no code implementations • 1 Nov 2023 • Nishaant Shah, Kshitij Tiwari, Aniket Bera
Urban search and rescue missions require rapid first response to minimize loss of life and damage.
no code implementations • 29 Sep 2023 • Dipam Patel, Phu Pham, Kshitij Tiwari, Aniket Bera
This paper introduces DREAM - Decentralized Reinforcement Learning for Exploration and Efficient Energy Management in Multi-Robot Systems, a comprehensive framework that optimizes the allocation of resources for efficient exploration.
no code implementations • 19 Sep 2023 • Phu Pham, Aniket Bera
Multi-Agent Path Finding (MAPF) in crowded environments presents a challenging problem in motion planning, aiming to find collision-free paths for all agents in the system.
no code implementations • 12 Sep 2023 • Aaditya Kharel, Manas Paranjape, Aniket Bera
With the rise in manipulated media, deepfake detection has become an imperative task for preserving the authenticity of digital content.
no code implementations • 16 Aug 2023 • Hrishikesh Viswanath, Aneesh Bhattacharya, Pascal Jutras-Dubé, Prerit Gupta, Mridu Prashanth, Yashvardhan Khaitan, Aniket Bera
We showcase the language-independent emotion modeling capability of the quantized emotional embeddings learned from a bilingual (English and Chinese) speech corpus with an emotion transfer task from a reference speech to a target speech.
no code implementations • 29 Jun 2023 • Anthony Francis, Claudia Pérez-D'Arpino, Chengshu Li, Fei Xia, Alexandre Alahi, Rachid Alami, Aniket Bera, Abhijat Biswas, Joydeep Biswas, Rohan Chandra, Hao-Tien Lewis Chiang, Michael Everett, Sehoon Ha, Justin Hart, Jonathan P. How, Haresh Karnan, Tsang-Wei Edward Lee, Luis J. Manso, Reuth Mirksy, Sören Pirk, Phani Teja Singamaneni, Peter Stone, Ada V. Taylor, Peter Trautman, Nathan Tsoi, Marynel Vázquez, Xuesu Xiao, Peng Xu, Naoki Yokoyama, Alexander Toshev, Roberto Martín-Martín
A major challenge to deploying robots widely is navigation in human-populated environments, commonly referred to as social robot navigation.
no code implementations • 8 Mar 2023 • Rashmi Bhaskara, Maurice Chiu, Aniket Bera
Our research aims to develop a model that can predict the movements of pedestrians and perceptually-social groups in crowded environments.
no code implementations • 8 Mar 2023 • Dipam Patel, Phu Pham, Aniket Bera
Using drones in conjunction with NeRF provides a unique and dynamic way to generate novel views of a scene, especially with limited scene capabilities of restricted movements.
no code implementations • 2 Mar 2023 • Xijun Wang, Ruiqi Xian, Tianrui Guan, Celso M. de Melo, Stephen M. Nogar, Aniket Bera, Dinesh Manocha
We propose a novel approach for aerial video action recognition.
Ranked #1 on Action Recognition on RoCoG-v2
no code implementations • 30 Jan 2023 • Hrishikesh Viswanath, Md Ashiqur Rahman, Abhijeet Vyas, Andrey Shor, Beatriz Medeiros, Stephanie Hernandez, Suhas Eswarappa Prameela, Aniket Bera
This article aims to provide a comprehensive insight into how data-driven approaches can complement conventional techniques to solve engineering and physics problems, while also noting some of the major pitfalls of machine learning-based approaches.
no code implementations • 25 Nov 2022 • Md Ashiqur Rahman, Jasorsi Ghosh, Hrishikesh Viswanath, Kamyar Azizzadenesheli, Aniket Bera
In contrast to the concurrent works, which mainly focus on generating the motion of a single actor from the textual description, we generate the motion of one of the actors from the motion of the other participating actor in the action.
1 code implementation • 19 Nov 2022 • Hrishikesh Viswanath, Md Ashiqur Rahman, Rashmi Bhaskara, Aniket Bera
We present, AdaFNIO - Adaptive Fourier Neural Interpolation Operator, a neural operator-based architecture to perform video frame interpolation.
no code implementations • 13 Sep 2022 • James F. Mullen Jr, Divya Kothandaraman, Aniket Bera, Dinesh Manocha
We compare our method, which we call PAAK, with prior approaches, including POSA, PROX ground truth, and a motion synthesis method, and highlight the benefits of our method with a perceptual study.
no code implementations • CVPR 2022 • Vikram Gupta, Trisha Mittal, Puneet Mathur, Vaibhav Mishra, Mayank Maheshwari, Aniket Bera, Debdoot Mukherjee, Dinesh Manocha
We present 3MASSIV, a multilingual, multimodal and multi-aspect, expertly-annotated dataset of diverse short videos extracted from short-video social media platform - Moj.
2 code implementations • CVPR 2021 • Trisha Mittal, Puneet Mathur, Aniket Bera, Dinesh Manocha
We use an LSTM-based learning model for emotion perception.
no code implementations • 17 Nov 2020 • Venkatraman Narayanan, Bala Murali Manoghar, Rama Prashanth RV, Phu Pham, Aniket Bera
Amodal recognition is the ability of the system to detect occluded objects.
no code implementations • 17 Nov 2020 • Videsh Suman, Phu Pham, Aniket Bera
A key aspect of driving a road vehicle is to interact with other road users, assess their intentions and make risk-aware tactical decisions.
no code implementations • 17 Nov 2020 • Pooja Guhan, Naman Awasthi, and Kathryn McDonald, Kristin Bussell, Dinesh Manocha, Gloria Reeves, Aniket Bera
We discuss MET, a learning-based algorithm proposed for perceiving a patient's level of engagement during telehealth sessions.
no code implementations • 1 Nov 2020 • Vishnu Sashank Dorbala, Arjun Srinivasan, Aniket Bera
We utilize both these trust metrics into an optimal cognitive reasoning scheme that decides when and when not to trust the given guidance.
no code implementations • 18 Sep 2020 • Abhishek Banerjee, Uttaran Bhattacharya, Aniket Bera
Our task is to map gestures to novel emotion categories not encountered in training.
no code implementations • 14 Mar 2020 • Trisha Mittal, Uttaran Bhattacharya, Rohan Chandra, Aniket Bera, Dinesh Manocha
Additionally, we extract and compare affective cues corresponding to perceived emotion from the two modalities within a video to infer whether the input video is "real" or "fake".
no code implementations • CVPR 2020 • Trisha Mittal, Pooja Guhan, Uttaran Bhattacharya, Rohan Chandra, Aniket Bera, Dinesh Manocha
We report an AP of 65. 83 across 4 categories on GroupWalk, which is also an improvement over prior methods.
Ranked #2 on Emotion Recognition in Context on CAER
Emotion Recognition in Context Multimodal Emotion Recognition
1 code implementation • 2 Mar 2020 • Venkatraman Narayanan, Bala Murali Manoghar, Vishnu Sashank Dorbala, Dinesh Manocha, Aniket Bera
Our approach predicts the perceived emotions of a pedestrian from walking gaits, which is then used for emotion-guided navigation taking into account social and proxemic constraints.
Ranked #1 on Emotion Classification on EWALK
no code implementations • 14 Dec 2019 • Tanmay Randhavane, Uttaran Bhattacharya, Kyra Kapsaskis, Kurt Gray, Aniket Bera, Dinesh Manocha
We present a data-driven deep neural algorithm for detecting deceptive walking behavior using nonverbal cues like gaits and gestures.
no code implementations • arXiv 2019 • Rohan Chandra, Tianrui Guan, Srujan Panuganti, Trisha Mittal, Uttaran Bhattacharya, Aniket Bera, Dinesh Manocha
In practice, our approach reduces the average prediction error by more than 54% over prior algorithms and achieves a weighted average accuracy of 91. 2% for behavior prediction.
Ranked #1 on Trajectory Prediction on ApolloScape
Robotics
no code implementations • ECCV 2020 • Uttaran Bhattacharya, Christian Roncal, Trisha Mittal, Rohan Chandra, Kyra Kapsaskis, Kurt Gray, Aniket Bera, Dinesh Manocha
For the annotated data, we also train a classifier to map the latent embeddings to emotion labels.
no code implementations • 9 Nov 2019 • Trisha Mittal, Uttaran Bhattacharya, Rohan Chandra, Aniket Bera, Dinesh Manocha
Our approach combines cues from multiple co-occurring modalities (such as face, text, and speech) and also is more robust than other methods to sensor noise in any of the individual modalities.
1 code implementation • 28 Oct 2019 • Uttaran Bhattacharya, Trisha Mittal, Rohan Chandra, Tanmay Randhavane, Aniket Bera, Dinesh Manocha
We use hundreds of annotated real-world gait videos and augment them with thousands of annotated synthetic gaits generated using a novel generative network called STEP-Gen, built on an ST-GCN based Conditional Variational Autoencoder (CVAE).
1 code implementation • 20 Jul 2019 • Rohan Chandra, Uttaran Bhattacharya, Christian Roncal, Aniket Bera, Dinesh Manocha
RobustTP is an approach that first computes trajectories using a combination of a non-linear motion model and a deep learning-based instance segmentation algorithm.
Robotics
no code implementations • 30 Jun 2019 • Tanmay Randhavane, Aniket Bera, Kyra Kapsaskis, Kurt Gray, Dinesh Manocha
We also investigate the perception of a user in an AR setting and observe that an FVA has a statistically significant improvement in terms of the perceived friendliness and social presence of a user compared to an agent without the friendliness modeling.
1 code implementation • 25 Jun 2019 • Rohan Chandra, Uttaran Bhattacharya, Tanmay Randhavane, Aniket Bera, Dinesh Manocha
We present a realtime tracking algorithm, RoadTrack, to track heterogeneous road-agents in dense traffic videos.
Robotics
no code implementations • 14 Jun 2019 • Tanmay Randhavane, Uttaran Bhattacharya, Kyra Kapsaskis, Kurt Gray, Aniket Bera, Dinesh Manocha
We also present an EWalk (Emotion Walk) dataset that consists of videos of walking individuals with gaits and labeled emotions.
2 code implementations • CVPR 2019 • Rohan Chandra, Uttaran Bhattacharya, Aniket Bera, Dinesh Manocha
We evaluate the performance of our prediction algorithm, TraPHic, on the standard datasets and also introduce a new dense, heterogeneous traffic dataset corresponding to urban Asian videos and agent trajectories.
Ranked #1 on Trajectory Prediction on TRAF
Trajectory Prediction Robotics
no code implementations • 15 Oct 2018 • Tanmay Randhavane, Aniket Bera, Emily Kubin, Austin Wang, Kurt Gray, Dinesh Manocha
We present a Pedestrian Dominance Model (PDM) to identify the dominance characteristics of pedestrians for robot navigation.
Robotics
no code implementations • 28 Sep 2018 • Aniket Bera, Tanmay Randhavane, Emily Kubin, Husam Shaik, Kurt Gray, Dinesh Manocha
We also present a novel interactive multi-agent simulation algorithm to model entitative groups and conduct a VR user study to validate the socio-emotional predictive power of our algorithm.
Graphics Human-Computer Interaction
no code implementations • 30 May 2018 • Yuanfu Luo, Panpan Cai, Aniket Bera, David Hsu, Wee Sun Lee, Dinesh Manocha
Our planning system combines a POMDP algorithm with the pedestrian motion model and runs in near real time.
Robotics
no code implementations • 2 Mar 2018 • Ernest Cheung, Aniket Bera, Emily Kubin, Kurt Gray, Dinesh Manocha
We present a novel approach to automatically identify driver behaviors from vehicle trajectories and use them for safe navigation of autonomous vehicles.
Robotics
no code implementations • 28 Jul 2017 • Ernest C. Cheung, Tsan Kwong Wong, Aniket Bera, Dinesh Manocha
We present a new method for training pedestrian detectors on an unannotated set of images.
no code implementations • 29 Jun 2016 • Ernest Cheung, Tsan Kwong Wong, Aniket Bera, Xiaogang Wang, Dinesh Manocha
We present a novel procedural framework to generate an arbitrary number of labeled crowd videos (LCrowdV).
no code implementations • 16 Sep 2014 • Aniket Bera, David Wolinski, Julien Pettré, Dinesh Manocha
We automatically compute the optimal parameters for each of these different models based on prior tracked data and use the best model as motion prior for our particle-filter based tracking algorithm.
no code implementations • 11 Feb 2014 • Aniket Bera, Dinesh Manocha
We present a novel, realtime algorithm to compute the trajectory of each pedestrian in moderately dense crowd scenes.