Search Results for author: Aniket Bera

Found 59 papers, 12 papers with code

FlashSLAM: Accelerated RGB-D SLAM for Real-Time 3D Scene Reconstruction with Gaussian Splatting

no code implementations1 Dec 2024 Phu Pham, Damon Conover, Aniket Bera

We present FlashSLAM, a novel SLAM approach that leverages 3D Gaussian Splatting for efficient and robust 3D scene reconstruction.

3D Reconstruction 3D Scene Reconstruction +2

Zonal RL-RRT: Integrated RL-RRT Path Planning with Collision Probability and Zone Connectivity

1 code implementation31 Oct 2024 AmirMohammad Tahmasbi, MohammadSaleh Faghfoorian, Saeed Khodaygan, Aniket Bera

Path planning in high-dimensional spaces poses significant challenges, particularly in achieving both time efficiency and a fair success rate.

Q-Learning Reinforcement Learning (RL)

COLLAGE: Collaborative Human-Agent Interaction Generation using Hierarchical Latent Diffusion and Language Models

no code implementations30 Sep 2024 Divyanshu Daiya, Damon Conover, Aniket Bera

We propose a novel framework COLLAGE for generating collaborative agent-object-agent interactions by leveraging large language models (LLMs) and hierarchical motion-specific vector-quantized variational autoencoders (VQ-VAEs).

Denoising Diversity +2

SELP: Generating Safe and Efficient Task Plans for Robot Agents with Large Language Models

no code implementations28 Sep 2024 Yi Wu, Zikang Xiong, Yiran Hu, Shreyash S. Iyengar, Nan Jiang, Aniket Bera, Lin Tan, Suresh Jagannathan

We demonstrate the effectiveness and generalizability of SELP across different robot agents and tasks, including drone navigation and robot manipulation.

Drone navigation Robot Manipulation +1

Go-SLAM: Grounded Object Segmentation and Localization with Gaussian Splatting SLAM

no code implementations25 Sep 2024 Phu Pham, Dipam Patel, Damon Conover, Aniket Bera

We introduce Go-SLAM, a novel framework that utilizes 3D Gaussian Splatting SLAM to reconstruct dynamic environments while embedding object-level information within the scene representations.

3D Scene Reconstruction Object +1

Dynamic Obstacle Avoidance through Uncertainty-Based Adaptive Planning with Diffusion

no code implementations25 Sep 2024 Vineet Punyamoorty, Pascal Jutras-Dubé, Ruqi Zhang, Vaneet Aggarwal, Damon Conover, Aniket Bera

By framing reinforcement learning as a sequence modeling problem, recent work has enabled the use of generative models, such as diffusion models, for planning.

Collision Avoidance Decision Making +1

Multi-Robot Informative Path Planning for Efficient Target Mapping using Deep Reinforcement Learning

1 code implementation25 Sep 2024 Apoorva Vashisth, Dipam Patel, Damon Conover, Aniket Bera

We propose a novel deep reinforcement learning approach for multi-robot informative path planning to map targets-of-interest in an unknown 3D environment.

Collision Avoidance Deep Reinforcement Learning +1

MVGaussian: High-Fidelity text-to-3D Content Generation with Multi-View Guidance and Surface Densification

no code implementations10 Sep 2024 Phu Pham, Aradhya N. Mathur, Ojaswa Sharma, Aniket Bera

The field of text-to-3D content generation has made significant progress in generating realistic 3D objects, with existing methodologies like Score Distillation Sampling (SDS) offering promising guidance.

Text to 3D

Adaptive Planning with Generative Models under Uncertainty

no code implementations2 Aug 2024 Pascal Jutras-Dubé, Ruqi Zhang, Aniket Bera

Planning with generative models has emerged as an effective decision-making paradigm across a wide range of domains, including reinforcement learning and autonomous navigation.

Autonomous Navigation Decision Making +1

Reduced-Order Neural Operators: Learning Lagrangian Dynamics on Highly Sparse Graphs

1 code implementation4 Jul 2024 Hrishikesh Viswanath, Yue Chang, Julius Berner, Peter Yichen Chen, Aniket Bera

We propose accelerating the simulation of Lagrangian dynamics, such as fluid flows, granular flows, and elastoplasticity, with neural-operator-based reduced-order modeling.

Speech2UnifiedExpressions: Synchronous Synthesis of Co-Speech Affective Face and Body Expressions from Affordable Inputs

no code implementations26 Jun 2024 Uttaran Bhattacharya, Aniket Bera, Dinesh Manocha

To enhance the plausibility of synthesis, we use an adversarial discriminator that learns to differentiate between the face and pose motions computed from the original videos and our synthesized motions based on their affective expressions.

Quantifying Uncertainty in Motion Prediction with Variational Bayesian Mixture

no code implementations CVPR 2024 Juanwu Lu, Can Cui, Yunsheng Ma, Aniket Bera, Ziran Wang

In this paper, we propose the Sequential Neural Variational Agent (SeNeVA), a generative model that describes the distribution of future trajectories for a single moving object.

Autonomous Vehicles motion prediction

Beyond Text: Utilizing Vocal Cues to Improve Decision Making in LLMs for Robot Navigation Tasks

no code implementations5 Feb 2024 Xingpeng Sun, Haoming Meng, Souradip Chakraborty, Amrit Singh Bedi, Aniket Bera

While LLMs excel in processing text in these human conversations, they struggle with the nuances of verbal instructions in scenarios like social navigation, where ambiguity and uncertainty can erode trust in robotic and other AI systems.

Decision Making Language Modelling +1

RL Dreams: Policy Gradient Optimization for Score Distillation based 3D Generation

no code implementations8 Dec 2023 Aradhya N. Mathur, Phu Pham, Aniket Bera, Ojaswa Sharma

Further, the recent work of Denoising Diffusion Policy Optimization (DDPO) demonstrates that the diffusion process is compatible with policy gradient methods and has been demonstrated to improve the 2D diffusion models using an aesthetic scoring function.

3D Generation Denoising +2

MIRACLE: Inverse Reinforcement and Curriculum Learning Model for Human-inspired Mobile Robot Navigation

no code implementations6 Dec 2023 Nihal Gunukula, Kshitij Tiwari, Aniket Bera

In emergency scenarios, mobile robots must navigate like humans, interpreting stimuli to locate potential victims rapidly without interfering with first responders.

Navigate Robot Navigation

DREAM: Decentralized Reinforcement Learning for Exploration and Efficient Energy Management in Multi-Robot Systems

no code implementations29 Sep 2023 Dipam Patel, Phu Pham, Kshitij Tiwari, Aniket Bera

This paper introduces DREAM - Decentralized Reinforcement Learning for Exploration and Efficient Energy Management in Multi-Robot Systems, a comprehensive framework that optimizes the allocation of resources for efficient exploration.

Efficient Exploration energy management +4

Optimizing Crowd-Aware Multi-Agent Path Finding through Local Communication with Graph Neural Networks

no code implementations19 Sep 2023 Phu Pham, Aniket Bera

Multi-Agent Path Finding (MAPF) in crowded environments presents a challenging problem in motion planning, aiming to find collision-free paths for all agents in the system.

Decision Making Motion Planning +2

DF-TransFusion: Multimodal Deepfake Detection via Lip-Audio Cross-Attention and Facial Self-Attention

no code implementations12 Sep 2023 Aaditya Kharel, Manas Paranjape, Aniket Bera

With the rise in manipulated media, deepfake detection has become an imperative task for preserving the authenticity of digital content.

DeepFake Detection Face Swapping

AffectEcho: Speaker Independent and Language-Agnostic Emotion and Affect Transfer for Speech Synthesis

no code implementations16 Aug 2023 Hrishikesh Viswanath, Aneesh Bhattacharya, Pascal Jutras-Dubé, Prerit Gupta, Mridu Prashanth, Yashvardhan Khaitan, Aniket Bera

We showcase the language-independent emotion modeling capability of the quantized emotional embeddings learned from a bilingual (English and Chinese) speech corpus with an emotion transfer task from a reference speech to a target speech.

Attribute Speech Synthesis +1

SG-LSTM: Social Group LSTM for Robot Navigation Through Dense Crowds

no code implementations8 Mar 2023 Rashmi Bhaskara, Maurice Chiu, Aniket Bera

Our research aims to develop a model that can predict the movements of pedestrians and perceptually-social groups in crowded environments.

Social Navigation

DroNeRF: Real-time Multi-agent Drone Pose Optimization for Computing Neural Radiance Fields

no code implementations8 Mar 2023 Dipam Patel, Phu Pham, Aniket Bera

Using drones in conjunction with NeRF provides a unique and dynamic way to generate novel views of a scene, especially with limited scene capabilities of restricted movements.

3D Reconstruction Novel View Synthesis +2

Neural Operator: Is data all you need to model the world? An insight into the impact of Physics Informed Machine Learning

no code implementations30 Jan 2023 Hrishikesh Viswanath, Md Ashiqur Rahman, Abhijeet Vyas, Andrey Shor, Beatriz Medeiros, Stephanie Hernandez, Suhas Eswarappa Prameela, Aniket Bera

This article aims to provide a comprehensive insight into how data-driven approaches can complement conventional techniques to solve engineering and physics problems, while also noting some of the major pitfalls of machine learning-based approaches.

Operator learning Physics-informed machine learning

PaCMO: Partner Dependent Human Motion Generation in Dyadic Human Activity using Neural Operators

no code implementations25 Nov 2022 Md Ashiqur Rahman, Jasorsi Ghosh, Hrishikesh Viswanath, Kamyar Azizzadenesheli, Aniket Bera

In contrast to the concurrent works, which mainly focus on generating the motion of a single actor from the textual description, we generate the motion of one of the actors from the motion of the other participating actor in the action.

Motion Generation

AdaFNIO: Adaptive Fourier Neural Interpolation Operator for video frame interpolation

1 code implementation19 Nov 2022 Hrishikesh Viswanath, Md Ashiqur Rahman, Rashmi Bhaskara, Aniket Bera

We present, AdaFNIO - Adaptive Fourier Neural Interpolation Operator, a neural operator-based architecture to perform video frame interpolation.

SSIM Video Frame Interpolation

Placing Human Animations into 3D Scenes by Learning Interaction- and Geometry-Driven Keyframes

no code implementations13 Sep 2022 James F. Mullen Jr, Divya Kothandaraman, Aniket Bera, Dinesh Manocha

We compare our method, which we call PAAK, with prior approaches, including POSA, PROX ground truth, and a motion synthesis method, and highlight the benefits of our method with a perceptual study.

Human Animation Motion Synthesis

3MASSIV: Multilingual, Multimodal and Multi-Aspect dataset of Social Media Short Videos

no code implementations CVPR 2022 Vikram Gupta, Trisha Mittal, Puneet Mathur, Vaibhav Mishra, Mayank Maheshwari, Aniket Bera, Debdoot Mukherjee, Dinesh Manocha

We present 3MASSIV, a multilingual, multimodal and multi-aspect, expertly-annotated dataset of diverse short videos extracted from short-video social media platform - Moj.

RAIST: Learning Risk Aware Traffic Interactions via Spatio-Temporal Graph Convolutional Networks

no code implementations17 Nov 2020 Videsh Suman, Phu Pham, Aniket Bera

A key aspect of driving a road vehicle is to interact with other road users, assess their intentions and make risk-aware tactical decisions.

Can a Robot Trust You? A DRL-Based Approach to Trust-Driven Human-Guided Navigation

no code implementations1 Nov 2020 Vishnu Sashank Dorbala, Arjun Srinivasan, Aniket Bera

We utilize both these trust metrics into an optimal cognitive reasoning scheme that decides when and when not to trust the given guidance.

Deep Reinforcement Learning Navigate +1

Emotions Don't Lie: An Audio-Visual Deepfake Detection Method Using Affective Cues

no code implementations14 Mar 2020 Trisha Mittal, Uttaran Bhattacharya, Rohan Chandra, Aniket Bera, Dinesh Manocha

Additionally, we extract and compare affective cues corresponding to perceived emotion from the two modalities within a video to infer whether the input video is "real" or "fake".

DeepFake Detection Face Swapping +1

ProxEmo: Gait-based Emotion Learning and Multi-view Proxemic Fusion for Socially-Aware Robot Navigation

1 code implementation2 Mar 2020 Venkatraman Narayanan, Bala Murali Manoghar, Vishnu Sashank Dorbala, Dinesh Manocha, Aniket Bera

Our approach predicts the perceived emotions of a pedestrian from walking gaits, which is then used for emotion-guided navigation taking into account social and proxemic constraints.

Emotion Classification Emotion Recognition +3

The Liar's Walk: Detecting Deception with Gait and Gesture

no code implementations14 Dec 2019 Tanmay Randhavane, Uttaran Bhattacharya, Kyra Kapsaskis, Kurt Gray, Aniket Bera, Dinesh Manocha

We present a data-driven deep neural algorithm for detecting deceptive walking behavior using nonverbal cues like gaits and gestures.

Action Classification

Forecasting Trajectory and Behavior of Road-Agents Using Spectral Clustering in Graph-LSTMs

no code implementations arXiv 2019 Rohan Chandra, Tianrui Guan, Srujan Panuganti, Trisha Mittal, Uttaran Bhattacharya, Aniket Bera, Dinesh Manocha

In practice, our approach reduces the average prediction error by more than 54% over prior algorithms and achieves a weighted average accuracy of 91. 2% for behavior prediction.

Robotics

M3ER: Multiplicative Multimodal Emotion Recognition Using Facial, Textual, and Speech Cues

no code implementations9 Nov 2019 Trisha Mittal, Uttaran Bhattacharya, Rohan Chandra, Aniket Bera, Dinesh Manocha

Our approach combines cues from multiple co-occurring modalities (such as face, text, and speech) and also is more robust than other methods to sensor noise in any of the individual modalities.

Multimodal Emotion Recognition

STEP: Spatial Temporal Graph Convolutional Networks for Emotion Perception from Gaits

1 code implementation28 Oct 2019 Uttaran Bhattacharya, Trisha Mittal, Rohan Chandra, Tanmay Randhavane, Aniket Bera, Dinesh Manocha

We use hundreds of annotated real-world gait videos and augment them with thousands of annotated synthetic gaits generated using a novel generative network called STEP-Gen, built on an ST-GCN based Conditional Variational Autoencoder (CVAE).

General Classification

RobustTP: End-to-End Trajectory Prediction for Heterogeneous Road-Agents in Dense Traffic with Noisy Sensor Inputs

1 code implementation20 Jul 2019 Rohan Chandra, Uttaran Bhattacharya, Christian Roncal, Aniket Bera, Dinesh Manocha

RobustTP is an approach that first computes trajectories using a combination of a non-linear motion model and a deep learning-based instance segmentation algorithm.

Robotics

FVA: Modeling Perceived Friendliness of Virtual Agents Using Movement Characteristics

no code implementations30 Jun 2019 Tanmay Randhavane, Aniket Bera, Kyra Kapsaskis, Kurt Gray, Dinesh Manocha

We also investigate the perception of a user in an AR setting and observe that an FVA has a statistically significant improvement in terms of the perceived friendliness and social presence of a user compared to an agent without the friendliness modeling.

RoadTrack: Realtime Tracking of Road Agents in Dense and Heterogeneous Environments

1 code implementation25 Jun 2019 Rohan Chandra, Uttaran Bhattacharya, Tanmay Randhavane, Aniket Bera, Dinesh Manocha

We present a realtime tracking algorithm, RoadTrack, to track heterogeneous road-agents in dense traffic videos.

Robotics

Identifying Emotions from Walking using Affective and Deep Features

no code implementations14 Jun 2019 Tanmay Randhavane, Uttaran Bhattacharya, Kyra Kapsaskis, Kurt Gray, Aniket Bera, Dinesh Manocha

We also present an EWalk (Emotion Walk) dataset that consists of videos of walking individuals with gaits and labeled emotions.

Emotion Recognition

TraPHic: Trajectory Prediction in Dense and Heterogeneous Traffic Using Weighted Interactions

2 code implementations CVPR 2019 Rohan Chandra, Uttaran Bhattacharya, Aniket Bera, Dinesh Manocha

We evaluate the performance of our prediction algorithm, TraPHic, on the standard datasets and also introduce a new dense, heterogeneous traffic dataset corresponding to urban Asian videos and agent trajectories.

Trajectory Prediction Robotics

Pedestrian Dominance Modeling for Socially-Aware Robot Navigation

no code implementations15 Oct 2018 Tanmay Randhavane, Aniket Bera, Emily Kubin, Austin Wang, Kurt Gray, Dinesh Manocha

We present a Pedestrian Dominance Model (PDM) to identify the dominance characteristics of pedestrians for robot navigation.

Robotics

Data-Driven Modeling of Group Entitativity in Virtual Environments

no code implementations28 Sep 2018 Aniket Bera, Tanmay Randhavane, Emily Kubin, Husam Shaik, Kurt Gray, Dinesh Manocha

We also present a novel interactive multi-agent simulation algorithm to model entitative groups and conduct a VR user study to validate the socio-emotional predictive power of our algorithm.

Graphics Human-Computer Interaction

PORCA: Modeling and Planning for Autonomous Driving among Many Pedestrians

no code implementations30 May 2018 Yuanfu Luo, Panpan Cai, Aniket Bera, David Hsu, Wee Sun Lee, Dinesh Manocha

Our planning system combines a POMDP algorithm with the pedestrian motion model and runs in near real time.

Robotics

Identifying Driver Behaviors using Trajectory Features for Vehicle Navigation

no code implementations2 Mar 2018 Ernest Cheung, Aniket Bera, Emily Kubin, Kurt Gray, Dinesh Manocha

We present a novel approach to automatically identify driver behaviors from vehicle trajectories and use them for safe navigation of autonomous vehicles.

Robotics

Real-time Crowd Tracking using Parameter Optimized Mixture of Motion Models

no code implementations16 Sep 2014 Aniket Bera, David Wolinski, Julien Pettré, Dinesh Manocha

We automatically compute the optimal parameters for each of these different models based on prior tracked data and use the best model as motion prior for our particle-filter based tracking algorithm.

Combinatorial Optimization

Realtime Multilevel Crowd Tracking using Reciprocal Velocity Obstacles

no code implementations11 Feb 2014 Aniket Bera, Dinesh Manocha

We present a novel, realtime algorithm to compute the trajectory of each pedestrian in moderately dense crowd scenes.

Cannot find the paper you are looking for? You can Submit a new open access paper.