Search Results for author: Yutong Ban

Found 21 papers, 5 papers with code

Time Reversal Symmetry for Efficient Robotic Manipulations in Deep Reinforcement Learning

no code implementations20 May 2025 Yunpeng Jiang, Jianshu Hu, Paul Weng, Yutong Ban

Symmetry is pervasive in robotics and has been widely exploited to improve sample efficiency in deep reinforcement learning (DRL).

Deep Reinforcement Learning reinforcement-learning

Tracking-Aware Deformation Field Estimation for Non-rigid 3D Reconstruction in Robotic Surgeries

no code implementations4 Mar 2025 Zeqing Wang, Han Fang, Yihong Xu, Yutong Ban

Then, the 2D deformation field is smoothly incorporated with a neural implicit reconstruction network to obtain tissue deformation in the 3D space.

3D Reconstruction NeRF

ASAP: Learning Generalizable Online Bin Packing via Adaptive Selection After Pruning

no code implementations29 Jan 2025 Han Fang, Paul Weng, Yutong Ban

To learn these policies, we propose a training scheme based on a meta-learning phase of both policies followed by a finetuning phase of the sole selection policy to rapidly adapt it to a test distribution.

3D Bin Packing Decision Making +2

Surgical Scene Segmentation by Transformer With Asymmetric Feature Enhancement

1 code implementation23 Oct 2024 Cheng Yuan, Yutong Ban

Surgical scene segmentation is a fundamental task for robotic-assisted laparoscopic surgery understanding.

Anatomy Scene Segmentation +2

State-Novelty Guided Action Persistence in Deep Reinforcement Learning

no code implementations9 Sep 2024 Jianshu Hu, Paul Weng, Yutong Ban

While a powerful and promising approach, deep reinforcement learning (DRL) still suffers from sample inefficiency, which can be notably improved by resorting to more sophisticated techniques to address the exploration-exploitation dilemma.

Deep Reinforcement Learning reinforcement-learning +1

Understanding and Reducing the Class-Dependent Effects of Data Augmentation with A Two-Player Game Approach

no code implementations31 May 2024 Yunpeng Jiang, Paul Weng, Yutong Ban

By rewriting this optimization problem as an adversarial two-player game, we propose a novel multiplicative weight algorithm, for which we prove the convergence.

Data Augmentation Fairness +1

INViT: A Generalizable Routing Problem Solver with Invariant Nested View Transformer

1 code implementation4 Feb 2024 Han Fang, Zhihao Song, Paul Weng, Yutong Ban

Recently, deep reinforcement learning has shown promising results for learning fast heuristics to solve routing problems.

Deep Reinforcement Learning

Hypergraph-Transformer (HGT) for Interactive Event Prediction in Laparoscopic and Robotic Surgery

no code implementations3 Feb 2024 Lianhao Yin, Yutong Ban, Jennifer Eckhoff, Ozanan Meireles, Daniela Rus, Guy Rosman

Understanding and anticipating intraoperative events and actions is critical for intraoperative assistance and decision-making during minimally invasive surgery.

Decision Making Knowledge Graphs

Drive Anywhere: Generalizable End-to-end Autonomous Driving with Multi-modal Foundation Models

no code implementations26 Oct 2023 Tsun-Hsuan Wang, Alaa Maalouf, Wei Xiao, Yutong Ban, Alexander Amini, Guy Rosman, Sertac Karaman, Daniela Rus

As autonomous driving technology matures, end-to-end methodologies have emerged as a leading strategy, promising seamless integration from perception to control via deep learning.

Autonomous Driving Data Augmentation

Infrastructure-based End-to-End Learning and Prevention of Driver Failure

no code implementations21 Mar 2023 Noam Buckman, Shiva Sreeram, Mathias Lechner, Yutong Ban, Ramin Hasani, Sertac Karaman, Daniela Rus

FailureNet observes the poses of vehicles as they approach an intersection and detects whether a failure is present in the autonomy stack, warning cross-traffic of potentially dangerous drivers.

Autonomous Vehicles

On the Forward Invariance of Neural ODEs

no code implementations10 Oct 2022 Wei Xiao, Tsun-Hsuan Wang, Ramin Hasani, Mathias Lechner, Yutong Ban, Chuang Gan, Daniela Rus

We propose a new method to ensure neural ordinary differential equations (ODEs) satisfy output specifications by using invariance set propagation.

Autonomous Vehicles Collision Avoidance +2

Concept Graph Neural Networks for Surgical Video Understanding

no code implementations27 Feb 2022 Yutong Ban, Jennifer A. Eckhoff, Thomas M. Ward, Daniel A. Hashimoto, Ozanan R. Meireles, Daniela Rus, Guy Rosman

We constantly integrate our knowledge and understanding of the world to enhance our interpretation of what we see.

Video Understanding

SUPR-GAN: SUrgical PRediction GAN for Event Anticipation in Laparoscopic and Robotic Surgery

no code implementations10 May 2021 Yutong Ban, Guy Rosman, Jennifer A. Eckhoff, Thomas M. Ward, Daniel A. Hashimoto, Taisei Kondo, Hidekazu Iwaki, Ozanan R. Meireles, Daniela Rus

Comprehension of surgical workflow is the foundation upon which artificial intelligence (AI) and machine learning (ML) holds the potential to assist intraoperative decision-making and risk mitigation.

Decision Making Generative Adversarial Network

TransCenter: Transformers with Dense Representations for Multiple-Object Tracking

2 code implementations28 Mar 2021 Yihong Xu, Yutong Ban, Guillaume Delorme, Chuang Gan, Daniela Rus, Xavier Alameda-Pineda

Methodologically, we propose the use of image-related dense detection queries and efficient sparse tracking queries produced by our carefully designed query learning networks (QLN).

Ranked #18 on Multi-Object Tracking on MOT20 (MOTA metric, using extra training data)

Decoder image-classification +6

Aggregating Long-Term Context for Learning Laparoscopic and Robot-Assisted Surgical Workflows

no code implementations1 Sep 2020 Yutong Ban, Guy Rosman, Thomas Ward, Daniel Hashimoto, Taisei Kondo, Hidekazu Iwaki, Ozanan Meireles, Daniela Rus

With the understanding of the complete surgical workflow, the robots are able to assist the surgeons in intra-operative events, such as by giving a warning when the surgeon is entering specific keys or high-risk phases.

Surgical phase recognition

How To Train Your Deep Multi-Object Tracker

2 code implementations CVPR 2020 Yihong Xu, Aljosa Osep, Yutong Ban, Radu Horaud, Laura Leal-Taixe, Xavier Alameda-Pineda

In this paper, we bridge this gap by proposing a differentiable proxy of MOTA and MOTP, which we combine in a loss function suitable for end-to-end training of deep multi-object trackers.

Multi-Object Tracking Multiple Object Tracking +1

Variational Bayesian Inference for Audio-Visual Tracking of Multiple Speakers

no code implementations28 Sep 2018 Yutong Ban, Xavier Alameda-Pineda, Laurent Girin, Radu Horaud

We propose a variational inference model which amounts to approximate the joint distribution with a factorized distribution.

Bayesian Inference Variational Inference +1

A Deep Network for Arousal-Valence Emotion Prediction with Acoustic-Visual Cues

1 code implementation2 May 2018 Songyou Peng, Le Zhang, Yutong Ban, Meng Fang, Stefan Winkler

In this paper, we comprehensively describe the methodology of our submissions to the One-Minute Gradual-Emotion Behavior Challenge 2018.

Cannot find the paper you are looking for? You can Submit a new open access paper.