Search Results for author: Shubhankar Borse

Found 18 papers, 6 papers with code

PosSAM: Panoptic Open-vocabulary Segment Anything

1 code implementation14 Mar 2024 Vibashan VS, Shubhankar Borse, Hyojin Park, Debasmit Das, Vishal Patel, Munawar Hayat, Fatih Porikli

In this paper, we introduce an open-vocabulary panoptic segmentation model that effectively unifies the strengths of the Segment Anything Model (SAM) with the vision-language CLIP model in an end-to-end framework.

Open Vocabulary Panoptic Segmentation Open Vocabulary Semantic Segmentation +2

Multi-camera Bird's Eye View Perception for Autonomous Driving

no code implementations16 Sep 2023 David Unger, Nikhil Gosala, Varun Ravi Kumar, Shubhankar Borse, Abhinav Valada, Senthil Yogamani

Surround vision systems that are pretty common in new vehicles use the IPM principle to generate a BEV image and to show it on display to the driver.

Autonomous Driving Sensor Fusion

X-Align++: cross-modal cross-view alignment for Bird's-eye-view segmentation

no code implementations6 Jun 2023 Shubhankar Borse, Senthil Yogamani, Marvin Klingner, Varun Ravi, Hong Cai, Abdulaziz Almuzairee, Fatih Porikli

Bird's-eye-view (BEV) grid is a typical representation of the perception of road components, e. g., drivable area, in autonomous driving.

Autonomous Driving Segmentation

X$^3$KD: Knowledge Distillation Across Modalities, Tasks and Stages for Multi-Camera 3D Object Detection

no code implementations3 Mar 2023 Marvin Klingner, Shubhankar Borse, Varun Ravi Kumar, Behnaz Rezaei, Venkatraman Narayanan, Senthil Yogamani, Fatih Porikli

Specifically, we propose cross-task distillation from an instance segmentation teacher (X-IS) in the PV feature extraction stage providing supervision without ambiguous error backpropagation through the view transformation.

3D Object Detection Instance Segmentation +3

DejaVu: Conditional Regenerative Learning to Enhance Dense Prediction

no code implementations CVPR 2023 Shubhankar Borse, Debasmit Das, Hyojin Park, Hong Cai, Risheek Garrepalli, Fatih Porikli

Next, we use a conditional regenerator, which takes the redacted image and the dense predictions as inputs, and reconstructs the original image by filling in the missing structural information.

Depth Estimation

TransAdapt: A Transformative Framework for Online Test Time Adaptive Semantic Segmentation

no code implementations24 Feb 2023 Debasmit Das, Shubhankar Borse, Hyojin Park, Kambiz Azarian, Hong Cai, Risheek Garrepalli, Fatih Porikli

Test-time adaptive (TTA) semantic segmentation adapts a source pre-trained image semantic segmentation model to unlabeled batches of target domain test images, different from real-world, where samples arrive one-by-one in an online fashion.

Segmentation Semantic Segmentation +1

X3KD: Knowledge Distillation Across Modalities, Tasks and Stages for Multi-Camera 3D Object Detection

no code implementations CVPR 2023 Marvin Klingner, Shubhankar Borse, Varun Ravi Kumar, Behnaz Rezaei, Venkatraman Narayanan, Senthil Yogamani, Fatih Porikli

Specifically, we propose cross-task distillation from an instance segmentation teacher (X-IS) in the PV feature extraction stage providing supervision without ambiguous error backpropagation through the view transformation.

3D Object Detection Instance Segmentation +3

X-Align: Cross-Modal Cross-View Alignment for Bird's-Eye-View Segmentation

no code implementations13 Oct 2022 Shubhankar Borse, Marvin Klingner, Varun Ravi Kumar, Hong Cai, Abdulaziz Almuzairee, Senthil Yogamani, Fatih Porikli

Bird's-eye-view (BEV) grid is a common representation for the perception of road components, e. g., drivable area, in autonomous driving.

Autonomous Driving Segmentation

Self-Supervised Geometric Correspondence for Category-Level 6D Object Pose Estimation in the Wild

1 code implementation13 Oct 2022 Kaifeng Zhang, Yang Fu, Shubhankar Borse, Hong Cai, Fatih Porikli, Xiaolong Wang

While 6D object pose estimation has wide applications across computer vision and robotics, it remains far from being solved due to the lack of annotations.

6D Pose Estimation 6D Pose Estimation using RGB +2

Learning Implicit Feature Alignment Function for Semantic Segmentation

1 code implementation17 Jun 2022 Hanzhe Hu, Yinbo Chen, Jiarui Xu, Shubhankar Borse, Hong Cai, Fatih Porikli, Xiaolong Wang

As such, IFA implicitly aligns the feature maps at different levels and is capable of producing segmentation maps in arbitrary resolutions.

Segmentation Semantic Segmentation

Simple and Efficient Architectures for Semantic Segmentation

1 code implementation16 Jun 2022 Dushyant Mehta, Andrii Skliar, Haitam Ben Yahia, Shubhankar Borse, Fatih Porikli, Amirhossein Habibian, Tijmen Blankevoort

Though the state-of-the architectures for semantic segmentation, such as HRNet, demonstrate impressive accuracy, the complexity arising from their salient design choices hinders a range of model acceleration tools, and further they make use of operations that are inefficient on current hardware.

Image Classification Segmentation +1

HS3: Learning with Proper Task Complexity in Hierarchically Supervised Semantic Segmentation

no code implementations3 Nov 2021 Shubhankar Borse, Hong Cai, Yizhe Zhang, Fatih Porikli

While deeply supervised networks are common in recent literature, they typically impose the same learning objective on all transitional layers despite their varying representation powers.

Ranked #4 on Semantic Segmentation on Cityscapes test (using extra training data)

Segmentation Semantic Segmentation

X-Distill: Improving Self-Supervised Monocular Depth via Cross-Task Distillation

no code implementations24 Oct 2021 Hong Cai, Janarbek Matai, Shubhankar Borse, Yizhe Zhang, Amin Ansari, Fatih Porikli

In order to enable such knowledge distillation across two different visual tasks, we introduce a small, trainable network that translates the predicted depth map to a semantic segmentation map, which can then be supervised by the teacher network.

Knowledge Distillation Monocular Depth Estimation +2

AuxAdapt: Stable and Efficient Test-Time Adaptation for Temporally Consistent Video Semantic Segmentation

1 code implementation24 Oct 2021 Yizhe Zhang, Shubhankar Borse, Hong Cai, Fatih Porikli

Since inconsistency mainly arises from the model's uncertainty in its output, we propose an adaptation scheme where the model learns from its own segmentation decisions as it streams a video, which allows producing more confident and temporally consistent labeling for similarly-looking pixels across frames.

Optical Flow Estimation Segmentation +4

Perceptual Consistency in Video Segmentation

no code implementations24 Oct 2021 Yizhe Zhang, Shubhankar Borse, Hong Cai, Ying Wang, Ning Bi, Xiaoyun Jiang, Fatih Porikli

More specifically, by measuring the perceptual consistency between the predicted segmentation and the available ground truth on a nearby frame and combining it with the segmentation confidence, we can accurately assess the classification correctness on each pixel.

Segmentation Semantic Segmentation +2

InverseForm: A Loss Function for Structured Boundary-Aware Segmentation

1 code implementation CVPR 2021 Shubhankar Borse, Ying Wang, Yizhe Zhang, Fatih Porikli

We present a novel boundary-aware loss term for semantic segmentation using an inverse-transformation network, which efficiently learns the degree of parametric transformations between estimated and target boundaries.

Ranked #5 on Semantic Segmentation on Cityscapes test (using extra training data)

Segmentation Semantic Segmentation

Cannot find the paper you are looking for? You can Submit a new open access paper.